BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy8713
         (309 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 120/268 (44%), Positives = 160/268 (59%), Gaps = 47/268 (17%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  ++LK   G      L   + P+ + ++E D  LPA+FD+R +WP CPTI+EI
Sbjct: 45  HNFYNVDMSYLKRLCGTF----LGGPKPPQRVMFTE-DLKLPASFDAREQWPQCPTIKEI 99

Query: 104 RDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
           RDQGSCGSCW     E        HVNG+RP C   +G TPKC + C+  Y   YK+D +
Sbjct: 100 RDQGSCGSCWAFGAVEAISDRICIHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKH 158

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +G  SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG +                 
Sbjct: 159 YGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------- 201

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                                + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWG
Sbjct: 202 --------------------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWG 239

Query: 282 DNGLFKILRGKDECGIESSITAGVPKLD 309
           DNG FKILRG+D CGIES + AG+P+ D
Sbjct: 240 DNGFFKILRGQDHCGIESEVVAGIPRTD 267


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 113/210 (53%), Positives = 133/210 (63%), Gaps = 39/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEIAPCEHHVNGTRP CD   G TP C  ECQ++YDV YK 
Sbjct: 174 TRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCDGEHGKTPSCRHECQKSYDVDYKT 233

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N K I KEI ++GPVEGAFTV++DLILYK G +              
Sbjct: 234 DKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVY-------------- 279

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  G+ LGGHAIRILGWG + K+   YWLIANSWNT
Sbjct: 280 -----------------------QHVHGRELGGHAIRILGWGVENKT--PYWLIANSWNT 314

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWG+NG FK+LRG+D CGIES+I AG+PK+
Sbjct: 315 DWGNNGFFKMLRGEDHCGIESAIAAGLPKV 344



 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 47/75 (62%), Gaps = 3/75 (4%)

Query: 43  KNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV---DEDLPANFDSRTKWPNCPT 99
           +N   ++PR+H +  MGVHPD +        L+   EV   D D+P  FD+R  WPNCPT
Sbjct: 48  RNYDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDVPEEFDARKAWPNCPT 107

Query: 100 IREIRDQGSCGSCWG 114
           I EIRDQGSCGSCW 
Sbjct: 108 IGEIRDQGSCGSCWA 122



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/32 (78%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 157 CGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQ 188


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 110/202 (54%), Positives = 133/202 (65%), Gaps = 39/202 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PY IAPCEHHVNGTRPSC+   G TPKCV++CQE+Y+VPY+KD  FGA S
Sbjct: 175 GPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQESYNVPYQKDKRFGASS 234

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS++ +E  I KEI  +GPVEGAFTV++DL+ YK G +                      
Sbjct: 235 YSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGVY---------------------- 272

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +GK LGGHAIRILGWG +  +  KYWLIANSWN+DWGDNG F
Sbjct: 273 ---------------QHVTGKMLGGHAIRILGWGVENGT--KYWLIANSWNSDWGDNGFF 315

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG+D  GIESSI+AG+PKL
Sbjct: 316 KILRGEDHLGIESSISAGLPKL 337



 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 41/83 (49%), Gaps = 3/83 (3%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CGFGCNGGFPG AW YWV+ G+VSGG +GS    +         H+    G  P      
Sbjct: 150 CGFGCNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEHHVN---GTRPSCEGEG 206

Query: 69  NRLPELIGYSEVDEDLPANFDSR 91
            + P+ +   +   ++P   D R
Sbjct: 207 GKTPKCVKKCQESYNVPYQKDKR 229


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 124/299 (41%), Positives = 149/299 (49%), Gaps = 91/299 (30%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GCNGG+P  AW +WV  G+VSGG Y S       +  I  +     + V  D+  P 
Sbjct: 144 CGMGCNGGYPSSAWNFWVSDGLVSGGLYDSH------IGRIQVSLCVLLLAVDRDFVSP- 196

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
                                                        GCRPY I PCEHHVN
Sbjct: 197 ---------------------------------------------GCRPYTIPPCEHHVN 211

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
           G+RPSC    G TP+C+  C+  Y   YK+D +FG  SYSVSS E  I +EIY++GPVEG
Sbjct: 212 GSRPSCSGEGGDTPECIFRCEAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYKNGPVEG 271

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
           AFTV++D +LYKSG +                                      + SG A
Sbjct: 272 AFTVYEDFVLYKSGVY-------------------------------------QHVSGSA 294

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           LGGHAI++LGWGE+  +   YWL ANSWNTDWGDNG FKILRG D CGIES I AG PK
Sbjct: 295 LGGHAIKMLGWGEE--NGVPYWLCANSWNTDWGDNGFFKILRGADHCGIESEIVAGNPK 351



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 5/71 (7%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  +++K   G      L   +LP +I Y+  D  LP  FDSR +WPNCPT++EI
Sbjct: 44  HNFHNVDYSYVKKLCGTL----LKGPKLPLMIRYAG-DIKLPKEFDSREQWPNCPTLKEI 98

Query: 104 RDQGSCGSCWG 114
           RDQGSCGSCW 
Sbjct: 99  RDQGSCGSCWA 109


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 131/210 (62%), Gaps = 39/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEIAPCEHHVNGTRP C  S G TP C  +CQ +Y V Y K
Sbjct: 168 TRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAK 225

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D NFG+KSYSV  N   I +EI  +GPVEGAFTV++DLILYKSG +              
Sbjct: 226 DKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 271

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   +SK  YWLI NSWNT
Sbjct: 272 -----------------------QHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNT 308

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDNG F+ILRG+D CGIESSI+AG+PKL
Sbjct: 309 DWGDNGFFRILRGQDHCGIESSISAGLPKL 338



 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           Q  +N   ++   +++  MGVHPD   + LP  R+     Y++   D+P  FD+R  WPN
Sbjct: 39  QVGRNFKESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGVDIPEEFDARKAWPN 98

Query: 97  CPTIREIRDQGSCGSCW 113
           CPTI EIRDQGSCGSCW
Sbjct: 99  CPTIGEIRDQGSCGSCW 115



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 27/34 (79%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
            +CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 149 HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 108/202 (53%), Positives = 130/202 (64%), Gaps = 39/202 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PY IAPCEHHVNGTRPSC+   G TPKCV++CQ++Y VPY KD  +G+KS
Sbjct: 173 GPFGSNLGCQPYAIAPCEHHVNGTRPSCEGEGGKTPKCVKKCQDSYTVPYAKDKRYGSKS 232

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS+  +E  I KEI  +GPVEGAFTV++DL+ YK G +                      
Sbjct: 233 YSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEGVY---------------------- 270

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +GK LGGHAIRILGWG +  +  KYWLIANSWN+DWGDNG F
Sbjct: 271 ---------------QHVTGKMLGGHAIRILGWGVENNT--KYWLIANSWNSDWGDNGFF 313

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG+D  GIESSI AG+PKL
Sbjct: 314 KILRGEDHLGIESSIAAGLPKL 335



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/30 (76%), Positives = 25/30 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGGFPG AW YWV  G+VSGG +GS
Sbjct: 148 CGFGCNGGFPGAAWSYWVHKGLVSGGPFGS 177


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 114/210 (54%), Positives = 131/210 (62%), Gaps = 39/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G+ GS  GCRPYEI PCEHHVNGTRP C  S G TP+C   C+ +Y V YKK
Sbjct: 170 TRKGIVSGGNFGSQQGCRPYEIEPCEHHVNGTRPPC--SSGSTPRCQHVCESSYKVDYKK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D NFG+KSYS+ +N   I KEI  +GPVEGAFTV++DLILYKSG +              
Sbjct: 228 DKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVYEDLILYKSGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  GK LGGHAIRILGWG     K  YWLIANSWNT
Sbjct: 274 -----------------------EHVHGKELGGHAIRILGWGVWGDEKIPYWLIANSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDNG F+I+RGKD CGIESSI+AG+PKL
Sbjct: 311 DWGDNGFFRIVRGKDHCGIESSISAGLPKL 340



 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 7/77 (9%)

Query: 43  KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIG--YSEVDEDLPANFDSRTKWPNC 97
           +N   ++   +++  MGVHPD   + LP     E++G    + D D+P  FD+R KW NC
Sbjct: 44  RNFHESVSEKYIRGLMGVHPDADKFALPDKM--EVLGKLVEDSDSDIPTEFDAREKWSNC 101

Query: 98  PTIREIRDQGSCGSCWG 114
           PTI EIRDQGSCGSCW 
Sbjct: 102 PTIGEIRDQGSCGSCWA 118



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/33 (72%), Positives = 27/33 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG +GS+Q 
Sbjct: 153 CGFGCNGGFPGAAWSYWTRKGIVSGGNFGSQQG 185


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 132/334 (39%), Positives = 166/334 (49%), Gaps = 107/334 (32%)

Query: 43  KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +N  + +   H+++ MGVHPD   + LP  R  EL+G    D+DLP  FDS   WPNCPT
Sbjct: 46  RNFDAAVSEHHIRALMGVHPDSHKFTLPEKR--ELLGADGEDKDLPEEFDSSKNWPNCPT 103

Query: 100 IREIRDQGSCGSCWGCRPYE----------------------IAPCEHH----VNGTRP- 132
           IREIRDQGSCGSCW     E                      +  C H      NG  P 
Sbjct: 104 IREIRDQGSCGSCWAFGAVEAMSDRVCIHSNATVNFHFSADDLVTCCHTCGFGCNGGFPG 163

Query: 133 ---------------SCDASKGHTPKCVRECQENYDVP---------------------- 155
                          S ++++G  P  V  C+ + D P                      
Sbjct: 164 AAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEHHVDGPRPPCHSGSTPHCKHQCQPNYSV 223

Query: 156 -YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
            Y+KD +FGA SYS++ N ++I +EI  +GPVEGAFTV++DLILYK+G +          
Sbjct: 224 DYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTGVY---------- 273

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                                       +  GK LGGHAIRI+GWG   +SK  YWLIAN
Sbjct: 274 ---------------------------QHVHGKQLGGHAIRIIGWGVWGESKVPYWLIAN 306

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           SWNTDWGDNG F+ILRGKD CGIES I+AG+PKL
Sbjct: 307 SWNTDWGDNGFFRILRGKDHCGIESQISAGLPKL 340



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW   GIVSGG+Y S +
Sbjct: 153 CGFGCNGGFPGAAWSYWTTRGIVSGGSYNSTE 184


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 108/206 (52%), Positives = 131/206 (63%), Gaps = 39/206 (18%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G  GS  GCRPYEIAPCEHHVNGTRP C+   G TP+C  +CQ +Y V YK D +F
Sbjct: 174 IVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHF 233

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G+++YS+S N   I +EI  HGPVEGAFTV++DLILYK G                    
Sbjct: 234 GSRAYSISKNVHDIQEEIMTHGPVEGAFTVYEDLILYKDG-------------------- 273

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                        V++ +     GK LGGHAIRI+GWG ++     YWL+ANSWNTDWG+
Sbjct: 274 -------------VYEHV----HGKELGGHAIRIIGWGVEKDI--PYWLVANSWNTDWGN 314

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG FKILRGKD CGIESSI+AG+PK+
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPKI 340



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/32 (81%), Positives = 27/32 (84%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YWV+ GIVSGG YGS Q
Sbjct: 153 CGFGCNGGFPGAAWSYWVRKGIVSGGPYGSSQ 184


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 107/205 (52%), Positives = 129/205 (62%), Gaps = 39/205 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   GS GS  GCRPYEIAPCEHHVNGTRP C      TP C ++C++ Y+VPYKKD NF
Sbjct: 166 IVSGGSFGSNQGCRPYEIAPCEHHVNGTRPPCTGDDNKTPSCKQQCEKGYNVPYKKDKNF 225

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G ++YS+SS  + I KEI  +GPVEGAF V++DL+ YK G +                  
Sbjct: 226 GKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGVY------------------ 267

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               +  G+ALGGHAIRILGWG ++ +   YWLIANSWN+DWGD
Sbjct: 268 -------------------QHVKGEALGGHAIRILGWGTEKGT--PYWLIANSWNSDWGD 306

Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
           NG FKILRG+D CGIESSI AG+PK
Sbjct: 307 NGTFKILRGEDHCGIESSIVAGIPK 331



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/32 (75%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGFPG AW YWV  GIVSGG++GS Q
Sbjct: 145 CGMGCNGGFPGAAWHYWVNKGIVSGGSFGSNQ 176


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score =  206 bits (525), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 108/206 (52%), Positives = 132/206 (64%), Gaps = 39/206 (18%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G  GS  GCRPYEIAPCEHHVNGTRP C+   G TP+C  +CQ +Y V YK D +F
Sbjct: 174 IVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPCEKEYGKTPRCQHKCQASYKVDYKTDKHF 233

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G+++YS+S N + I  EI  +GPVEGAFTV++DLILYK G                    
Sbjct: 234 GSRAYSISKNVRDIQGEIMTNGPVEGAFTVYEDLILYKDG-------------------- 273

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                        V++ +     GK LGGHAIRI+GWG ++ +   YWLIANSWNTDWG+
Sbjct: 274 -------------VYEHV----HGKELGGHAIRIIGWGVEKDT--PYWLIANSWNTDWGN 314

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG FKILRGKD CGIESSI+AG+PK+
Sbjct: 315 NGFFKILRGKDHCGIESSISAGLPKI 340



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 26/33 (78%), Positives = 27/33 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YWV+ GIVSGG YGS Q 
Sbjct: 153 CGFGCNGGFPGAAWGYWVRKGIVSGGPYGSSQG 185


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score =  206 bits (525), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 106/202 (52%), Positives = 131/202 (64%), Gaps = 39/202 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PY IAPCEHHVNG+RPSC+   G TPKCV++CQ +Y+VPY KD  +G  S
Sbjct: 178 GPFGSDQGCQPYAIAPCEHHVNGSRPSCEGEGGKTPKCVKKCQASYNVPYAKDKMYGKSS 237

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS++++EK I KEI  +GPVEGAFTV++DL+ YK G +                      
Sbjct: 238 YSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGVYH--------------------- 276

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           +  GK LGGHAIRILGWG ++ +  KYWLIANSWN+DWGDNG F
Sbjct: 277 ----------------HVHGKMLGGHAIRILGWGVEDGT--KYWLIANSWNSDWGDNGFF 318

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG+D  GIESSI AG+PK+
Sbjct: 319 KILRGEDHLGIESSIAAGLPKV 340



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 24/32 (75%), Positives = 27/32 (84%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YWV+ G+VSGG +GS Q
Sbjct: 153 CGFGCNGGFPGAAWSYWVRKGLVSGGPFGSDQ 184


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score =  205 bits (521), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEI+PCEHHVNGTRP C A  G TPKC   CQ  Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 228 DKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   + K  YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG YGS Q 
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEI+PCEHHVNGTRP C A  G TPKC   CQ  Y V Y K
Sbjct: 159 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGRTPKCSHVCQSGYTVDYAK 217

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 218 DKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 263

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   + K  YWLI NSWNT
Sbjct: 264 -----------------------QHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 300

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 301 DWGDHGFFRILRGQDHCGIESSISAGLPKL 330



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG YGS Q 
Sbjct: 142 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 174


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEIAPCEHHVNGTRP C    G TPKC   C+  Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEIAPCEHHVNGTRPPCGHGGG-TPKCSHVCESGYTVDYAK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 228 DKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   + K  YWLI NSWNT
Sbjct: 274 -----------------------QHQHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDNG F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDNGFFRILRGQDHCGIESSISAGLPKL 340



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG YGS Q 
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEI+PCEHHVNGTRP C A  G TPKC   CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGATPKCSHVCQSSYTVDYAK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 228 DKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG     K  YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDQGFFRILRGQDHCGIESSISAGLPKL 340



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/32 (78%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQ 183


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score =  204 bits (518), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 111/210 (52%), Positives = 129/210 (61%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEI+PCEHHVNGTRP C A  G TPKC   CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPC-AHGGGTPKCSHVCQSSYTVDYAK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 228 DKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG     K  YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGDEKIPYWLIGNSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG YGS Q 
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  203 bits (517), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 107/210 (50%), Positives = 131/210 (62%), Gaps = 39/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   GS GS  GCRPYE+ PCEHHVNGTRP C +  G TP+C+ +C+  Y V Y K
Sbjct: 168 THKGIVSGGSYGSKEGCRPYEVEPCEHHVNGTRPPCHS--GSTPRCMHKCESGYSVDYAK 225

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FGAK+YSV+ N   I +EI  +GPVEGAFTV++DLILYK+G +              
Sbjct: 226 DKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTGVY-------------- 271

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  G+ LGGHAIRILGWG    +K  YWLI NSWNT
Sbjct: 272 -----------------------QHVHGRQLGGHAIRILGWGVWGDNKVPYWLIGNSWNT 308

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDNG F+ILRG+D CGIES+I+AG+PKL
Sbjct: 309 DWGDNGFFRILRGEDHCGIESAISAGLPKL 338



 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 48/75 (64%), Gaps = 5/75 (6%)

Query: 43  KNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDE-DLPANFDSRTKWPNCP 98
           +N  +++   H++  MGVHPD   + LP  +   L    E D  DLP  FD+RT WP+CP
Sbjct: 42  RNFDASVSEHHIRGLMGVHPDAHKFTLP-EKSQVLGNLMEADGGDLPEEFDARTAWPDCP 100

Query: 99  TIREIRDQGSCGSCW 113
           TI EIRDQGSCGSCW
Sbjct: 101 TIGEIRDQGSCGSCW 115



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/32 (78%), Positives = 27/32 (84%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW   GIVSGG+YGSK+
Sbjct: 151 CGFGCNGGFPGAAWSYWTHKGIVSGGSYGSKE 182


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score =  203 bits (516), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 103/201 (51%), Positives = 127/201 (63%), Gaps = 39/201 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC PY+IAPCEHHVNGTRP+C+  +G TPKC++ CQ +Y V Y++D ++GAKS
Sbjct: 7   GPFGSNQGCHPYKIAPCEHHVNGTRPACNGEEGKTPKCIKHCQASYTVAYEQDKSYGAKS 66

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV  +   I KEI  +GPVEGAFTV++DL+ YK G +                      
Sbjct: 67  YSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVY---------------------- 104

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +GK LGGHAIRILGWG +  +   YWLIANSWNTDWG+NG F
Sbjct: 105 ---------------QHVTGKMLGGHAIRILGWGVE--NDVPYWLIANSWNTDWGNNGFF 147

Query: 287 KILRGKDECGIESSITAGVPK 307
           KILRG D CGIES I+AG+PK
Sbjct: 148 KILRGSDHCGIESQISAGIPK 168


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score =  203 bits (516), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 110/210 (52%), Positives = 128/210 (60%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEI+PCEHHVNGTRP C    G TPKC   CQ +Y V Y K
Sbjct: 169 TRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCANGSG-TPKCSHVCQSSYTVDYAK 227

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYK G +              
Sbjct: 228 DKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYEDLILYKDGVY-------------- 273

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG     K  YWLI NSWNT
Sbjct: 274 -----------------------QHEHGKELGGHAIRILGWGVWGNEKIPYWLIGNSWNT 310

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD+G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 311 DWGDHGFFRILRGQDHCGIESSISAGLPKL 340



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW + GIVSGG YGS Q 
Sbjct: 152 CGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQG 184


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 102/199 (51%), Positives = 125/199 (62%), Gaps = 40/199 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
           GS  GCRPY IAPCEHHVNGTRP C   +G TPKCV EC   Y   YKKD  FG ++YSV
Sbjct: 172 GSNIGCRPYSIAPCEHHVNGTRPPC-TGEGDTPKCVSECNAGYTPSYKKDKRFGKQTYSV 230

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
              E+ IM E+Y++GPVE AF+V++D +LYK+G +                         
Sbjct: 231 PPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGVY------------------------- 265

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                        + +G+ LGGHAI+ILGWG++  +   YWL+ANSWNTDWGDNG FKIL
Sbjct: 266 ------------QHVTGQMLGGHAIKILGWGKENNT--PYWLVANSWNTDWGDNGFFKIL 311

Query: 290 RGKDECGIESSITAGVPKL 308
           RGKDECGIES I AG+P+L
Sbjct: 312 RGKDECGIESEIVAGIPRL 330



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC GGFP  AW YW +SG+V+GG YGS
Sbjct: 144 CGMGCMGGFPSAAWDYWAESGLVTGGLYGS 173


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 100/201 (49%), Positives = 126/201 (62%), Gaps = 39/201 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  G+  GCRPYEI PCEHH NG+RP+CDAS+G+TPKC + C+ NY + Y  DL+FG+K+
Sbjct: 167 GQYGTKQGCRPYEIPPCEHHTNGSRPACDASEGNTPKCAKSCESNYKINYSNDLHFGSKA 226

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS+SS+ K I  EI ++GPVEGAF+V+ D + YK+G +                      
Sbjct: 227 YSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGVY---------------------- 264

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           +  G+ LGGHAIRI GWG +  +   YWLIANSWNTDWGD+G F
Sbjct: 265 ---------------QHIKGQFLGGHAIRIFGWGVENNT--PYWLIANSWNTDWGDSGTF 307

Query: 287 KILRGKDECGIESSITAGVPK 307
           KILRG D CGIES I AG+PK
Sbjct: 308 KILRGSDHCGIESGIVAGLPK 328



 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 48/74 (64%), Gaps = 3/74 (4%)

Query: 41  AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
           A +N   +    ++   MGV PD+    N +P ++ +     ++PA+FD+R +WP+CPTI
Sbjct: 37  AGRNFAQDKSMDYIIKLMGVLPDHK---NYMPPVLTHKLEALEIPADFDARQQWPHCPTI 93

Query: 101 REIRDQGSCGSCWG 114
           REIRDQGSCGSCW 
Sbjct: 94  REIRDQGSCGSCWA 107



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 27/32 (84%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGG+PG AW YWV+ G+VSGG YG+KQ
Sbjct: 142 CGMGCNGGYPGAAWHYWVRKGLVSGGQYGTKQ 173


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  200 bits (509), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 109/210 (51%), Positives = 129/210 (61%), Gaps = 38/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEIAPCEHHVNGTR  C+     TPKC  +C+  Y+V Y K
Sbjct: 170 TRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAPCNHDS-KTPKCQHQCEAGYNVEYSK 228

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYSV  N + I +EI  +GPVEGAFTV++DLILYKSG +              
Sbjct: 229 DKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 274

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   K +  YWLIANSWN 
Sbjct: 275 -----------------------QHEHGKELGGHAIRILGWGVWGKEEVPYWLIANSWND 311

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD G F+ILRG+D CGIESSI+AG+PKL
Sbjct: 312 DWGDKGFFRILRGEDHCGIESSISAGLPKL 341



 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 50/77 (64%), Gaps = 2/77 (2%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD-YNLPANRLPELIGY-SEVDEDLPANFDSRTKWPNC 97
           QA +N    +   +++  MGVHPD Y        E++GY S+  +D+P  FD+R KWPNC
Sbjct: 42  QAGRNFDEGVSEEYIRGLMGVHPDAYKFALPDKQEVLGYLSQKVDDIPKEFDAREKWPNC 101

Query: 98  PTIREIRDQGSCGSCWG 114
           PTI EIRDQGSCGSCW 
Sbjct: 102 PTINEIRDQGSCGSCWA 118



 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/31 (80%), Positives = 26/31 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CGFGCNGGFPG AW YW + GIVSGG YGSK
Sbjct: 153 CGFGCNGGFPGAAWSYWTRKGIVSGGRYGSK 183


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/210 (50%), Positives = 127/210 (60%), Gaps = 39/210 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   GS  S  GCRPYEI PCEHHVNGTRP C    G TP C  +C+ +Y V Y K
Sbjct: 172 THKGIVSGGSYNSNEGCRPYEIEPCEHHVNGTRPPC--KNGRTPSCKHQCESSYSVDYAK 229

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG+KSYS+  N + I +EI  +GPVEGAFTV++DLILYKSG +              
Sbjct: 230 DKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVYEDLILYKSGVY-------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  GK LGGHAIRILGWG    SK  YWLI NSWNT
Sbjct: 276 -----------------------KHVHGKELGGHAIRILGWGVWGDSKVPYWLIGNSWNT 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDNG F+I+RG+D CGIES+I+AG+P L
Sbjct: 313 DWGDNGFFRIVRGEDHCGIESAISAGLPAL 342



 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/76 (51%), Positives = 52/76 (68%), Gaps = 7/76 (9%)

Query: 43  KNSLSNIPRAHLKSWMGVHPD---YNLP--ANRLPELIGYSEVDEDLPANFDSRTKWPNC 97
           +N  +++   H++  MGVHPD   + LP  +  L  L+G  +  +DLP +FD+RT WPNC
Sbjct: 46  RNFDASVSEGHIRGLMGVHPDAHKFTLPEKSQVLGNLVG--DDGDDLPESFDARTAWPNC 103

Query: 98  PTIREIRDQGSCGSCW 113
           PTI EIRDQGSCGSCW
Sbjct: 104 PTIGEIRDQGSCGSCW 119



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/33 (69%), Positives = 25/33 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YW   GIVSGG+Y S + 
Sbjct: 155 CGFGCNGGFPGAAWSYWTHKGIVSGGSYNSNEG 187


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 101/196 (51%), Positives = 122/196 (62%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C    G TPKC + C+  Y   YK+D +FG  SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGTRPQCTGEGGDTPKCSKTCEPGYSPSYKEDKHFGYDSYSVSSNE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D ++YK+G                               
Sbjct: 238 KEIMAEIYKNGPVEGAFTVFSDFLMYKTG------------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
             V+  L    +G+ LGGHAIRILGWG++  +   YWL+ NSWN DWGD+G FKI+RG+D
Sbjct: 267 --VYKHL----AGEMLGGHAIRILGWGKE--NGVPYWLVGNSWNVDWGDSGFFKIVRGED 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 319 HCGIESEIVAGIPRTD 334



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW+YW K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSAAWKYWTKKGLVSGGLYDS 175


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 100/194 (51%), Positives = 120/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP+C   +G TPKCV++C+E Y   Y  D +FG  SY V ++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEEGYSPAYGTDKHFGTTSYGVPTSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V+ D  LYKSG +                             
Sbjct: 238 KEIMAEIYKNGPVEGAFLVYADFPLYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G+ LGGHAI+ILGWG +  +   YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 294 ECGIESSITAGVPK 307
            CGIES I AGVPK
Sbjct: 319 HCGIESEIVAGVPK 332



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW++W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWQFWTETGLVSGGLYDS 175


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 98/194 (50%), Positives = 121/194 (62%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY IAPCEHHVNG+RP C    G TP+CVR+C+  Y   Y +D ++G  SYSV S+E
Sbjct: 176 GCRPYTIAPCEHHVNGSRPPCTGEGGDTPECVRQCESGYTPSYIQDKHYGKTSYSVPSDE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EIY++GPVEGAFTV++D +LYK+G +                             
Sbjct: 236 QQIQTEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG A+GGHAI++LGWGE+  +   YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEENGT--PYWLCANSWNTDWGDNGYFKILRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330



 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWASEGLVSGGLYES 173


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score =  197 bits (500), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 103/202 (50%), Positives = 130/202 (64%), Gaps = 40/202 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PY I+PCEHHVNGTR  C+  +G TPKCV++CQ +Y+VPY KD  FG  S
Sbjct: 173 GPYGSDQGCQPYAISPCEHHVNGTRGPCNG-EGKTPKCVKKCQASYNVPYAKDKFFGKSS 231

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS++S+E+ I KE++ +GPVEGAFTV++DL+ YK G +                      
Sbjct: 232 YSIASHEQQIQKELFTNGPVEGAFTVYEDLLNYKEGVY---------------------- 269

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +GK LGGHAIRILGWG +  +  K+WLIANSWN+DWGDNG F
Sbjct: 270 ---------------QHTAGKMLGGHAIRILGWGVENDT--KFWLIANSWNSDWGDNGYF 312

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG D  GIESSI AG+PK+
Sbjct: 313 KILRGSDHLGIESSIAAGLPKV 334



 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 25/33 (75%), Positives = 27/33 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGGFPG AW YWV+ G+VSGG YGS Q 
Sbjct: 148 CGFGCNGGFPGAAWSYWVRKGLVSGGPYGSDQG 180


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 122/193 (63%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C   +G TP+C  +C+  Y   YK+D +FG +SYSV S+E
Sbjct: 176 GCRPYSIPPCEHHVNGTRPPCKGEEGDTPQCTNQCEPGYTPGYKQDKHFGKRSYSVPSDE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IMKE+Y++GPVEGAFTV++D +LYKSG +                             
Sbjct: 236 KEIMKELYKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG A+GGHAI++LGWGE+      YWL ANSWNTDWG+NG FKI+RG+D
Sbjct: 267 --------RHVSGSAVGGHAIKVLGWGEE--GGIPYWLAANSWNTDWGENGFFKIVRGED 316

Query: 294 ECGIESSITAGVP 306
            CGIES + AG+P
Sbjct: 317 HCGIESEMVAGIP 329



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  A  +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAACDFWTKEGLVSGGLYDS 173


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 99/194 (51%), Positives = 121/194 (62%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP+C   +G TPKCV++C++ Y   Y  D +FGA SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPACKGEEGDTPKCVKQCEDGYAPVYGSDKHFGATSYGVPSSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V+ D  +YKSG +                             
Sbjct: 238 KEIMAEIYKNGPVEGAFLVYADFPMYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G+ LGGHAI+ILGWG +  +   YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 319 HCGIESEIVAGIPK 332



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW++W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWKFWTETGLVSGGLYDS 175


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 99/194 (51%), Positives = 120/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RPSC   +G TPKC++ C+E Y   Y  D +FGA SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPSCKGEEGDTPKCMKTCEEGYTPAYGSDKHFGATSYGVPSSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM +IY++GPVEGAF V+ D  LYKSG +                             
Sbjct: 238 KEIMADIYKNGPVEGAFVVYADFPLYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G+ LGGHAI+ILGWG +  +   YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 269 --------QHETGEELGGHAIKILGWGVENGT--PYWLCANSWNTDWGDNGFFKILRGKD 318

Query: 294 ECGIESSITAGVPK 307
            CGIES + AG+PK
Sbjct: 319 HCGIESEVVAGIPK 332



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AWR+W ++G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRFWTETGLVSGGLYDS 175


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  194 bits (494), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 100/194 (51%), Positives = 119/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP C   C+  Y   YK+D +FG  SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMSCEPGYSPSYKQDKHFGKTSYSVPSNQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IMKE+Y++GPVEGAFTV++D + YKSG +                             
Sbjct: 236 KDIMKELYKNGPVEGAFTVYEDFLSYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG ALGGHAI+ILGWGE+  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHVSGPALGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 317 HCGIESEIVAGIPQ 330



 Score = 45.1 bits (105), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWSSDGLVTGGLYNS 173


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score =  194 bits (492), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 100/195 (51%), Positives = 123/195 (63%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY IAPCEHHVNGTRP C   +  TPKC   C   Y VPYK+D +FG+K Y+V S++
Sbjct: 172 GCRPYSIAPCEHHVNGTRPPCSGEQ-DTPKCTGVCIPKYSVPYKQDKHFGSKVYNVPSDQ 230

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM E+Y +GPVE AFTV++D  LYKSG                               
Sbjct: 231 QQIMTELYTNGPVEAAFTVYEDFPLYKSG------------------------------- 259

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
             V+  L    +G ALGGHA++ILGWGE+  +   +WL+ANSWN+DWGDNG FKILRG D
Sbjct: 260 --VYQHL----TGSALGGHAVKILGWGEENGT--PFWLVANSWNSDWGDNGYFKILRGHD 311

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PKL
Sbjct: 312 ECGIESEMVAGLPKL 326



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC+GGFP  AW YW +SG+V+GG Y S
Sbjct: 140 CGFGCSGGFPAEAWDYWRRSGLVTGGLYNS 169


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score =  193 bits (491), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 99/193 (51%), Positives = 121/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 99  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 157

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 158 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 238

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 239 HCGIESEIVAGMP 251



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 24/30 (80%), Positives = 28/30 (93%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP +FD+R +WPNCPTI+EIRDQGSCGSCW
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCW 30


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score =  193 bits (490), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 99/196 (50%), Positives = 121/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS+NE
Sbjct: 178 GCLPYTIPPCEHHVNGSRPQC-TGEGDTPKCTKSCEAGYSPSYKEDKHYGYTSYSVSNNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNGLFKILRG+D
Sbjct: 268 --------KHEAGDIMGGHAIRILGWGVE--NSVPYWLVANSWNVDWGDNGLFKILRGED 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W+K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWIKKGLVSGGLYNS 175


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 190 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 248

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 249 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 279

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 280 --------QHITGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 329

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 330 HCGIESEVVAGIPRTD 345



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 157 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 162 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 220

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 221 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 251

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 252 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 301

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 302 HCGIESEVVAGIPRTD 317



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 129 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 159


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 115 GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 173

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 174 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 204

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 205 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 254

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 255 HCGIESEVVAGIPRTD 270



 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 22/33 (66%)

Query: 6   IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           I  C F CNGG+P  AW +W + G+VSGG Y S
Sbjct: 80  ITGCLFSCNGGYPAEAWNFWTRKGLVSGGLYES 112


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330



 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 12/83 (14%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYS------------EVDEDLPANFDSR 91
           + L N       +W   H  YN+  + + +L G                D  LP +FD+R
Sbjct: 28  DELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGTILGGPKLPQRDAFAADVVLPESFDAR 87

Query: 92  TKWPNCPTIREIRDQGSCGSCWG 114
            +WPNCPTI+EIRDQGSCGSCW 
Sbjct: 88  KQWPNCPTIKEIRDQGSCGSCWA 110


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330



 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 12/83 (14%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYS------------EVDEDLPANFDSR 91
           + L N       +W   H  YN+  + + +L G                D  LP +FD+R
Sbjct: 28  DELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVVLPESFDAR 87

Query: 92  TKWPNCPTIREIRDQGSCGSCWG 114
            +WPNCPTI+EIRDQGSCGSCW 
Sbjct: 88  EQWPNCPTIKEIRDQGSCGSCWA 110


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 84  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 142

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 143 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 173

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 174 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 223

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 224 HCGIESEVVAGIPRTD 239



 Score = 45.8 bits (107), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 51 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 81


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 99  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 157

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 158 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 238

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 239 HCGIESEVVAGIPRTD 254



 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 66 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 96


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 50  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 108

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 109 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 139

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 140 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 189

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 190 HCGIESEVVAGIPRTD 205



 Score = 45.8 bits (107), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 17 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 47


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 101 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 160 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 190

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 191 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 240

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 241 HCGIESEVVAGIPRTD 256



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 68 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 98


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  192 bits (488), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 100 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 158

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 159 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 189

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 190 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 239

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 240 HCGIESEVVAGIPRTD 255



 Score = 46.2 bits (108), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 67 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 97


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 34  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 92

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 93  KGIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 123

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 124 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 173

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 174 HCGIESEVVAGIPRTD 189



 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 1  MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 31


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYNS 175


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 22/31 (70%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +  + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFLTRKGLVSGGLYDS 175


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 96/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS+NE
Sbjct: 169 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYSPTYKQDKHYGYDSYSVSNNE 227

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 228 RDIMAEIYKNGPVEGAFSVYADFLLYKSGVY----------------------------- 258

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 259 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 308

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 309 HCGIESEVVAGIPRTD 324



 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 136 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 166


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 119/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY IAPCEHHVNG+RP C    G TP+C ++C+  Y   Y +D ++G  SYSV  +E
Sbjct: 176 GCRPYSIAPCEHHVNGSRPPCTGEGGDTPQCTKKCEAGYTPGYTQDKHYGKLSYSVDDSE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EIY++GPVEGAFTV++D +LYK+G +                             
Sbjct: 236 KEIQLEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G A+GGHAI++LGWGE+  +   YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVTGSAVGGHAIKVLGWGEENGT--PYWLCANSWNTDWGDNGFFKILRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWEFWTTDGLVSGGLYDS 173


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/194 (50%), Positives = 119/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RPSC    G TP+CV  C+  Y   YK+D ++G  SYSVSS+E
Sbjct: 92  GCRPYTIPPCEHHVNGSRPSCSGEGGETPQCVYRCEAGYTPSYKQDKHYGKTSYSVSSDE 151

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVEGAFTV++D +LYK+G +                             
Sbjct: 152 DDIKHEIYKNGPVEGAFTVYEDFVLYKTGVY----------------------------- 182

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G ALGGHAI+ILGWGE+  +   YWL ANSWNTDWG+NG FKILRG +
Sbjct: 183 --------QHVTGSALGGHAIKILGWGEE--NGIPYWLCANSWNTDWGNNGFFKILRGSN 232

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P 
Sbjct: 233 HCGIESEIVAGIPN 246



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 60 CGMGCNGGYPSAAWDFWTKDGLVSGGLYDS 89


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 96/193 (49%), Positives = 119/193 (61%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C   +G TP+C  +C+  Y   YK+D +FG  SYS+ S E
Sbjct: 176 GCRPYSIPPCEHHVNGTRPPCTGEEGDTPQCSNQCETGYTPGYKQDKHFGKNSYSLPSEE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM E+ ++GPVEGAFTV++D +LYKSG +                             
Sbjct: 236 QQIMAELLKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG A+GGHAI++LGWGE+  +   YWL ANSWNTDWG+NG FKILRGKD
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEEGGT--PYWLAANSWNTDWGENGFFKILRGKD 316

Query: 294 ECGIESSITAGVP 306
            CGIES + AGVP
Sbjct: 317 HCGIESEMVAGVP 329



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTTEGLVTGGLYDS 173


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 48  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 106

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 107 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 137

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 138 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 187

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 188 HCGIESEVVAGIPRTD 203



 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 15 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 45


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 96/196 (48%), Positives = 123/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 RDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 99  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 157

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 158 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 188

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 189 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 238

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 239 HCGIESEIVAGMP 251



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 24/30 (80%), Positives = 28/30 (93%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP +FD+R +WPNCPTI+EIRDQGSCGSCW
Sbjct: 1   LPESFDAREQWPNCPTIKEIRDQGSCGSCW 30


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSV   E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYSSSYKEDKHYGYSSYSVPGIE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGTENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 51  GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCNKTCEPGYSPSYKEDKHFGCSSYSVANNE 109

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 110 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 140

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 141 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 190

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 191 HCGIESEIVAGMP 203


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 99/193 (51%), Positives = 118/193 (61%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK D +FG  SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHVSGEMMGGHAIRILGWGVENDT--PYWLVGNSWNTDWGDKGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 118/193 (61%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP C  +C+  Y   YK+D +FG  SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCSGEGGDTPNCDMKCEPGYSPSYKQDKHFGKTSYSVPSNQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIM E++++GPVEGAFTV++D +LYKSG +                             
Sbjct: 236 NSIMAELFKNGPVEGAFTVYEDFLLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  +GGHAI+ILGWGE+  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHMSGSPVGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 317 HCGIESEIVAGIP 329



 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWATEGLVTGGLYNS 173


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 99/193 (51%), Positives = 118/193 (61%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK D +FG  SYSVSSNE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKDDKHFGCSSYSVSSNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHVSGEMMGGHAIRILGWGVENDT--PYWLVGNSWNTDWGDKGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 96/196 (48%), Positives = 118/196 (60%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP+C R C+  Y   YK+D ++G  SY V  +E
Sbjct: 178 GCRPYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V++D ++YKSG +                             
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVSGEQVGGHAIRILGWGVENGT--PYWLAANSWNTDWGDNGFFKILRGED 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AGVP+ +
Sbjct: 319 HCGIESEIVAGVPRTE 334



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTERGLVSGGLYDS 175


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 100/211 (47%), Positives = 122/211 (57%), Gaps = 39/211 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + +   G  GS  GCRPY I PCEHHVNGTRP C    G TPKC + C+  Y   YK+
Sbjct: 163 TRKGLVSGGLYGSHVGCRPYSIPPCEHHVNGTRPKCTGEGGDTPKCSKTCEPGYSPSYKE 222

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D  +G  SYSV S EK IM EIY++GPVE AF+VF D + YKSG +              
Sbjct: 223 DKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGVY-------------- 268

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   + +G+ LGGHAIRILGWG++  +   YWL+ NSWN 
Sbjct: 269 -----------------------KHVAGEVLGGHAIRILGWGKE--NGVPYWLVGNSWNV 303

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           DWGDNG FKILRG+D CGIES + AG+P+ D
Sbjct: 304 DWGDNGFFKILRGEDHCGIESEVVAGIPRTD 334



 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 25/31 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           LCG GCNGG+P  AW+YW + G+VSGG YGS
Sbjct: 145 LCGEGCNGGYPTEAWKYWTRKGLVSGGLYGS 175


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 119/194 (61%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY IAPCEHHVNG+RPSC    G TP+C+ +C+  Y   YK+D +FG  SY+V S+E
Sbjct: 176 GCRPYTIAPCEHHVNGSRPSCTGEGGDTPQCITKCEAGYTPSYKEDKHFGKTSYTVLSDE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI+++GPVEGAF V++D +LYKSG +                             
Sbjct: 236 EQIQSEIFKNGPVEGAFIVYEDFVLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG A+GGHAI+ILGWG ++     YWL ANSWNTDWGDNG FK LRG D
Sbjct: 267 --------QHVSGSAVGGHAIKILGWGVEDGV--PYWLCANSWNTDWGDNGFFKFLRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYDS 173


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 96/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GP EGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPAEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSDNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVE AFTV+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEAAFTVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG ++ +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVEDGT--PYWLVGNSWNTDWGDNGFFKILRGRD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P  D
Sbjct: 318 HCGIESEIVAGIPCTD 333



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 119/196 (60%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TPKC + C+  Y   YK+D ++G  SY V S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCKGEGGETPKCSKTCEPGYSPSYKEDKHYGYSSYGVPSSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EIY++GPVEGAF+V+ D ++YKSG +                             
Sbjct: 238 QEIMAEIYKNGPVEGAFSVYTDFLVYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEEVGGHAIRILGWGVENGT--PYWLAANSWNTDWGDNGFFKILRGQD 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 319 HCGIESEIVAGIPRTD 334



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 175


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 122/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 190 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKSCEPGYTPTYKQDKHYGYNSYSVSNSE 248

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 249 RDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 279

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 280 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 329

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 330 HCGIESEVVAGIPRTD 345



 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 157 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYDS 187


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP+C+ +C+  Y   Y++D ++G  SYSV S+E
Sbjct: 176 GCRPYTIPPCEHHVNGSRPPCTGEGGDTPQCLSQCEAGYTPSYREDKHYGKTSYSVLSDE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVEGAFTV++D +LYKSG +                             
Sbjct: 236 AEIQYEIYKNGPVEGAFTVYEDFVLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG A+GGHAI++LGWGE+  +   YWL ANSWNTDWGDNG FK LRG D
Sbjct: 267 --------QHVSGSAVGGHAIKVLGWGEE--NGVPYWLCANSWNTDWGDNGFFKFLRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYDS 173


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  190 bits (482), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 93/194 (47%), Positives = 118/194 (60%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHH NGTRP C    G TP+CV++C++ Y   YK+D ++G  SY +  +E
Sbjct: 168 GCRPYSIPPCEHHTNGTRPPCSGEGGETPECVKKCEDGYTPAYKQDKHYGVTSYGIPRSE 227

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V+ D ++YKSG +                             
Sbjct: 228 KEIMAEIYKNGPVEGAFVVYSDFLMYKSGVY----------------------------- 258

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG D  +   YWL ANSWNTDWG++G F+ILRG+D
Sbjct: 259 --------QHVSGEEVGGHAIRILGWGVDNGT--PYWLAANSWNTDWGEDGFFRILRGQD 308

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 309 HCGIESEIVAGIPK 322



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW+YW + G+VSGG Y S
Sbjct: 136 CGMGCNGGYPSGAWKYWTEKGLVSGGLYDS 165


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score =  190 bits (482), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP C  +C+  Y   YK+D +FG  SYSV SN+
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPNCDMKCEPGYSPLYKEDKHFGKTSYSVPSNQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM E++++GPVE AFTV++D +LYKSG +                             
Sbjct: 236 NGIMAELFKNGPVEAAFTVYEDFLLYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG ALGGHAI+ILGWGE+  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 267 --------QHMSGSALGGHAIKILGWGEE--NGVPYWLAANSWNTDWGDNGYFKILRGED 316

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 317 HCGIESEIVAGIP 329



 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W   G+V+GG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTTDGLVTGGLYNS 173


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP+C R C+  Y   YK+D ++G  SY V  +E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V++D ++YKSG +                             
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIR+LGWG D  +   YWL ANSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEQVGGHAIRLLGWGVDNGT--PYWLAANSWNTDWGDNGFFKILRGED 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P  +
Sbjct: 319 HCGIESEIVAGIPSTE 334



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTEKGLVSGGLYDS 175


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 122/196 (62%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP+C   +G TPKC + C+  Y   YK+D +FG  SYS+ +NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPAC-TGEGDTPKCSKTCEPGYSPTYKEDKHFGYTSYSLPTNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM EIY++GPVEGAF+V+ D +LYKSG                               
Sbjct: 237 WEIMAEIYKNGPVEGAFSVYSDFLLYKSG------------------------------- 265

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
             V+  L    +G  +GGHAIRILGWGE+  +   YWL+ANSWNTDWGD G F+ILRG+D
Sbjct: 266 --VYQHL----TGDMMGGHAIRILGWGEE--NGVPYWLVANSWNTDWGDGGFFRILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 47/71 (66%), Gaps = 5/71 (7%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  ++LK   G      L   +LP+ + +++ D +LP +FD+R +W +CPTI+EI
Sbjct: 45  HNFRNVDMSYLKRLCGSF----LGGPKLPQRVKFAK-DMNLPKSFDAREQWSHCPTIKEI 99

Query: 104 RDQGSCGSCWG 114
           RDQGSCGSCW 
Sbjct: 100 RDQGSCGSCWA 110


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 94/193 (48%), Positives = 117/193 (60%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TPKC + C+  Y   YK+D +FG  +YSV S+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCSGEGGDTPKCSKICEPGYSPSYKEDKHFGCDTYSVPSDE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVE AF+V+ D +LYKSG +                             
Sbjct: 238 KEIMVEIYKNGPVEAAFSVYSDFLLYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVTGEMVGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGRD 318

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 319 HCGIESEIVAGIP 331



 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGEGCNGGFPSGAWNFWKKQGLVSGGLYDS 175


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYS+S NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTV+ D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGDLMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGIP 330



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 94/193 (48%), Positives = 117/193 (60%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TPKC R C+  Y   YK+D +FG  SYSV S+E
Sbjct: 201 GCRPYSIPPCEHHVNGSRPPCTGEGGSTPKCSRICEAGYTPSYKEDKHFGCSSYSVPSSE 260

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM EIY++GPVE AF+V+ D +LYKSG +                             
Sbjct: 261 TEIMAEIYKNGPVEAAFSVYSDFLLYKSGVY----------------------------- 291

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG ++ +   YWL+ NSWNTDWGD+G FKILRG+D
Sbjct: 292 --------QHVTGEMMGGHAVRILGWGVEDGT--PYWLVGNSWNTDWGDSGFFKILRGQD 341

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 342 HCGIESEIVAGLP 354



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 169 CGEGCNGGFPSGAWNFWTKKGLVSGGLYDS 198


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYS+S NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTV+ D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGDLMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 318 HCGIESEIVAGIP 330



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 115/194 (59%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C    G TP+C+ +C+  Y   YKKD ++G  SYSV +NE
Sbjct: 176 GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCINQCESGYTPSYKKDKHYGKTSYSVEANE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVEGAF V++D  +YKSG +                             
Sbjct: 236 NQIQTEIYKNGPVEGAFMVYEDFPMYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  +GGHAI+ILGWG ++     YWL ANSWNTDWGDNG FKILRG D
Sbjct: 267 --------QHVSGSLIGGHAIKILGWGVEDGV--PYWLCANSWNTDWGDNGYFKILRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+V+GG Y S
Sbjct: 144 CGMGCNGGYPTAAWDFWTKEGLVTGGLYDS 173


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 100/201 (49%), Positives = 122/201 (60%), Gaps = 40/201 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  G+  GCRPY IAPCEHHVNGTR  C + +G TPKC R C++ Y V Y+ D NFG  +
Sbjct: 212 GQYGTHQGCRPYSIAPCEHHVNGTRLPC-SGEGPTPKCERTCEKGYKVKYEDDKNFGYTA 270

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV ++EK IM EI  +GPVEGAFTV+ D   YKSG +                      
Sbjct: 271 YSVDNDEKQIMTEIMTNGPVEGAFTVYADFPTYKSGVY---------------------- 308

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + SG  LGGHAIR+LGWG ++ +   YWL+ANSWN+DWGDNG F
Sbjct: 309 ---------------QHVSGGELGGHAIRVLGWGVEDGT--PYWLVANSWNSDWGDNGFF 351

Query: 287 KILRGKDECGIESSITAGVPK 307
           KILRG++ECGIE  I AG+PK
Sbjct: 352 KILRGQNECGIEGEIVAGLPK 372



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGFP  AW Y+  +G+VSGG YG+ Q
Sbjct: 187 CGMGCNGGFPPAAWEYFRDTGLVSGGQYGTHQ 218


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 96/196 (48%), Positives = 121/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCE HVNG RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 105 GCRPYSIPPCEAHVNGARPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 163

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 164 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 194

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 195 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 244

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ D
Sbjct: 245 HCGIESEVVAGIPRTD 260



 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 72  MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 102


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 101/217 (46%), Positives = 127/217 (58%), Gaps = 39/217 (17%)

Query: 90  SRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
           S+ +  N    + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TP CV++C+
Sbjct: 4   SQEQHGNYCKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCE 62

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           E Y VPY +DL+ G  +YS+ ++   I +EIY +GPVEGAFTV++D I Y++G +     
Sbjct: 63  EGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY----- 117

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
                                            + +GKALGGHAIRILGWG  +  +  Y
Sbjct: 118 --------------------------------KHVAGKALGGHAIRILGWGV-QNGEIPY 144

Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WL+ANSWNTDWG +G FKILRG DECGIE  I AG+P
Sbjct: 145 WLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 94/196 (47%), Positives = 118/196 (60%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP+C    G TPKC ++C+  Y   YK D ++G  +Y+V S+E
Sbjct: 180 GCRPYSIPPCEHHVNGSRPACTGEGGDTPKCNKKCEAGYSPDYKDDKHYGTTAYNVPSSE 239

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V+ D + YKSG +                             
Sbjct: 240 KEIMAEIYKNGPVEGAFIVYADFLQYKSGVY----------------------------- 270

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  LGGHAIR+LGWG ++     YWL ANSWNTDWGDNG FKILRGKD
Sbjct: 271 --------QHVTGDMLGGHAIRVLGWGVEDGV--PYWLAANSWNTDWGDNGFFKILRGKD 320

Query: 294 ECGIESSITAGVPKLD 309
            CGIES + AG+P+ +
Sbjct: 321 HCGIESEMVAGIPRTE 336



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 36/75 (48%), Gaps = 19/75 (25%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY-------------------GSKQAEKNSLSNI 49
           CG GCNGGFP  AW+YW+K G+VSGG Y                   GS+ A      + 
Sbjct: 148 CGEGCNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEHHVNGSRPACTGEGGDT 207

Query: 50  PRAHLKSWMGVHPDY 64
           P+ + K   G  PDY
Sbjct: 208 PKCNKKCEAGYSPDY 222


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 93/194 (47%), Positives = 115/194 (59%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TPKCV  C+  Y   Y KD ++G  SYSV ++ 
Sbjct: 176 GCRPYTIPPCEHHVNGSRPHCSGEGGDTPKCVHSCEAGYSPTYTKDKHYGKSSYSVEASV 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI ++GPVEGAF V++D ++YKSG +                             
Sbjct: 236 EQIQAEISQNGPVEGAFIVYEDFVMYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G ALGGHAI++LGWGE++     YWL ANSWNTDWG+NG FKILRG D
Sbjct: 267 --------QHTTGSALGGHAIKVLGWGEEDGV--PYWLCANSWNTDWGENGFFKILRGSD 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSAAWDFWTKEGLVSGGLYNS 173


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 99/206 (48%), Positives = 123/206 (59%), Gaps = 39/206 (18%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TP CV++C+E Y VPY +DL
Sbjct: 175 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPYAQDL 233

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+ ++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 234 HHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 277

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                 + +GKALGGHAIRILGWG  +  +  YWL+ANSWNTDW
Sbjct: 278 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNTDW 315

Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
           G +G FKILRG DECGIE  I AG+P
Sbjct: 316 GSDGFFKILRGSDECGIEGQINAGLP 341



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/32 (75%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW   GIVSGG YGS  
Sbjct: 156 CGFGCNGGFPGAAWNYWKTKGIVSGGPYGSNM 187


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 154/337 (45%), Gaps = 110/337 (32%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV--DEDLPANFDSRTKWPNC 97
           +A +N   N P   L   MGVHPD NL    +P L   S++  ++ +P  FD+R +WP+C
Sbjct: 46  KAGRNFGKNFPMGALTQMMGVHPDSNL---YMPPLKNVSQMYSNQAIPEAFDAREQWPDC 102

Query: 98  PTIREIRDQGSCGSCWGCRPYEIA--------------------------PCEHHVNGTR 131
           PTI+EIRDQGSCGSCW     E                             C    NG  
Sbjct: 103 PTIQEIRDQGSCGSCWAFGAVEAMSDRICIHSKGEVNAHLSAENLVSCCYTCGFGCNGGF 162

Query: 132 PSC----------------DASKGHTPKCVRECQEN------------------------ 151
           P                  ++S+G  P  +  C+ +                        
Sbjct: 163 PGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEHHTTGDRPPCSEGGGTPKCLKTCEDG 222

Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           Y V Y +DL++GA SYSV    + I  EI  +GPVEGA TV++D   YKSG +       
Sbjct: 223 YTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSGVY------- 275

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          +  GKALGGHAIRILGWG +E     YWL
Sbjct: 276 ------------------------------QHVHGKALGGHAIRILGWGVEEGV--PYWL 303

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           IANSWNTDWGDNG  K+LRGKD CGIES ITAG+PKL
Sbjct: 304 IANSWNTDWGDNGYIKLLRGKDHCGIESQITAGLPKL 340



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW +WVK GIV+GG + S Q
Sbjct: 154 CGFGCNGGFPGAAWSHWVKKGIVTGGNFNSSQ 185


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 98/199 (49%), Positives = 122/199 (61%), Gaps = 40/199 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
           GS  GCRPY I PCEHHVNGTRP C   +G TPKC  +C + Y   Y+KD  FG K+YSV
Sbjct: 171 GSNKGCRPYSIPPCEHHVNGTRPPCQG-EGDTPKCQTKCIDGYTPAYEKDKYFGKKTYSV 229

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
            S ++ IM E+Y++GPVE AF+V++D +LYKSG                           
Sbjct: 230 PSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSG--------------------------- 262

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                 V+  L    +G  LGGHAI+ILGWG++  +   YWL ANSWNTDWG+ G FKIL
Sbjct: 263 ------VYQHL----TGDMLGGHAIKILGWGKENNT--PYWLAANSWNTDWGNQGFFKIL 310

Query: 290 RGKDECGIESSITAGVPKL 308
           RG DECGIES + AG+P+L
Sbjct: 311 RGGDECGIESEVVAGIPQL 329



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+P  AW YW KSG+V+GG YGS +
Sbjct: 143 CGMGCFGGYPSAAWEYWAKSGLVTGGLYGSNK 174


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 107/210 (50%), Positives = 121/210 (57%), Gaps = 48/210 (22%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + I   G  GS  GCRPYEIAPCEHHVNGTRP C  S G TP C  +CQ +Y V Y K
Sbjct: 168 TRKGIVSGGPYGSTQGCRPYEIAPCEHHVNGTRPPC--SHGSTPSCQHKCQASYSVEYAK 225

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D NFG+KSYSV  N   I +EI  +GPVEGAFTV++DLILYKSG +              
Sbjct: 226 DKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYEDLILYKSGVY-------------- 271

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   ++ GK LGGHAIRILGWG   +SK  YWLI NSWNT
Sbjct: 272 -----------------------QHEHGKELGGHAIRILGWGVWGESKVPYWLIGNSWNT 308

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGDN         D CGIESSI+AG+  L
Sbjct: 309 DWGDN---------DHCGIESSISAGLSHL 329



 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           Q  +N   ++   +++  MGVHPD   + LP  R+     Y++   D+P  FD+R  WPN
Sbjct: 39  QVGRNFKESVSEEYIRGLMGVHPDAHKFALPEKRIVLGDLYADDGIDIPEEFDARKAWPN 98

Query: 97  CPTIREIRDQGSCGSCW 113
           CPTI EIRDQGSCGSCW
Sbjct: 99  CPTIGEIRDQGSCGSCW 115



 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 27/34 (79%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
            +CGFGCNGGFPG AW YW + GIVSGG YGS Q
Sbjct: 149 HICGFGCNGGFPGAAWSYWTRKGIVSGGPYGSTQ 182


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 98/208 (47%), Positives = 123/208 (59%), Gaps = 40/208 (19%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + +   G CGS  GCRPY IAPCEHHVNGTRP C  ++  TPKC ++C + Y   Y K
Sbjct: 159 TKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQ-ETPKCEKKCIDGYLTSYLK 217

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +FG +SYS+ S ++ IM E+Y++GPVE AFTV+ D +LYK+G +              
Sbjct: 218 DKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGVY-------------- 263

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   + +G+ LGGHAI+ILGWGE+  S   YWL ANSWN 
Sbjct: 264 -----------------------QHVTGEVLGGHAIKILGWGEE--SGTPYWLAANSWNG 298

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
           DWGD G FKI RG DECGIES + AG P
Sbjct: 299 DWGDKGFFKIKRGNDECGIESEMVAGTP 326



 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC+GG+P  AW +W K G+V+GG  GS+
Sbjct: 142 CGMGCSGGYPSSAWEFWTKKGLVTGGLCGSE 172


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 100/207 (48%), Positives = 123/207 (59%), Gaps = 40/207 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+  S  GC+PYEIAPCEHHV+G RP C A  G TPKC + C+ NY V Y+ DL+ 
Sbjct: 165 IVSGGAFNSTQGCQPYEIAPCEHHVSGPRPKC-AEGGSTPKCHKNCESNYVVDYESDLHH 223

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G+K YSV  +E  I  +I  +GPVEGAFTV+ D + YKSG +                  
Sbjct: 224 GSKHYSVDKDETQIKYDIMTNGPVEGAFTVYVDFLHYKSGVY------------------ 265

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               +  G  LGGHAIR+LGWGE++ +   YWL ANSWNTDWGD
Sbjct: 266 -------------------QHTHGLPLGGHAIRVLGWGEEDGT--PYWLCANSWNTDWGD 304

Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
           NG FKILRG D CGIES I+AG+PK++
Sbjct: 305 NGYFKILRGSDHCGIESEISAGLPKVE 331



 Score = 60.8 bits (146), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 26/34 (76%), Positives = 29/34 (85%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
            LCGFGCNGGFPG A++YWV SGIVSGGA+ S Q
Sbjct: 142 HLCGFGCNGGFPGAAFQYWVHSGIVSGGAFNSTQ 175


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  186 bits (472), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 49/75 (65%), Gaps = 6/75 (8%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           QA +N   N+  ++LK   G      L   +LPE +G+SE D +LP +FD+R +W NCPT
Sbjct: 42  QAGRN-FYNVDISYLKKLCGT----VLGGPKLPERVGFSE-DINLPESFDAREQWSNCPT 95

Query: 100 IREIRDQGSCGSCWG 114
           I +IRDQGSCGSCW 
Sbjct: 96  IAQIRDQGSCGSCWA 110



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 116/194 (59%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I+PCEHHVNG+RP C    G TP+C+  C+  Y   YK+D ++G  SYSV  + 
Sbjct: 176 GCRPYTISPCEHHVNGSRPPCTGEGGDTPECISRCEAGYSPSYKQDKHYGKSSYSVEGSV 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI ++GPVEGAFTV++D ++YKSG +                             
Sbjct: 236 EQIQAEISKNGPVEGAFTVYEDFVMYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHAI++LGWGE++     YWL ANSWNTDWGDNG FKILRG +
Sbjct: 267 --------QHVSGSVLGGHAIKVLGWGEEDGI--PYWLCANSWNTDWGDNGFFKILRGSN 316

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+PK
Sbjct: 317 HCGIESEIVAGIPK 330



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 144 CGMGCNGGYPSSAWDFWTKEGLVSGGLYNS 173


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/203 (49%), Positives = 120/203 (59%), Gaps = 40/203 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C      TPKCV+EC+  Y VPYK+D ++G   
Sbjct: 172 GSYNSSQGCRPYEIPPCEHHVPGNRLPCSGDT-KTPKCVKECESGYKVPYKQDKHYGKHV 230

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV   E  I  E+Y++GPVEGAFTV+ DL+ YKSG +                      
Sbjct: 231 YSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGVY---------------------- 268

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +G ALGGHAI+I+GWG +  +  KYWLIANSWN+DWGDNG F
Sbjct: 269 ---------------KHVTGDALGGHAIKIMGWGVE--NGNKYWLIANSWNSDWGDNGFF 311

Query: 287 KILRGKDECGIESSITAGVPKLD 309
           KILRG+D CGIESSI AG P  +
Sbjct: 312 KILRGEDHCGIESSIVAGEPLFN 334



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 36/65 (55%), Gaps = 12/65 (18%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLP 67
           +CG GCNGG P +AW YW   G+VSGG+Y S Q  +     IP            ++++P
Sbjct: 146 ICGLGCNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRP--YEIPPC----------EHHVP 193

Query: 68  ANRLP 72
            NRLP
Sbjct: 194 GNRLP 198


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 93/196 (47%), Positives = 116/196 (59%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y I PCEHHVNG+RP C    G TP+C R C+  Y   YK+D ++G  SY V  +E
Sbjct: 178 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF V++D ++YKSG +                             
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ +GGHAIRILGWG +  +   YWL ANSWNTDWG  G FKILRG+D
Sbjct: 269 --------QHVSGEQVGGHAIRILGWGVENGT--PYWLAANSWNTDWGITGFFKILRGED 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AGVP+++
Sbjct: 319 HCGIESEIVAGVPRME 334



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTERGLVSGGLYDS 175


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331



 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331



 Score = 45.8 bits (107), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 110 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 168

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 169 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 199

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 200 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 249

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 250 HCGIESEIVAGIPR 263



 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 78  CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 107


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 161 GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 219

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 220 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 250

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 251 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 300

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 301 HCGIESEIVAGIPR 314



 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 129 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 158


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  184 bits (467), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 105 GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 163

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 164 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 194

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 195 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 244

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 245 HCGIESEIVAGIPR 258



 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 73  CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 102


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 95/194 (48%), Positives = 117/194 (60%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS +E
Sbjct: 99  GCLPYTIPPCEHHVNGARPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 157

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 158 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 188

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL+ANSWN DWGDNG FKILRG++
Sbjct: 189 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNADWGDNGFFKILRGEN 238

Query: 294 ECGIESSITAGVPK 307
            CGIES I AG+P+
Sbjct: 239 HCGIESEIVAGIPR 252



 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 22/30 (73%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 67 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 96


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 92/193 (47%), Positives = 118/193 (61%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHYGCNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVE AF+VF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEAAFSVFSDFLQYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG +  +   YWL+ NSWNTDWGD+G FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENDT--PYWLVGNSWNTDWGDHGFFKILRGRD 317

Query: 294 ECGIESSITAGVP 306
            CGIES + AG+P
Sbjct: 318 HCGIESEVVAGIP 330



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 115/194 (59%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TP+CV +C+  Y   Y+KD ++G  SY V S E
Sbjct: 176 GCRPYTIEPCEHHVNGSRPPCTGEGGDTPECVTQCEAGYTPSYQKDKHYGKTSYGVPSEE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EIY++GPVEGAF V++D   YKSG +                             
Sbjct: 236 EQIQSEIYKNGPVEGAFIVYEDFPSYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G ALGGHAI+++GWGE+  +   YWL ANSWNTDWGDNG FKILRG +
Sbjct: 267 --------QHVTGSALGGHAIKMIGWGEE--NGVPYWLCANSWNTDWGDNGFFKILRGSN 316

Query: 294 ECGIESSITAGVPK 307
            CGIES + AG+PK
Sbjct: 317 HCGIESEVVAGIPK 330



 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W + G+V+GG Y S
Sbjct: 144 CGMGCNGGYPANAWEFWTEQGLVTGGLYNS 173


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 97/206 (47%), Positives = 123/206 (59%), Gaps = 39/206 (18%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TP CV++C++ Y VPY +DL
Sbjct: 173 KGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPACVKKCEDGYKVPYAQDL 231

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+ ++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 232 HRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 275

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                 + +GKALGGHAIRILGWG  +  +  YWL+ANSWN+DW
Sbjct: 276 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNSDW 313

Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
           G +G FKILRG DECGIE  I AG+P
Sbjct: 314 GSDGFFKILRGSDECGIEGQINAGLP 339



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/34 (76%), Positives = 26/34 (76%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           R CGFGCNGGFPG AW YW   GIVSGG YGSK 
Sbjct: 152 RTCGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKM 185


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 95/206 (46%), Positives = 123/206 (59%), Gaps = 41/206 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   GS  S  GC+PY I PCEHHVNGTR  C   +G TP+CV+ C+E YDVPY KD +F
Sbjct: 179 IVSGGSYNSHQGCQPYAIEPCEHHVNGTRKPC--GEGDTPRCVKRCEEGYDVPYGKDRHF 236

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y+V  + K+I KE+  +GP E A TV+DD + Y++G +                  
Sbjct: 237 GKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGVY------------------ 278

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + SG ALGGHA+R+LGWG ++ +   YWL+ANSWN DWGD
Sbjct: 279 -------------------QHVSGGALGGHAVRLLGWGVEDGT--PYWLLANSWNYDWGD 317

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG F+ILRG+DECGIES I  G+PK+
Sbjct: 318 NGYFRILRGQDECGIESDINGGLPKV 343



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/33 (72%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGG PG AW YWV +GIVSGG+Y S Q 
Sbjct: 158 CGFGCNGGEPGAAWDYWVSTGIVSGGSYNSHQG 190


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/203 (46%), Positives = 118/203 (58%), Gaps = 53/203 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP+C   +G TP C ++C+E Y   YK D N+G+ SYSV S+E
Sbjct: 179 GCRPYSIPPCEHHVNGSRPACTGEEGDTPTCRKKCEEGYSTQYKDDKNYGSTSYSVPSSE 238

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EIY++GPV                                            EGA
Sbjct: 239 QEIMAEIYKNGPV--------------------------------------------EGA 254

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F+V++D + YKSG       + LGGHAIRILGWG +  +  +YWL ANSWN DWGDNG F
Sbjct: 255 FSVYEDFLHYKSGVYQHVAGEMLGGHAIRILGWGVE--NGIRYWLAANSWNIDWGDNGFF 312

Query: 287 KILRGKDECGIESSITAGVPKLD 309
           K LRGK+ CGIES I AG+P+ D
Sbjct: 313 KFLRGKNHCGIESEIIAGIPRTD 335



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 147 CGDGCNGGFPAGAWNFWTKKGLVSGGLYDS 176


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 118/196 (60%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TP+C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 159/341 (46%), Gaps = 117/341 (34%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD--YNLP---ANRLPELIGYSEVDEDLPANFDSRTKW 94
           +A +N   ++   + +  MGVHPD  Y++P   A+++PE       + D+P  FDSR  W
Sbjct: 38  EAGRNFNRHLSIRYFRRLMGVHPDSKYHMPGYEAHKIPE-------NFDMPKEFDSRAAW 90

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAP--------------------------CEHHVN 128
           P CPTI EIRDQGSCGSCW     E+                            C    N
Sbjct: 91  PMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSSENLVSCCHLCGFGCN 150

Query: 129 GTRP----------------SCDASKGHTPKCVRECQEN--------------------- 151
           G  P                S ++++G  P  +  C+ +                     
Sbjct: 151 GGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPKCVKRC 210

Query: 152 ---YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG 208
              Y V Y+ DL+ G K+YS+  +E  I  EI ++GPVEGAFTV+ D + YKSG +    
Sbjct: 211 ENGYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY---- 266

Query: 209 NETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK 268
                                             ++ G  LGGHAIRILGWGE+  +   
Sbjct: 267 ---------------------------------QHRHGLPLGGHAIRILGWGEENGT--P 291

Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           YWL ANSWNTDWGDNGLFKILRG D CGIES I+AG+PKL+
Sbjct: 292 YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPKLN 332



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
            LCGFGCNGGFPG A++YWV SGIVSGG++ S Q 
Sbjct: 143 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQG 177


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 89/193 (46%), Positives = 115/193 (59%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY +APCEHHVNG+RP C      TPKCV +C   Y + Y KD +FG +SYS+ S +
Sbjct: 176 GCRPYTLAPCEHHVNGSRPPCQGEV-ETPKCVTQCNNGYSLSYPKDKHFGQRSYSIPSQQ 234

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM E+Y++GPVE AF+V+ D +LYK+G +                             
Sbjct: 235 EQIMTELYKNGPVEAAFSVYADFLLYKNGVY----------------------------- 265

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  LGGHA++ILGWGE+  +   YWL+ANSWN+DWGD G FKI RG D
Sbjct: 266 --------QHVTGDMLGGHAVKILGWGEENGT--PYWLVANSWNSDWGDKGFFKIKRGND 315

Query: 294 ECGIESSITAGVP 306
           ECGIES + AG P
Sbjct: 316 ECGIESEMVAGAP 328



 Score = 44.7 bits (104), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GGFP  AW +W   G+V+GG + SK
Sbjct: 144 CGMGCFGGFPSAAWEFWTNKGLVTGGLFDSK 174


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TP+C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRIL WG +  +   YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILVWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTKKGLVSGGVYDS 175


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/196 (47%), Positives = 117/196 (59%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G T +C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTHRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG +  +   YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/196 (47%), Positives = 116/196 (59%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G TP+C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++ PVEGAFTVF D + YKSG +                             
Sbjct: 237 KEIMAEIYKNDPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRILGWG    +   YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWG--VGNGVPYWLAANSWNLDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 99/210 (47%), Positives = 121/210 (57%), Gaps = 41/210 (19%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ-ENYDVPYK 157
           T + +   G  GS  GC+PY+I PCEHHVNGTR  C A  G TPKC R C+ ENY VPY 
Sbjct: 163 TSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPC-AEGGRTPKCHRTCENENYSVPYD 221

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           KDL+FG  SYS+ S+ K I  EI ++GPVE AF+V+ D +  KSG +             
Sbjct: 222 KDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGVY------------- 268

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                    +  G  LGGHAIRILGWG ++ +   YWL+ANSWN
Sbjct: 269 ------------------------RHVKGSLLGGHAIRILGWGVEKGT--PYWLVANSWN 302

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPK 307
           TDWGD G FKILRG D CGIE S+  G+P+
Sbjct: 303 TDWGDKGTFKILRGSDHCGIEGSVVTGLPR 332



 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 23/30 (76%), Positives = 25/30 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGGFPG AW+YW   G+VSGG YGS
Sbjct: 146 CGFGCNGGFPGAAWKYWTSKGLVSGGLYGS 175


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 97/199 (48%), Positives = 116/199 (58%), Gaps = 41/199 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQ-ENYDVPYKKDLNFGAKSYS 168
           GS  GC+PY IAPCEHH NGTRP C    G TPKC   C+ E+Y +PY+KD +FG  SYS
Sbjct: 175 GSHKGCQPYAIAPCEHHANGTRPPCSGG-GRTPKCHTFCENEDYSLPYEKDKSFGRSSYS 233

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           V S+ K I  EI  +GPVE AF+V+ D + YKSG +                        
Sbjct: 234 VKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSGVY------------------------ 269

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                         +  G  LGGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKI
Sbjct: 270 -------------RHVKGSLLGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGTFKI 314

Query: 289 LRGKDECGIESSITAGVPK 307
           L+G D CGIE SI AG+P+
Sbjct: 315 LKGSDHCGIEGSIVAGLPQ 333



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW +W K G+VSGG YGS +
Sbjct: 147 CGFGCNGGFPGAAWSFWKKKGLVSGGLYGSHK 178


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score =  177 bits (450), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 97/196 (49%), Positives = 120/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKFCEPGYTPSYKEDKHYGCSSYSVSSSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVE AFTV+ D +LYKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEAAFTVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGRD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P  D
Sbjct: 318 HCGIESEIVAGIPCTD 333



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYES 175


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 93/195 (47%), Positives = 115/195 (58%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHHV G RPSC+     TPKC   CQ  Y++PY KD  +G   Y V SN+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSCEGDV-ETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ EHGPVE  F V+ D   YKSG +                             
Sbjct: 248 EAIMKEVKEHGPVEVDFEVYADFPNYKSGVY----------------------------- 278

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWN+DWGDNG FKI+RG++
Sbjct: 279 --------QHVSGGLLGGHAVRLLGWGEE--NGVPYWLIANSWNSDWGDNGYFKIIRGRN 328

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PKL
Sbjct: 329 ECGIESDVNAGIPKL 343



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/62 (48%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N     LP L  GY+   ++LP  FD+R  WP+CP+I EIRDQ SCGSC
Sbjct: 63  IRRMLGALPDPN--GGHLPTLCTGYTPSLDELPKEFDARKYWPHCPSISEIRDQSSCGSC 120

Query: 113 WG 114
           W 
Sbjct: 121 WA 122



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 19/28 (67%), Positives = 21/28 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GCNGGFP  AW YW +SGIV+G  Y
Sbjct: 157 CGMGCNGGFPHSAWSYWKRSGIVTGDLY 184


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 93/193 (48%), Positives = 110/193 (56%), Gaps = 40/193 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CR YEI PCEHHVNGTRP C+     TPKC   CQE Y VPYKKD ++  K YSV SNE 
Sbjct: 198 CRAYEIPPCEHHVNGTRPPCEGD-APTPKCKNVCQEEYKVPYKKDKHYAVKVYSVHSNED 256

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  E+  HGPVE  F V+ D   YKSG +                              
Sbjct: 257 AIKHELITHGPVEADFEVYADFPTYKSGVY------------------------------ 286

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                   + SG  LGGHAI+++GWGE++     YWL ANSWNTDWG+ G FKILRGK+ 
Sbjct: 287 -------QHVSGALLGGHAIKLMGWGEEDGV--PYWLCANSWNTDWGEGGFFKILRGKNH 337

Query: 295 CGIESSITAGVPK 307
           CGIES I AG+P+
Sbjct: 338 CGIESDIVAGIPQ 350



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 24/33 (72%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R CG GCNGGFP  AW +W   G+VSGG YG+K
Sbjct: 163 RDCGMGCNGGFPSQAWNFWKHEGLVSGGLYGTK 195


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 117/196 (59%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C    G TPKC + C+  Y   YK+D ++G  SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPCTGEGGDTPKCSKICEPGYSPSYKEDKHYGCSSYSVSSSE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EI+++GPVE AFTV+ D + YKSG +                             
Sbjct: 238 KEIMAEIFKNGPVEAAFTVYSDFLQYKSGVY----------------------------- 268

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  +GGHA+RILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 269 --------QHVAGDMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 318

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P  D
Sbjct: 319 HCGIESEIVAGIPCTD 334



 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 47/70 (67%), Gaps = 5/70 (7%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  +++K   G      L   +LP+ + ++E D  LP NFD+R +WPNCPTI+EI
Sbjct: 45  HNFHNVDLSYVKRLCGTF----LGGPKLPQRVWFAE-DVVLPENFDAREQWPNCPTIKEI 99

Query: 104 RDQGSCGSCW 113
           RDQGSCGSCW
Sbjct: 100 RDQGSCGSCW 109



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTKQGLVSGGLYDS 175


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 94/194 (48%), Positives = 116/194 (59%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYEI  CEHH  G RP C +    TPKCV  C++ Y+  Y+ D +FG KSYS+ S E
Sbjct: 93  GCQPYEIPACEHHTTGDRPPC-SDIVDTPKCVHLCEKGYNTSYRDDKHFGKKSYSIESLE 151

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI+++GPVEGAF+V+ D I YKSG +                             
Sbjct: 152 QQIQTEIFKNGPVEGAFSVYSDFINYKSGVY----------------------------- 182

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG++LGGHAIR+LGWG +  +   YWL ANSWNTDWGD G FKILRG D
Sbjct: 183 --------QHHSGESLGGHAIRVLGWGYE--NDVPYWLCANSWNTDWGDKGYFKILRGSD 232

Query: 294 ECGIESSITAGVPK 307
           ECGIESSI AG+PK
Sbjct: 233 ECGIESSIVAGIPK 246



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          CG GC+GGFP  AW +WV  GI +GG + S
Sbjct: 61 CGMGCDGGFPPSAWEFWVDKGIATGGLWNS 90


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 92/194 (47%), Positives = 114/194 (58%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY +APCEHH  G+ P+C  +   TPKCV  C++ Y   Y+ D +FG K YS+SSNE
Sbjct: 182 GCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQHDKHFGKKVYSISSNE 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EI+++GPVE  FTV+ D + YKSG +                             
Sbjct: 241 KQIQTEIFKNGPVEADFTVYADFLSYKSGVY----------------------------- 271

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHAIRILGWG +  +   YWL+ANSWN DWGD+G FKILRGKD
Sbjct: 272 --------QHHSGDVLGGHAIRILGWGTENGT--PYWLVANSWNEDWGDHGYFKILRGKD 321

Query: 294 ECGIESSITAGVPK 307
           ECGIE  I AG+PK
Sbjct: 322 ECGIEDDINAGIPK 335



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC+GG+P  AW YW +SG+VS G YG+
Sbjct: 150 CGAGCDGGYPAAAWEYWKESGLVSDGLYGT 179


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 120/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVSS+E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHYGCSSYSVSSSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVE AF+V+ D ++YKSG +                             
Sbjct: 237 KEIMAEIYKNGPVEAAFSVYSDFLMYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHA+RILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAVRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P  D
Sbjct: 318 HCGIESEIVAGIPCTD 333



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 47/71 (66%), Gaps = 5/71 (7%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+ +++LK   G      L   + P+ + ++E +  LP +FDSR +WPNCPTI+EI
Sbjct: 45  HNFHNVDQSYLKKLCGTF----LGGPKPPQRLWFAE-NMILPESFDSREQWPNCPTIKEI 99

Query: 104 RDQGSCGSCWG 114
           RDQGSCGSCW 
Sbjct: 100 RDQGSCGSCWA 110



 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW +W   G+VSGG Y S
Sbjct: 146 CGDGCNGGFPAEAWNFWTXXGLVSGGLYDS 175


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 95/199 (47%), Positives = 117/199 (58%), Gaps = 41/199 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYS 168
           GS  GC+PY I PCEHHVNGTR  C A  G TPKC + C  +NY + Y+KDL+FG  SYS
Sbjct: 173 GSHKGCQPYLIEPCEHHVNGTRKPC-AEGGRTPKCHKTCDNKNYPISYEKDLSFGRSSYS 231

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           + S+ K I  +I  +GPVE AF+V+ D + YKSG +                        
Sbjct: 232 IRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGVY------------------------ 267

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                         +  G  LGGHAIRILGWG ++ +   YWL+ANSWNTDWGDNG FKI
Sbjct: 268 -------------RHVKGSLLGGHAIRILGWGMEKGT--PYWLVANSWNTDWGDNGTFKI 312

Query: 289 LRGKDECGIESSITAGVPK 307
           LRG D CGIE S+ AG+P+
Sbjct: 313 LRGSDHCGIEDSVVAGLPR 331



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AWR+W   G+VSGG YGS +
Sbjct: 145 CGFGCNGGFPGAAWRFWENKGLVSGGLYGSHK 176


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 114/195 (58%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHHV G RPSC      TPKC   CQ  Y++PY KD  +G   Y V SN+
Sbjct: 189 GCQPYEFPPCEHHVVGPRPSCGGDV-ETPKCKTTCQPGYNIPYNKDKWYGKTVYRVHSNQ 247

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 248 EAIMKEVMDHGPVEVDFEVYADFPNYKSGVY----------------------------- 278

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWN+DWGDNG FKI+RG++
Sbjct: 279 --------QHVSGGLLGGHAVRLLGWGEE--NGVPYWLIANSWNSDWGDNGYFKIIRGRN 328

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PKL
Sbjct: 329 ECGIESDVNAGIPKL 343



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/62 (48%), Positives = 39/62 (62%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N     LP L  GY+   ++LP  FD+R  WP+CP+I EIRDQ SCGSC
Sbjct: 63  IRRMLGALPDPN--GGYLPTLCTGYTPSLDELPKEFDARKHWPHCPSISEIRDQSSCGSC 120

Query: 113 WG 114
           W 
Sbjct: 121 WA 122



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW +SGIV+G  Y +
Sbjct: 157 CGMGCNGGFPHSAWSYWKRSGIVTGDLYNT 186


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 99/210 (47%), Positives = 119/210 (56%), Gaps = 54/210 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C      TPKC+++C++NY+V YK+D ++G   
Sbjct: 172 GSYNSTQGCRPYEIPPCEHHVPGNRLPCSGDT-KTPKCIKKCEDNYNVAYKQDKHYGKHI 230

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV   E  I  E+Y++GPVE                                       
Sbjct: 231 YSVRGGEDHIKAELYKNGPVE--------------------------------------- 251

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YKSG        ALGGHAI+I+GWG +  +K  YWLIANSWN+D
Sbjct: 252 -----GAFTVYADLLSYKSGVYKHVAGDALGGHAIKIMGWGVENGNK--YWLIANSWNSD 304

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WGDNG FKILRG+D CGIESSI AG P LD
Sbjct: 305 WGDNGFFKILRGEDHCGIESSIVAGEPLLD 334


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 117/198 (59%), Gaps = 40/198 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
           G+  GC+PY +APCEHH  G+ P+C  +   TPKCV  C++ Y   Y+ D +FG K YS+
Sbjct: 178 GTSDGCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQDDKHFGRKVYSI 236

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           SS+EK I  EI+++GPVE  FTV+ D + YKSG +                         
Sbjct: 237 SSDEKQIQTEIFKNGPVEADFTVYADFLSYKSGVY------------------------- 271

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                        ++SG  LGGHAIRILGWG +  +   YWL+ANSWN DWGD+G FKIL
Sbjct: 272 ------------QHQSGDVLGGHAIRILGWGTENGT--PYWLVANSWNEDWGDHGYFKIL 317

Query: 290 RGKDECGIESSITAGVPK 307
           RGKDECGIE  I AG+PK
Sbjct: 318 RGKDECGIEDDINAGIPK 335


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/194 (46%), Positives = 115/194 (59%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY IA C+HHV G +  C + + HTP+C + C+  YDV ++KD +FGA +YSV S+ 
Sbjct: 173 GCQPYLIASCDHHVVGKKQPCASKEEHTPRCSKTCEAGYDVSFEKDKHFGASAYSVRSSV 232

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++I  EI  +GPVEGAFTV+ D   YKSG +                             
Sbjct: 233 EAIQTEIMTNGPVEGAFTVYADFPTYKSGVY----------------------------- 263

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHAIRILGWG +  +   YWL+ANSWN DWG  G FKI+RGKD
Sbjct: 264 --------QHTSGAMLGGHAIRILGWGTENGT--PYWLVANSWNEDWGAMGYFKIIRGKD 313

Query: 294 ECGIESSITAGVPK 307
           +CGIES ITAG+PK
Sbjct: 314 DCGIESQITAGMPK 327



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGG+ G AWRY+  +G+V+GG Y SK+
Sbjct: 141 CGMGCNGGYLGAAWRYFEHTGLVTGGQYNSKE 172


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 99/214 (46%), Positives = 119/214 (55%), Gaps = 54/214 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   GS  S  GC+PYEIAPCEHHV+G RP C    G TPKC + C++ Y V Y+ DL+ 
Sbjct: 165 IVSGGSFNSTQGCQPYEIAPCEHHVSGPRPKCSEGGG-TPKCAKTCEKGYIVDYESDLHH 223

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G K+YS+  +E  I  EI  +GPV                                    
Sbjct: 224 GGKAYSIMKDEDQIKYEIMNNGPV------------------------------------ 247

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANS 275
                   EGAFTV+ D + YKSG         LGGHAIR+LGWGE+  +   YWL ANS
Sbjct: 248 --------EGAFTVYVDFLHYKSGVYQHRHGLPLGGHAIRVLGWGEENGTP--YWLCANS 297

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WNTDWGDNGLFKILRG D CGIES I+AG+PK++
Sbjct: 298 WNTDWGDNGLFKILRGSDHCGIESEISAGLPKVN 331



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/34 (73%), Positives = 29/34 (85%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
            LCGFGCNGGFPG A++YWV SGIVSGG++ S Q
Sbjct: 142 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQ 175


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 118/341 (34%), Positives = 157/341 (46%), Gaps = 117/341 (34%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD--YNLP---ANRLPELIGYSEVDEDLPANFDSRTKW 94
           +A +N   ++   + +  MGVHPD  Y++P    +++PE       + +LP  FDSR  W
Sbjct: 37  EAGRNFNKHLSIRYFRRLMGVHPDSKYHMPKYEVHQIPE-------NFELPKEFDSRAAW 89

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAP--------------------------CEHHVN 128
           P CPTI EIRDQGSCGSCW     E+                            C    N
Sbjct: 90  PMCPTIGEIRDQGSCGSCWAFGAVEVMSDRQCIHSKGKSNFHYSAENLVSCCHLCGFGCN 149

Query: 129 GTRP----------------SCDASKGHTPKCVRECQENYDVP----------------- 155
           G  P                S ++++G  P  +  C+ +   P                 
Sbjct: 150 GGFPGAAFKYWVHSGIVSGGSFNSTQGCQPYEIAPCEHHVPGPRPKCSEGGGTPKCAKTC 209

Query: 156 -------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG 208
                  Y+ DL+ G K+YS+  +E  I  EI ++GPVEGAFTV+ D + YKSG +    
Sbjct: 210 EKGYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSGVY---- 265

Query: 209 NETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK 268
                                             ++ G  LGGHAIR+LGWGE+  +   
Sbjct: 266 ---------------------------------QHRHGLPLGGHAIRVLGWGEENGT--P 290

Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           YWL ANSWNTDWGDNGLFKILRG D CGIES I+AG+PKL+
Sbjct: 291 YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISAGLPKLN 331



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
            LCGFGCNGGFPG A++YWV SGIVSGG++ S Q 
Sbjct: 142 HLCGFGCNGGFPGAAFKYWVHSGIVSGGSFNSTQG 176


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 119/203 (58%), Gaps = 40/203 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PYEIAPCEHH+NG+RP+C   +  TP+C + C+  Y+V + KD ++   +
Sbjct: 170 GPYGSMQGCQPYEIAPCEHHINGSRPACGKIEP-TPRCKKTCESGYNVTFNKDKHYAKSA 228

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVSS  + I  EI  +GPVE AFTV+ D   YKSG +                      
Sbjct: 229 YSVSSKVQQIQMEIMTNGPVEAAFTVYADFPHYKSGVY---------------------- 266

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           ++SG  LGGHA++++GWG +  +   YWLIANSWN+DWGD G F
Sbjct: 267 ---------------QHESGAELGGHAVKMIGWGMEGST--PYWLIANSWNSDWGDMGFF 309

Query: 287 KILRGKDECGIESSITAGVPKLD 309
           KILRG+DECGIE  I AG P++D
Sbjct: 310 KILRGQDECGIERDIVAGEPRMD 332



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC+GGFP  AW YW + G+V+GG YGS Q
Sbjct: 145 CGMGCHGGFPEAAWEYWKQDGLVTGGPYGSMQ 176


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score =  174 bits (442), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 93/199 (46%), Positives = 113/199 (56%), Gaps = 40/199 (20%)

Query: 110 GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
           GS  GC PY+I PCEHHV G RP C    G TP CV +C+ N  + Y +D ++G  SY+V
Sbjct: 206 GSKTGCLPYQIKPCEHHVPGDRPKCSEGGG-TPSCVSKCKGNTTIHYNQDKHYGLSSYAV 264

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
            S+   I  EI  HGPVEGAFTV+ D   YKSG +                         
Sbjct: 265 GSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGVY------------------------- 299

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                        + +G  LGGHAIRILGWG +  +   YWL+ANSWNTDWGD G FKIL
Sbjct: 300 ------------KHVTGGVLGGHAIRILGWGSE--NGVAYWLVANSWNTDWGDKGYFKIL 345

Query: 290 RGKDECGIESSITAGVPKL 308
           RG DECGIESS+ AG+P++
Sbjct: 346 RGSDECGIESSVVAGIPQI 364



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/31 (70%), Positives = 25/31 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GCNGGFPG AW+YW   G+V+GG YGSK
Sbjct: 178 CGDGCNGGFPGSAWKYWNSDGLVTGGLYGSK 208


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  174 bits (440), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 91/198 (45%), Positives = 116/198 (58%), Gaps = 40/198 (20%)

Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
           S  GC+PY I  C+HHV G    C   +  TPKC ++C+ NY+V YK D ++G  SYSV 
Sbjct: 189 SSQGCQPYMIPACDHHVVGHLQPCPKEEAKTPKCSKKCEANYNVTYKDDKHYGKNSYSVD 248

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           S EK IM EI  +GPVE AFTV++D + YKSG +                          
Sbjct: 249 SVEK-IMTEIMTNGPVEAAFTVYEDFLSYKSGVY-------------------------- 281

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       +++G+ LGGHA++ILGWGED  +   YW++ANSWN DWG+ G F ILR
Sbjct: 282 -----------QHRTGQELGGHAVKILGWGEDNGT--PYWIVANSWNPDWGNQGFFNILR 328

Query: 291 GKDECGIESSITAGVPKL 308
           GKDECGIES I AG+PKL
Sbjct: 329 GKDECGIESQIVAGLPKL 346



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GGFP  AWRY+ + G+V+GG Y S Q
Sbjct: 160 CGEGCQGGFPAEAWRYYEREGLVTGGLYNSSQ 191


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score =  173 bits (439), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 95/196 (48%), Positives = 121/196 (61%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY I PCEHHVNG+RP+C   +G TP+C + C+  Y   YK+D ++G  SYSVSS+E
Sbjct: 178 GCKPYSIPPCEHHVNGSRPAC-TGEGDTPRCSKTCEPGYSPSYKEDKHYGYSSYSVSSDE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVEGAFTV+ D ++YKSG +                             
Sbjct: 237 NEIKAEIYKNGPVEGAFTVYSDFLMYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  +GGHAIRILGWGE+  +   YWL+ANSWNTDWGD G FKILRG+D
Sbjct: 268 --------QHTTGDIMGGHAIRILGWGEE--NGVPYWLVANSWNTDWGDKGFFKILRGQD 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 46/71 (64%), Gaps = 5/71 (7%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++  N+  ++LK   G      L   +LP  + +++ D  LP +FD+R +WPNCPTI+EI
Sbjct: 45  HNFFNVEVSYLKKLCGTF----LGGPKLPRRVEFAD-DIKLPESFDAREQWPNCPTIKEI 99

Query: 104 RDQGSCGSCWG 114
           RDQGSCGSCW 
Sbjct: 100 RDQGSCGSCWA 110


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 94/194 (48%), Positives = 115/194 (59%), Gaps = 40/194 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G+  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ +Y VP+KKD  +G   
Sbjct: 134 GNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCEKTCESSYTVPFKKDKRYGKHV 192

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVS +E +I  E++++GPVEGAFTV+ DL+ YKSG +                      
Sbjct: 193 YSVSGHEDNIKAELFKNGPVEGAFTVYSDLLSYKSGVY---------------------- 230

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           +  G ALGGHAI+ILGWG +  S  KYWLIANSWN+DWGDNG  
Sbjct: 231 ---------------QHTHGNALGGHAIKILGWGVENGS--KYWLIANSWNSDWGDNGFL 273

Query: 287 KILRGKDECGIESS 300
           KILRG+D CGIESS
Sbjct: 274 KILRGEDHCGIESS 287


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 92/196 (46%), Positives = 115/196 (58%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP C   +G T +C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTRRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 237 KKIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +++G  +GGHAIRIL WG +  +   YW  ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILVWGVE--NGVPYWAAANSWNLDWGDNGFFKILRGEN 317

Query: 294 ECGIESSITAGVPKLD 309
            CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333



 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTKKGLVSGGYYDS 175


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 120/206 (58%), Gaps = 40/206 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+ GS  GC+PY IAPCEHH+ G+RP C   +GHT  C ++C++ Y +PY KDL++
Sbjct: 167 IVSGGNYGSKEGCQPYSIAPCEHHIPGSRPPCRG-EGHTADCRKQCEKGYSIPYDKDLHY 225

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
               YS   + K I  EI ++GPVE AF V++DL+ YK G +                  
Sbjct: 226 AEFVYSTERDVKEIQTEILKNGPVEAAFFVYEDLLTYKEGVY------------------ 267

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + +G  +GGHAI+ILGWG +  +   YWLIANSWNTDWG+
Sbjct: 268 -------------------KHVAGAPVGGHAIKILGWGVENGT--PYWLIANSWNTDWGN 306

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG FKILRG DECGIE  ++AG+P++
Sbjct: 307 NGFFKILRGSDECGIEIDVSAGLPRI 332



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG PG AW YW   GIVSGG YGSK+
Sbjct: 146 CGAGCFGGDPGSAWEYWRDVGIVSGGNYGSKE 177


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 118/213 (55%), Gaps = 54/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   GS  S  GC PYE+ PCEHHV G R  C+     TPKC + C+  Y+VP+KKD ++
Sbjct: 170 IVSGGSYNSTQGCIPYEVPPCEHHVPGNRLPCNGDT-KTPKCQKTCEAGYNVPFKKDKHY 228

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G   YSVS NE +I  E++++GPVE                                   
Sbjct: 229 GKHVYSVSGNEDNIKAELFKNGPVE----------------------------------- 253

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                    GAFTV+ DL+ YKSG        ALGGHA++ILGWG +  SK  YWLIANS
Sbjct: 254 ---------GAFTVYSDLLSYKSGVYQHTDGSALGGHAVKILGWGVENGSK--YWLIANS 302

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WN+DWGDNG FKILRG+D CGIESSI  G P L
Sbjct: 303 WNSDWGDNGFFKILRGEDHCGIESSIVTGEPLL 335


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 97/210 (46%), Positives = 116/210 (55%), Gaps = 54/210 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C      TPKC ++C+  YDV YK+D  +G   
Sbjct: 173 GSYNSSQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKCTKKCESGYDVNYKQDKQYGKHV 231

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y+VS +E  I  E++++GPVE                                       
Sbjct: 232 YTVSGDEDHIRAELFKNGPVE--------------------------------------- 252

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YKSG        ALGGHA++ILGWG +  +K  YWLIANSWN+D
Sbjct: 253 -----GAFTVYSDLLSYKSGVYKHTQGDALGGHAVKILGWGVENDNK--YWLIANSWNSD 305

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WGDNG FKILRG+D CGIESSI  G P LD
Sbjct: 306 WGDNGFFKILRGEDHCGIESSIVTGEPFLD 335


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 88/193 (45%), Positives = 109/193 (56%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P C      TPKC ++CQ+ Y  PYKKD  +G  SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  HGPVE AFTV  D + YKSG                               
Sbjct: 247 NAIKKEIMMHGPVEAAFTVHSDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +GGHA+RI+GWG ++K+   YWLIANSWN DWG+ G F+ILRGKD
Sbjct: 276 ------IYKYMTGAEIGGHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRILRGKD 327

Query: 294 ECGIESSITAGVP 306
           ECGIES +T G+P
Sbjct: 328 ECGIESEVTGGLP 340



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 88/193 (45%), Positives = 109/193 (56%), Gaps = 39/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P C      TPKC ++CQ+ Y  PYKKD  +G  SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  HGPVE AFTV  D + YKSG                               
Sbjct: 247 NAIKKEIMMHGPVEAAFTVHSDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +GGHA+RI+GWG ++K+   YWLIANSWN DWG+ G F+ILRGKD
Sbjct: 276 ------IYKYMTGAEIGGHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRILRGKD 327

Query: 294 ECGIESSITAGVP 306
           ECGIES +T G+P
Sbjct: 328 ECGIESEVTGGLP 340



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 99/209 (47%), Positives = 116/209 (55%), Gaps = 54/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ +Y+V Y KD  +G   
Sbjct: 177 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCHKTCESSYNVDYHKDKRYGKHV 235

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVSS E  I  E+Y++GPVE                                       
Sbjct: 236 YSVSSKEDHIKAELYKNGPVE--------------------------------------- 256

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YK+G        ALGGHAI+ILGWG +  +K  YWLIANSWN+D
Sbjct: 257 -----GAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNK--YWLIANSWNSD 309

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WGDNG FKILRG+D CGIESSI AG P L
Sbjct: 310 WGDNGFFKILRGEDHCGIESSIVAGEPLL 338


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHH  G  P CD     TP C R CQ  Y+V Y+ D  +G   Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 37/62 (59%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N    +L  L  GY     +LP +FD+R +W +CP+I EIRDQ SCGS 
Sbjct: 66  IRRMLGALPDPN--GEQLETLCTGYELTVNELPKSFDARKEWTHCPSISEIRDQSSCGSY 123

Query: 113 WG 114
           W 
Sbjct: 124 WA 125



 Score = 45.1 bits (105), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW   GIV+G  Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHH  G  P CD     TP C R CQ  Y+V Y+ D  +G   Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N    +L  L  GY     +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66  IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123

Query: 113 WG 114
           W 
Sbjct: 124 WA 125



 Score = 45.1 bits (105), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW   GIV+G  Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHH  G  P CD     TP C R CQ  Y+V Y+ D  +G   Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N    +L  L  GY     +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66  IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123

Query: 113 WG 114
           W 
Sbjct: 124 WA 125



 Score = 45.1 bits (105), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW   GIV+G  Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score =  170 bits (430), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 96/209 (45%), Positives = 116/209 (55%), Gaps = 54/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ NY+V Y+KD  +G   
Sbjct: 174 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHV 232

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           +SVSS E  I  E++++GPVE                                       
Sbjct: 233 FSVSSKEDHIRAELFKNGPVE--------------------------------------- 253

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YK+G        ALGGHA++ILGWG +  +K  YWLIANSWN+D
Sbjct: 254 -----GAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNK--YWLIANSWNSD 306

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WGDNG FKILRG+D CGIESSI AG P  
Sbjct: 307 WGDNGFFKILRGEDHCGIESSIVAGEPMF 335



 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A +N   + P AH+K   GV PDY+L  ++L ++    E+   LP NFD R KWPNCPT
Sbjct: 42  KAGRNFPEHTPFAHIKKLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCPT 99

Query: 100 IREIRDQGSCGSCWG 114
           + E+RDQGSCGSCW 
Sbjct: 100 LNEVRDQGSCGSCWA 114


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 96/209 (45%), Positives = 116/209 (55%), Gaps = 54/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ NY+V Y+KD  +G   
Sbjct: 174 GSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDS-KTPKCEKTCESNYNVDYRKDKRYGKHV 232

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           +SVSS E  I  E++++GPVE                                       
Sbjct: 233 FSVSSKEDHIRAELFKNGPVE--------------------------------------- 253

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YK+G        ALGGHA++ILGWG +  +K  YWLIANSWN+D
Sbjct: 254 -----GAFTVYSDLLNYKTGVYKHTIGDALGGHAVKILGWGVENGNK--YWLIANSWNSD 306

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WGDNG FKILRG+D CGIESSI AG P  
Sbjct: 307 WGDNGFFKILRGEDHCGIESSIVAGEPMF 335



 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/75 (50%), Positives = 50/75 (66%), Gaps = 2/75 (2%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A +N   + P AH+K   GV PDY+L  ++L ++    E+   LP NFD R KWPNCPT
Sbjct: 42  KAGRNFPEHTPFAHIKRLAGVLPDYHL--SKLSKVEHEDELIASLPENFDPRDKWPNCPT 99

Query: 100 IREIRDQGSCGSCWG 114
           + E+RDQGSCGSCW 
Sbjct: 100 LNEVRDQGSCGSCWA 114


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 115/213 (53%), Gaps = 54/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+  S  GCRPYEI PCEHHV G R  C      TPKC + C+  Y+V YKKD  +
Sbjct: 169 IVSGGNYNSTQGCRPYEIPPCEHHVPGNRMPCSGDT-KTPKCQKNCENGYNVMYKKDKRY 227

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G   YSVS+ E  I  E+Y++GPVE                                   
Sbjct: 228 GKHVYSVSAGEDHIRAELYKNGPVE----------------------------------- 252

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                    GAFTV+ DL+ YKSG        ALGGHAI+ILGWG +  +K  YWL+ANS
Sbjct: 253 ---------GAFTVYADLLAYKSGVYKHIQGDALGGHAIKILGWGVENDNK--YWLVANS 301

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WNTDWGDNG FKILRG++ CGIE SI AG P L
Sbjct: 302 WNTDWGDNGFFKILRGENHCGIEGSIIAGEPLL 334



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/65 (41%), Positives = 35/65 (53%), Gaps = 12/65 (18%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLP 67
           +CG GCNGG P +AW YW   GIVSGG Y S Q  +     IP            ++++P
Sbjct: 147 ICGLGCNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRP--YEIPPC----------EHHVP 194

Query: 68  ANRLP 72
            NR+P
Sbjct: 195 GNRMP 199


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 86/194 (44%), Positives = 111/194 (57%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P C      TPKC ++CQ+ Y  PYKKD  +G  SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYKKDKYYGRMSYNVLNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  HGPVE AFTV  D + YKSG                               
Sbjct: 247 NAIKKEIMMHGPVEVAFTVHSDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +G HA+RI+GWG ++K+   YWLIANSWN DWG+ G F++LRGKD
Sbjct: 276 ------IYKYMTGAEIGEHAVRIIGWGVEKKT--PYWLIANSWNEDWGEKGYFRMLRGKD 327

Query: 294 ECGIESSITAGVPK 307
           ECGIES++T+G+P+
Sbjct: 328 ECGIESAVTSGLPR 341



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  167 bits (424), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 89/195 (45%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEH+  G  P CD     TP C R CQ  Y+V Y+ D  +G   Y V SN+
Sbjct: 192 GCQPYEFPPCEHNTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWNTDWGDNG FKI+RGK+
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNTDWGDNGYFKIIRGKN 331

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 332 ECGIESDVNAGIPKI 346



 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N    +L  L  GY     +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66  IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123

Query: 113 WG 114
           W 
Sbjct: 124 WA 125



 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 22/51 (43%), Positives = 26/51 (50%), Gaps = 9/51 (17%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA---------EKNSLSNIP 50
           CG GCNGGFP  AW YW   GIV+G  Y +            E N+L  +P
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEHNTLGPLP 210


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score =  167 bits (423), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 91/195 (46%), Positives = 112/195 (57%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I  C+HHV G    C  S G TPKC   C+  Y+V Y+KD ++G+ +YSV   E
Sbjct: 186 GCLPYTIKACDHHVVGKLQPCSKSIGPTPKCKHTCEAGYNVTYEKDKHYGSSAYSVHGVE 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EI  +GPVEGAFTV+ D   YKSG +                             
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ LGGHAI+ILGWG +  + + YWL+ANSWN DWGD G FKILRG+D
Sbjct: 276 --------KHTTGQPLGGHAIKILGWGTE--NGDDYWLVANSWNPDWGDQGFFKILRGQD 325

Query: 294 ECGIESSITAGVPKL 308
           ECGIES I+AG PKL
Sbjct: 326 ECGIESQISAGEPKL 340



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 24/38 (63%)

Query: 3   TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           T   R CG GC GGFP  AW Y+ K G+V+GG Y S Q
Sbjct: 148 TSCCRTCGNGCEGGFPSAAWSYYKKDGLVTGGQYNSHQ 185


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 88/195 (45%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHHV G  PSCD     TP C   CQ  Y++PY+KD  +G K Y + SN 
Sbjct: 191 GCQPYEFPPCEHHVIGPLPSCDGDV-ETPSCKTNCQPGYNIPYEKDKWYGEKVYRIHSNP 249

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IM E+  +GPVE  F V+ D   YKSG +                             
Sbjct: 250 EAIMLELMRNGPVEVDFEVYADFPNYKSGVY----------------------------- 280

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG  LGGHA+R+LGWGE+  +   YWLIANSWN+DWGD G FKI+RGK+
Sbjct: 281 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIANSWNSDWGDKGYFKIVRGKN 330

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 331 ECGIESDVNAGIPKI 345



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 38/61 (62%), Gaps = 3/61 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           ++  +G  PD   P     E +    + ++LP +FD+R +WP+CP+I EIRDQ SCGSCW
Sbjct: 67  IRRMLGALPD---PNGEQLETLCTGYISDELPKSFDARVEWPHCPSISEIRDQSSCGSCW 123

Query: 114 G 114
            
Sbjct: 124 A 124



 Score = 45.1 bits (105), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW   GIV+G  Y +
Sbjct: 159 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 188


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score =  167 bits (422), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 88/191 (46%), Positives = 112/191 (58%), Gaps = 40/191 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G+  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ +Y+VP+KKD  +G   
Sbjct: 133 GNYNSSQGCRPYEIPPCEHHVPGNRMPCNGDT-KTPKCQKNCESSYNVPFKKDKRYGKHV 191

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVS +E  I  E++++GPVE AFTV+ DL+ YK+G +                      
Sbjct: 192 YSVSGHEDHIKAELFKNGPVEAAFTVYSDLLSYKNGVY---------------------- 229

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           +  G ALGGHAI+I+GWG +  +  KYWLIANSWN+DWGDNG F
Sbjct: 230 ---------------KHTEGNALGGHAIKIIGWGVE--NNNKYWLIANSWNSDWGDNGFF 272

Query: 287 KILRGKDECGI 297
           KILRG+D CGI
Sbjct: 273 KILRGEDHCGI 283



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 25/36 (69%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           E+   LP  FD R KWP C T+ EIRDQGSCGSCW 
Sbjct: 38  ELIATLPEIFDPRDKWPECLTLNEIRDQGSCGSCWA 73


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 88/194 (45%), Positives = 110/194 (56%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY +APCEHH  G+ P+C  +   TPKCV  C++ Y   Y+ D +FG K YS+SS+E
Sbjct: 182 GCKPYSLAPCEHHTKGSLPNCTGTVP-TPKCVHLCRKGYGKDYQDDKHFGKKVYSISSDE 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EI+++GPVE  F V  D + YKSG +                             
Sbjct: 241 KQIQTEIFKNGPVEADFIVLADFLSYKSGVY----------------------------- 271

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + S   +GGHAIRILGWG +  +   YWL ANSWN DWGD+G FKILRGKD
Sbjct: 272 --------QHHSDDVIGGHAIRILGWGTENGT--PYWLAANSWNEDWGDHGYFKILRGKD 321

Query: 294 ECGIESSITAGVPK 307
           ECGIE  I AG+PK
Sbjct: 322 ECGIEEDINAGIPK 335



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/35 (57%), Positives = 24/35 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GCNGG P  AW YW +SG+V+GG YG+    K
Sbjct: 150 CGAGCNGGTPAAAWEYWKESGLVTGGLYGTNDGCK 184


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 111/195 (56%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY I  C+HHV G    C    G TPKC   C+  Y+V Y+KD ++G  +YSV   E
Sbjct: 186 GCQPYTIKACDHHVVGKLQPCSKDIGPTPKCKHTCEAGYNVTYEKDKHYGMSAYSVHGVE 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EI  +GPVEGAFTV+ D   YKSG +                             
Sbjct: 246 K-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ LGGHAI+ILGWG +  + + YWL+ANSWN DWGD G FKILRG+D
Sbjct: 276 --------KHTTGQPLGGHAIKILGWGTE--NGDDYWLVANSWNPDWGDQGFFKILRGQD 325

Query: 294 ECGIESSITAGVPKL 308
           ECGIES I+AG PKL
Sbjct: 326 ECGIESQISAGEPKL 340



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 24/38 (63%)

Query: 3   TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           T   R CG GC GGFP  AW Y+ + G+V+GG Y S Q
Sbjct: 148 TSCCRTCGNGCEGGFPSAAWSYYKRDGLVTGGQYNSHQ 185


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 109/195 (55%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY+IA C+HHV G    C      TPKC R+C+  Y+V Y  D +FG  +YSV S+ 
Sbjct: 101 GCQPYKIAACDHHVVGKLKPCKGDS-PTPKCERKCEAGYNVSYSDDKHFGQSAYSVRSDP 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I KEI  +GPVEGAFTV+ D   YKSG +                             
Sbjct: 160 AEIQKEIMTNGPVEGAFTVYADFPTYKSGVY----------------------------- 190

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG ALGGHAI+ILGWGE+  +   YWL+ANSWN+DWGD G FKI RG D
Sbjct: 191 --------QHTSGSALGGHAIKILGWGEENGT--PYWLVANSWNSDWGDEGFFKIKRGND 240

Query: 294 ECGIESSITAGVPKL 308
           ECGIES I  G+PK 
Sbjct: 241 ECGIESGIVGGLPKF 255



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
             CG GCNGG+P  AW +W   G+V+GG Y S + 
Sbjct: 67  ETCGMGCNGGYPESAWDHWKSKGLVTGGQYDSHKG 101


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 115/210 (54%), Gaps = 40/210 (19%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            I  I   G  GS  GCRPYEI PCEHH +G RP C  +   TPKC R+C E++D  Y+ 
Sbjct: 167 AIDGIVSGGLYGSHVGCRPYEIPPCEHHTSGNRPDCKGNS-KTPKCQRQCVESFDGKYQA 225

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D +F +  Y+V ++E+ IM EI  +GPVE  F V+ D + YKSG +              
Sbjct: 226 DKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGVY-------------- 271

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  G  LGGHA++ILGWGE+  +   YWL ANSWNT
Sbjct: 272 -----------------------QHVKGGFLGGHAVKILGWGEE--NGVPYWLCANSWNT 306

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPKL 308
           DWGD G FKILRG + C IE+ I AG+PK+
Sbjct: 307 DWGDGGFFKILRGYNHCKIEADINAGIPKI 336



 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 22/30 (73%), Positives = 26/30 (86%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP+ FD+R  WP+CPTI EIRDQG+CGSCW
Sbjct: 84  LPSEFDARKAWPDCPTIGEIRDQGTCGSCW 113



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/31 (74%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
            CGFGCNGG P  AWRYW   GIVSGG YGS
Sbjct: 149 FCGFGCNGGLPENAWRYWAIDGIVSGGLYGS 179


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 90/194 (46%), Positives = 110/194 (56%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I  C+HHVNGT   CD S   TP+CVR C++ Y+V +  D ++G KSYSV SN 
Sbjct: 186 GCMPYPIKACDHHVNGTLGPCDKSIPPTPRCVRMCRKGYNVDFADDKHYGKKSYSVPSNV 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI  +GPVE  FTV+ D  LYKSG +                 + +T Q      
Sbjct: 246 TQIQVEIMTNGPVEADFTVYADFPLYKSGVY-----------------QRHTDQ------ 282

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                         ALGGHAIR+LGWG ++     YWL ANSWNT+WGD G FKILRG D
Sbjct: 283 --------------ALGGHAIRLLGWGVEKGV--PYWLAANSWNTEWGDKGFFKILRGSD 326

Query: 294 ECGIESSITAGVPK 307
           ECGIE  + AG+P+
Sbjct: 327 ECGIEDDVVAGIPR 340



 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGFPG AW YWV  GIV+GG Y S +
Sbjct: 154 CGSGCNGGFPGAAWSYWVHKGIVTGGNYDSDE 185


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 84/194 (43%), Positives = 113/194 (58%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C ++CQ+ Y  PYK+D ++G +SY+V SNE
Sbjct: 61  GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYKQDKHYGDESYNVISNE 120

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI  +GPVE AF V++D + YKSG                               
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSG------------------------------- 149

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G  +GGHAIRI+GWG   K +  YWLIANSWN DWG+ GLF+I+RG+D
Sbjct: 150 ------IYRHVTGSIVGGHAIRIIGWG--VKKRTPYWLIANSWNEDWGEKGLFRIVRGRD 201

Query: 294 ECGIESSITAGVPK 307
           EC IES++ AG+ K
Sbjct: 202 ECSIESNVVAGLIK 215



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CG GC GGFPG+AW YWV  GIV+GG+
Sbjct: 29 CGQGCQGGFPGVAWDYWVTQGIVTGGS 55


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/194 (45%), Positives = 106/194 (54%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I  C+HHVNGT   CD     TP+CV  C++ YDV Y  D ++G  SYSV S E
Sbjct: 191 GCMPYPIKACDHHVNGTLGPCDKKIPPTPRCVHMCRKGYDVDYHDDKHYGKSSYSVPSEE 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EI  +GPVE  FTV+ D + YKSG +    +E                       
Sbjct: 251 KQIQAEIMTNGPVEADFTVYSDFVHYKSGVYQRHTDE----------------------- 287

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                         ALGGHAIR+LGWG +  +   YWL ANSWNT+WGD G FKILRG D
Sbjct: 288 --------------ALGGHAIRLLGWGVE--NGVPYWLAANSWNTEWGDKGFFKILRGSD 331

Query: 294 ECGIESSITAGVPK 307
           ECGIE  + AG+PK
Sbjct: 332 ECGIEDDVVAGLPK 345



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 26/32 (81%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           R CG GCNGGFPG AW +WVK+GIV+GG Y S
Sbjct: 157 RTCGNGCNGGFPGSAWSFWVKTGIVTGGNYDS 188


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 114/322 (35%), Positives = 146/322 (45%), Gaps = 107/322 (33%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +K  MG   D  L  ++L  +    +   +LP NFD R KWPNCPT+ EIRDQGSCGSCW
Sbjct: 54  IKKLMGALEDKYL--HKLYTVEHDDDTINNLPENFDPRDKWPNCPTLNEIRDQGSCGSCW 111

Query: 114 GCRPYEIAPCEH--HVNGTR-------------PSC------------------------ 134
                E     +  + NGT+             P C                        
Sbjct: 112 AFGAVEAMTDRYCTYSNGTKHFHFSAEDLLSCCPVCGLGCNGGIPSFAWEYWKHFGIVSG 171

Query: 135 ---DASKGHTPKCVRECQEN------------------------YDVPYKKDLNFGAKSY 167
              ++S+G  P  +  C+ +                        Y   YK D  +G   Y
Sbjct: 172 GNYNSSQGCLPYEIPPCEHHVPGNRIPCNGETSTPKCHRSCRKEYTNSYKSDKKYGKHVY 231

Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
           SV   E+ I  EI+++GPVEGAFTV+ DL+ YKSG +                       
Sbjct: 232 SVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGVY----------------------- 268

Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
                          +  G+ALGGHAI+I+GWG +  +  KYWLIANSWN+DWGDNG FK
Sbjct: 269 --------------KHTEGEALGGHAIKIMGWGVE--NGNKYWLIANSWNSDWGDNGFFK 312

Query: 288 ILRGKDECGIESSITAGVPKLD 309
           ILRG+D CGIESSI AG P  D
Sbjct: 313 ILRGEDHCGIESSIVAGEPSYD 334



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 22/34 (64%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           +CG GCNGG P  AW YW   GIVSGG Y S Q 
Sbjct: 146 VCGLGCNGGIPSFAWEYWKHFGIVSGGNYNSSQG 179


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/236 (38%), Positives = 124/236 (52%), Gaps = 46/236 (19%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
           E I + +  +DLP  FD+R +W  C +I+EIRDQ  CGSCWGC  Y +  C        P
Sbjct: 70  ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWGCMSYPLPRC-------NP 122

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGAFT 191
           SC  +    P C +EC +   + Y++D ++  ++Y + S  E+ I  EI ++GPV  +FT
Sbjct: 123 SC-KTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVASFT 181

Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG 251
           V+ D I Y SG +   G                                      K LGG
Sbjct: 182 VYADFIHYLSGVYKFDGE------------------------------------SKLLGG 205

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           HA+RI+GWG  E     YWL++NSWN  WGD GLFKI RGK+ECGIE  ITAG+P+
Sbjct: 206 HAVRIIGWG-IENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 87/194 (44%), Positives = 108/194 (55%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I  C+HHVNGT   CD +   TP+CVR C++ YDV +  D ++G  +YSV +  
Sbjct: 218 GCMPYPIKACDHHVNGTLGPCDKTIPPTPRCVRMCRKGYDVDFMDDKHYGRHAYSVPAKA 277

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EI  +GPVE  FTV++D + YKSG +                             
Sbjct: 278 KQIQAEIMMNGPVEADFTVYEDFLHYKSGVY----------------------------- 308

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                      +  ALGGHAIR+LGWG +  +   YWL ANSWNT+WGD G FKILRG D
Sbjct: 309 --------QRHTDSALGGHAIRLLGWGVE--NGVPYWLAANSWNTEWGDKGFFKILRGSD 358

Query: 294 ECGIESSITAGVPK 307
           ECGIES I AG+PK
Sbjct: 359 ECGIESDIVAGLPK 372



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGFPG AW YWV  GIV+GG Y S +
Sbjct: 186 CGAGCNGGFPGSAWSYWVHKGIVTGGNYDSDE 217


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 91/213 (42%), Positives = 117/213 (54%), Gaps = 54/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+ GS  GC+PY IAPCEHHV G RP+C + +G TP C  +C +   + Y KDL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGPRPAC-SGEGSTPDCRNQCDKRSGISYDKDLYY 225

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +YS+    K I  EI ++GPVE                                   
Sbjct: 226 GESAYSLEDEAKQIQAEILKNGPVEA---------------------------------- 251

Query: 223 DNTSQLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                     AFTV++DL+ YK       +G  LGGHAI+ILGWG +  +   YWL+ANS
Sbjct: 252 ----------AFTVYEDLVNYKEGVYQHVAGSVLGGHAIKILGWGVENDTP--YWLVANS 299

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WNTDWG+NG FKILRGKDECGIE  ++AG+P+L
Sbjct: 300 WNTDWGNNGFFKILRGKDECGIEIDVSAGLPRL 332



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW YW   GIVSGG YGSKQ
Sbjct: 146 CGFGCDGGYPASAWDYWQNVGIVSGGNYGSKQ 177


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  164 bits (415), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 112/304 (36%), Positives = 155/304 (50%), Gaps = 47/304 (15%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD---YNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           +A +N    +   +++  +GVH D   Y LP+ R         V  DLP +FDSR +WPN
Sbjct: 43  KAGRNFHEGVTMKYIRGLLGVHKDNHKYRLPSIR-------HAVPGDLPESFDSREQWPN 95

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVREC------ 148
           CPTI EIRDQGSCGSCW     E     H  H NG + + + S      C   C      
Sbjct: 96  CPTISEIRDQGSCGSCWAFGAAEAMSDRHCIHSNG-KVNVEISAEDLLTCCDSCGMGCNG 154

Query: 149 ---QENYDVPYKKDL--------NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL- 196
                 ++    K L        + G + Y+++S E     ++   G +           
Sbjct: 155 GFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQCVHMCE 214

Query: 197 ----ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDLILYKS------ 245
               + Y++ ++F  G ++ ++   +  I+   S  G  E AFTV+ D + YKS      
Sbjct: 215 KGYNVSYRADKYF--GKKSYSIDEQEDQIKTEISTNGPVEAAFTVYADFVTYKSGVYRHV 272

Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G+ +GGHA+RILGWG +  S   YWL+ANSWNTDWGD G FKILRG DECGIESSI AG
Sbjct: 273 TGEEMGGHAVRILGWGTE--SGTPYWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAG 330

Query: 305 VPKL 308
           +PK+
Sbjct: 331 LPKV 334



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/30 (70%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFPG AW YWV  G+V+GG Y S
Sbjct: 148 CGMGCNGGFPGSAWEYWVDKGLVTGGLYNS 177


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/194 (45%), Positives = 112/194 (57%), Gaps = 54/194 (27%)

Query: 123 CEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
           CEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++EK IM EIY+
Sbjct: 1   CEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYK 59

Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
           +GPVE                                            GAF+V+ D +L
Sbjct: 60  NGPVE--------------------------------------------GAFSVYSDFLL 75

Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
           YKSG       + +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FKILRG+D C
Sbjct: 76  YKSGVYQHVTGEMMGGHAIRILGWGVENGTP--YWLVANSWNTDWGDNGFFKILRGQDHC 133

Query: 296 GIESSITAGVPKLD 309
           GIES + AG+P+ D
Sbjct: 134 GIESEVVAGIPRTD 147


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 97/209 (46%), Positives = 114/209 (54%), Gaps = 54/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           GS  S  GCRPYEI PCEHHV G R  C+     TPKC + C+ +Y V Y KD  +G   
Sbjct: 177 GSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDS-KTPKCHKTCEASYSVDYHKDKRYGKHV 235

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVSS E  I  E++++GPVE                                       
Sbjct: 236 YSVSSKEDHIKAELFKNGPVE--------------------------------------- 256

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                GAFTV+ DL+ YK+G        ALGGHAI+ILGWG +  +K  Y LIANSWN+D
Sbjct: 257 -----GAFTVYSDLLNYKNGVYKHTVGNALGGHAIKILGWGVENGNK--YRLIANSWNSD 309

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WGDNG FKILRG+D CGIESSI AG P L
Sbjct: 310 WGDNGFFKILRGEDHCGIESSIVAGEPLL 338


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 89/202 (44%), Positives = 113/202 (55%), Gaps = 41/202 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GCRPY I PC H  NG +  C  S   TPKC+++C   Y+VPY KD +FG  +
Sbjct: 173 GPFGSDQGCRPYTIEPCVHVENGAQSPCKDSI--TPKCIKKCLPGYNVPYAKDKSFGKST 230

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS++++E+ I KEI+ +GPVE  FTVFDD   YK G                        
Sbjct: 231 YSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHG------------------------ 266

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                        +  + SG   G HA+RILGWG +  +  KYWL ANSWN+DWGDNG F
Sbjct: 267 -------------IYQHTSGNLAGEHAVRILGWGVENGT--KYWLAANSWNSDWGDNGYF 311

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG +   IES+I AG+PK+
Sbjct: 312 KILRGSNHVDIESAIVAGLPKV 333



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC+GG PG  W++W++ G+VSGG +GS Q
Sbjct: 148 CGHGCDGGAPGAGWKHWIEKGLVSGGPFGSDQ 179


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 86/200 (43%), Positives = 110/200 (55%), Gaps = 53/200 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY I PCEHH  G RP C   +G TPKC  +C + Y   + +D ++G+ +Y + +NE
Sbjct: 192 GCQPYAIEPCEHHTEGDRPPCTGEEGTTPKCSHKCVDGYTGNFAQDKHYGSVAYRIPANE 251

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+IM EIY++GPV                                            EGA
Sbjct: 252 KAIMNEIYKNGPV--------------------------------------------EGA 267

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D   YKSG        ALGGHAIR+LGWGE+  + EKYWL  NSWNTDWG+NG F
Sbjct: 268 FIVYEDFPTYKSGVYSHHTGSALGGHAIRVLGWGEE--NGEKYWLCGNSWNTDWGNNGFF 325

Query: 287 KILRGKDECGIESSITAGVP 306
           KI RG +ECGIES +  G+P
Sbjct: 326 KIKRGVNECGIESEMVGGIP 345



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/77 (48%), Positives = 46/77 (59%), Gaps = 6/77 (7%)

Query: 38  SKQAEKNSLSNIPRAHLKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           S +A  N  SN    H+    G +  D  LP N L      ++ D +LPANFDSR  WP+
Sbjct: 53  SWKAGANFNSNYAPKHVAGLCGTIMGDDRLPVNHL-----LNDADLELPANFDSREAWPD 107

Query: 97  CPTIREIRDQGSCGSCW 113
           CP+I E+RDQGSCGSCW
Sbjct: 108 CPSISEVRDQGSCGSCW 124



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/29 (68%), Positives = 24/29 (82%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           +CG GCNGGFP  AW YWV++G+VSGG Y
Sbjct: 160 VCGNGCNGGFPQAAWEYWVQNGLVSGGLY 188


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 82/194 (42%), Positives = 113/194 (58%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C + CQ+ Y  PY++D ++G +SY+V SNE
Sbjct: 61  GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNE 120

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI  +GPVE AF V++D + YKSG                               
Sbjct: 121 KAIQKEIMMNGPVEAAFDVYEDFLNYKSG------------------------------- 149

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G  +GGHAIRI+GWG ++++   YWLIANSWN DWG+ GLF+I+RG+D
Sbjct: 150 ------IYRHVTGSIVGGHAIRIIGWGVEKRT--PYWLIANSWNEDWGEKGLFRIVRGRD 201

Query: 294 ECGIESSITAGVPK 307
           EC IES + AG+ K
Sbjct: 202 ECSIESHVVAGLIK 215



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 21/27 (77%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CG GC GGFPG AW YWV  GIV+GG+
Sbjct: 29 CGDGCQGGFPGQAWDYWVTQGIVTGGS 55


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 84/195 (43%), Positives = 113/195 (57%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY I  C+HHV  ++  C+ S   TPKC + C++ Y++ YK D ++G  SYS+++++
Sbjct: 176 GCQPYAIPACDHHVPHSKNPCNGSLP-TPKCEKVCEKGYNITYKNDKHYGVTSYSINNDQ 234

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM+EI  +GPVE AFTVF D   YKSG +                             
Sbjct: 235 NEIMREIMTNGPVEAAFTVFADFPNYKSGVY----------------------------- 265

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ LGGHAI+ILGWG +  +   YWL+ANSWN  WGDNG FKILRG D
Sbjct: 266 --------QHVSGEELGGHAIKILGWGVENNT--PYWLVANSWNPSWGDNGFFKILRGSD 315

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  + AG+PK+
Sbjct: 316 ECGIEDEVVAGLPKV 330


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/195 (44%), Positives = 109/195 (55%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY +  C+HHV G    C   + HTP C  EC+  Y+V Y KD ++GA +YSV   +
Sbjct: 211 GCQPYTVKACDHHVVGKLQPCSKKEEHTPVCKHECESGYNVSYTKDKHYGATAYSVRGVQ 270

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EI  +GPVEGAFTV+ D   YKSG +                             
Sbjct: 271 Q-IMTEIMTNGPVEGAFTVYADFPQYKSGVY----------------------------- 300

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  LGGHAI+I+GWG +    + YWL+ANSWN DWG+ G FKILRG+D
Sbjct: 301 --------KHTTGSPLGGHAIKIMGWGTE--GGDDYWLVANSWNPDWGNQGTFKILRGRD 350

Query: 294 ECGIESSITAGVPKL 308
           ECGIES I AG PKL
Sbjct: 351 ECGIESQIAAGEPKL 365



 Score = 44.7 bits (104), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 24/39 (61%)

Query: 3   TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           T   R CG GCNGGF   AW Y+ + G+V+GG Y S Q 
Sbjct: 173 TSCCRSCGNGCNGGFLSGAWEYYKRDGLVTGGQYNSHQG 211


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 86/195 (44%), Positives = 113/195 (57%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYEI  CEHH +G++  C+ S+  TPKC R C+E Y+V Y  D +  +  YS++++E
Sbjct: 176 GCQPYEIPSCEHHTSGSKKPCEGSE-PTPKCKRSCREGYNVSYSDDKHKVSSHYSIANDE 234

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EIY +GPVE AFTV+ D   YKSG +                             
Sbjct: 235 EQIKNEIYLNGPVEAAFTVYSDFPNYKSGVY----------------------------- 265

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    Y +G ALGGHAI+ILGWG +  +   YWL+ANSWN DWGD G FKILRG +
Sbjct: 266 --------KYTTGNALGGHAIKILGWGVE--NNVPYWLVANSWNPDWGDKGFFKILRGSN 315

Query: 294 ECGIESSITAGVPKL 308
           ECGIE+S+ AG+  L
Sbjct: 316 ECGIEASVVAGMVLL 330



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 25/35 (71%), Gaps = 1/35 (2%)

Query: 80  VDEDLPANFDSRTKW-PNCPTIREIRDQGSCGSCW 113
           V   LP ++D+R KW   CP+  EIRDQGSCGSCW
Sbjct: 73  VIATLPDSYDTREKWGSTCPSTTEIRDQGSCGSCW 107



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG  G AW ++  +G V+GG Y S +
Sbjct: 144 CGFGCNGGRLGPAWNFFKYAGAVTGGQYNSSE 175


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 109/194 (56%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PCEHH  G  P+C   K  TP+CVR+C++ Y+  Y +D ++  K Y++S++E
Sbjct: 181 GCQPYYFPPCEHHTVGPLPNCTGIKP-TPQCVRDCRKGYEKSYSEDKHYAKKVYTLSADE 239

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI+++GPVE  FTV+ D + YKSG +                             
Sbjct: 240 TQIKTEIFKNGPVEADFTVYADFVSYKSGVY----------------------------- 270

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                      S  ALGGHAIRILGWG +  +   YWL+ANSWN DWGD G FKILRG D
Sbjct: 271 --------QRHSDDALGGHAIRILGWGTE--NGVPYWLVANSWNEDWGDKGYFKILRGND 320

Query: 294 ECGIESSITAGVPK 307
           ECGIE  I AG+PK
Sbjct: 321 ECGIEDDINAGIPK 334



 Score = 45.1 bits (105), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW ++   GIV+GG YG+
Sbjct: 149 CGAGCNGGYPAAAWEFYKTDGIVTGGLYGT 178


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 83/174 (47%), Positives = 105/174 (60%), Gaps = 40/174 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWNTDWGDNG FK
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFK 311



 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 81/190 (42%), Positives = 109/190 (57%), Gaps = 39/190 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    CEHH  G  P C +    TP+C + CQ+ Y  PY +D + G  SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI ++GPVE +FTV++D + YKSG                               
Sbjct: 246 KAIQKEIMKYGPVEASFTVYEDFLNYKSG------------------------------- 274

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G+ALGGHAIRI+GWG + K+   YWLIANSWN DWG+NG F+I+RG+D
Sbjct: 275 ------IYKHITGEALGGHAIRIIGWGVENKT--PYWLIANSWNEDWGENGYFRIVRGRD 326

Query: 294 ECGIESSITA 303
           EC IES + A
Sbjct: 327 ECFIESEVIA 336



 Score = 40.8 bits (94), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 19/27 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GG  G AW +WVK GIV+G +
Sbjct: 154 CGLGCEGGILGPAWDFWVKEGIVTGSS 180


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 86/200 (43%), Positives = 107/200 (53%), Gaps = 53/200 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TPKC ++CQ+ Y  PYKKD  +G  SY+V S E
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGEKIYKTPKCQQKCQKGYKTPYKKDKYYGKLSYNVLSKE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  HGPVE A                                            
Sbjct: 247 DAIKKEIMMHGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV+ D + YKSG         +GGHA+RI+GWG ++K+   YWLIANSWN DWG+ G F
Sbjct: 263 FTVYSDFLNYKSGIYKHMKGTVIGGHAVRIIGWGVEKKTP--YWLIANSWNEDWGEKGYF 320

Query: 287 KILRGKDECGIESSITAGVP 306
           +ILRGKD CGIES++TAG+P
Sbjct: 321 RILRGKDVCGIESAVTAGLP 340



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 29/52 (55%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  L   R P  + +  V  ++P++FDSR KW  C +I  IRDQ  CG CW 
Sbjct: 70  DEELRKKRRP-TVDHQNVSLEIPSSFDSRKKWRQCKSISNIRDQSRCGPCWA 120



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEEGIVTGSS 181


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/196 (41%), Positives = 108/196 (55%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHHVNGT    C ++   T KC R CQ  Y + Y +DL+FG  +Y+VS  
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYTQDLHFGQSAYAVSKK 254

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I KEI  HGPVE AF+V++D   Y  G +                            
Sbjct: 255 VTEIQKEIMTHGPVEVAFSVYEDFEHYSGGVY---------------------------- 286

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    ++ +G +LGGHA+++LGWG D  +   YWL ANSWN DWG+NG F+I+RG 
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGTP--YWLCANSWNEDWGENGYFRIIRGV 335

Query: 293 DECGIESSITAGVPKL 308
           +ECGIES +  G+PKL
Sbjct: 336 NECGIESGVVGGIPKL 351



 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 39/74 (52%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            S+ P    K  MG          R+ E+     +D  +P +FDSR +WPNCP+I +IRD
Sbjct: 59  FSSYPDTIKKQLMGAKMIEIPDEYRVFEMTHPEVLDAAIPDSFDSRAQWPNCPSISKIRD 118

Query: 106 QGSCGSCWGCRPYE 119
           Q SCGSCW     E
Sbjct: 119 QSSCGSCWAVSAAE 132



 Score = 46.2 bits (108), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           +CG GCNGG+P  AWR++VK G V+GG+Y  K   K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKTGCK 197


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 110/192 (57%), Gaps = 39/192 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PYK+D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYKQDKHYGDESYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI  +GPVE AF V++D + YKSG                               
Sbjct: 247 KAIQKEIMMYGPVEAAFDVYEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G  +GGHAIRI+GWG  EK K  YWLIANSWN DWG+ GLF+++RG+D
Sbjct: 276 ------IYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLFRMVRGRD 327

Query: 294 ECGIESSITAGV 305
           EC IES + AG+
Sbjct: 328 ECSIESHVVAGL 339



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 113/210 (53%), Gaps = 54/210 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PYEI PCEHH+NG+RP+C   +  TP+C + C+  Y+V + KD ++   +
Sbjct: 170 GPYGSHQGCQPYEIKPCEHHINGSRPACGKLEP-TPRCKKSCESGYNVTFAKDKHYAKTA 228

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSVSS  + I  EI  +GPVE A                                     
Sbjct: 229 YSVSSKVQQIQMEIMTNGPVEAA------------------------------------- 251

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                  FTV+ D   YKSG         LGGHA++++GWG +  +   YWLIANSWNTD
Sbjct: 252 -------FTVYADFPHYKSGVYQHESGAELGGHAVKMIGWGTEGSTP--YWLIANSWNTD 302

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WG+ G FKILRG+DECGIE  I AG PKLD
Sbjct: 303 WGNMGFFKILRGQDECGIERDIVAGEPKLD 332



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 25/34 (73%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           + CG GCNGGFP  AW YW + G+V+GG YGS Q
Sbjct: 143 KSCGNGCNGGFPEAAWEYWKRDGLVTGGPYGSHQ 176


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/196 (41%), Positives = 107/196 (54%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHHVNGT    C +    T KC R CQ  Y + YK+DL+FG  +Y+VS  
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKCERSCQAGYSLTYKQDLHFGQSAYAVSKK 254

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I KEI  +GPVE AFTV+ D  +Y  G +                            
Sbjct: 255 ATEIQKEIMTNGPVEVAFTVYADFEVYSGGVY---------------------------- 286

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    ++ +G +LGGHA+++LGWG D  +   YWL ANSWN DWG+NG F+I+RG 
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGT--PYWLCANSWNEDWGENGYFRIIRGV 335

Query: 293 DECGIESSITAGVPKL 308
           +ECGIE  +  G+PKL
Sbjct: 336 NECGIEHGVVGGIPKL 351



 Score = 45.8 bits (107), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GCNGG+P  AWR++VK+G V+GG+Y  K
Sbjct: 163 CGNGCNGGYPIEAWRHYVKNGYVTGGSYQEK 193


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 113/205 (55%), Gaps = 42/205 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C A +  TP+C   C  + +Y V Y+KD ++G
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPECASHCTSKSSYSVAYEKDKHYG 227

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
             +Y V   E  I  EI +HGPVE  F V+ D   YKSG                     
Sbjct: 228 LSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSG--------------------- 266

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                           +  + SG+ LGGHA++ILGWG +  +K  YWL+ANSWN +WG+ 
Sbjct: 267 ----------------IYTHVSGQELGGHAVKILGWGVENGTK--YWLVANSWNINWGEK 308

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG++ECGIES++ AG+P L
Sbjct: 309 GYFRILRGRNECGIESAVVAGIPDL 333


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 82/196 (41%), Positives = 106/196 (54%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHHVNGT    C ++   T KC   CQ  Y + Y +DL+FG  +Y+VS  
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCEHSCQAGYPLTYTQDLHFGQSAYAVSKK 254

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I KEI  HGPVE AFTV++D   Y  G +                            
Sbjct: 255 PAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVY---------------------------- 286

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    ++ +G +LGGHA+++LGWG D  +   YWL ANSWN DWG+NG F+I+RG 
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGTP--YWLCANSWNEDWGENGYFRIIRGV 335

Query: 293 DECGIESSITAGVPKL 308
           +ECGIES +  G PKL
Sbjct: 336 NECGIESGVVGGTPKL 351



 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           +CG GCNGG+P  AWR++VK G V+GG+Y  K   K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQEKSGCK 197


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/195 (43%), Positives = 111/195 (56%), Gaps = 41/195 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY IA C+HHV G    C    G TP+C ++C+  Y+V +K D ++G +SYSVSS  
Sbjct: 56  GCQPYLIAACDHHVVGKLKPCKGD-GKTPRCEKKCEAGYNVTFKDDKHYGQRSYSVSS-V 113

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM+E+   GPVE AFTV+ D + Y SG +                             
Sbjct: 114 NDIMEELVTRGPVEAAFTVYSDFLQYHSGVY----------------------------- 144

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G ALGGHA++ILG+G +  + +KYWL+ANSWN DWGD G FKILRG D
Sbjct: 145 --------RHTTGSALGGHAVKILGYGVE--NGDKYWLVANSWNPDWGDQGFFKILRGVD 194

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  I AG PK+
Sbjct: 195 ECGIEGQIVAGEPKV 209



 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
          CG GCNGG+P  AW  +   G+V+GG Y SKQ
Sbjct: 24 CGDGCNGGYPSAAWEVFDHDGVVTGGQYNSKQ 55


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 86/194 (44%), Positives = 105/194 (54%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PCEHH  G  P+C   K  TP+C + C+E Y   Y +D +FG K YS+SS+E
Sbjct: 35  GCQPYYFPPCEHHTVGPLPNCTGIK-PTPECAKTCREGYQKSYTRDKHFGKKVYSISSDE 93

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVE  F+V+ D   YKSG +                             
Sbjct: 94  TQIKTEIYKNGPVEADFSVYADFPSYKSGVY----------------------------- 124

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                      S + LGGHAIRILGWG ++     YWL+ANSWN DWGD G FKI RG D
Sbjct: 125 --------QRHSEEMLGGHAIRILGWGTEDGV--PYWLVANSWNEDWGDKGYFKIRRGND 174

Query: 294 ECGIESSITAGVPK 307
           ECGIE  I AG+PK
Sbjct: 175 ECGIEDDINAGIPK 188



 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
          CG GCNGG+P  AW+++    IV+GG YG++  
Sbjct: 3  CGSGCNGGYPSAAWQFYKDEDIVTGGLYGTEDG 35


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  157 bits (397), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 81/206 (39%), Positives = 117/206 (56%), Gaps = 40/206 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+ GS  GC+PY IAPCEH ++G+ P+C      TPKC ++C++ Y +PY K   +
Sbjct: 168 IVSGGNYGSKQGCQPYSIAPCEHSIHGSSPACGGVT-DTPKCKKQCEKGYSIPYDKAFYY 226

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G   Y++ ++ + I  EI ++GP+  +F V++DL  YK                      
Sbjct: 227 GQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYK---------------------- 264

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                   EG +        + +G+ LGGH I+I GWG +  +   YWL+ANSWNTDWG+
Sbjct: 265 --------EGVYQ-------HVAGEFLGGHVIKIFGWGIENGTP--YWLVANSWNTDWGN 307

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG FKI RGKDECGIE  ++AG+P+L
Sbjct: 308 NGFFKIPRGKDECGIEIDVSAGLPRL 333



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 23/33 (69%), Positives = 23/33 (69%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC GG P  AW YW K GIVSGG YGSKQ 
Sbjct: 147 CGDGCLGGSPESAWEYWHKFGIVSGGNYGSKQG 179


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 94/303 (31%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC--------- 109
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  C         
Sbjct: 66  GRKEDPNLRQKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWAVSAV 124

Query: 110 ---------------------------GSCW------------------GCRPYEIAPCE 124
                                      G  W                  GCRPY    C+
Sbjct: 125 GAISDRICIQSGGKQSYCGSGCDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCD 184

Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
           H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SY+V S E  I K+I  HG
Sbjct: 185 HFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHG 244

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           PVE    +++D + YKSG                                     +  Y 
Sbjct: 245 PVEAYLEIYEDFLNYKSG-------------------------------------IYRYT 267

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           +G+ + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG++EC IES I AG
Sbjct: 268 TGQFISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 325

Query: 305 VPK 307
           + K
Sbjct: 326 LIK 328



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 142 CGSGCDGGFLGPSWDYWVLRGIVTGGS 168


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score =  157 bits (396), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 111/209 (53%), Gaps = 54/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G+  S  GC PY I PCEHHV G R  C+     TPKC + C+  Y+V YKKD  +G   
Sbjct: 53  GNYNSSQGCSPYVIPPCEHHVPGNRLPCNGDT-KTPKCSKTCENGYNVLYKKDKRYGKHV 111

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y+V   E  I  E++++GPVE A                                     
Sbjct: 112 YAVRGGEDHIKAELFKNGPVEAA------------------------------------- 134

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                  FTV+ DL+ YKSG        ALGGHAI+I+GWG +  +K  YWLIANSWNTD
Sbjct: 135 -------FTVYADLLAYKSGVYKHVEGDALGGHAIKIIGWGVENGNK--YWLIANSWNTD 185

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WG+NG FKILRG+D CGIESSI AG P L
Sbjct: 186 WGNNGFFKILRGEDHCGIESSIVAGEPLL 214


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 90/201 (44%), Positives = 109/201 (54%), Gaps = 54/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY +  C+HHVNGT   C      TPKCVR C++ Y++ +K D ++G  SYSVSSNE
Sbjct: 185 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNIDFKDDKHYGKSSYSVSSNE 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI ++GPV                                            EGA
Sbjct: 244 TQIQMEIMKNGPV--------------------------------------------EGA 259

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV+ D  LYKSG        ALGGHAIRILGWG +  +   +WL+ANSWNT+WGD G F
Sbjct: 260 FTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE--NGVPFWLVANSWNTEWGDKGYF 317

Query: 287 KILRGKDECGIESSITAGVPK 307
           KILRG +ECGIE  I AG+PK
Sbjct: 318 KILRGSNECGIEEDIVAGIPK 338



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 25/33 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GCNGGFPG AW YWV+ GIV+GG Y + + 
Sbjct: 153 CGSGCNGGFPGAAWSYWVEKGIVTGGNYDTDEG 185


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 40/206 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+ GS  GC+PY IAPCEH + G+RP+C+  +  TPKC ++C++ Y +PY  DL +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHSIPGSRPACEGVR-DTPKCKKQCEKGYGIPYGDDLCY 225

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G   Y++ ++ + I  EI ++GP+  +  V+                             
Sbjct: 226 GQPGYTIENDAQKIQAEILKNGPIVASILVY----------------------------- 256

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                   E  F+    +  + +G+ LGGH I+ILGWG +  +   YWL+ANSWNTDWG+
Sbjct: 257 --------EDLFSYKAGVYQHVAGEVLGGHVIKILGWGVENDTP--YWLVANSWNTDWGN 306

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG FKILRG DECGIE  I AG+P++
Sbjct: 307 NGFFKILRGSDECGIEDQIVAGIPRV 332



 Score = 46.6 bits (109), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/32 (68%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GC GG    AW YW K GIVSGG YGSKQ
Sbjct: 146 CGYGCLGGSAENAWEYWHKFGIVSGGNYGSKQ 177


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/195 (42%), Positives = 103/195 (52%), Gaps = 38/195 (19%)

Query: 114 GCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY    C HH       SC+     TP+C + CQ +Y + Y+ D  +G  SY V+S+
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E SIMKEI  +GPVE  F VFDD + YK+G +                            
Sbjct: 249 EVSIMKEILLNGPVEATFYVFDDFLNYKTGVY---------------------------- 280

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     Y +G  LGGHAIRI+GWG    +   YWL ANSWN  WGD G FKILRG 
Sbjct: 281 ---------KYVTGSLLGGHAIRIIGWGVSTLNHTPYWLCANSWNKQWGDKGYFKILRGS 331

Query: 293 DECGIESSITAGVPK 307
           +ECGIES +TAG+PK
Sbjct: 332 NECGIESMVTAGLPK 346



 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGCNGG PGMAW YW   GIV+GG+
Sbjct: 157 CGFGCNGGIPGMAWDYWKDEGIVTGGS 183


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 95/288 (32%), Positives = 130/288 (45%), Gaps = 98/288 (34%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP----------------- 122
           V   +PA+FDSRT+W  C +I+ IR+Q +CGSCW     EI                   
Sbjct: 82  VLASIPASFDSRTQWSECKSIKLIRNQATCGSCWAFGAAEIISDRTCIETKGAQQPIISP 141

Query: 123 --------------CE---------------------HHVNGTRP-------SCDASKGH 140
                         CE                     +H  G +P       S +  +  
Sbjct: 142 DDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESK 201

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TP C   CQ  Y   Y KD +FGA +Y+V+ +  +I  EI  +GPVE AFTV++D   YK
Sbjct: 202 TPACSLSCQSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYEDFYKYK 261

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                                      + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +  S   YWL+ANSW T+WG++G FKILRG D+CGIE ++ AG  ++
Sbjct: 285 TE--SGSPYWLVANSWGTNWGESGFFKILRGDDQCGIEGAVVAGKARV 330


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/288 (33%), Positives = 129/288 (44%), Gaps = 98/288 (34%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP 122
           V   +PA FDSRT+W  C +I+ IRDQ +CGSCW                 G +   I+P
Sbjct: 81  VLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISP 140

Query: 123 --------------CE---------------------HHVNGTRP-------SCDASKGH 140
                         CE                     +H  G +P       S +  +  
Sbjct: 141 DDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESK 200

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TP C   CQ  Y   Y KD +FG  +Y+V  N  SI  EIY +GPVE AF+V++D   YK
Sbjct: 201 TPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYK 260

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                                      + +GK LGGHAI+I+GWG
Sbjct: 261 SGVY-------------------------------------KHTAGKYLGGHAIKIIGWG 283

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +  S   YWL+ANSW  +WG++G FKI RG D+CGIES++ AG  K+
Sbjct: 284 TE--SGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 107/194 (55%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P C       PKC ++CQ+ Y  PY+KD  +G  SY++  NE
Sbjct: 187 GCQPYPFPKCEHHTKGRYPECGEIIYMKPKCHQKCQKGYKTPYEKDKYYGKVSYNLLKNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI KEI  HGPVE +F V  D + YKSG                               
Sbjct: 247 DSIKKEIMMHGPVEASFRVHSDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G  +G H +RI+GWG ++++   YWLIANSWN DWG+ G F++LRGKD
Sbjct: 276 ------IYKHMTGIDIGSHVVRIIGWGVEKET--PYWLIANSWNEDWGEKGYFRMLRGKD 327

Query: 294 ECGIESSITAGVPK 307
           ECGIES++T+G+P+
Sbjct: 328 ECGIESAVTSGLPR 341



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC  GFPG+AW YWV+ GIV+GG+
Sbjct: 155 CGLGCQMGFPGIAWDYWVQEGIVTGGS 181


>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/196 (43%), Positives = 110/196 (56%), Gaps = 39/196 (19%)

Query: 114 GCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY    C HH +  + P C++    TP+C   CQ++Y  PYKKD  +G  SY+V+S 
Sbjct: 112 GCQPYPFPECNHHSSSKSYPPCESYYFPTPECHETCQDDYGKPYKKDKFYGKSSYNVASE 171

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E SIMKEI  +GPVEG F V++D + YKSG +                            
Sbjct: 172 EISIMKEILLNGPVEGGFYVYEDFLNYKSGVY---------------------------- 203

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     + +G  LGGHAIRI+GWG  +++   YWL ANSWN  WGD G FKILRG 
Sbjct: 204 ---------KHITGSYLGGHAIRIIGWG-IQQNHIPYWLCANSWNNQWGDQGYFKILRGT 253

Query: 293 DECGIESSITAGVPKL 308
           +ECGIES +TAG+P L
Sbjct: 254 NECGIESMVTAGLPNL 269



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GG PGMAW YW   GIV+GG+
Sbjct: 80  CGFGCRGGIPGMAWDYWKYEGIVTGGS 106


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/312 (32%), Positives = 144/312 (46%), Gaps = 107/312 (34%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--- 119
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW     E   
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAFGAVEAMT 128

Query: 120 --------------------IAPCEHHVNGTR---------------------------- 131
                               I+ CE   +G +                            
Sbjct: 129 DRICIQSGGQQSAELSALDLISCCEDCGDGCKGGFPGQAWDYWVKRGIVTGGSEENHTGC 188

Query: 132 -----PSCD-ASKGHTPKC----------VRECQENYDVPYKKDLNFGAKSYSVSSNEKS 175
                P C+  +KG  P C           + CQ+ Y  PY++D ++G + Y+V SNEK+
Sbjct: 189 QPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNEKA 248

Query: 176 IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
           I +EI  +GPVE AF V++D + YKSG                                 
Sbjct: 249 IQREIMMYGPVEAAFDVYEDFLNYKSG--------------------------------- 275

Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
               +  + +G  +GGHAIRI+GWG  EK K  YWLIANSWN DWG+ GLF+++RG+DEC
Sbjct: 276 ----IYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLFRMVRGRDEC 329

Query: 296 GIESSITAGVPK 307
            IES + AG+ K
Sbjct: 330 SIESHVVAGLIK 341



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWVK GIV+GG+
Sbjct: 155 CGDGCKGGFPGQAWDYWVKRGIVTGGS 181


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/213 (41%), Positives = 115/213 (53%), Gaps = 55/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+ GS  GC+PY IAPCEHHV G+RP+C +  G TP C  +C E   + Y +D  +
Sbjct: 167 IVSGGNYGSKQGCQPYSIAPCEHHVPGSRPAC-SGGGDTPDCRNQCDEGSGISYDQDHYY 225

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G   Y++    K I  EI ++GPVE A                                 
Sbjct: 226 GETVYTLDE-AKQIQAEILKNGPVEAA--------------------------------- 251

Query: 223 DNTSQLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++DL+ YK       +G+ALGGHAI+ILGWG +  +   YWL+ANS
Sbjct: 252 -----------FTVYEDLLNYKEGVYQHVAGEALGGHAIKILGWGVENDTP--YWLVANS 298

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WNTDWG+NG FKILRG DECGIE  I AG+P++
Sbjct: 299 WNTDWGNNGFFKILRGSDECGIEDQIVAGLPRV 331



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 23/32 (71%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GC+GGFP  AW YW   GIVSGG YGSKQ
Sbjct: 146 CGYGCDGGFPASAWDYWQNEGIVSGGNYGSKQ 177


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/200 (42%), Positives = 104/200 (52%), Gaps = 53/200 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P C      TPKC ++CQ+ Y  PY KD  +G  SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTTGKYPECGEKIYKTPKCHQKCQKGYKTPYGKDKYYGRMSYNVLNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  HGPVE A                                            
Sbjct: 247 NAIKKEIMMHGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV  D + YKSG         +GGHA+RI+GWG ++K+   YWLIANSWN DWG+ G F
Sbjct: 263 FTVHSDFLNYKSGIYKYMTGAEIGGHAVRIIGWGVEKKTP--YWLIANSWNEDWGEKGYF 320

Query: 287 KILRGKDECGIESSITAGVP 306
           +ILRGKDECGIES +T G+P
Sbjct: 321 RILRGKDECGIESEVTGGLP 340



 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWV+ GIV+G +
Sbjct: 155 CGLGCQGGFPGAAWDYWVEDGIVTGSS 181


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 85/194 (43%), Positives = 105/194 (54%), Gaps = 40/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PCEHH  G  P+C   K  TP+C + C+E Y+  Y +D +FG K YS+SS+E
Sbjct: 60  GCQPYYFPPCEHHTVGPLPNCTGIK-PTPECAKTCREGYEKSYTRDKHFGKKVYSISSDE 118

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI ++GPVE  F V+ D   YKSG +                             
Sbjct: 119 TQIKTEICKNGPVEADFNVYADFPSYKSGVY----------------------------- 149

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                      S + LGGHAIRILGWG ++     YWL+ANSWN DWGD G FKI RG D
Sbjct: 150 --------QRHSKEMLGGHAIRILGWGTEDGV--PYWLVANSWNEDWGDKGYFKIRRGND 199

Query: 294 ECGIESSITAGVPK 307
           ECGIE+ I AG+PK
Sbjct: 200 ECGIENDINAGIPK 213



 Score = 44.7 bits (104), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
          CG GCNGG+P  AW+++   GIV+GG YG++  
Sbjct: 28 CGSGCNGGYPSAAWQFYKDEGIVTGGLYGTEDG 60


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 111/206 (53%), Gaps = 42/206 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C      TPKCV  C  N   P  Y +D +FG
Sbjct: 35  GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVEACTSNNTYPTGYLQDKHFG 94

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A +Y+V    + I  EI  HGP+E AFTV++D   Y +G +                   
Sbjct: 95  ATAYAVGKKVEQIQTEILAHGPIEVAFTVYEDFYQYTTGVY------------------- 135

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +GK+LGGHA++ILGWG D  +   YWL+ANSWN +WG+ 
Sbjct: 136 ------------------VHTAGKSLGGHAVKILGWGVDNGTP--YWLVANSWNVNWGEK 175

Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
           G F+I+RG +ECGIE S  AG+P LD
Sbjct: 176 GYFRIIRGLNECGIEHSAVAGLPDLD 201


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 130/288 (45%), Gaps = 102/288 (35%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP-- 122
           + +PA+FDSRT W  C +I+ IR+Q +CGSCW                 G +   I+P  
Sbjct: 84  DTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDD 143

Query: 123 ------------CE---------------------HHVNGTRP---------SCDASKGH 140
                       CE                     +H  G +P         SC  SK  
Sbjct: 144 LLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGSCPESK-- 201

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TP C   CQ  Y   Y KD +FG  +Y+V+    SI  EI  +GPVE AFTV++D   YK
Sbjct: 202 TPACSLSCQSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYK 261

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                                      + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +  S   YWL+ANSW T WG++G FKI RG D+CGIES++ AG  ++
Sbjct: 285 TE--SGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 96/296 (32%), Positives = 131/296 (44%), Gaps = 101/296 (34%)

Query: 75  IGYSEVD---EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
           I  +EVD   + +PA+FDSRT W  C +I+ IRDQ +CGSCW                  
Sbjct: 110 IRATEVDTVLDTIPASFDSRTHWSECKSIKLIRDQATCGSCWAFGAAEVISDRTCIETKG 169

Query: 115 -----CRPYEIAPC------------------------------EHHVNGTRP------- 132
                  P ++  C                              ++H  G +P       
Sbjct: 170 AQQPIISPDDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCT 229

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
           S +  +  TP C   CQ  Y   Y KD +FG  +Y+V+    SI  EI  +GPVE AFTV
Sbjct: 230 SGNCPESKTPSCSLSCQSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTV 289

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
           ++D   YKSG +                                      + +GKALGGH
Sbjct: 290 YEDFYKYKSGVY-------------------------------------KHTAGKALGGH 312

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           AI+I+GWG +  S   YWL+ANSW   WG++G F+I RG D+CGIES++ AG  K+
Sbjct: 313 AIKIIGWGTE--SGSPYWLVANSWGNSWGESGFFRIFRGDDQCGIESAVVAGKAKV 366



 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 19/28 (67%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GC GG+P  A R+W   G+V+GG Y
Sbjct: 188 CGNGCEGGYPIQALRWWDSKGVVTGGDY 215


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/213 (40%), Positives = 112/213 (52%), Gaps = 54/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G+  S  GC+PY +  C+HHV+G  P+C + +G TP C + C+  Y+  Y  D +F
Sbjct: 176 IVTGGNYNSSQGCQPYSLPNCDHHVSGQYPAC-SGEGPTPACKKSCEAGYNNTYSNDKHF 234

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           GA +YSV+     I  EI  +GPVE                                   
Sbjct: 235 GATAYSVAGEADKIATEIMTNGPVE----------------------------------- 259

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                    GAFTV++DL+ YKSG       + LGGHAI+I+GWG +  S   YW +ANS
Sbjct: 260 ---------GAFTVYEDLLTYKSGVYQHTTGQVLGGHAIKIIGWGVE--SGVDYWWVANS 308

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WN DWGDNG FKI +G DECGIES I AG+PKL
Sbjct: 309 WNNDWGDNGFFKIKKGVDECGIESQIVAGMPKL 341



 Score = 42.0 bits (97), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 1   MYTQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           + T  +  CG GC+GG+P  AW ++  +GIV+GG Y S Q
Sbjct: 147 LMTCCLFTCGSGCSGGYPSAAWSWFKTTGIVTGGNYNSSQ 186


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 111/205 (54%), Gaps = 42/205 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C      TPKCV  C  N  Y  PY +D +FG
Sbjct: 176 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKCVEACTSNNTYPTPYLQDKHFG 235

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A +Y+V    + I  EI ++GPVE AFTV++D   Y +G +                   
Sbjct: 236 ATAYAVGKKVEQIQTEILKNGPVEVAFTVYEDFYQYTTGVY------------------- 276

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ SG +LGGHA++ILGWG D  +   YWL+ANSWN +WG+ 
Sbjct: 277 ------------------VHTSGASLGGHAVKILGWGVDNGTP--YWLVANSWNVNWGEK 316

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+I+RG +ECGIE S  AG+P L
Sbjct: 317 GYFRIIRGLNECGIEHSAVAGIPDL 341


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score =  153 bits (387), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 130/288 (45%), Gaps = 102/288 (35%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP-- 122
           + +PA+FDSRT W  C +I+ IR+Q +CGSCW                 G +   I+P  
Sbjct: 84  DTIPASFDSRTHWSECKSIKLIRNQATCGSCWAFGAAEVISDRTCIETKGAQQPIISPDD 143

Query: 123 ------------CE---------------------HHVNGTRP---------SCDASKGH 140
                       CE                     +H  G +P         SC  SK  
Sbjct: 144 LLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGSCPESK-- 201

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TP C   CQ  Y   Y KD +FG  +Y+V+    SI  EI  +GPVE AFTV++D   YK
Sbjct: 202 TPACSLSCQPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYEDFYKYK 261

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                                      + +GKALGGHAI+I+GWG
Sbjct: 262 SGVY-------------------------------------KHTAGKALGGHAIKIIGWG 284

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +  S   YWL+ANSW T WG++G FKI RG D+CGIES++ AG  ++
Sbjct: 285 TE--SGSPYWLVANSWGTSWGESGFFKIFRGDDQCGIESAVVAGKARV 330


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 131/301 (43%), Gaps = 112/301 (37%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG+GC GG+P  AW+Y VKSG  +GG+Y S+                   G  P    P 
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYVSQ------------------FGCKPYSLAPC 187

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
               E +G               T WP+CP                              
Sbjct: 188 G---ETVG--------------NTTWPDCP------------------------------ 200

Query: 129 GTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
                      +TP CV +C   NY++ YK D +FG+ +Y+V      I  EI  HGPVE
Sbjct: 201 -------QDGYNTPSCVNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVE 253

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
            AFTV++D   YKSG +                                     ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGQ 276

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            LGGHAIRILGWG D  +   YWL+ANSWN +WG+NG F+I+RG +ECGIE ++  GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334

Query: 308 L 308
           +
Sbjct: 335 V 335


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 130/301 (43%), Gaps = 112/301 (37%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG+GC GG+P  AW+Y VKSG  +GG+Y ++                   G  P    P 
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
               E +G               T WP CPT                             
Sbjct: 188 G---ETVG--------------NTTWPACPT----------------------------- 201

Query: 129 GTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
                       TP CV +C   NY+V YK D +FG+ +Y+V      I  EI  HGPVE
Sbjct: 202 --------DGYDTPACVNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVE 253

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
            AFTV++D   YKSG +                                     ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGE 276

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            LGGHAIRILGWG D  +   YWL+ANSWN +WG+NG F+I+RG +ECGIE ++  GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334

Query: 308 L 308
           +
Sbjct: 335 V 335


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 106/195 (54%), Gaps = 42/195 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY++  C+HHV G    C   +  TP C   CQ N    +  D +FGA SYSV +++
Sbjct: 314 GCYPYQLQACDHHVTGKYQPCGDIQ-PTPACANSCQNN--ATWSSDKHFGASSYSVGTDQ 370

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           +SIM EIY +GPVE ++ V+ D + YKSG +                             
Sbjct: 371 QSIMTEIYTNGPVEASYDVYADFVSYKSGVY----------------------------- 401

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  LGGHA++I+GWG D  +   YW++ANSWN DWG+NG F ILRG D
Sbjct: 402 --------QHVTGDYLGGHAVKIIGWGVDGST--PYWIVANSWNNDWGNNGFFNILRGSD 451

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  I AG+PK+
Sbjct: 452 ECGIEDGIVAGIPKV 466



 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+P  AW Y+  +G+V+GG + S Q
Sbjct: 282 CGMGCEGGYPSAAWDYFQSTGLVTGGDWNSNQ 313


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 112/206 (54%), Gaps = 42/206 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C      TPKCV  C  N+  P  Y +D +FG
Sbjct: 175 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFG 234

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A +Y+V    + I  EI ++GP+E AFTV++D   Y +G +                   
Sbjct: 235 ATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVY------------------- 275

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +G +LGGHA++ILGWG D  +   YWL+ANSWN +WG+ 
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNINWGEK 315

Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
           G F+I+RG +ECGIE S  AG+P LD
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDLD 341


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 112/206 (54%), Gaps = 42/206 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C      TPKCV  C  N+  P  Y +D +FG
Sbjct: 175 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKCVDACTSNHTYPTAYLQDKHFG 234

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A +Y+V    + I  EI ++GP+E AFTV++D   Y +G +                   
Sbjct: 235 ATAYAVGKKVEQIQTEILKNGPIEVAFTVYEDFYQYTTGVY------------------- 275

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +G +LGGHA++ILGWG D  +   YWL+ANSWN +WG+ 
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNINWGEK 315

Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
           G F+I+RG +ECGIE S  AG+P LD
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDLD 341


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 111/209 (53%), Gaps = 40/209 (19%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           R I   G  G+  GC+PY +APCE+H     P+C     HTP+CV  C++ YD  Y++D 
Sbjct: 169 RGIVSGGLYGTPDGCKPYSLAPCEYHTKCRIPNC-IPIVHTPECVHHCRKGYDKDYQEDK 227

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +FG K YS+S +EK I  EI+ +GPVE  F V+ D + YKSG +    N+   M      
Sbjct: 228 HFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSGVYQRHSNDGRGM------ 281

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                          HAIRILGWG +  +   YWL ANSWN +W
Sbjct: 282 -------------------------------HAIRILGWGTENGT--PYWLAANSWNENW 308

Query: 281 GDNGLFKILRGKDECGIESSITAGVPKLD 309
           GD G FKILR  +ECGIE  I AG+PK++
Sbjct: 309 GDKGYFKILRRTNECGIEEHIYAGIPKIE 337



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 45/75 (60%), Gaps = 3/75 (4%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A  N    I  ++++  +GVHP       RL E + + E+ +DLP +FD+R KW +C +
Sbjct: 44  KAGSNFDKCISMSYIRGLLGVHPKSE--EYRLAEFV-HEEIPDDLPESFDARAKWSHCDS 100

Query: 100 IREIRDQGSCGSCWG 114
           I  IRDQ +CGSCW 
Sbjct: 101 IHLIRDQSTCGSCWA 115


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 106/198 (53%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    CEHH  G  P C +   +TP+C + CQ  Y  PY +D + G  SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDE 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI ++GPVE +                                            
Sbjct: 246 KAIQKEIMKYGPVEAS-------------------------------------------- 261

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YKSG       +ALGGHAIRI+GWG + K+   YWLIANSWN DWG+NG F
Sbjct: 262 FTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 319

Query: 287 KILRGKDECGIESSITAG 304
           +I+RG+DEC IES + AG
Sbjct: 320 RIVRGRDECSIESEVIAG 337



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 1/49 (2%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +L   R P  + +++ + ++P+NFDSR KWP C +I  IRDQ  CGSCW
Sbjct: 71  DLRRKRRP-TVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 18/27 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GG  G AW YWVK GIV+  +
Sbjct: 154 CGLGCEGGILGPAWDYWVKEGIVTASS 180


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 82/205 (40%), Positives = 113/205 (55%), Gaps = 42/205 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C A +  TP+CV++C  + +Y VPY +D ++G
Sbjct: 168 GSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPECVKQCTSKSDYAVPYDQDKHYG 227

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           + +Y++  N   I  EI  +GPVE  F V+ D   YKSG +                   
Sbjct: 228 SSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSDFYQYKSGIY------------------- 268

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                              + +G+ LGGHA++ILGWG +  +   YWL ANSWN +WG+ 
Sbjct: 269 ------------------KHVAGRELGGHAVKILGWGVENGT--PYWLAANSWNVNWGEK 308

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+I RG +ECGIESS+ AG+P L
Sbjct: 309 GYFRIRRGTNECGIESSVVAGIPDL 333



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AWRYWV +G+V+GG+Y S+
Sbjct: 143 CGDGCEGGYPIQAWRYWVHNGLVTGGSYESQ 173


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 92/275 (33%), Positives = 121/275 (44%), Gaps = 87/275 (31%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN------------- 128
           + LP +FDSR KWP C  IR IRDQ +CGSCW      +    H +              
Sbjct: 278 KKLPKHFDSREKWPECEWIRFIRDQSNCGSCWAVSAASVMTDRHCIASKGQETPYISDEQ 337

Query: 129 -------------------------GTRPSCD----------ASKGHTPKCVRECQENYD 153
                                    G +  C           +    TP C  +CQ +YD
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCKYDCQADYD 397

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
           +P   D  + ++ Y VSSN+  IM EIY HGPV   F V++D   Y SG +     +TT 
Sbjct: 398 IPISDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYEDFTYYISGIY----QQTTY 453

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
           +                                 A+GGHAIRI+GWGE+  +   YWLIA
Sbjct: 454 V---------------------------------AMGGHAIRIIGWGEE--NGIPYWLIA 478

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           NSWNT +G+ G F+I RG +EC IES +  G+PKL
Sbjct: 479 NSWNTTFGEKGFFRIRRGTNECRIESEVYTGIPKL 513



 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 37/91 (40%), Positives = 50/91 (54%), Gaps = 9/91 (9%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C PY I+PC       RP   A     PKC R CQ +Y++  K+D  +G   Y V+ +E 
Sbjct: 99  CLPYSISPCTM----CRPYMLA-----PKCQRTCQASYNLSLKRDKYYGKSHYYVNQDEF 149

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
            IM+EIY+ GPV   F V+ D + Y SG+F 
Sbjct: 150 DIMQEIYQRGPVVAGFKVYHDFLYYISGQFI 180


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 87/201 (43%), Positives = 108/201 (53%), Gaps = 54/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYEI  CEHHV G   +C   +  TPKC ++CQ  Y+  + +D +FG KSYS+++N 
Sbjct: 179 GCQPYEIPKCEHHVKGPFKAC-GKELPTPKCSQKCQPGYNKTFNQDKHFGKKSYSITNNI 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I KEI  +GPVE                                             A
Sbjct: 238 QQIQKEIMMNGPVEA--------------------------------------------A 253

Query: 234 FTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV+ D   YKSG         LGGHA++ILGWG +  +   YWLIANSWN  WGD G F
Sbjct: 254 FTVYADFPSYKSGVYQHTTGGPLGGHAVKILGWGTENNTP--YWLIANSWNPTWGDKGYF 311

Query: 287 KILRGKDECGIESSITAGVPK 307
           KI+RGKDECGIESSI AG+PK
Sbjct: 312 KIIRGKDECGIESSIVAGMPK 332



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGF   AW YWV +GIV+GG Y S +
Sbjct: 147 CGMGCNGGFLPQAWHYWVNNGIVTGGQYHSHK 178


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 83/205 (40%), Positives = 111/205 (54%), Gaps = 39/205 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   GS GS  GC+PY + PCEHH  G R +C    G TP C R CQ +Y + Y+ DL+F
Sbjct: 137 IVSGGSYGSKEGCQPYHLPPCEHHRAGPRRNC-TKYGPTPSCARVCQPDYKISYEDDLHF 195

Query: 163 GAKSYSVS-SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           G + Y+++  NEK I  EI+ +GPVE     ++D   Y+SG +                 
Sbjct: 196 GKQWYALAPHNEKIIRTEIFHNGPVEATMAAYEDFYTYESGIYH---------------- 239

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                    EG F                 HA++I+GWG D+K+   YWL+ANS+NTDWG
Sbjct: 240 -------HIEGTFVC--------------DHAVKIIGWGTDKKTNTPYWLVANSFNTDWG 278

Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
           + G FKI RG +ECGIE+ ITAG+P
Sbjct: 279 EYGFFKIKRGVNECGIENKITAGIP 303



 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 34/48 (70%), Gaps = 3/48 (6%)

Query: 66  LPANRLPEL---IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
           L  N  P+L      S++ E+LP  FDSR +WPNCPTIREIRDQGSCG
Sbjct: 30  LKPNVTPDLEPPFVVSKISENLPDEFDSRVRWPNCPTIREIRDQGSCG 77



 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           C  GC G    +AW +WVK GIVSGG+YGSK+
Sbjct: 116 CEKGCLGCDHHLAWDHWVKHGIVSGGSYGSKE 147


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 82/199 (41%), Positives = 108/199 (54%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C ++CQ+ Y  PY++D N+G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKNYGDQRYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG  EK K  YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 23/27 (85%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  151 bits (381), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 105/198 (53%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    CEHH  G  P C +    TP+C + CQ+ Y  PY +D + G  SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI ++GPVE                                              
Sbjct: 246 KAIQKEIMKYGPVEAG-------------------------------------------- 261

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YKSG       + LGGHAIRI+GWG + K+   YWLIANSWN DWG+NG F
Sbjct: 262 FTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 319

Query: 287 KILRGKDECGIESSITAG 304
           +I+RG+DEC IES +TAG
Sbjct: 320 RIVRGRDECSIESEVTAG 337



 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 19/27 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GG  G AW YWVK GIV+G +
Sbjct: 154 CGLGCEGGILGPAWDYWVKEGIVTGSS 180


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 101/297 (34%), Positives = 137/297 (46%), Gaps = 66/297 (22%)

Query: 18  PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY 77
           PG AW Y+  +GIV+GG + S Q  +         H+    G            P     
Sbjct: 117 PGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP------CQGEGPTPECK 170

Query: 78  SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS 137
            + +   P +     K         I   G   S  GC+PY+I  C+HHVNGT+  C   
Sbjct: 171 HKCNGGFPGSAWEYYK------DTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGPCQG- 223

Query: 138 KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           +G TP+C  +C+ +Y  PY++D ++     S+S+N ++   EI  +GPVE          
Sbjct: 224 EGPTPECKHKCEASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEAD-------- 275

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALG 250
                                               FTV++D   YKSG         LG
Sbjct: 276 ------------------------------------FTVYEDFPTYKSGVYQHTTGGVLG 299

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           GHAI+ILGWG +E +K  YWL+ANSWN +WGDNG FKILRG +ECGIES I  G+PK
Sbjct: 300 GHAIKILGWGVEEGTK--YWLVANSWNNEWGDNGFFKILRGSNECGIESDINFGIPK 354



 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           C   CNGGFPG AW Y+  +GIV+GG + S Q 
Sbjct: 169 CKHKCNGGFPGSAWEYYKDTGIVTGGQWNSSQG 201


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 78/194 (40%), Positives = 104/194 (53%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH+  G  P+C      TPKC ++CQ+ Y  PYKKD ++G  +Y+V +NE
Sbjct: 187 GCQPYPFPKCEHNTTGKYPACGQKIYETPKCQKKCQKGYKTPYKKDKHYGKVAYNVPNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI KEI  HGPV   FTV+ D + YKSG                               
Sbjct: 247 DSIKKEIMMHGPVGSFFTVYSDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  +  G  +G H +RI+GWG ++ +   YWLIANSWN  WG+ G F+ILRGKD
Sbjct: 276 ------IYKHMKGTEIGVHTVRIVGWGVEKGT--PYWLIANSWNEGWGEKGYFRILRGKD 327

Query: 294 ECGIESSITAGVPK 307
           EC IES +  G+P+
Sbjct: 328 ECDIESLVIGGLPR 341


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  150 bits (379), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 105/198 (53%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    CEHH  G  P C +    TP+C + CQ+ Y  PY +D + G  SY+V ++E
Sbjct: 100 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI ++GPVE                                              
Sbjct: 160 KAIQKEIMKYGPVEAG-------------------------------------------- 175

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YKSG       + LGGHAIRI+GWG + K+   YWLIANSWN DWG+NG F
Sbjct: 176 FTVYEDFLNYKSGIYKHITGETLGGHAIRIIGWGVENKAP--YWLIANSWNEDWGENGYF 233

Query: 287 KILRGKDECGIESSITAG 304
           +I+RG+DEC IES +TAG
Sbjct: 234 RIVRGRDECSIESEVTAG 251



 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 19/27 (70%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CG GC GG  G AW YWVK GIV+G +
Sbjct: 68 CGLGCEGGILGPAWDYWVKEGIVTGSS 94


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score =  150 bits (378), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 94/289 (32%), Positives = 126/289 (43%), Gaps = 108/289 (37%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC------------ 112
           +L   R P  + +++ + ++P++FDSR KWP C +I  IRDQ  CGSC            
Sbjct: 71  DLRRTRRP-TVDHNDWNVEIPSSFDSRKKWPRCKSIATIRDQSRCGSCCAFGAVEAMSER 129

Query: 113 ------------------------------WGCRPYEIAPCEHHVNGTRPSCDASKGHTP 142
                                          GC PY    CEH   G  P C +    TP
Sbjct: 130 SCIQSGGKQNVELSAVDLEGIVTGSSKENNTGCEPYPFPKCEHFTKGQYPPCGSKIYKTP 189

Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           +C   CQ+ Y   Y +D              ++I KEI ++GPVE +             
Sbjct: 190 RCKTTCQKRYKTSYAQD------------KHRAIQKEIMKYGPVEAS------------- 224

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
                                          FTV++D + YKSG       + LGGHAIR
Sbjct: 225 -------------------------------FTVYEDFLNYKSGIYKHITGETLGGHAIR 253

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           I+GWG + K+   YWLIANSWN DWG+NG F+I+RG+DEC IES +TAG
Sbjct: 254 IIGWGVENKTP--YWLIANSWNEDWGENGYFRIVRGRDECSIESEVTAG 300


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 85/196 (43%), Positives = 107/196 (54%), Gaps = 41/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PYEI  C+HHV G    C    G TP+C +EC+  Y+  Y KD +     ++V   E
Sbjct: 183 GCLPYEIKACDHHVVGKLQPCKGD-GPTPRCKKECESGYNNTYSKDEHHAKTVHAVEGVE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM EI  +GPVE AFTV+ D   YKSG +                             
Sbjct: 242 Q-IMTEIMTNGPVEAAFTVYSDFPTYKSGVY----------------------------- 271

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +KSG  LGGHAI+ LGWG ++   + YWL+ANSWN DWGDNG FKILRG+D
Sbjct: 272 --------EHKSGGPLGGHAIKTLGWGNED--GKDYWLVANSWNPDWGDNGFFKILRGRD 321

Query: 294 ECGIESSITAGVPKLD 309
           ECGIES+I AG+  L+
Sbjct: 322 ECGIESNIVAGMMVLE 337



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 49/79 (62%), Gaps = 8/79 (10%)

Query: 40  QAEKNSLSNIPRA----HLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWP 95
           +A   +  N+P      ++KS  G +P    P  + P  +   EV +DLP  FD+RT+WP
Sbjct: 42  KATTENFKNVPYKGRMDYVKSLCGANP--APPEMKFP--VKEIEVPKDLPDTFDARTQWP 97

Query: 96  NCPTIREIRDQGSCGSCWG 114
           +CP+++E+RDQG+CGSCW 
Sbjct: 98  DCPSLKEVRDQGACGSCWA 116



 Score = 44.3 bits (103), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 21/38 (55%), Positives = 23/38 (60%)

Query: 3   TQQIRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           T   R CG GCNGGF   AW Y  + GIV+GG Y S Q
Sbjct: 145 TSCCRTCGNGCNGGFLEGAWNYLKRDGIVTGGPYNSHQ 182


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 106/198 (53%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    CEHH  G  P C +    TP+C + CQ+ Y  PY +D + G  SY+V ++E
Sbjct: 191 GCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDE 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI ++GPVE +                                            
Sbjct: 251 KAIQKEIMKYGPVEAS-------------------------------------------- 266

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YKSG       +ALGGHAIRI+GWG + K+   YWLIANSWN DWG+NG F
Sbjct: 267 FTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 324

Query: 287 KILRGKDECGIESSITAG 304
           +I+RG+DEC IES + AG
Sbjct: 325 RIVRGRDECFIESEVIAG 342



 Score = 40.8 bits (94), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 19/27 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GG  G AW +WVK GIV+G +
Sbjct: 159 CGLGCEGGILGPAWDFWVKEGIVTGSS 185


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 111/205 (54%), Gaps = 42/205 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
           GS  + +GC+PY IAPC   VNG + P+C      TPKCV  C  + NY  PY +D +FG
Sbjct: 175 GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFG 234

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           + +Y+V    + I  EI  +GP+E AFTV++D   Y +G +                   
Sbjct: 235 STAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVY------------------- 275

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +G +LGGHA++ILGWG D  +   YWL+ANSWN  WG+ 
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNVAWGEK 315

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+I+RG +ECGIE S  AG+P L
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDL 340


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 131/301 (43%), Gaps = 112/301 (37%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG+GC+GG+P  AW+Y VKSG  +GG+Y ++                   G  P    P 
Sbjct: 146 CGYGCDGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
               E +G                 WP+CP      D G                     
Sbjct: 188 G---ETVG--------------NVTWPDCP------DDGY-------------------- 204

Query: 129 GTRPSCDASKGHTPKCVRECQEN-YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
                      +TP CV +C    Y+  YK D +FG+ +Y+V      I  EI  HGPVE
Sbjct: 205 -----------NTPACVNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVE 253

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
            AFTV++D   YKSG +                                     ++ +G+
Sbjct: 254 AAFTVYEDFYQYKSGVY-------------------------------------VHTTGQ 276

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            LGGHAIRILGWG D  +   YWL+ANSWN +WG+NG F+I+RG +ECGIE ++  GVPK
Sbjct: 277 ELGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPK 334

Query: 308 L 308
           +
Sbjct: 335 V 335


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 79/193 (40%), Positives = 110/193 (56%), Gaps = 40/193 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC+HHV G  P C   K  TPKCV++C   Y +  Y++DL+  +K Y + +N 
Sbjct: 185 CKPYVFPPCDHHVVGQYPPCGPIKP-TPKCVKQCNSQYTEKTYQQDLHHPSKVYQLPNNA 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++I +EI  HGPV+ +F V  D + YKSG +                IRD          
Sbjct: 244 EAIQREIMAHGPVQASFRVASDFLTYKSGVY----------------IRD---------- 277

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                        K  GGH+++I+GWG ++ +   YWLIANSWN DWG+NGLFK+LRGK+
Sbjct: 278 ----------PKLKYEGGHSVKIIGWGVEQGT--PYWLIANSWNEDWGENGLFKMLRGKN 325

Query: 294 ECGIESSITAGVP 306
           ECGIE+ + AG+P
Sbjct: 326 ECGIEAEVVAGLP 338



 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG GC GG+P  AW+Y   +G+ +GG YG
Sbjct: 152 CGNGCQGGYPSAAWKYMKATGVSTGGLYG 180


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 82/180 (45%), Positives = 106/180 (58%), Gaps = 39/180 (21%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TPKCV++C++ Y VPY++DL
Sbjct: 112 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPKCVKKCEDGYKVPYEQDL 170

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+S++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 171 HRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 214

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                 + +GKALGGHAIRILGWG  +  +  YWL+ANSWNTDW
Sbjct: 215 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNTDW 252



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/30 (80%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGGFPG AW YW   GIVSGG YGS
Sbjct: 93  CGFGCNGGFPGAAWHYWKTKGIVSGGPYGS 122


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/201 (38%), Positives = 109/201 (54%), Gaps = 53/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C + CQ+ Y  PY++D ++G +SY+V +NE
Sbjct: 124 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGEESYNVQNNE 183

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I ++I  +GPVE A                                            
Sbjct: 184 KVIQRDIMMYGPVEAA-------------------------------------------- 199

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG ++++   YWLIANSWN DWG+ GLF
Sbjct: 200 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 257

Query: 287 KILRGKDECGIESSITAGVPK 307
           +I+RG+DEC IES++ AG+ K
Sbjct: 258 RIVRGRDECSIESNVVAGLIK 278



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 23/27 (85%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 92  CGQGCQGGFPGVAWDYWVKRGIVTGGS 118


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 77/201 (38%), Positives = 108/201 (53%), Gaps = 53/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C + CQ+ Y  PY++D ++G +SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I ++I  +GPVE A                                            
Sbjct: 247 KVIQRDIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG ++++   YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 320

Query: 287 KILRGKDECGIESSITAGVPK 307
           +++RG+DEC IES + AG+ K
Sbjct: 321 RMVRGRDECSIESDVVAGLIK 341



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 23/27 (85%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 83/214 (38%), Positives = 113/214 (52%), Gaps = 53/214 (24%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           R I   GS  +  GC+PY    CEH   G  P+C      TP+C ++CQ+ Y  PY++D 
Sbjct: 13  RGIVTGGSKENHTGCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDK 72

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           N+G + Y+V SN K+I KEI  +GPVE A                               
Sbjct: 73  NYGDQRYNVISNAKAIQKEIMMNGPVEAA------------------------------- 101

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
                        F V++D + YKSG         +GGHAIRI+GWG ++++   YWLIA
Sbjct: 102 -------------FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIA 146

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           NSWN DWG+ GLF+I+RG+DEC IES++ AG+ K
Sbjct: 147 NSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 180


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 82/196 (41%), Positives = 108/196 (55%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHHVNGT    C ++   T KC R CQ  Y + Y++DL+FG  +Y+VS  
Sbjct: 195 GCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKCERSCQAGYALTYQQDLHFGQSAYAVSKK 254

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I KEI  HGPVE AFTV++D   Y  G +                            
Sbjct: 255 AAEIQKEIMTHGPVEVAFTVYEDFEHYSGGVY---------------------------- 286

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    ++ +G +LGGHA+++LGWG D  +   YWL ANSWN DWG+NG F+I+RG 
Sbjct: 287 ---------VHTAGASLGGHAVKMLGWGVDNGT--PYWLCANSWNEDWGENGYFRIIRGV 335

Query: 293 DECGIESSITAGVPKL 308
           +ECGIE  +  G+PKL
Sbjct: 336 NECGIEGGVVGGIPKL 351



 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 12/88 (13%)

Query: 44  NSLSNIPRAHLKSWMGVHPDY---NLPANRLPEL--------IGYSEV-DEDLPANFDSR 91
           N +    +A L S+   +PD     L   ++ E+        + + EV D  +P +FDSR
Sbjct: 45  NKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVFEMTHPEVEDAAVPDSFDSR 104

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           T WPNCP+I +IRDQ SCGSCW     E
Sbjct: 105 TAWPNCPSISKIRDQSSCGSCWAVSAAE 132



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 25/32 (78%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           +CG GCNGG+P  AWR++VK G V+GG+Y  K
Sbjct: 162 VCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDK 193


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/194 (40%), Positives = 106/194 (54%), Gaps = 40/194 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY   PCEHHV G R  C      TP+CV++CQ  Y   Y+ D  +G K+YS+ S+++
Sbjct: 186 CRPYSFPPCEHHVVGPRKPCTGDPT-TPQCVKKCQPEYPKTYENDKWYGLKAYSIHSDQE 244

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +IM+++  +GP+E  F V+ D   Y SG +                              
Sbjct: 245 AIMRDLMTYGPLEVDFEVYADFPSYSSGVY------------------------------ 274

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                   + +G  LGGHA+R++GWG ++ +   YWLIANSWNTDWGD G FKI RG +E
Sbjct: 275 -------RHVAGGLLGGHAVRLVGWGVEDGAD--YWLIANSWNTDWGDGGYFKIRRGVNE 325

Query: 295 CGIESSITAGVPKL 308
           CGIES   AG PKL
Sbjct: 326 CGIESDANAGHPKL 339



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 20/34 (58%), Positives = 27/34 (79%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           +++LP +FD+R KWP C +I EIRDQ +CGSCW 
Sbjct: 85  EQELPESFDAREKWPYCSSIAEIRDQSNCGSCWA 118



 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC GG+P  AW YWV++G+V+G  Y +
Sbjct: 153 CGMGCQGGYPAQAWEYWVRNGLVTGDLYNT 182


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 145/324 (44%), Gaps = 96/324 (29%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A    LSN   +  K  +GV P        +P L  +  + E LP  FD+R  WP 
Sbjct: 55  GWKAAFNPQLSNFTVSQFKRLLGVKPAREGDLEGIPVLT-HPRLKE-LPKEFDARKAWPQ 112

Query: 97  CPTIREIRDQGSCGSCWG-----------CRPYEIA------------------------ 121
           C TI +I DQG CGSCW            C  Y ++                        
Sbjct: 113 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYNLSISLSVNDLLACCSFLCGSGCDGGY 172

Query: 122 ----------------PCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                            C+ + + T    P C+     TPKC R+C +  +V ++K  ++
Sbjct: 173 PIAAWRYFKRSGVVTEECDPYFDTTGCSHPGCEPLYP-TPKCHRKCVKG-NVLWRKSKHY 230

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y VS + +SIM E+Y++GPVE +FTV++D   YKSG +                  
Sbjct: 231 GVNAYRVSHDPQSIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------------------ 272

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + +G  +GGHA++++GWG  E+  E YWLI NSWN  WG+
Sbjct: 273 -------------------KHVTGGNMGGHAVKLIGWGTSEQG-EDYWLIVNSWNRGWGE 312

Query: 283 NGLFKILRGKDECGIESSITAGVP 306
           +G FKI RG +ECGIE S+ AG+P
Sbjct: 313 DGYFKIRRGTNECGIEHSVVAGLP 336



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY+ +SG+V+
Sbjct: 163 LCGSGCDGGYPIAAWRYFKRSGVVT 187


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 98/190 (51%), Gaps = 39/190 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPYEI PC HH N T          TP CV  CQ  Y + Y  D  FG  SY++ S+  
Sbjct: 192 CRPYEIPPCGHHRNETFYGNCTQIADTPDCVTTCQAGYPISYDDDKTFGKDSYTIESSVT 251

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I KEI  +GPV  AF V++D   Y  G                                
Sbjct: 252 AIQKEIMTYGPVTAAFIVYEDFFHYHRG-------------------------------- 279

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + SG   GGHA+RILGWGE++ +   YWL+ANSWNTDWG+NG F+ILRG +E
Sbjct: 280 -----IYKHVSGGEEGGHAVRILGWGEEKGTA--YWLVANSWNTDWGENGYFRILRGSNE 332

Query: 295 CGIESSITAG 304
           CGIE ++ AG
Sbjct: 333 CGIEENVVAG 342



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 27/33 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC+GG+P  AW Y+V++G+V+GG YG+K +
Sbjct: 159 CGDGCDGGYPISAWEYFVETGVVTGGLYGTKDS 191


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 147/345 (42%), Gaps = 122/345 (35%)

Query: 32  SGGAYGSKQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLP--ELIGYSEVDEDLPAN 87
           +G   G    E+ +L ++     +SW+G   + DY+ P  + P  +L+G      D+PA 
Sbjct: 50  AGWTAGENFHEQTTLEDV-----RSWLGAWSNKDYDWP-QKYPHDDLVG------DIPAT 97

Query: 88  FDSRTKWPNCPTIREIRDQGSCGSCW---------------------------------- 113
           FDSR+ W +C  I +IRDQG CGSCW                                  
Sbjct: 98  FDSRSNWSDCSVIGKIRDQGGCGSCWAFGAAEAISDRICIASKGATDVMYAAEDVLSCCL 157

Query: 114 ----GCR-PYEIAPCEHHVN---------GTRPSCD------------------ASKGHT 141
               GC   Y +A  E+ V          GT+ +C                      G T
Sbjct: 158 TCGNGCNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEHHVPGDRPPCTEGGGT 217

Query: 142 PKCVRECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           PKC  +C  +Y    YK D   G K+YSV ++   I +EI  +GPVE AFTV+ D   YK
Sbjct: 218 PKCSHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYSDFPSYK 277

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                                      + SG  LGGHAI+I+GWG
Sbjct: 278 SGVY-------------------------------------RHTSGSELGGHAIKIIGWG 300

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
            +    + YWLI NSWN+DWGD G FKILRG +ECGIE  + A  
Sbjct: 301 TE--GGDDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAAT 343



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GCNGG+P  A  Y+V  G+V+GG YG+K
Sbjct: 159 CGNGCNGGYPLAAMEYFVTRGLVTGGLYGTK 189


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 141/319 (44%), Gaps = 102/319 (31%)

Query: 50  PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
           P A+ K   GV  D  L   RL   I  +  D  LP +FD+R KW  CP++  IR+QG C
Sbjct: 57  PAAYFK---GVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112

Query: 110 GSCW--------------------------------------GCRPYEIAPC-----EHH 126
           GSCW                                      GC+   + P      E  
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172

Query: 127 VNGTRP-------------SCDAS--KGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVS 170
           V+   P              CDAS  +  TPKC + CQ  Y+V    +D  +G  +YS+ 
Sbjct: 173 VSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSIP 232

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           ++E+ IM+EIY +GPV+ AF  + DL  YKSG +                          
Sbjct: 233 NDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVY-------------------------- 266

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       +  G   GGHA++++GWG +  +  KYWL+ANSW  DWGDNG FKI+R
Sbjct: 267 -----------RHVWGHMAGGHAVKLMGWGVE--NGLKYWLVANSWGDDWGDNGFFKIVR 313

Query: 291 GKDECGIESSITAGVPKLD 309
           G++ CGIE  + AG+P  +
Sbjct: 314 GENHCGIEKDVHAGLPSFN 332



 Score = 44.3 bits (103), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+ G AW++WV+ G+ SGG Y S+Q
Sbjct: 152 CGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 136/315 (43%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            SN      K  +GV P         P +     +   LP NFD+RT W  C TI  I D
Sbjct: 64  FSNYTVEQFKRLLGVKPTPKKELRSTPAISHPKSLK--LPKNFDARTAWSQCSTIGRILD 121

Query: 106 QGSCGSCWG-----------CRPYEI----------APC--------------------E 124
           QG CGSCW            C  +++          A C                     
Sbjct: 122 QGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWQYLA 181

Query: 125 HH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           HH             +  + P C+ +   TPKCV++C     V +KK  ++   +Y VSS
Sbjct: 182 HHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKHYSVNAYRVSS 239

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 240 DPHDIMTEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 272

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  LGGHA++++GWG  E   E YWL+AN WN +WGD+G FKI RG
Sbjct: 273 ----------KHITGYELGGHAVKLIGWGTTEDG-EDYWLLANQWNREWGDDGYFKIRRG 321

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  +TAG+P
Sbjct: 322 TNECGIEEDVTAGLP 336


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 77/194 (39%), Positives = 110/194 (56%), Gaps = 40/194 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C  YE   C+HHV G  P C  ++  TP+CV +CQE Y V YKKD +F  ++Y V SN +
Sbjct: 167 CNAYEFPKCDHHVEGKYPPCGETQ-PTPECVEKCQEGYPVEYKKDKHFFGEAYHVPSNVE 225

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  E+  +GP+E  F+V++D + YKSG                                
Sbjct: 226 AIKTELMTNGPIEVDFSVYEDFMTYKSG-------------------------------- 253

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +GK LGGHA++++GWG ++    +YW IANSWN DWG+NG F+I+ GK+E
Sbjct: 254 -----IYQHVAGKYLGGHAVKLVGWGVEDGV--EYWKIANSWNEDWGENGYFRIIAGKNE 306

Query: 295 CGIESSITAGVPKL 308
           CGIES   AG+P+L
Sbjct: 307 CGIESDGVAGIPEL 320



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CGFGCNGG+P MAW ++  +G+ +GG YGSK
Sbjct: 134 CGFGCNGGWPSMAWSWFHSTGVTTGGEYGSK 164


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 141/319 (44%), Gaps = 102/319 (31%)

Query: 50  PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
           P A+ K   GV  D  L   RL   I  +  D  LP +FD+R KW  CP++  IR+QG C
Sbjct: 57  PAAYFK---GVLYD-RLGETRLAPAILVNPQDIQLPESFDARQKWSQCPSLNVIRNQGCC 112

Query: 110 GSCW--------------------------------------GCRPYEIAPC-----EHH 126
           GSCW                                      GC+   + P      E  
Sbjct: 113 GSCWAISAASAMTDRWCIKSKGKEQFSFGATDMLACCHACGDGCKGGYLGPAWQFWVEQG 172

Query: 127 VNGTRP-------------SCDAS--KGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVS 170
           V+   P              CDAS  +  TPKC + CQ  Y+V    +D  +G  +YS+ 
Sbjct: 173 VSSGGPYNSRQGCHPYPIDVCDASGEEADTPKCSKRCQSGYNVTDVWQDRRYGRVAYSIP 232

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           ++E+ IM+EIY +GPV+ AF  + DL  YKSG +                          
Sbjct: 233 NDEQKIMEEIYINGPVQAAFMTYQDLHAYKSGVY-------------------------- 266

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       +  G   GGHA++++GWG +  +  KYWL+ANSW  DWGDNG FKI+R
Sbjct: 267 -----------RHVWGHMAGGHAVKLMGWGVE--NGLKYWLVANSWGDDWGDNGFFKIVR 313

Query: 291 GKDECGIESSITAGVPKLD 309
           G++ CGIE  + AG+P  +
Sbjct: 314 GENHCGIEKDVHAGLPSFN 332



 Score = 44.3 bits (103), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+ G AW++WV+ G+ SGG Y S+Q
Sbjct: 152 CGDGCKGGYLGPAWQFWVEQGVSSGGPYNSRQ 183


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 88/213 (41%), Positives = 110/213 (51%), Gaps = 56/213 (26%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
           GS  S +GC+PY IAPC   VNG T P C  S   TPKCV  C  N  Y +PY+KD ++G
Sbjct: 174 GSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKCVDHCTSNSSYPIPYEKDKHYG 233

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A +Y+VS     I  EI ++GPVE  F                                 
Sbjct: 234 ATAYAVSRKVDQIQSEILKNGPVEVGF--------------------------------- 260

Query: 224 NTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                      TV+ D   YKSG         LGGHA+++LGWG D  +   YWL ANSW
Sbjct: 261 -----------TVYADFYQYKSGVYVHVAGPELGGHAVKLLGWGVDNGTP--YWLAANSW 307

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           NT+WG+NG F+ILRG +ECGIES + AG+P L+
Sbjct: 308 NTNWGENGYFRILRGVNECGIESQVVAGMPDLE 340



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 26/31 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AW+YWVK+G+V+GG+Y S+
Sbjct: 149 CGDGCEGGYPIQAWKYWVKNGLVTGGSYESQ 179


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/301 (32%), Positives = 129/301 (42%), Gaps = 109/301 (36%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GC GG+P  AWRYWVK+G+V+GG++ S+                   G  P    P 
Sbjct: 141 CGDGCEGGYPIQAWRYWVKNGLVTGGSFESQ------------------YGCKPYSIAPC 182

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
                               D  T WP CP   +I D                 CEHH  
Sbjct: 183 GE----------------TIDGVT-WPECPM--KISDT--------------PKCEHHCT 209

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
           G                     +Y +PY +D +FGA +Y++  + K I  EI  HGPVE 
Sbjct: 210 G-------------------NNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEV 250

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
            F V++D  LYK+G +                                      + +G  
Sbjct: 251 GFIVYEDFYLYKTGIY-------------------------------------THVAGGE 273

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           LGGHA+++LGWG D  +   YWL ANSWNT WG+ G F+ILRG DECGIES+  AG+P L
Sbjct: 274 LGGHAVKMLGWGVDNGT--PYWLAANSWNTVWGEKGYFRILRGVDECGIESAAVAGMPDL 331

Query: 309 D 309
           +
Sbjct: 332 N 332


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 80/209 (38%), Positives = 109/209 (52%), Gaps = 40/209 (19%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKD 159
           + I   GS  S  GC+PY   PCEHH NGT    C      T  C  +CQ  Y   Y  D
Sbjct: 177 KGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTCEHKCQSGYATAYTND 236

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
             +GAK+Y+V++  K+I KEI  HGPVE A+ V++D   Y  G                 
Sbjct: 237 KRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKG----------------- 279

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                               +  + +G  LGGHA++++GWG +  +   YW+ +NSWN+D
Sbjct: 280 --------------------IYKHTAGSYLGGHAVKMIGWGTE--NGIPYWICSNSWNSD 317

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WG+NG F+ILRG DECGIES + AG+PK+
Sbjct: 318 WGENGFFRILRGTDECGIESGVVAGLPKI 346



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/35 (68%), Positives = 27/35 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CGFGC+GGFP  AW YWV+ GIVSGG+Y SK   K
Sbjct: 158 CGFGCDGGFPYAAWNYWVEKGIVSGGSYTSKSGCK 192


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/198 (39%), Positives = 107/198 (54%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH N T    C      TPKC ++C + Y +  Y +D  FG  +Y V  
Sbjct: 218 GCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVED 277

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +  SI KEI  HGPVE AF V++D ++Y  G                             
Sbjct: 278 DVTSIQKEILTHGPVEVAFEVYEDFLMYDGG----------------------------- 308

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                   + ++  GK  GGHA+++LGWG ++     YWL+ANSWNTDWG++G F+I+RG
Sbjct: 309 --------IYVHTGGKIGGGHAVKMLGWGVEQGVP--YWLVANSWNTDWGEDGFFRIIRG 358

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIESS+  G+PKL+
Sbjct: 359 IDECGIESSVVGGLPKLN 376



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 21/35 (60%), Positives = 25/35 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CGFGC+GG P  AW+YWVK GIV+G  +  KQ  K
Sbjct: 186 CGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCK 220


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 134/315 (42%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            SN   A  K  +GV P         P +     +   LP +FD+RT W  C TI  I D
Sbjct: 65  FSNYTVAQFKRLLGVKPSPKKELRSTPVVSHPRSLK--LPKSFDARTAWSQCSTIGRILD 122

Query: 106 QGSCGSCWGCRPYE---------------------IAPC--------------------E 124
           QG CGSCW     E                     +A C                     
Sbjct: 123 QGHCGSCWAFGAVESLSDRFCIHLDVNVSLSVNDLLACCGFLCGSGCDGGYPLYAWRYLA 182

Query: 125 HH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           HH             +  + P C+ +   TPKCVR+C +   + +KK   F   +YSV S
Sbjct: 183 HHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCVKGNQI-WKKSKYFSVNAYSVKS 240

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 241 DPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  LGGHA++++GWG  ++  E YWLIAN WN  WGD+G F I RG
Sbjct: 274 ----------KHITGSQLGGHAVKLIGWGTTDEG-EDYWLIANQWNRSWGDDGYFMIRRG 322

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  +TAG+P
Sbjct: 323 TNECGIEEDVTAGLP 337


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/191 (41%), Positives = 99/191 (51%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
            CRPYEI PC  H N T  S    +  TP C   CQ  Y + Y  D  +G  +YSVS++ 
Sbjct: 163 ACRPYEIPPCGIHKNETFYSNCTQEIDTPDCKTTCQAGYPISYDDDKTYGKTAYSVSNSV 222

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPV  AFTV+DD   YK+G                               
Sbjct: 223 HAIQKEIMTYGPVVAAFTVYDDFFHYKTG------------------------------- 251

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + SG   GGHA+RILGWG  ++    YWL+ANSWNTDWG+NG F+ILRG D
Sbjct: 252 ------IYKHVSGAEAGGHAVRILGWG--QQGGVPYWLVANSWNTDWGENGYFRILRGSD 303

Query: 294 ECGIESSITAG 304
           ECGIE  + AG
Sbjct: 304 ECGIEDGVVAG 314



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 35/64 (54%), Gaps = 9/64 (14%)

Query: 78  SEVD-EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
           +E+D   +P +FD+R  WP+CP+I  IRDQ  CGSCW     E+         +   C A
Sbjct: 59  TEIDGSKIPDSFDARVTWPHCPSISYIRDQSQCGSCWAFSSAEVM--------SDRVCIA 110

Query: 137 SKGH 140
           S GH
Sbjct: 111 SHGH 114



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 28/32 (87%)

Query: 10  GFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           G+GC+GG+P  AW+Y+V++G+V+GG YG+K A
Sbjct: 132 GYGCDGGWPVSAWQYFVETGVVTGGLYGTKDA 163


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 100/316 (31%), Positives = 137/316 (43%), Gaps = 98/316 (31%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-DLPANFDSRTKWPNCPTIREIR 104
            SN      K  +GV P   +P   L      S      LP NFD+RT W  C TI  I 
Sbjct: 65  FSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDARTAWSQCSTIGRIL 121

Query: 105 DQGSCGSCWG-----------CRPYEI----------APC-------------------- 123
           DQG CGSCW            C  +++          A C                    
Sbjct: 122 DQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGGYPLYAWRYL 181

Query: 124 EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
            HH             +  + P C+ +   TPKCV++C     V +KK  ++   +Y V+
Sbjct: 182 AHHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKHYSVSAYRVN 239

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           S+   IM E+Y++GPVE AFTV++D   YKSG +                          
Sbjct: 240 SDPHDIMAEVYKNGPVEVAFTVYEDFAYYKSGVY-------------------------- 273

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       + +G  LGGHA++++GWG  +   E YWL+AN WN +WGD+G FKI R
Sbjct: 274 -----------KHITGYELGGHAVKLIGWGTTDDG-EDYWLLANQWNREWGDDGYFKIRR 321

Query: 291 GKDECGIESSITAGVP 306
           G +ECGIE  +TAG+P
Sbjct: 322 GTNECGIEEDVTAGLP 337



 Score = 37.7 bits (86), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY    G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYLAHHGVVT 188


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 77/195 (39%), Positives = 105/195 (53%), Gaps = 43/195 (22%)

Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVSS 171
           C+ Y +APC HHV     P C      TP CV+ C  N  Y +PY KDL+ G+K+YS+  
Sbjct: 190 CQAYSLAPCAHHVTSDVYPPCTGELP-TPPCVKSCDSNSTYTIPYPKDLHKGSKAYSIDQ 248

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE++IM EI  +GP+E AFTV++D + YKSG +                           
Sbjct: 249 NEQAIMTEIQTNGPIEVAFTVYEDFLTYKSGVY--------------------------- 281

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  LGGHA++++GWG +  +   YW+I NSWN  WGD G FKILRG
Sbjct: 282 ----------QHVTGSELGGHAVKMVGWGVENGT--PYWIIVNSWNESWGDKGTFKILRG 329

Query: 292 KDECGIESSITAGVP 306
           ++ECGIES     +P
Sbjct: 330 QNECGIESECVTALP 344



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC+GG+P  A  Y+V +G+V+G  YG+
Sbjct: 157 CGFGCDGGWPEAAMDYYVNNGLVTGDLYGN 186


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/193 (39%), Positives = 109/193 (56%), Gaps = 40/193 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSNE 173
           C+PY   PC+HHV G    C   +  TP+CV+EC   Y    Y+KDL+F +++YS+  N 
Sbjct: 187 CKPYIFPPCDHHVTGQYQPCGPIQP-TPQCVKECNSEYTQNTYEKDLHFASQTYSIKQNV 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++I +EI  HGPV+ +F V  D + YKSG +                IR+          
Sbjct: 246 QAIQREIMAHGPVQASFKVAADFLTYKSGVY----------------IRN---------- 279

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                        K  GGH+++I+GWG++  +   YWLIANSWN DWG+ GLF++LRG++
Sbjct: 280 ----------PKLKYEGGHSVKIIGWGKEGNT--PYWLIANSWNEDWGEKGLFRMLRGRN 327

Query: 294 ECGIESSITAGVP 306
           ECGIE+ I AG+P
Sbjct: 328 ECGIEAQIVAGLP 340



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 25/33 (75%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           + +P  FD+R +WPNC +I+ IRDQ +CGSCW 
Sbjct: 86  DPIPEFFDAREQWPNCQSIKLIRDQSTCGSCWA 118



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 19/29 (65%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG GC GG+P  AW Y  + G+ +GG YG
Sbjct: 154 CGMGCKGGYPSAAWGYMKRQGVSTGGLYG 182


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 106/199 (53%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG  EK K  YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 106/199 (53%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG  EK K  YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVAGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGENGLF 320

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 131/300 (43%), Gaps = 110/300 (36%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG+GC GG+P  AW+Y VKSG  +GG+Y ++                   G  P    P 
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
               E +G                 WP+CP      D G          Y+   C +   
Sbjct: 188 G---ETVG--------------NVTWPSCP------DDG----------YDTPACVN--- 211

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
                         KC     +NY+V Y  D +FG+ +Y+V      I  EI  HGPVE 
Sbjct: 212 --------------KCT---NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEA 254

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
           AFTV++D   YK+G +                                     ++ +G+ 
Sbjct: 255 AFTVYEDFYQYKTGVY-------------------------------------VHTTGQE 277

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           LGGHAIRILGWG D  +   YWL+ANSWN +WG+NG F+I+RG +ECGIE ++  GVPK+
Sbjct: 278 LGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPKV 335


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 77/199 (38%), Positives = 106/199 (53%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PY++D ++G + Y+V SNE
Sbjct: 149 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 208

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 209 KAIQREIMMYGPVEAA-------------------------------------------- 224

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG ++++   YWLIANSWN DWG+ GLF
Sbjct: 225 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 282

Query: 287 KILRGKDECGIESSITAGV 305
           +I+RG+DEC IES + AG+
Sbjct: 283 RIVRGRDECSIESHVVAGL 301



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG AW YWVK GIV+GG+
Sbjct: 117 CGDGCKGGFPGQAWDYWVKRGIVTGGS 143


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 138/308 (44%), Gaps = 96/308 (31%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           L  ++G+HPD     N  P ++ ++    D+P +FD+RTKWPNC ++  IRDQG+CGSCW
Sbjct: 57  LNGFIGLHPD----PNYKPPVLVHTFNARDVPESFDARTKWPNCDSLNRIRDQGACGSCW 112

Query: 114 GCRP--------------------------------------YEIAPCEHHVN------- 128
                                                     Y ++  + ++N       
Sbjct: 113 AFASIESMSDRICIHSSGSAQFMFSPEDLLSCCTSCGDCGGGYMMSALDFYINEGIVSGG 172

Query: 129 ------GTRP-SCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
                 G RP + DA  +G TP C + C+  Y   Y  D ++G+  Y VSS    I  E+
Sbjct: 173 DVNSNEGCRPYTADAHDQGQTPACTKSCRNGYSTSYSADKHYGSNDYVVSSVIDQIQYEV 232

Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
             +GP+   F VF D   Y SG +                                    
Sbjct: 233 MTNGPIIVNFEVFQDFYNYVSGVY------------------------------------ 256

Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
             + SG+++G H ++I+GWG +  +   YWLIANSW + WGD+G FK+LRG++ECGIE+ 
Sbjct: 257 -RHVSGESVGFHVVKIVGWGVE--NGVPYWLIANSWGSSWGDHGFFKMLRGQNECGIENY 313

Query: 301 ITAGVPKL 308
             A +P+L
Sbjct: 314 PYAVMPRL 321


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 104/195 (53%), Gaps = 41/195 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY +A CEHH  G    C      TP C R C++ Y+V Y  D +FGA SY V   +
Sbjct: 188 GCQPYSLAKCEHHTTGPYKPC-GDIVPTPACKRSCRQGYNVTYPNDKHFGASSYGVRGVD 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI  +GPVE AFTV+ D + YKSG +                             
Sbjct: 247 Q-IATEIMTNGPVEAAFTVYSDFLSYKSGVY----------------------------- 276

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + SG+ LGGHAI+I+GWG  + +   YW++ANSWN  WG++G F I +G D
Sbjct: 277 --------QHTSGQPLGGHAIKIIGWGVQDGT--DYWIVANSWNDSWGNDGFFWIKKGTD 326

Query: 294 ECGIESSITAGVPKL 308
           ECGIES + AG+PK+
Sbjct: 327 ECGIESQVVAGLPKV 341



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGG+P  AW YW   GIV+GG Y S Q
Sbjct: 156 CGDGCNGGYPAAAWEYWKNQGIVTGGQYDSNQ 187


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 89/201 (44%), Positives = 105/201 (52%), Gaps = 54/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY +  C+HHVNGT   C      TPKCVR C++ Y+V +K D ++G  SYSV    
Sbjct: 186 GCMPYPVPSCDHHVNGTLGPC-GQDPPTPKCVRLCRKGYNVDFKDDKHYGKSSYSV---- 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
                                            P NET     I+  I  N      EGA
Sbjct: 241 ---------------------------------PSNETQ----IQMEIMKNGP---VEGA 260

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV+ D  LYKSG        ALGGHAIRILGWG +  +   YWL+ANSWNT+WGD G F
Sbjct: 261 FTVYADFPLYKSGVYKSHSTDALGGHAIRILGWGVE--NDVPYWLVANSWNTEWGDKGYF 318

Query: 287 KILRGKDECGIESSITAGVPK 307
           KILRG +ECGIE  I AG+PK
Sbjct: 319 KILRGSNECGIEEDIVAGIPK 339



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GCNGGFP  AW YWV  GIV+GG Y + +
Sbjct: 154 CGSGCNGGFPAAAWSYWVDKGIVTGGNYDTDE 185


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 110/209 (52%), Gaps = 57/209 (27%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK 165
           G+  S  GC+ Y + PCEHHV  G+RP C +    TP+CVR C E+  + Y + L FG +
Sbjct: 177 GAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPECVRSCYES-SLDYTESLTFG-Q 234

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
             S  +NEK +  EI ++GP+E A                                    
Sbjct: 235 QVSTFTNEKQMQLEILKNGPIEAA------------------------------------ 258

Query: 226 SQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                   FTV++D + YKSG        +++GGHAI++LGWG +E +K  YWLIANSWN
Sbjct: 259 --------FTVYNDFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTK--YWLIANSWN 308

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVP 306
           TDWGDNG FK LRG D CGIES   A +P
Sbjct: 309 TDWGDNGYFKFLRGVDHCGIESETAASLP 337



 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 23/36 (63%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKN 44
           CG GC+GG+    W YW   GIV+GGAY S Q  K+
Sbjct: 152 CGLGCDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKD 187


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 106/201 (52%), Gaps = 53/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG  EK K  YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLF 320

Query: 287 KILRGKDECGIESSITAGVPK 307
           +++RG+DEC IES + AG+ K
Sbjct: 321 RMVRGRDECSIESHVVAGLIK 341



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 105/203 (51%), Gaps = 54/203 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY +APCEH   G+ P C  +   TPKC R+C+E Y+  Y  D  F    YS++ +E
Sbjct: 68  GCKPYSLAPCEHSSQGSLPECVGTL-PTPKCKRQCREGYERSYDDDKYFAKNVYSINGSE 126

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  EI+++GPVE                                              
Sbjct: 127 KQIRTEIFQNGPVEAE-------------------------------------------- 142

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FT + D + YKSG         +G HAIRILGWG ++ +   YWL+ANSWN DWGD+G F
Sbjct: 143 FTAYADFLSYKSGVYQHHSRDIIGRHAIRILGWGSEDNNP--YWLLANSWNEDWGDHGYF 200

Query: 287 KILRGKDECGIESSITAGVPKLD 309
           K+LRG +EC IES + AG+PKLD
Sbjct: 201 KMLRGVNECDIESFVNAGIPKLD 223



 Score = 40.8 bits (94), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
          CG GC+GG    AW+YW  +G+VSGG Y +    K
Sbjct: 36 CGSGCSGGVSAAAWQYWKDAGLVSGGLYNTTDGCK 70


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 81/198 (40%), Positives = 104/198 (52%), Gaps = 43/198 (21%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCRPY   PCEHH N T    C      TPKC R+C +NY  PYK D  +G ++Y+V ++
Sbjct: 233 GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCDRQCDKNYKKPYKADKYYGEQAYNVEND 292

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            + I KEI   GPVE +F V+ D + Y  G                              
Sbjct: 293 VELIQKEIMTLGPVEASFEVYTDFLHYIGG------------------------------ 322

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN---GLFKIL 289
                  +  + +G   GGHA++ILGWG D+     YWL ANSWNTDWG++   G F+IL
Sbjct: 323 -------IYKHVAGSVGGGHAVKILGWGIDQGV--SYWLAANSWNTDWGEDVFSGYFRIL 373

Query: 290 RGKDECGIESSITAGVPK 307
           RG DECGIES I AG+P+
Sbjct: 374 RGVDECGIESGIVAGIPR 391



 Score = 45.8 bits (107), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           + CGFGC GG P  AW+YWV SGIV+G  Y
Sbjct: 199 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 228


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  144 bits (362), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 75/190 (39%), Positives = 101/190 (53%), Gaps = 39/190 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+PY    CEHH  G  P+C      TP C   CQ++Y  PY +D + G   Y+V ++EK
Sbjct: 165 CQPYPFPKCEHHTKGKYPACFEEIYKTPNCENTCQKSYKTPYAQDKHRGKSRYNVKNDEK 224

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I KEI ++GPVE  F V++D + YKSG                                
Sbjct: 225 AIQKEIMKYGPVEANFIVYEDFLNYKSG-------------------------------- 252

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +GK +  HAIRI+GWG +  +   YWLI NSWN DWG+NG F+ILRG+ E
Sbjct: 253 -----IYKHITGKLVSWHAIRIIGWGVENNT--PYWLIPNSWNEDWGENGNFRILRGRHE 305

Query: 295 CGIESSITAG 304
           C IES +TAG
Sbjct: 306 CSIESEVTAG 315



 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG G  GGFP +AW YWVK GIV+G +
Sbjct: 132 CGDGFEGGFPALAWDYWVKEGIVTGSS 158


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 105/199 (52%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  PY++D ++G + Y+V SNE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDQRYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPVE A                                            
Sbjct: 247 KAIQREIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG         +GGHAIRI+GWG  EK K  YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGV-EKGK-PYWLIANSWNEDWGEKGLF 320

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 23/52 (44%), Positives = 35/52 (67%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +  NR P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRNRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 134/315 (42%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
            +N      K  +GV P    P   L  + I       DLP  FD+RT+W +C TI  I 
Sbjct: 65  FANYTIEQFKHILGVKP---TPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNIL 121

Query: 105 DQGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP------- 132
           DQG CG+CW     E                            C    NG  P       
Sbjct: 122 DQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYF 181

Query: 133 --------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                    CD     T             PKC R+C+    V +KK+ +F   +Y V S
Sbjct: 182 RRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHFSVNAYRVHS 240

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           N   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 241 NPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+RG
Sbjct: 274 ----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 322

Query: 292 KDECGIESSITAGVP 306
           K+ECGIE  +TAG+P
Sbjct: 323 KNECGIEEDVTAGMP 337



 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+ +SG+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFRRSGVVT 188


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/201 (38%), Positives = 101/201 (50%), Gaps = 53/201 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY  APC HH NGT   C      TP C + CQ  Y + Y KD  +G K+YS+ +  
Sbjct: 199 GCRPYPFAPCNHHSNGTYGPCSHDLEPTPVCKKACQSTYKIQYNKDKYYGLKAYSLHNKA 258

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             + KE+  +GP+E A                                            
Sbjct: 259 SDLQKELMMNGPMEVA-------------------------------------------- 274

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D +LYK+G         LGGHA+R+LGWGE+  +   YWL+ANSWNT+WGD G F
Sbjct: 275 FEVYEDFLLYKTGVYQHHTGSVLGGHAVRLLGWGEE--NGVPYWLLANSWNTEWGDKGFF 332

Query: 287 KILRGKDECGIESSITAGVPK 307
           KI RG++ECGIES   AG+ K
Sbjct: 333 KIYRGRNECGIESEAVAGLYK 353



 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 32/77 (41%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLP-ELIGYSEVDEDLPANFDSRTKWPNC 97
           K         +P   ++  MGV     L  N +P  +I Y  +D ++P  FDSR +WP C
Sbjct: 56  KAGRNPYFETVPSHVIQGMMGVRRSSKLETNSIPLPVISYEHIDMEIPVEFDSRKQWPYC 115

Query: 98  PTIREIRDQGSCGSCWG 114
           PTI EIRDQ +CGSCW 
Sbjct: 116 PTIGEIRDQSNCGSCWA 132



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           ++CGFGC GG P  AW +WVK G+V+GG Y +
Sbjct: 165 KICGFGCQGGDPHQAWSFWVKYGLVTGGNYTT 196


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 75/192 (39%), Positives = 101/192 (52%), Gaps = 53/192 (27%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+PY +  CEHH+NG++P+C +    TP+CV  C   Y   Y++DL++G  +YSV     
Sbjct: 180 CQPYPLPSCEHHINGSKPACPSKIAKTPECVHTCHAGYPTSYEQDLHYGESAYSVRRRVA 239

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
            I  EI  +GPVE A                                            F
Sbjct: 240 EIQTEIMTNGPVEAA--------------------------------------------F 255

Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
           TV+ D   YKSG       + LGGHA++++GWGE++     YWLIANSWN+DWGD+G FK
Sbjct: 256 TVYADFPAYKSGVYKRHSLRQLGGHAVKMIGWGEEDGIP--YWLIANSWNSDWGDHGYFK 313

Query: 288 ILRGKDECGIES 299
           I+RG+DECGIES
Sbjct: 314 IVRGQDECGIES 325



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/56 (50%), Positives = 33/56 (58%), Gaps = 7/56 (12%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           GV     LP + LP L       ED+P  FDSRT+WP+C TI  I DQ +CGSCW 
Sbjct: 62  GVKGSIPLPLSDLPVL-------EDIPDMFDSRTQWPDCKTIGLIEDQSNCGSCWA 110



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 25/44 (56%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIP 50
           R CG GC GGF G AW YW + G+V+GG Y     E ++    P
Sbjct: 141 RNCGNGCEGGFLGAAWNYWKQEGLVTGGLYNPSATESDTCQPYP 184


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 139/308 (45%), Gaps = 96/308 (31%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS-- 111
           L  ++G+HPD     N +PE I ++   +D+P  FD+R KWP C ++  IRDQGSCGS  
Sbjct: 55  LNGFLGLHPD----PNYMPEKIKHNFNPQDIPKTFDARKKWPKCDSLNRIRDQGSCGSCW 110

Query: 112 --------------------------------CWGCRP----YEIAPCEHHVN------- 128
                                           C  C      Y +A  + ++        
Sbjct: 111 AFAAVETMSDRICIHSSGAKKFFFSAEDLLSCCTACGSCSGGYMMAAFDFYIKQGVVSGG 170

Query: 129 ------GTRP-SCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
                 G RP + DA  KG TP C + C++ Y   Y  D ++G+K Y V +   +I  EI
Sbjct: 171 DLNSNEGCRPYTADAHDKGVTPSCTKSCRKGYPTSYSSDKHYGSKDYIVDAGVSNIQYEI 230

Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
             +GP+  +F V+ D   Y SG +                                    
Sbjct: 231 MTNGPIIVSFKVYQDFYNYGSGVYH----------------------------------- 255

Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
             + SG   G H ++I+GWG +++  + YWLIANSW + WG++G FKILRGK+ECGIE++
Sbjct: 256 --HVSGNYTGNHIVKIVGWGTEKE--QDYWLIANSWGSSWGEHGFFKILRGKNECGIENN 311

Query: 301 ITAGVPKL 308
             A +PKL
Sbjct: 312 PYAVLPKL 319


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/196 (39%), Positives = 104/196 (53%), Gaps = 41/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH N TR   C      TPKC ++C  +Y +  Y  D  +G  +Y V +
Sbjct: 218 GCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKCSKKCVPSYKEKNYDDDRFYGRTAYGVKN 277

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +  +I KEI  HGPVE AF V++D + Y  G                             
Sbjct: 278 DVAAIQKEILTHGPVEVAFEVYEDFLHYAGG----------------------------- 308

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                   + ++  GK  GGHA++++GWG D+ +   YWLIANSWNTDWG+ G F+ILRG
Sbjct: 309 --------IYVHTGGKLGGGHAVKLIGWGIDQGTP--YWLIANSWNTDWGEEGFFRILRG 358

Query: 292 KDECGIESSITAGVPK 307
            DECGIES +  G+PK
Sbjct: 359 VDECGIESGVVGGIPK 374



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 25/57 (43%), Positives = 34/57 (59%), Gaps = 1/57 (1%)

Query: 58  MGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           MGV+ + +L       L    ++D D+P  FD+R  W NC +I+ IRDQ SCGSCW 
Sbjct: 96  MGVN-NVHLSVKAKQHLSSTKDLDIDIPETFDARQHWSNCQSIKNIRDQSSCGSCWA 151



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 24/37 (64%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           R CGFGC GG P  AW+YWV  GIV+G  + + Q  K
Sbjct: 184 RTCGFGCEGGDPMFAWQYWVDHGIVTGSNFTANQGCK 220


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/178 (44%), Positives = 101/178 (56%), Gaps = 39/178 (21%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TP CV++C+E Y VPY +DL
Sbjct: 110 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPTCVKKCEEGYKVPYAQDL 168

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+ ++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 169 HHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 212

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                 + +GKALGGHAIRILGWG  +  +  YWL+ANSWNT
Sbjct: 213 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNT 248



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/30 (80%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGGFPG AW YW   GIVSGG YGS
Sbjct: 91  CGFGCNGGFPGAAWNYWKTKGIVSGGPYGS 120


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 87/287 (30%), Positives = 128/287 (44%), Gaps = 95/287 (33%)

Query: 78  SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP-- 132
           + +  DLP   D+R +WP C  I  +RDQ +CGSCW      +     C   +   +P  
Sbjct: 78  ANLSVDLPFEMDARKRWPQCKYIGFVRDQANCGSCWAVSSASVMTDRICIESIAAKQPLL 137

Query: 133 --------------SCD-----------ASKG--------------------------HT 141
                          CD           A++G                           T
Sbjct: 138 SEEELVSCCKICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIGSNSEDEAET 197

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
           P C R+C   Y     +D +FG K Y V+SNE+ IM+E+Y++GPV  AF V++D + Y  
Sbjct: 198 PLCTRQCINEYPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDFMYYIK 257

Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
           G +                                      ++ GK LGGHA++++GWG 
Sbjct: 258 GVY-------------------------------------EHRFGKFLGGHAVKLIGWGI 280

Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +  + +KYWLI+NSWNT WG+NG FKI+RGK+ C IES + AG+ ++
Sbjct: 281 E--NSKKYWLISNSWNTTWGENGFFKIIRGKNCCAIESYVVAGMARI 325



 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 26/37 (70%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           ++CG+GC+GG+P  A+ YW   GI +GG YGS +  K
Sbjct: 147 KICGYGCDGGYPDKAFIYWATRGIPTGGPYGSTKGCK 183


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH +   P C      TP CV++C +  D  Y  D      +Y+V + +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE AF V++D + YKSG +F                            
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +  G  LGGHAIRILGWGE+  +   YWLIANSWN  WG++G FK+LRGK+
Sbjct: 268 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGCFKMLRGKN 316

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331



 Score = 45.1 bits (105), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GGFP +AW +W   GIV+GG+
Sbjct: 145 CGFGCQGGFPPIAWDFWQTEGIVTGGS 171


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH +   P C      TP CV++C +  D  Y  D      +Y+V + +
Sbjct: 177 GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE AF V++D + YKSG +F                            
Sbjct: 236 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +  G  LGGHAIRILGWGE+  +   YWLIANSWN  WG++G FK+LRGK+
Sbjct: 268 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGYFKMLRGKN 316

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331



 Score = 44.3 bits (103), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GGFP  AW +W   GIV+GG+
Sbjct: 145 CGFGCQGGFPPTAWDFWQTEGIVTGGS 171


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH +   P C      TP CV++C +  D  Y  D      +Y+V + +
Sbjct: 68  GCRSYPFPRCSHHGSKKYPPCSHRIYDTPNCVQKC-DTPDTDYATDKTRANITYNVKAKQ 126

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE AF V++D + YKSG +F                            
Sbjct: 127 NAIMKEIMINGPVEAAFQVYEDFLGYKSGVYF---------------------------- 158

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +  G  LGGHAIRILGWGE+  +   YWLIANSWN  WG++G FK+LRGK+
Sbjct: 159 ---------HSDGTLLGGHAIRILGWGEE--NGVAYWLIANSWNDGWGEDGYFKMLRGKN 207

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  +TAG+P+L
Sbjct: 208 ECGIEDEVTAGLPEL 222



 Score = 43.9 bits (102), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 20/27 (74%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CGFGC GGFP  AW +W   GIV+GG+
Sbjct: 36 CGFGCQGGFPPTAWDFWQTEGIVTGGS 62


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 103/195 (52%), Gaps = 42/195 (21%)

Query: 115 CRPYEIAPCEHHVNGTRPSC-DASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSN 172
           C+PY   PC HHV G   +C D  + +TPKC  EC   Y    Y++DL+ G  SYSV  +
Sbjct: 192 CQPYSFPPCSHHVQGEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKS 251

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E+ I  EIY++G    +F V+ D + Y SG                  +  NTS      
Sbjct: 252 EEQIKAEIYQYGSTTASFNVYSDFLTYSSG------------------VYQNTS------ 287

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                        G  +GGHAI++LGWG +  +   YWL ANSWN+ WG+NG FKILRG 
Sbjct: 288 -------------GSYMGGHAIKMLGWGVENGTP--YWLCANSWNSSWGENGFFKILRGS 332

Query: 293 DECGIESSITAG-VP 306
           +ECGIES + AG VP
Sbjct: 333 NECGIESGMVAGFVP 347



 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 17/28 (60%), Positives = 21/28 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GCNGG+   AW Y+VK+G+VSG  Y
Sbjct: 154 CGMGCNGGYTAGAWNYYVKTGLVSGNLY 181


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/334 (29%), Positives = 145/334 (43%), Gaps = 113/334 (33%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A +N   +I +  LKS   V  + ++P   L  +    E+    P  FD+R +WP+CP 
Sbjct: 36  KAGRNFAKDISKDFLKSLNCVRKNPDIPKLPLKNVTPTKEI----PVEFDAREQWPHCPC 91

Query: 100 IREIRDQGSCGSCWG-----------CRPYE-----------IAPC-------------- 123
           I EIRDQG+CGSCW            C   E           +A C              
Sbjct: 92  IDEIRDQGNCGSCWAVSAASVMTDRTCIDTEGLVDFRFSSENVAACCTECGNACYGGDED 151

Query: 124 --------EHHVNGTRPSCDASKGHTPKCVRECQENYDVP-------------------- 155
                   +  V+G R   ++++G  P  V EC+ + + P                    
Sbjct: 152 TAFTHWVTKGFVSGGRH--NSNEGCQPYSVEECEHHIEGPRPPCEGDMPELVCSETCHEE 209

Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
               Y++DL +G ++Y +  +   I +EI  +GPV  AF V+DD + YKSG +       
Sbjct: 210 YGKTYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSGVY------- 262

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          +++G   G HA+R++GWGE+E +   YWL
Sbjct: 263 ------------------------------QHETGLLDGYHAVRVIGWGEEEGT--PYWL 290

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           +ANSWNTDWGDNGLFKILRG DEC  E  + A  
Sbjct: 291 VANSWNTDWGDNGLFKILRGSDECEFEGDMAAAT 324


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 105/195 (53%), Gaps = 39/195 (20%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC HH N      C      TP+C + CQ  Y   Y+ D  +G  +Y++ +NE
Sbjct: 194 CKPYAFHPCGHHRNEIYYGECPKEIFPTPQCTQSCQAGYASDYEDDKIYGKSAYALPNNE 253

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI  +GPV+ AF V++D   Y+SG                               
Sbjct: 254 KAIQREIMTNGPVQAAFMVYEDFSRYRSG------------------------------- 282

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 + ++ +G+  GGHA++++GWG D+    KYWL ANSWN+DWG+NG F+I+RG D
Sbjct: 283 ------IYVHTAGRREGGHAVKLIGWGVDDDGN-KYWLAANSWNSDWGENGYFRIVRGVD 335

Query: 294 ECGIESSITAGVPKL 308
            CGIES++ AG+P +
Sbjct: 336 HCGIESAVVAGMPDV 350



 Score = 41.6 bits (96), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 22/35 (62%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GC GG+P  AWRY++  G+ +GG Y  K   K
Sbjct: 161 CGRGCRGGYPIEAWRYFMLHGVCTGGHYAEKDVCK 195


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 104/195 (53%), Gaps = 43/195 (22%)

Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVSS 171
           C+ Y  APC HHV     P C      TP C+  C  N  + +PY KD++ G+K+Y ++ 
Sbjct: 190 CQAYTFAPCAHHVTSDIYPPCTGELP-TPPCINSCDSNSTHTIPYSKDIHRGSKAYGIAK 248

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +EK+IM EIY++GP+E A TV++D + YK+G +                           
Sbjct: 249 DEKAIMAEIYKNGPIEVALTVYEDFLTYKTGVY--------------------------- 281

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  LGGHA++++GWG +  +   YW I NSWN  WGD G FKILRG
Sbjct: 282 ----------QHVTGDELGGHAVKMVGWGVENGT--PYWTIVNSWNESWGDKGTFKILRG 329

Query: 292 KDECGIESSITAGVP 306
           K+ECGIESS    +P
Sbjct: 330 KNECGIESSCVTALP 344


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 91/307 (29%), Positives = 128/307 (41%), Gaps = 96/307 (31%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----- 113
           G++  Y  P +        + V   +P +FDSRT+W NC +I  IRDQ  CGSCW     
Sbjct: 56  GMNVKYAAPHSDEIRSTEVNNVLPFIPPSFDSRTRWSNCTSIEMIRDQAQCGSCWAFSTA 115

Query: 114 ------------GCRPYEIAPCEHHVNGTRPSCDASKGHTP------------------- 142
                       G +   I+P +          D  KG  P                   
Sbjct: 116 EVISDRICIATKGTQQPTISPTDMLACCGNSCGDGCKGRYPIQAFRWWNSRGVVTGGDFR 175

Query: 143 ---------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
                                 C   CQ  Y   Y KD  FG  +Y+V+ N  +I  EI 
Sbjct: 176 GSGCRPYPFAPCISCPEEKTPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIM 235

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
            +GPV GAFT+++D+  YKSG +                                     
Sbjct: 236 TNGPVVGAFTMYEDMYKYKSGVY------------------------------------- 258

Query: 242 LYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            + +G+ LGGHAI+I+GWG   ++   YWLIANSW  +WG+NG  K+ RG +ECGIE ++
Sbjct: 259 RHTAGRLLGGHAIKIIGWG--TQNGIPYWLIANSWGANWGENGFLKMRRGVNECGIERAV 316

Query: 302 TAGVPKL 308
            AG+P++
Sbjct: 317 VAGMPRV 323


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 77/195 (39%), Positives = 105/195 (53%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH +   P C      TPKCV +C +  ++ Y+ D      +Y+V  ++
Sbjct: 177 GCRSYPFPKCSHHGSKKYPPCPHRIYDTPKCVPKC-DTPNIDYETDKTRANITYNVQRSQ 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE AF V++D   YK G +F                            
Sbjct: 236 MAIMKEIMINGPVEAAFEVYEDFFGYKQGVYF---------------------------- 267

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +GGHAIRILGWGE+  +   YWLIANSWN  WG++G FK+LRGK+
Sbjct: 268 ---------HSTGEFIGGHAIRILGWGEENGT--PYWLIANSWNEGWGEDGYFKMLRGKN 316

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  +TAG+P+L
Sbjct: 317 ECGIEDEVTAGLPEL 331



 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 20/27 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GG+P  AW +W   GIV+GG+
Sbjct: 145 CGFGCQGGYPPAAWDFWQAYGIVTGGS 171


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 107/199 (53%), Gaps = 43/199 (21%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH N T    C      TPKC ++C + Y +  Y +D  FG  +Y V  
Sbjct: 177 GCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKCEKKCLDIYTEKTYAEDKFFGETAYGVED 236

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +  SI KEI  HGPVE AF V++D                                    
Sbjct: 237 DVTSIQKEILTHGPVEVAFEVYED------------------------------------ 260

Query: 232 GAFTVFDD-LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
             F ++D  + ++  GK  GGHA+++LGWG ++     YWL+ANSWNTDWG++G F+I+R
Sbjct: 261 --FLMYDGGIYVHTGGKIGGGHAVKMLGWGVEQGVP--YWLVANSWNTDWGEDGFFRIIR 316

Query: 291 GKDECGIESSITAGVPKLD 309
           G DECGIESS+  G+PKL+
Sbjct: 317 GIDECGIESSVVGGLPKLN 335



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/37 (56%), Positives = 26/37 (70%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           + CGFGC+GG P  AW+YWVK GIV+G  +  KQ  K
Sbjct: 143 KSCGFGCDGGDPMAAWKYWVKEGIVTGSNFTMKQGCK 179


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 132/297 (44%), Gaps = 113/297 (38%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CGFGC+GGFP  AW Y+V +G+V+GG YG+K        N  R +  S  G HP+     
Sbjct: 158 CGFGCDGGFPDAAWEYFVSTGVVTGGLYGTK--------NACRPYEISPCGNHPN----- 204

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
                                  T + NC  +                            
Sbjct: 205 ----------------------ETFYRNCTGV---------------------------- 214

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            + PSC  S          CQ+ Y V YK D   G KSY+++++  +I K+I +HGP+  
Sbjct: 215 -STPSCKTS----------CQKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVA 263

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
            F+V++D + YK G                                     +  Y  G  
Sbjct: 264 TFSVYEDFMYYKKG-------------------------------------IYRYTHGGY 286

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
            GGHA+RILGWG +  +  KYW+IANSWNTDWG++G F+++RG ++CGIE S++AG+
Sbjct: 287 EGGHAVRILGWGVE--NNVKYWIIANSWNTDWGEDGFFRMVRGINDCGIEESVSAGL 341



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 29/43 (67%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           P+L    E    +P +FD+RT+WP+CP+I  IRDQ  CGSCW 
Sbjct: 81  PQLQENEEDTAGIPESFDARTQWPHCPSISLIRDQADCGSCWA 123


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 130/305 (42%), Gaps = 101/305 (33%)

Query: 63  DYNLPANRLPELIG--YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------- 113
           D    A   PEL     + V   +P  FD+RT+WPNC +I+ IR+Q +CGSCW       
Sbjct: 52  DVKYAAPHSPELRASQVNTVLPSIPTYFDARTRWPNCRSIKMIRNQATCGSCWAFGAAEV 111

Query: 114 ----------GCRPYEIAP----------CEHHVNGTRP--------------------- 132
                     G +   I+P          C +   G  P                     
Sbjct: 112 MSDRICIASMGTKQPIISPTDLLSCCGNFCGYGCKGASPLQAFRWWNKKGVVTGGDYRGS 171

Query: 133 --------SCDA---SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
                    C A   +K  TP+C   CQ  Y   Y KD  FG  +Y V  +  +I  EI 
Sbjct: 172 GCKPYPFAPCTALPCTKSETPRCSLNCQPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEI- 230

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
            +GPVE AF V+DD   Y+SG +                                     
Sbjct: 231 TNGPVEAAFIVYDDFNHYRSGVY------------------------------------- 253

Query: 242 LYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            + +GK +GGHA++I+GWG   ++   YWL+ANSW   WG+NG FK+LRG DECGIES+I
Sbjct: 254 RHVAGKLVGGHAVKIIGWG--IQNGAPYWLMANSWGPYWGENGFFKMLRGVDECGIESTI 311

Query: 302 TAGVP 306
            AG P
Sbjct: 312 VAGKP 316



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 15/29 (51%), Positives = 20/29 (68%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
            CG+GC G  P  A+R+W K G+V+GG Y
Sbjct: 140 FCGYGCKGASPLQAFRWWNKKGVVTGGDY 168


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC ++C  +Y D  Y +D  FGA +Y V  
Sbjct: 192 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 251

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 252 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 284

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG D+     YW +ANSWNTDWG++G F+ILRG
Sbjct: 285 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 332

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  G+PKL+
Sbjct: 333 VDECGIESGVVGGIPKLN 350



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P +FDSR  WP C +I+ IRDQ SCGSCW 
Sbjct: 90  DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 125



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 25/37 (67%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           + CGFGCNGG P  AWRYWVK GIV+G  Y +    K
Sbjct: 158 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 194


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC ++C  +Y D  Y +D  FGA +Y V  
Sbjct: 201 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 260

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 261 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 293

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG D+     YW +ANSWNTDWG++G F+ILRG
Sbjct: 294 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 341

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  G+PKL+
Sbjct: 342 VDECGIESGVVGGIPKLN 359



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P +FDSR  WP C +I+ IRDQ SCGSCW 
Sbjct: 99  DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 134



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 25/37 (67%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           + CGFGCNGG P  AWRYWVK GIV+G  Y +    K
Sbjct: 167 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 203


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  140 bits (353), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC ++C  +Y D  Y +D  FGA +Y V  
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 261

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 262 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 294

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG D+     YW +ANSWNTDWG++G F+ILRG
Sbjct: 295 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 342

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  G+PKL+
Sbjct: 343 VDECGIESGVVGGIPKLN 360



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P +FDSR  WP C +I+ IRDQ SCGSCW 
Sbjct: 100 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 135



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/37 (59%), Positives = 25/37 (67%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           + CGFGCNGG P  AWRYWVK GIV+G  Y +    K
Sbjct: 168 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 204


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  PSC      TP+C R+CQ+ Y  PY+ D ++G  S +V  NE
Sbjct: 161 GCQPYPFPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNE 220

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +F+D + YKSG                               
Sbjct: 221 SAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 249

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +G H +RI+GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 250 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 301

Query: 294 ECGIESSITAG 304
           EC +ES + AG
Sbjct: 302 ECSVESVVVAG 312



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  C S W 
Sbjct: 40  GRREDPNLRQKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWA 94



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGFPG AW YWV  GIV+GG+
Sbjct: 129 CGSGCDGGFPGPAWDYWVSHGIVTGGS 155


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 99/195 (50%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+HH  G  P C      TPKCV+ C +   + Y+KD      SY+V  +E
Sbjct: 183 GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHC-DTPKIDYQKDKTRANTSYNVHQSE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE  F V +D   YKSG                               
Sbjct: 242 VAIMKEILLNGPVEATFEVHEDFPEYKSG------------------------------- 270

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  +  G ++GGHAIRILGWGE+  +   YWLIANSWN DWG+ G  + LRG +
Sbjct: 271 ------IYFHAWGGSVGGHAIRILGWGEE--NGVPYWLIANSWNEDWGEKGYLRFLRGHN 322

Query: 294 ECGIESSITAGVPKL 308
           ECGIE   TAG+P L
Sbjct: 323 ECGIEEEATAGLPDL 337


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 78/178 (43%), Positives = 101/178 (56%), Gaps = 39/178 (21%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYEIAPCEHHVNGTR  C    G TP CV++C++ Y VPY +DL
Sbjct: 108 KGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGPCKEG-GKTPACVKKCEDGYKVPYAQDL 166

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+ ++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 167 HRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 210

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                 + +GKALGGHAIRILGWG  +  +  YWL+ANSWNT
Sbjct: 211 ---------------------KHVAGKALGGHAIRILGWGV-QNGEIPYWLVANSWNT 246



 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 29/44 (65%), Positives = 36/44 (81%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +L +L+ Y++   DLP NFD+R  WPNCPTIRE+RDQGSCGSCW
Sbjct: 10  KLEQLVSYTDTPTDLPENFDAREHWPNCPTIREVRDQGSCGSCW 53



 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 25/32 (78%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGGFPG AW YW   GIVSGG YGSK 
Sbjct: 89  CGFGCNGGFPGAAWHYWKTKGIVSGGPYGSKM 120


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 101/202 (50%), Gaps = 54/202 (26%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCRPY   PCEHH N T    C      TPKCV++C +NY   YK D  +G + Y+V SN
Sbjct: 220 GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGEQVYNVESN 279

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            +SI KEI   GPVE +                                           
Sbjct: 280 VESIQKEIMTLGPVEAS------------------------------------------- 296

Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F V+ D + Y        +G   GGHA+++LGWG D+     YWL ANSWNTDWG++G 
Sbjct: 297 -FEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVP--YWLAANSWNTDWGEDGY 353

Query: 286 FKILRGKDECGIESSITAGVPK 307
           F+ILRG +ECGIES I AG+PK
Sbjct: 354 FRILRGVNECGIESGIIAGIPK 375



 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           +P +FD+R  WP C ++R +RDQ SCGSCW     E
Sbjct: 123 IPESFDARKHWPECASLRNVRDQSSCGSCWAVAAVE 158



 Score = 44.7 bits (104), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           + CGFGC GG P  AW+YWV  GIV+G  Y
Sbjct: 186 KTCGFGCFGGEPMAAWKYWVLRGIVTGSEY 215


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 74/164 (45%), Positives = 96/164 (58%), Gaps = 40/164 (24%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 72  GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKICEPGYSPTYKQDKHYGYDSYSVSNSE 130

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAF+V+ D +LYKSG +                             
Sbjct: 131 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 161

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                    + +G+ +GGHAIRILGWG +  +   YWL+ANSWN
Sbjct: 162 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWN 195



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 39 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 69


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/323 (30%), Positives = 136/323 (42%), Gaps = 94/323 (29%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A     SN         +GV P        +P +     +   LP +FD+RT WP 
Sbjct: 56  GWKAAMNPRFSNYSVGQFMHLLGVKPTLQKDLEGVPVITHPKTLK--LPKHFDARTAWPQ 113

Query: 97  CPTIREIRDQGSCGSCWGCRPYE---------------------IAPCE----------- 124
           C TI +I DQG CGSCW     E                     +A C            
Sbjct: 114 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHFGMNISLSVNDLLACCGFLCGSGCDGGY 173

Query: 125 ---------HHVNGTR---PSCDASKGHTP---------KCVRECQENYDVPYKKDLNFG 163
                    HH   T    P  DA+    P         KCVR+C +   + ++K   +G
Sbjct: 174 PLYAWRYFIHHGVVTEECDPYFDATGCSHPGCEPGYPTPKCVRKCTDENQL-WRKAKRYG 232

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
             +Y +SS+   IM E+Y++GPVE AFTV++D   Y+SG +                   
Sbjct: 233 QSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFAHYESGVY------------------- 273

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                              Y +G  +GGHA++++GWG  +   E YW++AN WN +WGD+
Sbjct: 274 ------------------RYTTGDVMGGHAVKLIGWGTTDDG-EDYWILANQWNRNWGDD 314

Query: 284 GLFKILRGKDECGIESSITAGVP 306
           G F I RG +ECGIE  + AG+P
Sbjct: 315 GYFMIRRGVNECGIEEGVVAGLP 337



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY++  G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYFIHHGVVT 188


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  PSC      TP+C R+CQ+ Y  PY+ D ++G  S +V  NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +F+D + YKSG                               
Sbjct: 247 SAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +G H +RI+GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327

Query: 294 ECGIESSITAG 304
           EC IES + AG
Sbjct: 328 ECSIESVVVAG 338



 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGFPG AW YWV  GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 135/327 (41%), Gaps = 102/327 (31%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYN-----LPANRLPELIGYSEVDEDLPANFDSR 91
           G K A     SN   +  K  +GV P        +P    P+L+       +LP  FD+R
Sbjct: 55  GWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLL-------ELPQEFDAR 107

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DA 136
             WPNC TI  I DQG CGSCW     E      C H+        N     C     D 
Sbjct: 108 VAWPNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLLACCGFLCGDG 167

Query: 137 SKGHTP-----KCVRE--------------------CQENYDVP------------YKKD 159
             G  P       VR+                    C+  Y  P            + K 
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSKS 227

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            +FG  +Y +SS+  SIM E+Y++GPVE +FTV++D   YKSG +               
Sbjct: 228 KHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKSGVY--------------- 272

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                                  + +G  +GGHA++++GWG  E   E YWL+AN WN  
Sbjct: 273 ----------------------KHVTGDVMGGHAVKLIGWGTSEDG-EDYWLLANQWNRG 309

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WGD+G FKI RG DEC IE  + AG+P
Sbjct: 310 WGDDGYFKIRRGTDECEIEDEVVAGLP 336



 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AW+Y+V+ G+V+
Sbjct: 163 LCGDGCDGGYPLQAWKYFVRKGVVT 187


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLRQKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 128/284 (45%), Gaps = 108/284 (38%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
           LP  FD+RT W  C TI +I DQG CGSCW            C  +++            
Sbjct: 103 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLAC 162

Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
                 A C+            HH             +  + P C+ +   TPKCVR+C 
Sbjct: 163 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 221

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +   + +K+  ++  K+Y V S+ + IM E+Y++GPVE A                    
Sbjct: 222 KGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA-------------------- 260

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                   FTVF+D   YKSG        ALGGHA++++GWG  
Sbjct: 261 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 296

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++  E YWL+AN WNT+WGD+G FKI RG +ECGIE  +TAG+P
Sbjct: 297 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 339


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLRQKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 139/328 (42%), Gaps = 105/328 (32%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPD-----YNLPANRLPELIGYSEVDEDLPANFDSR 91
           G K    +  SN      K  +GV P       N+P    P+ I       +LP  FD+R
Sbjct: 51  GWKAGMNSRFSNHTVGQFKRLLGVLPTPRNFLENVPVITYPKGI-------NLPKQFDAR 103

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRP-----SC-------- 134
             WP C +++ I DQG CGSCW     E      C HH VN T       +C        
Sbjct: 104 EAWPQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDG 163

Query: 135 ----------------------------DASKGH--------TPKCVRECQENYDVPYKK 158
                                       DA   H        TP+CV++C++  +  +  
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDE-NQKWGN 222

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
              F A +Y +SS    IM E+Y +GPVE +F+V++D   YKSG +              
Sbjct: 223 SKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVY-------------- 268

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   Y  G  +GGHA++++GWG ++ +   YWL+ANSWNT
Sbjct: 269 -----------------------KYTKGDYMGGHAVKLVGWGTEDGT--DYWLVANSWNT 303

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
            WG++G FKI RG +ECGIE  + AG+P
Sbjct: 304 AWGEDGYFKIARGSNECGIEGDVVAGMP 331


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E    K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/284 (32%), Positives = 128/284 (45%), Gaps = 108/284 (38%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
           LP  FD+RT W  C TI +I DQG CGSCW            C  +++            
Sbjct: 101 LPKEFDARTAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCIHFDMNISLSVNDLLAC 160

Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
                 A C+            HH             +  + P C+ +   TPKCVR+C 
Sbjct: 161 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 219

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +   + +K+  ++  K+Y V S+ + IM E+Y++GPVE A                    
Sbjct: 220 KGNQI-WKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVA-------------------- 258

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                   FTVF+D   YKSG        ALGGHA++++GWG  
Sbjct: 259 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 294

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++  E YWL+AN WNT+WGD+G FKI RG +ECGIE  +TAG+P
Sbjct: 295 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 337


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 25/56 (44%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  I + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TIDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 101/195 (51%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHH+       C      T  C  +CQ+ Y + Y  D ++GA  Y+V+ +
Sbjct: 192 GCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQD 251

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             SI KEI  +GPVE AF V++D   Y SG                              
Sbjct: 252 VASIQKEIMTNGPVEVAFDVYEDFEHYSSG------------------------------ 281

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                  +  + +G  LGGHA+++LGWG +  +   YW+ ANSWN+DWG+NG F+ILRG 
Sbjct: 282 -------IYKHTTGDYLGGHAVKMLGWGTENGTD--YWICANSWNSDWGENGFFRILRGV 332

Query: 293 DECGIESSITAGVPK 307
           DEC IESS+ AG PK
Sbjct: 333 DECQIESSVVAGEPK 347



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 29/52 (55%), Gaps = 5/52 (9%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK-----NSLSNIPRAHLK 55
           CGFGC+GG P  AW YWV +GIV+G  Y SK   K         +IP  H K
Sbjct: 160 CGFGCDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYK 211


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC I+S I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIDSEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 99/191 (51%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  PSC      TP+C R+CQ+ Y  PY+ D ++G  + +V  NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKMYKTPQCKRKCQKGYTTPYEHDKHYGGIAINVIKNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +F+D + YKSG                               
Sbjct: 247 LAIQKEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +G H +RI+GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYKYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327

Query: 294 ECGIESSITAG 304
           EC IES + AG
Sbjct: 328 ECSIESVVVAG 338



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCGSSWA 120



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGFPG AW YWV  GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 81/202 (40%), Positives = 100/202 (49%), Gaps = 54/202 (26%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCRPY   PCEHH N T    C      TPKCV++C +NY   YK D  +G   Y+V SN
Sbjct: 86  GCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKCVKKCDKNYGKSYKADKYYGQSVYNVESN 145

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            +SI KEI   GPVE +                                           
Sbjct: 146 VESIQKEIMTLGPVEAS------------------------------------------- 162

Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F V+ D + Y        +G   GGHA+++LGWG D+     YWL ANSWNTDWG++G 
Sbjct: 163 -FEVYTDFLYYTGGIYKHVAGSMGGGHAVKVLGWGIDQGVP--YWLAANSWNTDWGEDGY 219

Query: 286 FKILRGKDECGIESSITAGVPK 307
           F+ILRG +ECGIES I AG+PK
Sbjct: 220 FRILRGVNECGIESGIIAGIPK 241



 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 21/30 (70%)

Query: 7  RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          + CGFGC GG P  AW+YWV  GIV+G  Y
Sbjct: 52 KTCGFGCFGGEPMAAWKYWVLRGIVTGSEY 81


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 71/194 (36%), Positives = 102/194 (52%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SY+V S E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I K+I  HGPVE    +++D + YKSG                               
Sbjct: 247 SVIQKDIMMHGPVEAYLEIYEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +GK + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327

Query: 294 ECGIESSITAGVPK 307
           EC IES I AG+ K
Sbjct: 328 ECLIESEIAAGLIK 341



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GG+   +W YWV  GIV+GG+
Sbjct: 155 CGSGCDGGYFLPSWDYWVSHGIVTGGS 181


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 95/304 (31%), Positives = 137/304 (45%), Gaps = 100/304 (32%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
           +L   +LP  I     D DLP  FD+R KWP CP++REIRDQG CGSCW           
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165

Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
                                      GCR   + P      E  ++   P  ++ +G  
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224

Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
           P  + EC+   E+ D P                +D ++G  +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           PV+ AF  + DL  YKSG                                     +  + 
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G   GGHA+++LGWG +  +  KYWL+ANSW  +WG+NG FKI+RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKIVRGENHCGIEENIHAG 365

Query: 305 VPKL 308
           +P  
Sbjct: 366 LPNF 369



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW++WV+ G+ SGG   S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 140/325 (43%), Gaps = 99/325 (30%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPEL--IGYSEVDEDLPANFDSRTKW 94
           G K    +  SN      K  +GV P    P N L  +  I Y +   +LP  FD+R  W
Sbjct: 51  GWKAGMNSRFSNHTVGQFKRLLGVLP---TPRNFLENVPVITYPK-GMNLPKQFDAREAW 106

Query: 95  PNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRP-----SC----------- 134
           P C +++ I DQG CGSCW     E      C HH VN T       +C           
Sbjct: 107 PQCTSVQTILDQGHCGSCWAFGAVEALSDRFCIHHKVNVTLSENDLVACCGFMCGDGCDG 166

Query: 135 -------------------------DASKGH--------TPKCVRECQENYDVPYKKDLN 161
                                    DA   H        TP+CV++C++  +  +     
Sbjct: 167 GYPISAWQYFISTGVVTAECDPYFDDAGCQHPGCEPLYPTPQCVKQCKDE-NQKWGNSKR 225

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           F A +Y +SS    IM E+Y +GPVE +F+V++D   YKSG +                 
Sbjct: 226 FSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYKSGVY----------------- 268

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                                Y  G  +GGHA++++GWG ++ +   YWL+ANSWNT WG
Sbjct: 269 --------------------KYTKGDYMGGHAVKLVGWGTEDGT--DYWLVANSWNTAWG 306

Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
           ++G FKI RG +ECGIE  + AG+P
Sbjct: 307 EDGYFKIARGSNECGIEGDVVAGMP 331


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/194 (37%), Positives = 103/194 (53%), Gaps = 40/194 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C  Y    CEHH  G  P C  S+  TP+CV++CQE Y V Y+KD +F  ++Y V     
Sbjct: 167 CNAYSFPKCEHHAEGKYPPCGESQ-ETPECVKQCQEGYPVEYEKDKHFFGEAYYVQGGID 225

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  E+  +GP+E +F V++D + YKSG                                
Sbjct: 226 AIKTELMTNGPLEVSFFVYEDFLTYKSG-------------------------------- 253

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +GK LGGHA++++GWG ++    +YW IANSWN DWG+NG F+I+ GK E
Sbjct: 254 -----IYQHVAGKYLGGHAVKLVGWGVEDGI--EYWKIANSWNEDWGENGYFRIVAGKGE 306

Query: 295 CGIESSITAGVPKL 308
           CGIE     G+PKL
Sbjct: 307 CGIEVGPIGGIPKL 320



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 43/92 (46%), Gaps = 16/92 (17%)

Query: 49  IPRAHLKSWMGV-HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQG 107
           IP      ++GV   D  LP+  +           DLP +FD   KWP CP+++EIRDQ 
Sbjct: 40  IPTRDYTQYLGVLFGDRQLPSKTIV-------ARGDLPESFDPVEKWPECPSLKEIRDQS 92

Query: 108 SCGSCWGCRPYEIAPCEHHVNGTRPSCDASKG 139
            CGSCW     E A        T   C ASKG
Sbjct: 93  VCGSCWAFGAAEAA--------TDRLCIASKG 116



 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 25/31 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CGFGC+GG+  MAWR++  +G+ +GG YGSK
Sbjct: 134 CGFGCDGGWLDMAWRWFQSTGVTTGGEYGSK 164


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 98/191 (51%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  PSC      TP+C R+CQ+ Y  PY+ D ++G  S +V  NE
Sbjct: 187 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I  EI  +GPVE    +F+D + YKSG                               
Sbjct: 247 SAIQNEIMMYGPVEAYLLIFEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +G H +RI+GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 276 ------IYRYTTGSFVGEHYVRIIGWGIENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 327

Query: 294 ECGIESSITAG 304
           EC IES + AG
Sbjct: 328 ECSIESVVVAG 338



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGFPG AW YWV  GIV+GG+
Sbjct: 155 CGSGCDGGFPGPAWDYWVSHGIVTGGS 181


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQICQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLRQKRRPT-VDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 91/284 (32%), Positives = 127/284 (44%), Gaps = 108/284 (38%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEI------------ 120
           LP  FD+R  W  C TI +I DQG CGSCW            C  +++            
Sbjct: 103 LPKEFDARAAWSQCSTIGKILDQGHCGSCWAFGAVESLQDRFCSHFDMNISLSVNDLLAC 162

Query: 121 ------APCE------------HH-------------VNGTRPSCDASKGHTPKCVRECQ 149
                 A C+            HH             +  + P C+ +   TPKCVR+C 
Sbjct: 163 CGFLCGAGCDGGTPIYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-QTPKCVRKCV 221

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +   + +K+  ++  K+Y V S+ + IM E+Y++GPVE A                    
Sbjct: 222 KGNQI-WKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVA-------------------- 260

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                   FTVF+D   YKSG        ALGGHA++++GWG  
Sbjct: 261 ------------------------FTVFEDFAHYKSGVYKHITGSALGGHAVKLIGWGTS 296

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++  E YWL+AN WNT+WGD+G FKI RG +ECGIE  +TAG+P
Sbjct: 297 DEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVTAGLP 339


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 101/194 (52%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SYSV   E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +++D + YKSG                               
Sbjct: 247 SAIQKEIMMYGPVEAYLQIYEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +GK + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--SYWLAANTWNEDWGEKGYFRIVRGRD 327

Query: 294 ECGIESSITAGVPK 307
           EC IES I AG  K
Sbjct: 328 ECLIESFIVAGQIK 341



 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GG  G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLRQKRRPT-VDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 101/194 (52%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SYSV   E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYSVIGVE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +++D + YKSG                               
Sbjct: 247 SAIQKEIMMYGPVEAYLEIYEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +GK + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRD 327

Query: 294 ECGIESSITAGVPK 307
           EC IES I AG  K
Sbjct: 328 ECLIESFIVAGQIK 341



 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GG  G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLRQRRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 155 CGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/315 (31%), Positives = 132/315 (41%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
            +N      K  +GV P    P   L  + I       DLP  FD+RT+W +C TI  I 
Sbjct: 65  FANYTIEQFKHILGVKP---TPPGLLAGVPIKTHPKSADLPKEFDARTQWSSCSTIGNIL 121

Query: 105 DQGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP------- 132
           DQG CG+CW     E                            C    NG  P       
Sbjct: 122 DQGHCGACWAFAAVESLQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYF 181

Query: 133 --------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                    CD     T             PKC R+C+    V +KK+ +    +Y V S
Sbjct: 182 RRSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCHRKCKVENQV-WKKNKHSSVNAYRVHS 240

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           N   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 241 NPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 273

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  + + E YWL+AN WN  WG +G FKI+RG
Sbjct: 274 ----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGGDGYFKIIRG 322

Query: 292 KDECGIESSITAGVP 306
           K+ECGIE  +TAG+P
Sbjct: 323 KNECGIEEDVTAGMP 337



 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+ +SG+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFRRSGVVT 188


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score =  137 bits (344), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 137/304 (45%), Gaps = 100/304 (32%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
           +L   +LP  I     D DLP  FD+R KWP CP++REIRDQG CGSCW           
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165

Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
                                      GCR   + P      E  ++   P  ++ +G  
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224

Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
           P  + EC+   E+ D P                +D ++G  +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           PV+ AF  + DL  YKSG                                     +  + 
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G   GGHA+++LGWG +  +  KYWL+ANSW  +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 305 VPKL 308
           +P  
Sbjct: 366 LPNF 369



 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW++WV+ G+ SGG   S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 137/304 (45%), Gaps = 100/304 (32%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
           +L   +LP  I     D DLP  FD+R KWP CP++REIRDQG CGSCW           
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165

Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
                                      GCR   + P      E  ++   P  ++ +G  
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224

Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
           P  + EC+   E+ D P                +D ++G  +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFING 284

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           PV+ AF  + DL  YKSG                                     +  + 
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G   GGHA+++LGWG +  +  KYWL+ANSW  +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 305 VPKL 308
           +P  
Sbjct: 366 LPNF 369



 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW++WV+ G+ SGG   S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 108/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 139 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 198

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 199 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 242

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 243 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 279

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 280 DWGEKGYFRIVRGRNECLIESEIAAGLIK 308



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 33  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 87



 Score = 42.0 bits (97), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 120 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 148


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 82/217 (37%), Positives = 107/217 (49%), Gaps = 55/217 (25%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T+  I   G+     GC+ Y  APCEHHV+G  P C  +K  TP C +EC     + Y+ 
Sbjct: 167 TVNGIVTGGNYEDTNGCKAYSFAPCEHHVDGDLPPCGPTKP-TPDCKKECDSGSSLTYQN 225

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           DL  G+ +Y +    K I  EI  +GPVE +                             
Sbjct: 226 DLTHGS-NYGIDPYPKQIQTEIMTNGPVEAS----------------------------- 255

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                          F+V++D + YKSG       +  GGHAI+ILGWG +  +   YWL
Sbjct: 256 ---------------FSVYEDFLSYKSGVYQHLEGEYAGGHAIKILGWGVENDTP--YWL 298

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +ANSWN DWGD G FKILRG +ECGIE SI AG+P+L
Sbjct: 299 VANSWNEDWGDKGYFKILRGSNECGIEGSIVAGIPEL 335


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG  K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGRIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +   CRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTSCRPYPFPKCDHFVKGKYRACGDKLYETPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECSIESEIAAGLIK 341



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/193 (40%), Positives = 100/193 (51%), Gaps = 43/193 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C    G TP+C+ +C+  Y   YK D ++G  SYSV S+E
Sbjct: 176 GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVPSDE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EIY++GPVEGAFTV++D +LYK+G +                             
Sbjct: 236 EQIQSEIYKNGPVEGAFTVYEDFLLYKTGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G A+GGHAI+   W  +E       L     +TDWGD        G D
Sbjct: 267 --------QHMTGSAVGGHAIK--SWLGEEVCS---LLALCHSDTDWGDMVSLSS-AGSD 312

Query: 294 ECGIESSITAGVP 306
            CGIES I AG+P
Sbjct: 313 HCGIESEIVAGIP 325


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 104/209 (49%), Gaps = 53/209 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+PY I PC+H  NG+RP C    G   +C   C+ +Y V +++D NF +K 
Sbjct: 179 GDYGSQQGCQPYTIEPCDHSGNGSRPVCTVGGG--VRCQHLCEPSYKVDFQRDKNFASKV 236

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YS+S++   I KEI  +GPV+   T                                   
Sbjct: 237 YSISNDVLEIQKEIMTNGPVQAILT----------------------------------- 261

Query: 227 QLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                    V++D + YK+G       + +G HA+RILGWG     K  YWL+ANSW +D
Sbjct: 262 ---------VYEDFLSYKTGVYYHLEGEKVGPHAVRILGWGVWGTKKVPYWLVANSWGSD 312

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKL 308
           WGDNG F I RG++ C IE  I AG+PKL
Sbjct: 313 WGDNGFFHIFRGENHCDIEGYIMAGLPKL 341



 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 48/76 (63%), Gaps = 4/76 (5%)

Query: 43  KNSLSNIPRAHLKSWMGVHPD-YNLPANRLPELIGYSEVD---EDLPANFDSRTKWPNCP 98
           +N   +I   +L+  MGVH + Y  P     E++G S+ +    DLP +FD+R +W +CP
Sbjct: 44  RNFHESISEKYLRGLMGVHEESYKYPLPDKQEVLGESDDEISLADLPVDFDARLRWTSCP 103

Query: 99  TIREIRDQGSCGSCWG 114
           TI EIR+QGSCGSCW 
Sbjct: 104 TISEIREQGSCGSCWA 119



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 26/33 (78%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           +CGF C GG+PG AW YW + G+VSGG YGS+Q
Sbjct: 153 ICGFACQGGYPGAAWAYWARKGLVSGGDYGSQQ 185


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V   E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V   E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 94/315 (29%), Positives = 139/315 (44%), Gaps = 97/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            S+   +  K  +GV         R P +    E++  LP  FD+RT WP C +I +I D
Sbjct: 60  FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIE--LPKTFDARTAWPQCLSIADILD 117

Query: 106 QGSCGSCW-------------------------------------GCR-PYEIAP----- 122
           QG CGSCW                                     GC   Y IA      
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFK 177

Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                   C+ + + T    P C+ +   TP C ++C +  ++ + +  +F   +Y V+S
Sbjct: 178 RTGVVTSECDPYFDQTGCSHPGCEPAYP-TPACEKKCVKK-NLLWSESKHFSVNAYRVNS 235

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           ++ SIM E+Y +GP E +FTV++D   YKSG +                           
Sbjct: 236 DQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVY--------------------------- 268

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  E   E YWL+AN WN  WGD+G FKI+RG
Sbjct: 269 ----------KHVTGSEMGGHAVKLIGWGTSEDG-EDYWLLANQWNRSWGDDGYFKIIRG 317

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  +TAG+P
Sbjct: 318 TNECGIE-DVTAGMP 331



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AW+Y+ ++G+V+
Sbjct: 159 LCGEGCDGGYPIAAWQYFKRTGVVT 183


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 135/331 (40%), Gaps = 110/331 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K    +  SN   A  K  +GV P        +P +     +   LP  FD+RT WP 
Sbjct: 56  GWKATMNHHFSNYTVAQFKYLLGVKPTPKEELRGIPVISHPKSLR--LPEEFDARTAWPQ 113

Query: 97  CPTIREIRDQGSCGSCWGCRPYE---------------------IAPC------------ 123
           C TI +I DQG CGSCW     E                     +A C            
Sbjct: 114 CSTIGKILDQGHCGSCWAFGAVESLSDRFCIHYGMNISLSVNDLLACCGFLCGSGCNGGY 173

Query: 124 --------EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                    HH             +  + P C+     TPKC R+C  N +  +KK  ++
Sbjct: 174 PISAWRYFVHHGVVTEECDPYFDDIGCSHPGCEPGYP-TPKCARKCV-NKNQLWKKSKHY 231

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G K Y + S+ +SIM EIY++GPVE A                                 
Sbjct: 232 GVKPYRIDSDPESIMAEIYKNGPVEVA--------------------------------- 258

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++D   YKSG         +GGHA++++GWG  E   E YWL+AN 
Sbjct: 259 -----------FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTSEDG-EAYWLLANQ 306

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 307 WNRGWGDDGYFKIRRGTNECGIEGDVVAGLP 337



 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+V  G+V+
Sbjct: 164 LCGSGCNGGYPISAWRYFVHHGVVT 188


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/198 (36%), Positives = 103/198 (52%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC + C   Y D  Y +D  +G+ +Y V  
Sbjct: 217 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKRCNAEYTDKTYSEDKFYGSSAYGVKD 276

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 277 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 309

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG ++     YW +ANSWNTDWG++G F+ILRG
Sbjct: 310 ----------VHTGGKLGGGHAVKLIGWGIEDGIP--YWTVANSWNTDWGEDGFFRILRG 357

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  G+PKL+
Sbjct: 358 VDECGIESGVVGGIPKLN 375



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P +FDSR  WP C +I+ IRDQ SCGSCW 
Sbjct: 115 DLDMDIPESFDSRENWPKCESIKAIRDQSSCGSCWA 150



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/30 (70%), Positives = 23/30 (76%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           R CGFGCNGG P  AWRYWVK GIV+G  +
Sbjct: 183 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNF 212


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 103/195 (52%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGH-TPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY IAPCEHH+ G++P+C AS    TP C   C     + Y+KD   G  +Y V   
Sbjct: 189 GCQPYPIAPCEHHMEGSKPNCSASPTEPTPACETTCTHGSSLAYQKDRQKGKSAYLVPVG 248

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           EK    EI+++GP+  AF V++D  +YKSG +                      +   E 
Sbjct: 249 EKQTQLEIFKNGPIVAAFKVYEDFFMYKSGVY----------------------KRHPES 286

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
            F               G HA++++GWG  E++   YWL+ NSW+ DWGD GLFKI RG 
Sbjct: 287 PFR--------------GRHAVKVIGWG--EQNGLPYWLVQNSWDYDWGDKGLFKIARG- 329

Query: 293 DECGIESSITAGVPK 307
           +EC  E S+TAG+PK
Sbjct: 330 NECDFEKSMTAGLPK 344



 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 22/91 (24%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDL------------------- 84
           ++++N P++  K+    HPD   P + L  L+G SE++ +L                   
Sbjct: 34  DAINNNPKSTWKAGHNFHPD--TPMSYLQGLLGVSELESNLADLDKYEEMEENEENKKIK 91

Query: 85  -PANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            P  FD+R KW  C ++REIRDQG+CGSCW 
Sbjct: 92  VPKYFDARKKWKKCKSLREIRDQGNCGSCWA 122



 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC GGFP  AW +  + G+V+GG Y S
Sbjct: 157 CGFGCEGGFPDAAWVFIKRHGLVTGGDYHS 186


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/304 (30%), Positives = 136/304 (44%), Gaps = 100/304 (32%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------- 113
           +L   +LP  I     D DLP  FD+R KWP CP++REIRDQG CGSCW           
Sbjct: 106 DLERTKLPLGIMADVEDLDLPDTFDAREKWPECPSLREIRDQGCCGSCWAVSAASAMTDR 165

Query: 114 ---------------------------GCRPYEIAPC-----EHHVNGTRPSCDASKGHT 141
                                      GCR   + P      E  ++   P  ++ +G  
Sbjct: 166 WCVRSKGKEQFIFGSLDLLSCCHSCGQGCRGGTLGPAWQFWVEKGLSSGGP-LNSRQGCH 224

Query: 142 PKCVRECQ---ENYDVP--------------YKKDLNFGAKSYSVSSNEKSIMKEIYEHG 184
           P  + EC+   E+ D P                +D + G  +YS+ ++E+ IM+EI+ +G
Sbjct: 225 PYPIGECRIPGEDEDTPKCSNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFING 284

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           PV+ AF  + DL  YKSG                                     +  + 
Sbjct: 285 PVQAAFHTYLDLHAYKSG-------------------------------------IYRHV 307

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G   GGHA+++LGWG +  +  KYWL+ANSW  +WG+NG FK++RG++ CGIE +I AG
Sbjct: 308 WGPLSGGHAVKLLGWGVE--NGVKYWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 305 VPKL 308
           +P  
Sbjct: 366 LPNF 369



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW++WV+ G+ SGG   S+Q
Sbjct: 190 CGQGCRGGTLGPAWQFWVEKGLSSGGPLNSRQ 221


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGP E    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + +++ ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRKEDPNLREKRRP-TVDHHDLNVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/299 (35%), Positives = 131/299 (43%), Gaps = 42/299 (14%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G       +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           H   G       S  H   C ++C +  D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGDGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
                  A  Y VS    S   + Y   H    G          Y    F  P   TT  
Sbjct: 163 PDS----AWEYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215

Query: 213 --AMSLIKWTIRDNTSQLGAEG--------------AFTVFDDLILYK-------SGKAL 249
             A+ LIK+   D+   L  E               AF V+ D + YK       SG  L
Sbjct: 216 DKAIPLIKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGDFL 275

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GGHA+RI+GWG+   +   YW IANSW+TDWG NG F ILRG +ECGIES+  AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/195 (37%), Positives = 100/195 (51%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEHH+       C      T  C  +CQ+ Y + Y  D ++GA  Y+V+ +
Sbjct: 77  GCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKCQDGYSISYNSDKHYGASVYAVAQD 136

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             SI KEI  +GPVE AF V++D   Y SG                              
Sbjct: 137 VASIQKEIMTNGPVEVAFDVYEDFEHYSSG------------------------------ 166

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                  +  + +G  LGGHA+++LGWG +  +   YW+ ANSWN+DWG+NG F+ILRG 
Sbjct: 167 -------IYKHTTGDYLGGHAVKMLGWGTENGTD--YWICANSWNSDWGENGFFRILRGV 217

Query: 293 DECGIESSITAGVPK 307
           DEC IES + AG PK
Sbjct: 218 DECEIESGVVAGEPK 232



 Score = 44.7 bits (104), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 28/52 (53%), Gaps = 5/52 (9%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK-----NSLSNIPRAHLK 55
          CGFGC+G  P  AW YWV +GIV+G  Y SK   K         +IP  H K
Sbjct: 45 CGFGCDGRDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYK 96


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 94/258 (36%), Positives = 126/258 (48%), Gaps = 43/258 (16%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143
           +P  FD+R KW +CP+I +IRDQGSCGSCW     E A  + +    + +   S  +   
Sbjct: 82  IPDTFDARQKWSDCPSISDIRDQGSCGSCWALGAVE-AMSDRYCVSFQENVHISAENLMT 140

Query: 144 CVREC---------QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH--GPVE----- 187
           C + C         Q+ ++   K  L  G +  S    +  ++ +   H  GP E     
Sbjct: 141 CCKFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGE 200

Query: 188 -----------GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
                        +T   +  L+   + +    E  A   I+  I  N      EGAFTV
Sbjct: 201 GKTPQCERTCRSGYTTSYEADLHYGEKAYAVHREVEA---IQTEIMTNGP---VEGAFTV 254

Query: 237 FDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
           + D   YKS       G ALGGHAIRILGWG +  +   YWLIANSWN  WGD G FK++
Sbjct: 255 YSDFPTYKSGVYQHVVGHALGGHAIRILGWGTE--NGVPYWLIANSWNPSWGDKGYFKMI 312

Query: 290 RGKDECGIESSITAGVPK 307
           RGKD+CGIES+I AG PK
Sbjct: 313 RGKDDCGIESNIVAGTPK 330



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 30/47 (63%), Gaps = 2/47 (4%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAH 53
           + CG GC GGF   AW YWVK G+V+GG YGS +  +  L  IP+ +
Sbjct: 143 KFCGNGCAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYL--IPKCN 187


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/198 (37%), Positives = 104/198 (52%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC ++C  +Y D  Y +D  +GA +Y V  
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGASAYGVKD 261

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 262 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 294

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG +  +   YW  ANSWNTDWG++G F+ILRG
Sbjct: 295 ----------VHTGGKLGGGHAVKLVGWGIE--NGIPYWTCANSWNTDWGEDGFFRILRG 342

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  GVPKL+
Sbjct: 343 VDECGIESGVVGGVPKLN 360



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 23/36 (63%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P NFDSR  WP C +IR IRDQ SCGSCW 
Sbjct: 100 DLDMDIPENFDSRENWPKCQSIRNIRDQSSCGSCWA 135



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/37 (62%), Positives = 25/37 (67%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           R CGFGCNGG P  AWRYWVK GIV+G  Y +    K
Sbjct: 168 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANSGCK 204


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score =  134 bits (338), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 134/315 (42%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
            SN   A  K  +GV P    P N L  + +       +LP  FD+R+ W  C TI  I 
Sbjct: 57  FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113

Query: 105 DQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------- 142
           DQG CGSCW     E      C H        VN     C     D   G  P       
Sbjct: 114 DQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYF 173

Query: 143 -------------------------------KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                                          KC ++C+E   V +++  +F   +Y ++S
Sbjct: 174 VQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQV-WQEKKHFSIDAYRINS 232

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 233 DPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 265

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+RG
Sbjct: 266 ----------KHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 314

Query: 292 KDECGIESSITAGVP 306
           K+ECGIE  + AG+P
Sbjct: 315 KNECGIEEGVVAGMP 329



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AWRY+V++G+V+
Sbjct: 156 MCGDGCDGGYPIEAWRYFVQNGVVT 180


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 87/283 (30%), Positives = 121/283 (42%), Gaps = 96/283 (33%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGS-----------------CGSCWGCR-----PYEIA 121
           +P  FD+RTKWP C +I+ IR+Q +                 C +  G R     P ++ 
Sbjct: 87  IPETFDARTKWPKCKSIKLIRNQANCGSCWAFGAAEVISDRICIATKGARQPVISPMDMV 146

Query: 122 PC------------------------------EHHVNGTRP-----SCDASKGHTPKCVR 146
            C                              ++  +G +P     S       TP+C  
Sbjct: 147 DCCGEYCGYGCDGGYSIQALRWWVFDGVVTGGDYQGDGCKPYQFCNSAGCPDAVTPECAL 206

Query: 147 ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFV 206
            CQ  Y+  Y KD NFG  +Y V     +I  +I  +GPVE +F V++D   YKSG    
Sbjct: 207 SCQSKYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSG---- 262

Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
                                            +  Y +GK LGGHAI+I+GWG +  + 
Sbjct: 263 ---------------------------------VYKYIAGKMLGGHAIKIIGWGTENGTA 289

Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
             YWLIANSW T WG+NG FKI RG +ECGIE+++ AG   +D
Sbjct: 290 --YWLIANSWGTKWGENGFFKIRRGVNECGIENNVVAGKADVD 330



 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG+GC+GG+   A R+WV  G+V+GG Y
Sbjct: 153 CGYGCDGGYSIQALRWWVFDGVVTGGDY 180


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 133/300 (44%), Gaps = 111/300 (37%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GC GG+P  AW YW++ GIV+GG + +            R   + WM    D+    
Sbjct: 151 CGQGCRGGYPPKAWDYWMREGIVTGGTWEN------------RTGCQPWMFTKCDH---- 194

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
                 +G            DSR K+  CP                          H+  
Sbjct: 195 ------VG------------DSR-KYSRCP--------------------------HYTY 209

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            T P           C R CQ  Y+  Y++D  +G  SY+V  +E  IM+EI ++GPVE 
Sbjct: 210 PTPP-----------CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 258

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
            F +F D  +Y+SG                                     +  + +GK 
Sbjct: 259 TFAIFQDFGVYRSG-------------------------------------IYHHVAGKF 281

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +G HA+R++GWG +  +   YWL+ANSWN +WG+NG F+++RG++ECGIES + AG+P+L
Sbjct: 282 IGRHAVRMIGWGVE--NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 339



 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A     SN+   H K  +G   +     N L   I +     DLP +FD+R++WP C 
Sbjct: 43  KAARSTRFSNVD--HFKLHLGALSETPEERNALRPTIKHDISKNDLPESFDARSQWPQCW 100

Query: 99  TIREIRDQGSCGSCWG 114
           TI EIRDQ SCGSCW 
Sbjct: 101 TISEIRDQASCGSCWA 116


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 73/185 (39%), Positives = 96/185 (51%), Gaps = 40/185 (21%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCRPY   PCEHH N T    C      TPKC ++C +NY   YK D  +G ++Y+V ++
Sbjct: 174 GCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVEND 233

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            +SI KEI   GPVE +F V+ D + Y SG                              
Sbjct: 234 VESIQKEIMTLGPVEASFEVYTDFLHYTSG------------------------------ 263

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                  +  + +G   GGHA++ILGWG D+     YWL ANSWN DWG++G F+ILRG 
Sbjct: 264 -------IYKHVAGSVGGGHAVKILGWGIDQGV--SYWLAANSWNNDWGEDGYFRILRGA 314

Query: 293 DECGI 297
           DECG+
Sbjct: 315 DECGM 319



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           +P +FD+R  WP C ++R IRDQ SCGSCW     E
Sbjct: 77  IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVE 112



 Score = 45.8 bits (107), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           + CGFGC GG P  AW+YWV SGIV+G  Y
Sbjct: 140 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 169


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 105/298 (35%), Positives = 127/298 (42%), Gaps = 41/298 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           H   G       S  H   C ++C    D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGYGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
                  A  Y VS    S   + Y   H    G          Y    F  P   TT  
Sbjct: 163 PD----AAWRYYVSHGLASSYCQPYPFPHCDHHGGKGKKPPCSKYD---FHTPKCNTTCT 215

Query: 213 --AMSLIKWTIRDNTSQLGAEG-------------AFTVFDDLILYK-------SGKALG 250
             A+ LIK+    +    G E              AF V+ D   YK       SG  LG
Sbjct: 216 DKAIPLIKYRGNHSYEVHGEEDYKRELYFNGPFVVAFQVYSDFFAYKTGVYRHVSGDVLG 275

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GHA+RI+GWG+   +   YW IANSW+TDWG NG F ILRGKDECGIE    AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGKDECGIEHQGYAGSPAI 331



 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG+GC+GG+P  AWRY+V  G+ S
Sbjct: 154 CGYGCDGGYPDAAWRYYVSHGLAS 177


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 100/195 (51%), Gaps = 40/195 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC +H N      C      TPKC + CQ  Y+  Y +D  F  +SY + SNE
Sbjct: 185 CKPYAFYPCGNHTNERYYGPCPRGLWPTPKCRKACQRKYNKSYNEDKYFATRSYYLPSNE 244

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           +SI +EIY++GPV  AF V+ D   Y+ G                               
Sbjct: 245 RSIREEIYKNGPVVAAFKVYQDFSYYRGG------------------------------- 273

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 + ++K G   G HA++++GWG +  +   YWLIANSWNTDWG+NG F+I RG +
Sbjct: 274 ------IYVHKWGGQTGAHAVKVVGWGRENGTD--YWLIANSWNTDWGENGYFRIARGSN 325

Query: 294 ECGIESSITAGVPKL 308
           ECGIE  + +GV ++
Sbjct: 326 ECGIEGQMVSGVMRV 340



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 5/44 (11%)

Query: 71  LPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           L E+ G     +D P +FD+R  WP C +I  IRDQ +CGSCW 
Sbjct: 79  LTEVFG-----DDPPDSFDARAHWPECRSIGTIRDQSACGSCWA 117


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 96/322 (29%), Positives = 137/322 (42%), Gaps = 110/322 (34%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            SN      K  +GV        +  P +     +   LP +FD+RT W  C TI  I D
Sbjct: 59  FSNFTVGQFKRLLGVKQTPRSELSSAPVVTHPKSLK--LPKDFDARTAWSQCSTIGRILD 116

Query: 106 QGSCGSCW-------------------------------------GC---RPY------- 118
           QG CGSCW                                     GC    P+       
Sbjct: 117 QGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILACCGLLCGAGCAGGTPFSAWIYLA 176

Query: 119 -------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                  E  P    +  + P C+ +   TPKCV++C  N +  ++   ++  K+Y+V+S
Sbjct: 177 HHGVVTEECDPYFDQIGCSHPGCEPTY-RTPKCVKKCV-NGNQLWETSKHYSVKAYTVNS 234

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + + IM E+Y++GPVE A                                          
Sbjct: 235 DPQDIMAEVYKNGPVEVA------------------------------------------ 252

Query: 232 GAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D   YKSG        ALGGHA++++GWG   +  E YWL+AN WNT+WGD+G
Sbjct: 253 --FTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTSHEG-EDYWLLANQWNTNWGDDG 309

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI RG +ECGIE+++TAG+P
Sbjct: 310 YFKIKRGTNECGIENAVTAGLP 331


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 133/301 (44%), Gaps = 47/301 (15%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           +   G       S  H   C ++C    D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLLSCCKDCGYGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY--EHGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
                  A  Y VS    S   + Y   H    G          Y    F  P   TT  
Sbjct: 163 PGT----AWEYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215

Query: 213 --AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYK-------SGK 247
             A+ LIK+  R N S  G +G                AF V+ D + YK       SG 
Sbjct: 216 DKAIPLIKY--RGNHS-YGLDGEDDYKRELYFNGPFVVAFQVYSDFLAYKTGVYRHVSGD 272

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            LGGHA+RI+GWG+   +   YW IANSW+TDWG NG F ILRGKDECGIES   AG+P 
Sbjct: 273 VLGGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGKDECGIESEGYAGLPA 330

Query: 308 L 308
           +
Sbjct: 331 I 331



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 14/24 (58%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG+GC+GG+PG AW Y+V  G+ S
Sbjct: 154 CGYGCDGGYPGTAWEYYVSHGLAS 177


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/200 (39%), Positives = 102/200 (51%), Gaps = 41/200 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  G+  GC PY +  C+HH  G    C A    TPKC ++C   Y   Y  D   G KS
Sbjct: 175 GQYGTNEGCMPYSLPHCDHHTTGKYQPCPAVV-PTPKCEKKCLTGYPKSYSNDKTRGKKS 233

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y V   + SIM+E+ ++GPV  AF V+ D + YK+G                        
Sbjct: 234 YGVRGVQ-SIMQELVDNGPVTAAFDVYSDFLSYKTG------------------------ 268

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                        +  + +G   GGHA++I+G+G +  S + YWL+ANSWN DWGD G F
Sbjct: 269 -------------VYRHTTGSYEGGHAVKIIGYGTE--SGQDYWLVANSWNEDWGDKGFF 313

Query: 287 KILRGKDECGIESSITAGVP 306
           KI +GKDECGIESSI AG P
Sbjct: 314 KIAKGKDECGIESSIVAGDP 333



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 26/34 (76%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           + CG GCNGG+P  AW ++V +G+VSGG YG+ +
Sbjct: 148 KSCGMGCNGGYPAAAWEWYVDTGVVSGGQYGTNE 181


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 90/284 (31%), Positives = 127/284 (44%), Gaps = 108/284 (38%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW------------------------------ 113
           LP +FD+RT W  C TI  I DQG CGSCW                              
Sbjct: 100 LPKDFDARTAWSQCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDMNVSLSVNDILAC 159

Query: 114 -------GC---RPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
                  GC    P+              E  P    +  + P C+ +   TPKCV++C 
Sbjct: 160 CGLLCGAGCAGGTPFSAWIYLAHHGVVTEECDPYFDQIGCSHPGCEPTY-RTPKCVKKCV 218

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
            N +  ++   ++  K+Y+V+S+ + IM E+Y++GPVE A                    
Sbjct: 219 -NGNQLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVA-------------------- 257

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGED 262
                                   FTV++D   YKSG        ALGGHA++++GWG  
Sbjct: 258 ------------------------FTVYEDFAHYKSGVYKHITGFALGGHAVKLVGWGTS 293

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
            +  E YWL+AN WNT+WGD+G FKI RG +ECGIE+++TAG+P
Sbjct: 294 HEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVTAGLP 336


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 80/207 (38%), Positives = 101/207 (48%), Gaps = 57/207 (27%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCRPY   PCEHH N T    C      TPKC ++C +NY   YK D  +G ++Y+V ++
Sbjct: 218 GCRPYPFPPCEHHSNKTHYEPCKHDLYPTPKCYKQCDKNYTKSYKADKYYGEQAYNVEND 277

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            +SI KEI   GPVE +                                           
Sbjct: 278 VESIQKEIMTLGPVEAS------------------------------------------- 294

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN-- 283
            F V+ D + Y SG          GGHA++ILGWG D+     YWL ANSWN DWG++  
Sbjct: 295 -FEVYTDFLHYTSGIYKHVAGSVGGGHAVKILGWGIDQGVS--YWLAANSWNNDWGEDVF 351

Query: 284 -GLFKILRGKDECGIESSITAGVPKLD 309
            G F+ILRG DECGIES I AG+P+ D
Sbjct: 352 SGYFRILRGADECGIESGIVAGIPRKD 378



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           +P +FD+R  WP C ++R IRDQ SCGSCW     E
Sbjct: 121 IPESFDARKNWPECASLRNIRDQSSCGSCWAVAAVE 156



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           + CGFGC GG P  AW+YWV SGIV+G  Y
Sbjct: 184 KTCGFGCFGGEPMAAWKYWVLSGIVTGSDY 213


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/194 (38%), Positives = 97/194 (50%), Gaps = 42/194 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PCEHH  G  P+C  +K  TPKC++ C++ Y+  Y +D  F    YS+ S+E
Sbjct: 122 GCQPYYFPPCEHHTKGPLPNCTDTKP-TPKCLQVCRKGYEKSYSEDKYFAKTVYSLHSDE 180

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVE  F+V+ D + YKSG +          S   W  R           
Sbjct: 181 TQIKTEIYKNGPVEADFSVYTDFLAYKSGVY-------QRHSYELWEARHQN-------- 225

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                                  LGW    +S    WL+ANSWN DWGD G FKI RG +
Sbjct: 226 -----------------------LGWALKRRS---VWLVANSWNQDWGDKGYFKIRRGNN 259

Query: 294 ECGIESSITAGVPK 307
           ECGIE+ I AG+PK
Sbjct: 260 ECGIENDINAGIPK 273



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%), Gaps = 1/43 (2%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           RLP  + + E+ EDLP +FD+R  W +C +I  IRDQ +CGSC
Sbjct: 12  RLPIRL-HEEIPEDLPESFDAREAWSHCDSIHLIRDQSTCGSC 53



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC GG+P  AW Y+   GIV+GG YG+
Sbjct: 90  CGMGCFGGYPSAAWDYYKDEGIVTGGLYGT 119


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 68/195 (34%), Positives = 102/195 (52%), Gaps = 40/195 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC HH N      C      TPKC + CQ  Y+  Y++D +F  ++Y + +NE
Sbjct: 188 CKPYAFYPCGHHQNDPYYGPCPGGLWPTPKCRKTCQRKYNKSYQEDKHFATRAYYLPNNE 247

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++I +EIY++GPV  AF V+ D   YK G                               
Sbjct: 248 RNIRQEIYKNGPVVAAFRVYQDFSYYKKG------------------------------- 276

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 + ++K G   G HA++++GWG +  +   YWLIANSWNTDWG++G F+I+RG +
Sbjct: 277 ------IYVHKWGGQTGAHAVKVVGWGRENATD--YWLIANSWNTDWGESGYFRIVRGTN 328

Query: 294 ECGIESSITAGVPKL 308
           ECGIE+ +  G  ++
Sbjct: 329 ECGIEAQMVGGAMRV 343



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/32 (65%), Positives = 24/32 (75%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D PA+FD+RT WP C +I  IRDQ SCGSCW 
Sbjct: 88  DPPASFDARTHWPECRSIGTIRDQSSCGSCWA 119



 Score = 37.4 bits (85), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 24/35 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG+GC GG+P  A+++  + G+V+GG Y  K+  K
Sbjct: 155 CGYGCQGGWPIEAYKWMQRDGVVTGGKYRQKKVCK 189


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 97/297 (32%), Positives = 128/297 (43%), Gaps = 107/297 (36%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------------- 121
           EV   +P  FD+R KWP+CPTI  +RDQG+CGSCW     E                   
Sbjct: 81  EVPAVIPDTFDARQKWPDCPTIGTVRDQGACGSCWAFGAVEAMSDRYCISFKEQVNISAE 140

Query: 122 -------PCEHHVNGTRPSC--------------------DASKGHTPKCVRECQENYDV 154
                   C    +G  P+                     D++ G  P  + +C  +   
Sbjct: 141 NLLSCCETCGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPG 200

Query: 155 PY------------------------KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAF 190
           PY                        + D ++G  SYS+SS+  SI  EI  +GPVEGAF
Sbjct: 201 PYENCSGSQSTPSCKRSCISSYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAF 260

Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
           +V+ D   Y SG +                                      + +G  LG
Sbjct: 261 SVYADFPTYTSGVY-------------------------------------QHTTGSFLG 283

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           GHAI+ILGWG +  +   YWL+ANSWN  WGD+G FKI+RGKDECGIESSI AG+P+
Sbjct: 284 GHAIKILGWGTE--NGVPYWLVANSWNPSWGDSGFFKIIRGKDECGIESSIVAGMPE 338



 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 23/34 (67%), Gaps = 4/34 (11%)

Query: 9   CGFGCNGGFPGMAWRYW----VKSGIVSGGAYGS 38
           CG GC+GG+P  AWR+W    +  GIV+GG Y S
Sbjct: 149 CGSGCDGGYPAAAWRHWADKLLYEGIVTGGQYDS 182


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 107/209 (51%), Gaps = 39/209 (18%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
            +R I   GS  +  GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++
Sbjct: 172 VLRGIVTGGSKENHTGCRPYPFPKCDHFVKGKYRACGDKLYKTPQCNQTCQKGYNTSYEQ 231

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D ++G  SY+V S E  I K+I  HGPVE    +++D + YKSG                
Sbjct: 232 DKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG---------------- 275

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                +  Y +GK + GHA+R++G G +  +   YWL AN+WN 
Sbjct: 276 ---------------------IYRYTTGKYISGHAVRLIGCGVENGT--AYWLAANTWNE 312

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG+ G F+I+RG++EC IES I AG+ K
Sbjct: 313 DWGEKGYFRIVRGRNECLIESEIAAGLIK 341



 Score = 54.3 bits (129), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%), Gaps = 1/56 (1%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           G   D NL   R P  + + ++  ++P++FDSR KWP C +I +IRDQ  CGS W 
Sbjct: 66  GRREDPNLREKRRP-TVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSQCGSSWA 120



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/29 (58%), Positives = 22/29 (75%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           + CG GC+GGF G +W YWV  GIV+GG+
Sbjct: 153 KYCGSGCDGGFLGPSWDYWVLRGIVTGGS 181


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 100/199 (50%), Gaps = 44/199 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPS---CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
           GC PY    C H   G+R     C      TP C   CQ  YD  Y+KD  +G  SY+V 
Sbjct: 173 GCLPYPFPQCRH--PGSRSQLNPCPRYTYPTPSCYPYCQAGYDKTYEKDKVYGKTSYNVD 230

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
            +E +IM+EI ++GPVE  F V+ D  +YKSG +                          
Sbjct: 231 RHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSGIYH------------------------- 265

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       + SG+  G HAIRI+GWG +  +  KYWL ANSWN  WG+NG F+ILR
Sbjct: 266 ------------HVSGRYAGKHAIRIIGWGVE--NGVKYWLTANSWNVGWGENGYFRILR 311

Query: 291 GKDECGIESSITAGVPKLD 309
           G DEC IES + AG+P+L 
Sbjct: 312 GTDECRIESIVVAGMPRLQ 330



 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 40/76 (52%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A  +   NI   H K  +G+  +           + Y+  D DLP +FD+R KWP C 
Sbjct: 33  KAAPSSRFINI--EHFKQHLGLLEETPEERQTRRPTVRYNVSDNDLPESFDAREKWPLCR 90

Query: 99  TIREIRDQGSCGSCWG 114
           +IR+I DQ SCGSCW 
Sbjct: 91  SIRQIPDQSSCGSCWA 106


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 69/147 (46%), Positives = 85/147 (57%), Gaps = 37/147 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RPSC   +G TPKCV +C+  Y   Y KD +FG+ SY+VSSNE
Sbjct: 111 GCRPYTIPPCEHHVNGSRPSCTGEEGDTPKCVMQCEAGYTPSYFKDKHFGSTSYAVSSNE 170

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EIY++GPVEGAFTV++D + YKSG +                             
Sbjct: 171 ADIQIEIYKNGPVEGAFTVYEDFLQYKSGVY----------------------------- 201

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWG 260
                    + +G A+GGHAIRILGWG
Sbjct: 202 --------KHVTGDAVGGHAIRILGWG 220



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/43 (69%), Positives = 34/43 (79%), Gaps = 1/43 (2%)

Query: 71  LPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           LP  +G +  D  LP NFDSRT+WP CPTI+EIRDQGSCGSCW
Sbjct: 1   LPMKLGMA-TDVKLPENFDSRTQWPKCPTIQEIRDQGSCGSCW 42


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/313 (31%), Positives = 134/313 (42%), Gaps = 92/313 (29%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
           L+N      K  +GV P        +P    YS+  E+LP  FD+R+KW  C TI  I D
Sbjct: 58  LANYTIEQFKHILGVKPTPPGLLAGVPTKT-YSK-SEELPKQFDARSKWSGCSTIGTILD 115

Query: 106 QGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHTP-------- 142
           QG CGSCW     E      C H         N     C     D   G  P        
Sbjct: 116 QGHCGSCWAFGAVECLQDRFCIHQNINISLSANDLVACCGFMCGDGCDGGYPIKAWQYFV 175

Query: 143 --KCVRE---------------CQENYDVP------------YKKDLNFGAKSYSVSSNE 173
               V E               C+  YD P            +++  +F   +Y V+S+ 
Sbjct: 176 QSGVVTEECDPYFDQVGCKHPGCEPAYDTPKCEKKCKVQNQVWEEKKHFSINAYRVNSDP 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM E+Y++GPVE AFTV++D   YKSG +                             
Sbjct: 236 HDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+RGK+
Sbjct: 267 --------KHVTGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRGKN 317

Query: 294 ECGIESSITAGVP 306
           ECGIE  + AG+P
Sbjct: 318 ECGIEEEVVAGMP 330



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AW+Y+V+SG+V+
Sbjct: 157 MCGDGCDGGYPIKAWQYFVQSGVVT 181


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 133/300 (44%), Gaps = 111/300 (37%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GC GG+P  AW YW++ GIV+GG + +            R   + WM    D+    
Sbjct: 59  CGQGCRGGYPPKAWDYWMREGIVTGGTWEN------------RTGCQPWMFTKCDH---- 102

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
                 +G            DSR K+  CP                          H+  
Sbjct: 103 ------VG------------DSR-KYSRCP--------------------------HYTY 117

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            T P           C R CQ  Y+  Y++D  +G  SY+V  +E  IM+EI ++GPVE 
Sbjct: 118 PTPP-----------CARACQTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEV 166

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
            F +F D  +Y+SG                                     +  + +GK 
Sbjct: 167 TFAIFQDFGVYRSG-------------------------------------IYHHVAGKF 189

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +G HA+R++GWG +  +   YWL+ANSWN +WG+NG F+++RG++ECGIES + AG+P+L
Sbjct: 190 IGRHAVRMIGWGVE--NGVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAGMPRL 247


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 70/194 (36%), Positives = 100/194 (51%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SY+V   E
Sbjct: 187 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGEFSYNVIGVE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I KEI  +GPVE    +++D + YKSG                               
Sbjct: 247 SVIQKEIMMYGPVEAYLHIYEDFLNYKSG------------------------------- 275

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG+D
Sbjct: 276 ------IYRYTTGQFISGHAVRLIGWGVENGT--SYWLAANTWNEDWGEKGYFRIVRGRD 327

Query: 294 ECGIESSITAGVPK 307
           EC IES I AG  K
Sbjct: 328 ECLIESFIVAGQIK 341



 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GG  G +W YWVK GIV+GG+
Sbjct: 155 CGSGCDGGVTGYSWDYWVKHGIVTGGS 181


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 101/194 (52%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+H V G   +C      TP+C + CQ+ Y+  Y++D ++G  SY+V S E
Sbjct: 154 GCRPYPFPKCDHFVKGKYRACGDKLYKTPQCKQTCQKGYNTSYEQDKHYGGFSYNVLSVE 213

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I K+I  HG VE    +++D + YKSG                               
Sbjct: 214 SVIQKDIMMHGTVEAYLEIYEDFLNYKSG------------------------------- 242

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G+ + GHA+R++GWG +  +   YWL AN+WN DWG+ G F+I+RG++
Sbjct: 243 ------IYRYTTGQFISGHAVRLIGWGVENGT--AYWLAANTWNEDWGEKGYFRIVRGRN 294

Query: 294 ECGIESSITAGVPK 307
           EC IES I AG+ K
Sbjct: 295 ECLIESEIAAGLIK 308



 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 20/27 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GG  G +W YWV  GIV+GG+
Sbjct: 122 CGSGCDGGVTGYSWDYWVSHGIVTGGS 148


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 93/315 (29%), Positives = 137/315 (43%), Gaps = 97/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            S+   +  K  +GV         R P +    E++  LP  FD+RT WP C +I +I D
Sbjct: 60  FSDFTVSQFKRLLGVKKAPKSLLKRTPVVTHSKEIE--LPKTFDARTAWPQCLSIADILD 117

Query: 106 QGSCGSCW-------------------------------------GCR-PYEIAP----- 122
           QG CGSCW                                     GC   Y IA      
Sbjct: 118 QGHCGSCWAFGAVESLTDRFCIHYGTNVTLSVNDLLACCGFLCGEGCDGGYPIAAWQYFK 177

Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                   C+ + + T    P C+ +   TP C ++C +  ++ + +  +F   +Y V+S
Sbjct: 178 RTGVVTSECDPYFDQTGCSHPGCEPAYP-TPACEKKCVKK-NLLWSESKHFSVNAYRVNS 235

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           ++ SIM E+Y +GP E +FTV++D   YKSG +                           
Sbjct: 236 DQHSIMTEVYTNGPAEVSFTVYEDFAHYKSGVY--------------------------- 268

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  E   E YWL+AN WN  WG +G FKI+RG
Sbjct: 269 ----------KHVTGSEMGGHAVKLIGWGTSEDG-EDYWLLANQWNRSWGGDGYFKIIRG 317

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  +TAG P
Sbjct: 318 TNECGIE-DVTAGTP 331



 Score = 37.4 bits (85), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AW+Y+ ++G+V+
Sbjct: 159 LCGEGCDGGYPIAAWQYFKRTGVVT 183


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/198 (36%), Positives = 103/198 (52%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
           GC+PY   PCEHH   T    C      TPKC ++C  +Y D  Y +D  +G  +Y V  
Sbjct: 203 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCIADYTDKTYSEDKFYGHSAYGVKD 262

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           + ++I KE+  HGP+E AF V++D + Y  G +                           
Sbjct: 263 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 295

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++  GK  GGHA++++GWG ++     YW  ANSWNTDWG++G F+ILRG
Sbjct: 296 ----------VHTGGKLGGGHAVKLIGWGIEDGIP--YWTCANSWNTDWGEDGFFRILRG 343

Query: 292 KDECGIESSITAGVPKLD 309
            DECGIES +  G+PKL+
Sbjct: 344 VDECGIESGVVGGIPKLN 361



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 22/36 (61%), Positives = 27/36 (75%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D D+P +FDSR  WP C +IR IRDQ SCGSCW 
Sbjct: 101 DLDLDIPESFDSRENWPKCQSIRNIRDQSSCGSCWA 136



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/30 (73%), Positives = 23/30 (76%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           R CGFGCNGG P  AWRYWVK GIV+G  Y
Sbjct: 169 RSCGFGCNGGDPLAAWRYWVKDGIVTGSNY 198


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 70/195 (35%), Positives = 103/195 (52%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY   PCEH+++  R   C      T  C  +CQ+NY + Y +D ++GA  Y +  +
Sbjct: 191 GCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTCEYKCQDNYTISYDEDKHYGAYPYVLVGD 250

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I +EI  HGPVE  F V++D   Y SG                              
Sbjct: 251 ASFIQQEIMNHGPVEVTFDVYEDFEHYSSG------------------------------ 280

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                  +  + +G+ +G HA+++LGWG +  +   YW+ ANSWN+DWG+NG F+ILRG+
Sbjct: 281 -------IYKHMAGEYVGVHAVKMLGWGTE--NGVDYWICANSWNSDWGENGFFRILRGE 331

Query: 293 DECGIESSITAGVPK 307
           +ECGIES++ AG PK
Sbjct: 332 NECGIESNVVAGKPK 346



 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 2/70 (2%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED-LPANFDSRTKWPNCPTIREIR 104
            +N+PR      MG      LPA        ++++D   +P +FD+RT WP C ++R +R
Sbjct: 56  FANLPRDIKHRLMG-SKYVALPAKYRMNEKTHNDIDNSTIPKSFDARTNWPKCASLRTVR 114

Query: 105 DQGSCGSCWG 114
           DQ +CGS W 
Sbjct: 115 DQSACGSGWA 124



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 20/35 (57%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG+GC GG    AW YW   GIV+G  Y +K   K
Sbjct: 159 CGYGCEGGDTYKAWNYWTTDGIVTGSNYTTKSGCK 193


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 93/298 (31%), Positives = 125/298 (41%), Gaps = 100/298 (33%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------- 121
           E I + +  +DLP  FD+R +W  C +I+EIRDQ  CGSCW      +            
Sbjct: 70  ETIFHEDDGKDLPEEFDARKQWSKCESIKEIRDQSGCGSCWAVSSASVMSDRICIQSDQK 129

Query: 122 ---------------PCEHHVNGT------------RPSCDASKGH-----------TPK 143
                           C   V+G             + S   S G             P+
Sbjct: 130 NQLRISAADMIECCESCTFSVDGCHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPR 189

Query: 144 CVRECQENYDVP-------------YKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGA 189
           C   C+  YD P             Y++D ++  ++Y + S  E+ I  EI ++GPV  +
Sbjct: 190 CNPSCKTLYDAPTCKKECDKGSPLKYEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVVAS 249

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
           FTV+ D I Y SG +   G                                      K L
Sbjct: 250 FTVYADFIHYLSGVYKFDGE------------------------------------SKLL 273

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           GGHA+RI+GWG  E     YWL++NSWN  WGD GLFKI RGK+ECGIE  ITAG+P+
Sbjct: 274 GGHAVRIIGWG-IENGTYPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 330


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/315 (30%), Positives = 134/315 (42%), Gaps = 96/315 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIR 104
            SN   A  K  +GV P    P N L  + +       +LP  FD+R+ W  C TI  I 
Sbjct: 57  FSNYTIAQFKHILGVKP---APQNALSNVPVKTYSRSLELPKEFDARSAWSRCSTIGNIL 113

Query: 105 DQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------- 142
           +QG CGSCW     E      C H        VN     C     D   G  P       
Sbjct: 114 EQGHCGSCWAFGAVECLQDRFCIHLNMSILLSVNDLLACCGFMCGDGCDGGYPIEAWRYF 173

Query: 143 -------------------------------KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                                          KC ++C+E   V +++  +F   +Y ++S
Sbjct: 174 VQNGVVTDECDPYFDPVGCKHPGCEPAYPTPKCEKKCKEQNQV-WQEKKHFSIDAYRINS 232

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 233 DPHDIMAEVYKNGPVEVAFTVYEDFAHYKSGVY--------------------------- 265

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+RG
Sbjct: 266 ----------KHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 314

Query: 292 KDECGIESSITAGVP 306
           K+ECGIE  + AG+P
Sbjct: 315 KNECGIEEGVVAGMP 329



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AWRY+V++G+V+
Sbjct: 156 MCGDGCDGGYPIEAWRYFVQNGVVT 180


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 99/327 (30%), Positives = 134/327 (40%), Gaps = 102/327 (31%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYN-----LPANRLPELIGYSEVDEDLPANFDSR 91
           G K A     SN   +  K  +GV P        +P    P+L+       +LP  FD+R
Sbjct: 55  GWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPILTHPKLL-------ELPQEFDAR 107

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DA 136
             W NC TI  I DQG CGSCW     E      C H+        N     C     D 
Sbjct: 108 VAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDG 167

Query: 137 SKGHTP-----KCVRE--------------------CQENYDVP------------YKKD 159
             G  P       VR+                    C+  Y  P            + + 
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRS 227

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            +FG  +Y +SS+  SIM E+Y++GPVE +FTV++D   YKSG +               
Sbjct: 228 KHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKSGVY--------------- 272

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                                  + +G  +GGHA++++GWG  E   E YWL+AN WN  
Sbjct: 273 ----------------------KHVTGDIMGGHAVKLIGWGTSEDG-EDYWLLANQWNRG 309

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WGD+G FKI RG +EC IE  + AG+P
Sbjct: 310 WGDDGYFKIRRGTNECEIEDEVVAGLP 336



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AW+Y+V+ G+V+
Sbjct: 163 LCGDGCDGGYPLQAWKYFVRKGVVT 187


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/202 (38%), Positives = 99/202 (49%), Gaps = 55/202 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+ Y + PCEHH  G  P+C      TP+C +EC    D+ YK DL  G+ +Y  SS+E
Sbjct: 182 GCKAYTVPPCEHHTEGDLPAC-GDIVPTPQCKKECDAGVDIEYKSDLRKGS-AYQTSSDE 239

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI  +GPVE                                              
Sbjct: 240 SQIQTEIMTNGPVEAD-------------------------------------------- 255

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D + YKSG          GGHAI+ILGWG ++ +   YWL ANSWN DWGD G F
Sbjct: 256 FDVYEDFLNYKSGVYQQTTGNYAGGHAIKILGWGVEDGTP--YWLAANSWNEDWGDKGYF 313

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KILRG++ECGIES I  G+P +
Sbjct: 314 KILRGQNECGIESDIIGGIPVV 335



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 8/60 (13%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GCNGG+P  AW YW ++GIV+GG Y +K   K + +  P  H       H + +LPA
Sbjct: 150 CGDGCNGGWPAEAWAYWAETGIVTGGKYETKDGCK-AYTVPPCEH-------HTEGDLPA 201


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/162 (43%), Positives = 92/162 (56%), Gaps = 37/162 (22%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
           T + +   G  GS  GCRPY I PCEHHVNG+RPSC    G TPKCV++C   Y   Y+K
Sbjct: 96  TEKGLVSGGLYGSGIGCRPYTIPPCEHHVNGSRPSCSGEGGDTPKCVQKCDSGYTPAYEK 155

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D  +G  +YSV S+ +SIM+EIY+ GPVEGAFTV++D +LYKSG +              
Sbjct: 156 DKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSGVY-------------- 201

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
                                   + +G+A+GGHAI+ILGWG
Sbjct: 202 -----------------------QHHTGEAVGGHAIKILGWG 220



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/30 (66%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW+YW + G+VSGG YGS
Sbjct: 79  CGMGCNGGYPSGAWQYWTEKGLVSGGLYGS 108


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/322 (30%), Positives = 136/322 (42%), Gaps = 92/322 (28%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G    + +  +N   A  K  +GV P        +P    YS    DLP  FD+R+KW  
Sbjct: 53  GWTAGQNSYFANYTIAQFKHILGVKPTPPGLLRGVPTKT-YSR-STDLPKEFDARSKWSG 110

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHT 141
           C TI  I DQG CGSCW     E      C H        VN     C     D   G  
Sbjct: 111 CSTIGTILDQGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLVACCGFMCGDGCDGGY 170

Query: 142 PKCVRE-------------------------CQENYDVP------------YKKDLNFGA 164
           P    +                         C+  Y  P            +++  +F  
Sbjct: 171 PISAWQYLVENGVVTDECDPYFDQVGCKHPGCEPAYPTPACEKKCKVQNQVWQEKKHFSI 230

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +Y V+S+   IM E+Y++GPVE AFTV++D   YKSG                      
Sbjct: 231 NAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKSG---------------------- 268

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                      V++ +    +G+ +GGHA++++GWG     K+ YWL+AN WN  WGD+G
Sbjct: 269 -----------VYEHI----TGEMMGGHAVKLIGWGTSADGKD-YWLLANQWNRGWGDDG 312

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI+RGK+ECGIE  + AG+P
Sbjct: 313 YFKIIRGKNECGIEEDVVAGMP 334


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/199 (36%), Positives = 100/199 (50%), Gaps = 53/199 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEH   G  P+C      TP+C + CQ+ Y  P+++D  FG  S +V +NE
Sbjct: 187 GCQPYPFPKCEHLTKGKYPACGTKIYKTPQCKQTCQKGYKTPFEQDKPFGEGSSNVQNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K   ++I  +GPVE A                                            
Sbjct: 247 KVFQRDIMMYGPVEAA-------------------------------------------- 262

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D +  KSG         +GGH IRI+GWG ++ +   YWLIANSWN DWG+NGLF
Sbjct: 263 FDVYEDFLNSKSGISRHVTGSIVGGHPIRIIGWGVEKGNP--YWLIANSWNEDWGENGLF 320

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+DEC IES + AG+
Sbjct: 321 RMVRGRDECSIESHVVAGL 339



 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + +  ++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHNLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 95/316 (30%), Positives = 131/316 (41%), Gaps = 97/316 (30%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            +N      K  +GV P    P   L  +      + DLP  FD+RT+W +C TI  I D
Sbjct: 62  FANYTIEQFKHILGVKP---TPPGLLAGVPIKIHPEMDLPKEFDARTQWSSCSTIGNILD 118

Query: 106 QGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP-------- 132
           QG CG+CW     E                            C    NG  P        
Sbjct: 119 QGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178

Query: 133 -------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
                   CD     T             PKC R+C+   +  +K++ +F   +Y V SN
Sbjct: 179 RSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVE-NQAWKENKHFSVNAYRVHSN 237

Query: 173 EKSIMKEIYEHGPVEGAFTVFD--DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
              IM E+Y++GPVE AFT     D   YKSG +                          
Sbjct: 238 PHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVY-------------------------- 271

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+R
Sbjct: 272 -----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIR 319

Query: 291 GKDECGIESSITAGVP 306
           G++ECGIE  +TAG+P
Sbjct: 320 GENECGIEGDVTAGMP 335



 Score = 40.8 bits (94), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+ +SG+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRSGVVT 184


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 104/205 (50%), Gaps = 61/205 (29%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGH--TPKCVRECQENYDV--PYKKDLNFGAKSYSV 169
           C+ Y   PC HHV  T+ P C   KG   TP+C ++C ++  V  PY +DL  G KSYSV
Sbjct: 194 CQAYSFPPCAHHVASTKYPPC---KGEVPTPECKKKCDDDSKVKRPYNEDLYKGQKSYSV 250

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           SS+ K+IM EI  +GPVE A                                        
Sbjct: 251 SSDPKAIMTEIMNNGPVEVA---------------------------------------- 270

Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
               FTV++D + YKSG       + LGGHA++++GWG +  +   YWLI NSWN  WGD
Sbjct: 271 ----FTVYEDFVTYKSGVYQHVTGEQLGGHAVKMIGWGVENDTP--YWLIVNSWNETWGD 324

Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
            G FKILRG +ECGIE  +   +P+
Sbjct: 325 QGTFKILRGSNECGIEDEVVTALPQ 349



 Score = 41.6 bits (96), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 23/29 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG GCNGG+P  A +Y+VK+G+V+G  +G
Sbjct: 161 CGDGCNGGYPEAAMQYFVKTGLVTGDLFG 189


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/198 (35%), Positives = 100/198 (50%), Gaps = 39/198 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GCRPY   PC HH N T       +  TP+CV++CQ+ Y   Y++D  +G   
Sbjct: 184 GDYGSKTGCRPYPFHPCGHHGNETYYGECPKEESTPECVKQCQKGYKNSYRRDKTWGEDY 243

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y V ++ K+I +EI   GPV  +FTV+DD   Y  G                        
Sbjct: 244 YEVENSVKAIQREIMRSGPVVSSFTVYDDFSYYVKG------------------------ 279

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                        +  + +GKA G HAI+I+GWG ++     YW+IANSW+ DWG+ G F
Sbjct: 280 -------------IYKHTAGKARGSHAIKIIGWGTEKNV--PYWIIANSWHNDWGEKGFF 324

Query: 287 KILRGKDECGIESSITAG 304
           +++RG + CGIE  + AG
Sbjct: 325 RMVRGTNHCGIEEDVVAG 342



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 34/46 (73%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           NR P +   S+  +D+P +FD+RTKWPNC +I+ IRDQ +CGSCW 
Sbjct: 79  NRKPVVEDASDKGDDIPESFDARTKWPNCTSIKHIRDQANCGSCWA 124



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CGFGC GG+P  A+ Y+   G+V+GG YGSK
Sbjct: 159 CGFGCEGGWPIDAFEYYSYQGVVTGGDYGSK 189


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 103/299 (34%), Positives = 128/299 (42%), Gaps = 42/299 (14%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G       +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           H   G       S  H   C ++C +  D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLLSCCKDCGDGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYE--HGPVEGAFTVFDDLILYKSGRFFVPGNETT-- 212
                  A  Y VS    S   + Y   H    G          Y    F  P   TT  
Sbjct: 163 PD----AAWRYYVSHGLASSYCQPYPFPHCGHHGGKGKKPPCSKYD---FHTPKCNTTCT 215

Query: 213 --AMSLIKWTIRDNTSQLGAEG--------------AFTVFDDLILYK-------SGKAL 249
             A+ LI++   D+   L  E               AF VF D + YK       SG  L
Sbjct: 216 DKAIPLIEYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVFSDFLAYKTGVYRHVSGDFL 275

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GGHA+RI+GWG+   +   YW IANSW+TDWG NG F  LRG +ECGIE    AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGTP--YWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAGLPAI 332


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 92/278 (33%), Positives = 115/278 (41%), Gaps = 93/278 (33%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---------------CEHHV 127
           +LP  FDSRT WPNC  I +I DQG CGSCW    +E+                    H+
Sbjct: 75  NLPDEFDSRTNWPNCQLIGKIYDQGHCGSCWAMSSFEVLQDRFCIKSEGKQTPELSPQHL 134

Query: 128 NGTRPSCDASKG------------------------------------HTPKCVR-ECQE 150
               P C    G                                     TPKC + +C  
Sbjct: 135 TSCTPGCSGCNGGWMSTAFGFMQSNGILGEDCIPYQMGKCKHPGCSTWPTPKCNKTKCYP 194

Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
           N       +L   A SYSV SNE  I KEIYE+GPV  +F V++DL +Y+SG +      
Sbjct: 195 NDT--KSTELWHAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQSGVY------ 246

Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYW 270
                                           + +G   G HAI+++GWG  +    KYW
Sbjct: 247 -------------------------------QHVTGGFEGLHAIKVVGWGILDGV--KYW 273

Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            I NSW  DWG +GL  I RG DECGIES + AG PKL
Sbjct: 274 TIVNSWAEDWGFDGLLLIRRGVDECGIESDVVAGQPKL 311


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 134/324 (41%), Gaps = 97/324 (29%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A     SN         +GV P        +P  +    +   LP+ FD+R  WP+
Sbjct: 50  GWKAAMSTRFSNYTVREFAHLLGVLPTPQKLLETVPVRVYPKGLK--LPSKFDARKAWPH 107

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAP---CEH-HVNGT---------------------- 130
           C + R I DQG CGSCW     E      C H  VN T                      
Sbjct: 108 CTSTRSILDQGHCGSCWAFAAVEALSDRFCIHFQVNATLSENDLVACCGFRCGSGCNGGF 167

Query: 131 ----------------------------RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                                        P C+ S   TP+CV+ C++N    + K  ++
Sbjct: 168 PLSAWRYFSRRGVVTDECDPYFDNDGCNHPGCEPSYP-TPRCVKNCKDNQRWSHSK--HY 224

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
            A +Y + S+  +IM E++ +GPVE +F+V++D   Y++G                    
Sbjct: 225 SANAYRIKSDPYNIMAEVFNNGPVEVSFSVYEDFAHYETG-------------------- 264

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                            +  +  G+ LGGHA++++GWG  +   + YWLIANSWNT WG+
Sbjct: 265 -----------------VYKHVQGRYLGGHAVKLIGWGTTDDGID-YWLIANSWNTAWGE 306

Query: 283 NGLFKILRGKDECGIESSITAGVP 306
            G FKI RG +ECGIE    AG+P
Sbjct: 307 GGYFKIARGVNECGIERDPVAGMP 330



 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 15/24 (62%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG GCNGGFP  AWRY+ + G+V+
Sbjct: 159 CGSGCNGGFPLSAWRYFSRRGVVT 182


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 37/195 (18%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY I PC HH N T       +  TP C ++CQ  Y   ++ D   G  +Y V   E+
Sbjct: 183 CRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEE 242

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I +EI  HGPV  +F V++D                   SL K  +  +T+        
Sbjct: 243 AIQREILRHGPVVASFAVYEDF------------------SLYKTGVYKHTA-------- 276

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                      G   G HA++++GWG D K+K KYWLIANSW+ DWG+NG F+ +RG ++
Sbjct: 277 -----------GALRGYHAVKMMGWGVDSKTKAKYWLIANSWHNDWGENGYFRFIRGIND 325

Query: 295 CGIESSITAGVPKLD 309
           C IE ++ AG+  +D
Sbjct: 326 CEIEDTVAAGIVDVD 340



 Score = 41.6 bits (96), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CGFGC GG+   AW Y+V  G+VSGG Y +K
Sbjct: 150 CGFGCGGGWSIRAWEYFVYEGVVSGGEYLTK 180



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 2/36 (5%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           E +ED+P  +D R K+  C T   IRDQ +CGSCW 
Sbjct: 81  EPNEDIPEEYDPREKF-KCSTFY-IRDQANCGSCWA 114


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 92/300 (30%), Positives = 129/300 (43%), Gaps = 104/300 (34%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG------------ 114
           P N LP  +  +      P +FD+R  WPNC +I+ IRDQ  CGSCW             
Sbjct: 69  PNNSLPGSLSRA------PTSFDARDYWPNCKSIKMIRDQAYCGSCWAFGAAEVISDRIC 122

Query: 115 ----------CRPYEIAPCEHHVNGTR--------------------------------P 132
                       P +I  C  + +G +                                 
Sbjct: 123 IQSNGTDQPIISPEDILTCCTNSHGCQGGFVLEAMKFWKSKGVVTGGDFQGDGCIPYSYG 182

Query: 133 SC-DASKGHT-PKCVRECQENYDV-PYKKDLNFGAKSYSVSSNE--KSIMKEIYEHGPVE 187
           SC D     T PKC  ECQ  Y    YK+D  +G+ +Y +S++   ++I  EI  +GPVE
Sbjct: 183 SCSDCHTAQTTPKCKNECQVKYTKNEYKEDKYYGSSAYRLSTSNAVRTIQSEILRNGPVE 242

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
             + V++D   YKSG +                                      Y SG+
Sbjct: 243 ATYQVYEDFYYYKSGVY-------------------------------------EYISGR 265

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            +GGHA++I+GWG +E     YWLIANSW T +G+NG FK+ RG +ECGIE+ + AG+ K
Sbjct: 266 HMGGHAVKIIGWGVEENV--NYWLIANSWGTGFGENGFFKMRRGNNECGIENYVVAGMAK 323


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 89/295 (30%), Positives = 124/295 (42%), Gaps = 96/295 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--H 126
           N LP L     V   LPA+FD+R KWP CP++ +IR QGSCGSC+      +    +  H
Sbjct: 33  NNLPRLQNQRSV-RALPASFDARQKWPYCPSLNQIRSQGSCGSCYAVSTAAVITDRYCIH 91

Query: 127 VNGTRP----------------SCDASKGHTP---------------------------- 142
             G R                  CD    H                              
Sbjct: 92  SGGERQFYFGSTGYLSCCTDCYKCDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGG 151

Query: 143 ---------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK-EIYEHGPVEGAFTV 192
                    KC R+CQ  Y + Y +DL  GA SY +   +++ MK EIY++GP+  +F V
Sbjct: 152 ATQDVNIVLKCDRQCQAGYPLTYSQDLKHGASSYILPWGDENAMKAEIYQNGPIVTSFDV 211

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
           + D   Y+SG +                                      + +G   G H
Sbjct: 212 YGDFFQYRSGVY-------------------------------------RHVTGAYKGSH 234

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           A+R++GWG +  +  KYWL ANSWN  WG+NG FKI+RG++  G+E    AG+PK
Sbjct: 235 AVRVIGWGVE--NGVKYWLCANSWNERWGENGFFKIVRGENHVGVEDISYAGLPK 287


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 40/194 (20%)

Query: 115 CRPYEIAPCEHH-VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           C PY   PC HH   G+  P C      TP+CV ECQ+ Y   Y+ D    + SY++  +
Sbjct: 185 CLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRS 244

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             +I KEI+  GPVE    V+ D   Y  G +                            
Sbjct: 245 VTTIQKEIWMRGPVEATMNVYTDFANYAGGVY---------------------------- 276

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     + +G+ LGGHAIR+LGWG +E     YWL ANSWN  WG+ G F+ILRG 
Sbjct: 277 ---------KHTTGELLGGHAIRLLGWGVEEDGT-PYWLAANSWNPSWGEKGFFRILRGS 326

Query: 293 DECGIESSITAGVP 306
           D CGIES ++AG+P
Sbjct: 327 DHCGIESDVSAGLP 340



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 54  LKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++S +G +  D N+   R P  I + ++  +LP+ FD+R  WP C TI +IRDQ  CGSC
Sbjct: 56  IRSVLGTMREDQNVKEFRRP-TISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSC 114

Query: 113 WG 114
           W 
Sbjct: 115 WA 116


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 40/194 (20%)

Query: 115 CRPYEIAPCEHH-VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           C PY   PC HH   G+  P C      TP+CV ECQ+ Y   Y+ D    + SY++  +
Sbjct: 185 CLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSECQKGYATKYEDDKIRASTSYNLYRS 244

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             +I KEI+  GPVE    V+ D   Y  G +                            
Sbjct: 245 VTAIQKEIWMRGPVEATMNVYTDFANYAGGVY---------------------------- 276

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     + +G+ LGGHAIR+LGWG +E     YWL ANSWN  WG+ G F+ILRG 
Sbjct: 277 ---------KHTTGELLGGHAIRLLGWGVEEDGT-PYWLAANSWNPSWGEKGFFRILRGS 326

Query: 293 DECGIESSITAGVP 306
           D CGIES ++AG+P
Sbjct: 327 DHCGIESDVSAGLP 340



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 26/62 (41%), Positives = 38/62 (61%), Gaps = 2/62 (3%)

Query: 54  LKSWMG-VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++S +G +  D N+   R P  I + ++  +LP+ FD+R  WP C TI +IRDQ  CGSC
Sbjct: 56  IRSVLGTMREDQNVKEFRRP-TISHEDITLELPSEFDAREHWPECRTIPQIRDQSGCGSC 114

Query: 113 WG 114
           W 
Sbjct: 115 WA 116


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/196 (37%), Positives = 102/196 (52%), Gaps = 48/196 (24%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
           GC PY  APC+      + SC  ++G TP C   CQ +Y    Y KD +FG  +Y ++++
Sbjct: 194 GCMPYSFAPCK------KDSC--AQGTTPSCKTTCQSSYKTAEYTKDKHFGTTAYKITNS 245

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             +I  EIY +GPVE +F V++D   YKSG +                            
Sbjct: 246 VAAIQTEIYHNGPVEASFKVYEDFYKYKSGVY---------------------------- 277

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     Y SGK +GGHA++I+GWG +  +   YWLIANSW T +GD+G FK+ RG 
Sbjct: 278 ---------QYTSGKLVGGHAVKIIGWGTE--NGVDYWLIANSWGTTFGDSGFFKMRRGT 326

Query: 293 DECGIESSITAGVPKL 308
           +E GIE ++ AG  KL
Sbjct: 327 NEVGIEGNVVAGTAKL 342


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 75/215 (34%), Positives = 104/215 (48%), Gaps = 50/215 (23%)

Query: 101 REIRDQGSC-----GSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRE--CQENYD 153
           + I+  G C     GS  GC+PY I PC  + N    SC      TP+C ++     NY+
Sbjct: 166 KYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNAN----SCSKENEDTPQCYKDQCTNNNYE 221

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
            P   DL +  K YSV    + IM E++++GPV  A  V+DD + YK G +         
Sbjct: 222 TPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGGIY--------- 272

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
                                        Y +G   G HA++I+GWGED+     YWL A
Sbjct: 273 ----------------------------QYTTGGLKGDHAVKIMGWGEDDGID--YWLCA 302

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           N+W   WG  G+FKI RG++ECGIE+ IT G+PK+
Sbjct: 303 NTWGNSWGMGGMFKIRRGRNECGIENRITGGLPKV 337



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 14/83 (16%)

Query: 46  LSNIPRAHLKSWMGVHPDY---------NLPANRLPE---LIGYS-EVD-EDLPANFDSR 91
           ++NIP+   K+ +  HP            +P N+L E   L+ Y   +D E LP ++D  
Sbjct: 34  VNNIPKHTWKAGINFHPSLLTNVSHLMGVVPWNKLSEKDILLTYDVSIDLESLPESYDIT 93

Query: 92  TKWPNCPTIREIRDQGSCGSCWG 114
             W  C ++  IRDQ +CGSCW 
Sbjct: 94  QTWSECKSVVSIRDQSNCGSCWA 116


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 129/322 (40%), Gaps = 110/322 (34%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
           L+N      K  +GV P    P  R           E LP  FD+R+KW  C TI +I D
Sbjct: 21  LANYTIEQFKHMLGVKP--TPPGLRAAVRTKTHSRSEQLPKVFDARSKWSGCSTIGKILD 78

Query: 106 QGSCGSCWGCRPYEIAP---CEHH------------------------------------ 126
           QG CGSCW     E      C HH                                    
Sbjct: 79  QGHCGSCWAFGAVECLQDRFCIHHNMNITLSANDLVACCGFMCGDGCDGGYPISAWQYFV 138

Query: 127 ---------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                          V    P C+ +   TP C ++C+    V +++  +F   +Y V+S
Sbjct: 139 QNGVVTDECDPYFDQVGCKHPGCEPAYP-TPVCEKKCKVQNQV-WEEKKHFSINAYQVNS 196

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y +GPVE A                                          
Sbjct: 197 DPHDIMAEVYNNGPVEVA------------------------------------------ 214

Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D   YKSG         +GGHA++++GWG  + + E YWL+AN WN  WGD+G
Sbjct: 215 --FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDG 271

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI+RGK+ECGIE  +TAG+P
Sbjct: 272 YFKIIRGKNECGIEEDVTAGMP 293



 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 13/25 (52%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AW+Y+V++G+V+
Sbjct: 120 MCGDGCDGGYPISAWQYFVQNGVVT 144


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/157 (44%), Positives = 90/157 (57%), Gaps = 39/157 (24%)

Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           Y+V Y+ D  +G   Y V SN+++IMKE+ +HGPVE  F V+ D   YKSG +       
Sbjct: 2   YNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGVY------- 54

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          + SG  LGGHA+R+LGWGE+  +   YWL
Sbjct: 55  ------------------------------QHVSGALLGGHAVRLLGWGEE--NNVPYWL 82

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           IANSWNTDWGDNG FKI+RGK+ECGIES + AG+PK+
Sbjct: 83  IANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIPKI 119


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 73/167 (43%), Positives = 89/167 (53%), Gaps = 53/167 (31%)

Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
           C+  Y   YK+D ++G  SYSVS +EK IM EIY++GPVE                    
Sbjct: 2   CEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVE-------------------- 41

Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWG 260
                                   GAFTVF D + YKSG         +GGHAIRILGWG
Sbjct: 42  ------------------------GAFTVFSDFLTYKSGVYKHEAGDVMGGHAIRILGWG 77

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            +  +   YWL+ANSWN DWGDNG FKILRG++ CGIES I AG+P+
Sbjct: 78  IE--NGVPYWLVANSWNVDWGDNGFFKILRGENHCGIESEIVAGIPR 122


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/202 (39%), Positives = 98/202 (48%), Gaps = 54/202 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    CEHHV G  P C      TP+CV++C +  DV Y +D      SY++ ++E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIMKEI   GPVE                                              
Sbjct: 242 ISIMKEIMLRGPVEAI-------------------------------------------- 257

Query: 234 FTVFDDLILYKSG---KALG----GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FT+++D + Y SG    ALG    GHA+RILGWGE       YWLIANSWN DWG+ G  
Sbjct: 258 FTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGNVPYWLIANSWNEDWGEEGYM 315

Query: 287 KILRGKDECGIESSITAGVPKL 308
           K LRG +ECGIE  +TAG+P L
Sbjct: 316 KFLRGYNECGIEDDVTAGLPYL 337



 Score = 45.4 bits (106), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GG+P +AW YW   GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWKTHGIVTGGS 177


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 73/162 (45%), Positives = 89/162 (54%), Gaps = 53/162 (32%)

Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           Y   YK+D +FG  SYSV++NEK IM EIY++GPVE                        
Sbjct: 2   YSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVE------------------------ 37

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEK 264
                               GAF+V+ D +LYKSG       + +GGHAIRILGWG +  
Sbjct: 38  --------------------GAFSVYSDFLLYKSGVYQHVSGEIMGGHAIRILGWGVENG 77

Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +   YWL+ NSWNTDWGDNG FKILRG+D CGIES I AG+P
Sbjct: 78  TP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 117


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/325 (29%), Positives = 133/325 (40%), Gaps = 112/325 (34%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEV-DEDLPANFDSRTKWPNCPTIREIR 104
           L+N      K  +GV P    P   L  +   +    E LP  FD+R+KW  C TI +I 
Sbjct: 60  LANYTIEQFKHMLGVKP---TPPGLLAGVRTKTHPRSEQLPKEFDARSKWSGCSTIGKIL 116

Query: 105 DQGSCGSCWGCRPYEIAP---CEHH----------------------------------- 126
           DQG CGSCW     E      C HH                                   
Sbjct: 117 DQGHCGSCWAFGAVECLQDRFCIHHNMNISLSANDLVACCGFMCGDGCDGGYPISAWQYF 176

Query: 127 ----------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
                           V    P C+ +   TP C ++C+    V +++  +F   +Y V+
Sbjct: 177 VQNGVVTEECDPYFDQVGCKHPGCEPAYP-TPVCEKKCKVQNQV-WQEKKHFSIDAYQVN 234

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           S+   IM E+Y++GPVE A                                         
Sbjct: 235 SDPHDIMAEVYKNGPVEVA----------------------------------------- 253

Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
              FTV++D   YKSG         +GGHA++++GWG  + + E YWL+AN WN  WGD+
Sbjct: 254 ---FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDD 309

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G FKI+RGK+ECGIE  +TAG+P +
Sbjct: 310 GYFKIIRGKNECGIEEDVTAGMPSM 334


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 91/160 (56%), Gaps = 38/160 (23%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           + I   G  GS  GC PYE+APCEHHVNGTR  C    G TPKCV++C++ Y VPY +DL
Sbjct: 112 KGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGPCKEG-GKTPKCVKKCEDGYKVPYAQDL 170

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           + G  +YS+S++   I +EIY +GPVEGAFTV++D I Y++G +                
Sbjct: 171 HHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVYEDFIAYRAGVY---------------- 214

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
                                 + +GKALGGHAIRILGWG
Sbjct: 215 ---------------------KHVAGKALGGHAIRILGWG 233



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/30 (80%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGGFPG AW YW   GIVSGG YGS
Sbjct: 93  CGFGCNGGFPGAAWNYWKTKGIVSGGPYGS 122


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 99/191 (51%), Gaps = 40/191 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS-SNE 173
           CRPY   PC HH N T        G TP+CVR+CQE Y+  Y +D   G  +Y +   + 
Sbjct: 189 CRPYPFHPCGHHGNETYYGECPEDGSTPECVRKCQEGYETEYHEDRVRGEDAYRLPIGSV 248

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I KEI  +GPV  AF VFDD   Y+ G                               
Sbjct: 249 KAIQKEIMRNGPVVAAFIVFDDFSFYRKG------------------------------- 277

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  + +G   GGHA++I+GWG +      YW+IANSW++DWG++G F+++RG +
Sbjct: 278 ------IYAHVAGSPRGGHAVKIIGWGTEHGV--PYWIIANSWHSDWGEDGYFRMVRGIN 329

Query: 294 ECGIESSITAG 304
           +CGIE+++ AG
Sbjct: 330 DCGIETNVVAG 340


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 96/331 (29%), Positives = 135/331 (40%), Gaps = 110/331 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A  +  +N   A  K  +GV P        +P  I   ++   LP  FD+RT W  
Sbjct: 59  GWKAAFNDRFANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQ 116

Query: 97  CPTIREIRDQGSCGSCW-------------------------------------GCRP-Y 118
           C ++  I DQG CGSCW                                     GC   Y
Sbjct: 117 CTSVGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNISLSVNDLLACCGFLCGQGCNGGY 176

Query: 119 EIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
            IA              C+ + + T    P C+ +   TPKC R+C     + +++  ++
Sbjct: 177 PIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHY 234

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y V S+   IM E+Y++GPVE A                                 
Sbjct: 235 GVSAYKVRSHPDDIMAEVYKNGPVEVA--------------------------------- 261

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN 
Sbjct: 262 -----------FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQ 309

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 310 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 340



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+   G+V+
Sbjct: 167 LCGQGCNGGYPIAAWRYFKHHGVVT 191


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 87/283 (30%), Positives = 120/283 (42%), Gaps = 97/283 (34%)

Query: 83  DLPANFDSRTKWPNC-PTIREIRDQGSCGSCWGCRPYEI--------------------- 120
           + P  FDSR  WP C   I  I+DQ +CGSCW      +                     
Sbjct: 288 NYPVEFDSRKHWPQCEKVISFIKDQANCGSCWAVSSASVMSDRTCIATDGQFTTLLSDAE 347

Query: 121 -----APCEHHVNGTRP----------------------SC---------DASKGHTPKC 144
                  C +  NG  P                      +C         + S+  TPKC
Sbjct: 348 LLSCCTSCGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCKPYPIPPCSNCSETRTPKC 407

Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
            + C   Y +   +D ++G+  Y     EKS+MK+I  +GP+    +V++D + YK    
Sbjct: 408 SKSCISTYPLSLNEDRHYGSTYYQFWLGEKSMMKDISLYGPIVAGMSVYEDFLHYK---- 463

Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
                                     EG +T        +SG  LGGHA+RI+GWGE + 
Sbjct: 464 --------------------------EGVYT-------QESGIFLGGHAVRIIGWGEQDN 490

Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
               YWL+ANSWNT +G++GLFKI RG DECGIES ++AG  K
Sbjct: 491 I--PYWLVANSWNTTFGEDGLFKIRRGFDECGIESYVSAGRAK 531



 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 25/35 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG+GCNGG+P   ++YWV SG+ +GG YGS    K
Sbjct: 355 CGYGCNGGYPQRTFKYWVYSGMPTGGPYGSNDTCK 389


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 96/322 (29%), Positives = 131/322 (40%), Gaps = 92/322 (28%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G   A     +N   A  K  +GV P  +   N +P  +        LP  FD+R+ W  
Sbjct: 57  GWTAARNPYFANYTTAQFKHILGVKPTPHSVLNDVP--VKTYPRSLMLPKEFDARSAWSQ 114

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHT 141
           C TI  I DQG CGSCW     E      C H        VN     C     D   G  
Sbjct: 115 CNTIGTILDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGY 174

Query: 142 PKC-----VRE--------------------CQENYDVP------------YKKDLNFGA 164
           P       VR                     C+  Y  P            + +  +F  
Sbjct: 175 PIMAWRYFVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSV 234

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +Y V+S+   IM E+Y++GPVE AFTV++D   YKSG +                    
Sbjct: 235 NAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVY-------------------- 274

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                             + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G
Sbjct: 275 -----------------KHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDG 316

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI+RG +ECGIE  + AG+P
Sbjct: 317 YFKIIRGTNECGIEEDVVAGMP 338



 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 23/25 (92%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P MAWRY+V++G+V+
Sbjct: 165 MCGDGCDGGYPIMAWRYFVRNGVVT 189


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 129/327 (39%), Gaps = 103/327 (31%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPD-----YNLPANRLPELIGYSEVDEDLPANFDSR 91
           G K    +  SN      K  +GV P       N+P    P+ +       +LP  FD+R
Sbjct: 51  GWKAGMNSRFSNHTVGQFKRLLGVLPTPRNLLENVPVRTYPKGL-------NLPKQFDAR 103

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-VNGTRPSCDASKGHTPKC--- 144
             WP C ++R I DQG CGSCW     E      C H+ VN T    D       +C   
Sbjct: 104 KAWPQCTSVRTILDQGHCGSCWAFGAVEALSDRFCIHYKVNVTLSENDLVACCGFRCGDG 163

Query: 145 -------------------VRECQENYD------------------VPYKKDLN------ 161
                                EC   +D                  V   KD N      
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQHPGCEPLYPTPQCVKQCKDENQNWGNS 223

Query: 162 --FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
             F A +Y ++S    IM E+Y  GPVE  F V++D   YKSG +               
Sbjct: 224 KRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYKSGVY--------------- 268

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                                  Y +G  LGGHA++++GWG +  +   YWL+ANSWNT 
Sbjct: 269 ----------------------KYITGDFLGGHAVKLIGWGTENGT--DYWLVANSWNTA 304

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG++G FKI RG +EC IE  + AG+P
Sbjct: 305 WGEDGYFKIARGSNECSIEEDVVAGMP 331


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 121/264 (45%), Gaps = 40/264 (15%)

Query: 62  PDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA 121
           PD++ P  ++P+             NFD+RT WP CP+I  IRDQ +CGSCW     E  
Sbjct: 6   PDFDYPNVKIPD-------------NFDARTNWPQCPSIAHIRDQSTCGSCWAFGAVEAM 52

Query: 122 PCEHHV--NGTRPSCDASKGHTPKCVRECQE--NYDVPYKKDLNFGAKSYSVSSNEKSIM 177
                +  NGT     +++     C+ +C    N   P      F     +  S    + 
Sbjct: 53  SDRLCIASNGTVKDELSAEDMLSCCLVQCGMGCNGGFPTGAWRFFKMHGLTTESKYPYVF 112

Query: 178 KEIYEH---------GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
                H         GP +            K  R+      + + + I+  I  N    
Sbjct: 113 PPCEHHINKTHYKPCGPSQPTPKCVR--ASEKKPRYHGKSVYSVSPAKIQAEIMTNGP-- 168

Query: 229 GAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
             E AFTV+ D + Y+       SG  LGGHAI+I+GWG +  +  KYWL+ANSWN DWG
Sbjct: 169 -VEAAFTVYQDFLAYQSGVYRHVSGPELGGHAIKIMGWGVE--AGNKYWLVANSWNEDWG 225

Query: 282 DNGLFKILRGKDECGIESSITAGV 305
           D G FKI RG DECGIESS+ AG+
Sbjct: 226 DKGTFKIARGDDECGIESSVVAGM 249


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/213 (37%), Positives = 100/213 (46%), Gaps = 54/213 (25%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G  GS  GC+PY + PCEHH  G +  C      TP C  +C ++  + YK +L F
Sbjct: 166 ITTGGLYGSKQGCQPYSLQPCEHHTEGNKVQCSTLDYDTPSCKHKCDDS-ALNYKSELTF 224

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G+ S     +  +I KEI  +GPVE A                                 
Sbjct: 225 GSGSVRNFYSVANIQKEILTNGPVEAA--------------------------------- 251

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      F V+ D + YKSG       + LGGHA+RILGWGE+  S   YWL+ANS
Sbjct: 252 -----------FDVYSDFVNYKSGVYQHVAGEYLGGHAVRILGWGEE--SGVPYWLVANS 298

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WN DWGD GLFKI RG +E G E SI A   ++
Sbjct: 299 WNEDWGDKGLFKIRRGNNESGFEDSIVAAPAQV 331



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 21/32 (65%), Positives = 26/32 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GC GG+P MAW YW+ +GI +GG YGSKQ
Sbjct: 145 CGYGCEGGYPTMAWSYWIDTGITTGGLYGSKQ 176


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 97/202 (48%), Gaps = 61/202 (30%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY  APC  +       C   K  TP C   CQ  Y   Y KD  FG  +Y+V+ N 
Sbjct: 133 GCRPYPFAPCNSY------KCPEEK--TPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 184

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I  EI  +GPV GA                                            
Sbjct: 185 AAIQTEIMTNGPVVGA-------------------------------------------- 200

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FT+++D+  YKSG       + LGGHAI+I+GWG   ++   YWLIANSW  DWG+NG  
Sbjct: 201 FTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT--QNGIPYWLIANSWGADWGENGFL 258

Query: 287 KILRGKDECGIESSITAGVPKL 308
           K+ RG +ECGIES++ AG+PK+
Sbjct: 259 KMRRGVNECGIESAVVAGMPKV 280



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/64 (40%), Positives = 37/64 (57%), Gaps = 9/64 (14%)

Query: 230 AEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
            E +FTV++D  +YK       +G+ +G HAI+I+GWG +  +   YWLIANSW    G 
Sbjct: 6   VEASFTVYEDFYIYKKGVYQYTAGQVVGVHAIKIMGWGTEHGT--DYWLIANSWGAQCGS 63

Query: 283 NGLF 286
              F
Sbjct: 64  CWAF 67


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 134/331 (40%), Gaps = 110/331 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K +  +  +N   A  K  +GV P        +P  I   ++   LP  FD+RT W  
Sbjct: 61  GWKASFNDRFANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQ 118

Query: 97  CPTIREIRDQGSCGSCWG-----------CRPYE----------IAPC------------ 123
           C +I  I DQG CGSCW            C  Y           +A C            
Sbjct: 119 CTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGY 178

Query: 124 --------EHH-------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                   +HH                + P C+ +   TPKC R+C     + +++  ++
Sbjct: 179 PIAAWRYFKHHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHY 236

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y V S+   IM E+Y++GPVE A                                 
Sbjct: 237 GVSAYKVRSHPDDIMAEVYKNGPVEVA--------------------------------- 263

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN 
Sbjct: 264 -----------FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQ 311

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 312 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 342



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+   G+V+
Sbjct: 169 LCGQGCNGGYPIAAWRYFKHHGVVT 193


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 149/363 (41%), Gaps = 133/363 (36%)

Query: 36  YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
           Y ++QA   EK+ ++ I  A+ K+W  GV+ D  L  +   +L+G               
Sbjct: 16  YRTEQAYFLEKDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 74

Query: 77  ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
                 Y+     +P++FD+R KW  C TI E+RDQG CGSCW                 
Sbjct: 75  KTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIATD 134

Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
                   P E+A C H                    H   T  + D+ +G  P      
Sbjct: 135 GEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPC 194

Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
                              +C R C  N D+ +K+D ++   +Y ++    +I  +I  +
Sbjct: 195 PLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILAY 252

Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
           GP+E +F V+DD   YKSG +    N T                                
Sbjct: 253 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 281

Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
                LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG +ECGI++S T 
Sbjct: 282 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTG 334

Query: 304 GVP 306
           GVP
Sbjct: 335 GVP 337



 Score = 40.8 bits (94), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC+GG+P  AW  + K G+V+GG Y S
Sbjct: 153 CGFGCSGGYPIRAWERFKKHGLVTGGNYDS 182


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 128/299 (42%), Gaps = 42/299 (14%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGV--HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G       +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGAFRRKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           +   G       S  H   C  +C +      
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRYCTVGGVQQLRISAAHLMSCCEDCGDG----C 158

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
           K      A  Y VS    S   + Y   P  G               F  P   TT    
Sbjct: 159 KGGAPDSAWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217

Query: 213 AMSLIKWTIRDNTSQLGAEGA----------------FTVFDDLILYK-------SGKAL 249
           A+ LIK+  R N S +   G                 F V+ D + YK       SG  L
Sbjct: 218 AIPLIKY--RGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGVYRHVSGDVL 275

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GGHA+RI+GWG+   +   YW IANSW+TDWG NG F ILRG +ECGIES+  AG+P +
Sbjct: 276 GGHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHFLILRGNNECGIESTGYAGLPAI 332


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 99/200 (49%), Gaps = 57/200 (28%)

Query: 115 CRPYEIAPCEHHVNG-TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+ Y  APC HHV+    P+C      TPKC + C       Y   ++ G+K+YSV   +
Sbjct: 189 CQAYSFAPCAHHVDTPLYPACTGEL-PTPKCAKTCDSGSGQTYT--VHKGSKAYSVGKTQ 245

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IM EI  +GPVE A                                            
Sbjct: 246 EAIMTEIQTNGPVEAA-------------------------------------------- 261

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YKSG       KALGGHAI+I+GWG +  +   YW++ NSWN  WGDNG F
Sbjct: 262 FTVYEDFLNYKSGVYKHVTGKALGGHAIKIVGWGVENNTP--YWIVVNSWNQTWGDNGTF 319

Query: 287 KILRGKDECGIESSITAGVP 306
           KILRGK+ECGIE+ +   +P
Sbjct: 320 KILRGKNECGIEAQVVTALP 339



 Score = 40.4 bits (93), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  A  Y+VK+G+V+G  Y +
Sbjct: 156 CGQGCNGGYPASAMSYYVKTGLVTGDLYNT 185


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 120/303 (39%), Gaps = 104/303 (34%)

Query: 67  PANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS----------------- 108
           P + LP +     E+   LP  FD+  KWPNCPTI EI DQ S                 
Sbjct: 72  PVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRY 131

Query: 109 -------------------CGSC------------WG-----------CRPYEIAPCEHH 126
                              CG C            W            C+PY    C H+
Sbjct: 132 CTIHGVRGLRISAADLLACCGDCGYGCLGGDPDMAWAYFSSEGIASGRCQPYPFPRCSHY 191

Query: 127 VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGP 185
            N T  P C A    TP C   C    D    K    G KSYS+S  E+   +E+Y  GP
Sbjct: 192 TNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSLS-GEEDFRRELYFRGP 247

Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
            +  F V+ DL  YK G +   G                       GAF           
Sbjct: 248 FQAVFDVWSDLFAYKHGVYKHVG-----------------------GAF----------- 273

Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
              +G HA+RI+GWG   +S   YW IANSWN +WGD G F +LRG +ECGIE S +AGV
Sbjct: 274 ---IGAHAVRIVGWGN--QSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGV 328

Query: 306 PKL 308
           P +
Sbjct: 329 PAI 331


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 135/324 (41%), Gaps = 98/324 (30%)

Query: 41  AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
           A  N   N P +HL+S  G   D   PA           + E +P NFD+R  WP C +I
Sbjct: 39  AGPNFPPNTPHSHLRSLNGARDD---PAFFTDTETKNVTIPEQIPQNFDARIVWPQCESI 95

Query: 101 REIRDQGSCGSCW-----------------GCRPYEIAP---------CEHHVNGTRPS- 133
           R+IR+QGSCGSCW                   + +E +          C H   G   S 
Sbjct: 96  RKIRNQGSCGSCWAFGAVETMSDRLCIASNATKKFEFSAQDLLACCKECGHGCGGGYSSR 155

Query: 134 ---------------CDASKGHTPKCVRECQEN-------------YDVPYKKDLNFGAK 165
                           + S+G  P  V+  +++             Y   Y +D  +GA+
Sbjct: 156 AWQYWVTDGIVSGGDFNTSQGCHPYSVQAFRDSTTPNCSSFCTNPKYQKNYSEDKRYGAR 215

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
           SY ++ N + I  EI   GPV+ ++ V+DD   Y++G                       
Sbjct: 216 SYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG----------------------- 252

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NG 284
                     V+  ++    G   G H+++ILGWG +  +   YWL+ANSW  DWG   G
Sbjct: 253 ----------VYQHVL----GNVSGRHSVKILGWGRENGT--DYWLVANSWGRDWGRLGG 296

Query: 285 LFKILRGKDECGIESSITAGVPKL 308
            FK LRG++ C IES+I  G PK+
Sbjct: 297 FFKFLRGENHCDIESNILGGDPKI 320



 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+   AW+YWV  GIVSGG + + Q
Sbjct: 144 CGHGCGGGYSSRAWQYWVTDGIVSGGDFNTSQ 175


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/335 (27%), Positives = 140/335 (41%), Gaps = 108/335 (32%)

Query: 35  AYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE---VDEDLPANFDSR 91
           A  + Q E  ++++  + H +S   +H  +N P    P+    +E   V +  P NFD+R
Sbjct: 38  AASTFQTENYAVTH-EKMHTRS---MHEKFNAP---FPDEFRATEREFVLDATPLNFDAR 90

Query: 92  TKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP----------------- 132
           T+WP C +++ IR+Q +CGSCW     E+      +  NGT+                  
Sbjct: 91  TRWPQCKSMKLIREQSNCGSCWAFSTAEVISDRTCIASNGTQQPIISPTDLLTCCGMSCG 150

Query: 133 -SCDA----------------------SKGHTPKCVREC-----------------QENY 152
             CD                         G  P  +R C                 Q  Y
Sbjct: 151 EGCDGGFPYRAFQWWARRGVVTGGDYLGTGCKPYPIRPCNSDNCVNLQTPPCRLSCQPGY 210

Query: 153 DVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT 212
              Y  D N+G  +Y V     +I  +IY +GPV  AF V++D   YKSG          
Sbjct: 211 RTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSG---------- 260

Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
                                      +  + +G++ GGHA++++GWG +  +   YWL 
Sbjct: 261 ---------------------------IYRHIAGRSKGGHAVKLIGWGTERGT--PYWLA 291

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            NSW + WG++G F+ILRG DECGIES I AG+P+
Sbjct: 292 VNSWGSQWGESGTFRILRGVDECGIESRIVAGLPR 326



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 22/28 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GC+GGFP  A+++W + G+V+GG Y
Sbjct: 149 CGEGCDGGFPYRAFQWWARRGVVTGGDY 176


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/195 (35%), Positives = 100/195 (51%), Gaps = 41/195 (21%)

Query: 115 CRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC +H +      C      TP C + CQ +Y VPY  D  FG+K+  ++  E
Sbjct: 193 CKPYPFYPCGYHAHLPYYGPCPDGMWPTPTCEKACQSDYTVPYNDDRIFGSKTIVLTGEE 252

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I +EI+ +GP+   +TV++D   YK+G                               
Sbjct: 253 K-IKREIFNNGPLVATYTVYEDFAYYKNG------------------------------- 280

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 + +   G+A G HA++I+GWGE+  +  KYWLIANSWNTDWG+NG F++LRG +
Sbjct: 281 ------IYMTGLGRATGAHAVKIIGWGEE--NGVKYWLIANSWNTDWGENGFFRMLRGTN 332

Query: 294 ECGIESSITAGVPKL 308
            C IE S T G  K+
Sbjct: 333 LCDIELSATGGTFKV 347



 Score = 37.7 bits (86), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 20/31 (64%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC  G P  A+ Y ++ G+ SGG YG+K
Sbjct: 160 CGSGCTSGVPRQAFNYAIRKGVCSGGPYGTK 190


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 104/198 (52%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY   PC HH   T      ++  TPKCVR+CQ++Y   YKKD + G  +Y V ++E
Sbjct: 188 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 247

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI ++GPV GA                                            
Sbjct: 248 KAIQREIMKNGPVVGA-------------------------------------------- 263

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YK       +GKA GGHAI+I+GWG++      YWLIANSW+ DWG+NG F
Sbjct: 264 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 321

Query: 287 KILRGKDECGIESSITAG 304
           +ILRG + CGIE ++ AG
Sbjct: 322 RILRGSNHCGIEENVVAG 339



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 32/46 (69%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           NR P     ++  ED+P +FD+RTKWP C +++ IRDQ +CGSCW 
Sbjct: 75  NRKPVFDDKNDKGEDIPESFDARTKWPKCSSLKHIRDQANCGSCWA 120



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG+GCNGG+P  A+ Y+ K G V+GG Y
Sbjct: 156 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 183


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 131/315 (41%), Gaps = 94/315 (29%)

Query: 46  LSNIP--RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           L+N P   A  K  +GV P  +   N +P  +        LP  FD+R+ W  C TI  I
Sbjct: 109 LNNPPVQTAQFKHILGVKPTPHSVLNDVP--VKTYPRSLMLPKEFDARSAWSQCNTIGTI 166

Query: 104 RDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPSC-----DASKGHTPKC---- 144
            DQG CGSCW     E      C H        VN     C     D   G  P      
Sbjct: 167 LDQGHCGSCWAFGAVECLQDRFCIHFNMNISLSVNDLVACCGFMCGDGCDGGYPIMAWRY 226

Query: 145 -VRE--------------------CQENYDVP------------YKKDLNFGAKSYSVSS 171
            VR                     C+  Y  P            + +  +F   +Y V+S
Sbjct: 227 FVRNGVVTDECDPYFDQVGCKHPGCEPAYPTPVCEKKCKVQNQVWLEKKHFSVNAYRVNS 286

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE AFTV++D   YKSG +                           
Sbjct: 287 DPHDIMAEVYQNGPVEVAFTVYEDFAHYKSGVY--------------------------- 319

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + +G  +GGHA++++GWG  + + E YWL+AN WN  WGD+G FKI+RG
Sbjct: 320 ----------KHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDGYFKIIRG 368

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  + AG+P
Sbjct: 369 TNECGIEEDVVAGMP 383



 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 23/25 (92%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P MAWRY+V++G+V+
Sbjct: 210 MCGDGCDGGYPIMAWRYFVRNGVVT 234


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 99/297 (33%), Positives = 127/297 (42%), Gaps = 97/297 (32%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GC GG    AW Y+ K GIVSGG Y                  KS  G  P    P 
Sbjct: 149 CGNGCEGGVLTRAWIYYKKIGIVSGGGY------------------KSKQGCQPYTIPPC 190

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
           N L  + G  E  +++P         P C  I  I +Q        C+   I        
Sbjct: 191 NHL--VWGEIEQCKNIPMT-------PKCKNIPVIPEQ--------CKYIPI-------- 225

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
                       TP+C ++C +NY V Y KD + G   Y V  +E  I KEIYE+GPV  
Sbjct: 226 ------------TPECEKKCNKNYKVCYSKDKHRGKSVYRVKKSE--IFKEIYEYGPVTS 271

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
            FTV++D + YK G                                     +  Y SG+ 
Sbjct: 272 YFTVYEDFLNYKEG-------------------------------------IYNYTSGQK 294

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR-GKDECGIESSITAG 304
           LG H+++I+GWGE+   K  YWL ANS+NTDWGD G FKI+R G   CGI  ++ AG
Sbjct: 295 LGLHSVKIIGWGEERGIK--YWLAANSFNTDWGDKGFFKIIREGVGSCGISDNVVAG 349



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/74 (36%), Positives = 38/74 (51%), Gaps = 2/74 (2%)

Query: 41  AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTI 100
           A +N     P  ++ + MG   D  +  + LP+           P  FD+R  W NCPT+
Sbjct: 43  AGRNFPKKTPLKYIYNLMGTLSDSRM--DNLPQRNYTFSRKTKYPNQFDAREHWKNCPTL 100

Query: 101 REIRDQGSCGSCWG 114
           ++IRDQG CGSCW 
Sbjct: 101 KDIRDQGGCGSCWA 114


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 133/331 (40%), Gaps = 110/331 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K +  +  +N   A  K  +GV P        +P  I   ++   LP  FD+RT W  
Sbjct: 58  GWKASLNDRFANATVAEFKRLLGVKPTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSQ 115

Query: 97  CPTIREIRDQGSCGSCWG-----------CRPYEI------------------------- 120
           C +I  I DQG CGSCW            C  Y +                         
Sbjct: 116 CTSIPRILDQGHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVVACCGLLCGLGCNGGF 175

Query: 121 ---------------APCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                            C+ + + T    P C+     TPKCVR+C     + + +  ++
Sbjct: 176 PMGAWLYFKYHGVVTEECDPYFDNTGCSHPGCEPGYP-TPKCVRKCVSENQL-WGESKHY 233

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y ++ + + IM E+Y++GPVE A                                 
Sbjct: 234 GVSAYRINHDPQDIMAEVYKNGPVEVA--------------------------------- 260

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN 
Sbjct: 261 -----------FTVYEDFAHYKSGVYKHITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQ 308

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 309 WNRSWGDDGYFKIRRGTNECGIEHGVVAGLP 339


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 105/198 (53%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY   PC HH   T      ++  TPKCVR+CQ++Y   YKKD + G  +Y V ++E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI ++GPV GA                                            
Sbjct: 160 KAIQREIMKNGPVVGA-------------------------------------------- 175

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YK       +GKA GGHAI+I+GWG++  +   YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--NGVPYWLIANSWHNDWGENGYF 233

Query: 287 KILRGKDECGIESSITAG 304
           +ILRG + CGIE ++ AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 25/31 (80%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           D+P +FD+RTKWP C +++ I DQ +CGSCW
Sbjct: 1   DIPESFDARTKWPKCSSLKHIHDQANCGSCW 31



 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          CG+GCNGG+P  A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 149/364 (40%), Gaps = 133/364 (36%)

Query: 35  AYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG-------------- 76
            Y ++QA   E++ ++ I  A+ K+W  GV+ D  L  +   +L+G              
Sbjct: 15  VYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDM 73

Query: 77  -------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------- 114
                  Y+     +P++FD+R KW  C TI E+RDQG CGSCW                
Sbjct: 74  FKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIAT 133

Query: 115 -------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP----- 142
                    P E+A C H                    H   T  + D+ +G  P     
Sbjct: 134 DGEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 193

Query: 143 --------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
                               +C R C  N D+ +K+D ++   +Y ++    +I  +I  
Sbjct: 194 CPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILA 251

Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
           +GP+E +F V+DD   YKSG +    N T                               
Sbjct: 252 YGPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------ 281

Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
                 LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG +ECGI++S T
Sbjct: 282 ------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTT 333

Query: 303 AGVP 306
            GVP
Sbjct: 334 GGVP 337



 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 153 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 184


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 119/303 (39%), Gaps = 104/303 (34%)

Query: 67  PANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS----------------- 108
           P + LP +     E+   LP  FD+  KWPNCPTI EI DQ S                 
Sbjct: 72  PVSVLPRVNFTEEELLAPLPETFDAAEKWPNCPTITEISDQSSCGSCWAVAAATSMTDRY 131

Query: 109 -------------------CGSC------------WG-----------CRPYEIAPCEHH 126
                              CG C            W            C+PY    C H+
Sbjct: 132 CTIHGVRGLRISAADLLACCGDCGYGCLGGDPDMAWAYFSSEGIASGRCQPYPFPRCSHY 191

Query: 127 VNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGP 185
            N T  P C A    TP C   C    D    K    G KSYS S  E+   +E+Y  GP
Sbjct: 192 TNSTTYPQCSALHLWTPTCNPACT---DSTISKKKYRGLKSYSFS-GEEDFRRELYFRGP 247

Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
            +  F V+ DL  YK G +   G                       GAF           
Sbjct: 248 FQAVFDVWSDLFAYKHGVYKHVG-----------------------GAF----------- 273

Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
              +G HA+RI+GWG   +S   YW IANSWN +WGD G F +LRG +ECGIE S +AGV
Sbjct: 274 ---IGAHAVRIVGWGN--QSGVPYWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAGV 328

Query: 306 PKL 308
           P +
Sbjct: 329 PAI 331


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 92/305 (30%), Positives = 127/305 (41%), Gaps = 106/305 (34%)

Query: 73  ELIGYSEVD--EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------- 113
           E++   ++D  E++P +FD+R KWP C +I  IRDQ  CGSCW                 
Sbjct: 77  EMLKEEDMDFSEEIPVSFDARDKWPKCTSIGFIRDQSHCGSCWAVSSAETMSDRLCVQSN 136

Query: 114 ---------------------GC------RPYE----IAPCEHHVNGTRPSC-------- 134
                                GC      R +E       C   + GT+ SC        
Sbjct: 137 GTIKVLLSDTDILACCPNCGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPC 196

Query: 135 -DASKGH-------TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
            D S G        TPKC + CQ  Y   Y  D  +   +Y +  NE  I  EI  +GPV
Sbjct: 197 KDESYGKCPKDSFPTPKCRKICQYKYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPV 256

Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
             +F ++ D   Y+ G +   G                                     G
Sbjct: 257 TASFRIYPDFGFYEKGVYVTSG-------------------------------------G 279

Query: 247 KALGGHAIRILGWGEDEK--SKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIESSITA 303
           + LGGHAI+I+GWG ++   +   YWLIANSW TDWG+ NG F+ILRG++ C IE  + A
Sbjct: 280 RELGGHAIKIIGWGTEKVNGTDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIA 339

Query: 304 GVPKL 308
           G+ K+
Sbjct: 340 GMIKV 344



 Score = 37.7 bits (86), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/35 (45%), Positives = 22/35 (62%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GC GG    AW Y+  +G+ +GG YG+K + K
Sbjct: 155 CGAGCGGGHTIRAWEYFKNTGVCTGGLYGTKDSCK 189


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 149/364 (40%), Gaps = 133/364 (36%)

Query: 35  AYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG-------------- 76
            Y ++QA   E++ ++ I  A+ K+W  GV+ D  L  +   +L+G              
Sbjct: 12  VYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDM 70

Query: 77  -------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------- 114
                  Y+     +P++FD+R KW  C TI E+RDQG CGSCW                
Sbjct: 71  FKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWAFGTSSAFADRLCIAT 130

Query: 115 -------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP----- 142
                    P E+A C H                    H   T  + D+ +G  P     
Sbjct: 131 DGEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPP 190

Query: 143 --------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
                               +C R C  N D+ +K+D ++   +Y ++    +I  +I  
Sbjct: 191 CPLDEYGNNTCRGKPAEKNHRCTRMCYGNQDLDFKEDHHYTRDAYYLTYG--TIQNDILA 248

Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
           +GP+E +F V+DD   YKSG +    N T                               
Sbjct: 249 YGPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------ 278

Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
                 LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG +ECGI++S T
Sbjct: 279 ------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTT 330

Query: 303 AGVP 306
            GVP
Sbjct: 331 GGVP 334



 Score = 40.8 bits (94), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 40/188 (21%)

Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
           E  P    +  + P C+ +   TPKCVR+C +   + +KK  +F   +YSV S+   IM 
Sbjct: 42  ECDPYFDQIGCSHPGCEPAY-QTPKCVRKCVKGNQI-WKKSKHFSVNAYSVKSDPYDIMA 99

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+Y++GPVE AFTV++D   YKSG +                                  
Sbjct: 100 EVYKNGPVEVAFTVYEDFAHYKSGVY---------------------------------- 125

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
               + +G  LGGHA++++GWG  ++  E YWLIAN WN  WGD+G F I RG +ECGIE
Sbjct: 126 ---KHITGSQLGGHAVKLIGWGTTDEG-EDYWLIANQWNRSWGDDGYFMIRRGTNECGIE 181

Query: 299 SSITAGVP 306
             +TAG+P
Sbjct: 182 EDVTAGLP 189


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 90/320 (28%), Positives = 134/320 (41%), Gaps = 98/320 (30%)

Query: 41  AEKNSLSNIPRA--HLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           A KN     P     L   +G++ D N+    LP  + + E    +P +FD+R +WP C 
Sbjct: 44  ARKNFEGRTPEQLKALADVIGINRDPNV---TLP--VVFHEAISGIPDSFDAREQWPFCE 98

Query: 99  TIREIRDQGSCGSCW--------------------------------------GCRP-YE 119
           +IR IRD+G+CGSCW                                      GCR  + 
Sbjct: 99  SIRTIRDEGACGSCWAFAAVEVMSDRLCLASEGRKKFIFSAEEVVSCCTACGGGCRGGFL 158

Query: 120 IAPCEHHVN-------------GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
             P ++ V              G +P   A  G TP+C + C   Y+  ++KDL     +
Sbjct: 159 NEPYKYWVTNGIPSGGDYGSKLGCKPYTAAVSGETPQCQKACVSGYEKSWEKDLRHATSA 218

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y V+     I +EI ++GPV     V++D   Y +G                        
Sbjct: 219 YQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTG------------------------ 254

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                        +  + SG  +GGHA++I+GWG +  +   YW+ ANSW T +G++G F
Sbjct: 255 -------------IYQHTSGSFVGGHAVKIIGWGSE--NDVPYWIAANSWGTGFGEDGFF 299

Query: 287 KILRGKDECGIESSITAGVP 306
           +ILRG +  GIES I AG P
Sbjct: 300 RILRGSNCAGIESYIVAGYP 319



 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 22/31 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GGF    ++YWV +GI SGG YGSK
Sbjct: 149 CGGGCRGGFLNEPYKYWVTNGIPSGGDYGSK 179


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 104/198 (52%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY   PC HH   T      ++  TPKCVR+CQ++Y   YKKD + G  +Y V ++E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEVPNSE 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +EI ++GPV GA                                            
Sbjct: 160 KAIQREIMKNGPVVGA-------------------------------------------- 175

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YK       +GKA GGHAI+I+GWG++      YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 233

Query: 287 KILRGKDECGIESSITAG 304
           +ILRG + CGIE ++ AG
Sbjct: 234 RILRGSNHCGIEENVVAG 251



 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 26/31 (83%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           D+P +FD+RTKWP C +++ IRDQ +CGSCW
Sbjct: 1   DIPESFDARTKWPKCSSLKHIRDQANCGSCW 31



 Score = 38.1 bits (87), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          CG+GCNGG+P  A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/205 (36%), Positives = 99/205 (48%), Gaps = 53/205 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GCRPY   PC HH N T          TPKC R CQ +Y   Y  D ++G  +
Sbjct: 184 GDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDA 243

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y V  + K+I +EI ++GPV GA                                     
Sbjct: 244 YEVPHSVKAIQREIMKNGPVVGA------------------------------------- 266

Query: 227 QLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                  FTV++D   YK       +G+A GGHAI+I+GWG +  +   YWLIANSW+ D
Sbjct: 267 -------FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE--NDVPYWLIANSWHND 317

Query: 280 WGDNGLFKILRGKDECGIESSITAG 304
           WG+ G F+++RG +ECGIE  + AG
Sbjct: 318 WGEEGYFRMIRGINECGIEQEVVAG 342



 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 32/46 (69%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           NR P +    +  +D+P +FD+RT WPNC +IR IRDQ +CGSCW 
Sbjct: 79  NRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWA 124



 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           C +GC+GG+P +A+ ++   G V+GG YGSK
Sbjct: 159 CSYGCDGGWPILAFDFYTYEGAVTGGDYGSK 189


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 131/322 (40%), Gaps = 110/322 (34%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            +N   A  K  +GV P        +P  I   ++   LP  FD+RT W  C +I  I D
Sbjct: 1   FANATVAEFKRLLGVKPTPKTEFLGVP--IVSHDISLKLPKEFDARTAWSQCTSIGRILD 58

Query: 106 QGSCGSCW-------------------------------------GCRP-YEIAP----- 122
           QG CGSCW                                     GC   Y IA      
Sbjct: 59  QGHCGSCWAFGAVESLSDRFCIKYNMNVSLSVNDLLACCGFLCGQGCNGGYPIAAWRYFK 118

Query: 123 --------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                   C+ + + T    P C+ +   TPKC R+C     + +++  ++G  +Y V S
Sbjct: 119 HHGVVTEECDPYFDNTGCSHPGCEPAYP-TPKCARKCVSGNQL-WRESKHYGVSAYKVRS 176

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE A                                          
Sbjct: 177 HPDDIMAEVYKNGPVEVA------------------------------------------ 194

Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN WN  WGD+G
Sbjct: 195 --FTVYEDFAHYKSGVYKHITGTNIGGHAVKLIGWGTSDDG-EDYWLLANQWNRSWGDDG 251

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI RG +ECGIE  + AG+P
Sbjct: 252 YFKIRRGTNECGIEHGVVAGLP 273



 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 19/25 (76%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+   G+V+
Sbjct: 100 LCGQGCNGGYPIAAWRYFKHHGVVT 124


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 75/205 (36%), Positives = 99/205 (48%), Gaps = 53/205 (25%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GCRPY   PC HH N T          TPKC R CQ +Y   Y  D ++G  +
Sbjct: 184 GDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAKTPKCRRRCQRSYKKAYYMDKSYGEDA 243

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y V  + K+I +EI ++GPV GA                                     
Sbjct: 244 YEVPHSVKAIQREIMKNGPVVGA------------------------------------- 266

Query: 227 QLGAEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                  FTV++D   YK       +G+A GGHAI+I+GWG +  +   YWLIANSW+ D
Sbjct: 267 -------FTVYEDFSYYKKGIYKHTAGQARGGHAIKIIGWGVE--NDVPYWLIANSWHND 317

Query: 280 WGDNGLFKILRGKDECGIESSITAG 304
           WG+ G F+++RG +ECGIE  + AG
Sbjct: 318 WGEEGYFRMIRGINECGIEQEVVAG 342



 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 24/46 (52%), Positives = 32/46 (69%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           NR P +    +  +D+P +FD+RT WPNC +IR IRDQ +CGSCW 
Sbjct: 79  NRKPAVENEDDEGDDIPESFDARTHWPNCTSIRHIRDQANCGSCWA 124



 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 24/31 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG+GC+GG+P +A+ ++   G V+GG YGSK
Sbjct: 159 CGYGCDGGWPILAFDFYTYEGAVTGGDYGSK 189


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 98/195 (50%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    CEH   G  P C      TP+C++ C +  ++ Y+KD      SY+V   E
Sbjct: 183 GCRSYPFPSCEHRGKGQYPPCPHQLYPTPECIKRC-DTKEIDYEKDKTRANISYNVYPAE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           +++MKEI   GPV     V++DL+ YKSG +                             
Sbjct: 242 QAVMKEIMLRGPVGAILHVYEDLLDYKSGVY----------------------------- 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
           F V+        G  LG H IRILGWGE++     YWL+ANSWN DWG+ G  ++LR ++
Sbjct: 273 FHVW--------GGHLGEHGIRILGWGEEDGVP--YWLVANSWNEDWGEKGYMRVLRWRN 322

Query: 294 ECGIESSITAGVPKL 308
           ECGI   +TAG+P L
Sbjct: 323 ECGIVDQVTAGLPDL 337



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 27/34 (79%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++ LP +FD+R  WP+CP+I EIRDQ SCGSCW 
Sbjct: 83  NQHLPESFDARANWPHCPSISEIRDQSSCGSCWA 116


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 140/341 (41%), Gaps = 113/341 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G + A    LSN      K  +G  P        +P +I + +  + LP  FD+RT WP+
Sbjct: 56  GWEAAMNPQLSNFTVGQFKYLLGAKPTPKKELMGVP-MISHPKTLK-LPKEFDARTAWPH 113

Query: 97  CPTIREIRDQ-----------------GSCGSCWGCRPYE-------------------- 119
           C TI +I  Q                 G CGSCW     E                    
Sbjct: 114 CSTIGKILGQLLSFYNIFSIFFFLFLEGHCGSCWAFGAVESLSDRFCIHFGMNISLSVND 173

Query: 120 -IAPC--------------------EHH-------------VNGTRPSCDASKGHTPKCV 145
            +A C                     HH             +  + P C+     TPKCV
Sbjct: 174 LLACCGFLCGDGCDGGYPMYAWRYFVHHGVVTEECDPYFDNIGCSHPGCEPGFP-TPKCV 232

Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
           R+C +   + +++  ++   +Y +SS+   +M E+Y++GPVE +FTV++D   YKSG + 
Sbjct: 233 RKCIDKNQL-WRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGVY- 290

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS 265
                                                + +G+ +GGHA++++GWG  +  
Sbjct: 291 ------------------------------------KHITGEVMGGHAVKLIGWGTSDNG 314

Query: 266 KEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
            E YWL+AN WN  WGD+G FKI RG +ECGIE    AG+P
Sbjct: 315 -EDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAGLP 354



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY+V  G+V+
Sbjct: 181 LCGDGCDGGYPMYAWRYFVHHGVVT 205


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 149/363 (41%), Gaps = 133/363 (36%)

Query: 36  YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
           Y ++QA   E++ ++ I  A+ K+W  GV+ D  L  +   +L+G               
Sbjct: 13  YQTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 71

Query: 77  ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
                 Y+     +P+NFD+R KW  C TI E+RDQG CGSCW                 
Sbjct: 72  KTHDEAYNNWSNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWAFGTSSAFADRLCIATD 131

Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
                   P E+A C H                    H   T  + D+ +G  P      
Sbjct: 132 GEFNELLSPEELAFCCHKCGFGCSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPC 191

Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
                              +C R C  N ++ +K+D ++   +Y ++    +I  ++  +
Sbjct: 192 PLDEYGNNTCSGKPAEKNHRCTRMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQYDVLAY 249

Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
           GP+E +F V+DD   YKSG +    N T                                
Sbjct: 250 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 278

Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
                LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG +ECGI++S T 
Sbjct: 279 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGIDNSTTG 331

Query: 304 GVP 306
           GVP
Sbjct: 332 GVP 334



 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC+GG P  AW  + K G+V+GG Y S
Sbjct: 150 CGFGCSGGNPIKAWERFQKHGLVTGGNYDS 179


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC  + +GT            +C R C  N D+ Y  D  F    Y ++   
Sbjct: 184 GCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 242

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +    N T                      
Sbjct: 243 -SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNAT---------------------- 279

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWG +E     YWL+ NSW+  WGDNGLFKI RG D
Sbjct: 280 --------------KLGGHAVKLIGWGVEEGI--PYWLMVNSWSAQWGDNGLFKIRRGTD 323

Query: 294 ECGIESSITAGVP 306
           ECGI+S+ TAGVP
Sbjct: 324 ECGIDSATTAGVP 336



 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           +P  FD+R +W +C TI E+RDQG CGSCW 
Sbjct: 87  IPRTFDARRRWRHCKTIGEVRDQGYCGSCWA 117


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC  + +GT            +C R C  N D+ Y  D  F    Y ++   
Sbjct: 14  GCEPYRVPPCPRNEDGTSSCAGQPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 72

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +    N T                      
Sbjct: 73  -SIQKDVMNYGPIEASFDVYDDFYSYKSGVYQRTPNAT---------------------- 109

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWG +E     YWL+ NSW+  WGDNGLFKI RG D
Sbjct: 110 --------------KLGGHAVKLIGWGVEEGI--PYWLMVNSWSAQWGDNGLFKIRRGTD 153

Query: 294 ECGIESSITAGVP 306
           ECGI+S+ TAGVP
Sbjct: 154 ECGIDSATTAGVP 166


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/298 (33%), Positives = 128/298 (42%), Gaps = 41/298 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           H   G       S  H   C  +C +  D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCEDCGDGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
                  +  Y VS    S   + Y   P  G               F  P   TT    
Sbjct: 163 PGT----SWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217

Query: 213 AMSLIKWTIRDNTS-----------QLGAEGAFT----VFDDLILYK-------SGKALG 250
           A+ LIK+  R N S           +L   G F     V+ D + YK       SG  LG
Sbjct: 218 AIPLIKY--RGNHSYEVHGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLG 275

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GHA+RI+GWG+   +   YW IANSW+TDWG NG    LRG +ECGIE++  AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 100/210 (47%), Gaps = 53/210 (25%)

Query: 105 DQGSCGSCWGCRPYEIAPCEHHVN---GTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
           D    G+   C+PYE  PC HHV+      P+C   +  TP+C+ EC E          N
Sbjct: 389 DYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSET---------N 439

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           F   SY     +K + +E Y    +E    +  D++ Y S                    
Sbjct: 440 FSGGSYG---EDKKMAREAYSLAGIE---NIQRDMMKYGS-------------------- 473

Query: 222 RDNTSQLGAEGAFTVFDDLILY-------KSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                      AF+VF D + Y       +SG  +GGHA++++GWG DE S E YWLIAN
Sbjct: 474 --------VTAAFSVFSDFLTYSGGVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIAN 525

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAG 304
           SWN  WG+ GLF+ILRG +ECGIE  I AG
Sbjct: 526 SWNPSWGEGGLFRILRGVNECGIEGQIVAG 555



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/49 (46%), Positives = 30/49 (61%), Gaps = 1/49 (2%)

Query: 66  LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCW 113
           +P  R    +  S  DED+PANFD+R  +P C + I  +RDQ  CGSCW
Sbjct: 262 VPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCW 310



 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 23/30 (76%), Gaps = 2/30 (6%)

Query: 9   CGF--GCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG   GCNGG PG AW+++ K+G+V+GG Y
Sbjct: 361 CGLSMGCNGGQPGSAWKWFTKTGVVTGGDY 390


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 77/200 (38%), Positives = 92/200 (46%), Gaps = 54/200 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    CEHHV G  P C      TP+CV+ C +   + Y KD      SY++ S+E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPHQYYPTPECVQHC-DTPGIDYVKDKTRANMSYNIYSSE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IMKEI   GPVE                                              
Sbjct: 242 ILIMKEIMLRGPVEAV-------------------------------------------- 257

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D + YK G         L  HAIRILGWGE+      YWLIANSWN DWG+ G  
Sbjct: 258 FTVYEDFLQYKFGVYFHSWGAPLSEHAIRILGWGEE--GDVPYWLIANSWNEDWGEKGYM 315

Query: 287 KILRGKDECGIESSITAGVP 306
           K LRG +ECGIE  +TAG+P
Sbjct: 316 KFLRGLNECGIEDDVTAGLP 335



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 23/31 (74%), Positives = 26/31 (83%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           LP NFD+RTKWP+CP+I EIRDQ  CGSCW 
Sbjct: 86  LPKNFDARTKWPHCPSISEIRDQSGCGSCWA 116



 Score = 43.9 bits (102), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG+GC+GG+P +AW YW   GIV+GG+
Sbjct: 151 CGYGCSGGYPAVAWDYWGAHGIVTGGS 177


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 93/193 (48%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC  + +G        K    +C R C  N D+ Y  D  F    Y ++   
Sbjct: 184 GCEPYRVPPCPRNEDGKSSCAGKPKEKNHRCTRMCYGNQDLDYDDDHRFTRDFYYLTYG- 242

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +    N T                      
Sbjct: 243 -SIQKDVLNYGPIEASFDVYDDFPSYKSGVYQRTPNAT---------------------- 279

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWG +E +   YWL+ NSWN  WGDNGLFKI RG D
Sbjct: 280 --------------KLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDNGLFKIRRGTD 323

Query: 294 ECGIESSITAGVP 306
           EC I+S+ TAGVP
Sbjct: 324 ECRIDSATTAGVP 336



 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 1/44 (2%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS-KQAEKNSLSNIPR 51
           CG GCNGG+P  AW+Y+   G+V+GG Y S K  E   +   PR
Sbjct: 152 CGHGCNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR 195


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 72/194 (37%), Positives = 94/194 (48%), Gaps = 39/194 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY I    +   G  P         P C REC+++Y   Y +D ++G K Y++S +E
Sbjct: 180 GCQPYSIHTTRYTTTGLLPPPINDLSPMPPCKRECRKSYGKKYSEDKHYGEKVYTLSGDE 239

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I  EI+++GPVE  F V+ D   YKSG +        A S ++               
Sbjct: 240 AQIKTEIFKNGPVEADFAVYADFYSYKSGVY-------QAHSRVR--------------- 277

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                           G HAIRILGWG +  +   YWL ANSW   WGD G FKI RG +
Sbjct: 278 ---------------CGSHAIRILGWGTE--NGVPYWLAANSWTEHWGDKGYFKIRRGNN 320

Query: 294 ECGIESSITAGVPK 307
           ECGIE  I AG+PK
Sbjct: 321 ECGIEEDINAGIPK 334



 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 46/75 (61%), Gaps = 4/75 (5%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A +N   N+P +++K  MGV  +      RLP L+ +S + ++LP +FD+R  W  C +
Sbjct: 43  KAGRNFDKNVPFSYIKGLMGVARN---KTRRLPTLM-HSSIPDNLPESFDARQHWRKCNS 98

Query: 100 IREIRDQGSCGSCWG 114
           I  IRDQ SCG+CW 
Sbjct: 99  IHVIRDQSSCGACWA 113



 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           C  GC GG P  AW ++ + GIV+GG YG++
Sbjct: 148 CRTGCKGGVPSYAWMFYKEKGIVTGGLYGTE 178


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 98/335 (29%), Positives = 134/335 (40%), Gaps = 118/335 (35%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
           G K A  +  SN   A  K  +GV P            +G   V  D    LP  FD+RT
Sbjct: 58  GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDPSLKLPKAFDART 111

Query: 93  KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
            WP C +I  I DQG CGSCW                                     GC
Sbjct: 112 AWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171

Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
              Y IA              C+ + + T    P C+ +   TPKC R+C  +  + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPKCSRKCVSDNKL-WSE 229

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
             ++   +Y+V SN + IM E+Y++GPV                                
Sbjct: 230 SKHYSVSTYTVKSNPQDIMAEVYKNGPV-------------------------------- 257

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                       E +FTV++D   YKSG         +GGHA++++GWG   +  E YWL
Sbjct: 258 ------------EVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEG-EDYWL 304

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G F I RG +ECGIE    AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 93/312 (29%), Positives = 132/312 (42%), Gaps = 110/312 (35%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           + S +G     N P+  +P L   ++ +   PA+FDSRT W NC TI  I +Q  CGSCW
Sbjct: 52  IGSLLGFKKSLNRPS--IPVL--NADPNIKAPASFDSRTAWSNCTTIGYIENQARCGSCW 107

Query: 114 GCRPYE--------------------IAPCEHHVNG------------------------ 129
                E                    +  C+   +G                        
Sbjct: 108 AFGAVESAQDRICIHKGLDVQLSFLDLVTCDQSDDGCEGGDDVSAWNFLKKQGVVTQECK 167

Query: 130 --TRPSCDASKG------HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
             T P+C  ++       +TP CV++C+ N  + Y +D +  AK YS++S E +IM+EI 
Sbjct: 168 PYTIPTCPPAQQPCLNFVNTPNCVKQCESNSTLIYSQDKHKMAKIYSINSVE-AIMQEIS 226

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
            +GPVE                                              F+V++D +
Sbjct: 227 TNGPVEAC--------------------------------------------FSVYEDFL 242

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
            YKSG       K LGGH ++I G+G    +   YW +ANSW T WGDNG+F I RG DE
Sbjct: 243 GYKSGVYQHTTGKFLGGHCVKIFGYGT--LNGVNYWSVANSWTTSWGDNGIFLIKRGSDE 300

Query: 295 CGIESSITAGVP 306
           CGIE  + AG+P
Sbjct: 301 CGIEDEVVAGIP 312


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 124/290 (42%), Gaps = 100/290 (34%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTR------ 131
           V E LP  FD+R KWP+C TI+ IR+Q +CGSCW     E+      +  NGT+      
Sbjct: 88  VPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISV 147

Query: 132 ----PSCDASKGHTPK----------------------------------CVREC----- 148
                 C  + G+  K                                  C + C     
Sbjct: 148 EDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTT 207

Query: 149 -------QENYDVPYKK-DLNFGAKSYSVSSNEK--SIMKEIYEHGPVEGAFTVFDDLIL 198
                  Q +Y     K D ++GA +Y V++ +    I  EIY +GPVE ++ V++D   
Sbjct: 208 PSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYH 267

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
           YKSG +                                      Y SGK +GGHA++I+G
Sbjct: 268 YKSGVYH-------------------------------------YTSGKLVGGHAVKIIG 290

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WG +  +   YWLIANSW T +G+ G FKI RG +EC IE ++ AG+ KL
Sbjct: 291 WGVE--NGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 338



 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG+GC GG+   A R+W  SG V+GG YG
Sbjct: 158 CGYGCKGGYSIEALRFWASSGAVTGGDYG 186


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/193 (36%), Positives = 91/193 (47%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC     G             +C R C  N D+ Y  D  F    Y ++   
Sbjct: 185 GCEPYRVPPCPQDEEGKSSCAGKPIEKNHRCTRMCYGNQDLDYNDDHRFTRDYYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +    N T                      
Sbjct: 244 -SIQKDVMNYGPIEASFDVYDDFPSYKSGVYQRTPNAT---------------------- 280

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWG +E +   YWL+ NSWN  WGDNGLFKI RG D
Sbjct: 281 --------------KLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDNGLFKIRRGTD 324

Query: 294 ECGIESSITAGVP 306
           ECGI+S+ TAGVP
Sbjct: 325 ECGIDSAATAGVP 337



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           +P  FD+R +W +C TI E+RDQG CGSCW 
Sbjct: 88  IPRTFDARRRWRHCKTIGEVRDQGHCGSCWA 118



 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG+P  AW+Y+   GIV+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWKYFSSHGIVTGGNYKSGE 184


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/197 (39%), Positives = 94/197 (47%), Gaps = 54/197 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    CEHHV G  P C      TP+CV++C +  DV Y +D      SY++ ++E
Sbjct: 183 GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIMKEI   GPVE                                              
Sbjct: 242 ISIMKEIMLRGPVEAI-------------------------------------------- 257

Query: 234 FTVFDDLILYKSG---KALG----GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FT+++D + Y SG    ALG    GHA+RILGWGE       YWLIANSWN DWG+ G  
Sbjct: 258 FTMYEDFLRYSSGVYFHALGAPMSGHAVRILGWGE--LGNVPYWLIANSWNEDWGEEGYM 315

Query: 287 KILRGKDECGIESSITA 303
           K LRG +ECGIE  +TA
Sbjct: 316 KFLRGYNECGIEDDVTA 332



 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 21/31 (67%), Positives = 24/31 (77%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           LP NFD+R  WP+C +I EIRDQ SCGSCW 
Sbjct: 86  LPKNFDARKTWPHCSSISEIRDQSSCGSCWA 116



 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GG+P +AW YW   GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWKTHGIVTGGS 177


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/298 (33%), Positives = 127/298 (42%), Gaps = 41/298 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  +   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEARRLTGARIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +CGSCW           H   G       S  H   C  +C    D  Y
Sbjct: 103 CPTIREIADQSACGSCWAVSTASAISDRHCTVGGVQQLRISAAHLMSCCEDCGYGCDGGY 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
                  +  Y VS    S   + Y   P  G               F  P   TT    
Sbjct: 163 PGT----SWEYYVSHGLASSYCQPYPF-PHCGHHGGKGKKPPCSKYHFHTPKCNTTCTDK 217

Query: 213 AMSLIKWTIRDNTS-----------QLGAEGAFT----VFDDLILYK-------SGKALG 250
           A+ LIK+  R N S           +L   G F     V+ D + YK       SG  LG
Sbjct: 218 AIPLIKY--RGNHSYEVHGEDDYKRELYFNGPFVVVFWVYSDFLAYKTGVYRHVSGDFLG 275

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GHA+RI+GWG+   +   YW IANSW+TDWG NG    LRG +ECGIE++  AG P +
Sbjct: 276 GHAVRIVGWGKLNGT--PYWKIANSWDTDWGMNGHLLFLRGNNECGIEAAGYAGSPAI 331



 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG+GC+GG+PG +W Y+V  G+ S
Sbjct: 154 CGYGCDGGYPGTSWEYYVSHGLAS 177


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 123/284 (43%), Gaps = 108/284 (38%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------------------- 113
           DLP +FD+R  WP C +I+ I DQG CGSCW                             
Sbjct: 87  DLPKHFDAREAWPQCASIKTILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVA 146

Query: 114 -------GCR---PYE-----------IAPCEHHVNGT---RPSCDASKGHTPKCVRECQ 149
                  GC    PY             + C+ + +G     P C+     TP CV++C 
Sbjct: 147 CCSSCGFGCEGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEY-DTPVCVKQCV 205

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +N    ++   +F  ++Y+V+S+   I  EIY++GPVE ++                   
Sbjct: 206 DNEQ--WRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSY------------------- 244

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                    TV++D   YKSG       + LGGHA++ +GWG  
Sbjct: 245 -------------------------TVYEDFAHYKSGVYKHVFGQVLGGHAVKFIGWGTT 279

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +  K+ YW++ANSWN  WG++G F+I RG +ECGIES   AG+P
Sbjct: 280 DDGKD-YWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 322



 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 13/24 (54%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CGFGC GG+P  AW Y+ ++G+V+
Sbjct: 151 CGFGCEGGYPYAAWEYFAQTGVVT 174


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/297 (32%), Positives = 122/297 (41%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP +FD+  KWP C TI   R+  + GS                    
Sbjct: 86  PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMSGI 145

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N ++ 
Sbjct: 146 PDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVGVTTELCQPYPFGPCSHHGNSSKY 205

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +  +TPKC   C   E   V YK     G  SYS+   E+ +M E+  +GP+E A
Sbjct: 206 PPCPNTIYNTPKCNTTCDNVEMELVKYK-----GVSSYSIK-GERELMVELMNNGPLEVA 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG                                     +  + SG  L
Sbjct: 260 MQVYADFVAYKSG-------------------------------------VYKHVSGDHL 282

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   K    YW IANSWNTDWGD G F I RG DECGIESS  AG P
Sbjct: 283 GGHAVKLVGWGV--KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337



 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  G+ +
Sbjct: 161 ICGFGCYGGIPAMAWLWWVWVGVTT 185


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 137/335 (40%), Gaps = 118/335 (35%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
           G K A  +  SN   A  K  +GV P            +G   V  D    LP  FD+RT
Sbjct: 58  GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPVVSHDPSLKLPKAFDART 111

Query: 93  KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
            WP C +I +I DQG CGSCW                                     GC
Sbjct: 112 AWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171

Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
              Y IA              C+ + + T    P C+ +   TP+C+R+C  +  + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPRCLRKCVSDNKL-WSE 229

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
             ++   +Y+V+S+ + IM E+Y++GPVE +                             
Sbjct: 230 SKHYSVSTYTVNSSPQDIMAEVYKNGPVEVS----------------------------- 260

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                          FTV++D   YKSG         +GGHA++++GWG   +  E YWL
Sbjct: 261 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSNEG-EDYWL 304

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G F I RG +ECGIE    AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 91/191 (47%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    C+H  + + P C       P C + C+  Y +PYK D ++G   YS+  NE
Sbjct: 93  GCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNE 152

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I KEI  +GPVE    V  D + YKSG +                 R  T QL     
Sbjct: 153 SDIRKEIMMNGPVEAGIFVHSDFLNYKSGVY-----------------RHITGQL----- 190

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          +  H++RI+GWG +  +   YWL ANSWN DWG NG FKILRG +
Sbjct: 191 ---------------VTIHSVRIIGWGIE--NDIPYWLCANSWNEDWGLNGYFKILRGSN 233

Query: 294 ECGIESSITAG 304
           EC IES + AG
Sbjct: 234 ECEIESFVNAG 244


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 87/294 (29%), Positives = 122/294 (41%), Gaps = 92/294 (31%)

Query: 67  PANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------- 113
           PAN+L    E I +      LP  FD+R +W +CPTI +I  QG CGSCW          
Sbjct: 83  PANKLEPSIETISHKHKKLYLPKEFDARKQWSHCPTIGDILGQGHCGSCWAFGAVESLTD 142

Query: 114 ------------------GCRPYEIA-PCE-----------HHVNGTRPSCDASKGHTPK 143
                              C  +E    CE            H       CD        
Sbjct: 143 RFCIHLNESVSLSENDLLACCGFECGYGCEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC 202

Query: 144 CVRECQENYDVP-----------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
               C   Y+ P           + +  + G  +Y +S   + +M E+Y +GPVE AF V
Sbjct: 203 AHPGCYPTYETPKCEKQCVDDEFWVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEV 262

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
           ++D   YK+G                                 V+  L     G  +GGH
Sbjct: 263 YEDFAHYKTG---------------------------------VYKHLF----GGFMGGH 285

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           A++++GWG  +   + YW I NSWNT+WG++GLF+I+RG DECGIES+  AG+P
Sbjct: 286 AVKLIGWGTTDDGVD-YWTIVNSWNTNWGEDGLFRIVRGNDECGIESNAVAGLP 338


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/161 (42%), Positives = 88/161 (54%), Gaps = 40/161 (24%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY I PCEHHVNG+RP     +G TP+C + C+  Y   YK+D +FG  SYSVS++ 
Sbjct: 74  GCLPYTIPPCEHHVNGSRPPMHG-EGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 132

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K IM EIY++GPVEGAFTVF D + YKSG +                             
Sbjct: 133 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 163

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                    +++G  +GGHAIRILGWG +  +   YWL AN
Sbjct: 164 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAAN 194



 Score = 45.4 bits (106), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 19/30 (63%), Positives = 22/30 (73%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
          CG GCNGG+P  AW +W K G+VSGG Y S
Sbjct: 42 CGDGCNGGYPSGAWNFWTKKGLVSGGVYDS 71


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 96/202 (47%), Gaps = 63/202 (31%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY  APC         SC   K  TP C   CQ  Y   Y KD  FG  +Y+V+ N 
Sbjct: 178 GCRPYPFAPC--------ISCPEEK--TPTCSLSCQFGYSTAYAKDKRFGVSAYAVARNV 227

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I  EI  +GPV GA                                            
Sbjct: 228 AAIQTEIMTNGPVVGA-------------------------------------------- 243

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FT+++D+  YKSG       + LGGHAI+I+GWG   ++   YWLIANSW  +WG+NG  
Sbjct: 244 FTMYEDMYKYKSGVYRHTAGRLLGGHAIKIIGWGT--QNGIPYWLIANSWGANWGENGFL 301

Query: 287 KILRGKDECGIESSITAGVPKL 308
           K+ RG +ECGIE ++ AG+P++
Sbjct: 302 KMRRGVNECGIERAVVAGMPRV 323


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 115/275 (41%), Gaps = 90/275 (32%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPS 133
           LP +FD+R  WP C +I  I DQG CGSCW     E      C H        VN     
Sbjct: 102 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLAC 161

Query: 134 C-----DASKGHTPKC-----VRE--------------------CQENYDVP-------- 155
           C     D   G  P       VR                     C+  Y  P        
Sbjct: 162 CGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVD 221

Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
               ++K  ++G  +Y V  +   IM E+Y++GPVE +FTV++D   YKSG +       
Sbjct: 222 KNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------- 274

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          + +G  +GGHA++++GWG  +   E YWL
Sbjct: 275 ------------------------------KHITGDVMGGHAVKLIGWGTTDDG-EDYWL 303

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 304 LANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 338



 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AWRY+V+ G+V+
Sbjct: 165 MCGDGCDGGYPISAWRYFVRHGVVT 189


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/191 (35%), Positives = 91/191 (47%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    C+H  + + P C       P C + C+  Y +PYK D ++G   YS+  NE
Sbjct: 176 GCLPYPFPKCDHRSSNSYPKCGYITYTAPPCTKTCRSGYPIPYKADKHYGRVIYSLRPNE 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I KEI  +GPVE    V  D + YKSG +                             
Sbjct: 236 SDIRKEIMMNGPVEAGIFVHSDFLNYKSGVY----------------------------- 266

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    + +G+ +  H++RI+GWG +  +   YWL ANSWN DWG NG FKILRG +
Sbjct: 267 --------RHITGQLVTIHSVRIIGWGIE--NDIPYWLCANSWNEDWGLNGYFKILRGSN 316

Query: 294 ECGIESSITAG 304
           EC IES + AG
Sbjct: 317 ECEIESFVNAG 327


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 115/275 (41%), Gaps = 90/275 (32%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHH-------VNGTRPS 133
           LP +FD+R  WP C +I  I DQG CGSCW     E      C H        VN     
Sbjct: 101 LPKSFDAREAWPQCISIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNITLSVNDLLAC 160

Query: 134 C-----DASKGHTPKC-----VRE--------------------CQENYDVP-------- 155
           C     D   G  P       VR                     C+  Y  P        
Sbjct: 161 CGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDTTGCSHPGCEPAYPTPRCVRHCVD 220

Query: 156 ----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
               ++K  ++G  +Y V  +   IM E+Y++GPVE +FTV++D   YKSG +       
Sbjct: 221 KNQIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKSGVY------- 273

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          + +G  +GGHA++++GWG  +   E YWL
Sbjct: 274 ------------------------------KHITGDVMGGHAVKLIGWGTTDDG-EDYWL 302

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G FKI RG +ECGIE  + AG+P
Sbjct: 303 LANQWNRGWGDDGYFKIRRGTNECGIEEDVVAGLP 337



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AWRY+V+ G+V+
Sbjct: 164 MCGDGCDGGYPISAWRYFVRHGVVT 188


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/195 (36%), Positives = 96/195 (49%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PC +   G    C   K   PKC+  C   YD  Y+KD  FGA +Y + ++ 
Sbjct: 189 GCKPYPFEPCSYPFVG----CHHEK-KNPKCLHHCINGYDRKYRKDKFFGATAYKIPNDA 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI  +GPV   F VF+D   Y SG                               
Sbjct: 244 RMIQLEIMTNGPVATGFEVFEDFYFYHSG------------------------------- 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
             V+  ++    GK +G HAIRI+GWG +  +   YWLIANS+   WGD G FK+LRG +
Sbjct: 273 --VYKHVV----GKKVGMHAIRIVGWGTENGTP--YWLIANSYGDTWGDKGFFKMLRGSN 324

Query: 294 ECGIESSITAGVPKL 308
             GIES++ AG+P+L
Sbjct: 325 HLGIESTVIAGLPQL 339



 Score = 41.6 bits (96), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 25/36 (69%), Gaps = 1/36 (2%)

Query: 9   CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GCNGGF  G A++YWV +G+VSG  Y S +  K
Sbjct: 156 CGNGCNGGFLDGTAFQYWVDAGLVSGAPYNSSEGCK 191


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 73/203 (35%), Positives = 92/203 (45%), Gaps = 57/203 (28%)

Query: 115 CRPYEIAPCEHH-VNGTRPSCDASKGHTPKCVRECQENYDV-PYKKDLNFGAKSYSVSSN 172
           C+PY+   C HH  +   P C ++   TPKC + C   Y    Y  DL++G  SYSV   
Sbjct: 177 CKPYDFPACAHHEASPDYPDCPSTDYSTPKCTKSCVAGYTANTYTADLHYGQSSYSVGRT 236

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           + +I  EI  HGPVE A                                           
Sbjct: 237 DAAIQTEILNHGPVEAA------------------------------------------- 253

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            FTV+ D   Y+SG         LGGHAI I+GWG +  S   YWL+ NSWN  WGD G 
Sbjct: 254 -FTVYSDFPTYRSGVYKHTSGSVLGGHAISIVGWGTESGSP--YWLVKNSWNPSWGDGGF 310

Query: 286 FKILRGKDECGIESSITAGVPKL 308
           FKILRG  +CGI + +  G+PKL
Sbjct: 311 FKILRG--DCGINNDVVGGLPKL 331



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 32/46 (69%), Gaps = 2/46 (4%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            RLP  +  + + E +P  FDSRT WP CPTI+E+RDQ +CGSCW 
Sbjct: 66  QRLP--LKVAPIAEAIPDTFDSRTNWPACPTIKEVRDQSACGSCWA 109


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 123/284 (43%), Gaps = 108/284 (38%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------------------- 113
           DLP +FD+R  WP C +I+ I DQG CGSCW                             
Sbjct: 98  DLPKHFDAREAWPQCSSIKNILDQGHCGSCWAFGAVEALTDRFCILNNENVSLSENDLVA 157

Query: 114 -------GCR---PYE-----------IAPCEHHVNGT---RPSCDASKGHTPKCVRECQ 149
                  GC    PY             + C+ + +G     P C+     TP CV++C 
Sbjct: 158 CCSSCGFGCDGGYPYAAWEYFAQTGVVTSQCDPYFDGKGCKHPGCEPEY-DTPVCVKQCV 216

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +N    ++   +F  ++Y+V+S+   I  EIY++GPVE ++                   
Sbjct: 217 DNEQ--WRDSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSY------------------- 255

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                    TV++D   YKSG       + LGGHA++ +GWG  
Sbjct: 256 -------------------------TVYEDFAHYKSGVYKHVFGEVLGGHAVKFIGWGTT 290

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +  K+ YW++ANSWN  WG++G F+I RG +ECGIES   AG+P
Sbjct: 291 DDGKD-YWIVANSWNRSWGEDGFFQISRGSNECGIESEPVAGIP 333



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 13/24 (54%), Positives = 20/24 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CGFGC+GG+P  AW Y+ ++G+V+
Sbjct: 162 CGFGCDGGYPYAAWEYFAQTGVVT 185


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/331 (26%), Positives = 136/331 (41%), Gaps = 114/331 (34%)

Query: 50  PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-------DLPANFDSRTKWPNCPTIRE 102
           P+  + S++ +     + A +   L+ +   DE        +P++FD+R KW  C TI E
Sbjct: 44  PKLSIDSFVKLLGSKGVQAAKQASLVMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGE 103

Query: 103 IRDQGSCGSCWG----------------------CRPYEIAPCEH--------------- 125
           +RDQG+CGSCW                         P E+A C H               
Sbjct: 104 VRDQGNCGSCWAFGTSSAFADRLCIATDGEFNELLSPEELAFCCHKCGFGCSGGYPIRAW 163

Query: 126 -----HVNGTRPSCDASKGHTP-------------------------KCVRECQENYDVP 155
                H   T  + D+ +G  P                         +C + C  N ++ 
Sbjct: 164 ERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLD 223

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +K+D ++   +Y ++    +I  ++  +GP+E +F V+DD   YKSG +    N T    
Sbjct: 224 FKEDHHYTRDAYYLTYG--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATY--- 278

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                                            LGGHA++++GWGE+      YWL+ NS
Sbjct: 279 ---------------------------------LGGHAVKLIGWGEEYGV--PYWLLVNS 303

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD GLFKI RG +ECG ++S T GVP
Sbjct: 304 WNDQWGDQGLFKIRRGTNECGTDNSTTGGVP 334



 Score = 40.4 bits (93), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 63/191 (32%), Positives = 91/191 (47%), Gaps = 39/191 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+ Y   PC H + G  P C       PKC   CQE Y + Y+KD    +  Y + +N  
Sbjct: 167 CKSYPFPPCSHGIEGQYPQCSTKPPVVPKCETTCQEGYPIEYEKDRYKFSNVYQLENNVD 226

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
            I  EI E+GPV+ +F V++D + YKSG +                              
Sbjct: 227 QIKNEIMENGPVDASFQVYEDFMTYKSGIYH----------------------------- 257

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                   +  GK +  H ++I+GWGE+  + E YW   NSWN++WG+NGLF+I  G +E
Sbjct: 258 --------HVEGKFMNLHTVKIIGWGEE--NGEAYWKAVNSWNSEWGENGLFRIRLGTNE 307

Query: 295 CGIESSITAGV 305
           C IES +  G+
Sbjct: 308 CTIESQVEGGL 318



 Score = 43.9 bits (102), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGF C GG+  MAW Y  ++G+V+GG Y S +
Sbjct: 134 CGFQCQGGYSAMAWEYLRRTGVVTGGQYNSTE 165


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 101/185 (54%), Gaps = 40/185 (21%)

Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
           E  P    +  + P C+     TPKCVR+C +   V +KK  ++  K Y V+S+ ++IM+
Sbjct: 22  ECDPYFDQIGCSHPGCEPGY-QTPKCVRKCVKGNQV-WKKSKHYSVKPYKVNSDPQNIME 79

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+Y++GPVE AF+V++D   YKSG +                                  
Sbjct: 80  EVYKNGPVEVAFSVYEDFAHYKSGVY---------------------------------- 105

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
               + +G ALGGHA+++ GWG  ++  E YWL+AN WNT+WGD+G FKI RG +ECGIE
Sbjct: 106 ---KHITGSALGGHAVKLNGWGTSDEG-EDYWLLANQWNTNWGDDGYFKIKRGTNECGIE 161

Query: 299 SSITA 303
             +TA
Sbjct: 162 EDVTA 166


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 95/308 (30%), Positives = 131/308 (42%), Gaps = 95/308 (30%)

Query: 53  HLKSWMGVHPDYNLPANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSC 109
           HLK   G       PAN +    E + +   + DLP  FD+R  W +C TI +I DQG C
Sbjct: 70  HLKKMCGAK---MTPANEVEPSIERVTHKHKNLDLPTEFDARKHWSHCSTIGDILDQGHC 126

Query: 110 GSCWGCRPYEIAP---CEH-------HVNGTRPSC-----DASKGHTP------------ 142
           GSCW     E      C H         N     C     D  +G  P            
Sbjct: 127 GSCWAFGAVESLTDRFCIHLNESVSLSENDLLACCGFECGDGCEGGYPIRAWQYFKRTGV 186

Query: 143 ---KC----------VRECQENYDVP--YKKDLN---------FGAKSYSVSSNEKSIMK 178
              KC             C   YD P  +K+ ++          G  +Y VS   + +M 
Sbjct: 187 VTSKCDPYFDQKGCGHPGCYPTYDTPKCFKRCVDDELWVSSKHLGVSAYEVSMEPEELMA 246

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E++ +GP+E AF VF+D   YK+G                                 V+ 
Sbjct: 247 ELFTNGPIEVAFDVFEDFAHYKTG---------------------------------VYK 273

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
            L     G  +GGHA++++GWG  +   + YW + NSWNT+WG++G F+ILRGKDECGIE
Sbjct: 274 HLY----GGYIGGHAVKLVGWGTTDDGVD-YWSMVNSWNTNWGEDGTFRILRGKDECGIE 328

Query: 299 SSITAGVP 306
           S+  AG+P
Sbjct: 329 SNAVAGLP 336


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 138/316 (43%), Gaps = 67/316 (21%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A++N   N P+  +   +G      +  + + E       + ++P  FDSR +W  C T
Sbjct: 40  KAKQNFPENTPKEQIVRLLGSKRLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCET 99

Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS--CDASKGH-------------TPKC 144
           I  +R+QG+CGSCW           H   G      C A+ G                +C
Sbjct: 100 IGHVRNQGNCGSCWA----------HGTTGAFADRLCVATNGEFNELISAEELTFCCHRC 149

Query: 145 VRECQENYDVP----YKK---------DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
           V  C   Y +     +K+         D   G + Y V       +K+   H    G  T
Sbjct: 150 VFGCNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPP----CVKDDEGHNSCSGQPT 205

Query: 192 ----------VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDL 240
                       DD I YK   +        A  L   T++ +T   G  E +F V+DD 
Sbjct: 206 ERNHKCSKKCYGDDTIDYKKNHY----KTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDF 261

Query: 241 ILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
           + Y+SG          LGGHA++++GWG +E +   YWL+ NSW   WGD G+FKILRG 
Sbjct: 262 MNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGEQWGDKGMFKILRGT 319

Query: 293 DECGIESSITAGVPKL 308
           DECGIESS TAGVP +
Sbjct: 320 DECGIESSCTAGVPSV 335



 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           C FGCNGG+P  AW+Y+ + G+V+GG Y +
Sbjct: 149 CVFGCNGGYPLKAWQYFKRHGVVTGGDYDT 178


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 52/91 (57%), Positives = 67/91 (73%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNGTRP C    G TP+C+ +C+  Y   YK D ++G  SYSV S+E
Sbjct: 72  GCRPYTIPPCEHHVNGTRPPCTGEGGDTPQCILQCESGYTPSYKADKHYGKSSYSVPSDE 131

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
           + I  EIY++GPVEGAFTV++D +LYK+G +
Sbjct: 132 EQIQSEIYKNGPVEGAFTVYEDFLLYKTGVY 162


>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
          Length = 118

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 55/91 (60%), Positives = 69/91 (75%), Gaps = 1/91 (1%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D +FG  SYSV++NE
Sbjct: 29  GCRPYSIPPCEHHVNGSRPPCTG-EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 87

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
           K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 88  KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY 118



 Score = 41.6 bits (96), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 17/26 (65%), Positives = 19/26 (73%)

Query: 13 CNGGFPGMAWRYWVKSGIVSGGAYGS 38
          CNGGFP  AW +W K G+VSGG Y S
Sbjct: 1  CNGGFPSGAWNFWTKKGLVSGGLYNS 26


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 91/283 (32%), Positives = 124/283 (43%), Gaps = 98/283 (34%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP--------- 132
           LP  FDSR +WPNC +I+ IRDQ  CGSCW     EI      +  NGT+          
Sbjct: 85  LPTAFDSRVQWPNCNSIKLIRDQTYCGSCWAFAAAEIISDRICIQSNGTQQPIISPEDIL 144

Query: 133 SCDASK-------GHTPKCVR---------------------------ECQENYDVPYKK 158
           SC  S        G+T + ++                            C+E  D P  K
Sbjct: 145 SCCGSSCNNGCQGGYTIEAMKYWMNSGVVTGGDYQGAGCIPYSFRPCSTCKEPKDAPSCK 204

Query: 159 ---DLNFGAKS-----YSVSSNE------KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
                ++ AKS      + SSN       + I  EIY +GPVE A+ V+DD   YKSG +
Sbjct: 205 TTCQASYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGVY 264

Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
           +                                     +  G    GHA++I+GWG ++K
Sbjct: 265 Y-------------------------------------HVYGDKPSGHAVKIIGWGTEKK 287

Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
               YWL+ANSW+T +G+NG FKI RG +ECGIE ++ AG+PK
Sbjct: 288 V--DYWLVANSWSTTFGENGFFKIRRGTNECGIEENVVAGLPK 328


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 70/190 (36%), Positives = 93/190 (48%), Gaps = 40/190 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR +    C H   G  P C      TP+C+++C E  +V Y+KD      SY+V  ++
Sbjct: 148 GCRSFPFPKCGHRRKGRYPPCPRHIYPTPECIKQCDEP-EVNYEKDKTRANISYNVYPSD 206

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIMKEI  +GPVE +F ++ D + Y  G +F                            
Sbjct: 207 ISIMKEIMLNGPVEASFGIYADFLEYNGGVYF---------------------------- 238

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                    +  G  +  HAIRILGWGED+     YWLIANSWN DWG+ G  + LRG +
Sbjct: 239 ---------HCWGGPISRHAIRILGWGEDDGVP--YWLIANSWNEDWGEKGYVRFLRGHN 287

Query: 294 ECGIESSITA 303
           ECGIE  +TA
Sbjct: 288 ECGIEEEVTA 297



 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 122/305 (40%), Gaps = 66/305 (21%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
           CG GC GG+  +AW +W   GIV+G   GSK+      S    +      G +P      
Sbjct: 704 CGCGCRGGYSPIAWDFWKTHGIVTG---GSKEKPTGCRSYPFPSCEHRGKGQYPPCPHQL 760

Query: 69  NRLPELIGYSEVDE-----DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
              PE I   +  E     D    FDS +        R      + G     R   +  C
Sbjct: 761 YPTPECIKRCDTKEIDYEKDKTRGFDSASS--EQLADRHCFHTSNFGEASAQRTLHLT-C 817

Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
              +N    S D       K V     N              SY+V   E+++MKEI   
Sbjct: 818 ---LNFMHHSIDLLSSRLEKAVLRSTANI-------------SYNVYPAEQAVMKEIMLR 861

Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
           GPV     V++DL+ YKSG +F                                     +
Sbjct: 862 GPVGAILHVYEDLLDYKSGVYF-------------------------------------H 884

Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
             G  LG H IRILGWGE++     YWL+ANSWN DWG+ G  ++LR ++ECGI   +TA
Sbjct: 885 VWGGHLGEHGIRILGWGEEDGV--PYWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTA 942

Query: 304 GVPKL 308
           G+P L
Sbjct: 943 GLPDL 947



 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 27/34 (79%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++ LP +FD+R  WP+CP+I EIRDQ SCGSCW 
Sbjct: 636 NQHLPESFDARANWPHCPSISEIRDQSSCGSCWA 669



 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 23/45 (51%), Positives = 31/45 (68%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           R P +      +++LP +FD+RTKWP+CP+I EIRDQ SC S W 
Sbjct: 37  RRPTVKHEVSDEKELPKSFDARTKWPHCPSISEIRDQSSCESFWA 81



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 15/27 (55%), Positives = 18/27 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC  GF  MAW +W   GIV+GG+
Sbjct: 116 CGLGCGAGFHPMAWDFWKTHGIVTGGS 142


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 149/363 (41%), Gaps = 133/363 (36%)

Query: 36  YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
           Y ++QA   E++ ++ I  A+ K+W  GV+ D  L  +   +L+G               
Sbjct: 13  YRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPVMF 71

Query: 77  ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---------------- 114
                 Y+     +P++FD+R KW  C TI E+RDQG+CGSCW                 
Sbjct: 72  KTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGNCGSCWAFGTSSAFADRLCIATD 131

Query: 115 ------CRPYEIAPCEH--------------------HVNGTRPSCDASKGHTP------ 142
                   P E+A C H                    H   T  + D+ +G  P      
Sbjct: 132 GEFNELLSPEELAFCCHKCGFGCSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPC 191

Query: 143 -------------------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEH 183
                              +C + C  N ++ +K+D ++   +Y ++    +I  ++  +
Sbjct: 192 PLDEYGNNTCSGKPAEKNHRCTQMCYGNQNLDFKEDHHYTRDAYYLTYG--TIQNDVLAY 249

Query: 184 GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY 243
           GP+E +F V+DD   YKSG +    N T                                
Sbjct: 250 GPIEASFEVYDDFPSYKSGVYTKMENATY------------------------------- 278

Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
                LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG +ECG ++S T 
Sbjct: 279 -----LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRGTNECGTDNSTTG 331

Query: 304 GVP 306
           GVP
Sbjct: 332 GVP 334



 Score = 40.4 bits (93), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 97/335 (28%), Positives = 134/335 (40%), Gaps = 118/335 (35%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
           G K A  +  SN   A  K  +GV P            +G   V  D    LP  FD+RT
Sbjct: 61  GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDRSLKLPKEFDART 114

Query: 93  KWPNCPTIREIRDQGSCGS-------------------------------CWGCR----- 116
            WP C +I  I DQG CGS                               C G R     
Sbjct: 115 AWPQCTSIGNILDQGHCGSCWAFGAVESLSDRFCIEFGMNISLSVNDLLACCGFRCGDGC 174

Query: 117 --PYEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
              Y IA              C+ + + T    P C+ +   TPKC+R+C     + + +
Sbjct: 175 DGGYPIAAWQYFSYSGVVTEECDPYFDDTGCSHPGCEPAY-PTPKCMRKCVSGNQL-WSQ 232

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
             ++   +Y+V SN + IM E+Y++GPVE +                             
Sbjct: 233 SKHYSVSTYTVKSNPQDIMAEVYKNGPVEVS----------------------------- 263

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                          FTV++D   YKSG         +GGHA++++GWG  ++  E YWL
Sbjct: 264 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDEG-EDYWL 307

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G F I RG +ECGIE    AG+P
Sbjct: 308 LANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 342


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 96/299 (32%), Positives = 128/299 (42%), Gaps = 60/299 (20%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPD----YNLPANRLPELIGYSEVDEDLPANFDSRTKWP 95
           QA  N       + LK   G   D     NLP  +      +   D ++P +FD+R +W 
Sbjct: 1   QAGWNDFGEASMSDLKVLCGTILDDPDLLNLPVKQ------HDLTDMEIPKSFDARMEWS 54

Query: 96  NCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT------------PK 143
            C    +I DQG CGSCW     E+         +   C  ++G T             K
Sbjct: 55  TCVRSHKIHDQGHCGSCWAFASTEVL--------SDRLCIQTRGSTNIILSSEDLLSCDK 106

Query: 144 CVRECQENYDVP-----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AFTVFDDLI 197
             R C +   +       +K      +    +S     + E       EG A+  F  L 
Sbjct: 107 AGRGCSDGGRLSEAWRYMQKKGVVANRCKPYTSGATGFIPECMSKCTGEGHAYQKFYGLY 166

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALG 250
           LY            +  + IK  I  N      E AFTV+ D++ YKSG         LG
Sbjct: 167 LYT----------VSGENQIKVEIMTNGP---VEAAFTVYSDIVHYKSGVYHHTSGGKLG 213

Query: 251 GHAIRILGWG-EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GHA+++LGWG EDE   E+YWL+ANSW  DWGD G FKI RG DECGIES +  G  +L
Sbjct: 214 GHAVKVLGWGVEDE---EEYWLVANSWGPDWGDQGFFKIKRGSDECGIESRVLTGTARL 269


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 86/258 (33%), Positives = 121/258 (46%), Gaps = 41/258 (15%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASK 138
           D D+P +FDSR +WPNC ++REIR+QG+CGSCW      +       H NGTR    A++
Sbjct: 89  DIDIPESFDSRDRWPNCDSLREIRNQGTCGSCWAVAAASVMSDRVCIHTNGTRNVAIAAE 148

Query: 139 ---GHTPKCVRECQENY----DVPYKKDLNF----------GAKSYSVSSNEKSIMKEIY 181
              G    C   C+  +       Y  D             G K Y              
Sbjct: 149 DLMGCCADCGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCKPYPFKPCLYPFTDCHR 208

Query: 182 EHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
           E  P        G    +    ++ S  + VP +E     +I++ I  N      EG F 
Sbjct: 209 EESPKCKHHCQHGVDKRYARDKVFGSVAYSVPRDE----RVIRYEIMTNGP---VEGGFD 261

Query: 236 VFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
           V++D+ LYKS       G+ +G HA+RI+GWG +      YWLI+NS+  DWGD+G FKI
Sbjct: 262 VYEDVFLYKSGVYRHVYGEHVGKHAVRIIGWGRE--GGIPYWLISNSYGEDWGDHGYFKI 319

Query: 289 LRGKDECGIESSITAGVP 306
           +RG +  GIES +  G+P
Sbjct: 320 VRGINHLGIESKVITGLP 337



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/36 (55%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 9   CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GC GGF  G +++YWV +G+VSGGAY S +  K
Sbjct: 157 CGNGCEGGFLDGTSFQYWVDAGLVSGGAYNSTEGCK 192


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 95/297 (31%), Positives = 121/297 (40%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP +FD+  KWP C TI   R+  + GS                    
Sbjct: 86  PRNFSVEEMQQDLPESFDASEKWPMCVTIGEIRDQSNCGSCWAIAAVEAMSDRYCTMSGI 145

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N ++ 
Sbjct: 146 PDRRISTTNLLSCCFICGFGCYGGIPAMAWLWWVWVGVTTELCQPYPFGPCSHHGNSSKY 205

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +  +TPKC   C   E   V YK     G  SYS+   E+ +  E+  +GP+E A
Sbjct: 206 PPCPNTIYNTPKCNTTCDNVEMELVKYK-----GVSSYSIK-GERELDHELMNNGPLEVA 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG                                     +  + SG  L
Sbjct: 260 MQVYADFVAYKSG-------------------------------------VYKHVSGDHL 282

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   K    YW IANSWNTDWGD G F I RG DECGIESS  AG P
Sbjct: 283 GGHAVKLVGWGV--KDGIPYWKIANSWNTDWGDKGYFLIQRGNDECGIESSGVAGKP 337



 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  G+ +
Sbjct: 161 ICGFGCYGGIPAMAWLWWVWVGVTT 185


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 76/211 (36%), Positives = 104/211 (49%), Gaps = 59/211 (27%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC-VREC-QENYDVPYKKDLNFGA 164
           G  GS  GC+PY I PC       R +C      TP C ++ C   NY   Y+ DL++  
Sbjct: 181 GDYGSEDGCQPYSIYPCGKG----RNTCIEDDPDTPDCSIKTCTNSNYSKNYRADLHYVD 236

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
             YS+S +E+ IMK++Y++GPV+ A                                   
Sbjct: 237 TVYSLSRSEEDIMKDLYKNGPVQAA----------------------------------- 261

Query: 225 TSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                    F V+ D + YKSG       +  GGHAI+ILGWG D+ +K  YWL ANSW+
Sbjct: 262 ---------FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDGTK--YWLCANSWS 310

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
             WG+NGLF+ILRG +EC IE  + AG+P +
Sbjct: 311 RSWGENGLFRILRGNNECHIEDRVIAGMPHV 341



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 35/65 (53%), Gaps = 4/65 (6%)

Query: 53  HLKSWMGVHPDYNLPANRLPEL--IGYSEVDEDLPANFDSRTKWPNCPTIR-EIRDQGSC 109
           +    MGV P  N  + R   +      E +E LP NFD+R +WP C ++   I+DQ +C
Sbjct: 58  NFNQLMGVLPR-NFNSFRFAPIKKSAEDESNEALPENFDARERWPECSSLLGSIKDQSNC 116

Query: 110 GSCWG 114
           GSCW 
Sbjct: 117 GSCWA 121



 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 24/31 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC+GG P  AW ++++ GIV+GG YGS+
Sbjct: 156 CGNGCDGGSPESAWYFFMRHGIVTGGDYGSE 186


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 142/315 (45%), Gaps = 33/315 (10%)

Query: 18  PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL-PELIG 76
           P ++ R+  +  + + G + +     + +S      L+  MGV    N+    L P +  
Sbjct: 34  PLLSNRFVAEINLKAKGQWTASADNGHLVSGKSDEELRKLMGV---LNMSTAALSPRIFS 90

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
             E+ ++LP +FDS  KWP C TI EIRDQ +CGSCW     E     +           
Sbjct: 91  AEELAQELPTSFDSSDKWPKCRTISEIRDQSNCGSCWAIAAVEAMSDRYCTVAGITDLRV 150

Query: 137 SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV-----EGAFT 191
           S GH   C   C     +  +  +   A  + V     S + + Y   P       G + 
Sbjct: 151 STGHLLSCCFVC----GMGCQGGIPTMAWLWWVWVGLTSEVCQPYPFPPCGHHTDGGKYP 206

Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIK----WTIR---DNTSQLGAEGAFTV----FDDL 240
                I           +  TA++  K    +++R   +   +L   G F V    + D 
Sbjct: 207 ACPSTIYDTPTCNSTCADSHTALTKHKGEKSYSLRGEREYMIELMTYGPFEVAFDVYADF 266

Query: 241 ILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
           + YKS       G+ LGGHA++++GWG   ++   YW IANSWN+DWGDNG F I RG D
Sbjct: 267 VSYKSGVYSHTTGERLGGHAVKLVGWG--VQNGTPYWKIANSWNSDWGDNGYFLIRRGTD 324

Query: 294 ECGIESSITAGVPKL 308
           ECGIES+  AG+P L
Sbjct: 325 ECGIESTGVAGLPSL 339



 Score = 37.4 bits (85), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC GG P MAW +WV  G+ S
Sbjct: 161 VCGMGCQGGIPTMAWLWWVWVGLTS 185


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/190 (33%), Positives = 97/190 (51%), Gaps = 40/190 (21%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPYEI PC HH N T          TP+C R C   Y   Y  D  +  K+Y + ++ K
Sbjct: 189 CRPYEIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSD-RYYKKAYQLKNSVK 247

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I K+I ++GPV   +TV++D   Y+SG                                
Sbjct: 248 AIQKDIMKNGPVVATYTVYEDFAHYRSG-------------------------------- 275

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  +K+G+  G HA++++GWGE++ +   YW++ANSW+ DWG+NG F++ RG ++
Sbjct: 276 -----IYKHKAGRKTGLHAVKVIGWGEEKGTP--YWIVANSWHDDWGENGFFRMHRGSND 328

Query: 295 CGIESSITAG 304
           CG E  + AG
Sbjct: 329 CGFEERMAAG 338



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 21/31 (67%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           +P ++D R +W NC ++  I DQ +CGSCW 
Sbjct: 91  IPESYDPRIQWANCSSLFHIPDQANCGSCWA 121


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 87/283 (30%), Positives = 114/283 (40%), Gaps = 97/283 (34%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE-----------------------IA 121
           P +F  R  W +C +IR IRDQ +CGSCW     E                       +A
Sbjct: 88  PESFTPREYWSHCSSIRVIRDQSACGSCWAFAAAESISDRICIHTNGKVQVNISAEDLLA 147

Query: 122 PCEHHVNGTRPSCDASKG----------------------HTPKCVREC----------- 148
            C    +G    C  S                          P CV  C           
Sbjct: 148 CCHTCGHGCDGRCHCSSVAILQGRRLVPEPVRTEDGCQPYSLPPCVPNCTHPEPTPKCQH 207

Query: 149 --QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFV 206
             ++ Y+  Y++D +F    Y +     +I  +IY++GPVE AF V+ D   YKSG +  
Sbjct: 208 VCRKGYEKSYEEDKHFAKNVYRLLKKCDAIKTDIYKNGPVESAFFVYADFPSYKSGVY-- 265

Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
                    +IK+                             +G HAI+ILGWG ++   
Sbjct: 266 ------QQHMIKF-----------------------------MGVHAIKILGWGTEDGV- 289

Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
             YWL+ANSWN  WGD G FKILRGKDECGIE  I AG+P  D
Sbjct: 290 -PYWLVANSWNVGWGDKGYFKILRGKDECGIEEVIDAGIPMED 331


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 102/207 (49%), Gaps = 40/207 (19%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
           I   GS  S +GC+PY IAPC   + N T P C  +   TP C ++C+  Y V   KD +
Sbjct: 174 IPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTCEKKCKPGYPVDLDKDRH 233

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +G     + + +  I  ++  +GPVE    ++DD + Y +G +                 
Sbjct: 234 YGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGIY----------------- 276

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                               ++ +G   G  ++RILGWG  E     YWL+ANSW  +WG
Sbjct: 277 --------------------VHLAGNKQGHLSVRILGWGMFEGVP--YWLLANSWGKEWG 314

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F++LRG +ECG+E++  +G+PKL
Sbjct: 315 ENGTFRVLRGVNECGLEANCISGMPKL 341



 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 22/31 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG P  AW+YW K GI +GG+Y S+
Sbjct: 153 CGEGCAGGNPLKAWQYWQKHGIPTGGSYESQ 183


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 96/201 (47%), Gaps = 54/201 (26%)

Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC PY    C H V     P C      TPKC ++C   Y+  Y++D   G  SY+V   
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGGQ 242

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E  IM EI ++GPV+G                                            
Sbjct: 243 ETDIMMEIMKNGPVDGI------------------------------------------- 259

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F +F+D ++YKSG       + +GGHAIR++GWG +  +  KYWLIANSWN  WG+ G 
Sbjct: 260 -FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVKYWLIANSWNEGWGEKGY 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           F++ RG +ECGIE+ I AG+P
Sbjct: 317 FRMRRGNNECGIEARINAGLP 337



 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A     +NI +  +K  +GV  +     N   + + YS  + DLP +FD+R KW NCP
Sbjct: 43  KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCP 100

Query: 99  TIREIRDQGSCGSCWG 114
           +I EIRDQ SC SCW 
Sbjct: 101 SISEIRDQSSCSSCWA 116


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 92/196 (46%), Gaps = 40/196 (20%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
            C+PY   PC +H +      C      TP C R CQ  Y +P++KD  F  ++Y +  N
Sbjct: 192 ACQPYAFYPCGNHAHEPYYGPCPDELWPTPTCRRTCQLGYPIPFEKDKIFNDQTYYIFGN 251

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E  I  EI   GPV   + V+ D   YK G +                            
Sbjct: 252 ETEIKYEIMTRGPVVATYKVYRDFDYYKKGVY---------------------------- 283

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    +++ G+  G HA++I+GWG+   +   YWL+ANSWNTDWGDNG F+I+RG 
Sbjct: 284 ---------IHREGEVTGLHAVKIIGWGKG--NDVPYWLVANSWNTDWGDNGYFRIVRGT 332

Query: 293 DECGIESSITAGVPKL 308
           D C IE  +  G+ ++
Sbjct: 333 DNCEIERQMVGGIMRV 348



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 30/41 (73%)

Query: 74  LIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++  +E+  D+P  FD+R +WPNC +++ IRDQ SCGSCW 
Sbjct: 84  VLANTEMKVDIPDTFDARDRWPNCTSMKHIRDQSSCGSCWA 124



 Score = 40.4 bits (93), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGC GG+P  A+ Y  + G+ +GG YG K A
Sbjct: 160 CGFGCKGGYPARAFGYAWRYGLSTGGPYGEKDA 192


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/315 (27%), Positives = 130/315 (41%), Gaps = 94/315 (29%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
           L+N      K  +GV P        +P          +LP  FD+R+KW  C TI +I D
Sbjct: 59  LANYTIEQFKHILGVKPTPPGLLAGVPTKTYSRSEKAELPKEFDARSKWSGCSTIGKILD 118

Query: 106 QGSCGSCWGCRPYEIAP---CEHH------------------------------------ 126
           QG CG+CW     E      C HH                                    
Sbjct: 119 QGHCGACWAFGAVECLQDRFCIHHSVNVSLSVNDLVACCGFLCGDGCDGGYPIFAWQYFV 178

Query: 127 ---------------VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                          V    P C+ +   TP C ++C+    V +++  +F   +Y V+S
Sbjct: 179 ENGVVTDECDPFFDQVGCQHPGCEPAY-PTPVCEKKCKVQNQV-WEEKKHFSIDAYQVNS 236

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   IM E+Y++GPVE +F      I+Y+    +  G                       
Sbjct: 237 DPHDIMAEVYKNGPVEVSF------IIYEDFAHYKSG----------------------- 267

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
               V+  +    +G+ +GGHA +++GWG  + + E YWL+AN WN  WGD+G FKI+RG
Sbjct: 268 ----VYKQI----TGRMVGGHAAKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYFKIIRG 318

Query: 292 KDECGIESSITAGVP 306
            +ECGIE  + AG+P
Sbjct: 319 TNECGIEGDVNAGMP 333



 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AW+Y+V++G+V+
Sbjct: 160 LCGDGCDGGYPIFAWQYFVENGVVT 184


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 47/193 (24%)

Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C PY +   C H      P C+     TPKCVR+C +   + ++K   +G  +Y +SS+ 
Sbjct: 225 CDPYFDATGCSH------PGCEPGYP-TPKCVRKCTDENQL-WRKAKRYGQSAYRISSDP 276

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM E+Y++GPVE AFTV++D   Y+SG                               
Sbjct: 277 YQIMAEVYKNGPVEVAFTVYEDFAHYESG------------------------------- 305

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +  Y +G  +GGHA++++GWG  +   E YW++AN WN +WGD+G F I RG +
Sbjct: 306 ------VYRYTTGDVMGGHAVKLIGWGTTDDG-EDYWILANQWNRNWGDDGYFMIRRGVN 358

Query: 294 ECGIESSITAGVP 306
           ECGIE  + AG+P
Sbjct: 359 ECGIEEGVVAGLP 371



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY++  G+V+
Sbjct: 198 LCGSGCDGGYPLYAWRYFIHHGVVT 222


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY ++PC  + + N T     A K H  +C R C  N D+ +KKD +F   +Y ++ 
Sbjct: 180 GCQPYRVSPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQDLDFKKDHHFTRDAYYLTF 237

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
               I +++  +GP+E ++ V+DD   YKSG +    N T                    
Sbjct: 238 G--IIQRDVMAYGPIEASYDVYDDFPSYKSGVYVRTENATY------------------- 276

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG
Sbjct: 277 -----------------LGGHAVKLIGWGEEYGV--PYWLMVNSWNDQWGDKGLFKIRRG 317

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 318 TNECGIDNSTTGGVP 332



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 24/34 (70%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++ +P  FD+R KW  C TI E+RDQG CGSCW 
Sbjct: 80  NQKIPKFFDARKKWRKCFTIGEVRDQGKCGSCWA 113



 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGC+GG+P  AW  + K G+V+GG Y S
Sbjct: 148 CGFGCHGGYPIKAWERFQKHGLVTGGDYDS 177


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/264 (34%), Positives = 118/264 (44%), Gaps = 38/264 (14%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
           P      E+ E L   FD+   WP CPTI EIRDQ SCGSCW           +   G  
Sbjct: 80  PRQFSEEELREPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAISDRYCTLGGV 139

Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AF 190
                S G    C   C    +  Y +      + Y+V      I+ E  +  P    A 
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGYPE---VAWEYYAV----HGIVSEYCQPYPFPSCAH 192

Query: 191 TVFDDLILYKSGRFFVPGNETTA----MSLIKWTIRDNTSQLGA---------------E 231
            V    +   SG +  P   +T     + LIK+  R NTS L +               E
Sbjct: 193 HVNSSDLSPCSGEYDTPTCNSTCTDKKVPLIKY--RGNTSYLLSGEESFKRELLLNGPFE 250

Query: 232 GAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
            +F+V+ D + Y        +G  LGGHA+RI+GWG  E + E YW IANSWN +WG NG
Sbjct: 251 VSFSVYADFLAYTGGVYKHVAGTFLGGHAVRIVGWG--ELNGEPYWKIANSWNREWGMNG 308

Query: 285 LFKILRGKDECGIESSITAGVPKL 308
            F I RG DECGIE S  AG P++
Sbjct: 309 YFLIARGVDECGIEGSGVAGTPRI 332



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG+GCNGG+P +AW Y+   GIVS
Sbjct: 155 VCGYGCNGGYPEVAWEYYAVHGIVS 179


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 132/335 (39%), Gaps = 118/335 (35%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
           G K A  +  SN   A  K  +GV P            +G   V  D    LP  FD+RT
Sbjct: 58  GWKAAINDRFSNATVAEFKRLLGVKP------TPKKHFLGVPIVSHDPSLKLPKAFDART 111

Query: 93  KWPNCPTIREIRDQGSCGSCW-------------------------------------GC 115
            WP C +I  I   G CGSCW                                     GC
Sbjct: 112 AWPQCTSIGNILGLGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLACCGFRCGDGC 171

Query: 116 RP-YEIAP-------------CEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKK 158
              Y IA              C+ + + T    P C+ +   TPKC R+C  +  + + +
Sbjct: 172 DGGYPIAAWQYFSYSGVVTEECDPYFDNTGCSHPGCEPAY-PTPKCSRKCVSDNKL-WSE 229

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
             ++   +Y+V SN + IM E+Y++GPV                                
Sbjct: 230 SKHYSVSTYTVKSNPQDIMAEVYKNGPV-------------------------------- 257

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                       E +FTV++D   YKSG         +GGHA++++GWG   +  E YWL
Sbjct: 258 ------------EVSFTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEG-EDYWL 304

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G F I RG +ECGIE    AG+P
Sbjct: 305 MANQWNRGWGDDGYFMIRRGTNECGIEDEPVAGLP 339


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/149 (40%), Positives = 80/149 (53%), Gaps = 38/149 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I+PCEHHVNG+RP C      TP+C R C+  Y   Y +D ++G  SYS+ S+ 
Sbjct: 93  GCRPYSISPCEHHVNGSRPKCSGEI-ETPRCSRRCEAGYSPKYSEDKHYGLTSYSIGSDV 151

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM EIY++GPVE A  VF D +LYKSG +                             
Sbjct: 152 TEIMTEIYKNGPVEAALEVFKDFLLYKSGVY----------------------------- 182

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
                    +K+G ++GGHAI+ILGWGE+
Sbjct: 183 --------QHKTGGSIGGHAIKILGWGEE 203



 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 17/28 (60%), Positives = 20/28 (71%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          CG GCNGG+P  AW +W   G+VSGG Y
Sbjct: 61 CGNGCNGGYPSGAWEFWTNDGLVSGGLY 88


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 72/201 (35%), Positives = 97/201 (48%), Gaps = 43/201 (21%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAK 165
           G  GS  GC+PY I PC+H       +C      TP+C  +C   +Y   Y KD N    
Sbjct: 143 GEYGSNEGCQPYTIEPCQHTETAVENACSNKTLFTPECKVQCYNPDYGTRYVKD-NHQGT 201

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
            Y V +   + MKEIYE+GP+  +F ++ D + Y+SG +                     
Sbjct: 202 HYRVPA--YTAMKEIYENGPITASFYMYQDFVNYQSGVY--------------------- 238

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                            Y SGK +   A++ILGWGE+  +   YWL ANS+NT WGDNG 
Sbjct: 239 ----------------AYNSGKYVTTQAVKILGWGEENGTP--YWLAANSFNTYWGDNGF 280

Query: 286 FKILRGKDECGIESSITAGVP 306
            KILRG +EC IE  + AG+P
Sbjct: 281 VKILRGANECYIEEFMYAGLP 301


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 134/305 (43%), Gaps = 51/305 (16%)

Query: 40  QAEKNSLSNIPRAHLKSWMGV---HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           + + N  SN+     +S  G+      + +P  R   +      D D+P +FD+R  WPN
Sbjct: 45  KPDTNFQSNVHFHAFRSLKGIGESRTGFKVPIRRYEYVY-----DVDIPESFDARNHWPN 99

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDV 154
           C ++R IR+QG+CGSCW      +       H NGT     A++     CV +C    + 
Sbjct: 100 CESLRAIRNQGTCGSCWAVAAASVMSDRVCIHSNGTINVALAAEDLMGCCV-DCGNGCNG 158

Query: 155 PYKKDLNF------------------GAKSYSVSSNEKSIMKEIYEHGP------VEGAF 190
            +    +F                  G K Y     E        E  P       +G  
Sbjct: 159 GFLDGTSFQYWVDAGLVSGGAYNSTDGCKPYPFKPCEYPFNDCHVEISPKCTHHCRDGVD 218

Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS----- 245
             +    L+    + VP +E      I++ I  N      E  F V++D++LYKS     
Sbjct: 219 RHYSKDKLFGKVAYSVPRDERA----IRYEIMTNGP---VEAGFDVYEDVLLYKSGVYRH 271

Query: 246 --GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
             G+ +G HA+RI+GWG D      YWLIANS+  DWGD+G FK +RG +  GIES I  
Sbjct: 272 VYGEQIGKHAVRIIGWGRD--GGIPYWLIANSYGDDWGDHGYFKFVRGSNHLGIESKIIT 329

Query: 304 GVPKL 308
           G+P +
Sbjct: 330 GLPLI 334



 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 26/36 (72%), Gaps = 1/36 (2%)

Query: 9   CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GCNGGF  G +++YWV +G+VSGGAY S    K
Sbjct: 152 CGNGCNGGFLDGTSFQYWVDAGLVSGGAYNSTDGCK 187


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 66/191 (34%), Positives = 92/191 (48%), Gaps = 39/191 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY    C+H  + + P C       P C   C+  Y +PY  D +FG  +Y V  NE
Sbjct: 181 GCLPYPFPKCDHGSSDSYPMCGYVVYTPPVCNGTCRPGYPIPYNDDKHFGKSAYQVKQNE 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I +EI  +GPVE +  ++DD + YKSG                               
Sbjct: 241 SDIRREIMLYGPVEASIFIYDDFVDYKSG------------------------------- 269

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
             V+  L    +G+ +   ++RI+GWG +  +   YWL ANSWN +WG NG FKILRG +
Sbjct: 270 --VYKHL----TGRLITIQSVRIIGWGIE--NGIPYWLCANSWNEEWGLNGFFKILRGSN 321

Query: 294 ECGIESSITAG 304
           EC IE+ + AG
Sbjct: 322 ECEIEAFVNAG 332


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 115/290 (39%), Gaps = 100/290 (34%)

Query: 79  EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCE--HHVNGTRP--- 132
           E+ E++P +FDSRT WP C   I  IRDQ  CGSCW     E        H N T+    
Sbjct: 76  EITEEIPESFDSRTAWPECTQIIGMIRDQSRCGSCWAFAAVEAMSDRICIHSNATKKLLV 135

Query: 133 ------SCDASKG----------------------------------------HTPKC-- 144
                 +C  + G                                        H  KC  
Sbjct: 136 SSQDLLTCGTAGGCNGGWPAVAWSDWTNGIVTGGLYGALEQGCKSYFLEGCDDHPNKCRN 195

Query: 145 ---VRECQENYDVP---YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
                 C E  D P   YK    +G   Y +   E+ I  EI  +GPVE    V+ D   
Sbjct: 196 YVSTPACVEQCDEPSLYYKAQETYGQTPYEIQGEEQ-IQYEIMTNGPVEATMDVYVDFAQ 254

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
           Y+SG + +  +E                                       GGHA++ILG
Sbjct: 255 YQSGIYQLTTDEYE-------------------------------------GGHAVKILG 277

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WG ++    KYWL+ANSWN  WG+NGLF+I+RG+DE GIES+I A +P  
Sbjct: 278 WGVEDGV--KYWLVANSWNERWGENGLFRIIRGRDEVGIESTIDAALPDF 325



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 26/38 (68%), Gaps = 3/38 (7%)

Query: 3   TQQIRLCGF--GCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +Q +  CG   GCNGG+P +AW  W  +GIV+GG YG+
Sbjct: 137 SQDLLTCGTAGGCNGGWPAVAWSDWT-NGIVTGGLYGA 173


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY + PC  + + N T     A K H  +C R C  N ++ +K+D ++   +Y ++ 
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQELDFKEDHHWTRDAYYLTY 239

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
              +I K++  +GP+E +F V+DD   YKSG +    N +                    
Sbjct: 240 T--TIQKDVMAYGPIEASFDVYDDFPNYKSGVYMKTENASY------------------- 278

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKILRG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKILRG 319

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334



 Score = 53.9 bits (128), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 21/39 (53%), Positives = 27/39 (69%)

Query: 76  GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            Y+ +   +P+NFD+R KW  C TI E+RDQG CGSCW 
Sbjct: 77  AYNSLPNRIPSNFDARKKWRKCSTIGEVRDQGHCGSCWA 115



 Score = 42.0 bits (97), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW ++ K G+V+GG Y S +
Sbjct: 150 CGFGCHGGYPIKAWEWFKKHGLVTGGDYDSGE 181


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 70/201 (34%), Positives = 95/201 (47%), Gaps = 54/201 (26%)

Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC PY    C H V     P C      TPKC ++C   Y+  Y++D   G  SY+V   
Sbjct: 183 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 242

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E   M EI ++GPV+G                                            
Sbjct: 243 ETDFMMEIMKNGPVDGI------------------------------------------- 259

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F +F+D ++YKSG       + +GGHAIR++GWG +  +  KYWLIANSWN  WG+ G 
Sbjct: 260 -FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVKYWLIANSWNEGWGEKGY 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           F++ RG +ECGIE+ I AG+P
Sbjct: 317 FRMRRGNNECGIEARINAGLP 337



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A     +NI +  +K  +GV  +     N   + + YS  + DLP +FD+R KW NCP
Sbjct: 43  KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWANCP 100

Query: 99  TIREIRDQGSCGSCWG 114
           +I EIRDQ SC SCW 
Sbjct: 101 SISEIRDQSSCSSCWA 116



 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG+GCNGG P M+W YW + G+V+GG 
Sbjct: 151 CGYGCNGGIPAMSWDYWTREGVVTGGT 177


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 69/199 (34%), Positives = 97/199 (48%), Gaps = 51/199 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
           GC PY  APC+      +  C  S   TP C   CQ +Y    Y  D ++G  +Y +++ 
Sbjct: 190 GCMPYSFAPCQ------KSPCVEST--TPTCKTTCQSSYTTANYTTDKHYGTSAYRLATT 241

Query: 173 E---KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
                +I  EIY +GPVE ++ V++D   YKSG +                         
Sbjct: 242 NNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSGVYH------------------------ 277

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                        Y SGK +GGHA++I+GWG +  +   YWL+ANSW   +G+ G FKI 
Sbjct: 278 -------------YVSGKLVGGHAVKIIGWGTE--NDVDYWLVANSWGIKFGEGGFFKIR 322

Query: 290 RGKDECGIESSITAGVPKL 308
           RG +EC IES++ AGV KL
Sbjct: 323 RGTNECQIESNVVAGVAKL 341


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 75/198 (37%), Positives = 100/198 (50%), Gaps = 53/198 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY   PC HH   T      ++  TPKCVR+CQ++Y   YKKD + G  +Y   + E
Sbjct: 100 GCRPYPFHPCGHHGKDTYYGECPNEATTPKCVRKCQKSYKKSYKKDRSIGKDAYEEPNAE 159

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+  +EI ++GPV GA                                            
Sbjct: 160 KATQREIMKNGPVVGA-------------------------------------------- 175

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YK       +GKA GGHAI+I+GWG++      YWLIANSW+ DWG+NG F
Sbjct: 176 FTVYEDFSYYKKGIYKHTAGKARGGHAIKIIGWGKE--GGVPYWLIANSWHNDWGENGYF 233

Query: 287 KILRGKDECGIESSITAG 304
           +IL G + CGIE ++ AG
Sbjct: 234 RILCGSNHCGIEENVVAG 251



 Score = 47.0 bits (110), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 24/31 (77%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           D+P +  SRTKWP C +++ IRDQ +CGSCW
Sbjct: 1   DIPESPYSRTKWPKCSSLKPIRDQANCGSCW 31



 Score = 37.7 bits (86), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 16/28 (57%), Positives = 21/28 (75%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          CG+GCNGG+P  A+ Y+ K G V+GG Y
Sbjct: 68 CGYGCNGGWPIQAFNYFSKQGAVTGGDY 95


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 103/205 (50%), Gaps = 42/205 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
           GS  S +GC+PY IAPC   V N T P+C  +   TP C ++C  +  Y V   KD ++G
Sbjct: 121 GSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYG 180

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A    + + +  I  ++  +GP+E  F V+DD + Y +G +                   
Sbjct: 181 ASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIY------------------- 221

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +G   G  ++RILGWG  E     YWL+ANSW  +WG+N
Sbjct: 222 ------------------VHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKEWGEN 261

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ LRG +ECG+E++  +G+PKL
Sbjct: 262 GTFRALRGTNECGLEANCVSGMPKL 286


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 98/195 (50%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY + PC  + + N T     A K H  +C R C  N D+ +K+D ++   +Y ++ 
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGNQDLDFKEDHHYTRDAYYLTY 239

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
              +I  +I  +GP+E +F V+DD   YKSG +    N T                    
Sbjct: 240 G--TIQNDILAYGPIEASFEVYDDFPSYKSGVYTKMENATY------------------- 278

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 54/110 (49%), Gaps = 26/110 (23%)

Query: 30  IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
           +V    Y ++QA   E++ ++ I  A+ K+W  GV+ D  L  +   +L+G         
Sbjct: 7   VVLFSVYRTEQAYFLEEDYINQI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQ 65

Query: 77  ------------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                       Y+     +P++FD+R KW  C TI E+RDQG CGSCW 
Sbjct: 66  ASPDMFKTHDEAYNSWSNRIPSSFDARKKWRKCSTIGEVRDQGKCGSCWA 115



 Score = 40.8 bits (94), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 150 CGFGCSGGYPIRAWERFKKHGLVTGGNYDSGE 181


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 91/331 (27%), Positives = 133/331 (40%), Gaps = 112/331 (33%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A  +  +N   A  K  +GV          +P  I   ++   LP  FD+RT W +
Sbjct: 58  GWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSH 115

Query: 97  CPTIREIRDQGSCGSCWG-----------CRPYEI------------------------- 120
           C +IR I   G CGSCW            C  Y +                         
Sbjct: 116 CTSIRRIL--GHCGSCWAFGAVESLSDRFCIKYNLNVSLSANDVIACCGLLCGFGCNGGF 173

Query: 121 ---------------APCEHHVNGT---RPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
                            C+ + + T    P C+ +   TPKC R+C     + + +  ++
Sbjct: 174 PMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTY-PTPKCERKCVSRNQL-WGESKHY 231

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y ++ + + IM E+Y++GPVE A                                 
Sbjct: 232 GVGAYRINPDPQDIMAEVYKNGPVEVA--------------------------------- 258

Query: 223 DNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN 
Sbjct: 259 -----------FTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQ 306

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WN  WGD+G FKI RG +ECGIE S+ AG+P
Sbjct: 307 WNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 337



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCGFGCNGGFP  AW Y+   G+V+
Sbjct: 164 LCGFGCNGGFPMGAWLYFKYHGVVT 188


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 128/311 (41%), Gaps = 101/311 (32%)

Query: 57  WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCW-- 113
           ++G+HPD N       E++ + E    +PA FD+R  WP C   I  IR+QG CGSCW  
Sbjct: 50  FLGIHPDPNFQL----EVLEWEEPRTVIPATFDAREYWPQCKDVIGNIRNQGKCGSCWAF 105

Query: 114 ---------------GCRPYEIAP------CE---------------------------- 124
                          G   +E +P      CE                            
Sbjct: 106 AAAEVMSDRLCVATNGSVKFEFSPEDLINCCETCGKKCKGGYSYYAWKYYTSTGLVSGGD 165

Query: 125 -HHVNGTRP--SCDASKGHTPKCVRECQEN-YDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
            +   G +P    + + G +P+C + CQ   Y   Y  D +FG  +Y +  N  +I +EI
Sbjct: 166 YNTSRGCQPYSKSNFNDGVSPECSKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEI 225

Query: 181 YEHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
              G PV   F V++D  LY+ G +                                   
Sbjct: 226 LLRGGPVMAGFDVYEDFKLYREGVY----------------------------------- 250

Query: 240 LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIE 298
             ++ SG  LG HA++I+GWG +  +   YWL+ANSW  DWG   G+FKI RG +EC IE
Sbjct: 251 --VHTSGALLGSHAVKIIGWGTE--NGWAYWLVANSWGKDWGALGGVFKIRRGTNECKIE 306

Query: 299 SSITAGVPKLD 309
            SI  G  + D
Sbjct: 307 QSIITGHVRKD 317


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 100/203 (49%), Gaps = 40/203 (19%)

Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK 165
           GS  S +GC+PY I+PC+  + N T P C  S   TP C ++C+  Y V   KD ++G  
Sbjct: 210 GSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSCEKKCKSGYPVELDKDRHYGVS 269

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
              + + +  I  ++  +GP+     V+DD + Y +G                       
Sbjct: 270 VDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTG----------------------- 306

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                         + ++ +G   G  ++RILGWG  E     YWL+ANSW   WG+NG 
Sbjct: 307 --------------IYVHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKQWGENGT 350

Query: 286 FKILRGKDECGIESSITAGVPKL 308
           F++LRG +ECG+E++  +G+P+L
Sbjct: 351 FRVLRGVNECGLEANCVSGMPRL 373


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/304 (27%), Positives = 134/304 (44%), Gaps = 71/304 (23%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP- 132
           + ++++++P +FD+RT WPNCPTI  I DQG CGSCW    +E+     C H     +P 
Sbjct: 63  FVKIEDEIPESFDARTNWPNCPTIGHIYDQGHCGSCWAMCSFEVLQDRFCIHSNGSEKPW 122

Query: 133 -------SCDA----------------------------------------SKGHTPKCV 145
                  SCD+                                        S   TP C 
Sbjct: 123 LSGQDITSCDSRSHGCNGGWTETAFEYAKKAGVPTEECVPYLMGKCHHPGCSSWQTPTCK 182

Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF- 204
           +EC    +  Y  +  + +KSYS+  N ++I  E+  +GPV   FT +DDL +Y  G + 
Sbjct: 183 KECSSLSNYNYSSNRYYASKSYSIQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRGVYN 242

Query: 205 FVPGNET--TAMSLIKWTI-RDNTSQLGAEGAFTVFDDLIL-------------YKSGKA 248
            V G+E    A+ ++ W + R++   L  E      +                 +   K 
Sbjct: 243 HVMGSEQGLHAIKIVGWGVWRESEHMLTEEEKKAEEEKRKRIEEEIKKEKREDKWHDFKQ 302

Query: 249 LGGHAIRILGWGEDEKSKEK---YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
                 + +   E + +KE+   YW+I NSW  D+G +G+  I RG +ECGIES +  G+
Sbjct: 303 NALEKSKKVKRDETKNNKEEGIPYWIIVNSWGEDFGMDGILLIKRGVNECGIESDVYTGI 362

Query: 306 PKLD 309
           PK++
Sbjct: 363 PKIE 366


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 89/167 (53%), Gaps = 40/167 (23%)

Query: 141 TPKCVRECQENYDVP-YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
            PKC R+CQ +Y V    KD  FG  +YSV ++E  IM+EI+ +GPV+ AF V+ D   Y
Sbjct: 213 APKCSRKCQSSYSVQDVSKDRRFGRVAYSVVADEHRIMEEIFVNGPVQAAFQVYLDFKTY 272

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           KSG +                                      + +G   GGHAI+ILGW
Sbjct: 273 KSGVY-------------------------------------RHVTGPLEGGHAIKILGW 295

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           G +  +  KYWL +NSW  DWGD+G FKI+RG++  GIE+ + AG+P
Sbjct: 296 GVENGT--KYWLCSNSWGEDWGDHGFFKIVRGENHLGIETDVHAGLP 340



 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 34/71 (47%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 50  PRAHLKSW-MGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS 108
           PR  L S+ +GV+ +  L + RL   I   + D DLP  FD+R KWP CP++REIR+QG 
Sbjct: 64  PRQPLSSYRVGVNME-ELESKRLKPGILILKEDIDLPEQFDARDKWPQCPSLREIRNQGC 122

Query: 109 CGSCWGCRPYE 119
           CGSCW     E
Sbjct: 123 CGSCWAISAAE 133



 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW YWV+ G+ SGG Y SKQ
Sbjct: 163 CGDGCQGGVLGPAWDYWVQKGVSSGGPYNSKQ 194


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 90/264 (34%), Positives = 118/264 (44%), Gaps = 38/264 (14%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
           P      E+   L   FD+   WP CPTI EIRDQ SCGSCW           +   G  
Sbjct: 80  PRQFSEEELRVPLQDRFDAGEAWPKCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGV 139

Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG-AF 190
                S G    C   C    +  Y +      + Y+V      I+ E  +  P    A 
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGYPE---VAWEYYAV----HGIVSEYCQPYPFPSCAH 192

Query: 191 TVFDDLILYKSGRFFVPGNETTA----MSLIKWTIRDNTSQLGA---------------E 231
            V    +   SG +  P   +T     + LIK+  R NTS + +               E
Sbjct: 193 HVNSSDLSPCSGEYDTPTCNSTCTDKKIPLIKY--RGNTSYILSGEESFKRELLLNGPFE 250

Query: 232 GAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
            +F+V+ D + Y        +G  LGGHA+RI+GWG  E + E YW IANSWN +WG NG
Sbjct: 251 VSFSVYADFVAYTGGVYKHVTGVFLGGHAVRIVGWG--ELNGEPYWKIANSWNHEWGMNG 308

Query: 285 LFKILRGKDECGIESSITAGVPKL 308
            F I RG DECGIE S  AG+P++
Sbjct: 309 YFLIARGVDECGIEGSGVAGIPRI 332



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG+GCNGG+P +AW Y+   GIVS
Sbjct: 155 VCGYGCNGGYPEVAWEYYAVHGIVS 179


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/330 (29%), Positives = 133/330 (40%), Gaps = 123/330 (37%)

Query: 46  LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           ++ + R  +K  MG  +     LP     E     E+   LP +FD+ T WP+CPTI+ I
Sbjct: 55  MARLTRQGVKRLMGAKLRDAPVLPRRHFTE----EELRAPLPESFDAATAWPDCPTIKRI 110

Query: 104 ----------------------------RDQG-----------SCGS-CWG--------- 114
                                       RD G           SCG  C G         
Sbjct: 111 ADQSSCGSCWAVAAATAMSDRFCVTGGVRDLGISAGDLLSCCTSCGDGCDGGYPDEAWLY 170

Query: 115 ----------CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
                     C+PY   PC+H    ++ PSC     HTPKC   C +   +P  +   F 
Sbjct: 171 FTESGLVSDYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKCNATCTDK-RIPVVR--YFA 227

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           ++SYS+   E+   +E+Y  GP E A                                  
Sbjct: 228 SESYSLQ-GEEDYKRELYLRGPFEVA---------------------------------- 252

Query: 224 NTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                     FTV++D + Y+SG         +GGHA+R++GWGE  ++   YW IANSW
Sbjct: 253 ----------FTVYEDFLAYESGVYKHVSGGPVGGHAVRVVGWGE--RNGVPYWKIANSW 300

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
           NTDWG+NG     RGKDECGIES  +AG P
Sbjct: 301 NTDWGENGYLYFYRGKDECGIESQGSAGTP 330


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 100/207 (48%), Gaps = 49/207 (23%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKD 159
           R +   G   S  GC PY +  C         S D     TPKC R+CQ  Y+V     D
Sbjct: 173 RGVSSGGPYNSRQGCHPYPVDVCH--------SAD-EDADTPKCTRKCQSMYNVTNVSDD 223

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
             FG  +YSVS +E+ I +EI+ +GPV+ +F V+ D   YK+G                 
Sbjct: 224 RRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTG----------------- 266

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                               +  +  G   GGHA++++GWG +  +K  YWL +NSW  D
Sbjct: 267 --------------------VYRHVFGPMEGGHAVKMIGWGVENGTK--YWLCSNSWGED 304

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+ G FKI+RG++ CGIES + AG+P
Sbjct: 305 WGERGFFKIVRGENHCGIESDVHAGLP 331



 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG  G AW++WV+ G+ SGG Y S+Q
Sbjct: 154 CGDGCQGGNLGPAWQFWVQRGVSSGGPYNSRQ 185


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 92/197 (46%), Gaps = 43/197 (21%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
           GC PY+  PC HH+N T+ P C      TP CV +C    Y    K D ++  +S     
Sbjct: 162 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYSTSLKNDRHYMLESSPYQY 221

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +  +    I   GPV  ++ V++D + YKSG +                           
Sbjct: 222 SVNNAKNAIRTDGPVSASYLVYEDFLAYKSGVY--------------------------- 254

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                      + SG  LGGHA++I+GWGE+  + E YWL+ NSWN DWGD+GLFKI  G
Sbjct: 255 ----------KHTSGSYLGGHAVKIIGWGEE--NGEAYWLVVNSWNEDWGDHGLFKIALG 302

Query: 292 KDECGIESSITAGVPKL 308
              C I+  +  G PK+
Sbjct: 303 N--CQIDDDLLGGTPKV 317



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/34 (58%), Positives = 25/34 (73%), Gaps = 1/34 (2%)

Query: 82  EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +DLP +FD+RT +PNC   I  IRDQ +CGSCW 
Sbjct: 58  QDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 91


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 67/193 (34%), Positives = 92/193 (47%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC     G             +C R C  + D+ Y  D  F    Y ++   
Sbjct: 185 GCEPYRVPPCPRDDKGNNTCAGKPIEKNHRCTRMCYGDQDLDYNDDHRFTRDFYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +     E T                     
Sbjct: 244 -SIQKDVMTYGPIEASFDVYDDFPSYKSGVY-----EKT--------------------- 276

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                     ++   LGGHA++++GWG +E +   YWL+ NSWN  WGD GLFKI RG +
Sbjct: 277 ----------ENASYLGGHAVKLIGWGVEEGT--PYWLMVNSWNAQWGDKGLFKIRRGTN 324

Query: 294 ECGIESSITAGVP 306
           ECGI++S TAGVP
Sbjct: 325 ECGIDNSTTAGVP 337



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 51/116 (43%), Gaps = 24/116 (20%)

Query: 23  RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE 79
           R  +   +V    Y ++QA   EK+ +  I         GV+ D ++P +   +++G   
Sbjct: 3   RLVILLSVVLFSVYQTEQAYFLEKSYIDMINEVATTWTAGVNFDPSIPEDHFIKMLGSKG 62

Query: 80  VDE---------------------DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           V+                       +P  FD+R KW +C TI E+RDQG CGSCW 
Sbjct: 63  VESAKQASAHEFKTNDVAYDNHFGHIPRTFDARKKWRHCRTIGEVRDQGHCGSCWA 118


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/322 (29%), Positives = 127/322 (39%), Gaps = 110/322 (34%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            +N      K  +GV P        +P    YS     LP  FD+R++W  C TI  I D
Sbjct: 61  FANYTITQFKHILGVKPTPPALLAGVPTK-SYSR-SMKLPTEFDARSQWSGCSTIGTILD 118

Query: 106 QGS---------------------------------------CGS-CWGCRPY------- 118
           QG                                        CGS C G  P        
Sbjct: 119 QGHCGSCWAFGAVECLQDRFCIHLNMNISLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178

Query: 119 -------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
                  E  P    V    P C+ +   TPKC ++C+   +V +K+  +F   +Y V S
Sbjct: 179 RKGVVTDECDPYFDQVGCKHPGCEPAY-RTPKCEKKCKVQNEV-WKEQKHFSVDAYRVHS 236

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           N   IM E+Y +GPVE A                                          
Sbjct: 237 NPHDIMAEVYTNGPVEVA------------------------------------------ 254

Query: 232 GAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D   YKSG         +GGHA++++GWG  + + E YWL+AN WN  WGD+G
Sbjct: 255 --FTVYEDFAHYKSGVYKHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDG 311

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI+RGK+ECGIE  + AG+P
Sbjct: 312 YFKIIRGKNECGIEEDVVAGMP 333



 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+ + G+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRKGVVT 184


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 97/201 (48%), Gaps = 51/201 (25%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----Y 167
           GC PY+  PC HH+N T+ P C      TP CV +C    Y    K D ++  +S    Y
Sbjct: 372 GCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKNDRHYMLESSPYQY 431

Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
           SV++ + +I  +    GP+  ++ V++D + YKSG +                       
Sbjct: 432 SVNNAKNAIRTD----GPISASYLVYEDFLAYKSGVY----------------------- 464

Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
                          + SG  LGGHA++I+GWGE+  + E YWL+ NSWN DWGD GLFK
Sbjct: 465 --------------KHTSGSYLGGHAVKIIGWGEE--NGEAYWLVVNSWNEDWGDQGLFK 508

Query: 288 ILRGKDECGIESSITAGVPKL 308
           I  G   C I+  +  G PK+
Sbjct: 509 IALGN--CEIDDDLLGGTPKV 527


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 91/195 (46%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPCEH--HVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC PY    C H     G  P C      TPKC ++CQ  Y    ++D   G  SY+V  
Sbjct: 183 GCLPYPFPKCSHLEETPGLAP-CPRELYATPKCEKQCQAGYSKTSEEDKIKGKSSYNVGD 241

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
            E  IM EI  +GPV   + +F+D  +YKSG                             
Sbjct: 242 RETDIMMEIITNGPVSTIYYIFEDFTVYKSG----------------------------- 272

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                   +  Y SG  +GGH I  +GWG +  +  KYWL ANSWN  WG+NG F+I RG
Sbjct: 273 --------IYQYTSGSLMGGHGI--IGWGVE--NGVKYWLAANSWNEGWGENGYFRIRRG 320

Query: 292 KDECGIESSITAGVP 306
            +ECGIES I AG+P
Sbjct: 321 TNECGIESRINAGLP 335



 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 38/76 (50%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A     +NI     K  +G   +     N     + YS  + DLP +FD+R KWPNC 
Sbjct: 43  KAARSTRFNNI--EQFKKHLGALEETPEERNTRRPTVRYSVSENDLPESFDAREKWPNCS 100

Query: 99  TIREIRDQGSCGSCWG 114
           +I EI DQ SC SCW 
Sbjct: 101 SISEIPDQSSCSSCWA 116



 Score = 47.0 bits (110), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 18/27 (66%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG+GC GG+P MAW YW + GIVSGG 
Sbjct: 151 CGYGCEGGYPSMAWDYWWRHGIVSGGT 177


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP  FD+   WP C TI   R+  + GS                    
Sbjct: 86  PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N  + 
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +   TPKC   C+  E   V YK     G  SYSV   EK +M E+  +GP+E  
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG +                                      + SG  L
Sbjct: 260 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 282

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   +    YW IANSWNTDWGD G F I RG +ECGIES   AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337



 Score = 38.9 bits (89), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP  FD+   WP C TI   R+  + GS                    
Sbjct: 86  PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N  + 
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +   TPKC   C+  E   V YK     G  SYSV   EK +M E+  +GP+E  
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG +                                      + SG  L
Sbjct: 260 MQVYSDFVGYKSGGY-------------------------------------KHVSGDLL 282

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   +    YW IANSWNTDWGD G F I RG +ECGIES   AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337



 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP  FD+   WP C TI   R+  + GS                    
Sbjct: 86  PRNFSVDEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 145

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N  + 
Sbjct: 146 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 205

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +   TPKC   C+  E   V YK     G  SYSV   EK +M E+  +GP+E  
Sbjct: 206 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG +                                      + SG  L
Sbjct: 260 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 282

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   +    YW IANSWNTDWGD G F I RG +ECGIES   AG P
Sbjct: 283 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 337



 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  GI +
Sbjct: 161 ICGFGCYGGIPTMAWLWWVWVGITT 185


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 122/302 (40%), Gaps = 111/302 (36%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRP------------ 117
           +LP      + ++ LP +FD+R KW  CP++  +R+QG C S +                
Sbjct: 117 QLPLGFVLKKDEQPLPMSFDARQKWSYCPSMNMVRNQGCCDSSYAVAAVSTMTDRWCVHS 176

Query: 118 ----------YEIAPCEHHV----NGTRPS------------------------------ 133
                     Y++  C H      +G  PS                              
Sbjct: 177 EGKAQFNFGAYDVLSCCHRCGFGCDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDV 236

Query: 134 CDAS--KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
           C  S     TP+C+R CQ  Y+V Y +D ++G  +Y+V  +E+ IM E++  GP      
Sbjct: 237 CKKSGDSNDTPRCLRFCQPGYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGP------ 290

Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK---- 247
                                                 A+  FT++ D + YKSG     
Sbjct: 291 --------------------------------------AQATFTMYTDFVQYKSGVYRHT 312

Query: 248 ---ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
               +G H+++++GWG +  +  KYWL ANSW   WGD G FKI+RG+D    E+++ AG
Sbjct: 313 FGVRVGTHSVKVMGWGVE--NDVKYWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAG 370

Query: 305 VP 306
           +P
Sbjct: 371 LP 372



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 25/32 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG P   W YWV++GI SGGA+GS +
Sbjct: 196 CGFGCDGGVPSAVWHYWVENGITSGGAFGSHE 227


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 68/199 (34%), Positives = 97/199 (48%), Gaps = 54/199 (27%)

Query: 115 CRPYEIAPCEHHVNGTRPS-CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC +H N      C      TP+C + CQ  Y  PYKKD  +  KSY + ++E
Sbjct: 184 CKPYAFHPCGNHENQVYYGVCPKGSWPTPRCEKFCQRGYIKPYKKDKFYAKKSYWLPNDE 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I  +I ++GPV+ A                                            
Sbjct: 244 KEIRLDIMKNGPVQAA-------------------------------------------- 259

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F V++D  LYK        G   GGHA++I+GWG+D  +   YWLIANSW+ DWG++G F
Sbjct: 260 FDVYEDFKLYKRGIYKHKEGIQTGGHAVKIIGWGKDNGTD--YWLIANSWSKDWGESGFF 317

Query: 287 KILRGKDECGIESSITAGV 305
           +++RG+++C IE  ITAG+
Sbjct: 318 RMVRGENDCEIEDMITAGI 336


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 93/297 (31%), Positives = 117/297 (39%), Gaps = 107/297 (36%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI---REIRDQGS-------------------- 108
           P      E+ +DLP  FD+   WP C TI   R+  + GS                    
Sbjct: 91  PRNFSVVEMQQDLPEFFDAAEHWPMCVTISEIRDQSNCGSCWAIAAVEAISDRYCTLGGV 150

Query: 109 ----------------CG-SCWG-------------------CRPYEIAPCEHHVNGTR- 131
                           CG  C+G                   C+PY   PC HH N  + 
Sbjct: 151 PDRRISTSNLLSCCFICGFGCYGGIPTMAWLWWVWVGITTEVCQPYPFGPCSHHGNSDKY 210

Query: 132 PSCDASKGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
           P C  +   TPKC   C+  E   V YK     G  SYSV   EK +M E+  +GP+E  
Sbjct: 211 PPCPNTIYDTPKCNTTCEKSEMDLVKYK-----GGTSYSVK-GEKELMIELMTNGPLEVT 264

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             V+ D + YKSG +                                      + SG  L
Sbjct: 265 MQVYSDFVGYKSGVY-------------------------------------KHVSGDLL 287

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GGHA++++GWG   +    YW IANSWNTDWGD G F I RG +ECGIES   AG P
Sbjct: 288 GGHAVKLVGWGT--QGGVPYWKIANSWNTDWGDKGYFLIQRGSNECGIESGGVAGTP 342



 Score = 38.9 bits (89), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGC GG P MAW +WV  GI +
Sbjct: 166 ICGFGCYGGIPTMAWLWWVWVGITT 190


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 83/267 (31%), Positives = 110/267 (41%), Gaps = 92/267 (34%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR--------------------------- 116
           LP +FDSR KWP C  I  IR+Q  CGSCW C+                           
Sbjct: 2   LPESFDSREKWPTC--IHPIRNQEQCGSCWACKNLFIQSSEVLSDRFCIASGGKVNVVLS 59

Query: 117 PYEIAPCE------------------HHVNGTRPSC---DASKGHTPKCVRECQENYDVP 155
           P ++  C                    H       C    +  G  P C + C  N    
Sbjct: 60  PQDLVSCNWYNAGCDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVAPSCPKYC--NGTST 117

Query: 156 YKKDLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
               + + AK  Y V S  + IM EI  +GPV+  F+V+ D + YKSG +          
Sbjct: 118 PIDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKSGVY---------- 167

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                                       +++G  LGGHAI+I+GWG +  +  KYWL+AN
Sbjct: 168 ---------------------------THQTGSFLGGHAIKIVGWGVE--NNVKYWLVAN 198

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
           SW  DWG NGLFKI RG +ECGIE+ +
Sbjct: 199 SWGPDWGLNGLFKIKRGDNECGIEADV 225


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 97/198 (48%), Gaps = 53/198 (26%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY I PC HH N T       +  TP C ++CQ  Y   Y+ D  +G  ++ +  + +
Sbjct: 184 CRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVE 243

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I KE+ ++GPV  +F                                            
Sbjct: 244 AIQKELLKNGPVTASFA------------------------------------------- 260

Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
            V++D  LYKSG       +  G HA++++GWG + ++   YWLIANSW+ DWG+NG F+
Sbjct: 261 -VYEDFSLYKSGIYRHTAGELRGYHAVKMIGWGTENRTD--YWLIANSWHDDWGENGYFR 317

Query: 288 ILRGKDECGIESSITAGV 305
           I+RG ++CGIE ++ AG+
Sbjct: 318 IIRGINDCGIEENVAAGL 335



 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+   AW Y+  +G+VSGG Y SK+
Sbjct: 151 CGFGCDGGWSIKAWEYFTYAGLVSGGEYRSKR 182



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 21/48 (43%), Positives = 28/48 (58%), Gaps = 3/48 (6%)

Query: 69  NRLPELIGYS--EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           N+ P LI     E ++D+P  +D R  W NC +   IRDQ +CGSCW 
Sbjct: 69  NQNPNLIVKDDPEPEDDIPEEYDPRKIWSNCTSFY-IRDQANCGSCWA 115


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 59/202 (29%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PC +   G  P        TP C   C E YD  Y++D  +G+ +Y + ++E
Sbjct: 183 GCKPYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI  +GPVE                                              
Sbjct: 237 RMIQLEIMTNGPVESG-------------------------------------------- 252

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F+V+ DL LYK+G       + +G HA+R++GWG++      YWLIANS+  DWG++G F
Sbjct: 253 FSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVP--YWLIANSYGEDWGEHGYF 310

Query: 287 KILRGKDECGIESSITAGVPKL 308
           K LRG +  GIES + AG+PK+
Sbjct: 311 KFLRGSNHLGIESVVIAGLPKV 332



 Score = 40.8 bits (94), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 9   CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPD 63
           CG GCNGGF  G +++YWV  G+VSG AY S    K       +  L  ++G HP+
Sbjct: 150 CGNGCNGGFLDGTSFQYWVDVGLVSGAAYNSTDGCKPYPF---KPCLYPFVGCHPE 202


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 70/197 (35%), Positives = 87/197 (44%), Gaps = 53/197 (26%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY   PC+HHV+  +         TP CV+ C       Y  D      SYSVSS  +
Sbjct: 172 CRPYTFPPCDHHVDDGKYGPCGDSQPTPACVKSCTAQSGRNYDSDKIRSIDSYSVSSKVE 231

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
            I  EI   GPVE +                                            F
Sbjct: 232 QIQNEIMTFGPVEAS--------------------------------------------F 247

Query: 235 TVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
           TV++D + YKSG         LGGHA++I+GWG ++     YWL+ NSWN  WG+NGLFK
Sbjct: 248 TVYEDFLTYKSGVYQNVAGANLGGHAVKIIGWGVEKNVP--YWLVVNSWNEGWGENGLFK 305

Query: 288 ILRGKDECGIESSITAG 304
           ILRG +  GIE  I AG
Sbjct: 306 ILRGSNHVGIEGGIYAG 322



 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 42/67 (62%), Gaps = 8/67 (11%)

Query: 51  RAHLKSWMGV---HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQG 107
            A LK+ MG     PD+     +LPE     E + ++P +FD+R +WPNC +I+E+RDQ 
Sbjct: 44  EATLKTQMGTFLDEPDFM----KLPESTVQFE-NLEIPESFDARQQWPNCESIKEVRDQS 98

Query: 108 SCGSCWG 114
           +CGSCW 
Sbjct: 99  TCGSCWA 105



 Score = 41.2 bits (95), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG GCNGGFP  AW Y+   G+V+G  +G
Sbjct: 139 CGMGCNGGFPSGAWNYFKNKGLVTGDLFG 167


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 95/202 (47%), Gaps = 59/202 (29%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PC +   G  P        TP C   C E YD  Y++D  +G+ +Y + ++E
Sbjct: 183 GCKPYPFKPCLYPFVGCHPE------KTPSCTHHCTEGYDGTYRRDKYYGSAAYKLPNDE 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I  EI  +GPVE                                              
Sbjct: 237 RMIQLEIMTNGPVESG-------------------------------------------- 252

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F+V+ DL LYK+G       + +G HA+R++GWG++      YWLIANS+  DWG++G F
Sbjct: 253 FSVYQDLYLYKTGVYQHVVGREVGKHAVRLIGWGKERGVP--YWLIANSYGEDWGEHGYF 310

Query: 287 KILRGKDECGIESSITAGVPKL 308
           K LRG +  GIES + AG+PK+
Sbjct: 311 KFLRGSNHLGIESVVIAGLPKV 332



 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 32/56 (57%), Gaps = 4/56 (7%)

Query: 9   CGFGCNGGF-PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPD 63
           CG GCNGGF  G +++YWV  G+VSG AY +    K       +  L  ++G HP+
Sbjct: 150 CGNGCNGGFLDGTSFQYWVDVGLVSGAAYNNTDGCKPYPF---KPCLYPFVGCHPE 202


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 67/195 (34%), Positives = 97/195 (49%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY + PC  + + N T       K H  +C R C  N D+ +K+D ++   +Y ++ 
Sbjct: 182 GCQPYRVPPCPLDEYGNNTCSGKPTEKNH--RCTRMCYGNQDLDFKEDHHYTRDAYYLTY 239

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
              +I  ++  +GP+E +F V+DD   YKSG +    N T                    
Sbjct: 240 G--TIQNDVLAYGPIEASFEVYDDFPSYKSGVYTKMENATY------------------- 278

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 52/104 (50%), Gaps = 26/104 (25%)

Query: 36  YGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------------- 76
           Y ++QA   E++ +++I  A+ K+W  GV+ D  L  +   +L+G               
Sbjct: 13  YQTEQAYFLEEDYINHI-NANAKTWKAGVNFDPKLSIDSFVKLLGSKGVQAAKQASPDMF 71

Query: 77  ------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                 Y+     +P+ FD+R KW  C TI E+RDQG CGSCW 
Sbjct: 72  KTHDEAYNNWSNRIPSYFDARKKWRKCLTIGEVRDQGHCGSCWA 115



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGC+GG+P  AW  + K G+V+GG Y S + 
Sbjct: 150 CGFGCSGGYPIKAWERFKKHGLVTGGNYESGEG 182


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 112/298 (37%), Gaps = 100/298 (33%)

Query: 66  LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH 125
           LP  R  E     E    LP++FDS   WPNCPTI +I DQ +CGSCW            
Sbjct: 80  LPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRF 135

Query: 126 HVNGTRPSCDASKGHTPKCVREC------------------------------------- 148
              G       S G    C  +C                                     
Sbjct: 136 CTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHH 195

Query: 149 -----------QENYDVP---YKKD------LNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
                      Q N+D P   Y  D      +N+ + +      E   M+E++  GP E 
Sbjct: 196 SKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEV 255

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
           AF V++D I Y SG +                                      + SG+ 
Sbjct: 256 AFDVYEDFIAYNSGVYH-------------------------------------HVSGQY 278

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           LGGHA+R++GWG    +   YW IANSWNT+WG +G F I RG  ECGIE   +AG+P
Sbjct: 279 LGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 119/322 (36%), Gaps = 104/322 (32%)

Query: 46  LSNIPRAHLKSWMGVHPDYN----LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIR 101
           + NI     K   GV    N    LP  R  E     E    LP++FDS   WPNCPTI 
Sbjct: 34  MQNITLREAKRLNGVIKKNNNASILPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIP 89

Query: 102 EIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC------------- 148
           +I DQ +CGSCW               G       S G    C  +C             
Sbjct: 90  QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 149

Query: 149 -----------------------------------QENYDVP---YKKD------LNFGA 164
                                              Q N+D P   Y  D      +N+ +
Sbjct: 150 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRS 209

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +      E   M+E++  GP E AF V++D I Y SG +                    
Sbjct: 210 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH------------------- 250

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                             + SG+ LGGHA+R++GWG    +   YW IANSWNT+WG +G
Sbjct: 251 ------------------HVSGQYLGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDG 290

Query: 285 LFKILRGKDECGIESSITAGVP 306
            F I RG  ECGIE   +AG+P
Sbjct: 291 YFLIRRGSSECGIEDGGSAGIP 312


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 86/298 (28%), Positives = 112/298 (37%), Gaps = 100/298 (33%)

Query: 66  LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH 125
           LP  R  E     E    LP++FDS   WPNCPTI +I DQ +CGSCW            
Sbjct: 80  LPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRF 135

Query: 126 HVNGTRPSCDASKGHTPKCVREC------------------------------------- 148
              G       S G    C  +C                                     
Sbjct: 136 CTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPFPHCSHH 195

Query: 149 -----------QENYDVP---YKKD------LNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
                      Q N+D P   Y  D      +N+ + +      E   M+E++  GP E 
Sbjct: 196 SKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFFRGPFEV 255

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
           AF V++D I Y SG +                                      + SG+ 
Sbjct: 256 AFDVYEDFIAYNSGVYH-------------------------------------HVSGQY 278

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           LGGHA+R++GWG    +   YW IANSWNT+WG +G F I RG  ECGIE   +AG+P
Sbjct: 279 LGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDGYFLIRRGSSECGIEDGGSAGIP 334


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 119/322 (36%), Gaps = 104/322 (32%)

Query: 46  LSNIPRAHLKSWMGVHPDYN----LPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIR 101
           + NI     K   GV    N    LP  R  E     E    LP++FDS   WPNCPTI 
Sbjct: 33  MQNITLREAKRLNGVIKKNNNASILPKRRFTE----EEARAPLPSSFDSAEAWPNCPTIP 88

Query: 102 EIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVREC------------- 148
           +I DQ +CGSCW               G       S G    C  +C             
Sbjct: 89  QIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAW 148

Query: 149 -----------------------------------QENYDVP---YKKD------LNFGA 164
                                              Q N+D P   Y  D      +N+ +
Sbjct: 149 AYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCNYTCDDPTIPVVNYRS 208

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +      E   M+E++  GP E AF V++D I Y SG +                    
Sbjct: 209 WTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH------------------- 249

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                             + SG+ LGGHA+R++GWG    +   YW IANSWNT+WG +G
Sbjct: 250 ------------------HVSGQYLGGHAVRLVGWG--TSNGVPYWKIANSWNTEWGMDG 289

Query: 285 LFKILRGKDECGIESSITAGVP 306
            F I RG  ECGIE   +AG+P
Sbjct: 290 YFLIRRGSSECGIEDGGSAGIP 311


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 95/203 (46%), Gaps = 62/203 (30%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNF-GAKSYSVS 170
           GC PY+I PC H+ N T  P C  +K   P C   C  + YD P +KD +F   +S S  
Sbjct: 241 GCWPYDIPPCAHYTNSTLYPKCPKTKYDFPTCQESCPNKKYDTPMEKDRHFVEEESLSAL 300

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
            +  +I KEI  +GP                                             
Sbjct: 301 RSIDAIKKEIMTNGP--------------------------------------------V 316

Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
             ++ V+DD + YKSG        ALGGHA++I+GWGED      YWL+ NSWN +WGDN
Sbjct: 317 SASYLVYDDFLTYKSGVYKRTSHNALGGHAVKIIGWGED------YWLVVNSWNKNWGDN 370

Query: 284 GLFKILRGKDECGIESSITAGVP 306
           G+FKI  G  +CGIE ++ AG P
Sbjct: 371 GMFKI--GCGQCGIEDNVLAGTP 391



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYE 119
           P N   +L    E+ +DLP +F++  ++  C + I  IRDQ +CGSCW   P E
Sbjct: 123 PENIREKLYTADEL-KDLPVSFNATEEFKECSSVIGHIRDQSACGSCWAFAPTE 175


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 89/295 (30%), Positives = 123/295 (41%), Gaps = 92/295 (31%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY 118
           G  P+ +LP    PE+     V E++P NFD+R +WP   +I  IR+QG CGSCW     
Sbjct: 64  GAWPEGSLP----PEI--EVRVAENIPENFDARKQWPG--SIHPIRNQGQCGSCWAFGAS 115

Query: 119 EIAPCEHHVNGTRP-----------SCDASKG-------------------HTPKC---- 144
           E+      +                 CD                        T +C    
Sbjct: 116 EVLSDRFAIASKNQIYVTLSAQQLVDCDLDNSGCSGGWPINAWNYMVKTGLLTEQCYGPY 175

Query: 145 ------VRECQENYDVPYKKDLN---FGAKS-YSV-SSNEKSIMKEIYEHGPVEGAFTVF 193
                  R      D P++  +    + AKS Y + + N ++I  +I  +GPVE  FT+F
Sbjct: 176 YAKQYTCRLTANTTDCPWQPGVKARFYHAKSAYKLPAKNVEAIQTDIMNNGPVEADFTIF 235

Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
            D   Y+SG +                                     ++ +GK LGGHA
Sbjct: 236 QDFYAYRSGIY-------------------------------------VHATGKQLGGHA 258

Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           I+ILGWG ++     YWL ANSW  +WG  G FKI RG DECGIE  + AG+P L
Sbjct: 259 IKILGWGTEDNV--DYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLPLL 311


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 61/166 (36%), Positives = 82/166 (49%), Gaps = 39/166 (23%)

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
           P C R CQ  Y + Y +DL +G  +Y V  NE +IM EIY++GPV   F VF D   YKS
Sbjct: 193 PTCSRTCQAGYPLTYSQDLKYGGSAYRVMWNENAIMTEIYQNGPVVVQFEVFADFYQYKS 252

Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
           G +                                      + +G   G HA+R++GWG 
Sbjct: 253 GVY-------------------------------------RHVTGATEGWHAVRVIGWGV 275

Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           +  +  KYWL+ANSW   WGD G FK +RG++  GIE  + AG+PK
Sbjct: 276 E--NGVKYWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYAGLPK 319



 Score = 37.7 bits (86), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           F C+GG+ G  W+YWV SG+ S G Y S Q
Sbjct: 147 FKCDGGYVGKTWQYWVDSGLTSEGPYKSGQ 176


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 92/201 (45%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G        +    +C R C  N D+ + +D  +   SY ++   
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD   YKSG          LGGHA++++GWGE+      YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECGI++S TAGVP
Sbjct: 317 FKIRRGTNECGIDNSTTAGVP 337



 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 26/39 (66%)

Query: 76  GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            Y ++   +P +FD+R KW  C TI  +RDQG+CGSCW 
Sbjct: 80  AYDKLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWA 118



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG+P  AW  + K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWERFKKRGLVTGGDYQSGE 184


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 92/201 (45%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G        +    +C R C  N D+ + +D  +   SY ++   
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPRESNHRCTRMCYGNQDLDFDEDHRYTRDSYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD   YKSG          LGGHA++++GWGE+      YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYVKSENATYLGGHAVKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECGI++S TAGVP
Sbjct: 317 FKIRRGTNECGIDNSTTAGVP 337



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 25/39 (64%)

Query: 76  GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            Y  +   +P +FD+R KW  C TI  +RDQG+CGSCW 
Sbjct: 80  AYDNLFGRIPRHFDARRKWRRCHTIGAVRDQGNCGSCWA 118



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG+P  AW  + K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWERFKKRGLVTGGDYQSGE 184


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 102/208 (49%), Gaps = 48/208 (23%)

Query: 103 IRDQGSCGSCWGCRPYEIAP-CEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYK-KDL 160
           +   G  GS  GC PY + P C     G  P         P C   C   Y+V    +D 
Sbjct: 149 VSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRCNAGYNVTEDLRDR 201

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
            FG  +YS+ ++E+ IM++I+ +GPV+  F  ++D++ Y  G +                
Sbjct: 202 RFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGGVY---------------- 245

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                 ++SG+  GGHA++++GWG ++ +K  YWL+ANSW   W
Sbjct: 246 ---------------------RHQSGRLKGGHAVKLIGWGVEDGTK--YWLVANSWGRVW 282

Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
           GD+G FK++RG++ CGIE ++ AG+P  
Sbjct: 283 GDDGFFKMVRGENHCGIEENVHAGLPSF 310



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 37/75 (49%), Gaps = 1/75 (1%)

Query: 38  SKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNC 97
           SK    N  +  P A +    GV P   L   RL   I     D  LP +FD+R +WP C
Sbjct: 17  SKILSSNLTTTSPFAWILDLPGV-PLEKLKETRLHPAINVFAEDLVLPKSFDARQQWPQC 75

Query: 98  PTIREIRDQGSCGSC 112
            ++ EIR QG CGSC
Sbjct: 76  SSLNEIRTQGCCGSC 90


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/262 (33%), Positives = 115/262 (43%), Gaps = 34/262 (12%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
           P      E+   L   FD+   WP CPT+ EIRDQ SCGSCW           +   G  
Sbjct: 80  PRQFSEEELRVPLQDRFDAGEAWPECPTVTEIRDQSSCGSCWAVAAASAISDRYCTLGGV 139

Query: 132 PSCDASKGHTPKCVRECQ------------ENYDV-----PYKKDLNFGAKSYSVSSNEK 174
                S G    C   C             E Y V      Y +   F + ++ V+S++ 
Sbjct: 140 RDLRISAGDLMSCCDVCGFGCNGGYPEVAWEYYAVHGIVSEYCQPYPFPSCAHHVNSSDL 199

Query: 175 SIMKEIYEHGPVEGAFTVFD-DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           S     Y+        T     LI Y+       GN +  +S  +   R+       E +
Sbjct: 200 SPCSGEYDTPTCNSTCTDKKIPLIKYR-------GNTSYVLSGEEPFKRELILNGPFEVS 252

Query: 234 FTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F+V+ D + Y        +G  LGGHA+RI+GWG  E + E YW IANSWN +WG NG F
Sbjct: 253 FSVYADFVAYTGGVYKHVAGIFLGGHAVRIVGWG--ELNGEPYWKIANSWNREWGMNGYF 310

Query: 287 KILRGKDECGIESSITAGVPKL 308
            I RG DECGIE S  AG P++
Sbjct: 311 LIARGVDECGIEGSGVAGTPRI 332



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 20/25 (80%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CGFGCNGG+P +AW Y+   GIVS
Sbjct: 155 VCGFGCNGGYPEVAWEYYAVHGIVS 179


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/265 (29%), Positives = 111/265 (41%), Gaps = 92/265 (34%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--------------------IAP 122
           +LP NFD+  +WP CPTI  I++Q  CGSCW     E                    +  
Sbjct: 69  NLPTNFDAAQQWPQCPTIGAIQNQAECGSCWAFGAIESISDRFCIHKNESVQLSFQDLIT 128

Query: 123 CEHHVNG--------------------------TRPSCDASKG------HTPKCVRECQE 150
           C++  NG                          T P+C  ++       +TP C  +C  
Sbjct: 129 CDNQDNGCEGGDPYTAYKYVQKNGVVTSNCQPYTIPTCPPAQQPCMNFVNTPPCSAKC-A 187

Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
           N  V +++DL+     Y+V  N  +I  EI  +GPVE  F V++D + YKSG +      
Sbjct: 188 NSSVNFQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSGVY------ 241

Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYW 270
                                           +KSGK LGGH I+I+G+G    +   YW
Sbjct: 242 -------------------------------THKSGKDLGGHCIKIVGFGVSNGTP--YW 268

Query: 271 LIANSWNTDWGDNGLFKILRGKDEC 295
           +  NSW T WG+NG+F I  GK+EC
Sbjct: 269 ICNNSWTTSWGNNGIFWIEAGKNEC 293


>gi|281200411|gb|EFA74631.1| hypothetical protein PPL_11599 [Polysphondylium pallidum PN500]
          Length = 311

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/277 (28%), Positives = 110/277 (39%), Gaps = 87/277 (31%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
           PE +  S+V   +P +FDSRT WP C  +  + +QG CGSCW                  
Sbjct: 71  PEEVSVSKVA--VPNSFDSRTNWPGC--VHAVLNQGQCGSCWAFAASESLSDRLCIASQG 126

Query: 115 -----CRPYEIAPCE----HHVNGTRP---------------SC---DASKGHTPKCVRE 147
                  P  +  C+       NG  P               SC    +  G  P C +E
Sbjct: 127 AINVTLSPQALVSCDIEFNQGCNGGIPQMAWEYLELHGIPTDSCFPYTSGNGTAPDCQKE 186

Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
           C +       K   F  K+    S+  +I   ++ +GP+EG   V+ D + Y SG +   
Sbjct: 187 CSDGSKYQLYKGKTFTLKT---CSSVAAIQANVFAYGPIEGTMDVYQDFMSYTSGVY--- 240

Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE 267
                                            ++    K LGGHAI+I+GWG D  S  
Sbjct: 241 ---------------------------------VMTPGSKLLGGHAIKIVGWGTDSTSGL 267

Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            YW++ NSW +DWG NG F I RG + CGI+   +AG
Sbjct: 268 DYWIVQNSWGSDWGMNGFFWIQRGTNMCGIDRDASAG 304


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 68/206 (33%), Positives = 102/206 (49%), Gaps = 43/206 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
           GS  + +GC+PY IAPC   V N T P+C  +   TP C ++C  +  Y V   KD ++G
Sbjct: 165 GSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSCEKKCTSKNGYPVDIDKDRHYG 224

Query: 164 AKSYSVSSNEK-SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           A S     N +  I  ++  +GP+E  F V+DD + Y +G +                  
Sbjct: 225 ASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTGIY------------------ 266

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                              ++ +G   G  ++RILGWG  E     YWL+ANSW  +WG+
Sbjct: 267 -------------------VHLTGNKQGHLSVRILGWGMYEGVP--YWLLANSWGKEWGE 305

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
           NG F+ LRG +ECG+E++  + +PKL
Sbjct: 306 NGTFRALRGTNECGLEANCVSAMPKL 331


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 97/195 (49%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY + PC  + + N T     A K H  +C R C  N ++ +K+D  +   +Y +  
Sbjct: 182 GCQPYRVPPCPFDEYGNNTCRGKPAEKNH--RCTRMCYGNQNLDFKEDHRYTRDAYYL-- 237

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           N + I  ++  +GP+E ++ V+DD   YKSG +    N +                    
Sbjct: 238 NYQIIQNDLMTYGPIEASYDVYDDFPNYKSGVYMKTENASY------------------- 278

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG
Sbjct: 279 -----------------LGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 319

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 320 TNECGIDNSTTGGVP 334



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 28/39 (71%)

Query: 76  GYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
            Y+ +   +P+NFD+R KW  C T+ ++RDQG+CG+CW 
Sbjct: 77  AYNSLPNRIPSNFDARKKWRKCSTVGKVRDQGNCGTCWA 115



 Score = 37.7 bits (86), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC+GG+P  AW  + K G+V+GG Y S
Sbjct: 150 CGSGCHGGYPIKAWERFRKHGLVTGGDYNS 179


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/305 (28%), Positives = 119/305 (39%), Gaps = 91/305 (29%)

Query: 57  WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG- 114
           ++G+HPD N      PE+         +P  FD+R  WP C   I  IR+QG C S W  
Sbjct: 49  FLGIHPDPNFK----PEIKEPQATQNVIPETFDAREYWPECADIIGNIRNQGKCSSSWAF 104

Query: 115 ---------------------CRPYEIAPCEHHV-------------------------- 127
                                  P ++  C H+                           
Sbjct: 105 AAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGD 164

Query: 128 ----NGTRPSCDASKGH-TPKCVRECQ-ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
                G +P  + +    TP C   CQ + Y +PY  D +FG   Y +  NE +I  EI 
Sbjct: 165 YNTSTGCQPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEIL 224

Query: 182 EHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
             G PV  AF V+ D  +Y+ G                            E   T+ + +
Sbjct: 225 SGGGPVVAAFDVYGDFKIYRDG----------------------------EQHDTILEGV 256

Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIES 299
            +Y SG   G  A++I+GWG +  +   YWL ANSW  DWG   G FKI RG +ECG E 
Sbjct: 257 YIYTSGALFGRTAVKIIGWGTE--NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEE 314

Query: 300 SITAG 304
           SI AG
Sbjct: 315 SIIAG 319


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/195 (36%), Positives = 95/195 (48%), Gaps = 48/195 (24%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYD--VPYKKDLNFGAKSYSVSS 171
           C+PY   PC HH N  + P C ++   TPKC   C+ N    V YK     G+ SYSV  
Sbjct: 188 CQPYPFDPCSHHGNSEKYPPCPSTIYDTPKCNTTCERNEMDLVKYK-----GSTSYSVK- 241

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
            EK +M E+  +GP+E    V+ D + YKSG                             
Sbjct: 242 GEKELMIELMTNGPLELTMQVYSDFVGYKSG----------------------------- 272

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
               V+  ++    G  LGGHA++++GWG  +     YW +ANSWNTDWGD G F I RG
Sbjct: 273 ----VYKHVL----GDFLGGHAVKLVGWGTQDGVP--YWKVANSWNTDWGDKGYFLIQRG 322

Query: 292 KDECGIESSITAGVP 306
            +EC IES   AG+P
Sbjct: 323 NNECKIESGGVAGIP 337


>gi|86451924|gb|ABC97357.1| cathepsin B [Streblomastix strix]
          Length = 283

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/265 (32%), Positives = 115/265 (43%), Gaps = 88/265 (33%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GCRPYEIAP-----C 123
           +P  FD+R KWP+   I  +RDQG CGSCW               GC   +IAP     C
Sbjct: 63  VPDTFDAREKWPD--AILPVRDQGECGSCWAFSIAETIGDRLGVLGCSRGDIAPEDLVSC 120

Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQEN-----YDVPYK----------KDLNFGAKSYS 168
           +   +G    CD   G        CQEN       +PYK          +    G+  Y 
Sbjct: 121 DIFDDG----CDG--GFIDMAWDWCQENGLTTEECIPYKAGEGVPSPCPETCEDGSAIYR 174

Query: 169 VSS------NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
                    +   I  EIYE+GPV   F V+ D + YKSG +                  
Sbjct: 175 TPIESYRYIDADDIQGEIYEYGPVSMGFIVYSDFMSYKSGVY------------------ 216

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                              ++++G   GGHA+ I+GWG +++    YWL+ NSW TDWG+
Sbjct: 217 -------------------VHQAGYIEGGHAVLIVGWGVEDEVP--YWLVQNSWGTDWGE 255

Query: 283 NGLFKILRGKDECGIESSITAGVPK 307
           NG FKILRG D C  ES++TAG P+
Sbjct: 256 NGFFKILRGSDHCECESNVTAGYPE 280


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 97/202 (48%), Gaps = 49/202 (24%)

Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE----NYDVPYKKDLNFGAKS 166
           SCW   PYEI  C HH  G  P C+      PKC ++C+E    +   P+K DL+F   +
Sbjct: 341 SCW---PYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV   ++ I +E+ E+G + GAF V++D +LYK G +                      
Sbjct: 398 YSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYH--------------------- 435

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +G  +GGHA++++G+G ++     YWL  NSWN  WGD G F
Sbjct: 436 ----------------HVTGMPMGGHAVKVIGFGNED--GRDYWLAVNSWNEYWGDKGTF 477

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KI  G  E GI+     G PK+
Sbjct: 478 KIEMG--EAGIDKEFCGGEPKV 497



 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 51  RAHLKSWMGVHPDYNLPANRLPELIG---YSEVDEDLPAN-FDSRTKWPNCP-TIREIRD 105
           + H+ +++  + D + P   L E +    ++E  + L  + FD+R  +P C   I  +RD
Sbjct: 200 KRHMGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRD 259

Query: 106 QGSCGSCWG 114
           QG CGSCW 
Sbjct: 260 QGDCGSCWA 268



 Score = 38.9 bits (89), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
           FGC+GG P MAWR++   G+V+GG Y
Sbjct: 308 FGCSGGQPRMAWRWFSNDGVVTGGDY 333


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 97/202 (48%), Gaps = 49/202 (24%)

Query: 111 SCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE----NYDVPYKKDLNFGAKS 166
           SCW   PYEI  C HH  G  P C+      PKC ++C+E    +   P+K DL+F   +
Sbjct: 341 SCW---PYEIPFCRHHSEGPYPKCEGPLPKAPKCRKDCEEAEYTSKVKPFKDDLHFATSA 397

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           YSV   ++ I +E+ E+G + GAF V++D +LYK G +                      
Sbjct: 398 YSVEGRDQ-IKRELMENGTLTGAFLVYEDFLLYKEGVYH--------------------- 435

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
                           + +G  +GGHA++++G+G ++     YWL  NSWN  WGD G F
Sbjct: 436 ----------------HVTGMPMGGHAVKVIGFGNED--GRDYWLAVNSWNEYWGDKGTF 477

Query: 287 KILRGKDECGIESSITAGVPKL 308
           KI  G  E GI+     G PK+
Sbjct: 478 KIEMG--EAGIDKEFCGGEPKV 497



 Score = 40.8 bits (94), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 51  RAHLKSWMGVHPDYNLPANRLPELIG---YSEVDEDLPAN-FDSRTKWPNCP-TIREIRD 105
           + H+ +++  + D + P   L E +    ++E  + L  + FD+R  +P C   I  +RD
Sbjct: 200 KRHMGTYLSFYSDPDKPEVPLGEPLPVKVFAETQQVLETDKFDAREAFPQCAEVIGHVRD 259

Query: 106 QGSCGSCWG 114
           QG CGSCW 
Sbjct: 260 QGDCGSCWA 268



 Score = 38.9 bits (89), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 11  FGCNGGFPGMAWRYWVKSGIVSGGAY 36
           FGC+GG P MAWR++   G+V+GG Y
Sbjct: 308 FGCSGGQPRMAWRWFSNDGVVTGGDY 333


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/132 (47%), Positives = 75/132 (56%), Gaps = 39/132 (29%)

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
           M EI ++GPVEGAFTV+ D   YKSG +                                
Sbjct: 1   MAEIQKYGPVEGAFTVYADFPSYKSGVY-------------------------------- 28

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                 +++G+ALGGHAI+ILGWG ++     YWL+ANSWN DWGD G FKILRG DECG
Sbjct: 29  -----QHETGEALGGHAIKILGWGNED--GHDYWLVANSWNEDWGDQGFFKILRGVDECG 81

Query: 297 IESSITAGVPKL 308
           IES ITAG PKL
Sbjct: 82  IESQITAGSPKL 93


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/195 (32%), Positives = 94/195 (48%), Gaps = 40/195 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY + PC     G             KC ++C  +  + YKK+      +Y +S+  
Sbjct: 181 GCQPYRVPPCVRDDEGHNSCSGQPTERNHKCSKKCYGDETINYKKNHYKTKDAYYLSNT- 239

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            ++ K+   +GP+E +F V+DD   Y+SG +    N +                      
Sbjct: 240 -TMQKDTMVYGPIEASFDVYDDFTSYESGVYQKTENAS---------------------- 276

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWG +E +   YWL+ NSW   WGD G+FKILRG D
Sbjct: 277 --------------YLGGHAVKMIGWGVEEGT--PYWLMVNSWGEQWGDKGMFKILRGTD 320

Query: 294 ECGIESSITAGVPKL 308
           ECG+ESS TAGVP +
Sbjct: 321 ECGVESSCTAGVPSV 335



 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 41/75 (54%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A++N   N PR  +   +G      L  + + E       + ++P  FDSR +W NC T
Sbjct: 40  KAKQNFPENTPREDIVRLLGSKRLLGLNKSPIKENDILYVDNGEVPEFFDSRLEWKNCKT 99

Query: 100 IREIRDQGSCGSCWG 114
           I E+R+QG+CGSCW 
Sbjct: 100 IGEVRNQGNCGSCWA 114



 Score = 43.9 bits (102), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 23/30 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGG P  AW+Y+ + G+V+GG Y +
Sbjct: 149 CGFGCNGGNPLKAWKYFKRHGVVTGGNYNT 178


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/292 (29%), Positives = 115/292 (39%), Gaps = 97/292 (33%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
           P      E+ +DLP  FD+   WP C TI EIRDQ +CGSCW     E     +   G  
Sbjct: 86  PRNFSVEELQQDLPEFFDAAEHWPMCLTISEIRDQSNCGSCWAIAAVEAISDRYCTFGGV 145

Query: 132 PSCDASKGHTPKC--------------------------VRECQEN-------------- 151
           P    S  +   C                            +CQ                
Sbjct: 146 PDRRMSTSNLLSCCFICGLGCHGGIPTVAWLWWVWVGIATEDCQPYPFDPCSHHGNSEKY 205

Query: 152 -------YDVPY------KKDLNF----GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
                  YD P       + +++     G+ SYSV   EK +M E+  +GP+E    V+ 
Sbjct: 206 PPCPSTIYDTPKCNTTCERSEMDLVKYKGSTSYSV-KGEKELMIELMTNGPLELTMQVYS 264

Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
           D + YKSG                                 V+  ++    G+ LGGHA+
Sbjct: 265 DFVGYKSG---------------------------------VYKHVL----GEFLGGHAV 287

Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +++GWG  +     YW +ANSWNTDWGD G F I RG +EC IES   AG+P
Sbjct: 288 KLVGWGTQDGV--PYWKVANSWNTDWGDKGYFLIQRGNNECKIESGGVAGIP 337


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G             +C R C  + D+ + +D  +    Y ++   
Sbjct: 122 GCEPYRVPPCPNDDQGNNTCSGQPMEKNHRCTRMCYGDQDLDFDEDHRYTRDHYYLTY-- 179

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I K++  +GP+E +F V+DD   YKSG +    N +                      
Sbjct: 180 RGIQKDVINYGPIEASFDVYDDFPSYKSGIYVKSENAS---------------------- 217

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGH+++++GWGE+      YWL+ NSWN DWGD GLFKI RG +
Sbjct: 218 --------------YLGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGLFKIRRGTN 261

Query: 294 ECGIESSITAGVP 306
           ECG+++S T GVP
Sbjct: 262 ECGVDNSTTGGVP 274



 Score = 45.4 bits (106), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +++P  FD+R KW  C TI E+RDQG CGS W
Sbjct: 23  QEIPIKFDARKKWLRCKTIGEVRDQGHCGSDW 54



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC+GG+P  AW+ + K G+V+GG Y S + 
Sbjct: 90  CGDGCSGGYPIRAWKRYKKHGLVTGGNYKSGEG 122


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 101/206 (49%), Gaps = 42/206 (20%)

Query: 107 GSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFG 163
           GS  S +GC+PY I PC   V N T P+C  +   TP C ++C     Y +   KD ++G
Sbjct: 169 GSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSCEKKCTSRIGYPIDIDKDRHYG 228

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
                + +++  I  ++  +GP++  F V+DD + Y +G +                   
Sbjct: 229 VSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTGIY------------------- 269

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                             ++ +G   G  ++RI+GWG  +     YWL ANSW   WG+N
Sbjct: 270 ------------------VHLTGNKQGHLSVRIIGWGVWQGVP--YWLCANSWGRQWGEN 309

Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
           G F++LRG +ECG+ES+  +G+PKL+
Sbjct: 310 GTFRVLRGTNECGLESNCVSGMPKLN 335


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/253 (31%), Positives = 114/253 (45%), Gaps = 28/253 (11%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW----------------GCRPYEIAPC 123
           V+ D P  FDSRT W +C  I  IRDQG+CGSCW                G +  ++   
Sbjct: 81  VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVSTGGKFNQLLSP 140

Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF--GAKSYSVSSNEKSIMKEIY 181
           E      +       G  P    E      V    D N   G   Y V        + I 
Sbjct: 141 EELTFCCKDCGQGCGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQGENIC 200

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
           +  P+E             + +          ++ IK   +D  +    E +F  +DDL 
Sbjct: 201 DEQPMERNHQCPKTCYGKTTVQNRYKTKSEYYINSIKTIEQDIKTYGPVEASFDCYDDLS 260

Query: 242 LYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
           +YKSG        K  GGH+I+I+GWG+++ +   YWL  NSW+  WGD+G FKI++G++
Sbjct: 261 VYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTP--YWLAVNSWSKFWGDHGTFKIIKGRN 318

Query: 294 ECGIESSITAGVP 306
           ECGIE ++TAG+P
Sbjct: 319 ECGIERAVTAGIP 331


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/201 (34%), Positives = 93/201 (46%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC HH  G     D       +C R C  + D+ +  D  +   SY ++   
Sbjct: 185 GCEPYRVPPCRHHAEGNNSCSDKPMEKNHRCTRMCYGDQDLDFDDDHRYTRDSYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 244 -SIQKDVMNYGPIEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD   YKSG          LGGHA++++GWGE+  S   YWL+ NSWNTDWGD GL
Sbjct: 259 FDVYDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEE--SGVPYWLMVNSWNTDWGDKGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECG+++S TAGVP
Sbjct: 317 FKIQRGTNECGVDNSTTAGVP 337



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 24/38 (63%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           Y  +   +P  FD+R KW  C TI  +RDQG+CGSCW 
Sbjct: 81  YDNLFGRIPKKFDARKKWRKCKTIGAVRDQGNCGSCWA 118



 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GCNGG+P  AW  +   G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWERFKSHGLVTGGDYKSGE 184


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/299 (27%), Positives = 120/299 (40%), Gaps = 103/299 (34%)

Query: 75  IGYSEVDEDLPANFDSRTKWPNCPTIREI------------------------------- 103
           I ++ ++ ++P +FD+R  W NC TIR+I                               
Sbjct: 80  ISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRIS 139

Query: 104 -----RDQGSCG---SCW--------------------------GCRPYEIAPCEHHVNG 129
                RD  SCG    C+                          GC+PY +  C +H   
Sbjct: 140 VQLSARDAISCGFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPES 199

Query: 130 TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
               C+ +    P+C  ECQ+ Y+  Y  D  +G + Y+V   ++ I KEI  +GPV  +
Sbjct: 200 RFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS 259

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
            +V  D ++YKSG  ++P   +                                   + L
Sbjct: 260 ISVNTDFLVYKSG-VYLPTPRS-----------------------------------RNL 283

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           G   +RI+GWG +   K  YWL ANSWN +WGDNG  KI RG     IES + A +PK+
Sbjct: 284 GWITLRIIGWGYE--GKIPYWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAPIPKM 340


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 82/293 (27%), Positives = 119/293 (40%), Gaps = 108/293 (36%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE------------- 119
           ++  Y  +   +P +F+++T WPNC TI +I++Q  CGSCW     E             
Sbjct: 68  QIKSYDPLGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNEN 127

Query: 120 -------IAPCEHHVNG--------------------------TRPSCDASKG------H 140
                  +  C+   NG                          T P+C  ++       +
Sbjct: 128 VQLSFMDMVTCDETDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVN 187

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TP C +ECQ N  + Y +D +  AK YS  S+E +IM+EI  +GPVE             
Sbjct: 188 TPSCTKECQSNSSLIYSQDKHKMAKIYSFDSDE-AIMQEIVTNGPVEAC----------- 235

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHA 253
                                            FTVF+D + YKSG       K LGGH 
Sbjct: 236 ---------------------------------FTVFEDFLAYKSGVYVHTTGKDLGGHC 262

Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++++G+G    +   Y+   N W T WGDNG F I RG  +CGI   + AG+P
Sbjct: 263 VKLVGFGT--LNGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 91/295 (30%), Positives = 121/295 (41%), Gaps = 107/295 (36%)

Query: 76  GYS-EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCW-------------------- 113
           GY+ E  +DLP +FD+RT +PNC   I  IRDQ +CGSCW                    
Sbjct: 133 GYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWAFGVTEAFNDRLCIKSNGAF 192

Query: 114 ----------------GC---RPYEIAPCEHHV-----NGTRPSCDASKGHTP------- 142
                           GC    PY      H        G+RP   +     P       
Sbjct: 193 TELLSAGEMNACTLFFGCGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDI 252

Query: 143 ----KCVRECQE-NYDVPYKKDLNFGAKS----YSVSSNEKSIMKEIYEHGPVEGAFTVF 193
                CV +C+   Y    + D +F  +S    YSV+  + +I  +    GPV  +FTV+
Sbjct: 253 YPTPNCVEQCRNPKYTTTLRDDRHFMLESSPYHYSVNDAKNAIRTD----GPVSASFTVY 308

Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
           +D + YKSG +                                      + SG  LGGHA
Sbjct: 309 EDFLAYKSGVY-------------------------------------KHTSGSYLGGHA 331

Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           ++I+GWG  EKS + YWL  NSWN DWGD GLFKI  G   CGI+  +  G PK+
Sbjct: 332 VKIIGWG--EKSGQAYWLAVNSWNEDWGDKGLFKIALGN--CGIDDDLLGGTPKV 382


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 76/228 (33%), Positives = 101/228 (44%), Gaps = 82/228 (35%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY-DVPYKKDLNF---------- 162
           GC+PY   PC         SC+ASK  TP C ++CQ  Y +  YK D  F          
Sbjct: 173 GCQPYTFPPCS--------SCEASKS-TPSCQKKCQTGYLEATYKNDKRFENEEQDSSYM 223

Query: 163 -------------GAKSYSVSSNEKS----------IMKEIYEHGPVEGAFTVFDDLILY 199
                        G  +Y +S+   S          I  EIY +GPVE ++ VF+D   Y
Sbjct: 224 SENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIYNNGPVEVSYRVFEDFYQY 283

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           KSG +                                      Y SGK  G HA++I+GW
Sbjct: 284 KSGVYH-------------------------------------YVSGKLTGAHAVKIIGW 306

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           G +  +K  YWL+ANSW TD+G+ G FKI RG +ECGIE ++ AG+ K
Sbjct: 307 GTE--NKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGLAK 352



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 25/37 (67%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEI 120
           +P +FD+R  WPNC +I+ IR+Q  CG+CW     EI
Sbjct: 76  VPISFDARDHWPNCKSIKLIRNQAYCGACWAFGAAEI 112



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 14/28 (50%), Positives = 20/28 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GC GG+P    ++W+ SG+V+GG Y
Sbjct: 142 CGEGCKGGYPLEGLKFWMNSGVVTGGDY 169


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 71/214 (33%), Positives = 98/214 (45%), Gaps = 67/214 (31%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFG--------- 163
           GC PY  APC           +  +  TP C   CQ +Y    YKKD ++G         
Sbjct: 127 GCMPYSFAPCTK---------NCPESTTPSCKTTCQSSYKTEEYKKDKHYGELVWHSFNR 177

Query: 164 -------AKSYSVSSNEK--SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
                  A +Y V++ +    I  EIY +GPVE ++ V++D   YKSG +          
Sbjct: 178 FQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYHYKSGVYH--------- 228

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                                       Y SGK +GGHA++I+GWG +  +   YWLIAN
Sbjct: 229 ----------------------------YTSGKLVGGHAVKIIGWGVE--NGVDYWLIAN 258

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           SW T +G+ G FKI RG +EC IE ++ AG+ KL
Sbjct: 259 SWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 292



 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 16/29 (55%), Positives = 20/29 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
           CG+GC GG+   A R+W  SG V+GG YG
Sbjct: 96  CGYGCKGGYSIEALRFWASSGAVTGGDYG 124


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 90/201 (44%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G        +    +C R C  N D+ Y  D  F   SY ++ + 
Sbjct: 185 GCEPYRVPPCPYDAEGHNTCAGKPREKNHRCTRTCYGNQDLDYNDDHRFTRDSYYLTYS- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 244 -SIQKDVMRYGPIEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F ++DD   YKSG          LGGHA++++GWGE+      YWL+ NSWN  WGDNGL
Sbjct: 259 FDMYDDFPSYKSGVYVRSENASYLGGHAVKLIGWGEEHGVL--YWLMVNSWNEGWGDNGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECGI++S T GVP
Sbjct: 317 FKIRRGTNECGIDNSTTGGVP 337



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 26/110 (23%)

Query: 30  IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
           ++    Y ++QA   +K+ + NI   H  +W  GV+ D N P     +++G         
Sbjct: 10  VIFVSVYVTEQAYFLQKDFIDNI-NNHATTWKAGVNFDPNTPKEYFLKMLGSKGVQIPDK 68

Query: 77  ------------YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                       Y  +   +P +FD+R KW  C TI ++RDQG+CGSCW 
Sbjct: 69  HNIHMYKTHDAAYDNLFGRIPKHFDARKKWKRCHTIGKVRDQGNCGSCWA 118



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GCNGG+P  AW  +   G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWESFNNRGLVTGGDYQSGE 184


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +  +G             KC ++C  + D+ + KD  +    Y ++   
Sbjct: 183 GCEPYRVPPCPYDKDGKNTCSGQPMESNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY-- 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I K++  +GP+E +F V+DD   YKSG +    N +                      
Sbjct: 241 RGIQKDVINYGPIETSFDVYDDFPNYKSGIYVKSENAS---------------------- 278

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGH+++++GWGE+      YWL+ NSWN DWGD GLFKI RG +
Sbjct: 279 --------------YLGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGLFKIRRGTN 322

Query: 294 ECGIESSITAGVP 306
           EC +++S T GVP
Sbjct: 323 ECRVDNSTTGGVP 335



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 41/83 (49%), Gaps = 14/83 (16%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED--------LPANFDSR 91
           +A  NS  N P+ H+   +G          ++P+ + Y+    D        +P  FD+R
Sbjct: 40  KAGVNSAPNTPKEHILRLLGSR------GVQIPDKVNYNMYKNDDHADNYQEIPMKFDAR 93

Query: 92  TKWPNCPTIREIRDQGSCGSCWG 114
            KW  C TI E+RDQG+CGS W 
Sbjct: 94  KKWIRCKTIGEVRDQGNCGSDWA 116



 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 16/30 (53%), Positives = 21/30 (70%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGG+P  AW+ +   G+V+GG Y S
Sbjct: 151 CGNGCNGGYPIRAWKRFKNHGLVTGGNYKS 180


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 94/189 (49%), Gaps = 41/189 (21%)

Query: 115 CRPYEIAPCEHHVN-GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC  H +      C      TPKC +  Q  Y+  Y++D +F  +SYS+ +NE
Sbjct: 187 CKPYSFYPCGQHKDVPYYGPCPGGLWPTPKCRKSSQRKYNKTYQEDKHFATRSYSLPNNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           +SI +EIY++GPV  AF V++D             + T  + + KW I+           
Sbjct: 247 RSIRQEIYKNGPVVAAFKVYEDY------------SSTGGIYVHKWGIQ----------- 283

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                           G HA +++GWG +  +   YWLIANSWNTDWG++G ++I+R  D
Sbjct: 284 ---------------TGAHADKVIGWGRENGT--DYWLIANSWNTDWGEDGYYRIVRETD 326

Query: 294 ECGIESSIT 302
            C IE  + 
Sbjct: 327 NCEIERQMV 335



 Score = 37.7 bits (86), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 23/35 (65%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG+GC GG+P  A+R+  + G+V+GG Y  +   K
Sbjct: 154 CGYGCQGGWPIEAYRWMQRDGVVTGGKYRQRDVCK 188


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 89/266 (33%), Positives = 121/266 (45%), Gaps = 49/266 (18%)

Query: 83  DLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
           +LP  FD+R  + NC T I  +RDQ +CGSCW     E       +  +G       S G
Sbjct: 126 NLPDRFDAREHFKNCATVIGHVRDQSTCGSCWAFATSEAFSDRLCIRSSGEFDLVPLSAG 185

Query: 140 HTPKCVRECQ-------------------ENYDVPYKKD-----LNFGAKSYSVSSNEKS 175
           HT  C  E +                     + V  + D      NF   S+ V   E  
Sbjct: 186 HTAACCSEAEGCFSFGCDGGQPDSAWRWFSEHGVVSELDSGCWPYNFPECSHHV---ETK 242

Query: 176 IMKEIYEHGPVEGAFTVFDDLIL---YKSGRFFV--PGNETTAMSLIKWTIRDNTSQLGA 230
            M+    + P     T   +      ++S R F    G     +  IK  I DN      
Sbjct: 243 GMEPCKGNSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGP---V 299

Query: 231 EGAFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
             AFTV++D + YKS       G  LGGHA++I+GWG D+   E+YWL+ NSWN +WGD 
Sbjct: 300 AAAFTVYEDFLYYKSGVYKHVNGSELGGHAVKIIGWGTDQ--NEQYWLVMNSWNVNWGDQ 357

Query: 284 GLFKILRGKDECGIESSITAGVPKLD 309
           G+FKI  G  ECGI+S +TAG+PK +
Sbjct: 358 GIFKIAIG--ECGIDSEVTAGIPKYE 381


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 84/284 (29%), Positives = 114/284 (40%), Gaps = 108/284 (38%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGS----------------------------------- 108
           LP  FD+RT WP+C TI  I DQG                                    
Sbjct: 83  LPIEFDARTAWPHCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGMNLSLSVNDLLAC 142

Query: 109 ----CGS-CWGCRPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQ 149
               CG+ C G  P               E  P    +  + P C+     TPKC R+C 
Sbjct: 143 CGWMCGAGCDGGSPIDAWRYFVQSGVVTEECDPYFDDIGCSHPGCEPGF-PTPKCERKCA 201

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           +   + + +  +F   +Y + S+  SIM E+  +GPVE A                    
Sbjct: 202 DKNKL-WAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEVA-------------------- 240

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                   FTV++D   YKSG        A+GGHA++++GWG  
Sbjct: 241 ------------------------FTVYEDFAHYKSGVYKHITGDAMGGHAVKLIGWGTS 276

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           E   E YWL+AN WN  WGD+G FKI RG +ECGIE ++ AG+P
Sbjct: 277 EDG-EDYWLLANQWNRGWGDDGYFKIKRGTNECGIEGAVVAGLP 319



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG P  AWRY+V+SG+V+
Sbjct: 146 MCGAGCDGGSPIDAWRYFVQSGVVT 170


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 98/212 (46%), Gaps = 58/212 (27%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----Y 167
           GC PY+  PC HHVN ++ P C      TP C  +C    Y    + D +F  +S    Y
Sbjct: 244 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEY 303

Query: 168 SVSSNEKSIMKE-----IYEHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
           SV+  + +I  +     IY   P      V  +F V++D + Y+SG +            
Sbjct: 304 SVNDAKNAIRTDGPVGPIYFCDPSVNFDQVSASFIVYEDFLAYRSGVY------------ 351

Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                                     + SGK LGGHA++I+GWGE+  + + YWL+ NSW
Sbjct: 352 -------------------------KHTSGKELGGHAVKIIGWGEE--TGQAYWLVVNSW 384

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           N DWGDNGLFKI  G   C I+  +  G PK+
Sbjct: 385 NEDWGDNGLFKIALGN--CEIDDDLLGGTPKV 414



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 25/34 (73%), Gaps = 1/34 (2%)

Query: 82  EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +DLP +FD+RT +PNC   IR IRDQ  CGSCW 
Sbjct: 140 QDLPTDFDARTAFPNCSKVIRHIRDQSDCGSCWA 173


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 40/188 (21%)

Query: 119 EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
           E  P    +  + P C+     TPKC R+C +   + + +  +F   +Y + S+  SIM 
Sbjct: 158 ECDPYFDDIGCSHPGCEPGF-PTPKCERKCADKNKL-WAESKHFSVNAYRIDSDPHSIMA 215

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  +GPVE AFTV++D   YKSG +                                  
Sbjct: 216 EVSMNGPVEVAFTVYEDFAHYKSGVY---------------------------------- 241

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
               + +G  +GGHA++++GWG  +   E YWL+AN WN  WGD+G FKI RG +ECGIE
Sbjct: 242 ---KHITGDVMGGHAVKLIGWGTSDDG-EDYWLLANQWNRGWGDDGYFKIRRGTNECGIE 297

Query: 299 SSITAGVP 306
             + AG+P
Sbjct: 298 EDVVAGLP 305



 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 22/25 (88%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG GC+GG+P  AWRY+V+SG+V+
Sbjct: 132 MCGDGCDGGYPIDAWRYFVQSGVVT 156


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 96/195 (49%), Gaps = 44/195 (22%)

Query: 114 GCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY + PC  + + N T       K H  +C R C  + D+ +  D ++   +Y ++ 
Sbjct: 181 GCQPYRVPPCPLDEYGNNTCHGKPMEKNH--RCTRMCYGDQDLDFNNDHHYTRDAYYLTY 238

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
              +I  ++  +GP+E +F V+DD   YKSG +                           
Sbjct: 239 G--TIQNDVLTYGPIEASFEVYDDFPSYKSGVY--------------------------- 269

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                    +  ++   LGGHA++++GWGE+      YWL+ NSWN  WGD GLFKI RG
Sbjct: 270 ---------VKTENASYLGGHAVKLIGWGEEYGV--PYWLLVNSWNDQWGDQGLFKIRRG 318

Query: 292 KDECGIESSITAGVP 306
            +ECGI++S T GVP
Sbjct: 319 TNECGIDNSTTGGVP 333



 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 19/33 (57%), Positives = 25/33 (75%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           + +P+NFD+R KW  C +I E+RDQG CGSCW 
Sbjct: 82  QRIPSNFDARKKWKKCLSIGEVRDQGHCGSCWA 114



 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 23/33 (69%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CGFGCNGG+P  AW  + K G+V+GG Y S + 
Sbjct: 149 CGFGCNGGYPIRAWERFRKHGLVTGGNYDSYEG 181


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 98/203 (48%), Gaps = 49/203 (24%)

Query: 113 WGCRPYE-IAPCEHHV---------NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           +GC+PY+   P   H+         N T          TP+C R C   Y   Y  D  +
Sbjct: 180 YGCKPYKPTGPIGRHLKRNDYAPCPNDTYYGECVGMADTPRCKRRCLLGYPKSYPSDRYY 239

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y V  + K+I +EI ++GPV  +F V++D   YKSG                    
Sbjct: 240 GKSAYIVKQSVKAIQREIMKNGPVVASFAVYEDFRHYKSG-------------------- 279

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                            +  + +G+  G HA++I+GWG++  +   +WLIANSW+ DWG+
Sbjct: 280 -----------------IYKHTAGELRGYHAVKIIGWGKENNTD--FWLIANSWHQDWGE 320

Query: 283 NGLFKILRGKDECGIESSITAGV 305
            G F+I+RGK+ECGIE+ + AG+
Sbjct: 321 KGYFRIVRGKNECGIETDVVAGI 343



 Score = 38.9 bits (89), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 15/26 (57%), Positives = 20/26 (76%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGG 34
           CG+GCNGGFP  AWR++  +G  +GG
Sbjct: 149 CGYGCNGGFPIEAWRHFTVAGNCTGG 174


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 79/160 (49%), Gaps = 40/160 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PYE  PCEHH  G  P CD     TP C R CQ  Y+V Y+ D  +G   Y V SN+
Sbjct: 192 GCQPYEFPPCEHHTLGPLPVCDGDV-ETPPCKRTCQAGYNVSYENDKWYGKVVYRVKSNQ 250

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           ++IMKE+ +HGPVE  F V+ D   YKSG +                             
Sbjct: 251 EAIMKELMQHGPVEVDFEVYADFPNYKSGVY----------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
                    + SG  LGGHA+R+LGWGE+  +   YWLIA
Sbjct: 282 --------QHVSGALLGGHAVRLLGWGEE--NNVPYWLIA 311



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/62 (45%), Positives = 38/62 (61%), Gaps = 3/62 (4%)

Query: 54  LKSWMGVHPDYNLPANRLPEL-IGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSC 112
           ++  +G  PD N    +L  L  GY     +LP +FD+R +W +CP+I EIRDQ SCGSC
Sbjct: 66  IRRMLGALPDPN--GEQLETLCTGYELTLNELPKSFDARKEWTHCPSISEIRDQSSCGSC 123

Query: 113 WG 114
           W 
Sbjct: 124 WA 125



 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 20/30 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GCNGGFP  AW YW   GIV+G  Y +
Sbjct: 160 CGMGCNGGFPHSAWLYWKNQGIVTGDLYNT 189


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 39/191 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY I PC HH N T          TP C +EC+      Y+ D  +G  +Y V  + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKKECRPGVRKVYRIDKRYGKDAYIVKQSVK 244

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI  +GPV  +F V      Y+  R +  G                          
Sbjct: 245 AIQSEILRNGPVVASFAV------YEDFRHYKSG-------------------------- 272

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +G+  G HA++++GWG +  +   +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIIRGTND 325

Query: 295 CGIESSITAGV 305
           CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336



 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AW+Y++  G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182



 Score = 40.8 bits (94), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D+P ++D R  W NC T   IRDQ +CGSCW 
Sbjct: 86  DIPPSYDPRDVWKNCTTFY-IRDQANCGSCWA 116


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 80/299 (26%), Positives = 119/299 (39%), Gaps = 103/299 (34%)

Query: 75  IGYSEVDEDLPANFDSRTKWPNCPTIREI------------------------------- 103
           I ++ ++ ++P +FD+R  W NC TIR+I                               
Sbjct: 19  ISHNSINMEIPRSFDARYHWINCSTIRQIHDESLCRADWAIATVDSISDRICIRSNGRIS 78

Query: 104 -----RDQGSCG---SCW--------------------------GCRPYEIAPCEHHVNG 129
                RD  SCG    C+                          GC+PY +  C +H   
Sbjct: 79  VQLSARDAISCGFSPGCFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSYHPES 138

Query: 130 TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
               C+ +    P+C  ECQ+ Y+  Y  D  +G + Y+V   ++ I KEI  +GPV  +
Sbjct: 139 RFLDCNNNTFEFPQCTNECQDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS 198

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
            +V  D ++YKSG  ++P   +                                   + L
Sbjct: 199 ISVNTDFLVYKSG-VYLPTPRS-----------------------------------RNL 222

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           G   +RI+GWG +   K  YWL ANSWN +WG NG  KI RG     IES + A +PK+
Sbjct: 223 GWITLRIIGWGYE--GKIPYWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAPIPKM 279


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/171 (37%), Positives = 84/171 (49%), Gaps = 40/171 (23%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C+HHV G  P C      TP+CV++C +  ++ Y +D      SY++ ++E
Sbjct: 183 GCRSYPFPKCDHHVQGHYPPCPRQIYPTPECVQDC-DTPELGYLEDKTRANISYNIYASE 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIMKEI   GPVE  FTV++D + YKS  +F                            
Sbjct: 242 ISIMKEIMLRGPVEAVFTVYEDFLQYKSRVYF---------------------------- 273

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                    +  G  + GHAIRILGWGE+      YWLIANSWN DWG+ G
Sbjct: 274 ---------HAWGAPMSGHAIRILGWGEE--GDVPYWLIANSWNEDWGEKG 313



 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 22/34 (64%), Positives = 27/34 (79%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  LP NFD+R+KWP+C ++ EIRDQ SCGSCW 
Sbjct: 83  DTRLPKNFDARSKWPHCSSVSEIRDQSSCGSCWA 116



 Score = 45.4 bits (106), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGC GG+P +AW YW   GIV+GG+
Sbjct: 151 CGFGCRGGYPAVAWDYWRTHGIVTGGS 177


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 39/191 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY I PC HH N T          TP C R+C+      Y+ D  +G  +Y V  + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI  +GPV  +F V      Y+  R +  G                          
Sbjct: 245 AIQSEILRNGPVVASFAV------YEDFRHYKSG-------------------------- 272

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +G+  G HA++++GWG +  +   +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIIRGTND 325

Query: 295 CGIESSITAGV 305
           CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336



 Score = 41.2 bits (95), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AW+Y++  G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 93/191 (48%), Gaps = 39/191 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPY I PC HH N T          TP C R+C+      Y+ D  +G  +Y V  + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI ++GPV  +F V      Y+  R +  G                          
Sbjct: 245 AIQSEILKNGPVVASFAV------YEDFRHYKSG-------------------------- 272

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
                +  + +G+  G HA++++GWG +  +   +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIVRGSND 325

Query: 295 CGIESSITAGV 305
           CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336



 Score = 41.6 bits (96), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AW+Y++  G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 98/202 (48%), Gaps = 61/202 (30%)

Query: 115 CRPYEIAPCEHHVNGTRP---SC-DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
           CRPYE  PC  H  G  P    C D +K  TPKC + CQ  Y   YK+D +FG  +Y + 
Sbjct: 22  CRPYEFPPCGRH--GKEPYYGECYDTAK--TPKCQKTCQRGYLKAYKEDKHFGKSAYRLP 77

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K+I ++I ++GPV   F                                        
Sbjct: 78  NNVKAIQRDIMKNGPVVAGFI--------------------------------------- 98

Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                V++D   YKSG       +  GGHA++I+GWG+++ +   YWLIANSW+ DWG+ 
Sbjct: 99  -----VYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTP--YWLIANSWHDDWGEK 151

Query: 284 GLFKILRGKDECGIESSITAGV 305
           G ++++RG + C IE  + AG+
Sbjct: 152 GFYRMIRGINNCRIEEMVFAGI 173


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/322 (28%), Positives = 123/322 (38%), Gaps = 112/322 (34%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDE-DLPANFDSRTKWP 95
           G + A     SN      K  +GV P   +P   L      S      LP NFD+RT W 
Sbjct: 56  GWEAAINPRFSNYTVEQFKRLLGVKP---MPKKELRSTPAISHPKTLKLPKNFDARTAWS 112

Query: 96  NCPTIREIRDQGS---------------------------------------CGS-CWGC 115
            C TI  I DQG                                        CGS C G 
Sbjct: 113 QCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHFDVNISLSVNDLLACCGFLCGSGCDGG 172

Query: 116 RPY--------------EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLN 161
            P               E  P    +  + P C+ +   TPKCV++C     V +KK  +
Sbjct: 173 YPLYAWRYLAHHGVVTEECDPYFDQIGCSHPGCEPAY-RTPKCVKKCVSGNQV-WKKSKH 230

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +   +Y V+S+   IM E+Y++GPVE A                                
Sbjct: 231 YSVSAYRVNSDPHDIMAEVYKNGPVEVA-------------------------------- 258

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIAN 274
                       FTV++D   YKSG         LGGHA++++GWG  +   E YWL+AN
Sbjct: 259 ------------FTVYEDFAYYKSGVYKHITGYELGGHAVKLIGWGTTDDG-EDYWLLAN 305

Query: 275 SWNTDWGDNGLFKILRGKDECG 296
            WN +WGD+G FKI RG +ECG
Sbjct: 306 QWNREWGDDGYFKIRRGTNECG 327



 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GC+GG+P  AWRY    G+V+
Sbjct: 164 LCGSGCDGGYPLYAWRYLAHHGVVT 188


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/174 (37%), Positives = 84/174 (48%), Gaps = 54/174 (31%)

Query: 144 CVRECQENYDVPYKKDLNFGAKSYSVSSN-EKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           C + C+  Y   YK+D ++G  SYSVS    +   +   ++GPVE A             
Sbjct: 1   CSKICEPGYSPSYKEDKHYGCSSYSVSRGARRRSWQRSSKNGPVEAA------------- 47

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
                                          FTV+ D + YKSG         +GGHA+R
Sbjct: 48  -------------------------------FTVYSDFLQYKSGVYQHVAGDMMGGHAVR 76

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           ILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D CGIES I AG+P  D
Sbjct: 77  ILGWGVENGTP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPCTD 128


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 40/193 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY++ PC     G             KC R C  +    YKK       +Y +  N 
Sbjct: 181 GCQPYKVPPCVKDEEGHNSCSGQPTEPNHKCSRSCYGDKTCDYKKGHYKTKNAYYL--NI 238

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            ++ K+   +GP+E +F V+DD + Y+SG +     + T                     
Sbjct: 239 DTMQKDTIAYGPIEASFDVYDDFVNYESGVY-----QKT--------------------- 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                     +  K LGGHA++++GWGE++ +   YWL+ NSW   WG NG+FKILRG +
Sbjct: 273 ----------EDAKYLGGHAVKMIGWGEEDGT--PYWLMVNSWGEQWGANGMFKILRGTN 320

Query: 294 ECGIESSITAGVP 306
           ECGIE S TAGVP
Sbjct: 321 ECGIEGSPTAGVP 333



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A++N    + +  +   +G     ++P + + E       D ++P  FD+R +W +C T
Sbjct: 40  KAKQNFPEYMTKEQIVRLLGSKNLTSVPKSLIKENDSEYINDSEIPNFFDARIQWSHCKT 99

Query: 100 IREIRDQGSCGSCWG 114
           I E+R+QG+CGSCW 
Sbjct: 100 IGEVRNQGNCGSCWA 114


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 82/265 (30%), Positives = 119/265 (44%), Gaps = 34/265 (12%)

Query: 67  PANRLP---ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
           PAN L    E + +      LP  FD+R  W +C TI  I DQG CGSCW     E    
Sbjct: 75  PANELEPSIERVTHKHKKLVLPKEFDARKHWGHCSTIGAILDQGHCGSCWAFGAAESLTD 134

Query: 124 EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYK-KDLNFGAKSYSVSSNEKSIMKEIYE 182
              ++       +       C  EC +  D  Y  +   +  ++  V+S       +I  
Sbjct: 135 RFCIHMNESVSLSENDLLACCGFECGDGCDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGC 194

Query: 183 HGPVEGAFTVF----------DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
             P  G +  +          DD +  KS    V   E +          D  ++L   G
Sbjct: 195 GHP--GCYPTYRTPKCVKHCVDDELWVKSKHLSVNAYEVSKEP------EDLMAELYTNG 246

Query: 233 ----AFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
               +F VF+D   YK+       G+ +GGHA++++GWG  +   + YW I NSWNT+WG
Sbjct: 247 PIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHAVKLIGWGTTDDGVD-YWTIVNSWNTNWG 305

Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
           ++GLF+I RG +ECGIES   AG+P
Sbjct: 306 EHGLFRIARGGNECGIESYAVAGLP 330


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 69/205 (33%), Positives = 101/205 (49%), Gaps = 49/205 (23%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  GS  GC+P+ IAP         P+  ++   TP C  +C  +Y     KD  +G   
Sbjct: 184 GPYGSKSGCKPFSIAP---------PTSSSTAAQTPLCQLKCISDYKRKLDKDRYYGESY 234

Query: 167 YSVSSNE---KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           Y ++S+    K+I +EI +HGPV  A  +F+  + YKS                      
Sbjct: 235 YLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYKS---------------------- 272

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                G   A    DD        +LG HA++++GWGE ++    YWL+ NSWNT +G+ 
Sbjct: 273 -----GVYSANKRNDD-------PSLGLHAVKLIGWGEQKRIP--YWLVVNSWNTTFGEQ 318

Query: 284 GLFKILRGKDECGIES-SITAGVPK 307
           GLFKI RG +ECGIE+  +TAG+ +
Sbjct: 319 GLFKIRRGTNECGIENLHVTAGLAE 343



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/31 (61%), Positives = 26/31 (83%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG+GCNGGFP +A++YW + G+ +GG YGSK
Sbjct: 159 CGYGCNGGFPLLAFKYWNEIGVPTGGPYGSK 189



 Score = 40.0 bits (92), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 17/34 (50%), Positives = 21/34 (61%), Gaps = 2/34 (5%)

Query: 82  EDLPA--NFDSRTKWPNCPTIREIRDQGSCGSCW 113
           E LP   +FD+R KWP C  I  I+DQ +C  CW
Sbjct: 56  ESLPLEEHFDAREKWPECKYIGFIKDQSTCSCCW 89


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/194 (32%), Positives = 92/194 (47%), Gaps = 44/194 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PC  +      SC        KC ++C  N  + Y+ D  +  +S  V + +
Sbjct: 181 GCQPYMFPPCTGN-----NSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            ++  +I  +GP+E +F V+DD I YKSG +F   N T                      
Sbjct: 236 -NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT---------------------- 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGH+++ +GWG +      YWL+ NSWN+ WGD G FKI RG +
Sbjct: 273 --------------YLGGHSVKCIGWGVERNV--SYWLMMNSWNSTWGDGGYFKIRRGTN 316

Query: 294 ECGIESSITAGVPK 307
           EC +E S TAGVP+
Sbjct: 317 ECQVEDSSTAGVPE 330



 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC GG+P  AWRY+ K G+V+GG + S
Sbjct: 149 CGLGCQGGYPIRAWRYYSKHGLVTGGNFNS 178


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 133/351 (37%), Gaps = 130/351 (37%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           G K A  +  +N   A  K  +GV          +P  I   ++   LP  FD+RT W +
Sbjct: 58  GWKAAFNDRFANATVAEFKRLLGVIQTPKTAYLGVP--IVRHDLSLKLPKEFDARTAWSH 115

Query: 97  CPTIREIRD--------------------QGSCGSCWG-----------CRPYEI----- 120
           C +IR I                       G CGSCW            C  Y +     
Sbjct: 116 CTSIRRILVGYILNNVLLWSTITLWFWFLLGHCGSCWAFGAVESLSDRFCIKYNLNVSLS 175

Query: 121 -----------------------------------APCEHHVNGT---RPSCDASKGHTP 142
                                                C+ + + T    P C+ +   TP
Sbjct: 176 ANDVIACCGLLCGFGCNGGFPMGAWLYFKYHGVVTQECDPYFDNTGCSHPGCEPTY-PTP 234

Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           KC R+C     + + +  ++G  +Y ++ + + IM E+Y++GPVE A             
Sbjct: 235 KCERKCVSRNQL-WGESKHYGVGAYRINPDPQDIMAEVYKNGPVEVA------------- 280

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIR 255
                                          FTV++D   YKSG         +GGHA++
Sbjct: 281 -------------------------------FTVYEDFAHYKSGVYKYITGTKIGGHAVK 309

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++GWG  +   E YWL+AN WN  WGD+G FKI RG +ECGIE S+ AG+P
Sbjct: 310 LIGWGTSDDG-EDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLP 359



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 19/25 (76%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCGFGCNGGFP  AW Y+   G+V+
Sbjct: 186 LCGFGCNGGFPMGAWLYFKYHGVVT 210


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/198 (30%), Positives = 96/198 (48%), Gaps = 42/198 (21%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG--AKSYSVSSN 172
           C PY + PC  H N T          TP C R+CQ  +   Y+ D  +G   ++Y++  +
Sbjct: 188 CSPYPLHPCGRHGNDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRS 247

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E  I ++I E G V   F V++D   Y+SG                              
Sbjct: 248 EVKIRRDIKERGSVVAVFAVYEDFSHYQSG------------------------------ 277

Query: 233 AFTVFDDLILYKSGKALGG-HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                  +  + +G+  GG HA++++GWG+D  +   YWLIANSW+ DWG+NG F+++RG
Sbjct: 278 -------IYKHTAGRFTGGYHAVKMIGWGKDNGTD--YWLIANSWHDDWGENGFFRMIRG 328

Query: 292 KDECGIESSITAGVPKLD 309
            + CGIE  + AG+  ++
Sbjct: 329 INNCGIEEQVDAGIVDVE 346



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 27/50 (54%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           N   N  P +   ++   DLP N+D R  W NC +   IRDQ +CGSCW 
Sbjct: 70  NANQNLNPVVNDDNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWA 119



 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 21/31 (67%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG GC GG+P  AW+++   G+VSGG Y  K
Sbjct: 155 CGLGCRGGWPIEAWKFFEYDGVVSGGPYLGK 185


>gi|156708108|gb|ABU93312.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 109/262 (41%), Gaps = 88/262 (33%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
           ++D  LP NFDSR +WP    I  +RDQ SCGSCW               GC      P 
Sbjct: 58  DLDNALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDFGDMSPQ 115

Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
           ++  C+    G                  T   C    +  G  P C  +C     +   
Sbjct: 116 DLVSCDTTDMGCNGGYMDHAWAWTKSHGITTEKCMPYQSGSGRVPACPAKCVNGSAIVRN 175

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K +++         N + +M+E+YE+GP+  AFTV+ D + YKSG +             
Sbjct: 176 KSVSYKKL------NAQQMMEELYENGPISVAFTVYYDFMNYKSGVY------------- 216

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                   ++K+G   GGHA+  +GWG ++ +   YWL  NSW 
Sbjct: 217 ------------------------VHKTGGIAGGHAVLCVGWGVEDNTP--YWLCQNSWG 250

Query: 278 TDWGDNGLFKILRGKDECGIES 299
             WG+ G FKILRG + CGIE+
Sbjct: 251 PAWGEKGHFKILRGSNHCGIEN 272


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 90/201 (44%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +  +G             +C R C  + D+ + +D  +   SY ++   
Sbjct: 185 GCEPYRVPPCPYDESGNNTCAGKPMEANHRCTRMCYGDQDLDFDEDHRYTRDSYYLTYG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GPVE +                                            
Sbjct: 244 -SIQKDVLTYGPVEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD   YKSG          LGGHA +++GWGE+      YWL+ NSWN DWGDNGL
Sbjct: 259 FDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVP--YWLMVNSWNADWGDNGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECGI++S T GVP
Sbjct: 317 FKIQRGTNECGIDNSTTGGVP 337



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 53/116 (45%), Gaps = 24/116 (20%)

Query: 23  RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIG--- 76
           R ++   ++    Y ++QA   E++ ++ I         GV+ D   P   + +L+G   
Sbjct: 3   RVFILLSVILFSVYMTEQAYFLEEDYINKINEQATTWKAGVNFDPKTPKEHILKLLGSKG 62

Query: 77  -----------YSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                      Y   DE+       +P  FD+R KW NC TI  IRDQG+CGSCW 
Sbjct: 63  VQIPSKLNHKMYKSEDENYDNLFGRIPRKFDARKKWRNCKTIGAIRDQGNCGSCWA 118



 Score = 44.3 bits (103), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG+P  AW ++ K G+V+GG Y S +
Sbjct: 153 CGFGCNGGYPIKAWEHFKKHGLVTGGDYKSGE 184


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 88/289 (30%), Positives = 125/289 (43%), Gaps = 79/289 (27%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
           NR P +    + D+D+P +FD+RT W NC ++R IRDQ +   C  C     A       
Sbjct: 79  NRKPVVENADDEDDDIPESFDARTHWANCTSLRHIRDQAN---CGSCWAVSTASAL---- 131

Query: 129 GTRPSCDASKGHTP---------KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKE 179
            +   C ASKG T           C + C       Y  D  +  +++   S +      
Sbjct: 132 -SDRICIASKGETQLHISSIDIVSCCKLC------GYGCDGGWPIEAFDYFSRQ------ 178

Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL-------------IKWTIRDNTS 226
               G V G  T  D    Y     +  GN+T    +             +K   R++T 
Sbjct: 179 ----GAVTGETTSKDGCRPYPFHPLWTYGNDTVGRRMSGRCKHSKTVGEGVKRVTRNHTR 234

Query: 227 QLG---------------AEG---------AFTVFDDLILYK-------SGKALGGHAIR 255
           + G               +EG          FTV++D   YK       +GKA G HAI+
Sbjct: 235 RTGLTARRLRITEFCQSHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHAIK 294

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           I+GWG +  +   YWLIANSW+ DWG+ GLF+I+RG +ECGIE  + AG
Sbjct: 295 IIGWGVE--NGLPYWLIANSWHDDWGEQGLFRIVRGINECGIEQEVVAG 341


>gi|156708104|gb|ABU93310.1| cathepsin B1 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 109/262 (41%), Gaps = 88/262 (33%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
           ++D  LP NFDSR +WP    I  +RDQ SCGSCW               GC      P 
Sbjct: 58  DLDNALPENFDSREQWPG--KILPVRDQASCGSCWAFSVAETMGDRLSIKGCDYGDMAPQ 115

Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
           ++  C+    G                  T   C    +  G  P C  +C     +   
Sbjct: 116 DLVSCDTTDMGCNGGYMDHAWAWTKSHGVTTEKCMPYQSGSGRVPACPAKCVNGSAIVRN 175

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K +++         N + +M+E+YE+GP+  AFTV+ D + YKSG +             
Sbjct: 176 KSVSYK------KLNAQQMMEELYENGPISVAFTVYYDFMNYKSGVY------------- 216

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                   ++K+G   GGHA+  +GWG ++ +   YWL  NSW 
Sbjct: 217 ------------------------VHKTGGIAGGHAVLCVGWGVEDNTP--YWLCQNSWG 250

Query: 278 TDWGDNGLFKILRGKDECGIES 299
             WG+ G FKILRG + CGIE+
Sbjct: 251 PAWGEKGHFKILRGSNHCGIEN 272


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 100/200 (50%), Gaps = 61/200 (30%)

Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C PY +   C+H      P C+ +   TPKC ++C+E   V +++  +F   +Y ++S+ 
Sbjct: 44  CDPYFDPVGCKH------PGCEPAY-PTPKCEKKCKEQNQV-WQEKKHFSIDAYRINSDP 95

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM E+Y++GPVE A                                            
Sbjct: 96  HDIMAEVYKNGPVEVA-------------------------------------------- 111

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YKSG         +GGHA++++GWG  + + E YWL+AN WN  WGD+G F
Sbjct: 112 FTVYEDFAHYKSGVYKHITGGIMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWGDDGYF 170

Query: 287 KILRGKDECGIESSITAGVP 306
           KI+RGK+ECGIE  + AG+P
Sbjct: 171 KIIRGKNECGIEEGVVAGMP 190



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 14/25 (56%), Positives = 22/25 (88%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVS 32
          +CG GC+GG+P  AWRY+V++G+V+
Sbjct: 17 MCGDGCDGGYPIEAWRYFVQNGVVT 41


>gi|290992302|ref|XP_002678773.1| predicted protein [Naegleria gruberi]
 gi|284092387|gb|EFC46029.1| predicted protein [Naegleria gruberi]
          Length = 236

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 107/237 (45%), Gaps = 26/237 (10%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH--- 140
           +PA FDSRTKWP+C  +  IR+Q  CGSCW     E+         +   C AS G    
Sbjct: 14  VPA-FDSRTKWPHC--VHPIRNQEQCGSCWAFSASEVL--------SDRFCIASGGKVDV 62

Query: 141 --TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
             +P+ +  C       Y  D  +   +++  +       +   +    G          
Sbjct: 63  VLSPQYMVSCDS---TDYGCDGGYLNNAWAFLAGTGIPSDKCAPYTSQNGDVAACPSKCQ 119

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGG 251
             S            ++ I   + D       + AF+V+ D + YKSG         LGG
Sbjct: 120 DGSSVKLYKAKNPQQLNDIPSIMEDMQQNGPVQAAFSVYRDFMSYKSGVYHHVSGSLLGG 179

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           HAI+++GWG D  + + YW+IANSW   WG NG F ILRG DECGIE ++ +G  +L
Sbjct: 180 HAIKMVGWGVDSATNKPYWIIANSWGPSWGLNGFFWILRGSDECGIEDNVWSGQAQL 236


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 86/305 (28%), Positives = 115/305 (37%), Gaps = 100/305 (32%)

Query: 57  WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG- 114
           ++G+HPD N      PE+         +P  FD+R  WP C   I  IR+QG C S W  
Sbjct: 49  FLGIHPDPNFK----PEIKEPQATQNVIPETFDAREYWPECADIIGNIRNQGKCSSSWAF 104

Query: 115 ---------------------CRPYEIAPCEHHV-------------------------- 127
                                  P ++  C H+                           
Sbjct: 105 AAAEVMSDRLCIATNGKVKIQLSPEDLIDCCHYCGNQCKGGYTYYAWNYFMLTGLVSGGD 164

Query: 128 ----NGTRPSCDASKGH-TPKCVRECQ-ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
                G +P  + +    TP C   CQ + Y +PY  D +FG   Y +  NE +I  EI 
Sbjct: 165 YNTSTGCQPYSELNYYRITPPCNTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEIL 224

Query: 182 EHG-PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
             G PV  AF V+ D  +Y+                                     D +
Sbjct: 225 SGGGPVVAAFDVYGDFKIYR-------------------------------------DGV 247

Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILRGKDECGIES 299
            +Y SG   G  A++I+GWG +  +   YWL ANSW  DWG   G FKI RG +ECG E 
Sbjct: 248 YIYTSGALFGRTAVKIIGWGTE--NGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEE 305

Query: 300 SITAG 304
           SI AG
Sbjct: 306 SIIAG 310


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 64/200 (32%), Positives = 99/200 (49%), Gaps = 45/200 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+  PC+H+ +    +C +  +     C ++C  +NY V Y+ DL+  +  Y  S 
Sbjct: 162 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 221

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  HGPV     V+++ + YK G                            
Sbjct: 222 TNVKQIQQEIMTHGPVTAFMYVYENFMGYKEG---------------------------- 253

Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                      +YKS  G+ +G H ++++GWG D    E YWL  NSWN++WG++GLFKI
Sbjct: 254 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGLFKI 301

Query: 289 LRGKDECGIESSITAGVPKL 308
           LRG + C IE  + AG+  +
Sbjct: 302 LRGYNFCSIELLVMAGIVDV 321


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/315 (30%), Positives = 123/315 (39%), Gaps = 114/315 (36%)

Query: 56  SWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC 115
           S +G+  D +   + +P  +  S   +DLP +F+    WPN   +  IRDQ  CGSCW  
Sbjct: 68  SMLGLRLDRDY--SEVPVKVHSSTALKDLPESFNCYENWPN--YMHPIRDQARCGSCWAF 123

Query: 116 RPYEIAPCEHHV--NGT---------RPSCD----------------------------- 135
              E+      +  NGT           SCD                             
Sbjct: 124 AASEVLSDRFAIASNGTVNKILSPEDLVSCDKGDMGCQGGYLDKAWDYLKTNGIVTESCF 183

Query: 136 ---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
              A KG  P C   C +    PYKK   + A  Y   + E+ IMKEIY +GPVE     
Sbjct: 184 PYAAQKGVAPSCRISCVDGE--PYKK---YKASDYYQLTTEEDIMKEIYLNGPVEAG--- 235

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG----KA 248
                                                    F V+   + YKSG    + 
Sbjct: 236 -----------------------------------------FRVYTSFMSYKSGVYHHRI 254

Query: 249 L----GGHAIRILGWGEDEKSK-----EKYWLIANSWNTDWGDNGLFKILRGKD-----E 294
           L    GGHAI+I+GWG +   +      KYW+ ANSW  DWG NG FKI RGK+     E
Sbjct: 255 LDIMEGGHAIKIVGWGVEPPKRFWQKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSE 314

Query: 295 CGIESSITAGVPKLD 309
           CGIE  + AG PKLD
Sbjct: 315 CGIEDQVFAGHPKLD 329


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 100/203 (49%), Gaps = 45/203 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
           GC+PY  APC +        C  SK  TP C  +CQ  Y V  YK D ++G     V+  
Sbjct: 167 GCKPYSFAPCSN--------CVESKT-TPSCQSKCQSTYTVTNYKGDKHYGKNEGKVT-- 215

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
                 E ++H     A+ +                  + A+ +I+  I  N      E 
Sbjct: 216 ------ERHKHLECTSAYRL---------------DTSSNAVPIIQNEIYQNGP---VEV 251

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           A+TV+DD   YKSG       K  GGHA++I+GWG ++     YWL+ NSW T +GD G 
Sbjct: 252 AYTVYDDFYHYKSGVYHHVTGKDTGGHAVKIIGWGTEKGVD--YWLVTNSWGTSFGDKGF 309

Query: 286 FKILRGKDECGIESSITAGVPKL 308
           FKI RG +ECGIES++ AG+ K+
Sbjct: 310 FKIRRGTNECGIESNVVAGMAKV 332



 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 18/37 (48%), Positives = 25/37 (67%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEI 120
           +P  FD+RT WP C +I+ +RDQ +CGSCW     E+
Sbjct: 70  IPTTFDARTNWPKCNSIKMVRDQSNCGSCWAFGAAEV 106


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 91/193 (47%), Gaps = 43/193 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + P     + +     A       C R C  N  + +  D  +    Y ++   
Sbjct: 185 GCEPYRVPPSNDGNSSSSDQPLAIN---HICRRHCYGNQSIDFNDDHRYTRDYYYLTYG- 240

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +F V+DD   YKSG +    N +                      
Sbjct: 241 -SIQKDVLTYGPIEASFDVYDDFPSYKSGVYVKSDNAS---------------------- 277

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGHA++++GWGE++ +   YWL+ NSWNT WGDNG FKI RG +
Sbjct: 278 --------------YLGGHAVKLIGWGEEDGT--PYWLMVNSWNTQWGDNGFFKIRRGTN 321

Query: 294 ECGIESSITAGVP 306
           ECG+++S TAGVP
Sbjct: 322 ECGVDNSTTAGVP 334



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 33/116 (28%), Positives = 55/116 (47%), Gaps = 24/116 (20%)

Query: 23  RYWVKSGIVSGGAYGSKQA---EKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIG--- 76
           R ++   ++    Y ++QA   +++ ++NI         G++ D N P + + +L+G   
Sbjct: 3   RVFMLLSVIFVSVYATEQAYFLQEDFINNINEQATTWKAGMNFDPNTPHDDIIKLLGSRG 62

Query: 77  -----------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                      Y   DE        +P +FD+R KW  C TI  +RDQG+CGSCW 
Sbjct: 63  VQNPDKVNHKLYKTHDEAYDNLFGRIPEHFDARNKWVYCDTIGRVRDQGNCGSCWA 118



 Score = 40.8 bits (94), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG+P  AW+ +   G+V+GG Y S +
Sbjct: 153 CGFGCHGGYPIKAWKRFSTHGLVTGGDYNSGE 184


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 115/296 (38%), Gaps = 102/296 (34%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTI------------------------------- 100
           P     +E+   L   FD+   WPNCPTI                               
Sbjct: 80  PRQFSEAELRVRLEDKFDAAEAWPNCPTITEIRDQSSCGSCWAVAAASAMSDRYCTLGGV 139

Query: 101 REIR----DQGSCGSCWG------------------------CRPYEIAPCEHHVNGTRP 132
           R++R    D  SC    G                        C+PY    C HHVN +  
Sbjct: 140 RDLRISAGDLMSCCDVCGYGCNGGFPEVAWVFYVVHGLVSEYCQPYPFPSCAHHVNSSDL 199

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
           +  +    TPKC   C E   +P  +    G  SY V S E+   +E+  +GP E AF V
Sbjct: 200 APCSGDYKTPKCNSTCTEK-KIPLIRYR--GNHSY-VLSGEEHFKRELLLNGPFEVAFEV 255

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
           + D + Y  G +                                      + +G  LGGH
Sbjct: 256 YADFMAYTGGVY-------------------------------------KHVAGDLLGGH 278

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           A+R++GWGE   + E YW IANSWN +WG NG F I RG +ECGIES+  AG P++
Sbjct: 279 AVRLVGWGE--LNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIESNGVAGTPRI 332



 Score = 40.8 bits (94), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           +CG+GCNGGFP +AW ++V  G+VS
Sbjct: 155 VCGYGCNGGFPEVAWVFYVVHGLVS 179


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 89/204 (43%), Gaps = 57/204 (27%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
           GC PY+  PC HHVN ++ P C      TP C  +C    Y    + D +F  +S     
Sbjct: 123 GCWPYDFPPCAHHVNDSKYPKCPKDSYETPNCAEQCHNPKYTTTLRDDRHFMVESSPYQY 182

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +       I   GPV  +                                          
Sbjct: 183 SVNDAKNAIRTDGPVSAS------------------------------------------ 200

Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D + YKSG       + LGGHA++I+GWGE+  S + YWL+ NSWN DWGD+G
Sbjct: 201 --FTVYEDFLAYKSGVYKHTSGEYLGGHAVKIIGWGEE--SGQAYWLVVNSWNEDWGDHG 256

Query: 285 LFKILRGKDECGIESSITAGVPKL 308
           LFKI  G   CGI+  +  G PK+
Sbjct: 257 LFKIALGN--CGIDDYLLGGTPKV 278



 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 20/34 (58%), Positives = 25/34 (73%), Gaps = 1/34 (2%)

Query: 82  EDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +DLP +FD+RT +PNC   I  IRDQ +CGSCW 
Sbjct: 19  QDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 52


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/217 (34%), Positives = 103/217 (47%), Gaps = 50/217 (23%)

Query: 99  TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYK 157
            +  I D G+ G   GC  Y++ PC HHVN ++ P+C   +   PKC R+C E+ D  + 
Sbjct: 70  ALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPAC-PDEVRAPKCARKC-ESEDKDWT 127

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K    G K YSV            + G +EG   +     +Y++G               
Sbjct: 128 KAKVKGEKGYSVC-----------QQGELEGTCAIKMAADIYQNGPI------------- 163

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKY 269
                         G F V  D + YKSG          LGGHAI+I+G+G ++   + Y
Sbjct: 164 -------------TGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDG--KDY 208

Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WL+ANSWN DWGD+G FKI+RGK+ C IE  +  G P
Sbjct: 209 WLVANSWNEDWGDDGYFKIIRGKNACQIEDPVINGGP 245



 Score = 40.4 bits (93), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 18/33 (54%), Positives = 20/33 (60%)

Query: 7  RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
          +L   GCNGG P   + YW  SGIV GG YG K
Sbjct: 51 KLGDMGCNGGIPSSVYSYWALSGIVDGGNYGDK 83


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 91/194 (46%), Gaps = 44/194 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY   PC  +      SC        KC ++C  N  + Y+ D  +  +S  V + +
Sbjct: 181 GCQPYMFPPCTGN-----NSCSGQSEKNHKCQKKCFGNTSISYRGDRRYVERSPYVLAYD 235

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            ++  +I  +GP+E +F V+DD I YKSG +F   N T                      
Sbjct: 236 -NMQNDIMTYGPIESSFDVYDDFISYKSGVYFKSPNAT---------------------- 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                          LGGH+++ +GWG +      YWL+ NSWN  WGD G FKI RG +
Sbjct: 273 --------------YLGGHSVKCIGWGVERNV--SYWLMMNSWNNTWGDGGNFKIRRGTN 316

Query: 294 ECGIESSITAGVPK 307
           EC +E S TAG+P+
Sbjct: 317 ECQVEDSSTAGMPE 330



 Score = 42.7 bits (99), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 22/30 (73%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CG GC GG+P  AWRY+ K G+V+GG + S
Sbjct: 149 CGLGCQGGYPIRAWRYYSKHGLVTGGNFNS 178


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 80/267 (29%), Positives = 120/267 (44%), Gaps = 52/267 (19%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTRPSCDASKGH 140
           +PA FD+R +WPNCPTI EI +QGSC SCW   P ++     C H  +G+R     S G+
Sbjct: 113 IPAEFDARLRWPNCPTIGEIFEQGSCASCWAVAPTDVMSDRICIH--SGSRHIVRLSAGN 170

Query: 141 TPKCVRECQENYDVPY---------KKDLNFGAKSYSVSSNEKSIMKEIYE---HGPVEG 188
              C + C +     +         K  +  G    S    +K      Y+    G ++ 
Sbjct: 171 LLSCCKLCGKGCKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKN 230

Query: 189 AFTVFDDLIL-------------YKSGRFF------VPGNETTAMSLIKWTIRDNTSQLG 229
                D+ +L             YK   ++      +P N+  A+ L    I +N     
Sbjct: 231 KCPKTDNTLLECRETCRTSYNKSYKQDLYYGESVYRIP-NDARAIQL---EIMENGP--- 283

Query: 230 AEGAFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
            +    +++D + YK        G+ L  HA++I GWG +  +   YWL AN W+  WG+
Sbjct: 284 VQANLRIYEDFLHYKFGVYRHVHGQGLEYHAVKIFGWGTEGGT--PYWLAANPWSKRWGN 341

Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
            G FKILRG +   IE  + AG+PKLD
Sbjct: 342 GGFFKILRGSNHAEIEDHVMAGIPKLD 368



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/32 (65%), Positives = 25/32 (78%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +LCG GC GGFPG AW +W K GIV+GG+Y S
Sbjct: 176 KLCGKGCKGGFPGGAWMHWSKHGIVTGGSYSS 207


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 120/305 (39%), Gaps = 110/305 (36%)

Query: 49  IPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGS 108
           I  A L++ +G       P+N +P        D  LP NFD+R +WP    I  +R+Q  
Sbjct: 36  ITTAKLRARLGAIDLNEGPSNYVP--------DTSLPDNFDAREQWPG--KILPVRNQEQ 85

Query: 109 CGSCW---------------GC-----RPYEIAPCE---HHVNGTRPSCD---------- 135
           CGSCW               GC      P ++  C+   H  NG  P             
Sbjct: 86  CGSCWAFAVAETTGNRLNILGCGRGDMSPQDLVSCDKVDHGCNGGSPLFSWEWVKHSGIT 145

Query: 136 --------ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
                   +  G  P C ++C     +   K     AKS  +   +K +  E+Y  GP E
Sbjct: 146 TEECIPYVSGGGRVPSCPKKCTNGSAIVRTK-----AKSVGLVKGDK-MQNELYSRGPFE 199

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
            A                                            F+V++D   YKSG 
Sbjct: 200 AA--------------------------------------------FSVYEDFKSYKSGV 215

Query: 247 ------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
                 K LGGHA+ ++GWG ++ +   YWLI NSW T WG+ G FKILRGK+ECGIE++
Sbjct: 216 YHHITGKMLGGHAVMVVGWGVEDGTP--YWLIQNSWGTTWGEQGFFKILRGKNECGIETT 273

Query: 301 ITAGV 305
              G 
Sbjct: 274 CFQGT 278


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 95/204 (46%), Gaps = 59/204 (28%)

Query: 114 GCRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GCR +  + PC+HH++G         G +PKC   C+      YK D ++G  SYS+S +
Sbjct: 192 GCRLFPSLLPCKHHIHGXP---YVXTGDSPKCSMTCEPGQT--YKXDKHYGCSSYSISDS 246

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            K IM  IY++  VE A                                           
Sbjct: 247 TKDIMTNIYKNDXVEEA------------------------------------------- 263

Query: 233 AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F+V+ D ++YK       +G+  GGHAI ILG   +  +   YWL+AN WN DWGDNG 
Sbjct: 264 -FSVYLDFLMYKFKEYQGVTGEMXGGHAICILGCKVENSTS--YWLVANXWNRDWGDNGF 320

Query: 286 FKILRGKDECGIESSITAGVPKLD 309
           FKILRG+D  GIES + A +P  +
Sbjct: 321 FKILRGQDHYGIESEVVAEIPHTE 344



 Score = 44.3 bits (103), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 25/46 (54%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAH 53
           LCG GCNGG P   W +W   G+VSGG Y S    +   S +P  H
Sbjct: 159 LCGDGCNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKH 204



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 2/46 (4%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++LP+ + ++  D +LP +FD   +WP+ P  REIRDQGS G CW 
Sbjct: 79  SKLPQRVKFAX-DINLPESFDPXEQWPDXPX-REIRDQGSYGFCWA 122


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 86/319 (26%), Positives = 134/319 (42%), Gaps = 106/319 (33%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLP----ANRLPELIGYSEVDEDLPANFDSRTKWP 95
           +AE N L    R     ++G+HPD N       +++  +I        +P +FD+R KWP
Sbjct: 38  KAETNCLDIKSRL---GFLGLHPDPNYKIQTKQHKISRII-------SIPESFDAREKWP 87

Query: 96  NCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH-----TPKCVRE-- 147
            C   I +IR+QG+CGSCW     E+         T   C +SKG      +P+ +    
Sbjct: 88  ECKDVIGKIRNQGNCGSCWAFASTEVM--------TDRLCISSKGKIKFVFSPENLLTCC 139

Query: 148 -----------------------------------CQENYDVPYK-KDLNFGAKSYSVSS 171
                                              CQ   +  ++  + +   K Y++ +
Sbjct: 140 KDCGCGCKGGYIKNAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEASECVKFYTLET 199

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           N   I  EI  +GPV   + VF+D   +KSG ++                          
Sbjct: 200 NVAQIQMEILTNGPVMAYYNVFEDFACHKSGVYY-------------------------- 233

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD-NGLFKILR 290
                      YKSGK +G H+++++GWG +E     YWLIANSW ++WG+  G FK+ R
Sbjct: 234 -----------YKSGKFVGRHSVKVIGWGTEEGIP--YWLIANSWGSEWGELGGFFKMRR 280

Query: 291 GKDECGIESSITAGVPKLD 309
           G +EC IE  +TAG   ++
Sbjct: 281 GTNECWIEQEMTAGKVHIE 299


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 88/204 (43%), Gaps = 57/204 (27%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVSS 171
           GC PY+  PC HH+N T+ P C      TP CV +C    Y    + D +F  +S     
Sbjct: 2   GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHFMLESSPYHY 61

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +       I   GPV  +                                          
Sbjct: 62  SVNDAKNAIRTDGPVSAS------------------------------------------ 79

Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             FTV++D + Y+SG         LGGHA++I+GWGE  KS + YWL  NSWN DWGD+G
Sbjct: 80  --FTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGE--KSGQAYWLAVNSWNEDWGDHG 135

Query: 285 LFKILRGKDECGIESSITAGVPKL 308
           LFKI  G   CGI+  +  G PK+
Sbjct: 136 LFKIALG--NCGIDDDLLGGTPKV 157


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/282 (27%), Positives = 111/282 (39%), Gaps = 106/282 (37%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE--------------------IA 121
           + +PANF+S  +W NC  I  I++Q  CGSCW     E                    + 
Sbjct: 68  QAVPANFNSAQQWSNCSYISAIQNQARCGSCWAFGAVESVSDRFCIHKGEDVLLSFQDLV 127

Query: 122 PCEHHVNG--------------------------TRPSCDASKG------HTPKCVRECQ 149
            C+   NG                          T P+C  ++        TP+CV +C 
Sbjct: 128 TCDQSDNGCQGGDAYTAMKFIQKKGIVSNDCLPYTIPTCAPAQQPCLNFVDTPQCVEKC- 186

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
            N    Y +DL+F    YS++    +I +EI  +GPVE                      
Sbjct: 187 SNASYTYAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEAC-------------------- 226

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGED 262
                                   F V++D + YKSG       K LGGH ++++GWG  
Sbjct: 227 ------------------------FEVYEDFLGYKSGVYQHTTGKDLGGHCVKMIGWGT- 261

Query: 263 EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            ++ E YW+  NSW T WG+ G+F I  G +ECGIES + A 
Sbjct: 262 -QNNELYWICNNSWTTYWGNQGVFWIKAGVNECGIESDVVAA 302


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 106/271 (39%), Gaps = 87/271 (32%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYE 119
            D P  FD+R +WP C  IR I++Q +CGSCW                         P  
Sbjct: 123 RDFPTQFDAREQWPQC--IRSIKNQKNCGSCWAFSASSVLADRFCIKSGGKVNVDLSPQF 180

Query: 120 IAPCEHHVNGTRPSC-DAS--------------------KGHTPKC-VRECQENYDVPYK 157
           +  C    NG      DA+                     G  P C V+ C     VP +
Sbjct: 181 MVSCSGQNNGCNGGFFDATWRFLVSVGTVSEACVPYVSFGGAVPACNVKSC----GVPGQ 236

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K   + A S         IM ++  +GP++ A  V+ D   YKSG +             
Sbjct: 237 KSPFYRAGSARKLEGMLDIMADLKANGPIQVAMGVYRDFYSYKSGVYH------------ 284

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                    + SG+ +GGHA++I+GWG D  SK  YW+ ANSW 
Sbjct: 285 -------------------------HVSGRYVGGHAVKIVGWGYDSASKLPYWICANSWG 319

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            DWG  G F ILRG+ ECGI   + +G P L
Sbjct: 320 EDWGIKGYFWILRGRGECGIGKMVWSGKPAL 350


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 87/202 (43%), Gaps = 54/202 (26%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC  H N      C      TPKC + CQ  Y+V YK D  +G  +YS+    
Sbjct: 187 CKPYAFYPCGRHQNQKYFGPCPKELWPTPKCRKMCQLKYNVAYKDDKIYGNDAYSL---- 242

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
                                            P NET  M  I        +     G+
Sbjct: 243 ---------------------------------PNNETRIMQEI-------FTNGPVVGS 262

Query: 234 FTVFDDLILYKSGKAL-------GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F+VF D  +YK G  +       G HA++I+GWG  +  K  YWLIANSWN DWGD G  
Sbjct: 263 FSVFADFAIYKKGVYVSNGIQQNGAHAVKIIGWGVQDGLK--YWLIANSWNNDWGDEGYV 320

Query: 287 KILRGKDECGIESSITAGVPKL 308
           + LRG + CGIES +  G  K+
Sbjct: 321 RFLRGDNHCGIESRVVTGTMKV 342



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 21/36 (58%), Positives = 28/36 (77%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           +++ +LP  FD+R KWPNC +IR IRDQ +CGSCW 
Sbjct: 84  DLNINLPETFDAREKWPNCTSIRTIRDQSNCGSCWA 119


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 93/307 (30%), Positives = 136/307 (44%), Gaps = 49/307 (15%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +A++N   N P+  +   +G      +  + + E       + ++P  FDSR +W  C T
Sbjct: 40  KAKQNFPENTPKEQIVRLLGSKRLLGVSKSPIKENDELYMDNSEVPEFFDSRLEWDYCET 99

Query: 100 IREIRDQ---GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVR---ECQENYD 153
           I  +R+Q   GSC +      +    C    NG      +++  T  C R    C   Y 
Sbjct: 100 IGHVRNQGNCGSCWAHGTTGAFADRLCVA-TNGEFNELISAEELTFCCHRCGFGCNGGYP 158

Query: 154 VP----YKK---------DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT--------- 191
           +     +K+         D   G + Y V       +K+   H    G  T         
Sbjct: 159 LKAWQYFKRHGVVTGGDYDTTDGCQPYRVPP----CVKDDEGHNSCSGQPTERNHKCSKK 214

Query: 192 -VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG-AEGAFTVFDDLILYKSG--- 246
              DD I YK   +        A  L   T++ +T   G  E +F V+DD + Y+SG   
Sbjct: 215 CYGDDTIDYKKNHY----KTKDAYYLKNTTMQKDTMVYGPIEASFDVYDDFMNYESGVYQ 270

Query: 247 -----KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
                  LGGHA++++GWG +E +   YWL+ NSW   WGD G+FKILRG DECGIESS 
Sbjct: 271 RTGNASYLGGHAVKMIGWGVEEGTP--YWLMVNSWGEQWGDKGMFKILRGTDECGIESSC 328

Query: 302 TAGVPKL 308
           TAGVP +
Sbjct: 329 TAGVPSV 335



 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 17/30 (56%), Positives = 24/30 (80%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           CGFGCNGG+P  AW+Y+ + G+V+GG Y +
Sbjct: 149 CGFGCNGGYPLKAWQYFKRHGVVTGGDYDT 178


>gi|156708106|gb|ABU93311.1| cathepsin B2 cysteine protease [Monocercomonoides sp. PA]
          Length = 282

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 76/264 (28%), Positives = 107/264 (40%), Gaps = 88/264 (33%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPY 118
           E D  LP NFD+R +WP    I  +RDQ SCGSCW               GC      P 
Sbjct: 58  ESDNALPENFDAREQWPE--QILPVRDQASCGSCWAFSVAETMGDRLSIIGCGRGHMSPQ 115

Query: 119 EIAPCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYK 157
           ++  C+    G                  T   C    +  G  P C  +C     +   
Sbjct: 116 DLVSCDTTDMGCNGGYMDKAWAWTKSHGVTNEECMPYQSGGGRVPACPAKCVNGSTIVRT 175

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K  +F   + S       + +E+YE+GP+  AFTV+ D + YKSG +             
Sbjct: 176 KSQSFTHFTAS------QMQQELYENGPLSVAFTVYYDFMNYKSGVY------------- 216

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                   ++K+G   GGHA+  +GWG ++ +   YWL  NSW 
Sbjct: 217 ------------------------VHKTGGVAGGHAVLCIGWGVEDNTP--YWLCQNSWG 250

Query: 278 TDWGDNGLFKILRGKDECGIESSI 301
             WG+ G FKILRG + CGIE+ +
Sbjct: 251 PAWGEKGHFKILRGSNHCGIENQV 274


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 99/200 (49%), Gaps = 45/200 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+  PC+H+ +    +C +  +     C ++C  +NY V Y+ DL+  +  Y  S 
Sbjct: 162 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 221

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  +GPV     V+++ + YK G                            
Sbjct: 222 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 253

Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                      +YKS  G+ +G H ++++GWG D    E YWL  NSWN++WG++GLFKI
Sbjct: 254 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGLFKI 301

Query: 289 LRGKDECGIESSITAGVPKL 308
           LRG + C IE  + AG+  +
Sbjct: 302 LRGYNFCSIELLVMAGIVDV 321


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 61/199 (30%), Positives = 87/199 (43%), Gaps = 41/199 (20%)

Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
           +GC+PY I PC+        S      HTP C   C  N  + + YK+D +FG   Y+V 
Sbjct: 196 FGCKPYSIYPCDKKYPNGTTSVPCPGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVG 255

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
                I  EI  +GPV  +F ++DD   YKSG                            
Sbjct: 256 KKMTDIQTEIMTNGPVIASFVIYDDFWDYKSG---------------------------- 287

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                    + ++ +G   GG   +I+GWG D  S   YWL  + W TD+G+NG  + LR
Sbjct: 288 ---------IYVHTAGDQEGGMDTKIIGWGVD--SGVPYWLCVHQWGTDFGENGFVRFLR 336

Query: 291 GKDECGIESSITAGVPKLD 309
           G +E  IE  + A +P +D
Sbjct: 337 GVNEVNIEHQVLAALPDID 355


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 89/201 (44%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G             +C R C  + D+ + +D  +    Y ++   
Sbjct: 183 GCEPYRVPPCPNDDQGNNTCAGKPMESNHRCTRMCYGDQDLDFDEDHRYTRDYYYLTYG- 241

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 242 -SIQKDVMTYGPIEAS-------------------------------------------- 256

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD   YKSG          LGGHA++++GWGE+      YWL+ NSWN DWGD+G 
Sbjct: 257 FDVYDDFPSYKSGVYVKSENASYLGGHAVKLIGWGEEYGVP--YWLMVNSWNEDWGDHGF 314

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECG+++S TAGVP
Sbjct: 315 FKIQRGTNECGVDNSTTAGVP 335



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 24/108 (22%)

Query: 30  IVSGGAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPELIG--------- 76
           ++    Y ++QA   EK+ + NI  A   +W  GV+ D       + +L+G         
Sbjct: 10  VIFVSVYMTEQAYFLEKDFIDNI-NAQATTWKAGVNFDPKTSKEHIMKLLGSRGVQIPNK 68

Query: 77  -----YSEVDED-----LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
                Y   D +     +P  FD+R KW +C TI  +RDQG+CGSCW 
Sbjct: 69  NNMNLYKSEDAEYDNTYIPRFFDARRKWRHCSTIGRVRDQGNCGSCWA 116



 Score = 44.3 bits (103), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGCNGG+P  AW+ + K G+V+GG Y S +
Sbjct: 151 CGFGCNGGYPIKAWKRFSKKGLVTGGDYKSGE 182


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 68/210 (32%), Positives = 98/210 (46%), Gaps = 60/210 (28%)

Query: 107 GSCGSCWGCRPYEIAPC--EHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
           G+  S  GC+PY ++PC  + + N T     A K H  +C R C  + D  +K+D  F  
Sbjct: 173 GNYDSSEGCQPYRVSPCPLDEYGNNTCRGKPAEKNH--RCTRMCYGDQDRDFKEDHRFTR 230

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +Y ++    +I K++  +GP+E +                                   
Sbjct: 231 DAYYLTYG--TIQKDVMTYGPIEAS----------------------------------- 253

Query: 225 TSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                    + V+DD   YKSG          LGGHA++++GWGE+      YWL+ NSW
Sbjct: 254 ---------YEVYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVP--YWLMVNSW 302

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
           N  WGD GLFKI RG +ECGI++S T GVP
Sbjct: 303 NDQWGDRGLFKIRRGTNECGIDNSTTGGVP 332



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 24/34 (70%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++ +P  FD+R KW  C TI E+RDQG CGSCW 
Sbjct: 80  NQRIPKFFDARKKWRKCSTIGEVRDQGKCGSCWA 113



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GC+GG+P  AW  + K G+V+GG Y S +
Sbjct: 148 CGYGCHGGYPIKAWERFKKHGLVTGGNYDSSE 179


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 83/274 (30%), Positives = 116/274 (42%), Gaps = 69/274 (25%)

Query: 78  SEVDEDLPANFDSRTKWPNC---PTIREIRDQGSCGSCW--------GCR---------- 116
           S+  E L A+  +   W N      +  +R+Q  CGSC+        G R          
Sbjct: 179 SQDAEKLRASLPTEFDWTNVNGRDFVVPVRNQEQCGSCYAFSSSDMFGSRVRIPSNLTQV 238

Query: 117 ----PYEIAPCEHHVNG------------------TRPSCDASKGH-TPKCVRECQENYD 153
               P +I  C  +  G                  T  SCD  +GH   KC  +C  N  
Sbjct: 239 PVYSPQDIVDCSAYSQGCDGGFPFLVGKYAMDYGLTVESCDPYQGHDLGKCSNQCPVNRQ 298

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETT 212
                   +    Y  +S+E S+M EIY++GP+   F V+ DL  YK G +  V   E  
Sbjct: 299 QRLHSSNYYFVGGYYGNSHELSMMHEIYQNGPLAIGFEVYPDLRNYKHGVYKHVTAEELK 358

Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
           A  L +                   D++I +     +  HA+ ++GWG +  +   YW I
Sbjct: 359 AQGLSE-------------------DEMIPHFE---VVNHAVLMVGWGVENGTP--YWKI 394

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
            NSW+T WGDNG FKILRG DECG+ES   AG+P
Sbjct: 395 KNSWSTTWGDNGYFKILRGSDECGVESDAEAGIP 428


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/199 (31%), Positives = 86/199 (43%), Gaps = 41/199 (20%)

Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVS 170
           +GC+PY I PC+        S      HTP C   C  N   P  YK+D +FG   Y+V 
Sbjct: 196 FGCKPYTIYPCDKKYPNGTTSVPCPGYHTPVCEERCTSNITWPISYKQDKHFGKAHYNVG 255

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
                I  EI  +GPV  +F ++DD   YKSG                            
Sbjct: 256 KKMTDIQTEIMRNGPVIASFIIYDDFWDYKSG---------------------------- 287

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                    + ++ +G   GG   +I+GWG D  +   YWL  + W TD+G+NG  +ILR
Sbjct: 288 ---------IYVHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRILR 336

Query: 291 GKDECGIESSITAGVPKLD 309
           G +E  IE  + A  P LD
Sbjct: 337 GVNEVNIEHQVLAAQPDLD 355


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 130/335 (38%), Gaps = 118/335 (35%)

Query: 37  GSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDED----LPANFDSRT 92
           G K A  +  SN   A  K  +GV P       +L  L+G   V  D    LP +FD+RT
Sbjct: 59  GWKAAINDRFSNATVAEFKRLLGVKPT----PKKL--LLGVPVVSHDQSLKLPKSFDART 112

Query: 93  KWPNCPT------------------IREIRDQ-----------------GSCG------- 110
            WP C +                  +  + D+                   CG       
Sbjct: 113 HWPQCTSIGKILDQGHCGSCWAFGAVESLSDRFCIQFGMNITLSVNDLLACCGFRCGDGC 172

Query: 111 ------SCW------GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKK 158
                 S W      G    E  P       + P C+ +  +TP+C+R+C     + + +
Sbjct: 173 DGGYPISAWQYFSYSGVVTEECDPYFDQTGCSHPGCEPAY-NTPQCLRKCVGRNQL-WSE 230

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
             ++   +Y V SN + IM EIY++GPVE +                             
Sbjct: 231 SKHYSINTYVVESNPQDIMAEIYKNGPVEVS----------------------------- 261

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWL 271
                          FTV++D   YKSG         +GGHA++++GWG  +   E YWL
Sbjct: 262 ---------------FTVYEDFAHYKSGVYKHITGSNIGGHAVKLIGWGTTDDG-EDYWL 305

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +AN WN  WGD+G F I RG +ECGIE    AG+P
Sbjct: 306 LANQWNRSWGDDGYFMIRRGTNECGIEDEPVAGLP 340


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 93/206 (45%), Gaps = 65/206 (31%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-YKKDLNFGAKSYSVSSN 172
           GC PY   PC+       P  + S   TP C   CQE Y    YK D +F   +Y +S+ 
Sbjct: 192 GCMPYSFPPCKK-----SPCVEFS---TPSCKTTCQEKYTTADYKNDKHFATSAYKLSTT 243

Query: 173 EK---SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           +    +I  EIY +GPVE +                                        
Sbjct: 244 KNAVPTIQYEIYHNGPVEAS---------------------------------------- 263

Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
               + VF+D   YKSG         +GGHA++I+GWG +  +   YWL+ANSW T +G+
Sbjct: 264 ----YRVFEDFYQYKSGVYHHVSGNLVGGHAVKIIGWGTE--NGVDYWLVANSWGTSFGE 317

Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
            G FKI RG +EC IES+I AG+ KL
Sbjct: 318 KGFFKIRRGTNECQIESNIVAGLAKL 343



 Score = 38.1 bits (87), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           CG GC GG+   A +YW+ SG+V+GG Y
Sbjct: 161 CGKGCQGGYTIEAMKYWMNSGVVTGGDY 188


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 81/306 (26%), Positives = 121/306 (39%), Gaps = 117/306 (38%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCP---TIRE---------------------IRD 105
           +LP+ I     +E  P +FD+R KW  CP   TIR                      I  
Sbjct: 121 KLPQGIVLKLQEEPFPESFDARQKWSFCPSVGTIRNQGCCASSYAVAAVATITDRWCIHS 180

Query: 106 QGSCGSCWGCRPYEIAPCEHHV----NGTRPS---------------------------- 133
           +G     +G   Y++  C H      +G  PS                            
Sbjct: 181 EGKSQFSFG--AYDVLSCCHRCGFGCDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPF 238

Query: 134 --CDASKGHTPK----CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
             C   +   P     C+R+CQ  Y+  Y +D +FG  +YSV  +E  I+ E++  GPV+
Sbjct: 239 GVCKPQEIFAPHVDLICLRQCQPGYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQ 298

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
            +F                                            TV+ D I YKSG 
Sbjct: 299 ASF--------------------------------------------TVYTDFIQYKSGV 314

Query: 248 -------ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
                   +G H+++I+GWG +  +K  +WL ANSW  +WG+NG FKI+RG+D   +ES+
Sbjct: 315 YRHTYGVRVGDHSVKIVGWGVENGTK--FWLCANSWGAEWGENGFFKIIRGEDHLSVESN 372

Query: 301 ITAGVP 306
           + AG+P
Sbjct: 373 VVAGLP 378



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 20/32 (62%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CGFGC+GG P   W YWV++GI SGGAY S +
Sbjct: 200 CGFGCDGGVPSAVWHYWVENGITSGGAYESHE 231


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/198 (31%), Positives = 100/198 (50%), Gaps = 41/198 (20%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+  PC+H+ + +  +C +  +     C  +C  +NY V Y+ DL   +  Y  S 
Sbjct: 162 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 221

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  +GPV     V+++ + YK G +     ++TA                 
Sbjct: 222 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVY-----KSTA----------------- 259

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                          G+ +G H ++++GWG DE   E YWL  NSWN++WG++GLFKILR
Sbjct: 260 ---------------GELIGYHHVKLIGWGVDEAGIE-YWLAMNSWNSNWGNDGLFKILR 303

Query: 291 GKDECGIESSITAGVPKL 308
           G + C IE  + AG+  +
Sbjct: 304 GYNFCSIELLVMAGLVDV 321


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/285 (29%), Positives = 111/285 (38%), Gaps = 114/285 (40%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
           LP NFDSR +W  C  I  IR+Q  CGSCW                         P ++ 
Sbjct: 86  LPTNFDSRQQWGKC--IHPIRNQEQCGSCWAFSASESLSDRFCIASNGKVDVILSPQDMV 143

Query: 122 PCEHHVNGTRPSCDASK-------------------------GHTPKCVRECQENYDVPY 156
            C+++  G    CD                            G+ P C   C    ++P 
Sbjct: 144 SCDYNDMG----CDGGNLDNAWWWMKNKGIVPDSCMPYVSGGGNVPACPSNCNGT-NIPI 198

Query: 157 KKDLNFGAKSYSVSS------NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
              L + AKS+S  S          I +EIY +GPV+G                      
Sbjct: 199 SSQLYY-AKSFSHISPWMFWERVADIQQEIYTNGPVQGG--------------------- 236

Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDE 263
                                  F+V+ D + YKSG         LGGHAI+I+GWG + 
Sbjct: 237 -----------------------FSVYQDFMNYKSGVYSHKTGSFLGGHAIKIIGWGVE- 272

Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
                YWL+ANSW+TDWG +G FKILRG +ECGIE  + AG   L
Sbjct: 273 -GGVDYWLVANSWSTDWGIDGTFKILRGHNECGIEDDVYAGPADL 316


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 84/271 (30%), Positives = 110/271 (40%), Gaps = 45/271 (16%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE 124
            LP  R  E     ++  DLP +FD+   WP+CPTIREI DQ +C + W           
Sbjct: 76  TLPPARFTE----EQLRTDLPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDR 131

Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY--- 181
           +   G       S      C ++C    +  Y       A  Y VS    S   + Y   
Sbjct: 132 YCTVGKGKQLRISAADLMACCKDCGGGCEGGYPD----AAWEYYVSHGIASSQCQPYPFP 187

Query: 182 --EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AMSLIKWTIRDNTSQLGAEG--- 232
             EH   +G  T           +F  P    T     + LIK+    +    G E    
Sbjct: 188 RCEHRGAQGKKTPCSKY------KFVTPQCNATCTDKTIPLIKYRGNHSYEVRGEEDYKR 241

Query: 233 ----------AFTVFDDLILYK-------SGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      F V  D + YK       +G  LGG A+RI+GWG+   +   YW +ANS
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKNGVYQHVAGNFLGGKAVRIVGWGKLNGT--PYWKVANS 299

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           W+TDWG NG F ILRG +EC IE    AG P
Sbjct: 300 WDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/199 (30%), Positives = 89/199 (44%), Gaps = 41/199 (20%)

Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
           +GC+PY I PC+ +      S      HTP C   C  N  + + YK+D +FG   Y+V 
Sbjct: 193 FGCKPYSIYPCDKNYPNGTTSVPCPGYHTPPCEDHCTSNITWPIAYKQDKHFGKAHYNVG 252

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
                I  EI  +GPV  +F +++D   YKSG +                          
Sbjct: 253 KKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIY-------------------------- 286

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                      ++ +G   GG   +I+GWG D  +   YWL  + W TD+G+NG  +ILR
Sbjct: 287 -----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRILR 333

Query: 291 GKDECGIESSITAGVPKLD 309
           G +E  IE  + A +P +D
Sbjct: 334 GVNEVNIEHQVLAALPDVD 352


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 100/211 (47%), Gaps = 65/211 (30%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQE----NYDVPYKKDL 160
           G   +CW   PYEI  C HH     P+CD       TPKC ++C+E     + +P+ KD+
Sbjct: 268 GKGTTCW---PYEIPFCAHHAKAPFPNCDTDVRPRKTPKCRKDCEEAAYSEHVLPFDKDV 324

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +  + SYS+ S + ++ +++  HG V                                  
Sbjct: 325 HKASSSYSLRSRD-AVKRDMMAHGTVT--------------------------------- 350

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
                      GAF V++D + YKSG         LGGHAI+I+GWG ++   E+YW   
Sbjct: 351 -----------GAFMVYEDFLNYKSGVYKHVYGGPLGGHAIKIIGWGTEDG--EEYWHAV 397

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           NSWNT WGD+G FKI  G  +CG+++ + AG
Sbjct: 398 NSWNTYWGDSGHFKIEMG--QCGVDNEMVAG 426



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 28/45 (62%), Gaps = 1/45 (2%)

Query: 71  LPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           LP    +   +E +PANFD+RT +P C   +  +RDQG CGSCW 
Sbjct: 155 LPAKTVFENANEPVPANFDARTAFPVCKDVVGHVRDQGDCGSCWA 199



 Score = 44.3 bits (103), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 24/33 (72%)

Query: 6   IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           I    FGCNGG PGMAWR++ + G+V+GG + +
Sbjct: 234 IHCASFGCNGGQPGMAWRWFERKGVVTGGDFDT 266


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 80/300 (26%), Positives = 111/300 (37%), Gaps = 119/300 (39%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
            +L E      + + +P +FD+RT+WP   +I  IRDQ  CGSCW               
Sbjct: 66  TKLREFPVVDTIVDAIPTSFDARTQWP--ASIHPIRDQQQCGSCWAFGATEALSDRLAIA 123

Query: 115 --------CRPYEIAPCE---------------HHV----------------NGTRPSCD 135
                     P ++  C+               H++                NG   +C 
Sbjct: 124 SNNSINVVLSPQDLVSCDSTDYGCDGGYPINAWHYMQSLGVVTDTCYPYTSGNGDSGTCQ 183

Query: 136 ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
            +   TP C           YK        +Y V++N  +I  EI  +GPVE A      
Sbjct: 184 ITGKKTPACATA------TFYKAK-----TAYQVANNMAAIQSEILANGPVEAA------ 226

Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILY-------KSGKA 248
                                                 F+V+DD   Y       +SG  
Sbjct: 227 --------------------------------------FSVYDDFFSYTSGVYSHQSGAL 248

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            GGHA++I+GWG D  +   YW++ANSW T WG  G F I RG DECGIE  I AG+  +
Sbjct: 249 DGGHAVKIVGWGVDGTTP--YWIVANSWGTSWGQAGFFWIKRGNDECGIEDGIVAGLAAV 306


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/200 (33%), Positives = 96/200 (48%), Gaps = 61/200 (30%)

Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C PY +   C+H      P C+ +   TP C ++C+    V  +K  +F   +Y V+S+ 
Sbjct: 68  CDPYFDQVGCKH------PGCEPAY-PTPVCEKKCKVQNQVWLEKK-HFSVNAYRVNSDP 119

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             IM E+Y++GPVE A                                            
Sbjct: 120 HDIMAEVYQNGPVEVA-------------------------------------------- 135

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YKSG         +GGHA++++GWG  + + E YWL+AN WN  WGD+G F
Sbjct: 136 FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIGWGTTD-AGEDYWLLANQWNRGWGDDGYF 194

Query: 287 KILRGKDECGIESSITAGVP 306
           KI+RG +ECGIE  + AG+P
Sbjct: 195 KIIRGTNECGIEEDVVAGMP 214



 Score = 41.2 bits (95), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 15/25 (60%), Positives = 23/25 (92%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVS 32
          +CG GC+GG+P MAWRY+V++G+V+
Sbjct: 41 MCGDGCDGGYPIMAWRYFVRNGVVT 65


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 95/200 (47%), Gaps = 61/200 (30%)

Query: 115 CRPY-EIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C PY +   C H      P C+ +   TPKC R+C     + + +  ++G  +Y ++ + 
Sbjct: 17  CDPYFDNTGCSH------PGCEPTY-PTPKCERKCVSRNQL-WGESKHYGVGAYRINPDP 68

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IM E+Y++GPVE A                                            
Sbjct: 69  QDIMAEVYKNGPVEVA-------------------------------------------- 84

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           FTV++D   YKSG         +GGHA++++GWG  +   E YWL+AN WN  WGD+G F
Sbjct: 85  FTVYEDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDG-EDYWLLANQWNRSWGDDGYF 143

Query: 287 KILRGKDECGIESSITAGVP 306
           KI RG +ECGIE S+ AG+P
Sbjct: 144 KIRRGTNECGIEQSVVAGLP 163


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/186 (33%), Positives = 87/186 (46%), Gaps = 43/186 (23%)

Query: 122 PCEHHVNGTRPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
           PC+H  +     C      TP+C  +C   +Y   Y KD N     Y +     + MKEI
Sbjct: 1   PCQHTESAVENPCSNKTFFTPECKVQCYNPDYGTRYVKD-NHKGTQYRIPG--YTAMKEI 57

Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
           YE+GP+  +F ++ D + Y+SG +                                    
Sbjct: 58  YENGPITASFYMYQDFVNYQSGVY------------------------------------ 81

Query: 241 ILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
             + SGK +   A++ILGWGE+  +   YWL ANS+NT WGDNG  KILRG +EC IE  
Sbjct: 82  -AFNSGKYVTTQAVKILGWGEENGTP--YWLAANSFNTYWGDNGFVKILRGANECYIEEF 138

Query: 301 ITAGVP 306
           + AG+P
Sbjct: 139 MYAGLP 144


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 118/329 (35%), Gaps = 104/329 (31%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  K   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT--------------- 141
           CPTIREI DQ +C + W      +    +   G       S  H                
Sbjct: 103 CPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCKQCGGGCKGGF 162

Query: 142 -----------------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI-- 176
                                  P C     +    P  K  NF     + +  +KSI  
Sbjct: 163 PGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSK-YNFDTPKCNATCTDKSIPL 221

Query: 177 ------------------MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
                              +E+Y +GP    F V+ DL  YKSG                
Sbjct: 222 VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSG---------------- 265

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                            V+ ++     G  LGG A+RI+GWG+   +   YW +AN+W+T
Sbjct: 266 -----------------VYRNV----DGDILGGQAVRIVGWGKLNGT--PYWKVANTWDT 302

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG +G   ILRG +EC IE    AG P+
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 89/201 (44%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +   G             +C R C  + ++ + +D  +    Y ++   
Sbjct: 41  GCEPYRVPPCPYDEQGNNTCAGKPMEKNHRCTRICYGDQELDFDEDHRYTRDYYYLTYG- 99

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 100 -SIQKDVMTYGPIEAS-------------------------------------------- 114

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+ D   YKSG          LGGHA++++GWGE  +    YWL+ NSWN DWGDNGL
Sbjct: 115 FDVYSDFPSYKSGIYERTENATYLGGHAVKLIGWGE--QYGIPYWLMVNSWNEDWGDNGL 172

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECG+++S TAGVP
Sbjct: 173 FKIRRGTNECGVDNSTTAGVP 193



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 23/33 (69%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
          CGFGC+GG+P  AW+ +   G+V+GG Y S + 
Sbjct: 9  CGFGCHGGYPIRAWKRFKNHGLVTGGDYKSGEG 41


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 87/178 (48%), Gaps = 41/178 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY + PC +H  G   SC       TP C + CQ  Y   Y+KD ++    Y +  +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+ ++GPV+ AF  ++D   Y  G                               
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                 + ++  G+  G HA++++GWG +  +K  YW +ANSW+TDWG+NG F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGTK--YWNVANSWSTDWGENGYFRILRG 331



 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 19/39 (48%), Positives = 26/39 (66%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           ++D+P +FDSR  W +C +I  IRDQ +CGSCW     E
Sbjct: 92  NDDIPESFDSREVWKSCSSITYIRDQSNCGSCWAVSAAE 130


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 87/178 (48%), Gaps = 41/178 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY + PC +H  G   SC       TP C + CQ  Y   Y+KD ++    Y +  +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+ ++GPV+ AF  ++D   Y  G                               
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                 + ++  G+  G HA++++GWG +  +K  YW +ANSW+TDWG+NG F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGTK--YWNVANSWSTDWGENGYFRILRG 331



 Score = 38.1 bits (87), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R CG GCNGG    AW Y  + G+V+GG Y  K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
           G   +CW   PYE+  C HH     P CDA+     TPKC ++C+E        P+ +D 
Sbjct: 374 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 430

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +    +YS+ S +  + +++  HGPV G                                
Sbjct: 431 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 457

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
                       AF V++D + YKSG         +GGHAI+I+GWG +  + E+YW   
Sbjct: 458 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 503

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           NSWNT WGD G FKI  G  +CGI+  + AG
Sbjct: 504 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532



 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 77  YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +    E +PA+FD+RT +P C   +  +RDQG CGSCW 
Sbjct: 267 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 305



 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 6   IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           I    FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 71/209 (33%), Positives = 101/209 (48%), Gaps = 66/209 (31%)

Query: 114 GCRPYEIAPCEHHVN--GTRPSCDASKGHTPK--CVRECQENYDVP-YKKDLNFGA-KSY 167
           GC PY    C HHV+  G  P     KG++P   C   C+ ++  P ++ D +F   + Y
Sbjct: 221 GCWPYNFPECSHHVDTKGMEPC----KGNSPSPVCSTTCRNHHFKPSFESDRHFTEDEGY 276

Query: 168 SVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
           S+   ++ I +EI ++GPV  A                                      
Sbjct: 277 SLDEVDE-IKREIIDNGPVAAA-------------------------------------- 297

Query: 228 LGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                 FTV++D   YKSG         LGGHA++I+GWG D+   E+YWL+ NSWN +W
Sbjct: 298 ------FTVYEDFPYYKSGVYKHVNGSELGGHAVKIIGWGIDQN--EQYWLVMNSWNVNW 349

Query: 281 GDNGLFKILRGKDECGIESSITAGVPKLD 309
           GD G+FKI  G  ECGI+S +TAG+PK +
Sbjct: 350 GDQGIFKIAIG--ECGIDSEVTAGIPKYE 376


>gi|403362666|gb|EJY81064.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 124/312 (39%), Gaps = 95/312 (30%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +  +N  +N   A +K  +G    ++       ++  +++++  +P +FDSRT+W  C  
Sbjct: 40  EVSENKFANYTEAQIKGLLGTVLSHS------SDIPAFTQINAAVPDSFDSRTQWQGC-- 91

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  IRDQ  CGSCW                         P ++  C+ +  G        
Sbjct: 92  VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNL 151

Query: 130 ----------TRPSCD---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                        SC+   ++ G  P C  +C     +   K      K  + ++  KS+
Sbjct: 152 AWQYLEKKGVASDSCEPYKSASGTAPSCPSKCANGQAIKKYKCQAGSTKQANGAAATKSL 211

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
              I + GPVE  FTV+ D   YKSG +                                
Sbjct: 212 ---IQQSGPVETGFTVYADFFNYKSGIYH------------------------------- 237

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                 + SG A GGHA++ILGWG  ++  E YW++ANSW   WG+ G F I +G  + G
Sbjct: 238 ------HVSGGAEGGHAVKILGWG--KQGSENYWIVANSWGESWGEKGFFNIRQG--DSG 287

Query: 297 IESSITAGVPKL 308
           I+ +    +P L
Sbjct: 288 IDQATFGCIPDL 299


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
           G   +CW   PYE+  C HH     P CDA+     TPKC ++C+E        P+ +D 
Sbjct: 377 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 433

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +    +YS+ S +  + +++  HGPV G                                
Sbjct: 434 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 460

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
                       AF V++D + YKSG         +GGHAI+I+GWG +  + E+YW   
Sbjct: 461 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 506

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           NSWNT WGD G FKI  G  +CGI+  + AG
Sbjct: 507 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 535



 Score = 45.4 bits (106), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 77  YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +    E +PA+FD+RT +P C   +  +RDQG CGSCW 
Sbjct: 270 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 308



 Score = 44.7 bits (104), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 6   IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           I    FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 343 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 373


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 92/197 (46%), Gaps = 57/197 (28%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV-SSNE 173
           C+PY   PC+ +     P        TPKC + CQ  Y VPY++D  FG  S+ +   NE
Sbjct: 187 CKPYPFYPCDGNYG---PCPKEGAFDTPKCRKICQFRYPVPYEEDKVFGKNSHILLQDNE 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
             I +EI+ +GPV                                          GA   
Sbjct: 244 ARIRQEIFINGPV------------------------------------------GAN-- 259

Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
           F VF+D I YK G       K +G HAI+++GWG +  +   YWL+ANS+N DWG+NG F
Sbjct: 260 FYVFEDFIHYKEGIYKQTYGKWIGVHAIKLIGWGTENGTD--YWLVANSYNYDWGENGTF 317

Query: 287 KILRGKDECGIESSITA 303
           +ILRG + C IES + A
Sbjct: 318 RILRGTNHCLIESQVIA 334


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 95/211 (45%), Gaps = 65/211 (30%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDAS--KGHTPKCVRECQENYDV----PYKKDL 160
           G   +CW   PYE+  C HH     P CDA+     TPKC ++C+E        P+ +D 
Sbjct: 374 GKGTTCW---PYEVPFCAHHAKAPFPDCDATLVPRKTPKCRKDCEEQAYADNVHPFDQDT 430

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +    +YS+ S +  + +++  HGPV G                                
Sbjct: 431 HKATSAYSLRSRD-DVKRDMMTHGPVSG-------------------------------- 457

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGK-------ALGGHAIRILGWGEDEKSKEKYWLIA 273
                       AF V++D + YKSG         +GGHAI+I+GWG +  + E+YW   
Sbjct: 458 ------------AFMVYEDFLSYKSGVYKHVSGLPVGGHAIKIIGWGTE--NGEEYWHAV 503

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           NSWNT WGD G FKI  G  +CGI+  + AG
Sbjct: 504 NSWNTYWGDGGQFKIAMG--QCGIDGEMVAG 532



 Score = 45.4 bits (106), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 18/39 (46%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 77  YSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +    E +PA+FD+RT +P C   +  +RDQG CGSCW 
Sbjct: 267 FENATEPVPAHFDARTAFPACKDVVGHVRDQGDCGSCWA 305



 Score = 44.7 bits (104), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 23/31 (74%)

Query: 6   IRLCGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
           I    FGCNGG PGMAWR++ + G+V+GG +
Sbjct: 340 IHCASFGCNGGQPGMAWRWFERKGVVTGGDF 370


>gi|403345965|gb|EJY72367.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 124/312 (39%), Gaps = 95/312 (30%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +  +N  +N   A +K  +G    ++       ++  +++++  +P +FDSRT+W  C  
Sbjct: 40  EVSENKFANYTEAQIKGLLGTVLSHS------SDIPAFTQINAAVPDSFDSRTQWQGC-- 91

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  IRDQ  CGSCW                         P ++  C+ +  G        
Sbjct: 92  VHPIRDQAQCGSCWAFAASESLSDRFCIASQGKVNVVLSPQDMVSCDTNNYGCDGGYLNL 151

Query: 130 ----------TRPSCD---ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                        SC+   ++ G  P C  +C     +   K      K  + ++  KS+
Sbjct: 152 AWQYLEKKGVASDSCEPYKSASGTAPSCPSKCSNGQAIKKYKCKAGSTKQANGAAATKSL 211

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
              I + GPVE  FTV+ D   YKSG +                                
Sbjct: 212 ---IQQSGPVETGFTVYADFFNYKSGIYH------------------------------- 237

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                 + SG A GGHA++ILGWG  ++  E YW++ANSW   WG+ G F I +G  + G
Sbjct: 238 ------HVSGGAEGGHAVKILGWG--KQGSENYWIVANSWGESWGEKGFFNIRQG--DSG 287

Query: 297 IESSITAGVPKL 308
           I+ +    +P L
Sbjct: 288 IDQATFGCIPDL 299


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 130/331 (39%), Gaps = 110/331 (33%)

Query: 41  AEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSE------VDEDLPANFDSRTKW 94
           A +N   + P  HLK   G        A   P+L+G ++      + E +P  FD RT W
Sbjct: 41  AGRNFPEDTPIEHLKRLNG--------ALITPDLVGKNQTHVINVIPEAIPETFDGRTHW 92

Query: 95  PNCPT---IREIRDQGS----------------------------------CGSCW-GC- 115
             CP+   IR   + GS                                  C +C  GC 
Sbjct: 93  SQCPSLKNIRNQGNCGSCWAFGSVEVMTDRLCIASKGKTKFEFSADDLLACCTACGKGCD 152

Query: 116 -----RPYEIAPCEHHVNG----TRPSCDASKGH------TPKCVREC-QENYDVPYKKD 159
                R +E    +  V+G    +   C   +G       TPKC  +C    Y  PY KD
Sbjct: 153 GGAPYRAFEYWVAKGIVSGGDYNSNEGCQPYEGSAFLNSVTPKCSTKCLNSKYTTPYAKD 212

Query: 160 LNFGAK-SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
            ++G    Y  S N   I  EI  +GPV     V++D   YKSG +              
Sbjct: 213 KHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVY-------------- 258

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   + SG ++GGHA++I+GWG ++     YWLIANSW  
Sbjct: 259 -----------------------QHVSGNSMGGHAVKIIGWGTEKGV--PYWLIANSWGA 293

Query: 279 DWGD-NGLFKILRGKDECGIESSITAGVPKL 308
            W D +G +KILRGK+ C IE+ I  G P++
Sbjct: 294 KWADLDGFYKILRGKNHCKIETYIYGGTPQV 324



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC+GG P  A+ YWV  GIVSGG Y S +
Sbjct: 147 CGKGCDGGAPYRAFEYWVAKGIVSGGDYNSNE 178


>gi|403377404|gb|EJY88697.1| hypothetical protein OXYTRI_00086 [Oxytricha trifallax]
          Length = 351

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 71/270 (26%), Positives = 111/270 (41%), Gaps = 84/270 (31%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRP 117
           + + +P  FD RTKWP C  +R+IRDQ +CG+CW                         P
Sbjct: 116 LKDSIPLEFDFRTKWPQC--LRKIRDQANCGACWAFTGSGMLADRICILTNGTINEELSP 173

Query: 118 YEIAPCEHHVNG------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
            ++  C H   G                  T+ SC   K  T KC   CQ   +  +K  
Sbjct: 174 QDMVDCSHDNFGCEGGYLMNALDYLMNEGVTKESCTPYKDKTNKCQYTCQNKTEEFHKHY 233

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
              G  +  V +NE+ I +++ ++GP+    TV++D I Y +G +               
Sbjct: 234 CKPG--TLRVLTNEEQIKRDLMQNGPLMVGLTVYEDFINYATGDY--------------- 276

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                                  + +G+ +GGHA++++GW   +K +   WLI N WN D
Sbjct: 277 ----------------------KFVAGEIVGGHAVKLMGWRTTQKGQTS-WLIQNQWNDD 313

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WG+ G   IL  ++E GI+S      P +D
Sbjct: 314 WGEQGFGYIL--ENEVGIDSIGVGCTPDID 341


>gi|328869211|gb|EGG17589.1| hypothetical protein DFA_08585 [Dictyostelium fasciculatum]
          Length = 323

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 105/274 (38%), Gaps = 86/274 (31%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRP 117
           VD  +P+ FD+R +WP C  +  + +Q  CGSCW                         P
Sbjct: 91  VDASIPSTFDAREQWPGC--VHAVLNQEQCGSCWAFSSSEALSDRLCIASKGQVNVTLSP 148

Query: 118 YEIAPCE----HHVNGTRPSC------------------DASKGHTPKCVRECQENYDVP 155
             +  C+       NG  P                     A  G    C R+C +   + 
Sbjct: 149 QALVACDDIGNQGCNGGVPQLAWEYMEWKGLPTFECYPYTAGNGTDGTCQRQCADGSAMT 208

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y +   F   S +  ++   I  EI  +GPV G   V+ D + Y SG +           
Sbjct: 209 YYRAKPF---SMTTCNSVACIQNEIITYGPVVGTMMVYQDFMSYSSGVY----------- 254

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                + D T++L                    LGGHAI I+GWG D  SK  YW++ NS
Sbjct: 255 -----VYDGTAEL--------------------LGGHAIEIVGWGTDATSKLDYWIVKNS 289

Query: 276 WNTDWGD-NGLFKILRGKDECGIESSITAGVPKL 308
           W+  WG  +G F I RG + CGI+   +A   KL
Sbjct: 290 WSAAWGGLDGYFWIQRGTNMCGIDHDASASQAKL 323


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 44/91 (48%), Positives = 62/91 (68%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C ++CQ+ Y  PY++D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQKCQKGYKTPYEQDKHYGEESYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
           K+I KEI  +GPVE AF V++D + YKSG +
Sbjct: 247 KAIQKEIMMNGPVEAAFDVYEDFLNYKSGIY 277



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 23/27 (85%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/313 (27%), Positives = 125/313 (39%), Gaps = 102/313 (32%)

Query: 12  GCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL 71
           GCNGGFP  AW +    GI +GG Y +K          P             Y+ P    
Sbjct: 604 GCNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWP-------------YDFPP--- 647

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR 131
                         A+  + TK+P CP +          SC G  P   A     +    
Sbjct: 648 -------------CAHHINDTKYPECPKV----------SCSGESPPATAETATVI---- 680

Query: 132 PSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKS----YSVSSNEKSIMKE-----IY 181
                +   TP C  +C    Y    + D +F  +S    YSV+  + +I  +     IY
Sbjct: 681 --AYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDAKNAIRTDGPVGPIY 738

Query: 182 EHGP------VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
              P      V  +F+V++D + YKSG +                               
Sbjct: 739 FCDPNVNFDQVSASFSVYEDFLAYKSGVY------------------------------- 767

Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
                  + SG+ LGGHA++I+GWGE+  S + YW++ NSWN DWGD+GLFKI  G   C
Sbjct: 768 ------KHTSGEYLGGHAVKIIGWGEE--SGQAYWIVVNSWNEDWGDHGLFKIALGN--C 817

Query: 296 GIESSITAGVPKL 308
           GI+ ++  G PK+
Sbjct: 818 GIDDNLLGGTPKV 830



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/41 (56%), Positives = 29/41 (70%), Gaps = 2/41 (4%)

Query: 76  GYS-EVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           GY+ E  +DLP +FD+RT +PNC   I  IRDQ +CGSCW 
Sbjct: 528 GYAIEELQDLPTDFDARTAFPNCSKVIGHIRDQSACGSCWA 568


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 91/201 (45%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +  +G             +C R C  + D+ +  D      SY ++   
Sbjct: 185 GCEPYRVPPCPYDESGNNTCSGKPMEQNHRCTRMCYGDQDLDFDDDHRHTRDSYYLTIG- 243

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 244 -SIQKDVMTYGPIEAS-------------------------------------------- 258

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           F V+DD + YKSG          LGGHA++++GWGE+  +   YWL+ NSWN DWGD GL
Sbjct: 259 FDVYDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTP--YWLMMNSWNADWGDEGL 316

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +ECG+++S TAGVP
Sbjct: 317 FKIRRGTNECGVDNSTTAGVP 337



 Score = 48.5 bits (114), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 19/38 (50%), Positives = 25/38 (65%)

Query: 77  YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           Y  +   +P  FD+R KW +C TI  +RDQG+CGSCW 
Sbjct: 81  YDNLFGRIPKKFDARKKWRHCTTIGAVRDQGNCGSCWA 118



 Score = 42.0 bits (97), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG+GCNGG+P  AW  + K G+V+GG Y S +
Sbjct: 153 CGYGCNGGYPIKAWERFKKHGLVTGGEYKSGE 184


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/178 (30%), Positives = 87/178 (48%), Gaps = 41/178 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY + PC +H  G   SC       TP C + CQ  Y   Y+KD ++    Y +  +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+ ++GPV+ AF  ++D   Y  G                               
Sbjct: 253 KAIQREMMKNGPVQAAFITYEDFSFYTKG------------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                 + ++  G+  G HA++++GWG +  +  KYW +ANSW+TDWG++G F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDWGEDGYFRILRG 331



 Score = 38.1 bits (87), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R CG GCNGG    AW Y  + G+V+GG Y  K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 111/268 (41%), Gaps = 88/268 (32%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPYEI 120
           +E+ PA+FD R KWP       +R+Q SCGSCW               GC      P ++
Sbjct: 54  NENAPASFDCRQKWPG--KAEPVRNQASCGSCWAHAASETMGFRMGIRGCYKGVMSPQDL 111

Query: 121 APCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKD 159
             CE +  G                  T   C    +  G  P C  +C+   ++     
Sbjct: 112 VSCESNNMGCEGGYADRVWNWIQKKGITTEQCLPYVSGSGRVPTCPSKCKNGSNIVRSFV 171

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++G      S N K++M E+  +GPV   F VF+D + YKSG                 
Sbjct: 172 SSWG------SFNSKTVMDEVANNGPVYACFEVFEDFLNYKSG----------------- 208

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                               +  +K+GK+ G H + ++GWG +  +   YWL+ NSW + 
Sbjct: 209 --------------------IYQHKTGKSKGWHHVMLMGWGTE--NGVPYWLLQNSWGSG 246

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPK 307
           WG+ G F+I RG ++C I+    +G+PK
Sbjct: 247 WGEKGFFRIRRGTNDCHIDEIFYSGLPK 274


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 114/298 (38%), Gaps = 100/298 (33%)

Query: 51  RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
           RA L + +G H  Y  P       +  SE     P  FD+R +WP    I  +RDQ SCG
Sbjct: 42  RAMLGAELGPHMPYVQP-------LSLSE-----PTEFDAREQWPG--KILPVRDQASCG 87

Query: 111 SCWGCRPYE-------IAPCEHHVNGTRP--SCDAS------------------------ 137
           SCW     E       IA C       +   SCD +                        
Sbjct: 88  SCWAHSVAEAMGDAQNIAGCPRGAMSVQDLVSCDKTDSACNGGDMKKAQEYLVKTGITTE 147

Query: 138 --------KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGA 189
                    G  P C  +C     +     + +  +S+  S     IM+ + E+GP+   
Sbjct: 148 ACVKYVSGSGRVPACPSKCDNGSQI-----IRYKLQSWK-SVEPSEIMQALMEYGPLSCG 201

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
           F V+ D + Y+SG +                                      +KSG   
Sbjct: 202 FMVYSDFMNYRSGVY-------------------------------------QHKSGYFE 224

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           GGHA+ + GWG +  +   YWL+ NSW   WG+ G FKILRG + C IES +T GVPK
Sbjct: 225 GGHAVLLCGWGVE--NGLPYWLVQNSWGPAWGEKGFFKILRGSNHCEIESYVTLGVPK 280


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 46/273 (16%)

Query: 57  WMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGC 115
           ++G+HPD   P  ++     + ++ + +P +FD+R KWP C   I +IRDQG+CGSCW  
Sbjct: 54  FLGLHPD---PDYKIQT--KHHKIAKSIPESFDAREKWPECKDVIGKIRDQGTCGSCWAF 108

Query: 116 RPYEIAPCEHHVNGTRPSCDASKGHT-----PKCVRECQENYDVPYKKDLNFGAKSYSVS 170
              E+         T   C  +KG T     P+ +  C E  D   +    + AK++   
Sbjct: 109 ASTEVM--------TDRLCIGTKGETKFVFSPENLLTCCE--DCRLECVGGYTAKAWDYY 158

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK----------WT 220
            NE  +    Y     EG          Y      V   +     +            +T
Sbjct: 159 INEGIVSGGDYNSS--EGCQPYSKASFQYAVASKCVKACQNDKYDVKYDDDKHYGDSFYT 216

Query: 221 IRDNTSQLGAE--------GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLI 272
           +  N +Q+  E          F VF+D+I YKSG  L    + IL WG +E     YWLI
Sbjct: 217 LETNVTQIQTEILTNGPVMATFNVFEDIIYYKSGIQLSN--VSILRWGTEEGVP--YWLI 272

Query: 273 ANSWNTDWGD-NGLFKILRGKDECGIESSITAG 304
           ANSW T WGD  G  KI RG +EC IE  + AG
Sbjct: 273 ANSWGTWWGDLGGFIKIKRGTNECAIEQEMAAG 305


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 87/302 (28%), Positives = 118/302 (39%), Gaps = 107/302 (35%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRP------- 117
            LP  R  E     ++  +LP +FD+  KWP+CPTIREI DQ +C + W           
Sbjct: 76  TLPPVRFTE----EQLRTELPESFDAAEKWPHCPTIREIPDQSACRASWAVATASAISDR 131

Query: 118 -------------------------------YEIAPCEHHV-NGTR---------PSCD- 135
                                          Y  A  E++V NG           P C+ 
Sbjct: 132 YCTVGNGKQLRISAADLMACCTGCGGGCEGGYPDAAWEYYVSNGITSSQCQPYPFPRCEH 191

Query: 136 -ASKGHTPKCVRECQENYDVP------YKKDLNF----GAKSYSVSSNEKSIMKEIYEHG 184
             ++G  P C +    N+D P        K +      G  SY V   E+   +E+Y +G
Sbjct: 192 RGAQGKKPPCSK---YNFDTPTCNATCTDKSVPLIKYRGNHSYEVRG-EEDYKRELYFNG 247

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           P    F V  D + YKSG                                     +  + 
Sbjct: 248 PFVVRFQVHSDFLAYKSG-------------------------------------VYQHV 270

Query: 245 SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           +G  LGG A+RI+GWG  + +   YW +ANSW+TDWG NG F ILRG +EC IE    AG
Sbjct: 271 AGNFLGGKAVRIVGWG--KMNGTPYWKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAG 328

Query: 305 VP 306
            P
Sbjct: 329 TP 330


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 116/297 (39%), Gaps = 42/297 (14%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  K   G  +     LP  R  E     ++   LP  FD+   WP+
Sbjct: 47  KAVYNGKMQNITFAEAKRLTGAWIQKSSTLPPARFTE----EQLRTKLPETFDAAEHWPH 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +C + W           +   G       S      C ++C +     +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAADLLSCCKQCGDGCKGGF 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
                     Y ++S+        + H    GA         YK   F  P    T    
Sbjct: 163 PGFAWLYYVEYGIASS--GCQPYPFPHCEHRGAQGNKTPCSKYK---FDTPKCNATCTDK 217

Query: 213 AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKAL 249
           ++ L+K+  R N + L   G                 F V+ DL  YKS       G  L
Sbjct: 218 SIPLVKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFL 275

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GG A+RI+GWG+   +   YW +ANSW+TDWG NG   ILRG +EC IE     G P
Sbjct: 276 GGQAVRIVGWGKLNGT--PYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330



 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 17/28 (60%), Positives = 20/28 (71%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGG 34
           + CG GC GGFPG AW Y+V+ GI S G
Sbjct: 152 KQCGDGCKGGFPGFAWLYYVEYGIASSG 179


>gi|403371460|gb|EJY85611.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/312 (25%), Positives = 120/312 (38%), Gaps = 95/312 (30%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +  +N  +N   A LK  +G    +         +  +++++  LP +FDSRT+W +C  
Sbjct: 40  EVSQNKFANYTEAQLKGLLGTVLSHQ------SGISAFTQINAALPDSFDSRTQWKDC-- 91

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  IRDQ  CGSCW                         P ++  C+    G        
Sbjct: 92  VHPIRDQAQCGSCWAFAAAESLSDRFCIASQGKVNLVLSPQDMVSCDTSNFGCFGGYLDQ 151

Query: 130 ----------TRPSCDASK---GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                     +  SC+  K   G  P C  +C     +   K     A S   +   ++ 
Sbjct: 152 AWQYLEQQGVSSDSCEPYKSGNGDQPSCPTKCSNGQAI---KKYKCKAGSTKQAKGAEAT 208

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
              I E GPVE  FTV+ D   Y SG +                                
Sbjct: 209 KSLIQESGPVETGFTVYQDFYNYNSGVYH------------------------------- 237

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                 + +G A GGHA++ILGWG+  +  E YW++ANSW  DWG+ G F I +G  + G
Sbjct: 238 ------HVTGDAEGGHAVKILGWGK--QGLENYWIVANSWGEDWGEKGYFNIRQG--DSG 287

Query: 297 IESSITAGVPKL 308
           I+ +    +P +
Sbjct: 288 IDEATFGCIPDV 299


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/271 (30%), Positives = 111/271 (40%), Gaps = 45/271 (16%)

Query: 65  NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE 124
           +LP  R  E     ++  +LP +FD+   WP+CPTIREI DQ +C + W           
Sbjct: 76  SLPPVRFTE----EQLRTELPESFDAAEHWPHCPTIREIADQSACRASWAVATASAISDR 131

Query: 125 HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY--- 181
           +   G       S      C ++C    +  Y       A  Y VS    S   + Y   
Sbjct: 132 YCTVGKGKQLRISAADLMACCKDCGGGCEGGYPD----AAWEYYVSHGITSSQCQPYPFP 187

Query: 182 --EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AMSLIKWTIRDNTSQLGAEG--- 232
             EH   +G              +F  P    T    ++ LIK+    +    G E    
Sbjct: 188 RCEHRGAQGKKPPCSKY------KFVTPQCNATCTDKSVPLIKYRGNHSYEVRGEEDYKR 241

Query: 233 ----------AFTVFDDLILYKS-------GKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                      F V  D + YKS       G  LGG A+RI+GWG+   +   YW +ANS
Sbjct: 242 ELYFNGPFVVRFQVHSDFLAYKSGVYQHVAGNFLGGKAVRIVGWGKLNGT--PYWKVANS 299

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           W+TDWG NG F ILRG +EC IE    AG P
Sbjct: 300 WDTDWGMNGYFLILRGDNECNIEHLGFAGTP 330


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 87/192 (45%), Gaps = 40/192 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+PY    C  H      +C +    TP C   CQ  Y   Y+ D       Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKARTWYWLPNDER 254

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI + GPV   F +++D   Y+ G +                              
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYEGGVY------------------------------ 284

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
                  ++ +G   GGH+I+I+GWG D+  K  YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335

Query: 294 ECGIESSITAGV 305
            C IE  + AG 
Sbjct: 336 NCDIEGGVLAGT 347



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           N LP  I     ++D+P +FDSR KW +CP++R I DQ +CGSCW 
Sbjct: 83  NVLP--IANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126



 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           + CG+GC+GG+   AW++   +G+V+GGAY  K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 116/297 (39%), Gaps = 42/297 (14%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +     LP  R  E     ++   LP  FD+   WP+
Sbjct: 47  KAVYNGKMQNITFSEAKRLTGARIQKSRTLPPARFTE----EQLRTKLPETFDAAEHWPH 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ  C + W           +   G       S      C ++C +     +
Sbjct: 103 CPTIREIADQSECRASWAVSTASAISDRYCTVGGGKQLRISAADLMACCKQCGDGCKGGF 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETT---- 212
                     Y ++S++       + H    GA         YK   F  P    T    
Sbjct: 163 PGFAWLYYVEYGITSSQ--CQPYPFPHCEHRGAQGNKTPCSKYK---FDTPKCNATCTDK 217

Query: 213 AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKAL 249
           ++ L+K+  R N + L   G                 F V+ DL  YKS       G  L
Sbjct: 218 SIPLVKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRNVDGDFL 275

Query: 250 GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GG A+RI+GWG+   +   YW +ANSW+TDWG NG   ILRG +EC IE     G P
Sbjct: 276 GGQAVRIVGWGKLNGT--PYWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTGFP 330



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 16/26 (61%), Positives = 19/26 (73%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVS 32
           + CG GC GGFPG AW Y+V+ GI S
Sbjct: 152 KQCGDGCKGGFPGFAWLYYVEYGITS 177


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 54/137 (39%), Positives = 72/137 (52%), Gaps = 38/137 (27%)

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           SS+  SIM E+Y++GPVE AFTV++D   YKSG +                         
Sbjct: 4   SSDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVY------------------------- 38

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                        + +G  LGGHA++++GWG  E   E YWL+AN WN  WGD+G FKI 
Sbjct: 39  ------------KHVTGDELGGHAVKLIGWGTSEDG-EDYWLLANQWNRGWGDDGYFKIR 85

Query: 290 RGKDECGIESSITAGVP 306
           RG +EC IE  + AG+P
Sbjct: 86  RGTNECDIEDEVVAGMP 102


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/329 (25%), Positives = 116/329 (35%), Gaps = 104/329 (31%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  A  K   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFAEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT--------------- 141
           CPTIREI DQ +C + W      +    +   G       S  H                
Sbjct: 103 CPTIREIADQSACRASWAVSTASVISDRYCTVGGVQQLRISAAHLLSCCKQCGGGCKGGF 162

Query: 142 -----------------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI-- 176
                                  P C     +    P  K  NF     + +  +KSI  
Sbjct: 163 PGFAWRYYVEYGIASSYCQPYPFPHCEHRGAQGNKTPCSK-YNFDTPKCNATCTDKSIPL 221

Query: 177 ------------------MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
                              +E+Y +GP    F V+ DL  YKSG +              
Sbjct: 222 VKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVY-------------- 267

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
                                   +  G  LGG A++++GWG+   +   YW +AN+W+T
Sbjct: 268 -----------------------RHVDGDFLGGTAVKVVGWGKLNGT--PYWKVANTWDT 302

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVPK 307
           DWG +G   ILRG +EC IE    AG P+
Sbjct: 303 DWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 88/188 (46%), Gaps = 59/188 (31%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKC-VREC-QENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY I P        R +C      TP C +R C   NY   Y+ DL++    YS+S 
Sbjct: 73  GCQPYSIYP----RGKGRNTCIDDDIDTPDCSIRTCTNSNYTKGYRADLHYVDTVYSLSR 128

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +E+ IM +IY++GPV+ A                                          
Sbjct: 129 SEEDIMTDIYKNGPVQAA------------------------------------------ 146

Query: 232 GAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
             F V+ D + YKSG       +  GGHAI+ILGWG D+ +K  YWL ANSW+  WG+NG
Sbjct: 147 --FYVYTDFMYYKSGVYSYTRGQIEGGHAIKILGWGVDDNTK--YWLCANSWSRSWGENG 202

Query: 285 LFKILRGK 292
           LF+ILRG 
Sbjct: 203 LFRILRGN 210


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 43/89 (48%), Positives = 60/89 (67%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  P+C      TP+C + CQ+ Y  PY++D ++G +SY+V SNE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVISNE 246

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           K+I +EI  +GPVE AF V++D + YKSG
Sbjct: 247 KAIQREIMMYGPVEAAFDVYEDFLNYKSG 275



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/52 (42%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 63  DYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           D  +   R P  + + +++ ++P+ FDSR KWP+C +I +IRDQ  CGSCW 
Sbjct: 70  DAEMKRKRRP-TVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWA 120



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 23/27 (85%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 98/202 (48%), Gaps = 47/202 (23%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  G+  GC PY++ PC ++  G             +C + C     V  +    +  KS
Sbjct: 175 GDYGTKEGCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKS 229

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
             V ++ K+I +++  +GPVE +F V+DD  +YKSG                        
Sbjct: 230 EYVMNSIKTIEQDLKTYGPVEASFDVYDDFSVYKSG------------------------ 265

Query: 227 QLGAEGAFTVFDDLILYKSGKA--LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                         I  K+ KA   GGH+I+I+GWG+   +   YWL  NSW+  WG++G
Sbjct: 266 --------------IYRKTPKAKYQGGHSIKIIGWGQQNGT--PYWLAVNSWSKFWGEHG 309

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI++G++ECGIE ++TAG+P
Sbjct: 310 TFKIIKGRNECGIERAVTAGIP 331



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 24/34 (70%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           V+ D P  FDSRT W +C  I  IRDQG+CGSCW
Sbjct: 81  VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 85/196 (43%), Gaps = 41/196 (20%)

Query: 113 WGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQEN--YDVPYKKDLNFGAKSYSVS 170
           +GC+PY I PC+        S      HTP C   C  N  + + YK+D +FG   Y+V 
Sbjct: 194 FGCKPYSIYPCDKKYANGTTSVPCPGYHTPTCEEHCTSNITWPIAYKQDKHFGKAHYNVG 253

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
                I  EI  +GPV  +F ++DD   YK+G +                          
Sbjct: 254 KKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIY-------------------------- 287

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                      ++ +G   GG   +I+GWG D  +   YWL  + W TD+G+NG  + LR
Sbjct: 288 -----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFVRFLR 334

Query: 291 GKDECGIESSITAGVP 306
           G +E  IE  + A +P
Sbjct: 335 GVNEVNIEHQVLAALP 350


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 77/248 (31%), Positives = 110/248 (44%), Gaps = 36/248 (14%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
           E LP  FD+RT+W     I  + DQG CG+ W     ++A     V   GT  S   S  
Sbjct: 195 ESLPREFDARTRWRR--QISGVDDQGWCGASWAISTAQVASDRFAVMSKGT-DSVLLSAQ 251

Query: 140 HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE--------GAFT 191
           H   C ++ Q   D  Y        + + +   +    K +YE   ++        G   
Sbjct: 252 HLLSCNKKGQRGCDGGYLDRAWLFMRKFGLVDEQCYPWKGVYEQCKLQKRTNLEAAGCRA 311

Query: 192 VFDDLI--LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKAL 249
             + L   LYK G  +  GNET  M       R+  +    +    V+ D   Y+SG  +
Sbjct: 312 PANPLRKELYKVGPAYRLGNETDIM-------REILTSGPVQATMKVYQDFFSYESGIYM 364

Query: 250 ----------GGHAIRILGWGEDEKSKE----KYWLIANSWNTDWGDNGLFKILRGKDEC 295
                     G H++RI+GWGED  +      KYWL+ NSW  +WG+NGLF+I RG +EC
Sbjct: 365 HTPIAELYESGYHSVRIIGWGEDISTDSGLPIKYWLVVNSWGQEWGENGLFRIRRGINEC 424

Query: 296 GIESSITA 303
            IES + A
Sbjct: 425 DIESFVVA 432


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 90/188 (47%), Gaps = 50/188 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQE-NYDVPYKKDLNFGAKSYSVS-S 171
           GC+PY   P          + + S   TP+C ++C+   Y   YK+D +FG   Y+V  S
Sbjct: 206 GCKPYPFLP--------HTTVEYS---TPECSKKCENYQYKKAYKQDKHFGMSVYNVQFS 254

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +   I  EI  +GPVE       ++I+Y    F+  G      ++  W            
Sbjct: 255 DPVDIQYEIMNNGPVEA------NMIVYYDFMFYKSG---VYQTVFPW------------ 293

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                            LGGHA+RI+GWG D  +K  YWL+ANSWNTDWG++G F+I RG
Sbjct: 294 ----------------PLGGHAVRIVGWGVDGPTKVPYWLVANSWNTDWGEDGYFRIRRG 337

Query: 292 KDECGIES 299
            DE  IES
Sbjct: 338 TDESYIES 345



 Score = 37.7 bits (86), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 1/32 (3%)

Query: 84  LPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           LP +FDSR ++  C   I  I+DQ +CGSCW 
Sbjct: 108 LPQHFDSRKQFTKCAKVIGTIQDQSNCGSCWA 139


>gi|66805843|ref|XP_636643.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
 gi|60465035|gb|EAL63141.1| hypothetical protein DDB_G0288563 [Dictyostelium discoideum AX4]
          Length = 314

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 82/339 (24%), Positives = 123/339 (36%), Gaps = 104/339 (30%)

Query: 31  VSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY------------- 77
           V  G++  K    ++L N    + KS    H + N       ++IG              
Sbjct: 19  VCLGSFLDKPVLDDNLINSINNNKKSSWTAHRNKNFEGKTFGDIIGMMGTKKTAAPFKLT 78

Query: 78  ---SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---CEHHVNGTR 131
               E+   +P +FDSR +WP+C  I  I +Q  CGSCW     E+     C    N T 
Sbjct: 79  ENGEELKGSIPTSFDSRVQWPDC--IHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTN 136

Query: 132 P---------SCD---------------------------------ASKGHTPKCVRECQ 149
           P         +CD                                 A  G    C R C 
Sbjct: 137 PGALSPQTLVACDVYGNDGCSGGIPQLAWEYMELKGLPTDSCVPYTAGNGTVYSCQRSCS 196

Query: 150 ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGN 209
           ++ D    +   F  K+    S+ + I + I  +GP+ G   V++D + Y SG +     
Sbjct: 197 DSEDYSLYRAKPFTLKT---CSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVY----- 248

Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
                                          ++      LGGHAI+I+GWG D+ S+  Y
Sbjct: 249 -------------------------------VMTPGSSLLGGHAIKIVGWGFDQTSQLNY 277

Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           W++ANSW  DWG  G F I    + C I S  +A   ++
Sbjct: 278 WIVANSWGADWGQQGFFFI--SMETCSISSDASAAEARV 314


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 86/192 (44%), Gaps = 40/192 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+PY    C  H      +C +    TP C   CQ  Y   Y+ D       Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPACKPYCQYGYGKRYENDKIKAKTWYWLPNDER 254

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI + GPV   F +++D   Y  G +                              
Sbjct: 255 TIQLEIMKKGPVHATFNIYEDFEHYNGGVY------------------------------ 284

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
                  ++ +G   GGH+I+I+GWG D+  K  YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335

Query: 294 ECGIESSITAGV 305
            C IE  + AG 
Sbjct: 336 NCDIEGGVLAGT 347



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 27/34 (79%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           ++D+P +FDSR KW +CP++R I DQ +CGSCW 
Sbjct: 93  NDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126



 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           + CG+GC+GG+   AW++   +G+V+GGAY  K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 79/172 (45%), Gaps = 56/172 (32%)

Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           +C R C  N D+ Y  D  F    Y ++    SI K++  +GP+E +             
Sbjct: 3   RCTRMCYGNQDLDYDDDHRFTRDFYYLTYG--SIQKDVLNYGPIEAS------------- 47

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAI 254
                                          F V+DD   YKSG          LGGHA+
Sbjct: 48  -------------------------------FDVYDDFPSYKSGVYQRTPNATKLGGHAV 76

Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +++GWG +E +   YWL+ NSWN  WGDNGLFKI RG DEC I+S+ TAGVP
Sbjct: 77  KLIGWGVEEGTP--YWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVP 126


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 89/303 (29%), Positives = 117/303 (38%), Gaps = 52/303 (17%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +     LP  R  E     ++   LP  FD+   WP+
Sbjct: 48  KAVYNGKMQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPH 103

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ  C + W           +   G       S  H   C ++C +      
Sbjct: 104 CPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----C 159

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           K      A  Y V     S   + Y     EH   +G  T            F  P    
Sbjct: 160 KGGFPGFAWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNA 213

Query: 212 T----AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS------ 245
           T    A+ LIK+  R N + L   G                 F V+ DL  YKS      
Sbjct: 214 TCTDKAIPLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHV 271

Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G  LGG A++++GWG+   +   YW +ANSW+TDWG  G   ILRG +EC IE    AG
Sbjct: 272 DGDFLGGTAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329

Query: 305 VPK 307
            P+
Sbjct: 330 TPE 332



 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 17/24 (70%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178


>gi|403365170|gb|EJY82363.1| Cathepsin B [Oxytricha trifallax]
          Length = 309

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 77/312 (24%), Positives = 120/312 (38%), Gaps = 95/312 (30%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +  +N  +N   A LK  +G    +         +  +++++  LP +FDSRT+W +C  
Sbjct: 40  EVSQNKFANYTEAQLKGLLGTVLSHQ------SGISAFTQINAALPDSFDSRTQWKDC-- 91

Query: 100 IREIRDQGSCGSCWGCRPYE------IAPCEHHVNGTRP-----SCDAS----------- 137
           +  IRDQ  CGSCW     E          +  VN         SCDAS           
Sbjct: 92  VHPIRDQAKCGSCWAFAAVESLSDRFCIASQGKVNLVLSPQDMLSCDASNFCCFGGYLDT 151

Query: 138 ---------------------KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                                 G  P C  +C     +   K     A S   +   ++ 
Sbjct: 152 AWQYLEQQGVGSDSCEPYKSGNGDQPSCPSKCSNGQAI---KKYKCKAGSTKQAKGAEAT 208

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
              I + GPVE  FT+++D + Y SG +                                
Sbjct: 209 KSLIQQSGPVETGFTIYEDFLNYNSGIYH------------------------------- 237

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                 + +G  +GGHA++ILGWG  ++  E YW++ANSW  DWG+ G F I +G  + G
Sbjct: 238 ------HVTGGNMGGHAVKILGWG--KQGLENYWIVANSWGEDWGEKGYFNIRQG--DSG 287

Query: 297 IESSITAGVPKL 308
           I+ +    +P +
Sbjct: 288 IDEATFGCIPDV 299


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 85/196 (43%), Gaps = 43/196 (21%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC PY+  PC H     + P+C        +CV + +    V Y  D  F  +S     +
Sbjct: 245 GCWPYDFPPCAHFFKDPKYPACPKFARVNLRCVSKLRHMM-VVYFSDRYFMVESVPYHFS 303

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
                  I   GPV   F V++D + YKSG +                            
Sbjct: 304 ADDAKNAIRTDGPVSATFYVYEDFLAYKSGVY---------------------------- 335

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                     + SG  LG HA++I+GWGED    E YWL+ NSWN  WGD+GLFKI  G 
Sbjct: 336 ---------KHTSGSLLGAHAVKIIGWGED--GGEAYWLVVNSWNEGWGDHGLFKIALG- 383

Query: 293 DECGIESSITAGVPKL 308
            +CGI++ +  G PK+
Sbjct: 384 -DCGIDNELLGGTPKV 398



 Score = 44.7 bits (104), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 1/47 (2%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
           +++ E +   E  +DLP +FD+RT +P C   I  +RDQ +CG CW 
Sbjct: 125 DKVVEKVYAIEELKDLPTDFDARTAFPKCSKVIGHVRDQSACGDCWA 171


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 75/264 (28%), Positives = 107/264 (40%), Gaps = 90/264 (34%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT---------RPS 133
           ++P NFD+R +WP    I  +RDQ SCGSCW     E       + G            S
Sbjct: 62  NVPENFDAREQWPG--KIYPVRDQASCGSCWAHAASEAIGNRFSIKGCGKGMLSVQDLVS 119

Query: 134 CD--------------------------------ASKGHTPKCVRECQENYDV-PYKKDL 160
           CD                                +  G  P C  +C     +  YK + 
Sbjct: 120 CDKGDSGCNGGSGPLSSKWLVSNGVTTEECLPYVSGNGRVPACAAKCSNGSQIIRYKYE- 178

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
              A++Y+V    ++I +E+ ++GPV   FTV+ D + YKSG +                
Sbjct: 179 --KAETYTV----QNIQEELMKNGPVYFRFTVYSDFMNYKSGVY---------------- 216

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                 +KSG   GGHA+ ++GWG ++     YWL+ NSW   W
Sbjct: 217 ---------------------QHKSGYQEGGHAVLLIGWGVEDGVP--YWLLQNSWGPAW 253

Query: 281 GDNGLFKILRGKDECGIESSITAG 304
           G+ G FKI+RGK+ECG E    AG
Sbjct: 254 GEKGHFKIIRGKNECGCEQGFYAG 277


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 75/147 (51%), Gaps = 39/147 (26%)

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
           G  +Y V     +I  EI  +GPV G FT+++D+  YKSG +                  
Sbjct: 111 GTSAYYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVY------------------ 152

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + +G+ LGGHAI+I+GWG   ++   YWLIANSW T WG+
Sbjct: 153 -------------------RHTAGRLLGGHAIKIIGWG--TQNGIPYWLIANSWGTKWGE 191

Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
           NG FKI RG +ECGIE+++ AG   +D
Sbjct: 192 NGFFKIRRGVNECGIENNVVAGKADVD 218


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 87/178 (48%), Gaps = 41/178 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY + PC +H  G   SC       TP C + CQ  Y   Y+KD ++    Y +  +E
Sbjct: 194 CKPYPLHPCGNH-GGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 252

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+ ++GPV+ A   ++D   Y+ G                               
Sbjct: 253 KAIQREMMKNGPVQAASITYEDFSFYRRG------------------------------- 281

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                 + ++  G+  G HA++++GWG +  +  KYW +ANSW+TDWG++G F+ILRG
Sbjct: 282 ------IYVHTRGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDWGEDGYFRILRG 331



 Score = 38.1 bits (87), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R CG GCNGG    AW Y  + G+V+GG Y  K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 88/296 (29%), Positives = 116/296 (39%), Gaps = 52/296 (17%)

Query: 46  LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           + NI  +  K   G  +     LP  R  E     ++   LP  FD+   WP+CPTIREI
Sbjct: 55  MQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPHCPTIREI 110

Query: 104 RDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
            DQ  C + W           +   G       S  H   C ++C +      K      
Sbjct: 111 ADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----CKGGFPGF 166

Query: 164 AKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AM 214
           A  Y V     S   + Y     EH   +G  T            F  P    T    A+
Sbjct: 167 AWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNATCTDKAI 220

Query: 215 SLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKALGG 251
            LIK+  R N + L   G                 F V+ DL  YKS       G  LGG
Sbjct: 221 PLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHVDGDFLGG 278

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            A++++GWG+   +   YW +ANSW+TDWG  G   ILRG +EC IE    AG P+
Sbjct: 279 TAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAGTPE 332



 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 17/24 (70%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 111/268 (41%), Gaps = 88/268 (32%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------GCR--------PYEI 120
           +E+ PA+FD R KWP       +R+QGSCGSCW            G R        P ++
Sbjct: 52  NENAPASFDCRQKWPG--KAEPVRNQGSCGSCWAHAASETMGFRMGIRRCSKGVMSPQDL 109

Query: 121 APCEHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKD 159
             CE +  G                  T   C    +  G  P C  +C+   ++     
Sbjct: 110 VSCESNNMGCNGGYADRVWNWIQKKGITTEQCIPYVSGSGRVPTCPSKCKNGSNIVRSFV 169

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++G      S N K++M E+  +GPV   F VF+D   Y+SG +               
Sbjct: 170 SSWG------SFNSKTVMDEVANNGPVYACFEVFEDFYNYRSGVY--------------- 208

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                                  +K+G++ G H + ++GWG +  +   YWL+ NSW + 
Sbjct: 209 ----------------------QHKTGRSQGWHHVMLMGWGTE--NGVPYWLLQNSWGSG 244

Query: 280 WGDNGLFKILRGKDECGIESSITAGVPK 307
           WG+ G F+I RG ++C I+    +G+PK
Sbjct: 245 WGEKGFFRIRRGTNDCHIDEIFYSGLPK 272


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 57/150 (38%), Positives = 70/150 (46%), Gaps = 38/150 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH  G  P C  +   TP CV  C +  D+ Y  D      SY+V SNE
Sbjct: 66  GCRSYPFPRCSHHGKGKYPPCPKTIFDTPNCVDHCDKP-DIDYAADKTHAKSSYNVQSNE 124

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + IMKEI  +GPVE AF V++D I YKSG +F                            
Sbjct: 125 RVIMKEIMRNGPVEAAFMVYEDFIEYKSGIYF---------------------------- 156

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDE 263
                    +  GK LGGHAIR+LGWGE++
Sbjct: 157 ---------HSHGKLLGGHAIRMLGWGEEK 177



 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 24/27 (88%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CGFGC+GGFP  AW +W+++G+V+GG+
Sbjct: 34 CGFGCHGGFPPRAWDFWMENGLVTGGS 60


>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
          Length = 71

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 47/63 (74%), Positives = 52/63 (82%), Gaps = 2/63 (3%)

Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           GKA+GGHAIRILGWG +E     YWLIAN+WNTDWGDNG  K+LRGKD CGIES IT G+
Sbjct: 11  GKAVGGHAIRILGWGVEEGVP--YWLIANNWNTDWGDNGYIKLLRGKDHCGIESQITGGL 68

Query: 306 PKL 308
           PKL
Sbjct: 69  PKL 71


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/296 (29%), Positives = 117/296 (39%), Gaps = 52/296 (17%)

Query: 46  LSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREI 103
           + NI  +  K   G  +     LP  R  E     ++   LP  FD+   WP+CPTIREI
Sbjct: 55  MQNITFSEAKRLTGARIQKSSALPPARFTE----EQLRTKLPETFDAAEHWPHCPTIREI 110

Query: 104 RDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFG 163
            DQ  C + W           +   G       S  H   C ++C +      K      
Sbjct: 111 ADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----CKGGFPGF 166

Query: 164 AKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT----AM 214
           A  Y V     S   + Y     EH   +G  T            F  P    T    ++
Sbjct: 167 AWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNATCTDKSV 220

Query: 215 SLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS-------GKALGG 251
            LIK+  R N + L   G                 F V+ DL  YKS       G  LGG
Sbjct: 221 PLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRNVDGDFLGG 278

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
            A++++GWG+   +   YW +ANSW+TDWG +G   ILRG +EC IE    AG P+
Sbjct: 279 TAVKVVGWGKLNGT--PYWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 332



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/24 (70%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178


>gi|290979437|ref|XP_002672440.1| predicted protein [Naegleria gruberi]
 gi|284086017|gb|EFC39696.1| predicted protein [Naegleria gruberi]
          Length = 354

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 83/252 (32%), Positives = 115/252 (45%), Gaps = 58/252 (23%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-CRPYEIAPCEHHVNGTRPSCDASKGHT 141
           DLP NFD+RT+W  C  I  +RDQ +CG+CW     Y +A   H +      C A+ G T
Sbjct: 131 DLPMNFDARTQWRGC--IPAVRDQQTCGACWAFSATYVLA---HRL------CIATNGKT 179

Query: 142 PKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
              +  E Q   D    K    G   Y+ S  E++               T  D  I Y 
Sbjct: 180 NVVLSPEYQVQCDT-MNKACQGGYLKYAWSFLERT--------------GTTVDSCIPYA 224

Query: 201 SGR-FFVPGN-------ETTAMSLIKW----------TIRDNTSQLGA-EGAFTVFDDLI 241
           SGR  F  G         T +M++ K            I+      G+ +  FT++ D +
Sbjct: 225 SGRATFSSGTCPAKCKVSTQSMTMYKAKNSRYISGVNNIKAAIMSYGSVQSGFTIYRDFM 284

Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
            Y+SG         LGGHA+ ++GWG +  S   YWL  NSW ++WG +G FKI +G  E
Sbjct: 285 SYRSGVYKHVSTTTLGGHAVALIGWGVE--SGTNYWLAVNSWGSNWGMSGYFKIAQG--E 340

Query: 295 CGIESSITAGVP 306
           CGIE+ + AG P
Sbjct: 341 CGIENQVYAGEP 352


>gi|253748582|gb|EET02635.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 298

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 107/272 (39%), Gaps = 94/272 (34%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------------------RPY 118
           D  +P +FD R ++P+C  I E+ DQGSCGSCW                         P 
Sbjct: 71  DTKVPDSFDFREEYPHC--IPEVVDQGSCGSCWAFSSVASLGDRRCFAGLDKKAVTYSPQ 128

Query: 119 EIAPCEH----------------------HVNGTRPSCDASKGHTPKCVRECQ---ENYD 153
            +  C+H                        N   P    + G    C  +C    E   
Sbjct: 129 YVVSCDHGDMACDGGWLQSVWRFLTKTGTTTNECVPYQSGTTGARGTCPTKCADGGELST 188

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
           V  KK +++G            IMK +   GP++ AFTV+ D + Y+ G +         
Sbjct: 189 VKAKKAVDYGLDC-------DLIMKALVTGGPLQTAFTVYSDFMYYEGGVY--------- 232

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
                                        + SG+  GGHA+ ++G+G DE   + YW+I 
Sbjct: 233 ----------------------------QHMSGRVEGGHAVEMVGYGTDEYDVD-YWIIR 263

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           NSW  DWG++G F+I+R  +ECGIE  +  G+
Sbjct: 264 NSWGPDWGEDGYFRIIRMTNECGIEEQVMGGI 295


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 67/212 (31%), Positives = 96/212 (45%), Gaps = 59/212 (27%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
           I   G  GS  GC PY++ PC +   G          H  KC R C  N  V  +    +
Sbjct: 172 ITTGGDYGSNEGCAPYKVPPC-YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENR----Y 226

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
             KS  V  + K+I ++I ++GPVE +                                 
Sbjct: 227 KVKSIYVLDSSKTIEQDIRKYGPVEAS--------------------------------- 253

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRILGWGEDEKSKEKYWLIAN 274
                      F V+DD I YKSG          +GGH+++++GWGE++     YWL+ N
Sbjct: 254 -----------FDVYDDFITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIP--YWLLVN 300

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           SW+  WG+ G F+I++G++ECGIE S TAGVP
Sbjct: 301 SWSKFWGEQGTFRIIKGRNECGIERSATAGVP 332



 Score = 42.0 bits (97), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 16/34 (47%), Positives = 21/34 (61%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           + D   +FD+R  W  C  I  +RDQG+CGSCW 
Sbjct: 83  NNDTIKHFDAREDWKICKQIGHVRDQGNCGSCWA 116



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC GG P  AW+Y+ + GI +GG YGS + 
Sbjct: 151 CGLGCQGGNPIKAWKYFKRHGITTGGDYGSNEG 183


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/195 (32%), Positives = 94/195 (48%), Gaps = 47/195 (24%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY++ PC +   G             +C + C   Y     +D       Y ++S E
Sbjct: 182 GCMPYKVPPC-YDEQGKNTCGGKPMERNHQCPKTC---YGKTTVQDRYKTKNEYVINSIE 237

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I +++  +GPVE +F V+DD  +YKSG                               
Sbjct: 238 -TIEQDLMTYGPVEASFDVYDDFSVYKSG------------------------------- 265

Query: 234 FTVFDDLILYKSGKAL--GGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                  I  K+ KA   GGH+I+I+GWGE+  +   YWL  NSW+  WGD+G FKI++G
Sbjct: 266 -------IYRKTPKAKYEGGHSIKIIGWGEENGT--PYWLAVNSWSKFWGDHGTFKIIKG 316

Query: 292 KDECGIESSITAGVP 306
           ++ECGIE ++TAG+P
Sbjct: 317 RNECGIERAVTAGIP 331



 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 23/34 (67%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           V+ + P  FDSR  W +C  I  IRDQG+CGSCW
Sbjct: 81  VENNSPKQFDSRENWKSCKQIGHIRDQGNCGSCW 114



 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC GG+P  AW+Y+   G+ +GG Y +K+ 
Sbjct: 150 CGKGCGGGYPIKAWKYFRTQGVTTGGDYDTKEG 182


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 81/145 (55%), Gaps = 33/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE++IMKEI+E+GPV+    V  D  +YKSG +    + T   +++   ++DNT 
Sbjct: 420 YRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVY----SSTAIDNIVVEQVKDNTY 475

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE---KYWLIANSWNTDWGDN 283
                                    H+++I+GWGE +KSK    KYW++ NSW  +WG+ 
Sbjct: 476 -------------------------HSVKIIGWGE-KKSKTNSGKYWIVQNSWGANWGEG 509

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+I +G +ECGIE  I A  P++
Sbjct: 510 GYFRIRKGVNECGIEEMILAAWPQI 534


>gi|156708122|gb|ABU93319.1| cathepsin B10 cysteine protease [Monocercomonoides sp. PA]
          Length = 283

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 74/262 (28%), Positives = 109/262 (41%), Gaps = 88/262 (33%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCW---------------GC-----RPYEIAPC 123
           +P +FD+R +WPN   I  +RDQ  CGSCW               GC      P ++  C
Sbjct: 64  VPESFDARDEWPN--AILPVRDQEKCGSCWAFSIAESLGDRFGILGCGKGHLSPQDLISC 121

Query: 124 EHHVNG------------------TRPSC---DASKGHTPKCVRECQENYDVPYKKDLNF 162
           + +  G                  T  SC    +  G  P C   C  N  V  +  +N 
Sbjct: 122 DSNDLGCNGGYQENSWTWVLTTGITTESCWPYRSGSGRIPSCPHRCV-NGSVLQRNTIN- 179

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
                 + S+E  +  E+Y +GP++  + V++D   Y  G +                  
Sbjct: 180 --NYRRLDSSE--LQDELYNNGPIQVTYVVYEDFFYYSKGIY------------------ 217

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
                               + SG  +GGHA+ ++GWG ++  K  YWL+ NSW  +WG+
Sbjct: 218 -------------------KHLSGNKVGGHAVVLMGWGIEDGVK--YWLVQNSWGYEWGE 256

Query: 283 NGLFKILRGKDECGIESSITAG 304
            G F+ILRG +ECGIESS  AG
Sbjct: 257 QGYFRILRGSNECGIESSAYAG 278


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/262 (30%), Positives = 114/262 (43%), Gaps = 46/262 (17%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQ---GSCGSCWGCRPYEIAPCEHHVNGTRPSCDA 136
           V+ D P  FDSR  W +C  I  IRDQ   GSC S      +    C     G + +   
Sbjct: 81  VENDSPQQFDSRENWKSCKQIGHIRDQGNCGSCWSFSTTGAFADRLCVS--TGGKFNELL 138

Query: 137 SKGHTPKCVRECQENYDVPY-----------------KKDLNFGAKSYSVSSNEKSIMKE 179
           S      C ++C    +  Y                   D   G K Y V+       K 
Sbjct: 139 SPEELAFCCKDCGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKN 198

Query: 180 IYEHGPVE-------GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
                P+E         +    D   YK+   +V       ++ IK   +D  +    E 
Sbjct: 199 TCGGKPMERNHQCPKTCYGKTTDQKRYKTKSEYV-------INSIKTIEQDIKTYGPVEA 251

Query: 233 AFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
           +F V+DD  +YKSG        K   GH+++I+GWG++  +   YWL  NSW+  WGD+G
Sbjct: 252 SFDVYDDFSVYKSGIYRKTPNAKYQNGHSVKIIGWGQENGTP--YWLAVNSWSKFWGDHG 309

Query: 285 LFKILRGKDECGIESSITAGVP 306
            FKI++GK+ECGIE ++TAG+P
Sbjct: 310 TFKIIKGKNECGIERAVTAGIP 331



 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 23/35 (65%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
           CG GC GG+P  AWRY+   G+ +GG Y +K+  K
Sbjct: 150 CGNGCEGGYPIKAWRYFRTQGVTTGGDYDTKEGCK 184


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 87/201 (43%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC    +G             +C R C  + ++ Y  D  F    Y ++   
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTY-- 244

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 245 ASIQKDVMTYGPIEASME------------------------------------------ 262

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
             V+DD   YKSG          LGGHA++++GWGE++     YWL+ NSW+  WGD GL
Sbjct: 263 --VYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVP--YWLMVNSWSEMWGDKGL 318

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +EC +++S+TAGVP
Sbjct: 319 FKIRRGTNECSVDNSMTAGVP 339



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 27/121 (22%)

Query: 20  MAWRYWVKSGIVSG-GAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPEL 74
           M  R W+ S ++   G   ++QA   E++ + +I     K+W  G++ D N P   + +L
Sbjct: 1   MGARMWISSSVILLLGVCVTEQAYFLEEDFIDSI-NEKAKTWKAGINFDPNTPKEYIVKL 59

Query: 75  IG--------------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +G              Y   DE        +P  FD+R +W  C TI ++RDQG+CGSCW
Sbjct: 60  LGSKGVQVPHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCW 119

Query: 114 G 114
            
Sbjct: 120 A 120



 Score = 43.9 bits (102), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 23/32 (71%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
            LCGF C+GG+P  AW Y+ + GIV+GG Y S
Sbjct: 153 HLCGFACHGGYPIKAWSYFRRHGIVTGGDYQS 184


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/201 (29%), Positives = 87/201 (43%), Gaps = 56/201 (27%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC    +G             +C R C  + ++ Y  D  F    Y ++   
Sbjct: 187 GCAPYRVPPCFSEEDGNNTCRGQPMEKHHRCTRMCYGDQEIDYDDDHRFTRDYYYLTY-- 244

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SI K++  +GP+E +                                            
Sbjct: 245 ASIQKDVMTYGPIEASME------------------------------------------ 262

Query: 234 FTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
             V+DD   YKSG          LGGHA++++GWGE++     YWL+ NSW+  WGD GL
Sbjct: 263 --VYDDFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVP--YWLMVNSWSEMWGDKGL 318

Query: 286 FKILRGKDECGIESSITAGVP 306
           FKI RG +EC +++S+TAGVP
Sbjct: 319 FKIRRGTNECSVDNSMTAGVP 339



 Score = 50.8 bits (120), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 27/121 (22%)

Query: 20  MAWRYWVKSGIVSG-GAYGSKQA---EKNSLSNIPRAHLKSW-MGVHPDYNLPANRLPEL 74
           M  R W+ S ++   G   ++QA   E++ + +I     K+W  G++ D N P   + +L
Sbjct: 1   MGARMWISSSVILLLGVCVTEQAYFLEEDFIDSI-NEKAKTWKAGINFDPNTPKEYIVKL 59

Query: 75  IG--------------YSEVDE-------DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +G              Y   DE        +P  FD+R +W  C TI ++RDQG+CGSCW
Sbjct: 60  LGSKGVQVPHKLNLKMYKTDDEAYVNLFGRIPKKFDARKEWRRCITIGQVRDQGNCGSCW 119

Query: 114 G 114
            
Sbjct: 120 A 120



 Score = 44.3 bits (103), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 18/34 (52%), Positives = 24/34 (70%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
            LCGF C+GG+P  AW Y+ + GIV+GG Y S +
Sbjct: 153 HLCGFACHGGYPIKAWSYFRRHGIVTGGGYQSGE 186


>gi|124487938|gb|ABN12052.1| cathepsin B endopeptidase-like protein [Maconellicoccus hirsutus]
          Length = 66

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/61 (72%), Positives = 51/61 (83%)

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           LGGHAIRILGWG  +K+   YWL+ANSWNTDWGD+G FKI RG +ECGIE SI AG+PKL
Sbjct: 1   LGGHAIRILGWGVCKKTNAPYWLVANSWNTDWGDHGYFKIKRGSNECGIEDSINAGIPKL 60

Query: 309 D 309
           +
Sbjct: 61  N 61


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 64/188 (34%), Positives = 87/188 (46%), Gaps = 42/188 (22%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTP-KCVRECQENYDVPYKKDLNFGAKSYSVSS 171
           GC+PY    C HHVN T  P CD+   +    C  ECQ++YD  Y++DL +G + Y   S
Sbjct: 168 GCQPYPFKHCAHHVNSTEYPPCDSVPEYKADTCSHECQKDYDRKYEEDLYYGKEQYGF-S 226

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           +E  I +EI  +GPV  +FTV++  + Y  G +      +T    IK             
Sbjct: 227 DEAPIQREIMTNGPVAVSFTVYESFLYYSGGIY-----RSTPGERIK------------- 268

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF-KILR 290
                             G HA+R++GWG +  +  KYW IANSWN  WG   L      
Sbjct: 269 ------------------GYHAVRVVGWGVENGT--KYWKIANSWNEQWGRERLLPHTPA 308

Query: 291 GKDECGIE 298
           G DE  IE
Sbjct: 309 GVDESDIE 316



 Score = 45.1 bits (105), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 79  EVDEDLPANFDSRTKWPNCPTI-REIRDQGSCGSCWG 114
           ++ E++P +FD+  KWP C  +   IRDQ +CGSCW 
Sbjct: 67  DLSEEIPESFDAAEKWPECAEVFNNIRDQSNCGSCWA 103


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/303 (29%), Positives = 116/303 (38%), Gaps = 52/303 (17%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +     L   R  E     ++   LP  FD+   WP+
Sbjct: 48  KAVYNGKMQNITFSEAKRLTGARIQKSSGLQPARFTE----EQLRTKLPETFDAAEHWPH 103

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ  C + W           +   G       S  H   C ++C +      
Sbjct: 104 CPTIREIADQSECRASWAVSTASAISDRYCTVGKGKQLRISAAHLLSCCKDCGDG----C 159

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           K      A  Y V     S   + Y     EH   +G  T            F  P    
Sbjct: 160 KGGFPGFAWRYYVEYGITSSSCQPYPFPRCEHQGAQGNKTPCSKY------NFDTPKCNA 213

Query: 212 T----AMSLIKWTIRDNTSQLGAEG----------------AFTVFDDLILYKS------ 245
           T    A+ LIK+  R N + L   G                 F V+ DL  YKS      
Sbjct: 214 TCTDKAIPLIKY--RGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRHV 271

Query: 246 -GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            G  LGG A++++GWG+   +   YW +ANSW+TDWG  G   ILRG +EC IE    AG
Sbjct: 272 DGDFLGGTAVKVVGWGKLNGT--PYWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329

Query: 305 VPK 307
            P+
Sbjct: 330 TPE 332



 Score = 42.4 bits (98), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 17/24 (70%), Positives = 19/24 (79%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVS 32
           CG GC GGFPG AWRY+V+ GI S
Sbjct: 155 CGDGCKGGFPGFAWRYYVEYGITS 178


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 122/300 (40%), Gaps = 48/300 (16%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +  + +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFSEAKRLTGAWIQKNSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +C + W           +   G       S  H   C ++C       +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCKQCGGGCKGGF 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
                  A  Y V     S   + Y     EH   +G  T   +       +F  P   T
Sbjct: 163 PG----FAWRYYVEYGIASSYCQPYPFPQCEHQGAQGNKTPCSNY------KFVTPQCNT 212

Query: 212 T----AMSLIKWTIRDNTSQLGAEGAFT--------------VFDDLILYKS-------G 246
           T     + LIK+  +D    L  E  F               V+ DL  YKS       G
Sbjct: 213 TCTDKTIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDG 272

Query: 247 KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
             +G  A++++GWG+   +   YW +AN+W+TDWG +G   ILRG +EC IE    AG P
Sbjct: 273 SYMGVTAVKVVGWGKLNGT--PYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330


>gi|395528577|ref|XP_003766405.1| PREDICTED: dipeptidyl peptidase 1-like [Sarcophilus harrisii]
          Length = 568

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 98/249 (39%), Gaps = 78/249 (31%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q +CGSC+                         P EI  C  +  G        
Sbjct: 352 VSPVRNQANCGSCYAFASLGMLESRIRIKTNNSQVPVLSPQEIVSCSEYSQGCEGGFPYL 411

Query: 130 -----------TRPSCDASKGHTPKCV-RECQENYDVPYKKDLNFGAKSYSVSSNEKSIM 177
                          C   + +   C  ++C   Y   Y     F         NE  + 
Sbjct: 412 IGGKYAQDFGLVEEECFPYQAYDSPCTPKKCSRYYTSEYHYVGGFYG-----GCNEALMK 466

Query: 178 KEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVF 237
            E+ ++GP+  AF V+DD I Y++G +   G            +RDN         F  F
Sbjct: 467 HELIQNGPLTVAFEVYDDFIHYRTGIYHHTG------------LRDN---------FNPF 505

Query: 238 DDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
           +          L  HA+ ++G+G DEK+ E YW++ NSW T WG+NG F+ILRG DEC I
Sbjct: 506 E----------LTNHAVLLVGYGTDEKTGEDYWIVKNSWGTSWGENGYFRILRGTDECAI 555

Query: 298 ESSITAGVP 306
           ES   A  P
Sbjct: 556 ESIAVAATP 564


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/291 (27%), Positives = 109/291 (37%), Gaps = 97/291 (33%)

Query: 46  LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRD 105
            +N      K  +GV P    P   L  +      + DLP  FD+RT+W +C TI  I D
Sbjct: 62  FANYTIEQFKHILGVKPT---PPGLLAGVPIKIHPEMDLPKEFDARTQWSSCSTIGNILD 118

Query: 106 QGSCGSCWGCRPYEIAP-------------------------CEHHVNGTRP-------- 132
           QG CG+CW     E                            C    NG  P        
Sbjct: 119 QGHCGACWAFAAVEALQDRFCIHLNMSVSLSVNDLLACCGFLCGSGCNGGYPISAWRYFR 178

Query: 133 -------SCDASKGHT-------------PKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
                   CD     T             PKC R+C+   +  +K++ +F   +Y V SN
Sbjct: 179 RSGVVTEECDPYFDQTGCQHPGCEPAYPTPKCQRKCKVE-NQAWKENKHFSVNAYRVHSN 237

Query: 173 EKSIMKEIYEHGPVEGAFTVFD--DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
              IM E+Y++GPVE AFT     D   YKSG +                          
Sbjct: 238 PHDIMAEVYKNGPVEVAFTYCQILDFAHYKSGVY-------------------------- 271

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                       + +G  +GGHA++++GWG  + + E YWL+AN WN  WG
Sbjct: 272 -----------KHITGGVMGGHAVKLIGWGTSD-AGEDYWLLANQWNRGWG 310



 Score = 40.4 bits (93), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 16/25 (64%), Positives = 21/25 (84%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVS 32
           LCG GCNGG+P  AWRY+ +SG+V+
Sbjct: 160 LCGSGCNGGYPISAWRYFRRSGVVT 184


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 66/214 (30%), Positives = 96/214 (44%), Gaps = 59/214 (27%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDL 160
           R I   G  GS  GC PY++ PC +   G          H  KC R C  N  V  +   
Sbjct: 170 RGITTGGDYGSNEGCAPYKVPPC-YDDQGEFLCQGKPTEHNHKCPRACYGNSTVENR--- 225

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
            +  +S  V  + K+I ++I  +GPVE +                               
Sbjct: 226 -YKVESIYVLDSFKTIEQDIRTYGPVEAS------------------------------- 253

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLI 272
                        F V+DD I YKSG          +GGH+++++GWGE++     YWL+
Sbjct: 254 -------------FDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIP--YWLL 298

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
            NSW+  WG+ G F+I++G++ECGIE S TAG+P
Sbjct: 299 VNSWSKFWGEQGTFRIIKGRNECGIERSATAGIP 332



 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GC GG P  AW+Y+ + GI +GG YGS + 
Sbjct: 151 CGLGCQGGNPIKAWKYFKRRGITTGGDYGSNEG 183



 Score = 40.4 bits (93), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 15/27 (55%), Positives = 19/27 (70%)

Query: 87  NFDSRTKWPNCPTIREIRDQGSCGSCW 113
           +FD+R  W  C  I  +RDQG+CGSCW
Sbjct: 89  HFDARENWKICKQIGHVRDQGNCGSCW 115


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 47/166 (28%)

Query: 115 CRPYEIAPCEHHVNGTRP---SC-DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
           CRPYE  PC  H  G  P    C D++K  TPKC + CQ  Y  PYK+D +FG  +Y + 
Sbjct: 72  CRPYEFPPCGRH--GKEPYYGECYDSAK--TPKCQKTCQRGYLKPYKEDKHFGKSAYRLP 127

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K+I ++I ++GPV   F V++D   YKSG                            
Sbjct: 128 NNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSG---------------------------- 159

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                    +  + +G+  GGHA++I+GWG++  +   YWLIANSW
Sbjct: 160 ---------IYKHTAGRMTGGHAVKIIGWGKEXGT--PYWLIANSW 194



 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 15/28 (53%), Positives = 21/28 (75%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
          CG+GC GG+P  AW+Y+   G+V+GG Y
Sbjct: 39 CGYGCEGGWPMKAWQYFXLEGVVTGGNY 66


>gi|149392557|gb|ABR26081.1| cathepsin b-like cysteine proteinase 3 [Oryza sativa Indica Group]
          Length = 142

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 82/168 (48%), Gaps = 53/168 (31%)

Query: 146 RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF 205
           ++C+    V  +K  +F   +Y V+S+   IM E+Y++GPVE A                
Sbjct: 1   KKCKVQNQVWLEKK-HFSVNAYRVNSDPHDIMAEVYQNGPVEVA---------------- 43

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILG 258
                                       FTV++D   YKSG         +GGHA++++G
Sbjct: 44  ----------------------------FTVYEDFAHYKSGVYKHITGGMMGGHAVKLIG 75

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           WG  + + E YWL+AN WN  WGD+G FKI+RG +ECGIE  + AG+P
Sbjct: 76  WGTTD-AGEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVVAGMP 122


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 76/275 (27%), Positives = 108/275 (39%), Gaps = 85/275 (30%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGH 140
           + LP +FD+   WP    I ++RDQG CGS W      +A     + +  R +   +   
Sbjct: 183 DHLPTHFDATNYWPG--FIGKVRDQGWCGSSWAVSTASVASDRFAILSKGRETVQLAPQQ 240

Query: 141 TPKCVRECQ--------------------------------------------ENYDVPY 156
              CVR  Q                                             N ++P 
Sbjct: 241 IVSCVRRSQGCSGGHLDTAWSYLRKVGTVNEECYPYISAHNVCKIRPSDTLITANCELPM 300

Query: 157 KKDLNFGAK---SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
           K D     K   ++S++ NE  IM EI +HGPV+    V  D   YKSG +      T+A
Sbjct: 301 KVDRTNMYKMGPAFSLN-NETDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHSAASTSA 359

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWL 271
                                            +  G H++R++GWGE+    E  KYW+
Sbjct: 360 --------------------------------DQRAGYHSVRLIGWGEERHGYEVTKYWI 387

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
             NSW T WG+NG F+ILRG +EC IES + A +P
Sbjct: 388 AVNSWGTWWGENGRFRILRGSNECEIESYVLASLP 422


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 77/301 (25%), Positives = 118/301 (39%), Gaps = 91/301 (30%)

Query: 12  GCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRL 71
           GC GG    AW +    GIV+GG +            +P+  + +  G  P Y+ P    
Sbjct: 132 GCQGGIARAAWSFLKMHGIVTGGDF------------VPKGSMSAADGCWP-YSFPKC-- 176

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC-EHHVNGT 130
                         A+    +K+  CP +R                  + P  E H  G 
Sbjct: 177 --------------AHDQEDSKYEPCPEVR------------------VPPLGERHQRGA 204

Query: 131 RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYS-VSSNEKSIMKEIYEHGPVEG 188
             S       TP C+  C  E Y  P  KD +F A++   +     +I KEI  +GP   
Sbjct: 205 GASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNIKKEIMTNGPTSA 264

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
           +F+ ++D   YKSG +                                      + SG  
Sbjct: 265 SFSTYEDFSSYKSGVY-------------------------------------KHTSGGY 287

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           LG H++ I+GWG ++     YWL+ NSWN  WGD+G FKI +G  +CGI+ ++   +P +
Sbjct: 288 LGDHSVEIIGWGTEKGVD--YWLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVQGSLPAM 343

Query: 309 D 309
           +
Sbjct: 344 N 344



 Score = 37.7 bits (86), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 1/38 (2%)

Query: 83  DLPANFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYE 119
           D+P++FD+R  +  C   I  + DQ +CGSCW   P E
Sbjct: 58  DIPSSFDARDAFKECKDVIGHVWDQSACGSCWAIAPVE 95


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 87/300 (29%), Positives = 121/300 (40%), Gaps = 48/300 (16%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +    +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFSEAKRLTGAWIQKTSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPY 156
           CPTIREI DQ +C + W           +   G       S  H   C ++C       +
Sbjct: 103 CPTIREIADQSACRASWAVSTASAISDRYCTVGGGKQLRISAAHLLSCCKQCGGGCKGGF 162

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-----EHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
                  A  Y V     S   + Y     EH   +G  T   +       +F  P   T
Sbjct: 163 PG----FAWRYYVEYGIASSYCQPYPFPQCEHHGAQGNKTPCSNY------KFVTPQCNT 212

Query: 212 T----AMSLIKWTIRDNTSQLGAEGAFT--------------VFDDLILYKS-------G 246
           T     + LIK+  +D    L  E  F               V+ DL  YKS       G
Sbjct: 213 TCTDKTIPLIKYRGKDAYMLLPGEEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRNVDG 272

Query: 247 KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
             +G  A++++GWG+   +   YW +AN+W+TDWG +G   ILRG +EC IE    AG P
Sbjct: 273 SYMGVTAVKVVGWGKLNGT--PYWKVANTWDTDWGMDGYLLILRGNNECNIEHLGFAGTP 330


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/291 (29%), Positives = 120/291 (41%), Gaps = 87/291 (29%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
           P  ++  +   +   E LP +FD+ T WP    I E++DQG CGS W             
Sbjct: 169 PKFKVKSMSRLTNGQEHLPTHFDATTYWPG--FIGEVKDQGWCGSSWALSTASVASDRFA 226

Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKC---VR-------EC----------- 148
               G    ++AP +  ++  R S   S GH       VR       EC           
Sbjct: 227 ILSKGREIVQLAP-QQIISCVRRSQGCSGGHLDTAWNYVRKVGTVNDECYPYISAQNACK 285

Query: 149 --------QENYDVPYKKDLNFGAK---SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
                     N D+P K D     K   ++S++ NE  IM EI +HGPV+    V  D  
Sbjct: 286 IRPSDTLITANCDLPTKVDRTNMYKMGPAFSLN-NETDIMIEIKKHGPVQAILRVHRDFF 344

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YKSG +                     S  G E A                G H++R++
Sbjct: 345 SYKSGIYR----------------HSAASSAGDERA----------------GYHSVRLI 372

Query: 258 GWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           GWGE+    E  KYW+  NSW   WG+NG F+I+RG++EC IES + A +P
Sbjct: 373 GWGEERNGYETTKYWVAVNSWGRWWGENGRFRIVRGQNECEIESYVLASLP 423


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 58/192 (30%), Positives = 85/192 (44%), Gaps = 40/192 (20%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           C+PY    C  H      +C +    TP     CQ  Y   Y+ D       Y + ++E+
Sbjct: 195 CKPYVFPQCGAHKGKAFNNCPSHPYATPARKPYCQYGYGKRYENDKIKARTWYWLPNDER 254

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I  EI + GPV   F +++D   Y  G +                              
Sbjct: 255 TIQLEIMQKGPVHATFNIYEDFEHYNGGVY------------------------------ 284

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG-DNGLFKILRGKD 293
                  ++ +G   GGH+I+I+GWG D+  K  YWLIANSW+TDWG D G F+++RG +
Sbjct: 285 -------IHTAGAMEGGHSIKIIGWGVDKGVK--YWLIANSWSTDWGEDGGYFRVVRGIN 335

Query: 294 ECGIESSITAGV 305
            C IE  + AG 
Sbjct: 336 NCDIEGGVLAGT 347



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 31/46 (67%), Gaps = 2/46 (4%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           N LP  I     ++D+P +FDSR KW +CP++R I DQ +CGSCW 
Sbjct: 83  NVLP--IANITSNDDIPESFDSREKWKDCPSLRVIPDQSNCGSCWA 126



 Score = 41.2 bits (95), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 24/33 (72%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           + CG+GC+GG+   AW++   +G+V+GGAY  K
Sbjct: 160 KFCGYGCDGGYNARAWKWATIAGVVTGGAYKEK 192


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 76/149 (51%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y  D+      Y +SSNEK IMKEI ++GPV+    V +D  +YK+G +     + T +S
Sbjct: 356 YHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTGIY-----KHTDVS 410

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
             K                            +  G H++RI GWGED     +  KYW+ 
Sbjct: 411 FTK------------------------PPQYRKHGTHSVRITGWGEDRNVDGTSRKYWIA 446

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW  +WG+NG F+I+RG++EC IE+ +
Sbjct: 447 ANSWGKNWGENGYFRIVRGENECEIETFV 475


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 78/158 (49%), Gaps = 53/158 (33%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y++D   G  SY+V   E  IM EI ++GPV+G                           
Sbjct: 1   YEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGI-------------------------- 34

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEK 268
                             F +F+D ++YKSG       + +GGHAIR++GWG +  +  K
Sbjct: 35  ------------------FYMFEDFLVYKSGIYHYTTGRLVGGHAIRVIGWGVE--NGVK 74

Query: 269 YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           YWLIANSWN  WG+ G F++ RG +ECGIE+ I AG+P
Sbjct: 75  YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112


>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
          Length = 188

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/197 (31%), Positives = 94/197 (47%), Gaps = 44/197 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGH---TPKCVREC-QENYDVPYKKDLNFGAKSYSV 169
           GC+PY I PC+  +N   P    +  H   TP C ++C   NY   ++ D+ +  K Y +
Sbjct: 31  GCQPYTIPPCKL-MNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDI-YKGKYYKL 88

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           S      MK+I+++GP+   F ++ DL+ YKSG +                      Q  
Sbjct: 89  SP--YMAMKDIFDNGPITTQFYMYRDLVDYKSGVY----------------------QYD 124

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
            +  F  F              H+++I GWGE+  +   YWL+ANS+ TDWG NG FKI 
Sbjct: 125 EQSDFDFFT------------VHSVKIFGWGEE--NGVPYWLVANSFGTDWGYNGTFKIS 170

Query: 290 RGKDECGIESSITAGVP 306
           RG D C  +  + AG+P
Sbjct: 171 RGNDGCFFQEKMYAGLP 187


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 80/162 (49%), Gaps = 34/162 (20%)

Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
           +  + C   Y+  Y  D+      Y +SSNEK IMKEI E+GPV+    V +D  +YK+G
Sbjct: 320 QATQRCPNTYN--YHNDIYQSTPPYKLSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNG 377

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED 262
            +     + T +S  K                            +  G H++RI GWGED
Sbjct: 378 IY-----KHTDVSSTK------------------------PPQYRKHGTHSVRITGWGED 408

Query: 263 EK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           +    +  KYW+ ANSW  +WG+NG F+I RG +EC IE+ +
Sbjct: 409 KDYDGTPRKYWIAANSWGKNWGENGFFRIARGANECEIEAFV 450


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 96/204 (47%), Gaps = 44/204 (21%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGH---TPKCVREC-QENYDVPYKKDLNF 162
           G   S  GC+PY I PC+  +N   P    +  H   TP C ++C   NY   ++ D+ +
Sbjct: 151 GDYNSNQGCQPYTIPPCKL-MNEKPPGHSCTTYHREETPICEKKCYNPNYYTSFRTDI-Y 208

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
             K Y +S      MK+I+++GP+   F ++ DL+ YKSG +                  
Sbjct: 209 KGKYYKLS--PYMAMKDIFDNGPITTQFYMYRDLVDYKSGVY------------------ 248

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
               Q   +  F  F              H+++I GWGE+  +   YWL+ANS+ TDWG 
Sbjct: 249 ----QYDEQSDFDFFTV------------HSVKIFGWGEE--NGVPYWLVANSFGTDWGY 290

Query: 283 NGLFKILRGKDECGIESSITAGVP 306
           NG FKI RG D C  +  + AG+P
Sbjct: 291 NGTFKISRGNDGCFFQEKMYAGLP 314



 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 18/26 (69%), Positives = 21/26 (80%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSC 109
           LP NFDSR KWPNCP+I  I +QG+C
Sbjct: 61  LPINFDSRKKWPNCPSIGHIYNQGNC 86



 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 21/45 (46%), Positives = 25/45 (55%), Gaps = 4/45 (8%)

Query: 1   MYTQQI----RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           M  QQI     LCG GC+GG    +W Y+ + G VSGG Y S Q 
Sbjct: 114 MSAQQIISCCYLCGHGCDGGSLFESWDYYRRHGFVSGGDYNSNQG 158


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 69/132 (52%), Gaps = 43/132 (32%)

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I KEI ++GPVE  F V++D + YKSG                                
Sbjct: 2   AIQKEIMKYGPVEANFIVYEDFLNYKSG-------------------------------- 29

Query: 235 TVFDDLILYK--SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                  +YK  +GK    HAIRI+GWGE+  +   YWLI NSWN DWG+NG F+ILRG+
Sbjct: 30  -------IYKHITGKLFSWHAIRIIGWGEENNT--PYWLIPNSWNEDWGENGNFRILRGR 80

Query: 293 DECGIESSITAG 304
            EC IES +TAG
Sbjct: 81  HECSIESEVTAG 92


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 76/289 (26%), Positives = 111/289 (38%), Gaps = 99/289 (34%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY-------------------- 118
           E+D  +   FD+R +WP+C TI E+ + G+    W   P                     
Sbjct: 85  EIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQLLS 144

Query: 119 --EIAPC------------EHHV------------------NGTRPSCDASKGHTPK--- 143
             E+  C            +++V                  NG +PS     G+ P    
Sbjct: 145 TEELISCSGIKEDEFGSVNDYYVWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLY 204

Query: 144 ---CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD-DLILY 199
              C + C  N  + Y +D       Y +    + I +E+  +GPV  AF VFD D  LY
Sbjct: 205 ENTCEKRCYGNNTINYNQDHVKIKNHYDIEY--EDIQREVQNYGPVSMAFKVFDNDFFLY 262

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           KSG +     +TT    I+W                                   +++GW
Sbjct: 263 KSGVY----EKTTNSEFIQW--------------------------------QYAKLIGW 286

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           G +  +   YWL+ N W  +WG NGLFKI RG DEC IE+ + AG P+L
Sbjct: 287 GVE--NGVDYWLLVNFWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 52/152 (34%), Positives = 72/152 (47%), Gaps = 38/152 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+PY    CEHH  G  PSC      TP+C R+CQ+ Y  PY+ D ++G  S +V  NE
Sbjct: 109 GCQPYPFPKCEHHSIGKYPSCGDKIYKTPQCKRKCQKGYTTPYEHDKHYGGISINVIKNE 168

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I KEI  +GPVE    +F+D + YKSG +                             
Sbjct: 169 SAIQKEIMMYGPVEAYLLIFEDFLNYKSGIY----------------------------- 199

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWG-EDEK 264
                    Y +G  +G H +RI+GWG E+E+
Sbjct: 200 --------RYTTGSFVGEHYVRIIGWGIENER 223



 Score = 47.0 bits (110), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 19/27 (70%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG GC+GGFPG AW YWV  GIV+GG+
Sbjct: 77  CGSGCDGGFPGPAWDYWVSHGIVTGGS 103


>gi|290984292|ref|XP_002674861.1| cathepsin C [Naegleria gruberi]
 gi|284088454|gb|EFC42117.1| cathepsin C [Naegleria gruberi]
          Length = 569

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 71/229 (31%), Positives = 99/229 (43%), Gaps = 31/229 (13%)

Query: 88  FDSRTKWPNCPTIRE---IRDQGSCG----SCWGCRPYEIAPCEHHVNGTRPSCDASKG- 139
            +SR +  +   +RE   ++D  SC      C G  PY +       N    SC   KG 
Sbjct: 358 IESRIRIQSRNNVREPLAVQDIVSCSPYAQKCHGGIPYAVGRHLRDFNLVPESCFPYKGS 417

Query: 140 HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
               C  +C+ N +   K         Y   SN  ++MKEIYEHGP+  ++ ++ D   Y
Sbjct: 418 ENVACSSKCK-NPEYIVKVTKYRYVSDYYGGSNYANMMKEIYEHGPISASYLIYPDFKYY 476

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
             G +   G           T R N    G E                    H++ I GW
Sbjct: 477 SKGIYKHSGKGYPMK-----TDRINREMNGWEPT-----------------THSVVITGW 514

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GED K+ EKYW + NSW+  WG+NG F+I RG DEC IE+   A  P++
Sbjct: 515 GEDPKTGEKYWNVLNSWSESWGENGRFRIKRGNDECAIEAEGVAFYPEV 563


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/194 (31%), Positives = 96/194 (49%), Gaps = 45/194 (23%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS-YSVSSN 172
           GC PY++ PC ++  G             +C + C     V  +    +  KS YS++S 
Sbjct: 182 GCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKSEYSINSI 236

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            K+I +++  +GPVE +F V+DD  +YKSG                  I   T +   EG
Sbjct: 237 -KTIEQDLKTYGPVEASFDVYDDFSVYKSG------------------IYRKTPKAKYEG 277

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                              H+I+I+GWG++  +   YWL  NSW+  WG++G FKI++G+
Sbjct: 278 R------------------HSIKIIGWGQENGT--TYWLAVNSWSKFWGEHGTFKIIKGR 317

Query: 293 DECGIESSITAGVP 306
           +ECGIE ++TAG+P
Sbjct: 318 NECGIERAVTAGIP 331



 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 24/34 (70%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           V+ D P  FDSRT W +C  I  IRDQG+CGSCW
Sbjct: 81  VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114



 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 15/32 (46%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+P  AW+Y+   G+ +GG Y +K+
Sbjct: 150 CGQGCGGGYPIKAWKYFRTQGVTTGGDYDTKE 181


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 79/289 (27%), Positives = 113/289 (39%), Gaps = 99/289 (34%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPY-------------------- 118
           E+D  +   FD+R +WP+C TI E+ + G+    W   P                     
Sbjct: 85  EIDHQIDQEFDARKRWPHCKTIGEVHNDGNSLLSWAYVPTGVFADRMCIATNGTYNQLLS 144

Query: 119 --EIAPC--------------------EHH--VNG----TRPSCDASK----GHTP---- 142
             E+  C                    ++H  V+G    T   C  SK    G+ P    
Sbjct: 145 TEELISCSGIKEDEFGSVNDDYVWEYLKNHGLVSGGKYNTNNGCQPSKIPPIGNLPTGLY 204

Query: 143 --KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD-DLILY 199
              C + C  N  + Y +D       Y +    + I +E+  +GPV  AF VFD D  LY
Sbjct: 205 ENTCEKRCYGNNTINYNQDHVKIKNHYDIEY--EDIQREVQNYGPVSMAFRVFDNDFFLY 262

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           KSG +     +TT    I+W                                   +++GW
Sbjct: 263 KSGVY----EKTTNSEFIQW--------------------------------QYAKLIGW 286

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           G +  +   YWL+ NSW  +WG NGLFKI RG DEC IE+ + AG P+L
Sbjct: 287 GVE--NGVDYWLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQL 333


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 92/193 (47%), Gaps = 43/193 (22%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY+I PC     G             +C + C  +  V  +    +  K+  V ++ 
Sbjct: 182 GCAPYKIPPCFDQ-KGKNTCAGKPLERNHQCPKTCYGSTTVQKR----YKVKNEYVLNSP 236

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            ++ +++ ++GP+E +F +FDDL  YKSG                  I   T +      
Sbjct: 237 NTMEQDLIKYGPIEASFNLFDDLSAYKSG------------------IYQKTPK------ 272

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                        K L GH+I+I+GWG++  +   YWL  NSW+  WG+ G F+I++G++
Sbjct: 273 ------------AKFLSGHSIKIIGWGKE--NGVPYWLAVNSWSKFWGEQGTFRIIKGRN 318

Query: 294 ECGIESSITAGVP 306
           ECGIE S TAG+P
Sbjct: 319 ECGIERSATAGIP 331



 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 39/77 (50%), Gaps = 4/77 (5%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPA--NFDSRTKWPNC 97
           +AE+   +N  + ++   +G     N  +    E+  Y  + E+  +   FDSR  W +C
Sbjct: 41  KAERFFPANTSKEYIMGLLGSRGYTNYSSEV--EIKTYDPLYEENASVEQFDSRENWKSC 98

Query: 98  PTIREIRDQGSCGSCWG 114
             I  IRDQG+CGSCW 
Sbjct: 99  KQIGRIRDQGNCGSCWA 115



 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 16/32 (50%), Positives = 22/32 (68%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC GG+P  AW+Y+   G+ +GG Y SK+
Sbjct: 150 CGKGCEGGYPIKAWQYFRTQGVPTGGDYDSKE 181


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/249 (28%), Positives = 105/249 (42%), Gaps = 43/249 (17%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
           LP  F+SRT+WP    I +I DQG CG+ W     ++A     +      + + S  H  
Sbjct: 203 LPREFNSRTRWPR--DISDIHDQGWCGASWAVSTADVASDRFAIMSKGAETVELSAQHLL 260

Query: 143 KCVRECQENYDVPYKKDLNFGAKSYSVSSNE---------------KSIMKEIYEHGPVE 187
            C    Q+     Y        + + +   E               +S +K      P  
Sbjct: 261 SCNNRGQQGCKGGYLDRAWLFMRKFGLVDEECYPWTGRNDQCRLRKRSNLKTAGCQNPPN 320

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
              T      LYK G  +  GNET  M  I        +    +    V+ D  +Y+SG 
Sbjct: 321 SLRTE-----LYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFVYQSGV 368

Query: 247 ---------KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDE 294
                       G H++RI+GWGE+   +    KYWL+ANSW  +WG+NGLF+I +G +E
Sbjct: 369 YRHSRSAELHDSGYHSVRIIGWGEEPSYRGPPLKYWLVANSWGHNWGENGLFRIQKGTNE 428

Query: 295 CGIESSITA 303
           C IES + A
Sbjct: 429 CEIESYVLA 437


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/249 (29%), Positives = 105/249 (42%), Gaps = 43/249 (17%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSC-DASKGHTP 142
           LP  FDSRT+W     I  I DQG CG+ W     ++A   + +        + S     
Sbjct: 203 LPREFDSRTRWSR--DISGIHDQGWCGASWAVSTADVASDRYSIMSKGAEAPELSAQQLL 260

Query: 143 KCVRECQE---------------NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
            C    Q+                + +  K+   +  K+      ++S +K      P  
Sbjct: 261 SCNNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWSGKNDQCKLRKRSTLKAAGCRKPSH 320

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
              T      LYK G  +  GNET  M  I        +    +    V+ D  +YKSG 
Sbjct: 321 PLRTE-----LYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFIYKSGI 368

Query: 247 ---------KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDE 294
                       G H++RI+GWGE+   +    KYWL+ANSW  +WGDNGLFKI +G +E
Sbjct: 369 YRHSRSAELHDSGYHSVRIIGWGEERSYRGPPLKYWLVANSWGYNWGDNGLFKIQKGTNE 428

Query: 295 CGIESSITA 303
           C IES + A
Sbjct: 429 CEIESYVLA 437


>gi|290971375|ref|XP_002668483.1| predicted protein [Naegleria gruberi]
 gi|284081912|gb|EFC35739.1| predicted protein [Naegleria gruberi]
          Length = 325

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
           E   D+P NFD+RT+W  C  +  IRDQ +CG+CW       A   ++V   R  C A+ 
Sbjct: 98  ETRMDIPMNFDARTQWRGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 147

Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           G T   +  E Q   D    K    G   YS +               +E   T  D  I
Sbjct: 148 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 192

Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
            Y SGR  F  G      +  +MS+ K+  ++     G              +  FTV+ 
Sbjct: 193 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYR 252

Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
           DL  YKSG         LGGHA+ ++G+G +  S   YWL ANSW  +WG +G FKI +G
Sbjct: 253 DLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGANWGMSGYFKIAQG 310

Query: 292 KDECGIESSITAG 304
             E GIE+ + AG
Sbjct: 311 --EGGIENQVYAG 321


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 98/242 (40%), Gaps = 69/242 (28%)

Query: 83  DLPANFDSRTKWPNCP-----------------------TIREIRDQGSCGSCWGCRPYE 119
           D+P +FD+R  +  C                         +  I  +GS  +  GC PY 
Sbjct: 133 DIPNSFDARDAFKECKDVIGHVCCDGCTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYN 192

Query: 120 IAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGA--KSYSVSSNEKS 175
              C HH   ++   C      TP C+  C  +NY  P  KD +F A    Y +   + +
Sbjct: 193 FPKCGHHQQDSKYQPCPEKNYDTPPCLDRCPNKNYGTPLDKDRHFTAHFSPYQLKGTD-N 251

Query: 176 IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
           I KEI  +GP   AF+++DD + Y+SG +                               
Sbjct: 252 IKKEIMTNGPTSAAFSMYDDFLSYESGVY------------------------------- 280

Query: 236 VFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
                  + SG  +G H + I+GWG   K    YWL+ NSWN  WG +G FKI +G  +C
Sbjct: 281 ------KHTSGTLMGEHGVEIIGWG--TKQGVDYWLVMNSWNEGWGVHGTFKIAQG--DC 330

Query: 296 GI 297
           GI
Sbjct: 331 GI 332


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 91/207 (43%), Gaps = 60/207 (28%)

Query: 114 GCRPYEIAPCEHHVNGT--RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSY-SV 169
           GC PY    C HH +G+  +P C      TP C   C    Y   + KD ++    + S 
Sbjct: 87  GCWPYSFPKCAHHQDGSDYKP-CAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSR 145

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
             +  SI KEI  +GP   A                                        
Sbjct: 146 FGSTSSIKKEIMTNGPTSAA---------------------------------------- 165

Query: 230 AEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
               F+V++D + YKSG         LGGHA+ I+GWG ++     YWL+ NSWN +WGD
Sbjct: 166 ----FSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVD--YWLVMNSWNEEWGD 219

Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
           +G FKI++G  +CGI+ +I AG P ++
Sbjct: 220 HGTFKIVQG--DCGIDDTILAGTPAMN 244


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 59/208 (28%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
           G  G+  GC PY++ PC ++  G             +C + C     V  +    +  KS
Sbjct: 175 GDYGTKEGCMPYKVPPC-YNKQGKNTCGGQPMERNHQCPKTCYGKTTVQNR----YKTKS 229

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
             V ++ K+I ++I  +GPV                                        
Sbjct: 230 EYVINSIKTIERDIMTYGPV---------------------------------------- 249

Query: 227 QLGAEGAFTVFDDLILYKSG--------KALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
               E +F V+DDL  YKSG        K  GGH+I+I+GWG+   +   YWL  NSW+ 
Sbjct: 250 ----EASFDVYDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTP--YWLAVNSWSK 303

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
            WG++G FKI++G++ECGIE ++TAG+P
Sbjct: 304 FWGEHGTFKIIKGRNECGIERAVTAGIP 331



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 24/34 (70%)

Query: 80  VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           V+ D P  FDSRT W +C  I  IRDQG+CGSCW
Sbjct: 81  VENDSPQQFDSRTNWKSCKQIGHIRDQGNCGSCW 114


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 89/193 (46%), Gaps = 36/193 (18%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVSSN 172
           C P+      H  N   P+C      T +  R+       P  +  ++     +Y +SSN
Sbjct: 343 CYPFSNQETNHSPNA--PACMMHSRSTGRGKRQAIARCPNPRSHANEIYQSTPAYRLSSN 400

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           EK IMKE+ E+GPV+    V +D  +Y++G +                 R      G   
Sbjct: 401 EKEIMKELMENGPVQAILEVHEDFFMYRTGIY-----------------RHTAVAAGKPE 443

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKIL 289
            +            +  G H+++I GWGE++    S +KYW+ ANSW  DWG++G F+I 
Sbjct: 444 QY------------RRHGTHSVKITGWGEEQMPDGSNQKYWIAANSWGKDWGEHGYFRIT 491

Query: 290 RGKDECGIESSIT 302
           RG++EC IE+ + 
Sbjct: 492 RGENECEIETFVV 504


>gi|290981656|ref|XP_002673546.1| predicted protein [Naegleria gruberi]
 gi|284087130|gb|EFC40802.1| predicted protein [Naegleria gruberi]
          Length = 362

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
           E   D+P NFD+RT+W  C  +  IRDQ +CG+CW       A   ++V   R  C A+ 
Sbjct: 135 ETRIDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 184

Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           G T   +  E Q   D    K    G   YS +               +E   T  D  I
Sbjct: 185 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDSCI 229

Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
            Y SGR  F  G      +  +MS+ K+  ++     G              +  FTV+ 
Sbjct: 230 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 289

Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
           DL  YKSG         LGGHA+ ++G+G +  S   YWL ANSW  +WG +G FKI +G
Sbjct: 290 DLTGYKSGVYKHIENTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 347

Query: 292 KDECGIESSITAG 304
             E GIE+ + AG
Sbjct: 348 --EGGIENQVYAG 358


>gi|403332696|gb|EJY65386.1| Cathepsin B [Oxytricha trifallax]
          Length = 297

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 86/312 (27%), Positives = 119/312 (38%), Gaps = 96/312 (30%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +   N  S++ +  L +  G    Y +P+N+  +  G   +    P NFD+R +W +   
Sbjct: 39  ETTTNPFSDLTKEQLLAKCGT---YIVPSNK--QYPGSPLIS--TPDNFDARQQWGS--K 89

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           I  IRDQ  CG+CW                         P ++  C+ +  G        
Sbjct: 90  IHAIRDQQQCGACWAFGATEALSDRFTIASNGSVDVVFSPEDLVSCDTNDYGCNGGYMDM 149

Query: 130 ----------TRPSC---DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                        SC    A  G  P C  +C    D   +K  +    S   S   + I
Sbjct: 150 AWEFLDQHGVVADSCFPYSAGSGFAPACASKCA---DGSAEKKYSCVHGSIRQSQGVEQI 206

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
             EI  HGPVEGAFTV+ D   Y+SG  + P     A                       
Sbjct: 207 KSEIVAHGPVEGAFTVYTDFFNYQSG-VYTPTTSDVA----------------------- 242

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                        GGHAI+ILG+G +  +   YWL ANSW   WG  G FKI +G  ECG
Sbjct: 243 -------------GGHAIKILGFGVENGT--PYWLCANSWGPSWGMQGFFKIKQG--ECG 285

Query: 297 IESSITAGVPKL 308
           IE  + +  P+L
Sbjct: 286 IEDQVFSCDPQL 297


>gi|291236490|ref|XP_002738176.1| PREDICTED: cathepsin C-like [Saccoglossus kowalevskii]
          Length = 438

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 73/297 (24%), Positives = 112/297 (37%), Gaps = 80/297 (26%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           L +W+   P    P    P     +++  +LPA FD R        +  +R+Q SCGSC+
Sbjct: 180 LVTWLPTPPIMQ-PPKPAPITSQSAQIAANLPAEFDWRNV-GGVNYVTPVRNQASCGSCF 237

Query: 114 -------------------------------------GCR---PYEIAPCEHHVNGTRPS 133
                                                GC    PY ++           +
Sbjct: 238 AFASAGMYESRLKVMTANEVNITISPQDVVQCCNYSQGCSGGFPYLVSKYSEDFGFVEET 297

Query: 134 CDASKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
           C         CV E  C+ +Y   Y+   +F         NE  +  E+ ++GP+  AF 
Sbjct: 298 CLPYTAQDGPCVSEIKCKRHYGTKYRYVGDFYG-----GCNEALMKIELVKNGPMAVAFM 352

Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG 251
           V+DD + Y+ G +                        G +  F  F+          +  
Sbjct: 353 VYDDFMSYQGGIY---------------------HHTGLQDKFNPFE----------ITN 381

Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           HA+ ++G+G D  +KEK+W++ NSW T WG+ G F+I RG DEC IES      P L
Sbjct: 382 HAVLLVGYGYDHDTKEKFWIVKNSWGTGWGEEGYFRIRRGNDECSIESIAVESTPIL 438


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 73/145 (50%), Gaps = 31/145 (21%)

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +F    Y V +NE+ IM+EIY +GPV+    V +D  LY+SG +          + I  +
Sbjct: 327 HFSTPPYRVPANEEDIMQEIYANGPVQALILVKEDFFLYRSGVY--------RHTRIAES 378

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNT 278
           +R   S+ G                      H++RILGWG D       KYWL ANSW  
Sbjct: 379 LRPQYSRSG---------------------WHSVRILGWGVDRSQYRPIKYWLCANSWGH 417

Query: 279 DWGDNGLFKILRGKDECGIESSITA 303
            WG+NG F+I+RG+DE  IES + A
Sbjct: 418 GWGENGYFRIVRGEDESQIESFVLA 442


>gi|323447573|gb|EGB03489.1| hypothetical protein AURANDRAFT_72715 [Aureococcus anophagefferens]
          Length = 812

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 106/259 (40%), Gaps = 88/259 (33%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRP-------- 132
           D+P+ F++ T+W     ++ IRDQ  CGSCW     E+    + +  N   P        
Sbjct: 339 DVPSEFNAVTQWKGL--VQPIRDQQQCGSCWAFSAAEVLSDRNAIQHNKAEPVLSPEDLV 396

Query: 133 SCD--------------------------------ASKGHTPKCVRECQENYD-VPYKKD 159
           SCD                                A  G  PKC   C++      YK  
Sbjct: 397 SCDRVDQGCNGGNLGTAWTYLKNTGIVTDACFPYTAGGGDAPKCETSCKDGSSWTKYK-- 454

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
               A +Y+V+  E ++ KEI  HGP++ AF V+   + YKSG +             KW
Sbjct: 455 ---AASAYAVNGVE-NMQKEIMTHGPIQVAFNVYKSFMSYKSGVY-----------AKKW 499

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                  +L  EG                  GHA++I+GWG +    + YWL+ANSWNT 
Sbjct: 500 Y------ELMPEG------------------GHAVKIVGWGTE--GGKDYWLVANSWNTS 533

Query: 280 WGDNGLFKILRGKDECGIE 298
           WGD G FKI  G +   ++
Sbjct: 534 WGDEGYFKIAVGAESISLD 552


>gi|290998826|ref|XP_002681981.1| predicted protein [Naegleria gruberi]
 gi|284095607|gb|EFC49237.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/253 (32%), Positives = 113/253 (44%), Gaps = 56/253 (22%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
           E   D+P NFD+RT+W  C  +  IRDQ +CG+CW       A   ++V   R  C A+ 
Sbjct: 83  ETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 132

Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           G T   +  E Q   D    K    G   YS +               +E   T  D  I
Sbjct: 133 GKTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 177

Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
            Y SGR  F  G      +  +MS+ K+  ++     G              +  FTV+ 
Sbjct: 178 PYASGRGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 237

Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
           DL  YKSG         LGGHA+ ++G+G +  S   YWL ANSW  +WG +G FKI +G
Sbjct: 238 DLTGYKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 295

Query: 292 KDECGIESSITAG 304
             E GIE+ + AG
Sbjct: 296 --EGGIENQVYAG 306


>gi|290990726|ref|XP_002677987.1| predicted protein [Naegleria gruberi]
 gi|284091597|gb|EFC45243.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 82/249 (32%), Positives = 112/249 (44%), Gaps = 56/249 (22%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTP 142
           D+P NFD+RT+W  C  +  IRDQ +CG+CW       A   ++V   R  C A+ G T 
Sbjct: 2   DIPMNFDARTQWRGC--VPAIRDQQTCGACW-------AFSANYVLAHRL-CIATNGQTN 51

Query: 143 KCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
             +  E Q   D    K    G   YS +               +E   T  D  I Y S
Sbjct: 52  VVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCIPYAS 96

Query: 202 GR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFDDLIL 242
           GR  F  G      +  +MS+ K+  ++     G              +  FTV+ DL  
Sbjct: 97  GRGTFSSGTCPTQCKIASMSMSKYKAKNTRYITGINNIKTAIMTYGSVQAGFTVYRDLTG 156

Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
           YKSG         LGGHA+ ++G+G +  S   YWL ANSW  +WG +G FKI +G  E 
Sbjct: 157 YKSGVYKHVVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG--EG 212

Query: 296 GIESSITAG 304
           GIE+ + AG
Sbjct: 213 GIENQVYAG 221


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 90/216 (41%), Gaps = 57/216 (26%)

Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDL 160
           I  +GS  +  GC PY    C HH   ++   C      TP C+  C  E Y +P  KD 
Sbjct: 150 IATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSCLDRCPNEKYGIPLDKDR 209

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
           +F A S  +     +I KEI  +GP    F+                             
Sbjct: 210 HFTAHSPDLFEGTDNIKKEIMTNGPTSATFS----------------------------- 240

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIA 273
                          V++D + YKSG         +G H++ I+GWG ++     YWL+ 
Sbjct: 241 ---------------VYEDFVSYKSGVYKHTNGTLMGIHSVEIIGWGTEKGVD--YWLVM 283

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           NSWN  WGD+G FKI +G  +CGI+ ++    P ++
Sbjct: 284 NSWNEGWGDHGTFKIAQG--DCGIDDAVLGSPPAMN 317


>gi|28974200|gb|AAO61484.1| cathepsin B [Sterkiella histriomuscorum]
          Length = 294

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 88/294 (29%), Positives = 125/294 (42%), Gaps = 63/294 (21%)

Query: 40  QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
           +   N  +N+ +  L +  G    Y +PAN+  E  G   +   +P NFD+R +W +   
Sbjct: 39  ETTTNPFNNMTKEQLLAKCGT---YIVPANK--EYPGSKIMT--VPENFDARQQWGS--K 89

Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           I  IRDQ  CGSCW     E       +NG           +P+ +  C  N        
Sbjct: 90  IHAIRDQQQCGSCWAFGATEAFSDRFAINGKDVIL------SPEDLVSCDTN-------- 135

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPG-----NETTAM 214
            ++G             M   +E+    GA T  D    Y +G  F P       + +AM
Sbjct: 136 -DYGCNG--------GYMDVAWEYLADHGAAT--DSCFPYSAGSGFAPACSDKCADGSAM 184

Query: 215 SLIKW---TIRDN----------TSQLGAEGAFTVFDDLILYKSG-------KALGGHAI 254
              K    ++R +           S    EGAFTV+ D   Y+SG          GGHAI
Sbjct: 185 QRFKCAPNSVRQSKGVAQIQSEIVSHGPVEGAFTVYTDFFNYQSGVYTPTTTDVAGGHAI 244

Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +ILG+G +  +   YWL ANSW   WG +G FKI +G  ECGIE  + +  P+L
Sbjct: 245 KILGYGVENGTP--YWLCANSWGPAWGMSGFFKIKQG--ECGIEDQVFSCDPQL 294


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 72/144 (50%), Gaps = 39/144 (27%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ IM EI  +GPV+  F V++D  +Y  G                        
Sbjct: 318 YRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGG------------------------ 353

Query: 227 QLGAEGAFTVFDDLILYK----SGKALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDW 280
                    V+  L L++      K  G H++RI+GWGED  +  + KYWL ANSW  +W
Sbjct: 354 ---------VYQHLDLHEHKEEERKVQGYHSVRIIGWGEDYSTGPQVKYWLAANSWGNEW 404

Query: 281 GDNGLFKILRGKDECGIESSITAG 304
           G++GLF+ILRG++ C IES +   
Sbjct: 405 GEDGLFRILRGENHCEIESFVIGA 428


>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
          Length = 69

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/62 (69%), Positives = 48/62 (77%), Gaps = 2/62 (3%)

Query: 246 GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           G A+GGHAI+ILGWGE++     YWL ANSWNTDWGDNG FK LRG D C IES I AG+
Sbjct: 10  GSAVGGHAIKILGWGEEDGVP--YWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGI 67

Query: 306 PK 307
           PK
Sbjct: 68  PK 69


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 58/187 (31%), Positives = 79/187 (42%), Gaps = 41/187 (21%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP--YKKDLNFGAKSYSVSS 171
           GC+PY I PC+        S      HTP C   C  N   P  YK+  +FG   Y+V  
Sbjct: 46  GCKPYTIYPCDKTYPNGTTSVPCPGYHTPVCEERCTSNITWPISYKQVKHFGKAHYNVGK 105

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
               I  EI  +GPV  +F ++DD   YKSG +                           
Sbjct: 106 KMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY--------------------------- 138

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
                     ++ +G   GG   +I+GWG D  +   YWL  + W TD+G+NG  +ILRG
Sbjct: 139 ----------VHTAGDQEGGMDTKIIGWGVD--NGVPYWLCVHQWGTDFGENGFMRILRG 186

Query: 292 KDECGIE 298
            +E  IE
Sbjct: 187 VNEVHIE 193


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 75/163 (46%), Gaps = 40/163 (24%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY   PC +H N T    C      TP C + CQ  Y   Y+KD  +   +Y VSS+E
Sbjct: 73  CKPYTFHPCGYHKNQTYYGECPKHTYQTPACKKYCQYGYGKRYEKDKIYAXDAYRVSSDE 132

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +I  EI+  GPV+ +F  ++D   YKSG                               
Sbjct: 133 AAIRAEIFARGPVQASFATYEDFAHYKSG------------------------------- 161

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
                 + ++ +GK  GGHA++I+GWG +  +K   W++ANSW
Sbjct: 162 ------IYVHTAGKRRGGHAVKIIGWGVENGTKX--WIVANSW 196



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 20/32 (62%)

Query: 8  LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           CG+GCNGG+   AW Y   SG+ SGG Y  K
Sbjct: 39 FCGYGCNGGYSARAWLYARNSGVCSGGRYQEK 70


>gi|281204808|gb|EFA79003.1| hypothetical protein PPL_08471 [Polysphondylium pallidum PN500]
          Length = 322

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 74/249 (29%), Positives = 110/249 (44%), Gaps = 18/249 (7%)

Query: 71  LPELIGYSEVDE-DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-N 128
           L  ++ Y++ D  ++PA+FD+RT+WPNC  I  +RDQGSC SCW      I      + +
Sbjct: 25  LDNVVSYTDQDRANIPASFDARTQWPNC--ISPVRDQGSCSSCWAMTSSSILADRLCIAS 82

Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI---YEHGP 185
           G       S  +   C + C+ N          FG    S+      I  E    Y+   
Sbjct: 83  GGAIKKLLSPQYMVDCAKNCKTNSQSDCNSGCKFGFLDISMEYLSNGISAESCLPYKESD 142

Query: 186 VEGAFTVFDD--LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI-- 241
                   D   + LY        GN   A    +  I  N   L     FT   ++   
Sbjct: 143 ATCPSQCKDGSPIQLYYGSGCISIGNLKDA----QLEIMKNGPILAVFQIFTSLYNIGSG 198

Query: 242 LYK-SGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESS 300
           LY+ +G    GHA R++GWGE+  +   YWL  NSW T++G +G FK+  G++  G ES 
Sbjct: 199 LYRGTGDPAEGHAARVIGWGEENGTP--YWLALNSWGTEFGMDGAFKVPMGENIAGFESQ 256

Query: 301 ITAGVPKLD 309
           + +  P +D
Sbjct: 257 LLSVKPNVD 265


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 56/171 (32%), Positives = 81/171 (47%), Gaps = 43/171 (25%)

Query: 141 TPKCVREC-QENYDVPYKKDLNFGAKSY-SVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
           TP C   C    Y   + KD ++    + S   +  SI KEI  +GP   AF+V++D + 
Sbjct: 5   TPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLS 64

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
           YKSG +                                      + SG  LGGHA+ I+G
Sbjct: 65  YKSGVY-------------------------------------KHTSGGFLGGHAVEIIG 87

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           WG ++     YWL+ NSWN +WGD+G FKI++G  +CGI+  I AG P ++
Sbjct: 88  WGTEKGV--DYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDMILAGTPAIN 134


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 73/286 (25%), Positives = 109/286 (38%), Gaps = 101/286 (35%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG--------------CRPYEIAPC------ 123
           LP +FD R +W +  T++++RDQG CG+ W                R +E+ P       
Sbjct: 185 LPMSFDGRIEWRD--TLQDVRDQGWCGASWAFSTAAVAADRLAIQSRGHEVYPLSMQNLL 242

Query: 124 ------EHHVNGTR-------------------PSCDASKGHTPKC---------VRECQ 149
                 +   NG                     P      G   KC           +CQ
Sbjct: 243 ACNNRGQQGCNGGHLDRAWNYMRRFGVVNEECYPYISGRTGQVEKCKVPRRGNLATMKCQ 302

Query: 150 ---------ENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
                    +  D P +K L     +Y ++  E  IM EI +HGPV+    V  D  LY+
Sbjct: 303 LVNAAERKSDRSDKPPRKGLFRSPPAYRIAPFEDDIMNEILQHGPVQATMRVHPDFFLYR 362

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
            G +   G  +   S                                  G H++RI+GWG
Sbjct: 363 GGVYRYSGTNSQQRS----------------------------------GYHSVRIVGWG 388

Query: 261 ED--EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
            D  +++  KYWL+ANSW   WG++G F+I+RG++E  IE  + A 
Sbjct: 389 VDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVLAA 434


>gi|290998874|ref|XP_002682005.1| predicted protein [Naegleria gruberi]
 gi|284095631|gb|EFC49261.1| predicted protein [Naegleria gruberi]
          Length = 310

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 83/255 (32%), Positives = 113/255 (44%), Gaps = 56/255 (21%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
           E   D+P NFD+RT+W  C  +  IRDQ +CG+CW       A   ++V   R  C A+ 
Sbjct: 83  ETRVDIPMNFDARTQWKGC--VPAIRDQQTCGACW-------AFSANYVLAHR-LCIATN 132

Query: 139 GHTPKCVR-ECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           G T   +  E Q   D    K    G   YS +               +E   T  D  I
Sbjct: 133 GQTNVVLSPEYQVQCDT-MNKACQGGYLKYSWTF--------------LENTGTPLDTCI 177

Query: 198 LYKSGR-FFVPGN-----ETTAMSLIKWTIRDNTSQLG-------------AEGAFTVFD 238
            Y SG   F  G      +  +MS+ K+  ++     G              +  FTV+ 
Sbjct: 178 PYASGGGTFSSGTCPTQCKIASMSMSKYKAKNTVYISGINNIKTAIMTYGSVQAGFTVYR 237

Query: 239 DLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
           DL  YKSG         LGGHA+ ++G+G +  S   YWL ANSW  +WG +G FKI +G
Sbjct: 238 DLTGYKSGVYKHLVSTVLGGHAVALIGFGVEGGSN--YWLAANSWGPNWGMSGYFKIAQG 295

Query: 292 KDECGIESSITAGVP 306
             E GIE+ + AG P
Sbjct: 296 --EGGIENQVYAGEP 308


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/257 (28%), Positives = 96/257 (37%), Gaps = 92/257 (35%)

Query: 86  ANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIAPC 123
           A FDSR KWPNC  +  IRDQG+CGSC+                         P ++  C
Sbjct: 4   AEFDSRQKWPNC--VHPIRDQGNCGSCYSFASSEVMSDRFCIFSNGSVNVVLSPQDLVTC 61

Query: 124 EHH---VNGTRPSCDASKGHTP-------------------KCVRECQENYDVPYKKDLN 161
             +    NG  P       H                     KC   C  N    +K D +
Sbjct: 62  SWYSFGCNGGIPGLVFDYIHKDGLVSDACFPYLSYDGNTHVKCPDFCYNNKTKSFKSDKH 121

Query: 162 FGAKSYSV-------SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
           F  K Y V       +     I KEI  HGPV   F V+ D  +YKSG +          
Sbjct: 122 FADKVYHVGEFLEDKAKRVLEIQKEILTHGPVNADFMVYSDFTVYKSGVY---------- 171

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIAN 274
                                       +++G   G HA++I+GWG +  +   YWLIAN
Sbjct: 172 ---------------------------RHQTGSFEGIHAVKIIGWGTE--NGVDYWLIAN 202

Query: 275 SWNTDWGDNGLFKILRG 291
           SW T +G  G FKI+RG
Sbjct: 203 SWGTTFGLQGFFKIVRG 219


>gi|432892467|ref|XP_004075795.1| PREDICTED: dipeptidyl peptidase 1-like [Oryzias latipes]
          Length = 453

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 77/307 (25%), Positives = 119/307 (38%), Gaps = 88/307 (28%)

Query: 50  PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLP---ANFDSRTKWPNC---PTIREI 103
           P   + +   +H     PA+R+P  +  + V  DL    A       W N      +  +
Sbjct: 181 PEHEMYTLQELHYRAGGPASRVPVRVRPAPVTADLAKVAAALPESWDWRNVGGVNFVSPV 240

Query: 104 RDQGSCGSCWGC----------------------RPYEIAPCEHHVNGTRPSCDASKGHT 141
           R+Q +CGSC+                         P ++  C  +  G    CD   G  
Sbjct: 241 RNQAACGSCYSFATMGMLEARVRVLTNNSQTPVFSPQQVVSCSEYSQG----CD---GGF 293

Query: 142 PKCVRECQENYDV------PY-KKDLNFGA-----KSYSVS----------SNEKSIMKE 179
           P  + +  +++ +      PY  KD   G      ++Y+             +E ++MKE
Sbjct: 294 PYLIGKYSQDFGIVEESCFPYIAKDSPCGVPQNCGRAYTAEYKYVGGFYGGCSEMAMMKE 353

Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
           +  HGP+  AF V+ D + Y  G +                        G    F  F+ 
Sbjct: 354 LVHHGPMAVAFEVYPDFMHYAGGIY---------------------HHTGLADPFNPFE- 391

Query: 240 LILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
                    L  HA+ ++G+G   K+ EKYW++ NSW T WG+NG F+I RG DEC IES
Sbjct: 392 ---------LTNHAVLLVGYGRCHKTGEKYWIVKNSWGTSWGENGFFRIRRGSDECSIES 442

Query: 300 SITAGVP 306
              A  P
Sbjct: 443 IAVAATP 449


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/254 (29%), Positives = 104/254 (40%), Gaps = 46/254 (18%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGH 140
           E LP  FD+R +WP    I +I DQG CG+ W      +A     +      S   S  H
Sbjct: 200 ESLPREFDARIRWPR--EISDIDDQGWCGASWAISTTRVASDRFALMSKGADSVLLSAQH 257

Query: 141 TPKC----VRECQENY-DVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHGP 185
              C     + C   Y D  +     FG            +      +++ +K      P
Sbjct: 258 LLSCNNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNVQCKLRKRTDLKTAGCRPP 317

Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
           V    T      LYK G  +  GNET  M  I        +    +    V+ D   Y+S
Sbjct: 318 VNPLRTE-----LYKVGPAYRLGNETDIMYEI-------LTSGPVQATMKVYQDFFSYES 365

Query: 246 G----------KALGGHAIRILGWGEDEKSKE------KYWLIANSWNTDWGDNGLFKIL 289
           G           A G H++RI+GWGED  +        KYWL+ NSW   WG++GLF+I 
Sbjct: 366 GIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRYRNLPIKYWLVVNSWGQQWGESGLFRIQ 425

Query: 290 RGKDECGIESSITA 303
           RG +EC IES + A
Sbjct: 426 RGTNECDIESFVVA 439


>gi|449485032|ref|XP_002188357.2| PREDICTED: dipeptidyl peptidase 1 [Taeniopygia guttata]
          Length = 667

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 108/283 (38%), Gaps = 83/283 (29%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR---------- 116
           PA   PEL+   +    LP ++D R        +  +R+QGSCGSC+             
Sbjct: 421 PAPLTPELL---KKVSSLPESWDWRNV-NGVNYVSPVRNQGSCGSCYAFSSMAMLEARIR 476

Query: 117 ------------PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
                       P ++  C  +  G                       C         C+
Sbjct: 477 ILTNNTQKPVFSPQQVVSCSRYSQGCDGGFPYLIGGKYVQDFGVVEDDCFPYTAQDSPCL 536

Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
             R C   Y   Y     F         NE  +  E+  HGP+  AF V++D +LYK G 
Sbjct: 537 FKRSCYHYYTSEYHYVGGFYG-----GCNEALMKLELVHHGPMAVAFEVYNDFMLYKEGI 591

Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
           +   G +                           DDL  ++    L  HA+ ++G+G+D 
Sbjct: 592 YHHTGLQ---------------------------DDLNPFE----LTNHAVLLVGYGKDP 620

Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +S EK+W++ NSW T WG++G F+I RG DEC IES   A  P
Sbjct: 621 ESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 663


>gi|348565723|ref|XP_003468652.1| PREDICTED: dipeptidyl peptidase 1-like [Cavia porcellus]
          Length = 463

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/270 (28%), Positives = 102/270 (37%), Gaps = 86/270 (31%)

Query: 83  DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
            LPA++D    W N   I     +R+QGSCGSC+                         P
Sbjct: 230 QLPASWD----WRNVNGINFVTPVRNQGSCGSCYSFASVGMLEARIRILTNNTQTPILSP 285

Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV--RECQENYDVPY 156
            EI  C  +  G                      SC   KG    C   ++C   Y   Y
Sbjct: 286 QEIVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEESCFPYKGIDVPCKVKKDCVRYYTSEY 345

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
                F         NE  +  E+ +HGP+  AF V+DD + Y  G +   G        
Sbjct: 346 HYVGGFYG-----GCNEALMKLELVQHGPMAVAFEVYDDFLHYHKGIYHRTG-------- 392

Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSW 276
               +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW
Sbjct: 393 ----LRD---------PFNPFE----------LTNHAVLLVGYGTDPVSGRDYWIVKNSW 429

Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGVP 306
            T WG++G F+ILRG DEC IES   A  P
Sbjct: 430 GTGWGEDGYFRILRGTDECAIESIAMAATP 459


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 82/184 (44%), Gaps = 37/184 (20%)

Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
           PC  +   +RP     +  T  C      + D  Y  D+      Y +SSNEK IMKEI 
Sbjct: 158 PCRMY---SRPMGRGKRQATGPCPNNFHHSND--YSNDIYQSTPPYRLSSNEKDIMKEIM 212

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
           E+GPV+    V +D  LYK G +                 R   +  G    F       
Sbjct: 213 ENGPVQALMEVHEDFFLYKDGIY-----------------RHTPASNGKPPQF------- 248

Query: 242 LYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                +  G H+++I GWGE+ +    + K+W  ANSW   WG+ G F+ILRG +EC IE
Sbjct: 249 -----RRQGTHSVKITGWGEELQPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDIE 303

Query: 299 SSIT 302
           S + 
Sbjct: 304 SFVV 307


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/254 (29%), Positives = 104/254 (40%), Gaps = 46/254 (18%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGH 140
           E LP  FD+R +WP    I +I DQG CG+ W      +A     +      S   S  H
Sbjct: 200 ESLPREFDARIRWPR--EISDIDDQGWCGASWAISATRVASDRFALMSKGADSVLLSAQH 257

Query: 141 TPKC----VRECQENY-DVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHGP 185
              C     + C   Y D  +     FG            +      +++ +K      P
Sbjct: 258 LLSCNNRGQQACSGGYLDRAWLYMRKFGLVDEDCYPWEGTNAQCKLRKRTDLKTAGCRPP 317

Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
           V    T      LYK G  +  GNET  M  I        +    +    V+ D   Y+S
Sbjct: 318 VNPLRTE-----LYKVGPAYRLGNETDIMYEI-------LTSGPVQATMKVYQDFFSYES 365

Query: 246 G----------KALGGHAIRILGWGEDEKSKE------KYWLIANSWNTDWGDNGLFKIL 289
           G           A G H++RI+GWGED  +        KYWL+ NSW   WG++GLF+I 
Sbjct: 366 GIYKHTATTEHYAFGYHSVRIIGWGEDTSAHRHHNLPIKYWLVVNSWGQQWGESGLFRIQ 425

Query: 290 RGKDECGIESSITA 303
           RG +EC IES + A
Sbjct: 426 RGTNECDIESFVVA 439


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 103/244 (42%), Gaps = 33/244 (13%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
           LP  FD+RT+WP    I  I DQG CG+ W     ++A     +        + S  H  
Sbjct: 203 LPREFDARTRWPR--DISGIHDQGWCGASWAVSTADVASDRFAIMSKGAEDVELSAQHLL 260

Query: 143 KCVRECQEN-----YDVPYKKDLNFG---AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
            C    Q+       D  +     FG    + Y  +            +  V G     +
Sbjct: 261 SCNNRGQQGCRGGYLDRAWLFMRKFGLVDKECYPWTGRNDQCRLRKRSNLNVAGCRKPPN 320

Query: 195 DLI--LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG------ 246
            L   LYK G  +  GNET  M  I        +    +    V+ D  +YK+G      
Sbjct: 321 PLRQELYKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVYQDFFVYKNGVYRHSR 373

Query: 247 ----KALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
                  G H++RI+GWGE+   +    KYWL+ANSW   WG+NGLF+I RG +EC IES
Sbjct: 374 SAELHDSGYHSMRIIGWGEEPSYRGPPLKYWLVANSWGRHWGENGLFRIQRGTNECEIES 433

Query: 300 SITA 303
            + A
Sbjct: 434 YVLA 437


>gi|330846430|ref|XP_003295033.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
 gi|325074364|gb|EGC28440.1| hypothetical protein DICPUDRAFT_51857 [Dictyostelium purpureum]
          Length = 257

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 108/268 (40%), Gaps = 84/268 (31%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NG---------TRP 132
           +P +FD+RT+WPNC  I  I +Q  CGSCW     E+      +  NG            
Sbjct: 31  IPQSFDARTQWPNC--IHPILNQEQCGSCWAFSASEVLSDRLCIASNGKTGVVLSPQALV 88

Query: 133 SCD-----ASKGHTPKCVRE------------------------CQENYDVPYKKDLNFG 163
           SCD        G  P+   E                        C +N  V  ++   + 
Sbjct: 89  SCDIFGNQGCNGGIPQLAWEYMELHGIPTYGCFPYTSGNGTDGSCVKNSCVDNEQYTLYR 148

Query: 164 AKSYSVSS--NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSLIKWT 220
           AK  ++ +  + + I ++I + GP++G   V+ D + Y SG +   PG+           
Sbjct: 149 AKPLTLKTCASVECIQQDIMKFGPIQGTMEVYSDFMSYTSGVYTMTPGSSL--------- 199

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                                       LGGHAI+I+GWG D+ S + YW++ANSW   W
Sbjct: 200 ----------------------------LGGHAIKIVGWGFDQASNQNYWIVANSWGPSW 231

Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
           G +G F I    D+CGI S   A   ++
Sbjct: 232 GIDGFFWI--AFDQCGINSDACAAQARI 257


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 76/149 (51%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y+ D+      Y +S+NEK IMKEI ++GPV+    V +D  +YKSG +     + T +S
Sbjct: 334 YQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSGIY-----KHTDVS 388

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
             K                            +  G H+++I GWGE+     +K KYW+ 
Sbjct: 389 FTK------------------------PPQYRKHGTHSVKITGWGEERNVDGAKRKYWIA 424

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW  +WG+ G F+I RG++EC IE+ +
Sbjct: 425 ANSWGKNWGEEGYFRIARGENECEIEAFV 453


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 50/139 (35%), Positives = 71/139 (51%), Gaps = 32/139 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y +SS+EK IMKE+ E+GPV+    V +D  LYKSG +                 +   +
Sbjct: 343 YRLSSHEKDIMKELMENGPVQALLEVHEDFFLYKSGIY-----------------KHTPA 385

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
            LG    +            +  G H+++I GWGE+   +  K KYW  ANSW   WG+N
Sbjct: 386 SLGKPERY------------RQHGTHSVKITGWGEEIQPDGQKVKYWTAANSWGPTWGEN 433

Query: 284 GLFKILRGKDECGIESSIT 302
           G F+I+RG +EC IES + 
Sbjct: 434 GYFRIVRGANECDIESFVV 452


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 52/148 (35%), Positives = 69/148 (46%), Gaps = 38/148 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    CEHHV G  P C      TP+CV++C +  DV Y +D      SY++ ++E
Sbjct: 66  GCRSYPFPKCEHHVQGHYPPCPRELYPTPECVQQC-DTPDVGYLEDKTRANMSYNIYASE 124

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            SIMKEI   GPVE  FT+++D + Y SG +F                            
Sbjct: 125 ISIMKEIMLRGPVEAIFTMYEDFLRYSSGVYF---------------------------- 156

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGE 261
                    +  G  + GHA+RILGWGE
Sbjct: 157 ---------HALGAPMSGHAVRILGWGE 175



 Score = 45.1 bits (105), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 17/27 (62%), Positives = 21/27 (77%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CGFGC GG+P +AW YW   GIV+GG+
Sbjct: 34 CGFGCRGGYPAVAWDYWKTHGIVTGGS 60


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 112/283 (39%), Gaps = 92/283 (32%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE-------HHVNGTRP- 132
           ++ LP++F++  KWP    + E  DQG+C   W      +A          H      P 
Sbjct: 218 NDILPSHFNAAEKWPG--LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQ 275

Query: 133 ---SCDA-----------------------------------SKGHTPKCV--------- 145
              SCD                                    + GH+  C+         
Sbjct: 276 NLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRG 335

Query: 146 -RECQENYDVPY--KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
            R+   N    Y    ++     +Y ++S+EK IMKE+YE+GPV+    V +D  +YKSG
Sbjct: 336 KRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSG 395

Query: 203 RFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED 262
            +                 R   ++   E               +  G H+++I GWGE+
Sbjct: 396 IYR----------------RTPVTEREPE-------------HHRRHGTHSVKITGWGEE 426

Query: 263 ---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
              +    KYWL ANSW  DWG++G F+I RG++EC IE+ I 
Sbjct: 427 RGRDGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 469


>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
          Length = 561

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 53/164 (32%), Positives = 79/164 (48%), Gaps = 40/164 (24%)

Query: 146 RECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF 204
           R+C  +Y   P +    +  + Y   S E+ +MKEIY  GP+  A    D+L+ YK G  
Sbjct: 153 RDCGHDYPCHPVQNYTKYFVEEYGYVSGEERMMKEIYARGPITCALDATDELVAYKGG-- 210

Query: 205 FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
                                          +F+D    K+G     HAI ++GWGE++ 
Sbjct: 211 -------------------------------IFED----KTGTTSLNHAISVVGWGEEDG 235

Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
             +KYW++ NSW T WG+NG F+I+RG +  GIES  T  VP++
Sbjct: 236 --KKYWIVRNSWGTYWGENGWFRIVRGTNNLGIESECTWAVPRV 277



 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 87/230 (37%), Gaps = 45/230 (19%)

Query: 87  NFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC- 144
           N   + KWP    +++E+ + G+ GSC G     +    H       +C   +    +C 
Sbjct: 369 NIMRKGKWPTVELSVQEVINCGNTGSCNGGWDSGVYRYAHEEGIPDQTCQVYEARNKECN 428

Query: 145 ----VRECQENYDVPYKKDLN-FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
                 +C  + D    KD   +    Y   S +  +  EI+  GP+    +V  + + Y
Sbjct: 429 DMNRCMDCPPDRDCYAVKDYKRYKVGDYGYVSGKDKMKAEIFARGPISCYVSVSQEFLDY 488

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
             G F                                     +      LGGH I + GW
Sbjct: 489 TGGVF-------------------------------------VEHDHSMLGGHIIEVAGW 511

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           G  E  +E YW+  NSW   WG+NG F+I   KD   IESS T GVP +D
Sbjct: 512 GVTEDGQE-YWIGRNSWGEYWGENGWFRIQTDKDNLEIESSCTWGVPIID 560


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 38/244 (15%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
           LP  FDS  KWP    + EI+DQG CGS W      +A     + +  R     S  H  
Sbjct: 79  LPREFDSEFKWPGW--MSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 136

Query: 143 KCVRECQENYDVPY--------KKDLNFGAKSYSVS-SNEKSIMKEIYEHGPVEGAF--- 190
            C R  Q++ +  Y        +K      + +  S +NEK     I   G +  A    
Sbjct: 137 SCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKC---RIPRRGDLVTANCQL 193

Query: 191 -TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
            T  D    YK    +  GNET  M  I             +    V+ D   YK G   
Sbjct: 194 PTNVDRRSKYKVAPAYRVGNETDIMYEI-------LHSGPVQATMKVYHDFFTYKRGIYR 246

Query: 247 -------KALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
                     G H++RI+GWGE+   +  +KYW +ANSW  +WG+NG F+ILRG +EC I
Sbjct: 247 HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEI 306

Query: 298 ESSI 301
           ES +
Sbjct: 307 ESFV 310


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 69/149 (46%), Gaps = 38/149 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY    C+HH  G  P C      TPKCV+ C +   + Y+KD      SY+V  +E
Sbjct: 66  GCRPYPFPKCQHHSQGHYPPCPRRIYPTPKCVKHC-DTPKIDYQKDKTRANTSYNVHQSE 124

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
            +IMKEI  +GPVE  F V +D   YKSG +F                            
Sbjct: 125 VAIMKEILLNGPVEATFEVHEDFPEYKSGIYF---------------------------- 156

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
                    +  G ++GGHAIRILGWGE+
Sbjct: 157 ---------HAWGGSVGGHAIRILGWGEE 176


>gi|449269572|gb|EMC80333.1| Dipeptidyl-peptidase 1 [Columba livia]
          Length = 412

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 117/335 (34%), Gaps = 97/335 (28%)

Query: 33  GGAYGSKQAEKNSLSNIPRAHLKSWMG-VHPDY-NLPANRLPELIG--YSEVDEDLPANF 88
           G   G +        N   AH KSW   ++ +Y N     L    G  YS V    PA  
Sbjct: 110 GSLSGRRYVHNFDFVNAINAHQKSWKATIYKEYENFALEELTRRSGGLYSRVPRPKPAPL 169

Query: 89  DSRT-----------KWPNCP---TIREIRDQGSCGSCWGCR------------------ 116
            +              W N      +  IR+QGSCGSC+                     
Sbjct: 170 TAELLKKVSGLPDSWDWRNVNGVNYVSPIRNQGSCGSCYAFSSMGMLEARIRILTNNTQK 229

Query: 117 ----PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV--RECQEN 151
               P ++  C  +  G                       C         C+  R C   
Sbjct: 230 PIFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAQDSPCLFKRSCYHY 289

Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           Y   Y     F         NE  +  E+  HGP+  AF V++D I YK G +   G   
Sbjct: 290 YTSEYHYVGGFYG-----GCNEALMKLELVLHGPMAVAFEVYNDFIHYKEGIYHHTG--- 341

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                    +RD+         F  F+          L  HA+ ++G+G D +S EK+W+
Sbjct: 342 ---------LRDD---------FNPFE----------LTNHAVLLVGYGTDPQSGEKFWI 373

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           + NSW   WG+NG F+I RG DEC IES   +  P
Sbjct: 374 VKNSWGILWGENGYFRIRRGTDECAIESIAVSATP 408


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 78/185 (42%), Gaps = 41/185 (22%)

Query: 121 APCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEI 180
           APC  H          +  H P         Y              Y +SS+EK IMKE+
Sbjct: 305 APCMMHSRSMGRGKRQATAHCPNSRAHANHIYQA---------TPPYRLSSDEKDIMKEL 355

Query: 181 YEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDL 240
            E+GPV+    V +D  LYKSG +                 +   + LG    +      
Sbjct: 356 MENGPVQALMEVHEDFFLYKSGIY-----------------KHTPASLGKPARY------ 392

Query: 241 ILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
                 +  G H+++I GWGE+ +    + KYW  ANSW   WG+ G F+ILRG +EC I
Sbjct: 393 ------RQHGTHSVKITGWGEERQPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDI 446

Query: 298 ESSIT 302
           ES + 
Sbjct: 447 ESFVV 451


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 41/71 (57%), Positives = 52/71 (73%), Gaps = 1/71 (1%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCRPY I PCEHHVNG+RP C   +G TPKC + C+  Y   YK+D ++G  SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236

Query: 174 KSIMKEIYEHG 184
           K IM EIY++G
Sbjct: 237 KDIMAEIYKNG 247



 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 33/50 (66%), Positives = 39/50 (78%)

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
            E  K+   YWL+ANSWNTDWGDNG FKILRG+D CGIES + AG+P+ D
Sbjct: 241 AEIYKNGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 290



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 45/84 (53%), Gaps = 14/84 (16%)

Query: 44  NSLSNIPRAHLKSWMGVHPDYNLPANRL-------------PELIGYSEVDEDLPANFDS 90
           + L N       +W   H  YN+  + L             P+ + ++E D  LP +FD+
Sbjct: 28  DELVNYVNKQNTTWQAGHNFYNVDVSYLKRLCGTFLGGPKPPQRVMFTE-DLKLPESFDA 86

Query: 91  RTKWPNCPTIREIRDQGSCGSCWG 114
           R +WP CPTI+EIRDQGSCGSCW 
Sbjct: 87  REQWPQCPTIKEIRDQGSCGSCWA 110



 Score = 46.6 bits (109), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 23/31 (74%)

Query: 8   LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
           +CG GCNGG+P  AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 109/249 (43%), Gaps = 35/249 (14%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDAS 137
           E+   LP +FD+R +WP+   I  +RDQG C S W      ++     + +G +     S
Sbjct: 306 EMSNFLPESFDARERWPS--FIHPVRDQGDCASSWAFSTTAVSADRLAIQSGGKFYNPLS 363

Query: 138 KGHTPKCVRECQENYDVPY--KKDLNFGAKSYSVSSNEKS------IMKEIYEHGPVEGA 189
                 C +  Q   +  Y  +       + Y+ +S + +      I +  Y  G +   
Sbjct: 364 VQQLLSCNQARQRGCNGGYLDRAWCVVSDECYTYTSGQTNQPGECHIPRTAYLDGEIRCP 423

Query: 190 FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
               D+ +   +  + +  NE   M+ I        +    +  F V +D  +YKSG   
Sbjct: 424 SGSADNRVYKMTPPYRISTNEREIMTEI-------MANGPVQATFLVHEDFFMYKSGVYQ 476

Query: 247 ------------KALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDWGDNGLFKILRGK 292
                          G H++RILGWG D  +    KYWL ANSW  +WG+NGLF+ILRG+
Sbjct: 477 HLPYANDKGPAYARSGYHSVRILGWGVDHSTGVPIKYWLCANSWGEEWGENGLFRILRGE 536

Query: 293 DECGIESSI 301
           + C IES I
Sbjct: 537 NHCDIESFI 545


>gi|326914532|ref|XP_003203579.1| PREDICTED: dipeptidyl peptidase 1-like [Meleagris gallopavo]
          Length = 420

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 73/283 (25%), Positives = 108/283 (38%), Gaps = 83/283 (29%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------- 115
           PA   PEL+   +   +LP ++D R        +  +R+Q SCGSC+             
Sbjct: 174 PAPLTPELL---KKVSNLPESWDWRNV-NGVNYVSPVRNQASCGSCYAFASMGMLEARIR 229

Query: 116 -----------RPYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
                       P ++  C  +  G                       C         C+
Sbjct: 230 ILTNNTQKPVFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAQDSPCL 289

Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
             R C   Y   Y     F       + NE  +  E+   GP+  AF V++D + YK G 
Sbjct: 290 FKRSCYHYYTSEYHYVGGFYG-----ACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGI 344

Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
           +   G            ++DN         F  F+          L  HA+ ++G+G+D 
Sbjct: 345 YHHTG------------LKDN---------FNPFE----------LTNHAVLLVGYGKDP 373

Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           KS EK+W++ NSW T WG++G F+I RG DEC IES   A  P
Sbjct: 374 KSGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 416


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 74/283 (26%), Positives = 114/283 (40%), Gaps = 88/283 (31%)

Query: 79  EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIA----------------- 121
           ++ + +P +FD+R KW +   I  I DQG+C S W      +A                 
Sbjct: 174 KMKKKIPKSFDARDKWGS--MITGILDQGNCASSWAFSTVGVASDRLAIQSSGETGMTLS 231

Query: 122 PCEHHVNGTRPSCDASKGHTPK----------------------------CVRECQENYD 153
           P       TR     S GH  +                            C+   +   D
Sbjct: 232 PQHLLSCNTRGQRGCSGGHIDRAWWFMRKRGVVSNDCYPYTSGDQDKKGVCMMPGKLPSD 291

Query: 154 VPYKKD----LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFF---V 206
            P  ++    L+     Y +++NE+ I  EI E+GPV+ +F V +D  +Y SG +    +
Sbjct: 292 CPTGRERNNELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGVYRHTPI 351

Query: 207 PGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK 266
             N+       +W                                H++++LGWG +  + 
Sbjct: 352 ASNDAEQYHASEW--------------------------------HSVKLLGWGVE--NG 377

Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
            KYWL ANSW T WG++G FKILRG++EC IES + A   K+D
Sbjct: 378 IKYWLGANSWGTKWGEDGYFKILRGENECNIESYVVAVWGKVD 420


>gi|253743418|gb|EES99819.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 296

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 99/250 (39%), Gaps = 76/250 (30%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
           P ++D R ++P+C  I E+ DQGSCGSCW            CR              +  
Sbjct: 77  PESYDFRDEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYVLD 134

Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS-----------YS 168
           C+   H  NG  P+      H+   V     +Y       + F  K+            +
Sbjct: 135 CDRKDHGCNGGEPTKAFDFLHSTGTVLTSCVDYTAGADNVVKFCPKTCDDGSAVENVFAA 194

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
             S   S +  +  HGPV   F V  D + YKSG +                        
Sbjct: 195 SGSKSGSAIDVLLSHGPVVATFNVAQDFMYYKSGVY------------------------ 230

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                         ++ G  LGGHA+ ++G+G  + S   YW + NSW  DWG++G F+I
Sbjct: 231 -------------QHRWGVWLGGHAVEVVGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 276

Query: 289 LRGKDECGIE 298
           +RG DECGIE
Sbjct: 277 VRGSDECGIE 286


>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
          Length = 91

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 40/59 (67%), Positives = 47/59 (79%), Gaps = 2/59 (3%)

Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           +GGHAIRI+GWG +E     YWL+ANSWN +WGDNG FKILRG +ECGIE  I AG+PK
Sbjct: 34  MGGHAIRIIGWGVEEDVP--YWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 82/177 (46%), Gaps = 45/177 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+I PC H+ NG   +C + +       RE    +NY V Y+ DL+  +  Y  S 
Sbjct: 124 GCQPYKIRPCNHYGNGNLKNCSSLRRTQMTVCREKCVNKNYKVKYEDDLHKTSIVYMTSW 183

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  +GPV     V+++ + YK G                            
Sbjct: 184 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 215

Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                      +YKS  G+ +G H ++++GWG D    E YWL  NSWN++WG NGL
Sbjct: 216 -----------IYKSTAGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGTNGL 260


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 49/167 (29%), Positives = 79/167 (47%), Gaps = 42/167 (25%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           C+PY + PCE  + G   SC       TP C + CQ  Y   Y+KD ++    Y +  +E
Sbjct: 194 CKPYHLHPCE--ITGKFWSCPRDHSFRTPACKKYCQYGYGKRYEKDKSYVKSVYILDEDE 251

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+ ++GPV+ AFT ++D   Y+ G                               
Sbjct: 252 KAIQREMMKNGPVQAAFTTYEDFSFYRKG------------------------------- 280

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                 + ++  G+  G HA++++GWG +  +  KYW +ANSW+TDW
Sbjct: 281 ------IYVHSYGRQRGAHAVKVVGWGVENGT--KYWNVANSWSTDW 319



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R CG GCNGG    AW Y  + G+V+GG Y  K
Sbjct: 159 RECGRGCNGGMDHKAWEYVKEFGVVTGGRYQEK 191


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 114/294 (38%), Gaps = 92/294 (31%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +   +    DLP +F++  KW     I E+ DQG CG+ W      +A     
Sbjct: 170 PTFRVKSMTRLTNPSNDLPRSFNAVEKWST--FISEVPDQGWCGASWVLSTTSVASDRFA 227

Query: 123 -------------------------CE----------HHVNGT-----------RPSCDA 136
                                    C+           H NG            R +C  
Sbjct: 228 IQSQGKEVVQLSAQNILSCTRRQQGCDGGHLDAAWRYMHKNGVLDANCYPYIQQRDTCKV 287

Query: 137 SKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
            + H  + ++   CQ  + V        G  +YS+S  E  IM EIY  GPV+   TV+ 
Sbjct: 288 QR-HRGRSLKAYGCQPAHGVNRDNFYTVG-PAYSLS-READIMAEIYHSGPVQATMTVYR 344

Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
           D   Y SG +     + TA +                              G A G H++
Sbjct: 345 DFFSYSSGVY-----QHTAAN-----------------------------RGAATGFHSV 370

Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +++GWGE E +  KYW+ ANSW   WG+ G F+ILRG +ECGIE  + A  P +
Sbjct: 371 KLVGWGE-EHNGVKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWPHV 423


>gi|308161503|gb|EFO63946.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 363

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 99/250 (39%), Gaps = 76/250 (30%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
           P ++D R ++P+C  I E+ DQGSCGSCW            CR              +  
Sbjct: 144 PESYDFREEYPHC--ITEVVDQGSCGSCWAFSSIQTFADHRCRSGLDATGVSYSVQYVLD 201

Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDV---------PYKKDLNFGAKSYSVS 170
           C+   H  NG  P    +  H    V      Y           P K D     ++   +
Sbjct: 202 CDRKDHGCNGGEPVNAFNFLHNTGTVLTSCVEYTAGDDAVVKFCPQKCDDGSAVENIVAT 261

Query: 171 SNEKS--IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           S  KS   +  +  HGPV   F V  D + YKSG +                        
Sbjct: 262 SGAKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVY------------------------ 297

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                         ++ G  LGGHA+ I+G+G  + S   YW + NSW  DWG++G F+I
Sbjct: 298 -------------QHRWGVWLGGHAVEIVGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 343

Query: 289 LRGKDECGIE 298
           +RG DECGIE
Sbjct: 344 VRGGDECGIE 353


>gi|33327024|gb|AAQ08887.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 72/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  H  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQHAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 116/308 (37%), Gaps = 101/308 (32%)

Query: 64  YNLPANRLPE-LIGYSEV------DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-- 114
           Y L  NR P  ++  +E+       E LP  F++  KWPN   I E  DQG+C   W   
Sbjct: 176 YRLGTNRPPSSVMNMNEIYTGLGSGEVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFS 233

Query: 115 --------------------CRPYEIAPCE-HHVNGTR------------------PSCD 135
                                 P  +  C+ HH  G R                    C 
Sbjct: 234 TAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDHCY 293

Query: 136 ASKGH-------TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSI 176
              GH        P C+           Q     P    +  D+     +Y + SNEK I
Sbjct: 294 PFSGHEQDEAGPAPPCMMHSRAMGRGKRQATARCPNSHVHANDIYQVTPAYRLGSNEKEI 353

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
           MKE+ E+GPV+    V +D  LY+ G +       T +SL +                  
Sbjct: 354 MKELLENGPVQALMEVHEDFFLYQGGIY-----SHTPVSLER------------------ 390

Query: 237 FDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                  +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ G F+ILRG +
Sbjct: 391 ------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRILRGTN 444

Query: 294 ECGIESSI 301
           EC IES +
Sbjct: 445 ECDIESFV 452


>gi|403357104|gb|EJY78168.1| Cathepsin B [Oxytricha trifallax]
          Length = 349

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 71/271 (26%), Positives = 102/271 (37%), Gaps = 84/271 (30%)

Query: 78  SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------C 115
            +++E +P +FDSR KWPNC  I  IRDQ  CGSCW                        
Sbjct: 119 QDLNETIPESFDSRDKWPNC--IHGIRDQQLCGSCWAFASSAFLSDRFCIHSEGQINEDL 176

Query: 116 RPYEIAPCEHHVNG------------------TRPSCDASKGHTPKCVRECQENYDVPYK 157
            P ++  C +   G                      C         C  +CQ N   PY 
Sbjct: 177 SPQDLVSCSYENFGCSGGQLTESVDFLIYEGIVSEKCKPYMNQDTYCKFKCQ-NDKQPYT 235

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
           K      KS  + S+ + I  E+  +GP+    +V++DL+ YK G +             
Sbjct: 236 KYF-CEQKSMLILSDIEEIQLELMTNGPMMVGLSVYEDLMNYKEGVYE------------ 282

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                    Y +G  +GGHAI+I+GWG  EK  E +W   N W 
Sbjct: 283 -------------------------YTTGNQVGGHAIKIIGWGHTEKG-ELFWKCQNQWG 316

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            DWG  G   I  G  E G+++ +   +P +
Sbjct: 317 KDWGMGGYINIKAG--ELGMDTMVLGCMPDI 345


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 116/291 (39%), Gaps = 87/291 (29%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
           P  R+  +     + + LP +F+S  KW +   I ++ DQG CGS W             
Sbjct: 125 PTYRVKAMSRLHNIVDHLPRSFNSIDKWAS--YISDVLDQGWCGSSWVISTASVASDRFA 182

Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV-----PY-------- 156
               G    +++P ++ ++ TR     + GH     R   +   V     PY        
Sbjct: 183 IQSRGKEVIQLSP-QNILSCTRRQQGCNGGHLDAAWRYLHKQGVVDESCYPYVGYRDACK 241

Query: 157 -----KKDLNFGAKSYS--------------VSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
                +   N G +SYS                +NE  IM EI+  GPV+   TV+ D  
Sbjct: 242 IPHNSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNETDIMAEIFMSGPVQATLTVYRDFF 301

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            Y  G +       TA S                              G  +G H+++++
Sbjct: 302 SYSGGIY-----RHTAAS-----------------------------RGSPVGFHSVKLI 327

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GWGE E    KYW+  NSW T WG++G F+ILRG +ECGIE  + A  P +
Sbjct: 328 GWGE-EHDGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEYVLAAWPNV 377


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 53/148 (35%), Positives = 73/148 (49%), Gaps = 40/148 (27%)

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A  Y VSS E +IMKEI + GPV+    V++D  LYK G +                   
Sbjct: 357 ASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIY------------------- 397

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDW 280
             SQ                K+G     H++++LGWG   +    K+K+W+ ANSW   W
Sbjct: 398 RHSQ----------------KAGSKWKTHSVKLLGWGALADKNGQKQKFWIAANSWGKSW 441

Query: 281 GDNGLFKILRGKDECGIESSI--TAGVP 306
           G+NG F+ILRG++EC IE  I  T+G P
Sbjct: 442 GENGYFRILRGQNECDIEKLILATSGQP 469


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 80/291 (27%), Positives = 117/291 (40%), Gaps = 96/291 (32%)

Query: 60  VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
           + PD  +P  R P+      ++  +P +++   ++P C     + DQG CGSCW      
Sbjct: 51  LRPD-TIPLARPPK------INISIPMSYNFTERFPQCDF--GVLDQGKCGSCWSFAVSK 101

Query: 115 ------CRPY---------EIAPCEHH---------VNGTR---------PSCDASKGHT 141
                 CR Y          +  C+           VN  R          SC    G+ 
Sbjct: 102 SFSHRYCRKYNKPVLFSQSHLVACDRRNSGCGGGIEVNAWRYIDLRGLPLDSCQPYDGNI 161

Query: 142 PK--CVREC---QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
            K  C ++C    E Y+  + +  +  A+  S+   +  IM E    GPV  +  V+ DL
Sbjct: 162 TKYNCSKKCTNESETYEAQFTEYWSV-ARYASIEEMQIGIMTE----GPVTTSLKVYSDL 216

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
           + YKSG                                     +  +  G+ LG HA+ I
Sbjct: 217 MYYKSG-------------------------------------IYTHTKGEFLGHHAVEI 239

Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPK 307
           +GWG   K+   YW+I+NSWNT WG NGLF I RG +EC IE  + AG  K
Sbjct: 240 IGWGT--KNGIDYWIISNSWNTTWGMNGLFLIKRGVNECHIEDYVCAGKVK 288


>gi|126327832|ref|XP_001363345.1| PREDICTED: dipeptidyl peptidase 1-like [Monodelphis domestica]
          Length = 462

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 100/248 (40%), Gaps = 76/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                           +I  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYAFASMAMLEARIRILTNNSKTPVLSTQQIVSCSEYSQGCDGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    GH   C  +    Y   Y  D ++    Y  + NE  +  
Sbjct: 306 IAGKYVQDFGVVEENCFPYLGHDSPCSPKNCTRY---YVSDYHYVGGFYG-ACNEALMKL 361

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+ E+GP+  AF V++D I Y+ G +   G            +RD         +F  F+
Sbjct: 362 ELVENGPMAVAFEVYNDFIHYQKGVYHHTG------------LRD---------SFNPFE 400

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     +  HA+ ++G+G DEK+ E YW++ NSW + WG++G F+ILRG DECGIE
Sbjct: 401 ----------ITNHAVLLVGYGTDEKTGEHYWIVKNSWGSYWGEDGYFRILRGTDECGIE 450

Query: 299 SSITAGVP 306
           S   +  P
Sbjct: 451 SIAVSATP 458


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 75/140 (53%), Gaps = 39/140 (27%)

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
           K+Y + ++ K+I K+I ++GPV   +TV++D   Y+SG                      
Sbjct: 2   KAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG---------------------- 39

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                          +  +K+G+  G HA++++GWGE++ +   YW++ANSW+ DWG+NG
Sbjct: 40  ---------------IYKHKAGRKTGLHAVKVIGWGEEKGTP--YWIVANSWHDDWGENG 82

Query: 285 LFKILRGKDECGIESSITAG 304
            F++ RG ++CG E  + AG
Sbjct: 83  FFRMHRGSNDCGFEERMAAG 102


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 38/244 (15%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
           LP  FDS  KWP    + EI+DQG CGS W      +A     + +  R     S  H  
Sbjct: 205 LPREFDSEFKWPG--WMSEIQDQGWCGSSWAITTAAVASDRFAILSKGREKVTLSAQHLL 262

Query: 143 KCVRECQENYDVPY--------KKDLNFGAKSYSVS-SNEKSIMKEIYEHGPVEGAF--- 190
            C R  Q++ +  Y        +K      + +  S +NEK     I   G +  A    
Sbjct: 263 SCDRRGQQSCNGGYLDRAWSYIRKIGLVDEQCFPYSATNEKC---RIPRRGDLVTANCQL 319

Query: 191 -TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--- 246
            T  D    YK    +  GNET  M  I             +    V+ D   YK G   
Sbjct: 320 PTNVDRRSKYKVAPAYRVGNETDIMYEI-------LHSGPVQATMKVYHDFFTYKRGIYR 372

Query: 247 -------KALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGLFKILRGKDECGI 297
                     G H++RI+GWGE+   +  +KYW +ANSW  +WG+NG F+ILRG +EC I
Sbjct: 373 HSPISTNDRTGYHSVRIVGWGEEYSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEI 432

Query: 298 ESSI 301
           ES +
Sbjct: 433 ESFV 436


>gi|290992564|ref|XP_002678904.1| predicted protein [Naegleria gruberi]
 gi|284092518|gb|EFC46160.1| predicted protein [Naegleria gruberi]
          Length = 289

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 76/249 (30%), Positives = 109/249 (43%), Gaps = 45/249 (18%)

Query: 60  VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE 119
           VHP  NLP   +P        +    ++FD+RTKW  C  +  IRDQ  CGSCW     E
Sbjct: 66  VHPINNLPKKTMP-------ANLKAASSFDARTKWGKC--VHPIRDQQQCGSCWAFSASE 116

Query: 120 IAPCEHHVNGTRPSCDASKGH-----TPKCVRECQENYDVPYKKDLNF--GAKSYSVSSN 172
           +         +   C AS G      +P+ + +C       Y  D  +   A ++   + 
Sbjct: 117 VL--------SDRFCIASNGSVDVVLSPEYMLQCDST---DYGCDGGYLNNAWAFLAGTG 165

Query: 173 EKSIMKEIYE--HGPVEGAFTVFDD---LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
             S   + Y   +G V    T   D   + LYK+       +    +S I    +D  + 
Sbjct: 166 IPSDKCDPYTSGNGDVGSCPTSCTDGSAIKLYKA-----KSSSVAQLSSIDDIQKDIQAN 220

Query: 228 LGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEK-YWLIANSWNTD 279
              + AF+V+ D   YKSG          GGHAI+I+GWG     K+  YW++ANSWNT+
Sbjct: 221 GPVQAAFSVYQDFFSYKSGVYRHVSGSLAGGHAIKIVGWGVTSDGKDTPYWIVANSWNTN 280

Query: 280 WGDNGLFKI 288
           WG  G F I
Sbjct: 281 WGQEGFFWI 289


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 64/236 (27%), Positives = 98/236 (41%), Gaps = 52/236 (22%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
           ELI  S + E    N + R+ W    +   +   G   S  GC+P++  P  + +   + 
Sbjct: 129 ELISCSGIKET-NGNVNERSIWEYLKS-HGVVSGGKYNSNDGCQPFKFPPIANILTHLQH 186

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
           +CD            C  N  + Y  D       Y++ +    I KE+  +GPV   F V
Sbjct: 187 TCD----------DHCYGNTSINYNHDHVRVRNYYTIRTG--YIQKEVQTYGPVAVQFKV 234

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
            DD +LYKSG +    N                                     K +   
Sbjct: 235 CDDFLLYKSGVYVKSDN------------------------------------AKVIRTQ 258

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
             +++GWG +  +   YWL+ NSW  +WG  GLFKI RG ++CG+ES + AGVP++
Sbjct: 259 YAKLIGWGVE--NGVDYWLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPEI 312


>gi|339235559|ref|XP_003379334.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
 gi|316978005|gb|EFV61034.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
          Length = 465

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 109/268 (40%), Gaps = 78/268 (29%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV---NGTRPS------- 133
           LP  FD R    N   I ++RDQ +CGSC+      +    +H+   N  R +       
Sbjct: 232 LPEKFDWRNNNGN-NFIGDVRDQKNCGSCYAFASASMLEARYHILTQNRERVTFSPQDVV 290

Query: 134 -------------------------------CDASKGHTPKCV--RECQENYDVPYKKDL 160
                                          C A  G   +C     C+  Y   Y+   
Sbjct: 291 NCSPYSQGCDGGFSYLIAGKYAEDYGMVSERCVAYTGKQQQCRTPSTCERYYATDYEY-- 348

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
                 Y  +SNE  +M+ + ++GP+   F V DD + Y  G +    + T+A+S +KW 
Sbjct: 349 ---IGGYYGASNEILMMQALVKNGPIAVGFEVHDDFLSYSHGIY----HYTSAVSPLKWN 401

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                           F ++           HA+ I+G+G DE +KEKYW++ NSW   +
Sbjct: 402 ---------------PFVEV----------NHAVIIVGYGTDEMTKEKYWIVKNSWGRKF 436

Query: 281 GDNGLFKILRGKDECGIESSITAGVPKL 308
           G++G F+I RG +ECGIES      P +
Sbjct: 437 GEDGYFRIRRGTNECGIESLAFQATPII 464


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 63/207 (30%), Positives = 88/207 (42%), Gaps = 60/207 (28%)

Query: 114 GCRPYEIAPCEHHVNGT--RPSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSY-SV 169
           GC PY    C HH   +  +P C      TP C   C    Y   + KD ++    + S 
Sbjct: 208 GCWPYNFPKCAHHQKESDYKP-CAKEIYDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSR 266

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
             +  SI KEI  +GP   A                                        
Sbjct: 267 FGSTSSIKKEIMTNGPTSAA---------------------------------------- 286

Query: 230 AEGAFTVFDDLILYKSGKA-------LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
               F+V++D + YKSG         LGGHA+ I+GWG ++     YWL+ NSWN +WGD
Sbjct: 287 ----FSVYEDFLSYKSGVYKHTSGGFLGGHAVEIIGWGTEKGVD--YWLVMNSWNEEWGD 340

Query: 283 NGLFKILRGKDECGIESSITAGVPKLD 309
           +G FKI++G  +CGI+  I AG P ++
Sbjct: 341 HGTFKIVQG--DCGIDDMILAGTPAIN 365



 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 26/46 (56%), Gaps = 1/46 (2%)

Query: 70  RLPELIGYSEVDEDLPANFDSRTKWPNCP-TIREIRDQGSCGSCWG 114
            L E +  +E   D+P +FD+R  +  C   I  +RDQ +CGSCW 
Sbjct: 86  ELEEKVYPAEELVDIPDSFDARDAFKECKDVIGHVRDQSACGSCWA 131


>gi|321476473|gb|EFX87434.1| hypothetical protein DAPPUDRAFT_221708 [Daphnia pulex]
          Length = 464

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 67/271 (24%), Positives = 104/271 (38%), Gaps = 82/271 (30%)

Query: 82  EDLPANFDSRTKWPNCPTIREI---RDQGSCGSCWG----------------------CR 116
           E LP  +D    W N   +  +   ++QGSCGSC+                         
Sbjct: 228 EFLPEEWD----WRNVSGVNYVPVVKNQGSCGSCYAFSSMGMLESRLRVATKNQVQVNLS 283

Query: 117 PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKC--VRECQENYDVP 155
           P +I  C  +  G                       C    G    C   ++CQ +Y   
Sbjct: 284 PQDIVSCSAYSQGCEGGFPYLIAGKYAQDHGVVAEECYPYTGRDSACSAAKKCQRSYVAK 343

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y+         Y  + NE+ +   + E GP+  +F V+ D + Y  G +           
Sbjct: 344 YRY-----VGGYYGACNEELMKMSLVESGPLSVSFEVYSDFMHYAGGVYH---------- 388

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
                          +G F   ++   ++    L  HA+ ++G+G D ++KEKYW++ NS
Sbjct: 389 -------------RTDGLFNKINEFNPFE----LTNHAVLLVGYGTDSQTKEKYWIVKNS 431

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           W T WG++G F+I RG DECGIES      P
Sbjct: 432 WGTKWGEDGFFRIRRGVDECGIESIAVEVTP 462


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/149 (32%), Positives = 75/149 (50%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y+ D+      Y +S++EK IMKEI ++GPV+    V +D  +Y SG +     + T +S
Sbjct: 137 YQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSGIY-----KHTDVS 191

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLI 272
             K                            +  G H+++I GWGE+     +  KYW+ 
Sbjct: 192 FTK------------------------PPHYRKHGTHSVKITGWGEERNFDGTTRKYWIA 227

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW  +WG+NG F+I RG++EC IE+ +
Sbjct: 228 ANSWGKNWGENGYFRIARGENECEIEAFV 256


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 80/274 (29%), Positives = 104/274 (37%), Gaps = 88/274 (32%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTP 142
           LP +FD+R KWP    I    DQG CG+ W      +A   + +        D S  H  
Sbjct: 190 LPNSFDARNKWPG--WISGPADQGWCGASWAVSTASVASDRYAIMSKGLTKVDLSPQHLL 247

Query: 143 KC---VRECQ-----------------ENYDVPYK---------KDLNFGAKSY----SV 169
            C    R CQ                 ++Y  P+          K  NF A S     S+
Sbjct: 248 SCNKGQRGCQGGHLSRAWTFIRKFGLVDDYCYPWTGTPTKCKIPKRPNFDALSSICPPSL 307

Query: 170 SSN----------------EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
            SN                EK IM+EI + GPV+    V+ D   YKSG +         
Sbjct: 308 GSNLRSELYRVGPAYKIQDEKDIMEEIMQSGPVQATMKVYQDFFSYKSGVY--------- 358

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYW 270
                   + NT                  +     G H+++ILGWGE+        KYW
Sbjct: 359 -------TKSNTE-----------------RESSNFGYHSVKILGWGEETNIYGQPIKYW 394

Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
           L ANSW   WG+NG FKI RG +EC IE  + A 
Sbjct: 395 LAANSWGQQWGENGFFKIRRGTNECEIEEFVLAA 428


>gi|196009233|ref|XP_002114482.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
 gi|190583501|gb|EDV23572.1| hypothetical protein TRIADDRAFT_28083 [Trichoplax adhaerens]
          Length = 466

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/271 (25%), Positives = 101/271 (37%), Gaps = 89/271 (32%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
            P  FD R    N   +  +R+QG+CGSC+                         P ++ 
Sbjct: 235 FPKQFDWRNV-SNVNYVSPVRNQGACGSCYAFSSMAMYEARLRVLSKNSVKRVMSPQDVV 293

Query: 122 PCEHHVNG-------------------TRPSC-------DASKGHTPKCVRECQENYDVP 155
            C  +  G                      SC       +  K    KC R    NY   
Sbjct: 294 SCSEYAQGCAGGFPYLIAGKYGEDFGLVEESCFPYNGKDEPCKETKSKCRRHSTTNY--- 350

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
                 +    +  + NE  +M+E+ ++GP+  +F V+ D   YK G +   G      S
Sbjct: 351 ------YYVGGFYGACNEYLMMRELVKNGPISISFEVYGDFKHYKGGIYQHTG---LGDS 401

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANS 275
              W I +                            HA+ ++G+G D+KS + YW++ NS
Sbjct: 402 YNPWQITN----------------------------HAVLLVGYGTDQKSGKDYWIVKNS 433

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           W T WG+NG F+ILRG DEC IE+   A  P
Sbjct: 434 WGTKWGENGFFRILRGVDECSIENEAVAVTP 464


>gi|301779281|ref|XP_002925058.1| PREDICTED: dipeptidyl peptidase 1-like [Ailuropoda melanoleuca]
 gi|281337582|gb|EFB13166.1| hypothetical protein PANDA_014484 [Ailuropoda melanoleuca]
          Length = 461

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 76/266 (28%), Positives = 104/266 (39%), Gaps = 74/266 (27%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
           LPA++D R        +  +R+Q SCGSC+                         P E+ 
Sbjct: 229 LPASWDWRNV-HGTNFVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQTPILSPQEVV 287

Query: 122 PCEHHVNGTR---PSCDASK-GHTPKCVRECQENY---DVP----------YKKDLNFGA 164
            C  +  G     P   A K       V E    Y   D P          Y  D ++  
Sbjct: 288 SCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYMGADFPCKPKKDCFRYYSSDYHYVG 347

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
             Y    NE  +  E+  HGP+  AF V+DD   Y++G ++  G            +RD 
Sbjct: 348 GFYG-GCNEALMKLELVHHGPIAVAFQVYDDFFHYRTGIYYHTG------------LRD- 393

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                    F  F+          L  HA+ ++G+G D  S   YW++ NSW   WG+NG
Sbjct: 394 --------PFNPFE----------LTNHAVLLVGYGTDTASGMDYWIVKNSWGAGWGENG 435

Query: 285 LFKILRGKDECGIESSITAG--VPKL 308
            F+I RG DEC IES   A   VPKL
Sbjct: 436 YFRIRRGTDECAIESIAVAATPVPKL 461


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/219 (31%), Positives = 89/219 (40%), Gaps = 59/219 (26%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTR-PS--CDASKGHTPKCVRECQE-NYDVPY 156
           R +   G  GS  GC+P+ I PC H V   R PS  C   K  TP+C   C   NY  P+
Sbjct: 171 RGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETPQCTLNCYNPNYSKPF 230

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL 216
            KD++ G +     S    I  E+ +HGP                               
Sbjct: 231 LKDISKGIRIDWHCSG--MIRNELKKHGP------------------------------- 257

Query: 217 IKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKY 269
                        A     V++D + YKSG       K LG   ++++GWG     +  Y
Sbjct: 258 -------------ATAIMRVYEDFLTYKSGIYQHVTGKLLGQITVKVIGWGVYRGVQ--Y 302

Query: 270 WLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           WL ANSW T WGD G FKI RG +EC  E    +G P L
Sbjct: 303 WLAANSWGTSWGDKGFFKIRRGYNECLFEDYFISGRPVL 341



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/33 (60%), Positives = 26/33 (78%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GCNGG+ G AW+YW+K G+V+GG YGS + 
Sbjct: 152 CGDGCNGGYSGAAWQYWMKRGLVTGGDYGSNEG 184


>gi|159108157|ref|XP_001704351.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432412|gb|EDO76677.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 360

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/250 (28%), Positives = 99/250 (39%), Gaps = 76/250 (30%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRP-----------YEIAP 122
           P ++D R ++P+C  I E+ DQG+CGSCW            CR              +  
Sbjct: 141 PESYDFRDEYPHC--ITEVVDQGNCGSCWAFSSVQTFADHRCRSGLDATGVSYSVQYVLD 198

Query: 123 CE---HHVNGTRPSCDASKGHTPKCVRECQENYDV---------PYKKDLNFGAKSYSVS 170
           C+   H  NG  P    +  H    V      Y           P K D     ++   +
Sbjct: 199 CDRKDHGCNGGEPVNAFNFLHNTGTVLASCVGYTAGDDAVVKFCPQKCDDGSAVENVVAT 258

Query: 171 SNEKS--IMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           S  KS   +  +  HGPV   F V  D + YKSG +                        
Sbjct: 259 SGSKSGSAIDVLLAHGPVVATFNVAQDFMYYKSGVY------------------------ 294

Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
                         ++ G  LGGHA+ I+G+G  + S   YW + NSW  DWG++G F+I
Sbjct: 295 -------------QHRWGLWLGGHAVEIIGYGVTD-SGLDYWTVRNSWGPDWGEDGYFRI 340

Query: 289 LRGKDECGIE 298
           +RG DECGIE
Sbjct: 341 VRGGDECGIE 350


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 68/137 (49%), Gaps = 31/137 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ I  E+  +GPV+  F V +D  +Y  G +                  D  +
Sbjct: 336 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 380

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
           Q GA              S  A G H++R+LGWG D  +    KYWL ANSW T WG++G
Sbjct: 381 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 426

Query: 285 LFKILRGKDECGIESSI 301
            FKILRG++ C IES +
Sbjct: 427 YFKILRGENHCEIESFV 443


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 107/252 (42%), Gaps = 47/252 (18%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTRPSCDASKG 139
           E LP  FDSRT+WP    I +I DQG CG+ W     ++A     +   GT  + + S  
Sbjct: 198 ESLPREFDSRTRWPR--DISKITDQGWCGASWAISSAQVASDRFAIMSKGT-DAVELSAQ 254

Query: 140 HTPKCVRECQE-----NYDVPYKKDLNFG----------AKSYSVSSNEKSIMKEIYEHG 184
           H   C    Q+     + D  +     FG          A + +    +++ ++      
Sbjct: 255 HLLSCNNRGQQGCSGGHLDRAWMFMRRFGLVDENCYPWKASTETCRLRKRTDLRSAGCAP 314

Query: 185 PVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYK 244
           P     T      LYK G  +   NET  M  I        +    +    V+ D   Y+
Sbjct: 315 PPNPLRTE-----LYKVGPAYRLANETDIMQEI-------LTSGPVQATMRVYQDFFSYE 362

Query: 245 SGKALGG----------HAIRILGWGED-----EKSKEKYWLIANSWNTDWGDNGLFKIL 289
           SG               H++RI+GWGE+       +  KYWL+ANSW   WG+NGLF+I 
Sbjct: 363 SGVYKHSVTAELYESDYHSVRIIGWGEEPPTYSRNTPLKYWLVANSWGQQWGENGLFRIQ 422

Query: 290 RGKDECGIESSI 301
           +G +EC IES +
Sbjct: 423 KGTNECEIESFV 434


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 69/137 (50%), Gaps = 31/137 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ I  E+  +GPV+  F V +D  +Y  G +                  D  +
Sbjct: 318 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 362

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
           Q GA              S  A G H++R+LGWG D  + +  KYWL ANSW T WG++G
Sbjct: 363 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDG 408

Query: 285 LFKILRGKDECGIESSI 301
            FK+LRG++ C IES +
Sbjct: 409 YFKVLRGENHCEIESFV 425


>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
          Length = 349

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/140 (36%), Positives = 67/140 (47%), Gaps = 31/140 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ I  E+  +GPV+  F V +D  +Y  G +                  D  +
Sbjct: 215 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 259

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
           Q GA              S  A G H++R+LGWG D  +    KYWL ANSW T WG++G
Sbjct: 260 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 305

Query: 285 LFKILRGKDECGIESSITAG 304
            FKILRG + C IES +   
Sbjct: 306 YFKILRGDNHCEIESFVVGA 325


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 68/137 (49%), Gaps = 31/137 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ I  E+  +GPV+  F V +D  +Y  G +                  D  +
Sbjct: 392 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 436

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
           Q GA              S  A G H++R+LGWG D  +    KYWL ANSW T WG++G
Sbjct: 437 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 482

Query: 285 LFKILRGKDECGIESSI 301
            FKILRG++ C IES +
Sbjct: 483 YFKILRGENHCEIESFV 499


>gi|403340695|gb|EJY69640.1| Cathepsin B [Oxytricha trifallax]
          Length = 247

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 113/263 (42%), Gaps = 51/263 (19%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHH 126
           P   +PE   ++++   +P  FDSR +W NC  +  IRDQ  CGSCW     E       
Sbjct: 15  PVEGIPEPAQHNDI---VPKTFDSREQWGNC--VHPIRDQAQCGSCWAFGASETL----- 64

Query: 127 VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAK------SYSVSSNEKSIMKEI 180
              +   C AS   T   +       D+      N G        ++S  +N  ++    
Sbjct: 65  ---SDRICIASDKKTDVILSP----EDLVACDGWNMGCNGGILPWAWSYLTNTGAVEDSC 117

Query: 181 YEHGPVEGAFTVF--------DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           + +   +GA            D    YK  +  V   + + +  IK  I  N      E 
Sbjct: 118 FPYSSDKGAVPTCAKKCQNDKDSFTKYKCKKNSVV--QASGVDKIKAEISKNGPM---ET 172

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            FTV++D + Y+SG         LGGHA++I+G+G+       YW+ ANSW+  WG+ G 
Sbjct: 173 GFTVYEDFMNYESGVYHHTTGNQLGGHAVKIVGYGD------GYWICANSWSEKWGEKGF 226

Query: 286 FKILRGKDECGIESSITAGVPKL 308
           F I  G  ECGI+S+  A  P L
Sbjct: 227 FNI--GFGECGIDSAAYACTPDL 247


>gi|354459545|pdb|3PDF|A Chain A, Discovery Of Novel Cyanamide-Based Inhibitors Of Cathepsin
           C
          Length = 441

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/253 (26%), Positives = 98/253 (38%), Gaps = 77/253 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 222 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 281

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 282 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 338

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 339 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 377

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 378 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 427

Query: 299 SSITAG--VPKLD 309
           S   A   +PKL+
Sbjct: 428 SIAVAATPIPKLE 440


>gi|308160258|gb|EFO62754.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 298

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 113/252 (44%), Gaps = 62/252 (24%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143
           +P +FD R ++P+C  I E+ DQG CGSCW                   S  AS G    
Sbjct: 74  VPDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD--- 111

Query: 144 CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLI 197
             R C    D   KK + + +  Y VS +   +      +  ++      G  T  D+ +
Sbjct: 112 --RRCVAGLD---KKAVRY-SPQYVVSCDRGDMACDGGWLPSVWRFLVKTGTTT--DECV 163

Query: 198 LYKSGRFFVPGN------ETTAMSLIKWT------------IRDNTSQLGAEGAFTVFDD 239
            Y+SG     G       + + + + K T            ++   +    + AFTV+ D
Sbjct: 164 PYQSGSTGARGTCPTKCADGSELPIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSD 223

Query: 240 LILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
            + Y+ G       +A GGHA+ ++G+G DE   + YW+I NSW  DWG++G F+I+R  
Sbjct: 224 FMYYQGGVYQHVYGRAEGGHAVEMVGYGTDEYDVD-YWIIRNSWGPDWGEDGYFRIIRMT 282

Query: 293 DECGIESSITAG 304
           +ECGIE  +  G
Sbjct: 283 NECGIEEQVIGG 294


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 73/164 (44%), Gaps = 39/164 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CR YEI PC +H N        S   TP C + C+  Y   Y  D  +G  +Y + ++  
Sbjct: 72  CRSYEIHPCGYHGNEPFYGHCHSMARTPPCKKRCRPGYKNSYMMDKRYGTSAYELPNSVX 131

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I ++I E+GPV   F V++D   YKSG                                
Sbjct: 132 AIQRDIMENGPVVAGFDVYEDFKYYKSG-------------------------------- 159

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGED--EKSKEKYWLIANSW 276
                +  + +GK  GGHA++++GWGE+  E     YW+IANSW
Sbjct: 160 -----IYRHTAGKXTGGHAVKVIGWGEEXTENGTIPYWIIANSW 198



 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
          CG GC GG+P  AW+Y V  G+V+GG +G K+
Sbjct: 39 CGAGCEGGWPIEAWKYGVTEGVVTGGNFGRKE 70


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 67/137 (48%), Gaps = 31/137 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E+ I  E+  +GPV+  F V +D  +Y  G +                  D  +
Sbjct: 332 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 376

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
           Q GA              S  A G H++R+LGWG D  +    KYWL ANSW T WG++G
Sbjct: 377 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGRPIKYWLCANSWGTQWGEDG 422

Query: 285 LFKILRGKDECGIESSI 301
            FKILRG + C IES +
Sbjct: 423 YFKILRGDNHCEIESFV 439


>gi|62897637|dbj|BAD96758.1| cathepsin C isoform a preproprotein variant [Homo sapiens]
          Length = 463

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|17933071|gb|AAL48192.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|119579767|gb|EAW59363.1| cathepsin C, isoform CRA_a [Homo sapiens]
          Length = 316

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 99  VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 158

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 159 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 215

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 216 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 254

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 255 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 304

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 305 SIAVAATP 312


>gi|317373330|sp|P53634.2|CATC_HUMAN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|17933069|gb|AAL48191.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|60827947|gb|AAX36820.1| cathepsin C [synthetic construct]
 gi|61368416|gb|AAX43175.1| cathepsin C [synthetic construct]
          Length = 464

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|54696504|gb|AAV38624.1| cathepsin C [synthetic construct]
 gi|54696506|gb|AAV38625.1| cathepsin C [synthetic construct]
 gi|61368207|gb|AAX43130.1| cathepsin C [synthetic construct]
 gi|61368212|gb|AAX43131.1| cathepsin C [synthetic construct]
          Length = 464

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|1582221|prf||2118248A prepro-cathepsin C
          Length = 463

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|189083844|ref|NP_001805.3| dipeptidyl peptidase 1 isoform a preproprotein [Homo sapiens]
 gi|1006657|emb|CAA60671.1| cathepsin C [Homo sapiens]
 gi|1947071|gb|AAC51341.1| prepro dipeptidyl peptidase I [Homo sapiens]
 gi|60816242|gb|AAX36375.1| cathepsin C [synthetic construct]
 gi|119579768|gb|EAW59364.1| cathepsin C, isoform CRA_b [Homo sapiens]
 gi|158257666|dbj|BAF84806.1| unnamed protein product [Homo sapiens]
 gi|261858568|dbj|BAI45806.1| cathepsin C [synthetic construct]
          Length = 463

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+NG F+I RG DEC IES   A  P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 82/201 (40%), Gaps = 57/201 (28%)

Query: 114 GCRPYEIAPCEHH-VNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY    CEH    G +  C   K  TPKC   C +   +P  K    G  +Y +   
Sbjct: 179 GCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTDK-SIPLVKYR--GNATYLLLHG 235

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E+   +E+Y +GP                   FV                          
Sbjct: 236 EEDYKRELYFNGP-------------------FV-------------------------A 251

Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
            F V+ DL  YKSG         LGG A+RI+GWG+   +   YW +ANSW+TDWG NG 
Sbjct: 252 VFFVYTDLFAYKSGVYRNVDGDFLGGQAVRIVGWGKLNGTP--YWKVANSWDTDWGMNGY 309

Query: 286 FKILRGKDECGIESSITAGVP 306
             IL G +EC IE     G P
Sbjct: 310 MLILGGNNECNIEHLGFTGFP 330



 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 40/78 (51%), Gaps = 6/78 (7%)

Query: 39  KQAEKNSLSNIPRAHLKSWMG--VHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN 96
           K      + NI  +  K   G  +  + +LP  R  E     ++  +LP +FDS  KWPN
Sbjct: 47  KAVYNGKMQNITFSEAKRLTGAWIQKNSSLPPVRFTE----EQLRTELPESFDSAEKWPN 102

Query: 97  CPTIREIRDQGSCGSCWG 114
           CPTIREI DQ +C + W 
Sbjct: 103 CPTIREIADQSACRASWA 120


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 76/172 (44%), Gaps = 40/172 (23%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC PY + PC +  +G             KC ++C  + D+ + KD  +    Y ++   
Sbjct: 125 GCEPYRVPPCPYDKDGKNTCSGQPMEPNHKCSKKCYGDEDIDFNKDHRYTRDDYYLTY-- 182

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           + I K++  +GP+E +F V+DD   YKSG +    N +                      
Sbjct: 183 RGIQKDVINYGPIEASFDVYDDFPNYKSGIYVKSENASY--------------------- 221

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                          LGGH+++++GWGE+      YWL+ NSWN DWGD GL
Sbjct: 222 ---------------LGGHSVKLIGWGEEYGV--LYWLMVNSWNADWGDKGL 256



 Score = 46.2 bits (108), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 21/52 (40%), Positives = 30/52 (57%), Gaps = 8/52 (15%)

Query: 70  RLPELIGYSEVDED--------LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
           ++P+ + Y+    D        +P  FD+R KW  C TI E+RDQG+CGS W
Sbjct: 6   QIPDKVNYNMYKNDDHADNYQEIPMKFDARKKWIRCKTIGEVRDQGNCGSDW 57



 Score = 38.1 bits (87), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA 41
           CG GCNGG+P  AW+ +   G+V+GG Y S + 
Sbjct: 93  CGNGCNGGYPIRAWKRFKNHGLVTGGNYKSGEG 125


>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
 gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
          Length = 238

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 72/144 (50%), Gaps = 31/144 (21%)

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           F    Y V +NE+ IM+EIY +GPV+    V +D  LY SG   V  +   A +L     
Sbjct: 95  FSTPPYRVPANEEDIMQEIYANGPVQALMLVKEDFFLYSSG---VYKHTRLAHNLPPEYQ 151

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED--EKSKEKYWLIANSWNTD 279
           + +                           H++RILGWG D  +   +KYWL ANSW + 
Sbjct: 152 KSDW--------------------------HSVRILGWGVDRTQYRPQKYWLCANSWGSG 185

Query: 280 WGDNGLFKILRGKDECGIESSITA 303
           WG+NG F+I+RG+DE  IES + A
Sbjct: 186 WGENGYFRIVRGEDESQIESFVLA 209


>gi|194382330|dbj|BAG58920.1| unnamed protein product [Homo sapiens]
          Length = 446

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 229 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 288

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 289 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 345

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 346 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 384

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 385 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDECAIE 434

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 435 SIAVAATP 442


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/145 (35%), Positives = 70/145 (48%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y +SSNE  IMKEI ++GPV+    V +D   YKSG +                 R   S
Sbjct: 343 YRISSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIY-----------------RHVAS 385

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
             G    +            + L  HA+++LGWG     +  KEK+W+ ANSW   WG+N
Sbjct: 386 THGESENY------------RKLRTHAVKLLGWGTLRGAQGRKEKFWIAANSWGKSWGEN 433

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 434 GYFRILRGVNESDIEKLIIAAWGQL 458


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 66/152 (43%), Gaps = 38/152 (25%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY I PC HH N T    C   +  TP C  +C   Y  PY  D ++G  +Y+V+  
Sbjct: 72  GCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVCTNKCIAAYKTPYSDDKHYGTSAYNVAKT 131

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
              I KEI  +GPVE A+TV++D   Y  G +                            
Sbjct: 132 VAGIQKEIMTNGPVEAAYTVYEDFYQYTGGVY---------------------------- 163

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEK 264
                     +  G  +GGHA+RILGWG  ++
Sbjct: 164 ---------THTGGAEVGGHAVRILGWGVRQQ 186



 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 20/37 (54%), Positives = 27/37 (72%)

Query: 7  RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
          + CG GC GG+P  AW+YWVK+GI +GG+Y S+   K
Sbjct: 38 KKCGNGCEGGYPIEAWKYWVKTGICTGGSYESQSGCK 74


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 111/285 (38%), Gaps = 98/285 (34%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYE 119
           E LP  F++  KWPN   I +  DQG+C   W                         P  
Sbjct: 201 EVLPKAFEASKKWPN--MIHDPLDQGNCAGSWAFSTAAVASDRVSIHSMGHMTPVLSPQN 258

Query: 120 IAPCE-HHVNGTR------------------PSCDASKGH--------TP---------- 142
           +  C+ HH  G +                    C    GH        TP          
Sbjct: 259 LLSCDTHHQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSRAMGR 318

Query: 143 ---KCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
              +  R C  ++D     ++     +Y + S+EK IMKE+ E+GPV+    V++D  LY
Sbjct: 319 GKRQATRRCPNSHDD--ANEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLY 376

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           KSG +        + +L+          +G    +            +  G H+++I GW
Sbjct: 377 KSGIY--------SHTLVS---------MGRPEQY------------RRHGTHSVKITGW 407

Query: 260 GED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           GE+   +    KYW  ANSW   WG+ G F+ILRG +EC IES +
Sbjct: 408 GEEMLPDGRTLKYWTAANSWGPSWGERGYFRILRGSNECDIESFV 452


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 90/217 (41%), Gaps = 50/217 (23%)

Query: 101 REIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS--------CDASKGHTPKCVREC-QEN 151
           + I   G  GS  GC+P+ + PC        PS        C      TPKC   C    
Sbjct: 180 KGIVTGGDYGSNEGCQPWLVQPCNASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNAR 239

Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNET 211
           ++  Y  D+    K ++   +  S  K + +HGP      V++D + YKSG +       
Sbjct: 240 HEGKYLDDIIKAKKVFTF--DGCSARKNLRKHGPYVVTMRVYEDFLAYKSGVYH------ 291

Query: 212 TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
                                          + +G  LG  ++R++GWG +    + +WL
Sbjct: 292 -------------------------------HVTGDYLGLLSVRMIGWGLE--GGQAFWL 318

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +ANSW T WGD G FKI R  +EC IE+   AGVP L
Sbjct: 319 LANSWGTSWGDKGFFKIRRFVNECWIENFRYAGVPNL 355



 Score = 45.1 bits (105), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 19/32 (59%), Positives = 24/32 (75%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
           CG GC+GG+   AWRY +K GIV+GG YGS +
Sbjct: 161 CGNGCSGGYTAAAWRYILKKGIVTGGDYGSNE 192


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 72/147 (48%), Gaps = 31/147 (21%)

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
            DL      Y +++ E  IM EIY++GPV+  F V +D  +Y  G +     E TA    
Sbjct: 320 SDLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRNVKQEFTA---- 375

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIAN 274
                   SQ  ++ A                G H+++I+GWG D     +  KYWL  N
Sbjct: 376 --------SQSDSDQA----------------GWHSVKIVGWGIDRSDWYNPIKYWLCTN 411

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
           SW  +WG+ G+F+I+RG +EC IES +
Sbjct: 412 SWGRNWGEQGMFRIVRGVNECEIESFV 438


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 67/148 (45%), Gaps = 38/148 (25%)

Query: 114 GCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+PY    C HH       SC+     TP+C + CQ +Y + Y+ D  +G  SY V+S+
Sbjct: 189 GCQPYPFPECIHHSTSINHSSCEVKYYSTPECYQTCQPDYAIQYENDKYYGKSSYYVTSD 248

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E SIMKEI  +GPVE  F V+DD + YK+G +                            
Sbjct: 249 EVSIMKEILLNGPVEATFYVYDDFLNYKTGVY---------------------------- 280

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWG 260
                     Y +G  LGGHAIRI   G
Sbjct: 281 ---------KYVTGSLLGGHAIRITWLG 299



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 22/27 (81%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CGFGCNGG PGMAW YW   GIV+GG+
Sbjct: 157 CGFGCNGGIPGMAWDYWKDEGIVTGGS 183


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 78/294 (26%), Positives = 111/294 (37%), Gaps = 92/294 (31%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +   +   +DLP  F++  KW +   I E+ DQG CGS W      +A     
Sbjct: 170 PTYRVKAMTRLTNPSDDLPRKFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 227

Query: 123 -------------------------CE----------HHVNGT-----------RPSCDA 136
                                    CE           H  G            R SC  
Sbjct: 228 IQSQGKEVVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVLDEKCYPYTQHRDSCKI 287

Query: 137 SKGHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
            + H  + ++   CQ  Y V  +  L     +YS+S  E  IM EIY  GPV+    ++ 
Sbjct: 288 QR-HNSRSLKANGCQPAYGVN-RDSLYTVGPAYSLS-READIMAEIYHSGPVQATMRIYR 344

Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
           D   Y  G +                 R   +  GA   F                 H++
Sbjct: 345 DFFSYSGGIY-----------------RQTAANRGAPTGF-----------------HSV 370

Query: 255 RILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +++GWGE E    KYW+ ANSW   WG++G F+ILRG +ECGIE  + A  P +
Sbjct: 371 KLVGWGE-EHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPYV 423


>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 105

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 15/101 (14%)

Query: 219 WTIRDNTSQLGAEG----AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKE 267
           +++ D  + +  +G    +FTV++D + Y+SG         LGGHA++I+GWGE  KS +
Sbjct: 9   YSVNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHAVKIIGWGE--KSGQ 66

Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            YWL  NSWN DWGD+GLFKI  G   CGI+  +  G PK+
Sbjct: 67  AYWLAVNSWNEDWGDHGLFKIALGN--CGIDDDLLGGTPKV 105


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 72/166 (43%), Gaps = 39/166 (23%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CR +   PC HH N T       +  TPKC   C   Y   Y  D   G  +Y + ++ K
Sbjct: 192 CRSHPFPPCGHHGNETYYGECGGRARTPKCRTSCTPGYKNSYSDDKIRGKDAYELPNSVK 251

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           +I +EI ++GPV  AFTV+ D   YK G                                
Sbjct: 252 AIQREIMKNGPVVAAFTVYADFSYYKKG-------------------------------- 279

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                +  + +G+A G HA++++GWGE+      YW++ NSW+ DW
Sbjct: 280 -----IYKHTAGRARGSHAVKVIGWGEE--GDVPYWIVKNSWHNDW 318



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 22/46 (47%), Positives = 32/46 (69%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           NR P +   ++  +D+P +FD+RT WPNC ++  IRDQ +CGSCW 
Sbjct: 79  NREPIVGDENDEGDDIPESFDARTHWPNCSSLTHIRDQANCGSCWA 124


>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
          Length = 150

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 56/181 (30%), Positives = 74/181 (40%), Gaps = 58/181 (32%)

Query: 103 IRDQGSCGSCWGCRPYEIAP---CEHHVNGTRP-----------------SCDAS-KGHT 141
           IR+Q +CGSCW     E+     C       +P                  CD   K  T
Sbjct: 2   IRNQTNCGSCWAFGAAEVISDRICIVTKGARQPIISPTDMLDCCGEYCGYGCDGCPKAVT 61

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
           PKC   CQ  Y+  Y KD NFG+ +Y V  N   I  EI  +GPVE +FTV++D  +YK 
Sbjct: 62  PKCALSCQSKYNTEYAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYKK 121

Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
           G +                                      Y +G+ LGGHAI+I+GWG 
Sbjct: 122 GVY-------------------------------------QYTAGEVLGGHAIKIIGWGT 144

Query: 262 D 262
           +
Sbjct: 145 E 145


>gi|307938279|ref|NP_001182763.1| dipeptidyl peptidase 1 precursor [Canis lupus familiaris]
          Length = 459

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 93/244 (38%), Gaps = 68/244 (27%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNGTR---PSC 134
           +  +R+Q SCGSC+                         P EI  C  +  G     P  
Sbjct: 243 VSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYL 302

Query: 135 DASKGHTPKCVRE------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
            A K      + E            C+ N    Y     +    +  + NE  +  E+  
Sbjct: 303 IAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVR 362

Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
           HGP+  AF V+DD   Y+ G ++  G            +RD          F  F+    
Sbjct: 363 HGPMAVAFEVYDDFFHYQKGIYYHTG------------LRD---------PFNPFE---- 397

Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
                 L  HA+ ++G+G D  S   YW++ NSW + WG++G F+I RG DEC IES   
Sbjct: 398 ------LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAV 451

Query: 303 AGVP 306
           A  P
Sbjct: 452 AATP 455


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 69/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IMKEI  +GPV+    V +D   YKSG  R     NE +            
Sbjct: 304 YRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSGIYRHVTSTNEKS------------ 351

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  + L  HA+++ GWG     +  KEK+W++ANSW   WG
Sbjct: 352 -------------------EKYQKLQTHAVKLTGWGTLRGAQGRKEKFWIVANSWGNSWG 392

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 393 ENGYFRILRGVNESDIEKLIIAAWGQL 419


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 51/168 (30%), Positives = 76/168 (45%), Gaps = 41/168 (24%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGH-TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           CRPY   PC  H +G R  C       TP C   CQ  Y   Y+KD  F   +Y + ++E
Sbjct: 193 CRPYAFHPCGLH-HGRRYDCPWDHSFSTPACKPYCQFGYGKRYEKDKFFVKSTYILDNDE 251

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K I +E+ ++GPV+ AF  ++D   YK G                               
Sbjct: 252 KVIQREMMKNGPVQAAFITYEDFSPYKGG------------------------------- 280

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                 + ++  G+  G HA++++GWG +  +  KYW +ANSW+ DWG
Sbjct: 281 ------IYVHVKGRERGAHAVKLIGWGVENGT--KYWTVANSWHDDWG 320



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 48  NIPRAHLKSWMGVHPDYNLPANRLPELIGYSE--VDEDLPANFDSRTKWPNCPTIREIRD 105
           N  +A  +    +  DY   A +L ++    E   +ED+P +FDSR  W NC +I  +RD
Sbjct: 56  NTSKAEERMAHLMKTDYIRNARKLYKVKKAEEQTTNEDIPESFDSRIVWKNCSSITYVRD 115

Query: 106 QGSCGSCWG 114
           Q  CGSCW 
Sbjct: 116 QSRCGSCWA 124



 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 22/33 (66%)

Query: 7   RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
           R+CG GC GG+  +AW +  + G+V+GG Y  K
Sbjct: 158 RMCGDGCEGGYDHLAWEWVQRFGVVTGGPYQQK 190


>gi|17933077|gb|AAL48195.1| cathepsin C [Homo sapiens]
          Length = 463

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+  HGP+  AF V+DD + YK G +   G            +RD        
Sbjct: 356 NEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD-------- 395

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG
Sbjct: 396 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRG 444

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 445 TDECAIESIAVAATP 459


>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
          Length = 61

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 39/59 (66%), Positives = 46/59 (77%), Gaps = 2/59 (3%)

Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           + +G  +GGHAIRILGWG +  +   YWL+ NSWNTDWGDNG FKILRG+D CGIES I
Sbjct: 5   HVTGDLMGGHAIRILGWGVENGTP--YWLVGNSWNTDWGDNGFFKILRGQDHCGIESEI 61


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 71/258 (27%), Positives = 109/258 (42%), Gaps = 25/258 (9%)

Query: 66  LPANRLPELIGYSEVDEDLP--ANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
            P N     +G    D+D+P    FD+RT+WP    I  I DQG CGS W      +A  
Sbjct: 170 FPLNAETRRMGPLRYDKDVPYPTQFDARTRWPG--FISPIVDQGWCGSDWAVSLAGVASD 227

Query: 124 EHHV--NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
              +  NG      + +      VR  Q  +        NF A+ + +   +    K   
Sbjct: 228 RFAIQSNGAENMVLSPQTLLSCNVRAQQGCHGGHIDVAWNF-ARGHGLVDEKCFPYKASV 286

Query: 182 EHGPVEGAFTVFDD----LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVF 237
              P      +  D    L+  ++ R+ +       +S  K  + D       +   TV+
Sbjct: 287 TRCPFRPRGNLIQDGCMPLVKRRTSRYKL--GPPAKLSHEKDIMYDIMESGPVQAVMTVY 344

Query: 238 DDLILYKSG----------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFK 287
            D   Y+ G          +  G H++RI+GWGED    ++YW++ANSW   WG+NG F+
Sbjct: 345 QDFFHYRDGVYRRSYHGNNELKGFHSVRIIGWGEDR--GDRYWVVANSWGRQWGENGYFR 402

Query: 288 ILRGKDECGIESSITAGV 305
           I RG +E  IES +  G+
Sbjct: 403 IARGSNEADIESFVVTGL 420


>gi|22653678|sp|O97578.1|CATC_CANFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain 1; AltName: Full=Dipeptidyl peptidase I
           heavy chain 1; Contains: RecName: Full=Dipeptidyl
           peptidase 1 heavy chain 2; AltName: Full=Dipeptidyl
           peptidase I heavy chain 2; Contains: RecName:
           Full=Dipeptidyl peptidase 1 heavy chain 3; AltName:
           Full=Dipeptidyl peptidase I heavy chain 3; Contains:
           RecName: Full=Dipeptidyl peptidase 1 heavy chain 4;
           AltName: Full=Dipeptidyl peptidase I heavy chain 4;
           Contains: RecName: Full=Dipeptidyl peptidase 1 light
           chain; AltName: Full=Dipeptidyl peptidase I light chain;
           Flags: Precursor
 gi|4106126|gb|AAD02704.1| dipeptidyl peptidase I [Canis lupus familiaris]
          Length = 435

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 93/244 (38%), Gaps = 68/244 (27%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNGTR---PSC 134
           +  +R+Q SCGSC+                         P EI  C  +  G     P  
Sbjct: 219 VSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYL 278

Query: 135 DASKGHTPKCVRE------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
            A K      + E            C+ N    Y     +    +  + NE  +  E+  
Sbjct: 279 IAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVR 338

Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
           HGP+  AF V+DD   Y+ G ++  G            +RD          F  F+    
Sbjct: 339 HGPMAVAFEVYDDFFHYQKGIYYHTG------------LRD---------PFNPFE---- 373

Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
                 L  HA+ ++G+G D  S   YW++ NSW + WG++G F+I RG DEC IES   
Sbjct: 374 ------LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAV 427

Query: 303 AGVP 306
           A  P
Sbjct: 428 AATP 431


>gi|32129433|sp|P92131.3|CATB1_GIALA RecName: Full=Cathepsin B-like CP1; AltName: Full=Cathepsin B-like
           protease B1; Flags: Precursor
          Length = 303

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 119/286 (41%), Gaps = 39/286 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
           K        N+     +S M + PD       +LP   + E+    E+ + +P  FD R 
Sbjct: 32  KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87

Query: 93  KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
           ++P C  ++   DQGSCGSCW      +        G  + +   S+ H   C  E   C
Sbjct: 88  EYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145

Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
                 P    L F       ++  + +    Y H        V DD   + LYK+  + 
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
             G  + ++  I   +         +    V+ DL  Y+SG          LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
           G+G  +   + YW+I NSW  DWG+NG F+I+RG +EC IE  I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|291384116|ref|XP_002708690.1| PREDICTED: cathepsin C [Oryctolagus cuniculus]
          Length = 463

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 72/268 (26%), Positives = 101/268 (37%), Gaps = 82/268 (30%)

Query: 83  DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
           DLPA++D    W N   I     +R+Q SCGSC+                         P
Sbjct: 230 DLPASWD----WRNVGGINFVSPVRNQESCGSCYSFASVGMLEARIRILTNNSQTPILSP 285

Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
            EI  C  +  G                       C    G    C  + +E+    Y  
Sbjct: 286 QEIVSCSQYAQGCNGGFPYLIAGKYAQDFGLVEEDCFPYTGTDSPC--KMKEDCFRYYSS 343

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           + ++    Y    NE  +  E+  HGP+  AF V+DD + Y  G +   G          
Sbjct: 344 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTG---------- 392

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
             +RD          F  F+          L  HA+ ++G+G D  +   YW++ NSW T
Sbjct: 393 --LRD---------PFNPFE----------LTNHAVLLVGYGTDPATGVDYWIVKNSWGT 431

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
            WG+NG F+I RG DEC IES   A  P
Sbjct: 432 SWGENGYFRIRRGTDECAIESIAVAATP 459


>gi|363729389|ref|XP_417207.2| PREDICTED: dipeptidyl peptidase 1 [Gallus gallus]
          Length = 460

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 70/283 (24%), Positives = 104/283 (36%), Gaps = 83/283 (29%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----------- 115
           PA   PEL+   +    LP ++D R        +  +R+Q SCGSC+             
Sbjct: 214 PAPLTPELL---KKVSGLPESWDWRNV-NGVNYVSPVRNQASCGSCYAFASMGMLEARIR 269

Query: 116 -----------RPYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCV 145
                       P ++  C  +  G                       C         C+
Sbjct: 270 ILTNNTQKPVFSPQQVVSCSQYSQGCDGGFPYLIAGKYVQDFGVVEEDCFPYTAKDTPCL 329

Query: 146 --RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
             R C   Y   Y     F       + NE  +  E+   GP+  AF V++D + YK G 
Sbjct: 330 FKRSCYHYYTSEYHYVGGFYG-----ACNEALMKLELVLSGPMAVAFEVYNDFMFYKEGI 384

Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE 263
           +                        G +  F  F+          L  HA+ ++G+G+D 
Sbjct: 385 Y---------------------HHTGLKDEFNPFE----------LTNHAVLLVGYGKDP 413

Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +S EK+W++ NSW T WG++G F+I RG DEC IES   A  P
Sbjct: 414 ESGEKFWIVKNSWGTSWGEDGYFRIRRGTDECAIESIAVAATP 456


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 69/248 (27%), Positives = 106/248 (42%), Gaps = 35/248 (14%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV-NGTRPSCDASKGHTP 142
           LP++FD+   WP    + E RDQG CGS W      +A     + +  R     +     
Sbjct: 296 LPSHFDAADHWPR--LVGEARDQGWCGSSWALSTTTMASDRFAILSKGREQVQLAPQQLL 353

Query: 143 KCVRECQE----NYDVPYKKDLNFGAKS-----YSVSSNEKSIMK-EIYEHGPVEGAFTV 192
            CVR  Q     + D  ++     G  +     Y  + N+  I   +       E    V
Sbjct: 354 ACVRRQQACSGGHLDTAWQYLRRVGVVNDECYPYIAAKNQCKINDGDTLVSANCELPANV 413

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG------ 246
            +   +Y+ G  +   NET  M+ IK        +   +    V+ D   Y++G      
Sbjct: 414 -NRTAMYRMGPAYSLNNETDIMTEIK-------ERGTVQAILRVYRDFFSYQNGIYRHSA 465

Query: 247 ------KALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                 +    H++R++GWGE+    +  KYW+  NSW T WG+NG F+ILRG +EC IE
Sbjct: 466 AATPAEERSAYHSVRLIGWGEERVGYDMVKYWIAVNSWGTWWGENGRFRILRGTNECEIE 525

Query: 299 SSITAGVP 306
           S + A  P
Sbjct: 526 SYVLASNP 533


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 78/298 (26%), Positives = 114/298 (38%), Gaps = 101/298 (33%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I +  DQG+C   W               
Sbjct: 86  NEIHTVLGPGEV---LPRTFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 140

Query: 115 --------CRPYEIAPCEHH----VNGTR---------------PSCDASKGH------- 140
                     P  +  C+ H     +G R                 C    GH       
Sbjct: 141 SLGHMSPVLSPQNLLSCDTHNQQGCHGGRLDGAWWFLRRRGVVSDHCYPFSGHGRDEAVP 200

Query: 141 TPKCVRE--------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
            P C+                C  +Y   +  D+     +Y + SNEK IMKE+ E+GPV
Sbjct: 201 APPCMMHSRAMGRGKRQATARCPNSY--VHANDIYQVTPAYRLGSNEKEIMKELMENGPV 258

Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
           +    V +D  LY+SG +       T +SL +                         +  
Sbjct: 259 QALMEVHEDFFLYQSGIY-----SHTPVSLGR------------------------PERY 289

Query: 247 KALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           +  G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 290 RRHGTHSVKITGWGEETLPDGRTVKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 347


>gi|431838501|gb|ELK00433.1| Dipeptidyl-peptidase 1 [Pteropus alecto]
          Length = 460

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 90/248 (36%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 243 VTPVRNQASCGSCYSFASVGMLEARIRILTNNTQSPILSPQEVVSCSQYAQGCEGGFPYL 302

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +EN    Y  + ++    Y    NE  +  
Sbjct: 303 IAGKYAQDFGLVEETCFPYTGTDSPC--KLKENCFRYYSSEYHYVGGFYG-GCNEALMKL 359

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y  G +                        G +  F  F+
Sbjct: 360 ELVHHGPMAVAFEVYDDFLHYHKGIY---------------------HHTGLKDPFNPFE 398

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW + NSW T WG+NG F+I RG DEC IE
Sbjct: 399 ----------LTNHAVLLVGYGTDPASGLNYWTVKNSWGTSWGENGYFRIRRGTDECAIE 448

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 449 SIAMAATP 456


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 71/147 (48%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G  R  +  NE +            
Sbjct: 353 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEES------------ 400

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
                              +  + L  HA+++ GWG  + +   KEK+W+ ANSW   WG
Sbjct: 401 -------------------EKYRKLQTHAVKLTGWGTLKGARGQKEKFWIAANSWGKSWG 441

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 442 ENGYFRILRGVNESDIEKLIIAAWGQL 468


>gi|358254887|dbj|GAA56530.1| cathepsin C [Clonorchis sinensis]
          Length = 362

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 74/281 (26%), Positives = 109/281 (38%), Gaps = 82/281 (29%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------- 114
           PEL+   E    LP  FD R + P+   +  +R+Q  CGSC+                  
Sbjct: 120 PELL---EASRYLPDEFDWRKQSPS--PVTPVRNQEVCGSCYAFASAAALEARIRLVSNF 174

Query: 115 -----CRPYEIAPCEHHVNG-------------------TRPSCDASKG-HTPKCVRE-- 147
                  P ++  C  +  G                      SCD   G    KC  +  
Sbjct: 175 TEEPILSPQDVVDCSPYSEGCDGGFPYLIAGKYAEDFGIPLESCDPYTGVKANKCPTKPG 234

Query: 148 CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVP 207
           C+  Y   Y+         Y  + +E  +  E+   GP    F V+DD + YKSG     
Sbjct: 235 CRRYYATNYRY-----LGGYYGACSELLMRMELVHGGPFPIGFEVYDDFVHYKSG----- 284

Query: 208 GNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE 267
                        +  +T+       F  F+          L  HA+ ++G+G DE+SK 
Sbjct: 285 -------------VYRHTNIRHPLKRFEPFE----------LTNHAVLLVGYGFDEESKL 321

Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            YW++ NSW T+WG++G F+ILRG DEC +ES      P L
Sbjct: 322 PYWIVKNSWGTEWGEDGFFRILRGSDECAVESLAVVFDPVL 362


>gi|114639716|ref|XP_508684.2| PREDICTED: dipeptidyl peptidase 1 isoform 2 [Pan troglodytes]
 gi|397526223|ref|XP_003833035.1| PREDICTED: dipeptidyl peptidase 1 [Pan paniscus]
 gi|410219182|gb|JAA06810.1| cathepsin C [Pan troglodytes]
 gi|410260226|gb|JAA18079.1| cathepsin C [Pan troglodytes]
 gi|410304128|gb|JAA30664.1| cathepsin C [Pan troglodytes]
 gi|410353831|gb|JAA43519.1| cathepsin C [Pan troglodytes]
          Length = 463

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|426370061|ref|XP_004051995.1| PREDICTED: dipeptidyl peptidase 1 [Gorilla gorilla gorilla]
          Length = 463

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LP ++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286

Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
           E+  C  +  G                      +C    G    C  + +E+    Y  +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
            ++    Y    NE  +  E+  HGP+  AF V+DD + YK G +   G           
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
            +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T 
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG++G F+I RG DEC IES   A  P
Sbjct: 433 WGEDGYFRIRRGTDECAIESIAVAATP 459


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 78/296 (26%), Positives = 113/296 (38%), Gaps = 97/296 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I +  DQG+C   W               
Sbjct: 152 NEIHTVLGPGEV---LPRTFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 206

Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
                     P  +  C+ H+  G R                    C    GH       
Sbjct: 207 SLGHMSPVLSPQNLLSCDTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRDEAVP 266

Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            P C+           Q     P    +  D+     +Y + SNEK IMKE+ E+GPV+ 
Sbjct: 267 APPCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKEIMKELMENGPVQA 326

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
              V +D  LY+SG +       T +SL +                         +  + 
Sbjct: 327 LMEVHEDFFLYQSGIY-----SHTPVSLGR------------------------PERYRR 357

Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 358 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 413


>gi|67867504|gb|AAH98085.1| Unknown (protein for MGC:107782) [Xenopus (Silurana) tropicalis]
          Length = 458

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/286 (24%), Positives = 109/286 (38%), Gaps = 84/286 (29%)

Query: 66  LPANRLPELIGYSEVDEDLPANFDSRTKWPNCP---TIREIRDQGSCGSCWG-------- 114
           +P    P  +   E  + LP  +D    W N      +  +R+Q SCGSC+         
Sbjct: 208 IPMRPRPAPLPTDEKYQGLPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLE 263

Query: 115 --------------CRPYEIAPCEHHVNGTR---PSCDASK-----------------GH 140
                           P ++  C ++  G     P   A K                   
Sbjct: 264 SRIQIRSQLSQKPILSPQQVVSCSNYSQGCEGGFPYLIAGKYVSDYGIVEESDLPYTGSD 323

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           +P  +++ Q+ Y   Y  + ++    Y    NE  +  E+   GP+  AF V+DD + Y+
Sbjct: 324 SPCTLKDSQQKY---YTAEYHYVGGFYG-GCNEAYMKLELVLGGPLSVAFEVYDDFMHYR 379

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                        G +  F  F           L  HA+ ++G+G
Sbjct: 380 SGVY---------------------HHTGLQDKFNPFQ----------LTNHAVLLVGYG 408

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
            D+++ EKYW++ NSW   WG+ G F+I RG DEC IES   +  P
Sbjct: 409 TDQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIESIAVSAEP 454


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 74/271 (27%), Positives = 112/271 (41%), Gaps = 47/271 (17%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
           P  R+  +   S     LP  FD+   W     + E RDQG CGS W             
Sbjct: 170 PRFRVKAMKRLSNKGGHLPTRFDASEHWTG--LVAEARDQGWCGSSWAFSTATMASDRFA 227

Query: 114 ----GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSV 169
               G    ++AP +  +   R     S GH     +  +    V  +      A++   
Sbjct: 228 ILSKGREMVQLAP-QQMLACVRRQQGCSGGHLDTAWQYLRRTGVVNEECYPYIAAQNVCK 286

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
            SN+ +++    E  PV+      +  ++YK G  F   NET  M+ IK        +  
Sbjct: 287 ISNDDTLITANCEL-PVK-----VNRTLMYKMGPAFSLNNETDIMAEIK-------DRGT 333

Query: 230 AEGAFTVFDDLILYKSG------------KALGGHAIRILGWGEDEKSKE--KYWLIANS 275
            +    V+ D   Y+SG            +    H++R++GWGE+    +  KYW+  NS
Sbjct: 334 VQAIMRVYRDFFSYRSGIYRHSAAATPAEERSAYHSVRLIGWGEERVGYDVVKYWIAINS 393

Query: 276 WNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           W   WG+NG F+ILRG +EC IES + A  P
Sbjct: 394 WGQWWGENGRFRILRGSNECDIESYVLASNP 424


>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
          Length = 202

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 77/175 (44%), Gaps = 42/175 (24%)

Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQEN-YDVPYKKDLNFGA 164
           G CG  +GCRPY   PC  H +      C      TP+C + CQ     + Y KD  + A
Sbjct: 65  GPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPECRKICQRGCIQLQYGKDRYYAA 124

Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
            +Y V ++ K+IM+EI   GPV GA+  + D  LYK G +     E TA           
Sbjct: 125 SAYFVKNDTKAIMREIMRGGPVHGAYDTYTDFRLYKGGVY-----EHTA----------- 168

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK---YWLIANSW 276
                                G+  GGH+I+I+GWG  +        YWL+ANSW
Sbjct: 169 ---------------------GERTGGHSIKIMGWGNYKHPNGTVIPYWLVANSW 202


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/141 (35%), Positives = 69/141 (48%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YKSG +    N           ++D + 
Sbjct: 347 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKSGIYRHINN-----------LKDESE 395

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
           +                   + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 396 KY------------------RNLRTHAVKLTGWGVLRGAQGKKEKFWIAANSWGKSWGEN 437

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 438 GYFRILRGVNESDIEKLIIAA 458


>gi|32129435|sp|P92133.2|CATB3_GIALA RecName: Full=Cathepsin B-like CP3; AltName: Full=Cathepsin B-like
           protease B3; Flags: Precursor
 gi|1763663|gb|AAB58260.1| cysteine protease [Giardia intestinalis]
 gi|11691660|emb|CAC18648.1| cathepsin B-like cysteine protease 3 [Giardia intestinalis]
          Length = 299

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 75/252 (29%), Positives = 112/252 (44%), Gaps = 63/252 (25%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
           P +FD R ++P+C  I E+ DQG CGSCW                   S  AS G     
Sbjct: 75  PDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD---- 111

Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLIL 198
            R C    D   KK + + +  Y VS +   +      +  ++      G  T  D+ + 
Sbjct: 112 -RRCFAGLD---KKAVKY-SPQYVVSCDRGDMACDGGWLPSVWRFLTKTG--TTTDECVP 164

Query: 199 YKSGRFFVPGNETTAMS-------LIKWTIR-----DNTSQLGA-------EGAFTVFDD 239
           Y+SG     G   T  +       L K T       D  + + A       + AFTV+ D
Sbjct: 165 YQSGSTGARGTCPTKCADGSDLPHLYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSD 224

Query: 240 LILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
            + Y+SG       +  GGHA+ ++G+G D+   + YW+I NSW  DWG++G F+I+R  
Sbjct: 225 FMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRIIRMT 283

Query: 293 DECGIESSITAG 304
           +ECGIE  +  G
Sbjct: 284 NECGIEEQVIGG 295


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 82/196 (41%), Gaps = 45/196 (22%)

Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSS 171
           GC PY    C H+   ++   C      TP C+  C  E Y  P  KD +F A++     
Sbjct: 191 GCWPYNFPRCAHYQKKSKYGPCPKKSYETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWF 250

Query: 172 NE-KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           N  +SI KEI +HGP   +F  ++D   YKSG +                          
Sbjct: 251 NGIRSIKKEIMKHGPTSASFFTYEDFFSYKSGVY-------------------------- 284

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                       Y SG  +  H + ++GWG ++     YWL  N WN +W D G FKI +
Sbjct: 285 -----------KYTSGAYVEFHTVELIGWGTEKGV--DYWLAKNDWNEEWADLGTFKIAQ 331

Query: 291 GKDECGIESSITAGVP 306
           G  +CGI + +  G P
Sbjct: 332 G--DCGI-NDLVLGAP 344


>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 109

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 42/102 (41%), Positives = 64/102 (62%), Gaps = 15/102 (14%)

Query: 218 KWTIRDNTSQLGAEG----AFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSK 266
           ++++ D  + +  +G    +F V++D + Y+SG       K LGGHA++I+GWGE+  + 
Sbjct: 12  EYSVNDAKNAIRTDGPVSASFIVYEDFLAYRSGVYKHTSGKELGGHAVKIIGWGEE--TG 69

Query: 267 EKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           + YWL+ NSWN DWGDNGLFKI  G   C I+  +  G PK+
Sbjct: 70  QAYWLVVNSWNEDWGDNGLFKIALGN--CEIDDDLLGGTPKV 109


>gi|448278133|gb|AGE43966.1| putative cathepsin B [Naegleria fowleri]
          Length = 349

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 68/258 (26%), Positives = 100/258 (38%), Gaps = 90/258 (34%)

Query: 97  CPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTR------------PSCDAS------- 137
           C  +  IR+Q  CGSCW     E+   +    GTR             SCD +       
Sbjct: 136 CQQLHRIRNQEQCGSCWAFSISEMV-ADRFCIGTRGKINTIMSPQWMVSCDTADNGCNGG 194

Query: 138 -------------------------KGHTPKCVRECQ--ENYDVPYKKDLNFGAKSYSVS 170
                                     G  P C   C   E+ +V Y+      ++++ V+
Sbjct: 195 EFPTAFQFVETTGLVSDGCVPYQSGNGFVPPCPNSCANGEDINVRYRTK---NSRNFDVN 251

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
            + KS+   I  +GPV   F V+ D   Y+SG   V                        
Sbjct: 252 -DMKSVQASILANGPVISGFKVYRDFYNYRSGYKHV------------------------ 286

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILR 290
                         +G  +GGHAI+++GWG  + S   YW++ANSW+ +WG NG F ILR
Sbjct: 287 --------------AGGLVGGHAIKVVGWGVTQ-SNVPYWIVANSWSDEWGMNGYFWILR 331

Query: 291 GKDECGIESSITAGVPKL 308
           G +EC IE ++   +P L
Sbjct: 332 GTNECSIEENMWETIPAL 349


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 31/145 (21%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  T 
Sbjct: 352 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHITK 394

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
           +   E               + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 395 KANEESG-----------KYRKLQTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWGEN 443

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 444 GYFRILRGVNESDIEKLIIAAWGQL 468


>gi|197101281|ref|NP_001125612.1| dipeptidyl peptidase 1 precursor [Pongo abelii]
 gi|75061881|sp|Q5RB02.1|CATC_PONAB RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|55728636|emb|CAH91058.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + YK G +   G            +RD          F  F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|311263676|ref|XP_003129789.1| PREDICTED: dipeptidyl peptidase 1-like [Sus scrofa]
          Length = 463

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 70/265 (26%), Positives = 101/265 (38%), Gaps = 78/265 (29%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
           LPA++D R        +  +R+Q SCGSC+                         P E+ 
Sbjct: 231 LPASWDWRNV-RGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVV 289

Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKC-VRECQENYDVPYKKDLN 161
            C  +  G                      +C    G    C V+E    Y   Y  + +
Sbjct: 290 SCSQYAQGCAGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCTVKEGCFRY---YSSEYH 346

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +    Y    NE  +  E+  HGP+  AF V+DD + Y+ G +   G            +
Sbjct: 347 YVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTG------------L 393

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
           RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T WG
Sbjct: 394 RD---------PFNPFE----------LTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWG 434

Query: 282 DNGLFKILRGKDECGIESSITAGVP 306
           ++G F+I RG DEC IES   A  P
Sbjct: 435 EDGYFRIRRGTDECAIESIAVAATP 459


>gi|307548878|ref|NP_001182580.1| dipeptidyl peptidase 1 precursor [Macaca mulatta]
          Length = 463

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G+   C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/297 (25%), Positives = 114/297 (38%), Gaps = 99/297 (33%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE---H 125
           N +  ++G  EV   LP  F++  KWPN   I E  DQG+C   W      +A      H
Sbjct: 70  NEIHTVLGPGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 124

Query: 126 HVNGTRP--------SCD-------------------------------------ASKGH 140
            +    P        SCD                                        G 
Sbjct: 125 SLGHMTPVLSPQNLLSCDKRNQQGCQGGHLDSAWWFLRRRGVVSDHCYPFSGQGRTETGP 184

Query: 141 TPKCVRECQE-------------NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
            P+C+   +              N+ V +  D+     +Y + S+EK IMKE+ E+GPV+
Sbjct: 185 APRCMMHSRAMGRGKRQATARCPNHQV-HANDIYQVTPAYRLGSSEKEIMKELMENGPVQ 243

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
               V +D  LY++G +       T +SL +                         +  +
Sbjct: 244 ALMEVHEDFFLYQNGIY-----SHTPVSLGR------------------------PERYR 274

Query: 248 ALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
             G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 275 RHGTHSVKITGWGEESLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 331


>gi|159109223|ref|XP_001704877.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432952|gb|EDO77203.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 300

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 115/255 (45%), Gaps = 63/255 (24%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
           +D+P +FD R ++P+C  I E+ DQG CGSCW                   S  A+ G  
Sbjct: 73  DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVATFGD- 112

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDD 195
               R C    D   KK + +  + Y VS +   +      +  +++     G  T  D+
Sbjct: 113 ----RRCVAGLD---KKPVKYSPQ-YVVSCDHGDMACNGGWLPNVWKFLTKTG--TTTDE 162

Query: 196 LILYKSGRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTV 236
            + YKSG   + G             +  TA S       I   ++  ++    + AF V
Sbjct: 163 CVPYKSGSTTLRGTCPTKCADGSSKVHLATATSYKDYGLDIPAMMKALSTSGPLQVAFLV 222

Query: 237 FDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
           + D + Y+SG          GGHA+ ++G+G D+   + YW+I NSW  DWG++G F+++
Sbjct: 223 YSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIRNSWGPDWGEDGYFRMI 281

Query: 290 RGKDECGIESSITAG 304
           RG ++C IE    AG
Sbjct: 282 RGINDCSIEEQAYAG 296


>gi|355752523|gb|EHH56643.1| hypothetical protein EGM_06098 [Macaca fascicularis]
          Length = 463

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G+   C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|383415299|gb|AFH30863.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
 gi|384944880|gb|AFI36045.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G+   C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|380808942|gb|AFE76346.1| dipeptidyl peptidase 1 isoform a preproprotein [Macaca mulatta]
          Length = 463

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G+   C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|159108625|ref|XP_001704582.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157432649|gb|EDO76908.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 298

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 74/251 (29%), Positives = 110/251 (43%), Gaps = 62/251 (24%)

Query: 85  PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
           P +FD R ++P+C  I E+ DQG CGSCW                   S  AS G     
Sbjct: 75  PDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD---- 111

Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLIL 198
            R C    D   KK + + +  Y VS +   +      +  ++      G  T  D+ + 
Sbjct: 112 -RRCFAGLD---KKAVKY-SPQYVVSCDRGDMACDGGWLPSVWRFLTKTG--TTTDECVP 164

Query: 199 YKSGRFFVPGNETT------------AMSLIKWTIR-DNTSQLGAEG-----AFTVFDDL 240
           Y+SG     G   T            A   + + +  D   +  A G     AFTV+ D 
Sbjct: 165 YQSGSTGARGTCPTKCADGSDLPIYKATKAVDYGLDCDLIMKALATGGPLQTAFTVYSDF 224

Query: 241 ILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
           + Y+ G       +  GGHA+ ++G+G DE   + YW+I NSW  DWG++G F+I+R  +
Sbjct: 225 MYYEGGVYQHTYGRVEGGHAVEMVGYGTDEYDVD-YWIIRNSWGPDWGEDGYFRIIRMTN 283

Query: 294 ECGIESSITAG 304
           ECGIE  +  G
Sbjct: 284 ECGIEEQVIGG 294


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/143 (34%), Positives = 67/143 (46%), Gaps = 36/143 (25%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G  R     NE +            
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEES------------ 402

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  + L  HA+++ GWG     +  KEK+W+ ANSW   WG
Sbjct: 403 -------------------RKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGISWG 443

Query: 282 DNGLFKILRGKDECGIESSITAG 304
           +NG F+ILRG +E  IE  I A 
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAA 466


>gi|147902366|ref|NP_001080511.1| cathepsin C precursor [Xenopus laevis]
 gi|33417162|gb|AAH56109.1| Ctsc protein [Xenopus laevis]
          Length = 458

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 98/250 (39%), Gaps = 75/250 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHH---VNGTRPSC 134
           +  +R+QGSCGSC+                         P ++  C ++    +G  P  
Sbjct: 241 VSPVRNQGSCGSCYAFASMGMLESRIQIQSQLSQKPILSPQQVVSCSNYSQGCDGGFPYL 300

Query: 135 DASK----------------GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
            A K                G    C    +++Y   Y  + ++    Y    NE  +  
Sbjct: 301 IAGKYLNDFGIVEESDFPYIGSDSPCT--LKDSYQRYYTAEYHYVGGFYG-GCNEAYMKL 357

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+   GP+  AF V+DD I Y+SG +                        G +  F  F 
Sbjct: 358 ELVLGGPLSVAFEVYDDFIHYRSGVY---------------------HHTGLQDKFNPFQ 396

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D+++ EKYW++ NSW   WG+ G F+I RG DEC IE
Sbjct: 397 ----------LTNHAVLLVGYGTDQQTGEKYWIVKNSWGESWGEKGFFRIRRGSDECAIE 446

Query: 299 SSITAGVPKL 308
           S   +  P +
Sbjct: 447 SIAVSANPII 456


>gi|355566931|gb|EHH23310.1| hypothetical protein EGK_06753 [Macaca mulatta]
          Length = 463

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 96/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G+   C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGNDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIHRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 68/145 (46%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V DD   YK G +                 R  TS
Sbjct: 359 YRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKGIY-----------------RHVTS 401

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HAI++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 402 THEEPEKY------------RKLRTHAIKLAGWGTLRGAQGRKEKFWIAANSWGKSWGEN 449

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 450 GYFRILRGVNESDIEKLIIAAWGQL 474


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/149 (33%), Positives = 69/149 (46%), Gaps = 38/149 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GCR Y    C HH  G    C      TP C + C +  +V Y  D      SY+V ++E
Sbjct: 66  GCRSYPFPKCNHHGKGPDAPCPEKIFPTPACNKTC-DTPEVNYILDKTKAKSSYNVPNSE 124

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+IMKEI ++GPVE AF V++D + Y+SG +F                            
Sbjct: 125 KAIMKEIMQNGPVEAAFEVYEDFLHYESGVYF---------------------------- 156

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGED 262
                    +  G+ +GGHAIR+LGWGE+
Sbjct: 157 ---------HSFGRMIGGHAIRMLGWGEE 176



 Score = 45.4 bits (106), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 24/27 (88%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
          CGFGC+GGFP  AW +W+++G+V+GG+
Sbjct: 34 CGFGCHGGFPPRAWDFWMENGLVTGGS 60


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 69/138 (50%), Gaps = 38/138 (27%)

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
           SY VSS E+ IM EI  +GPV+  F V  D        FF+ G            +  + 
Sbjct: 211 SYRVSSREQDIMSEILTNGPVQATFRVHGD--------FFIAG------------VYKHL 250

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS--KEKYWLIANSWNTDWGDN 283
             +G E                  G H++R+LGWGED  +    KYW+ ANSW T+WG+N
Sbjct: 251 PTVGEE----------------IEGYHSVRLLGWGEDYSTGIPVKYWIAANSWGTNWGEN 294

Query: 284 GLFKILRGKDECGIESSI 301
           G F+ILRG++ C IES +
Sbjct: 295 GTFRILRGENHCEIESFV 312


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 68/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R  V  NE              
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  K L  HA+++ GWG        KEK+W+ ANSW   WG
Sbjct: 402 -------------------EKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWG 442

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/151 (31%), Positives = 70/151 (46%), Gaps = 36/151 (23%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           Y  D+      Y +S+NE  IMKEI ++GPV+    V +D  +YKSG F           
Sbjct: 329 YHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSGIF----------- 377

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSG--KALGGHAIRILGWGEDEK---SKEKYW 270
                                  D+  +K    +    H++RI GWGE+        KYW
Sbjct: 378 --------------------RHTDVNYHKPSQYRKHATHSVRITGWGEERDYSGRTRKYW 417

Query: 271 LIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           + ANSW  +WG++G F+I RG +EC IE+ +
Sbjct: 418 IGANSWGKNWGEDGYFRIARGVNECDIETFV 448


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/147 (35%), Positives = 68/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R  V  NE              
Sbjct: 354 YRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  K L  HA+++ GWG        KEK+W+ ANSW   WG
Sbjct: 402 -------------------EKYKKLRTHAVKLTGWGTLRGARGKKEKFWIAANSWGKSWG 442

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469


>gi|348508181|ref|XP_003441633.1| PREDICTED: dipeptidyl peptidase 1-like isoform 1 [Oreochromis
           niloticus]
          Length = 455

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 111/293 (37%), Gaps = 88/293 (30%)

Query: 67  PANRLPELIGYSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
           PA+R+P  +  + V  D       LP  +D R        +  +R+Q SCGSC+      
Sbjct: 200 PASRIPVRVRPAPVKADVAKMASALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMG 258

Query: 115 -----------------CRPYEIAPCEHHVNG------------------TRPSCDASKG 139
                              P ++  C  +  G                     SC    G
Sbjct: 259 MLEARIRILTNNSDAPTLSPQQVVSCSEYSQGCDGGFPYLIGKYTQDFGIVDESCFPYVG 318

Query: 140 HTPKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
               C   ++CQ  Y   Y    N+    Y   S E ++M E+ ++GP+  AF V+ D +
Sbjct: 319 QNTPCGVPQKCQRIYAAEY----NYVGGFYGGCS-EAAMMLELVKNGPMAVAFEVYPDFM 373

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YK G +                        G    F  F+          L  HA+ ++
Sbjct: 374 NYKEGIY---------------------HHTGLADPFNPFE----------LTNHAVLLV 402

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
           G+G   K+ + YW++ NSW T WG+ G F+I RG DEC IES   A   +PKL
Sbjct: 403 GYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPKL 455


>gi|301618234|ref|XP_002938532.1| PREDICTED: tubulointerstitial nephritis antigen-like [Xenopus
           (Silurana) tropicalis]
          Length = 494

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 72/281 (25%), Positives = 110/281 (39%), Gaps = 93/281 (33%)

Query: 81  DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE-------HHVNGTRP- 132
           ++ LP++F++  KWP    + E  DQG+C   W      +A          H      P 
Sbjct: 233 NDILPSHFNAAEKWPG--LVHEPLDQGNCAGSWAFSTAAVASDRISIQSMGHMTQSLSPQ 290

Query: 133 ---SCDA-----------------------------------SKGHTPKCV--------- 145
              SCD                                    + GH+  C+         
Sbjct: 291 NLLSCDTRNQHGCRGGRVDGAWWYLRRRGVVSEPCYPFTSLNTNGHSAPCMMQSRSMGRG 350

Query: 146 -RECQENYDVPY--KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG 202
            R+   N    Y    ++     +Y ++S+EK IMKE+YE+GPV+    V +D  +YKSG
Sbjct: 351 KRQATNNCPNQYYSSNEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSG 410

Query: 203 RF-FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
            +   P  E                                 +  +  G H+++I G G 
Sbjct: 411 IYRHTPVTEREP------------------------------EHHRRHGTHSVKITG-GR 439

Query: 262 DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
           D ++  KYWL ANSW  DWG++G F+I RG++EC IE+ I 
Sbjct: 440 DGQT-HKYWLAANSWGRDWGEDGYFRIARGENECEIETFIV 479


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 67/148 (45%), Gaps = 38/148 (25%)

Query: 114 GCRPYEIAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC PY    C H V     P C      TPKC ++C   Y+  Y++D   G  SY+V   
Sbjct: 160 GCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKCEKKCHAGYNKTYEQDKVKGKSSYNVGEQ 219

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
           E  IM EI ++GPV+G F +F+D ++YKSG +                            
Sbjct: 220 ETDIMMEIMKNGPVDGIFYMFEDFLVYKSGIYH--------------------------- 252

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWG 260
                     Y +G+ +GGHAIR++GWG
Sbjct: 253 ----------YTTGRLVGGHAIRVIGWG 270



 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/76 (43%), Positives = 44/76 (57%), Gaps = 2/76 (2%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
           K A     +NI +  +K  +GV  +     N   + + YS  + DLP +FD+R KWPNCP
Sbjct: 20  KAAPSTRFNNIDQ--VKQNLGVLEETPEDRNTQRQTVRYSVSENDLPESFDARQKWPNCP 77

Query: 99  TIREIRDQGSCGSCWG 114
           +I EIRDQ SC SCW 
Sbjct: 78  SISEIRDQSSCSSCWA 93



 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 16/27 (59%), Positives = 21/27 (77%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
           CG+GCNGG P M+W YW + G+V+GG 
Sbjct: 128 CGYGCNGGIPAMSWDYWTREGVVTGGT 154


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 107/258 (41%), Gaps = 29/258 (11%)

Query: 66  LPANRLPELIGYSEVDEDLP--ANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPC 123
           +P +     +G    D+D+P   +FD+R +WPN   I  + DQG CGS W      +A  
Sbjct: 166 MPLSHETRRMGPIRYDKDIPYPRDFDARRRWPN--FISPVLDQGWCGSDWAVTIATVASD 223

Query: 124 EHHV--NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
              +  NG      + +      +R  Q           NF A+ + +   E    K   
Sbjct: 224 RFAIQSNGAERMVLSPQVLLSCNIRRQQGCRGGHIDVAWNF-ARGHGLVDEECFPYKAAT 282

Query: 182 EHGPVEGAFTVFDD----LILYKSGRFFV--PGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
              P      + +D     +  ++ R+ V  PG   T   ++     D           T
Sbjct: 283 TSCPFRPKANLIEDGCRPPVRQRTSRYKVGPPGKLATENDIMY----DIMESGPVHAVMT 338

Query: 236 VFDDLILYKSG----------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
           V  D   Y  G             G H++RI+GWGED    +KYW++ANSW  DWG+NG 
Sbjct: 339 VHQDFFHYHDGIYRRSPYGDNTLQGLHSVRIVGWGEDR--GDKYWVVANSWGCDWGENGY 396

Query: 286 FKILRGKDECGIESSITA 303
           F+I RG +E GIES +  
Sbjct: 397 FRIARGSNESGIESFVVT 414


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/293 (24%), Positives = 116/293 (39%), Gaps = 91/293 (31%)

Query: 59  GVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG---- 114
            + P ++L A+ +P      E    +PA+FD R ++P C  I  + DQG CGSCW     
Sbjct: 54  AMFPRHDLAAS-VPAECPRGEPSGSIPASFDFREEYPQC--ITPVYDQGHCGSCWAFSAT 110

Query: 115 -------------------CRPYEIAPCEH-------------------HVNGTR---PS 133
                               + Y I+ C++                   H   T    P 
Sbjct: 111 SAFGDRRCMQGLDSAGVPYSQQYTIS-CDYLDLGCAGGLSFSVWTFLTEHGTTTLECVPY 169

Query: 134 CDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVF 193
            DA+K  +  C   C +  ++   K    G   YS   N  +IM+ +   GPV+ +  V+
Sbjct: 170 TDANKDISSPCPDACADGSEIRLVK--ADGCLDYS--GNVTAIMQALANDGPVQASMAVY 225

Query: 194 DDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHA 253
            D + Y+SG +                                      +  G  +  HA
Sbjct: 226 RDFLYYRSGVY-------------------------------------RHVYGSQISSHA 248

Query: 254 IRILGWGE-DEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           + I+G+G  D++    YW++ NS  + WG+ G F I+RG +EC IES++ +G+
Sbjct: 249 VEIIGYGAADDEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 65/141 (46%), Gaps = 38/141 (26%)

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A  Y +SS E  IM+EI   GPV+    V++D  LYK G +                   
Sbjct: 357 ASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHS---------------- 400

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDW 280
                              YK+G     H++++LGWG        K+K+W+ ANSW   W
Sbjct: 401 -------------------YKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYW 441

Query: 281 GDNGLFKILRGKDECGIESSI 301
           G+NG F+ILRG++EC IE  I
Sbjct: 442 GENGYFRILRGQNECDIEKLI 462


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +  D+     +Y + SNEK IMKE+ E+GPV+    V +D  LY+SG +       T +S
Sbjct: 400 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVS 454

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
           L +                         +  +  G H+++I GWGE+   +    KYW  
Sbjct: 455 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTIKYWTA 490

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 491 ANSWGPAWGERGHFRIVRGANECDIESFV 519


>gi|348508183|ref|XP_003441634.1| PREDICTED: dipeptidyl peptidase 1-like isoform 2 [Oreochromis
           niloticus]
          Length = 461

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/293 (25%), Positives = 111/293 (37%), Gaps = 88/293 (30%)

Query: 67  PANRLPELIGYSEVDED-------LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----- 114
           PA+R+P  +  + V  D       LP  +D R        +  +R+Q SCGSC+      
Sbjct: 206 PASRIPVRVRPAPVKADVAKMASALPEQWDWRNV-DGVNFVSPVRNQESCGSCYSFATMG 264

Query: 115 -----------------CRPYEIAPCEHHVNG------------------TRPSCDASKG 139
                              P ++  C  +  G                     SC    G
Sbjct: 265 MLEARIRILTNNSDAPTLSPQQVVSCSEYSQGCDGGFPYLIGKYTQDFGIVDESCFPYVG 324

Query: 140 HTPKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
               C   ++CQ  Y   Y    N+    Y   S E ++M E+ ++GP+  AF V+ D +
Sbjct: 325 QNTPCGVPQKCQRIYAAEY----NYVGGFYGGCS-EAAMMLELVKNGPMAVAFEVYPDFM 379

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YK G +                        G    F  F+          L  HA+ ++
Sbjct: 380 NYKEGIY---------------------HHTGLADPFNPFE----------LTNHAVLLV 408

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
           G+G   K+ + YW++ NSW T WG+ G F+I RG DEC IES   A   +PKL
Sbjct: 409 GYGRCHKTGQNYWIVKNSWGTGWGEEGYFRIRRGNDECAIESIAVAANPIPKL 461


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/196 (29%), Positives = 83/196 (42%), Gaps = 51/196 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+P +I P           C+  +K +   CV  C  N  + Y  D       Y     
Sbjct: 185 GCQPSKIPPV----------CNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVK 232

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            K I KE+  +GPV  A  ++DD+ L+KSG +                            
Sbjct: 233 PKDIQKEVQTYGPVTAALNLYDDIFLHKSGVY---------------------------- 264

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
                    L K+ K +    ++++GWG +  +   YWL+ NSW  +WG NGL KI RGK
Sbjct: 265 --------TLTKNAKYVRLQYVKLIGWGVE--NGVDYWLLVNSWGNEWGQNGLLKIKRGK 314

Query: 293 DECGIESSITAGVPKL 308
             C +ES + A VPK+
Sbjct: 315 YGCAVESFVYAAVPKI 330


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 71/145 (48%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +            +  TI D+  
Sbjct: 354 YRVSSNETQIMKEIMKNGPVQAIMQVHEDFFYYKTGIY----------RHVTSTIEDS-- 401

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                            +  + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 402 -----------------EKYQKLRTHAVKLTGWGTLRGAKGRKEKFWIAANSWGKSWGEN 444

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 445 GYFRILRGVNESDIEKLIIAAWGQL 469


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 107/292 (36%), Gaps = 89/292 (30%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +   +     LPA F++  KW +   I E+ DQG CGS W      +A     
Sbjct: 172 PTYRVKAMSRLTNPTAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 229

Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
                                    CE           H       SC     H   C +
Sbjct: 230 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKI 289

Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
           R          C+ + +V   +D  +        + E  IM EIY  GPV+    V+ D 
Sbjct: 290 RHNSRSLKANGCRPSANV--DRDSFYTVGPAYTLNKESDIMAEIYHSGPVQATMRVYRDF 347

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
             Y SG +                 R   +  GA   F                 H++++
Sbjct: 348 FSYSSGVY-----------------RQTAANRGAPTGF-----------------HSVKL 373

Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +GWGE E + +KYW+ ANSW   WG+ G F+ILRG +ECGIE  + A  P +
Sbjct: 374 VGWGE-EHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424


>gi|402894881|ref|XP_003910570.1| PREDICTED: dipeptidyl peptidase 1 [Papio anubis]
          Length = 463

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 95/248 (38%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y++G +   G            +RD          F  F+
Sbjct: 363 ELVYHGPLSVAFEVYDDFLHYQNGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 72/159 (45%), Gaps = 41/159 (25%)

Query: 115 CRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           CRPYEI PC +H +      CD     TP+C R CQ  Y   Y  D ++G  +Y +  + 
Sbjct: 72  CRPYEIHPCGYHKDEPYYGECD-DLADTPRCKRRCQLGYPKSYPSDKHYGRTAYQLPMSV 130

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           +SI +EI  +GPV   FTV++D   YK G                               
Sbjct: 131 ESIQREIMRNGPVVAGFTVYEDFAHYKGG------------------------------- 159

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEK--YW 270
                 +  + SGK  GGHA++++GWG ++K  EK  YW
Sbjct: 160 ------IYKHTSGKKTGGHAVKVIGWGSEQKGSEKIPYW 192



 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 41/101 (40%), Gaps = 26/101 (25%)

Query: 9   CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQA-------------------EKNSLSNI 49
           CG+GC GG+   AW Y+ + G+V+GG Y +K +                   E + L++ 
Sbjct: 39  CGYGCQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCGYHKDEPYYGECDDLADT 98

Query: 50  PRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDS 90
           PR   +  +G    Y       P    Y      LP + +S
Sbjct: 99  PRCKRRCQLGYPKSY-------PSDKHYGRTAYQLPMSVES 132


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 107/292 (36%), Gaps = 89/292 (30%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +   +     LPA F++  KW +   I E+ DQG CGS W      +A     
Sbjct: 172 PTYRVKAMSRLTNPTAGLPAAFNAVEKWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 229

Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
                                    CE           H       SC     H   C +
Sbjct: 230 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQHRDTCKI 289

Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
           R          C+ + +V   +D  +        + E  IM EIY  GPV+    V+ D 
Sbjct: 290 RHNSRSLKANGCRPSANV--DRDSFYTVGPAYTLNKESDIMAEIYHSGPVQATMRVYRDF 347

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
             Y SG +                 R   +  GA   F                 H++++
Sbjct: 348 FSYSSGVY-----------------RQTAANRGAPTGF-----------------HSVKL 373

Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +GWGE E + +KYW+ ANSW   WG+ G F+ILRG +ECGIE  + A  P +
Sbjct: 374 VGWGE-EHNGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLASWPYV 424


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/152 (35%), Positives = 76/152 (50%), Gaps = 38/152 (25%)

Query: 155 PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
           P + +L     +Y +  NE  IM+EI   GPV+    V+ D  +YK+G +          
Sbjct: 379 PLRTELYKVGPAYRLG-NETDIMQEILTSGPVQATMRVYQDFFVYKNGIY---------- 427

Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWL 271
                  R + S   AE          L+ SG     H++RI+GWGE+   +    KYWL
Sbjct: 428 -------RHSQS---AE----------LHDSGY----HSVRIIGWGEERSYRGPPLKYWL 463

Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITA 303
           + NSW  +WG+NGLFKI RG +EC IES + A
Sbjct: 464 VVNSWGYNWGENGLFKIQRGTNECEIESYVLA 495


>gi|159112288|ref|XP_001706373.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157434469|gb|EDO78699.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 303

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 118/286 (41%), Gaps = 39/286 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
           K        N+     +S M + PD       +LP   + E+    E+ + +P  FD R 
Sbjct: 32  KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87

Query: 93  KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
           ++P C  ++   DQGSCG CW      +        G  + +   S+ H   C  E   C
Sbjct: 88  EYPQC--VKPALDQGSCGGCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145

Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
                 P    L F       ++  + +    Y H        V DD   + LYK+  + 
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
             G  + ++  I   +         +    V+ DL  Y+SG          LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
           G+G  +   + YW+I NSW  DWG+NG F+I+RG +EC IE  I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/145 (36%), Positives = 72/145 (49%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +       T+ S           
Sbjct: 354 YRVSSNETQIMKEIMQNGPVQAIMKVHEDFFSYKTGIY----RHVTSTS----------- 398

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWGDN 283
                      +D   Y+    L  HA+++ GWG  + +   KEK+W+ ANSW   WG+N
Sbjct: 399 -----------EDSEKYQK---LRTHAVKLTGWGTLKGARGKKEKFWIAANSWGKSWGEN 444

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G FKILRG +E  IE  I A   +L
Sbjct: 445 GYFKILRGVNESDIEKLIIAAWGQL 469


>gi|403287831|ref|XP_003935129.1| PREDICTED: dipeptidyl peptidase 1 [Saimiri boliviensis boliviensis]
          Length = 463

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 102/268 (38%), Gaps = 82/268 (30%)

Query: 83  DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
           +LP ++D    W N   I     +R+Q SCGSC+                         P
Sbjct: 230 NLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSP 285

Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
            E+  C  +  G                      +C    G    C  + +E+    Y  
Sbjct: 286 QEVVSCSKYAQGCEGGFPYLIAGKYAQDFGVVEEACFPYTGTDSPC--KMKEDCFRYYSS 343

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           + ++    Y    NE  +  E+  HGP+  AF V+DD + Y+ G +   G          
Sbjct: 344 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTG---------- 392

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
             +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T
Sbjct: 393 --LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGIHYWIVKNSWGT 431

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
            WG++G F+I RG DEC IES   A  P
Sbjct: 432 SWGEDGYFRIRRGTDECAIESIAVAATP 459


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|444728469|gb|ELW68926.1| Dipeptidyl peptidase 1 [Tupaia chinensis]
          Length = 462

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 88/250 (35%), Gaps = 79/250 (31%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 245 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 304

Query: 130 -----------TRPSCDASKGHTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSI 176
                         SC    G    C   ++C   Y   Y     F         NE  +
Sbjct: 305 IAGKYAQDFGLVEESCFPYTGTDAPCKMKKDCIRYYSSEYHYVGGFYG-----GCNEALM 359

Query: 177 MKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTV 236
             E+  HGP+  AF V+DD + Y+ G +                        G    F  
Sbjct: 360 KLELVHHGPMAVAFEVYDDFLHYQKGIY---------------------QHTGLRDPFNP 398

Query: 237 FDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
           F+          L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC 
Sbjct: 399 FE----------LTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGFFRIRRGIDECS 448

Query: 297 IESSITAGVP 306
           IES   A  P
Sbjct: 449 IESIAMAATP 458


>gi|11691656|emb|CAC18646.1| cathepsin B-like protease 1 [Giardia intestinalis]
          Length = 303

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 77/286 (26%), Positives = 118/286 (41%), Gaps = 39/286 (13%)

Query: 39  KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
           K        N+     +S M + PD       +LP   + E+    E+ + +P  FD R 
Sbjct: 32  KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87

Query: 93  KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
           ++P C  ++   DQGSCG CW      +        G  + +   S+ H   C  E   C
Sbjct: 88  EYPQC--VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145

Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
                 P    L F       ++  + +    Y H        V DD   + LYK+  + 
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199

Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
             G  + ++  I   +         +    V+ DL  Y+SG          LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
           G+G  +   + YW+I NSW  DWG+NG F+I+RG +EC IE  I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 32/147 (21%)

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
            D+     +Y + SNEK IMKE+ E+GPV+    V +D  LY+SG +       T +SL 
Sbjct: 335 NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVSLQ 389

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIAN 274
           +            EG              +  G H+++I GWGE+   +    KYW  AN
Sbjct: 390 R-----------PEGY-------------RRHGTHSVKITGWGEETLPDGRTLKYWTAAN 425

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
           SW   WG+ G F+I+RG +EC IES +
Sbjct: 426 SWGPAWGERGHFRIVRGANECDIESFV 452


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
          Length = 476

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 71/147 (48%), Gaps = 39/147 (26%)

Query: 162 FGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTI 221
           +  + Y      ++IMKEIY HGPV  +  V DDL+ YK G                   
Sbjct: 82  YYVEEYGHVEGVENIMKEIYAHGPVTCSIDVPDDLLEYKGG------------------- 122

Query: 222 RDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                         +++D    K+G A  GH I ++GWGE+      YW++ NSW T WG
Sbjct: 123 --------------IYED----KTGIAGDGHDISVVGWGEENGIP--YWIVRNSWGTYWG 162

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           + G F+I+RGK+  GIE   T G+P++
Sbjct: 163 EEGFFRIVRGKNNLGIEEGCTYGIPRI 189



 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/66 (46%), Positives = 44/66 (66%)

Query: 244 KSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
           + GK LG HA+ + GWG DE+++  YW++ NSW T WG+NG F+I  G++   IE   T 
Sbjct: 410 REGKWLGKHAVEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRIAMGQNLLNIEQMCTW 469

Query: 304 GVPKLD 309
           GVP +D
Sbjct: 470 GVPVID 475


>gi|349605750|gb|AEQ00879.1| Dipeptidyl-peptidase 1-like protein, partial [Equus caballus]
          Length = 356

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  I  E+  HGP+  AF V++D + Y  G +   G            +RD        
Sbjct: 249 NEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTG------------LRD-------- 288

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  S + YW++ NSW T WG++G F+I RG
Sbjct: 289 -PFNPFE----------LTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRIRRG 337

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 338 TDECAIESIAMAATP 352


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/147 (34%), Positives = 73/147 (49%), Gaps = 32/147 (21%)

Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
            D+     +Y + SNEK IMKE+ E+GPV+    V +D  LY+SG +       T +SL 
Sbjct: 304 NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVSLQ 358

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIAN 274
           +            EG              +  G H+++I GWGE+   +    KYW  AN
Sbjct: 359 R-----------PEGY-------------RRHGTHSVKITGWGEETLPDGRTLKYWTAAN 394

Query: 275 SWNTDWGDNGLFKILRGKDECGIESSI 301
           SW   WG+ G F+I+RG +EC IES +
Sbjct: 395 SWGPAWGERGHFRIVRGANECDIESFV 421


>gi|159115721|ref|XP_001708083.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
 gi|157436192|gb|EDO80409.1| Cathepsin B precursor [Giardia lamblia ATCC 50803]
          Length = 305

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 74/168 (44%), Gaps = 43/168 (25%)

Query: 132 PSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFT 191
           P      G + +C   CQ+    P +   ++ A S S  SN   IM  +   GPV+  F 
Sbjct: 171 PYTSGETGKSGECPTTCQDG--TPVESAFHYKAASASRLSNYNEIMVSLLADGPVQTGFY 228

Query: 192 VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS-GKALG 250
           V +D + Y  G                                      I +K  G +LG
Sbjct: 229 VHEDFLYYVGG--------------------------------------IYHKVYGTSLG 250

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
           GHA+ I+G+G    +   YW++ NSW +DWG+NG F+ILRG +ECGIE
Sbjct: 251 GHAVLIVGYGS--MNNHDYWIVRNSWGSDWGENGYFRILRGTNECGIE 296


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y +SSNE  IM+EI ++GPV+    V +D   YK+G  R  V  NE              
Sbjct: 354 YRISSNETEIMREIIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEP------------ 401

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  + L  HA+++ GWG     +  KEK+W+ ANSW   WG
Sbjct: 402 -------------------EKYRKLRTHAVKLTGWGTLRGAQGKKEKFWIAANSWGKSWG 442

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/143 (35%), Positives = 68/143 (47%), Gaps = 36/143 (25%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G  R     NE ++           
Sbjct: 351 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVTRTNEESSKY--------- 401

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
                                 + L  HAI++ GWG  + +   KEK+W+ ANSW   WG
Sbjct: 402 ----------------------RKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGKSWG 439

Query: 282 DNGLFKILRGKDECGIESSITAG 304
           +NG F+ILRG +E  IE  I A 
Sbjct: 440 ENGYFRILRGVNESDIEKLIIAA 462


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 64/138 (46%), Gaps = 38/138 (27%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS E  IM+EI   GPV+    V++D  LYK G +                      
Sbjct: 360 YRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHS------------------- 400

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                           YK+G     H++++LGWG        K+K+W+ ANSW   WG+N
Sbjct: 401 ----------------YKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGEN 444

Query: 284 GLFKILRGKDECGIESSI 301
           G F+ILRG++EC IE  I
Sbjct: 445 GYFRILRGQNECDIEKLI 462


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI  +GPV+    V +D   YK+G  R  +  NE +            
Sbjct: 354 YRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTGIYRHVISTNEES------------ 401

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS---KEKYWLIANSWNTDWG 281
                              +  + L  HA+++ GWG    +   KEK+W+ ANSW   WG
Sbjct: 402 -------------------EKYRKLRSHAVKLTGWGTLRGAGGKKEKFWIAANSWGKSWG 442

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 443 ENGYFRILRGVNESDIEKLIIAAWGQL 469


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 78/174 (44%), Gaps = 37/174 (21%)

Query: 138 KGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           K    +C    + N D   K  L+     Y V++ E+ IM EI   GPV+    V  D  
Sbjct: 301 KETMAQCPSRVRSNNDRTTKTRLHRVGPVYRVAT-EEGIMHEILTSGPVQAVMKVSRDFF 359

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
           +YKSG +                     S L                SG   G H++RI+
Sbjct: 360 MYKSGVY-------------------KCSNLA---------------SGSRTGYHSVRIV 385

Query: 258 GWGEDEKSKE--KYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           GWGE+ +  +  KYW+ +NSW + WG+NG F+IL+G DEC IE  + A    +D
Sbjct: 386 GWGEEYQGGKIVKYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAAWADID 439


>gi|296216857|ref|XP_002754752.1| PREDICTED: dipeptidyl peptidase 1 [Callithrix jacchus]
          Length = 460

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 101/268 (37%), Gaps = 82/268 (30%)

Query: 83  DLPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRP 117
           +LP ++D    W N   I     +R+Q SCGSC+                         P
Sbjct: 227 NLPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSP 282

Query: 118 YEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKK 158
            E+  C  +  G                      +C    G    C  + +E+    Y  
Sbjct: 283 QEVVSCSQYAQGCEGGFPYLIAGKYAQDFGVVEEACFPYTGTDSPC--KMKEDCFRYYSS 340

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           + ++    Y    NE  +  E+  HGP+  AF V+DD + Y  G +   G          
Sbjct: 341 EYHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYHKGIYHHTG---------- 389

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
             +RD          F  F+          L  HA+ ++G+G D  S   YW++ NSW T
Sbjct: 390 --LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGIHYWIVKNSWGT 428

Query: 279 DWGDNGLFKILRGKDECGIESSITAGVP 306
            WG++G F+I RG DEC IES   A  P
Sbjct: 429 SWGEDGYFRIRRGTDECAIESIAVAATP 456


>gi|1763659|gb|AAB58258.1| cysteine protease [Giardia intestinalis]
          Length = 269

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/279 (27%), Positives = 118/279 (42%), Gaps = 39/279 (13%)

Query: 46  LSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
             N+     +S M + PD       +LP   + E+    E+ + +P  FD R ++P C  
Sbjct: 5   FENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRDEYPQC-- 58

Query: 100 IREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---CQENYDVP 155
           ++   DQGSCG CW      +        G  + +   S+ H   C  E   C      P
Sbjct: 59  VKPALDQGSCGECWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGCDGGDFQP 118

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFFVPGNETT 212
               L F       ++  + +    Y H        V DD   + LYK+  +   G  + 
Sbjct: 119 TWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY---GQVSK 170

Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRILGWGEDEK 264
           ++  I   +    +    +    V+ DL  Y+SG          LG HA+ I+G+G  + 
Sbjct: 171 SVPAIMGML---VAGGPLQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIVGYGTTDD 227

Query: 265 SKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
             + YW+I NSW  DWG+NG F+I+RG +EC IE  I A
Sbjct: 228 GTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 265


>gi|308157829|gb|EFO60849.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 300

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 108/249 (43%), Gaps = 51/249 (20%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
           +D+P +FD R ++P+C  I E+ DQG CGSCW             + G          ++
Sbjct: 73  DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAFSSVATFGDRRCIAGLDKK---PVKYS 127

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
           P+ V  C            N       + +  K + K            T  D+ + Y+S
Sbjct: 128 PQYVVSCDHG---------NMACNGGWLPNAWKFLTK----------TGTTTDECVPYQS 168

Query: 202 GRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTVFDDLIL 242
           G   + G             + TTA S       I   ++  ++    + AF V+ D + 
Sbjct: 169 GSTTLRGTCPTKCADGSSKVHLTTATSYKDYGLDIPAMMKALSTTGPLQVAFLVYSDFMY 228

Query: 243 YKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
           Y+SG          GGHA+ ++G+G D+   + YW+I NSW  DWG++G F+++RG ++C
Sbjct: 229 YESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIRNSWGPDWGEDGYFRMIRGINDC 287

Query: 296 GIESSITAG 304
            IE    AG
Sbjct: 288 SIEEQAYAG 296


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 55/103 (53%), Gaps = 1/103 (0%)

Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
           CRPYEI PC HH              TP+C R+CQ  Y   YKKD  +G K+Y + ++ K
Sbjct: 72  CRPYEITPCGHHGREPYYGECYDDAQTPRCKRKCQSGYKTTYKKDKRYGRKAYQLPNSVK 131

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSL 216
           +I +EI  HGPV   +TV++D   Y  G +    G ET   ++
Sbjct: 132 AIQREIMMHGPVVAGYTVYEDFSYYTKGIYKHTAGRETGGHAV 174



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 25/31 (80%)

Query: 9  CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
          CG+GC+GG+P  AW+++ + G+V+GG YG +
Sbjct: 39 CGYGCDGGWPIKAWQFFAREGVVTGGNYGRQ 69


>gi|45708820|gb|AAH67941.1| LOC407938 protein, partial [Xenopus (Silurana) tropicalis]
          Length = 470

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 68/279 (24%), Positives = 107/279 (38%), Gaps = 84/279 (30%)

Query: 66  LPANRLPELIGYSEVDEDLPANFDSRTKWPNCP---TIREIRDQGSCGSCWG-------- 114
           +P    P  +   E  + LP  +D    W N      +  +R+Q SCGSC+         
Sbjct: 208 IPMRPRPAPLPTDEKYQGLPTEWD----WRNIAGYNFVTPVRNQASCGSCYAFSSMGMLE 263

Query: 115 --------------CRPYEIAPCEHHVNGTR---PSCDASK-----------------GH 140
                           P ++  C ++  G     P   A K                   
Sbjct: 264 SRIQIRSQLSQKPILSPQQVVSCSNYSQGCEGGFPYLIAGKYVSDYGIVEESDLPYTGSD 323

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           +P  +++ Q+ Y   Y  + ++    Y    NE  +  E+   GP+  AF V+DD + Y+
Sbjct: 324 SPCTLKDSQQKY---YTAEYHYVGGFYG-GCNEAYMKLELVLGGPLSVAFEVYDDFMHYR 379

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
           SG +                        G +  F  F           L  HA+ ++G+G
Sbjct: 380 SGVY---------------------HHTGLQDKFNPFQ----------LTNHAVLLVGYG 408

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
            D+++ EKYW++ NSW   WG+ G F+I RG DEC IES
Sbjct: 409 TDQQTGEKYWIVKNSWGESWGEKGYFRIRRGTDECAIES 447


>gi|332210919|ref|XP_003254561.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1 [Nomascus
           leucogenys]
          Length = 463

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 246 VSPVRNQASCGSCYSFASVGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 306 TAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y+ G +   G            +RD          F  F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYEKGIYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIAVAATP 459


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|75812938|ref|NP_001028789.1| dipeptidyl peptidase 1 precursor [Bos taurus]
 gi|115312125|sp|Q3ZCJ8.1|CATC_BOVIN RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|73587261|gb|AAI02116.1| Cathepsin C [Bos taurus]
          Length = 463

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+QGSCGSC+                         P E+  C  +  G        
Sbjct: 246 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                          C    G    C    +E     Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+   GP+  AF V+DD + Y+ G +   G            +RD          F  F+
Sbjct: 363 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIALAATP 459


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +  D+     +Y + SNEK IMKE+ E+GPV+    V +D  LY+SG +       T +S
Sbjct: 320 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQSGIY-----SHTPVS 374

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
           L +                         +  +  G H+++I GWGE+   +    KYW  
Sbjct: 375 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 410

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 411 ANSWGPAWGERGHFRIVRGANECDIESFV 439


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 75/292 (25%), Positives = 101/292 (34%), Gaps = 89/292 (30%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW------------- 113
           P  R+  +   S     LP  F++  +W +   I E+ DQG CGS W             
Sbjct: 170 PTYRVKAMTRLSNPSSGLPRKFNAVERWSS--YISEVPDQGWCGSSWVLSTTSVASDRFA 227

Query: 114 ---------GCRPYEIAPCEHHVNGT----------------------------RPSCDA 136
                       P  I  C     G                             R SC  
Sbjct: 228 IQSQGKEVVQLSPQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDETCYPYTQRRDSCKI 287

Query: 137 SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
                      C+  Y V  +  L     +YS+   E  IM EIY  GPV+    V+ D 
Sbjct: 288 RHNSRSLKANGCRPAYGVN-RDSLYTVGPAYSLKG-ETDIMAEIYHSGPVQATMRVYRDF 345

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
             Y  G +                 R   +  GA   F                 H+++I
Sbjct: 346 FSYSGGVY-----------------RQTAANRGAPTGF-----------------HSVKI 371

Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           +GWGE E    KYW+ ANSW   WG++G F+ILRG +ECGIE  + A  P +
Sbjct: 372 VGWGE-EHDGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEYVLASWPNV 422


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 66/136 (48%), Gaps = 39/136 (28%)

Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
           TPKC + CQ  Y   Y++D +F  ++Y + +NE+SI +EIY++GPV  AF V+ D   YK
Sbjct: 101 TPKCRKTCQRKYYKSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQDFSYYK 160

Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
            G                                     + ++K G   G HA++++GWG
Sbjct: 161 KG-------------------------------------IYVHKWGGQTGAHAVKVVGWG 183

Query: 261 EDEKSKEKYWLIANSW 276
            +  +   YWLIANSW
Sbjct: 184 RENAT--DYWLIANSW 197


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|344293788|ref|XP_003418602.1| PREDICTED: dipeptidyl peptidase 1 [Loxodonta africana]
          Length = 463

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 76/268 (28%), Positives = 106/268 (39%), Gaps = 78/268 (29%)

Query: 84  LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
           LPA++D    W N   I     +R+Q SCGSC+                         P 
Sbjct: 231 LPASWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARLRILTNNSQTPVLSPQ 286

Query: 119 EIAPCEHHVNGTR---PSCDASKGHT------PKCVRECQENYDVPYKKD-LNFGAKSYS 168
           E+  C  +  G     P   A K           C      +     KKD   + +  Y 
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTATDSPCKVKKDCFRYYSSEYH 346

Query: 169 V------SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
                    NE  +  E+  HGPV  +F V+DD I Y  G +   G            +R
Sbjct: 347 YVGGFYGGCNEALMKLELVNHGPVVVSFEVYDDFIHYHKGIYHHTG------------LR 394

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
           D          F  F+          L  HA+ ++G+G D  S   YW++ NSW+  WG+
Sbjct: 395 D---------PFNPFE----------LTNHAVLLVGYGTDSASGLDYWIVKNSWSATWGE 435

Query: 283 NGLFKILRGKDECGIES-SITAG-VPKL 308
           +G F+I RG DECGIES ++TA  +PKL
Sbjct: 436 DGYFRIRRGTDECGIESIALTATPIPKL 463


>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
          Length = 224

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 71/161 (44%), Gaps = 38/161 (23%)

Query: 139 GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
           G  P C   C    D   K   + G     +  N + I  EI  +GPV  AF V+ D + 
Sbjct: 100 GGGPACSDVCSLGPDYSVKAS-SLGV----IQDNVRQIQSEILSNGPVFAAFWVYSDFMA 154

Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
           Y +G  +    E  A                                GK  GGHA+ ++G
Sbjct: 155 Y-TGGVYSASKEALAQ-------------------------------GKT-GGHAVMMVG 181

Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           WG D+++ + YWL+ NSW+  WGD G FKI RG DECGIES
Sbjct: 182 WGTDKETGQDYWLLQNSWSEKWGDKGRFKIKRGVDECGIES 222


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|185135783|ref|NP_001117966.1| prepro-cathepsin C precursor [Oncorhynchus mykiss]
 gi|51038277|gb|AAT94060.1| prepro-cathepsin C [Oncorhynchus mykiss]
          Length = 457

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 72/295 (24%), Positives = 116/295 (39%), Gaps = 92/295 (31%)

Query: 67  PANRLPELIGYSEVDEDLP---ANFDSRTKWPNCPTIR---EIRDQGSCGSCWGC----- 115
           PA+ +P  +G + V   L    A    R  W +   +     +R+Q SCGSC+       
Sbjct: 202 PASHIPRRVGPAPVTSTLAKMAAGLPERWDWRDVNGVNYLSPVRNQASCGSCYSFALMGM 261

Query: 116 -----------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY------ 152
                             P ++  C  +  G    CD   G  P  + +  +++      
Sbjct: 262 LEARVRLQTNNTETPIFSPQQVVSCSQYSQG----CD---GGFPYLIGKYVQDFGIVEES 314

Query: 153 -----------DVP------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
                      DVP      Y  D ++    Y   S E ++M E+ ++GP+  AF V+ D
Sbjct: 315 CYPYAGTDSPCDVPDGCLRHYTSDYSYVGGFYGGCS-ESAMMLELVKNGPMGVAFEVYPD 373

Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIR 255
            + YK G +                        G   ++  F+          L  HA+ 
Sbjct: 374 FMHYKEGIY---------------------HHTGLHDSYNPFE----------LTNHAVL 402

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
           ++G+G+   + +K+W++ NSW T WG+ G FK+ RG DEC IES   A   +PKL
Sbjct: 403 LVGYGQCHVTGQKFWVVKNSWGTKWGEEGFFKVRRGSDECAIESIAVAAKPIPKL 457


>gi|32129434|sp|P92132.2|CATB2_GIALA RecName: Full=Cathepsin B-like CP2; AltName: Full=Cathepsin B-like
           protease B2; Flags: Precursor
 gi|11691658|emb|CAC18647.1| cathepsin B-like protease 2 [Giardia intestinalis]
          Length = 300

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 114/255 (44%), Gaps = 63/255 (24%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
           +D+P +FD R ++P+C  I E+ DQG CGSCW                   S  A+ G  
Sbjct: 73  DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVATFGD- 112

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDD 195
               R C    D   KK + +  + Y VS +   +      +  +++     G  T  D+
Sbjct: 113 ----RRCVAGLD---KKPVKYSPQ-YVVSCDHGDMACNGGWLPNVWKFLTKTG--TTTDE 162

Query: 196 LILYKSGRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTV 236
            + YKSG   + G             +  TA S       I   ++  ++    + AF V
Sbjct: 163 CVPYKSGSTTLRGTCPTKCADGSSKVHLATATSYKDYGLDIPAMMKALSTSGPLQVAFLV 222

Query: 237 FDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
             D + Y+SG          GGHA+ ++G+G D+   + YW+I NSW  DWG++G F+++
Sbjct: 223 HSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRMI 281

Query: 290 RGKDECGIESSITAG 304
           RG ++C IE    AG
Sbjct: 282 RGINDCSIEEQAYAG 296


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWVAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|351712812|gb|EHB15731.1| Dipeptidyl-peptidase 1 [Heterocephalus glaber]
          Length = 462

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 93/248 (37%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+QG CGSC+                         P E+  C  +  G        
Sbjct: 245 VSPVRNQGYCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 304

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         SC    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 305 IAGKYAQDFGFVEESCFPYTGTDAPC--KMKEDCMRYYTSEYHYVGGFYG-GCNEALMKL 361

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+ +HGP+  AF V DD + Y  G +   G            +RD          F  F+
Sbjct: 362 ELVQHGPMAVAFEVCDDFMHYHKGIYHHTG------------LRD---------PFNPFE 400

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  +   YW++ NSW T WG+ G F+ILRG DEC IE
Sbjct: 401 ----------LTNHAVLLVGYGTDSANGMDYWIVKNSWGTSWGEKGYFRILRGTDECAIE 450

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 451 SIAMAATP 458


>gi|296471940|tpg|DAA14055.1| TPA: dipeptidyl peptidase 1 [Bos taurus]
 gi|440894445|gb|ELR46895.1| Dipeptidyl peptidase 1 [Bos grunniens mutus]
          Length = 463

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+QGSCGSC+                         P E+  C  +  G        
Sbjct: 246 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 305

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                          C    G    C    +E     Y  + ++    Y    NE  +  
Sbjct: 306 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 362

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+   GP+  AF V+DD + Y+ G +   G            +RD          F  F+
Sbjct: 363 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 401

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 451

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 452 SIALAATP 459


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 71/146 (48%), Gaps = 32/146 (21%)

Query: 159 DLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIK 218
           D+     +Y + S+EK IMKE+ E+GPV+    V +D  LYK G +       T +S+ +
Sbjct: 337 DIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPLSMAR 391

Query: 219 WTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANS 275
                                    +  +  G H+++I GWGE+   +    KYW  ANS
Sbjct: 392 ------------------------PEQYRRHGTHSVKITGWGEETLPDGRTLKYWTAANS 427

Query: 276 WNTDWGDNGLFKILRGKDECGIESSI 301
           W   WG+ G F+ILRG +EC IES +
Sbjct: 428 WGPSWGERGHFRILRGSNECDIESFV 453


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 47/150 (31%), Positives = 73/150 (48%), Gaps = 32/150 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +  D+     +Y ++ +EK IMKE+ E+GPV+    V +D  LYKSG             
Sbjct: 330 HANDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSG------------- 376

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWLI 272
                I  +T+    +G              +  G H+++I GWGE++      +KYW  
Sbjct: 377 -----IYRHTAVAEGKG-----------PKHQQHGTHSVKITGWGEEQLPDGQVQKYWTA 420

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSIT 302
           ANSW   WG++G F+I RG +EC +ES + 
Sbjct: 421 ANSWGRAWGEDGHFRIARGVNECEVESFVV 450


>gi|30038325|dbj|BAC75711.1| cathepsin C [Bos taurus]
          Length = 458

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+QGSCGSC+                         P E+  C  +  G        
Sbjct: 241 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 300

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                          C    G    C    +E     Y  + ++    Y    NE  +  
Sbjct: 301 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 357

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+   GP+  AF V+DD + Y+ G +   G            +RD          F  F+
Sbjct: 358 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 396

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D  S   YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 397 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 446

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 447 SIALAATP 454


>gi|432108509|gb|ELK33225.1| Dipeptidyl peptidase 1 [Myotis davidii]
          Length = 466

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 91/248 (36%), Gaps = 75/248 (30%)

Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
           +  +R+Q SCGSC+                         P E+  C  +  G        
Sbjct: 249 VTPVRNQASCGSCYSFASMGMLEARIRILTNNTQSPILSPQEVVSCSQYAQGCEGGFPYL 308

Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
                         +C    G    C  + +E+    Y  + ++    Y    NE  +  
Sbjct: 309 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCIRYYTSEYHYVGGFYG-GCNEALMKL 365

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           E+  HGP+  AF V+DD + Y  G +                        G +  F  F+
Sbjct: 366 ELVHHGPMAVAFEVYDDFLHYNQGIY---------------------HHTGLKDPFNPFE 404

Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     L  HA+ ++G+G D K+   YW++ NSW T WG+ G F+I RG DEC IE
Sbjct: 405 ----------LTNHAVLLVGYGTDPKTGLDYWIVKNSWGTSWGEQGYFRIRRGTDECAIE 454

Query: 299 SSITAGVP 306
           S   A  P
Sbjct: 455 SIAMAATP 462


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G                  I  + +
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG------------------IYRHVT 396

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
           +   E +             +    HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 397 RTNEEAS-----------KYRKFQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|62510425|sp|Q60HG6.1|CATC_MACFA RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|52782205|dbj|BAD51949.1| cathepsin C [Macaca fascicularis]
          Length = 463

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+  HGP+  AF V+DD + Y++G +   G            +RD        
Sbjct: 356 NEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD-------- 395

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG
Sbjct: 396 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRG 444

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 445 TDECAIESIAVAATP 459


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 112/296 (37%), Gaps = 97/296 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I +  DQG+C   W               
Sbjct: 86  NEIHTVLGPGEV---LPRAFEASEKWPN--LIHDPLDQGNCAGSWAFSTAAVASDRVSIH 140

Query: 115 --------CRPYEIAPCEHH----VNGTR---------------PSCDASKGH------- 140
                     P  +  C+ H      G R                 C    GH       
Sbjct: 141 SLGHMTPVLSPQNLLSCDTHNQQGCQGGRLDGAWWFLRRRGVVSDHCYPFSGHERNEAGP 200

Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            P+C+           Q     P    +  D+     +Y + SNEK IMKE+ E+GPV+ 
Sbjct: 201 APRCMMHSRAMGRGKRQATARCPNSYVHANDIYQVTPAYRLGSNEKDIMKELMENGPVQA 260

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
              V +D  LY+SG +       T +S  +                         +  + 
Sbjct: 261 LMEVHEDFFLYQSGIY-----SHTPVSHGR------------------------PERYRR 291

Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 292 HGTHSVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANECDIESFV 347


>gi|194213370|ref|XP_001492720.2| PREDICTED: dipeptidyl peptidase 1-like [Equus caballus]
          Length = 478

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  I  E+  HGP+  AF V++D + Y  G +   G            +RD        
Sbjct: 371 NEALIKLELVHHGPMAVAFEVYNDFLHYHDGIYHHTG------------LRD-------- 410

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  S + YW++ NSW T WG++G F+I RG
Sbjct: 411 -PFNPFE----------LTNHAVLLVGYGTDSASGQDYWIVKNSWGTSWGEDGYFRIRRG 459

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 460 TDECAIESIAMAATP 474


>gi|47550737|ref|NP_999887.1| dipeptidyl peptidase 1 precursor [Danio rerio]
 gi|39794586|gb|AAH64286.1| Cathepsin C [Danio rerio]
          Length = 455

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/291 (24%), Positives = 113/291 (38%), Gaps = 91/291 (31%)

Query: 67  PANRLPELIGYSEVDED------LPANFDSRTKWPNCPTIREIRDQGSCGSCWGC----- 115
           PA+R+P  +    V  D      LP ++D R        +  +R+Q  CGSC+       
Sbjct: 201 PASRIPRRVRPVTVAADSKAASGLPQHWDWRNV-NGVNFVSPVRNQAQCGSCYSFATMGM 259

Query: 116 -----------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY------ 152
                             P ++  C  +  G    CD   G  P  + +  +++      
Sbjct: 260 LEARVRIQTNNTQQPVFSPQQVVSCSQYSQG----CD---GGFPYLIGKYIQDFGIVEED 312

Query: 153 -------DVP----------YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
                  D P          Y  D ++    Y   S E ++M E+ ++GP+  A  V+ D
Sbjct: 313 CFPYTGSDSPCNLPAKCTKYYASDYHYVGGFYGGCS-ESAMMLELVKNGPMGVALEVYPD 371

Query: 196 LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIR 255
            + YK G +   G            +RD  +                      L  HA+ 
Sbjct: 372 FMNYKEGIYHHTG------------LRDANNPF-------------------ELTNHAVL 400

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           ++G+G+  K+ EKYW++ NSW + WG+NG F+I RG DEC IES   A  P
Sbjct: 401 LVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDECAIESIAVAATP 451


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 84/177 (47%), Gaps = 45/177 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+  PC+H+ +    +C +  +     C ++C  +NY V Y+ DL+  +  Y  S 
Sbjct: 124 GCQPYKNRPCDHYGDSRLTNCSSLRRTQMTVCRKKCVNKNYKVKYEDDLHKTSIVYMTSW 183

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  +GPV     V+++ + YK G                            
Sbjct: 184 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEG---------------------------- 215

Query: 231 EGAFTVFDDLILYKS--GKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
                      +YKS  G+ +G H ++++GWG D    E YWL  NSWN++WG++GL
Sbjct: 216 -----------IYKSTTGELIGYHHVKLIGWGVDGDGTE-YWLAMNSWNSNWGNDGL 260


>gi|256074073|ref|XP_002573351.1| dipeptidyl-peptidase I (C01 family) [Schistosoma mansoni]
 gi|360043488|emb|CCD78901.1| putative cathepsin C [Schistosoma mansoni]
          Length = 455

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 76/282 (26%), Positives = 106/282 (37%), Gaps = 92/282 (32%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-----IREIRDQGSCGSCWG------- 114
           P+  L  L G      +LP  FD    W + P      +  IR+QG CGSC+        
Sbjct: 208 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYAFASAAAL 257

Query: 115 ---------------CRPYEIAPCEHH---VNGTRP----------------SCDASKGH 140
                            P  +  C  +    NG  P                +CD   G 
Sbjct: 258 EARIRLVSNFSEQPILSPQAVVDCSPYSEGCNGGFPFLIAGKYGEDFGFVSENCDPYTGE 317

Query: 141 -TPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
            T KC   + C   Y   Y          Y  ++NEK +  E+  +GP    F V++D  
Sbjct: 318 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 372

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YK G                  I  +T+       F  F+          L  HA+ ++
Sbjct: 373 FYKEG------------------IYHHTTVQNDHYNFNPFE----------LTNHAVLLV 404

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           G+G D+ S E YW + NSW  +WG+ G F+ILRG DECG+ES
Sbjct: 405 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 446


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 50/147 (34%), Positives = 68/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R     NE +            
Sbjct: 247 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEES------------ 294

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                                 + L  HA+++ GWG     +  KEK+W+ ANSW   WG
Sbjct: 295 -------------------DKYRKLRTHAVKLTGWGTLKGAQGRKEKFWIAANSWGKSWG 335

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 336 ENGYFRILRGVNESDIEKLIIAAWGQL 362


>gi|344250687|gb|EGW06791.1| Dipeptidyl-peptidase 1 [Cricetulus griseus]
          Length = 483

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +   G            +RD        
Sbjct: 376 NEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTG------------LRD-------- 415

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +   YW + NSW T+WG++G F+I RG
Sbjct: 416 -PFNPFE----------LTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRIRRG 464

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 465 TDECAIESIAVAAIP 479


>gi|343459017|gb|AEM37667.1| cathepsin C subunit [Epinephelus bruneus]
          Length = 106

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 69/136 (50%), Gaps = 33/136 (24%)

Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
           ++M E+ ++GP+  AF V+ D ++YK G +                        G   +F
Sbjct: 2   AMMLELVKNGPMAVAFEVYPDFMIYKEGIY---------------------HHTGLADSF 40

Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
             F+          L  HA+ ++G+G   K+ +KYW++ NSW TDWG++G F+I RG DE
Sbjct: 41  NPFE----------LTNHAVLLVGYGRCHKTGQKYWIVKNSWGTDWGEDGYFRIRRGSDE 90

Query: 295 CGIESSITAG--VPKL 308
           C IES   A   +PKL
Sbjct: 91  CSIESIAVAANPIPKL 106


>gi|395815757|ref|XP_003781389.1| PREDICTED: dipeptidyl peptidase 1 [Otolemur garnettii]
          Length = 575

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 66/266 (24%), Positives = 95/266 (35%), Gaps = 80/266 (30%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
           LPA++D R        +  +R+Q SCGSC+                         P E+ 
Sbjct: 343 LPASWDWRNVH-GVNYVSPVRNQESCGSCYSFASVGMLEARIRILTNNTQTPILSPQEVV 401

Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKCVRE--CQENYDVPYKKDL 160
            C  +  G                      +C    G    C  +  C+  Y   Y    
Sbjct: 402 SCSQYAQGCEGGFPYLVAGKHAQDFGLVEEACFPYTGTDAPCTMKEGCRRYYSSEYHYVG 461

Query: 161 NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWT 220
            F         NE  +  E+  HGP+  AF V+DD + Y  G +                
Sbjct: 462 GFYG-----GCNEALMKLELVHHGPMAVAFEVYDDFLHYHRGIY---------------- 500

Query: 221 IRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDW 280
                   G    F  F+          L  HA+ ++G+G D  +  +YW++ NSW T W
Sbjct: 501 -----HHTGLTDPFNPFE----------LTNHAVLLVGYGTDSATGIQYWIVKNSWGTGW 545

Query: 281 GDNGLFKILRGKDECGIESSITAGVP 306
           G++G F+I RG DEC IES   A  P
Sbjct: 546 GEDGYFRIRRGTDECAIESIAVAATP 571


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 71/140 (50%), Gaps = 32/140 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y +SS +  IMKEI E+GPV+    V+DD        FF+     + +    W++   T 
Sbjct: 361 YRISSQDADIMKEIKENGPVQAVMQVYDD--------FFL---YKSGIYKHIWSLEGKTQ 409

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDWGDN 283
               +                    H+I+I+GWG   + E  ++K+W+ ANSW   WG+N
Sbjct: 410 NRHQKKP------------------HSIKIVGWGTLRDAEGQRQKFWIAANSWGNSWGEN 451

Query: 284 GLFKILRGKDECGIESSITA 303
           G F+ILRG++EC IE ++ A
Sbjct: 452 GYFRILRGQNECDIEKTVIA 471


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  F            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKF------------QKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 72/149 (48%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +  D+     +Y + SNEK IMKE+ E+GPV+    V +D  LY+ G +       T +S
Sbjct: 333 HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLYQGGIY-----SHTPVS 387

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
           L K                         +  +  G H+++I GWGE+   +    KYW  
Sbjct: 388 LGK------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 423

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 424 ANSWGPAWGERGHFRIVRGTNECDIESFV 452


>gi|354498051|ref|XP_003511129.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 1-like
           [Cricetulus griseus]
          Length = 470

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +   G            +RD        
Sbjct: 363 NEALMKLELVQHGPMAVAFEVQDDFLHYHSGIYHHTG------------LRD-------- 402

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +   YW + NSW T+WG++G F+I RG
Sbjct: 403 -PFNPFE----------LTNHAVLLVGYGRDPDTGTDYWTVKNSWGTEWGESGYFRIRRG 451

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 452 TDECAIESIAVAAIP 466


>gi|26340150|dbj|BAC33738.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAVAAIP 458


>gi|160707990|ref|NP_034112.3| dipeptidyl peptidase 1 preproprotein [Mus musculus]
 gi|3023454|sp|P97821.1|CATC_MOUSE RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|1881656|gb|AAB49457.1| preprodipeptidyl peptidase I [Mus musculus]
 gi|7609786|gb|AAB58400.3| dipeptidyl peptidase I precursor [Mus musculus]
 gi|45219895|gb|AAH67063.1| Cathepsin C [Mus musculus]
 gi|74147157|dbj|BAE27487.1| unnamed protein product [Mus musculus]
 gi|74178079|dbj|BAE29829.1| unnamed protein product [Mus musculus]
 gi|148674849|gb|EDL06796.1| cathepsin C, isoform CRA_b [Mus musculus]
          Length = 462

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAVAAIP 458


>gi|407196042|gb|AFT64209.1| putative cathepsin C3, partial [Eimeria tenella]
          Length = 595

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 60/195 (30%), Positives = 85/195 (43%), Gaps = 38/195 (19%)

Query: 120 IAPCEHHV-NGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
           +APC  H+ N  R   +++ G  P+         D  Y ++ N+    Y    NE+ IM+
Sbjct: 407 VAPCLMHLGNFLRSPAESAPGCAPE---------DRWYAQEYNYVGGFYE-GCNEEKIME 456

Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
           EIY HGPV  A    D L LY+ G F V  ++   +                       D
Sbjct: 457 EIYNHGPVVAALDAPDALFLYEDGFFDVKPSDHGKLC----------------------D 494

Query: 239 DLILYKSGKALGGHAIRILGWGEDEK-----SKEKYWLIANSWNTDWGDNGLFKILRGKD 293
                 +G     HAI I+GWGED       +  K+W++ N+W  DWG NG  K+ RG++
Sbjct: 495 SPNKGLTGWEYTNHAIAIVGWGEDPPRMPGMTTRKFWVVRNTWGNDWGRNGYIKMKRGEN 554

Query: 294 ECGIESSITAGVPKL 308
              IES   A  P L
Sbjct: 555 LAAIESQAVAIDPDL 569


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 51/196 (26%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSN 172
           GC+P +I P           C+  +K +   CV  C  N  + Y  D       Y     
Sbjct: 130 GCQPSKIPPV----------CNLPTKINKRTCVDYCYGNDTIKYNHD--HVKVRYYYHVK 177

Query: 173 EKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEG 232
            K I KE+  +GPV  A  ++DD+ L+KS                              G
Sbjct: 178 PKDIQKEVQTYGPVTAALNLYDDIFLHKS------------------------------G 207

Query: 233 AFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
            +T      L K+ K +    ++++GWG +  +   YWL+ NSW  +WG NGL KI RGK
Sbjct: 208 VYT------LTKNAKYVRLQYVKLIGWGVE--NGVDYWLLVNSWGNEWGQNGLLKIKRGK 259

Query: 293 DECGIESSITAGVPKL 308
             C +ES + A VPK+
Sbjct: 260 YGCAVESFVYAAVPKI 275


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 116/285 (40%), Gaps = 44/285 (15%)

Query: 54  LKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
            K  +G  P  +   N + E+ G S  +E  PA F +  +WP    I +  DQ +CG+ W
Sbjct: 193 FKKRLGTFPPSHSLLN-MREVPGKSLPEEKFPAIFSAIYEWPE--WIHDPLDQRNCGASW 249

Query: 114 GCRPYEIAP--CEHHVNGTRP---------SCDASKGH------TPKCVRECQENYDVPY 156
                 +A      H  G            SCD    H           R  + +  V Y
Sbjct: 250 AFSTASVAADRIAIHSKGQITDNLSAQNLISCDTRNQHGCNGGSIDGAWRYLKTHGVVSY 309

Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY----EHGPVEGAFTVFDDLILYKSGRFFVPGNETT 212
               +F  K    S+  +  +   Y     +GP   AF   + L    S  + V   ET 
Sbjct: 310 ACYPSFWNKHLGPSAENQCYVSNEYGKNHTNGPCPNAFEKSNRLYRCAS-HYRVSSKETD 368

Query: 213 AMSLIKWTIRDNTSQLGAEGAFTVFDDLILY---------KSGKALGGHAIRILGWG--- 260
            M  IK        +   +    V++D  LY         K+G     H++++LGWG   
Sbjct: 369 IMKEIK-------DRGPVQAIMKVYEDFFLYKEGIYQHSQKAGSKWKTHSVKLLGWGALP 421

Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
           +    K+K+W+ ANSW   WG+NG F+ILRG++EC IE  I A +
Sbjct: 422 DKNGQKQKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466


>gi|74199074|dbj|BAE30750.1| unnamed protein product [Mus musculus]
          Length = 447

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 340 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 378

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 379 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 428

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 429 TDECAIESIAVAAIP 443


>gi|74191569|dbj|BAE30359.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAVAAIP 458


>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 204

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 94/236 (39%), Gaps = 86/236 (36%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
           + I +  V+  +P  FD+R  W NC TI++I D+  C + W                   
Sbjct: 55  QTISHRNVNMVIPHTFDARDHWVNCSTIKQIHDECCCRADW------------------- 95

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
                                           K Y+V ++++ I KEI  +GPV  +  V
Sbjct: 96  -----------------------------VSEKIYNVYADQEDIQKEILMNGPVIASILV 126

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
             D ++YKSG +F P  +++                                    LG  
Sbjct: 127 KVDFLVYKSGVYF-PTPKSSN-----------------------------------LGWI 150

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +RI+GWG + K+   YWL ANSW+ +WG+NG  K+ RG     IES + A +PK+
Sbjct: 151 NLRIIGWGYEGKTP--YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPKI 204


>gi|74204274|dbj|BAE39895.1| unnamed protein product [Mus musculus]
          Length = 462

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAVAAIP 458


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 68/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R     NE +            
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  +    HA+++ GWG     +  KEK+W+ ANSW   WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWG 443

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 68/147 (46%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R     NE +            
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  +    HA+++ GWG     +  KEK+W+ ANSW   WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWG 443

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470


>gi|12832450|dbj|BAB22112.1| unnamed protein product [Mus musculus]
          Length = 461

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 354 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 392

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 393 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 442

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 443 TDECAIESIAVAAIP 457


>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 229

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 94/236 (39%), Gaps = 86/236 (36%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
           + I +  V+  +P  FD+R  W NC TI++I D+  C + W                   
Sbjct: 80  QTISHRNVNMVIPHTFDARDHWVNCSTIKQIHDECCCRADW------------------- 120

Query: 133 SCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
                                           K Y+V ++++ I KEI  +GPV  +  V
Sbjct: 121 -----------------------------VSEKIYNVYADQEDIQKEILMNGPVIASILV 151

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
             D ++YKSG +F P  +++                                    LG  
Sbjct: 152 KVDFLVYKSGVYF-PTPKSSN-----------------------------------LGWI 175

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +RI+GWG + K+   YWL ANSW+ +WG+NG  K+ RG     IES + A +PK+
Sbjct: 176 NLRIIGWGYEGKTP--YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPKI 229


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 82/195 (42%), Gaps = 49/195 (25%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
           GC+P +I P  +            K +   C   C  N  + Y  D      SY+     
Sbjct: 179 GCQPSKIPPIFNL---------PKKIYNRTCDNFCYGNSLIDYNHD--HVKVSYTYHVLY 227

Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
           K+I +E+  +GPV   F+++DDL LY SG +                            A
Sbjct: 228 KNIQREVQTYGPVSAYFSLYDDLFLYTSGVY----------------------------A 259

Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
            T     + Y+S K        ++GWG +  +   YWL+ NSW  +WG NGLFKI RG D
Sbjct: 260 RTEKSKFVRYQSAK--------LIGWGVE--NGVDYWLLVNSWGNEWGQNGLFKIKRGTD 309

Query: 294 ECGIESSITAGVPKL 308
           EC       AGVPK+
Sbjct: 310 ECQFGRHTYAGVPKM 324


>gi|74212565|dbj|BAE31022.1| unnamed protein product [Mus musculus]
          Length = 191

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 84  NEALMELELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 122

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +  +YW+I NSW ++WG++G F+I RG
Sbjct: 123 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 172

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 173 TDECAIESIAVAAIP 187


>gi|253742295|gb|EES99137.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 315

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 70/293 (23%), Positives = 109/293 (37%), Gaps = 98/293 (33%)

Query: 61  HPDYNLPANRLPELIGYSE--VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC--- 115
           H  + L AN    L G +E  ++ D P +FD R ++P C  +    DQG CGSCW     
Sbjct: 56  HLVHFLDANAHSHLAGRTEKNINYDYPESFDFREEYPQC--LLPTYDQGHCGSCWAFASS 113

Query: 116 -------------------RPYEIAPCEHHVNGTR----------------------PSC 134
                               P  +  C     G                        P  
Sbjct: 114 RAFGDTRCMQGLDPVPVLYSPQYLVSCSLQNMGCTGGTMEDVGDFLRDTGIATDTCVPYV 173

Query: 135 DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFD 194
           D    H   C   C +   +   + ++F         N +++M+ I  +GP+  +  +++
Sbjct: 174 D-EDAHWEPCPVSCVDGSPIRTVQLMDF----VRYDGNLEAMMEAIAMNGPIHASMMIYE 228

Query: 195 DLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAI 254
           D + Y+SG +                                     +Y SG   G HAI
Sbjct: 229 DFMYYQSGIYH-----------------------------------FIYGSG--CGMHAI 251

Query: 255 RILGWGED--------EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
            ++G+G D        E+ +  YW+  NSW  DWG+NG F+I+RG +ECGIE+
Sbjct: 252 ELVGYGTDISGDSEAGEEVRVDYWIARNSWGEDWGENGYFRIVRGNNECGIEN 304


>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
          Length = 125

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 75/165 (45%), Gaps = 55/165 (33%)

Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
           +Y+  Y+ DL  G K++ V     + M++IYEHGP+   F+                   
Sbjct: 6   SYESTYRFDLKKGKKAHMVP--RCNAMRQIYEHGPLVAIFS------------------- 44

Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDE 263
                                    V+ D + YKSG        ++G HA+R+LGWG + 
Sbjct: 45  -------------------------VYADFLQYKSGVYQHNFGDSIGLHAVRVLGWGVE- 78

Query: 264 KSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
            +   YWL+ANSWN  WGD+G FKILRG++E  IE     G P+ 
Sbjct: 79  -NDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVGYPQF 122


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 104/283 (36%), Gaps = 94/283 (33%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCE---HHVNGTRP------ 132
           E LP  F++  KWPN   I E  DQG+C   W      +A      H +    P      
Sbjct: 96  EVLPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQN 153

Query: 133 --SCDASK-------------------------------------GHTPKCVREC----- 148
             SCD  +                                     G TP C+        
Sbjct: 154 LLSCDTHQQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGRERDEAGPTPPCMMHSRAMGR 213

Query: 149 ---QENYDVPY----KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKS 201
              Q     P       D+      Y + SN+K IMKE+ E+GPV+    V +D  LYK 
Sbjct: 214 GKRQATASCPNSHVNNNDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKG 273

Query: 202 GRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE 261
           G +       T +SL +                         +  +  G H+++I GWGE
Sbjct: 274 GIY-----SHTPVSLGR------------------------PERYRRHGTHSVKITGWGE 304

Query: 262 D---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           +   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 305 ETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIESFV 347


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 102/248 (41%), Gaps = 40/248 (16%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHT 141
           DLP  FDSR +W N   I  ++DQG CG+ W     ++A     +          S  H 
Sbjct: 233 DLPREFDSRIQWGN--DITPVQDQGWCGASWAISTVDVASDRFAIMSKGIEKVQLSGQHL 290

Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNE-------KSIMKEIYEHGPVEGA----- 189
             C    Q      Y        + + V   +       +S    I   G +  A     
Sbjct: 291 ISCNNRGQRGCKGGYLDRAWLFMRKFGVVDEDCYPWLSGRSDKCRIPRRGKLSDAGCQRR 350

Query: 190 --FTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG- 246
             + + +++  YK G  +  GNET  M  I        +    +    V  D   Y+SG 
Sbjct: 351 NSYNLRNEM--YKVGPAYRLGNETDIMQEI-------LTSGPVQATMRVHRDFFHYESGI 401

Query: 247 ---------KALGGHAIRILGWGEDEKSKE----KYWLIANSWNTDWGDNGLFKILRGKD 293
                    +  G H++RI+GWGE+         K+W +ANSW  DWG++G F+I+RG +
Sbjct: 402 YVHSRPFDTRQSGYHSVRIVGWGEEPSPYNGKPIKFWRVANSWGRDWGEDGYFRIVRGNN 461

Query: 294 ECGIESSI 301
           EC IES +
Sbjct: 462 ECEIESFV 469


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 108/250 (43%), Gaps = 37/250 (14%)

Query: 74  LIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPS 133
           +I YS+ D   P  FD+R +W     I  I DQ  CGS W      I        G R S
Sbjct: 176 VISYSK-DGQYPDEFDARREWYG--YISPIADQDWCGSDWAVSIASIV-------GDRFS 225

Query: 134 CDASKGHTPKCVRECQENYDVPYKKDLNFGAK--SYSVSSNEKSIMKEIYEHGPVEGAFT 191
             +      +   +   +  +  ++  N G    ++        + ++ +   P EGA T
Sbjct: 226 IQSFGTENVRMSSQTLLSCHLKGQRGCNGGNLDIAFDFVKTHGLVSEQCF---PYEGAVT 282

Query: 192 ---VFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-- 246
              + +D   Y+ G  F    E   M        D  +   A G  TV+ D   Y+ G  
Sbjct: 283 QCRIGNDCRRYRVGVPFSISKEEDIM-------YDIMTSGPALGIMTVYQDFFHYREGIY 335

Query: 247 --------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                      G H++RI+GWGED  +++KYW++ANSW T WG+ G F+I RG    GIE
Sbjct: 336 RHTRHGDQLMRGLHSVRIVGWGED--AEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIE 393

Query: 299 SSITAGVPKL 308
           SS+   +P +
Sbjct: 394 SSVLTVLPYV 403


>gi|2330009|gb|AAB66719.1| cysteine protease [Giardia muris]
          Length = 301

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 70/239 (29%), Positives = 106/239 (44%), Gaps = 44/239 (18%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-----------CRPYEIAPCEHHVNGT 130
           ++LP ++D R +  +C  + E+ DQ SCGSCW            C     +   H+    
Sbjct: 75  KELPKDYDPRVERAHC--LPEVADQASCGSCWAFSAVATFADRRCAYGLDSKQVHYSEQY 132

Query: 131 RPSCD----ASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPV 186
             SCD    A  G     V +      VP    L + +    ++ + +S +    +  PV
Sbjct: 133 VVSCDFGDGACNGGWLSNVWKFLTKTGVPKLDCLKYFS---GMTGDRESCITHCTDGSPV 189

Query: 187 EGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG 246
           E          LY++      G +   M  ++  + D   Q+    AF V+ D   Y SG
Sbjct: 190 E----------LYQASHVINYGMDLDRM--MEALVYDGPLQV----AFVVYSDFGYYSSG 233

Query: 247 -------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
                     GGHA+ ++G+G DE S  KYW+I NSW  DWG+ G F+I+R  +ECGIE
Sbjct: 234 VYQHVNGMMEGGHAVEMVGYGIDE-SGLKYWIIRNSWGPDWGEGGYFRIIRRVNECGIE 291


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 82/204 (40%), Gaps = 59/204 (28%)

Query: 115 CRPYEIAPCEH-HVNGTRPSCDAS----KGHTPKCVRECQENYDVPYKKD-LNFGAKSYS 168
           C+PY   PC H + +G    C+         TP C ++C   +   Y  D +      Y 
Sbjct: 174 CKPYSFPPCSHGNDSGKYSKCENDFFMLTEVTPSCTKKCHPQFSRTYDVDKIRSRENPYK 233

Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
           +  +++ I  EIY +GPV+  F                                      
Sbjct: 234 LIKDQEQIKNEIYLNGPVQAVF-------------------------------------- 255

Query: 229 GAEGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWG 281
                 TVFDD + YKSG       +  G HA++I+GWG +  +   YW   NSWN  WG
Sbjct: 256 ------TVFDDFLNYKSGVYQQTTGQRRGKHAVKIIGWGTE--NGVPYWEAINSWNDGWG 307

Query: 282 DNGLFKILRGKDECGIESSITAGV 305
            NG FKILRG +   IE  + A +
Sbjct: 308 INGKFKILRGFNHLDIEGEVYASI 331



 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 28/43 (65%)

Query: 72  PELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
           P    Y E  E+LP +F ++ KWP CP+I  I DQG+CGSCW 
Sbjct: 59  PVEYKYHEKLENLPPSFSAQEKWPGCPSIELIPDQGNCGSCWA 101


>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
           [Loxodonta africana]
          Length = 437

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 77/296 (26%), Positives = 111/296 (37%), Gaps = 97/296 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I E  DQG C   W               
Sbjct: 161 NEIHTVLGPGEV---LPMAFEASKKWPN--LIHEPLDQGDCAGSWAFSTAAVASDRVSIH 215

Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
                     P  +  C+ H+  G R                    C    GH       
Sbjct: 216 SLGHMTPILSPQNLLSCDTHNQQGCRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGP 275

Query: 141 TPKCVREC--------QENYDVP----YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
            P C+           Q     P    +  D+     +Y + +NEK IMKE+ E+GPV+ 
Sbjct: 276 VPPCMMHSRAMGRGKRQATSRCPNSHVHGNDIYQVTPAYRLGTNEKEIMKELMENGPVQA 335

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
              V +D  LY+ G +       T +S      ++   Q                   + 
Sbjct: 336 LMEVHEDFFLYQGGIY-----SHTPVS------QERPEQY------------------RR 366

Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 367 HGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 422


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/145 (35%), Positives = 68/145 (46%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  F              L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 TNKESEKFL------------KLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|410909768|ref|XP_003968362.1| PREDICTED: dipeptidyl peptidase 1-like [Takifugu rubripes]
          Length = 455

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/291 (24%), Positives = 110/291 (37%), Gaps = 86/291 (29%)

Query: 68  ANRLPELIGYSEVDEDLP---ANFDSRTKWPNCP---TIREIRDQGSCGSCWG------- 114
           A+R+P  +  + VD +L    A       W N      +  +R+QGSCGSC+        
Sbjct: 201 ASRIPIRVHPTNVDPELAKKAAALPELWDWRNVEGVNFVSPVRNQGSCGSCYCFATMGML 260

Query: 115 ---------------CRPYEIAPCEHHVNG------------------TRPSCDASKGHT 141
                            P ++  C  +  G                     SC    G  
Sbjct: 261 EARLRILTNNSQSPVLSPQQVVSCSEYSQGCDGGFPYLTGKYVQDFGIVDESCFPYMGKD 320

Query: 142 PKC--VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILY 199
             C   + C+  Y   YK    F         +E ++M E+ ++GP+  A  V+ D + Y
Sbjct: 321 SPCGISQSCRRGYAAEYKYVGGFYG-----GCSEAAMMVELVKNGPMAVALEVYSDFMSY 375

Query: 200 KSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGW 259
           K G +   G            + D+ +                      L  HA+ ++G+
Sbjct: 376 KGGIYHHTG------------LTDHVNPF-------------------ELTNHAVLLVGY 404

Query: 260 GEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG--VPKL 308
           G    + +KYW++ NSW + WG++G F+I RG DEC IES   A   +PKL
Sbjct: 405 GRCHMTGQKYWIVKNSWGSSWGEDGYFRIRRGSDECAIESIAVAASPIPKL 455


>gi|390348202|ref|XP_001201161.2| PREDICTED: dipeptidyl peptidase 1-like [Strongylocentrotus
           purpuratus]
          Length = 458

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 66/273 (24%), Positives = 98/273 (35%), Gaps = 78/273 (28%)

Query: 78  SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCR--------------------- 116
           S+    LP +FD R        +  +R+Q  CGSC+                        
Sbjct: 217 SKAAFSLPESFDWR-DLNGQNFVSPVRNQAQCGSCFSFAALAMLEARLRIATNNTVQKVF 275

Query: 117 -PYEIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRE---CQENYD 153
            P ++  C  +  G                      SC   +G    C +E   C+  Y 
Sbjct: 276 APQDVVDCSEYAQGCEGGFPYLIAGKYAEDFGVVEESCYPYQGVDSACSKEQPGCRRYYA 335

Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
             Y+    F       + NE+ +   +  +GP+   F V+ D + YK G +         
Sbjct: 336 TNYQYIGGFYG-----ACNEELMRLALVNNGPIAVGFQVYGDFMSYKGGVY--------- 381

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
                          G + +   FD          L  HA+ ++G+G DE S   +W + 
Sbjct: 382 ------------HHTGVKNSMLKFDPF-------ELTNHAVLVVGYGVDEASGMSFWTVK 422

Query: 274 NSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           NSW T WG+ G F+ILRG DECGIES      P
Sbjct: 423 NSWGTGWGEGGYFRILRGTDECGIESMAMQSFP 455


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 82/171 (47%), Gaps = 42/171 (24%)

Query: 132 PSCDASKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAF 190
           P C +     P C  EC  E+Y    ++DL+       + ++ + I +EI+++G V G  
Sbjct: 21  PKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGTVLGVI 80

Query: 191 TVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
           ++++D  LYKSG +                                     ++ +G  +G
Sbjct: 81  SMYEDFRLYKSGVY-------------------------------------VHTTGGLVG 103

Query: 251 GHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            H+++I+GWG +  S + YWL  NSWN +WGD+G+ K+  G  E GIE+SI
Sbjct: 104 VHSLKIIGWGVE--SGQDYWLAVNSWNEEWGDHGMIKLAVG--ETGIENSI 150


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/296 (27%), Positives = 125/296 (42%), Gaps = 93/296 (31%)

Query: 67  PANRLPELIGYSEV-----DEDLPANFDSR-TKWPNCPTIREIRDQGS-CGSCWG----- 114
           P  ++PEL+  ++      DE LP ++D R     N  T+ + +     CGSCW      
Sbjct: 21  PNAKVPELVKTAQPYTFLGDEVLPKSYDPRDIDGRNYVTVTKNQHIPQYCGSCWSFASVS 80

Query: 115 ------------------CRPYEIAPCEHHVNGTR---PSCDASKGH---TPK--CVREC 148
                               P  I  C+H+ NG +   P       H    P+  C+R  
Sbjct: 81  SVSDRLKLMTKGKWPVHDLSPQVILNCDHNSNGCQGGHPLTAFKYMHDHGVPEEGCMRYM 140

Query: 149 QENY---DVPYKKDLN-----FGAKSYS--------VSSNEKSIMKEIYEHGPVEGAFTV 192
            +N    D+   +D +     F  K+Y+          + EK++MKEIY  GP+  +  V
Sbjct: 141 AKNMECTDINICRDCDSEKGCFAVKNYTKYYVDEYGSVAGEKNMMKEIYARGPITCSIAV 200

Query: 193 FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGH 252
            DDL+ YK G +                 RD T      GA T+               H
Sbjct: 201 PDDLMEYKGGIY-----------------RDTT------GAKTL--------------DH 223

Query: 253 AIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           AI ++GWGE++   +KYW+  NSW T WG+ G F+I+RG++  GIE+     VP++
Sbjct: 224 AISVVGWGEEDG--QKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWAVPRV 277



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 56/232 (24%), Positives = 88/232 (37%), Gaps = 49/232 (21%)

Query: 87  NFDSRTKWPNCP-TIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP--SCDASKGHTPK 143
           N   + KWP    + +E+ +  + G+C G    ++   E+  N   P  +C   +    +
Sbjct: 371 NLMRKGKWPTVELSAQEVINCSNAGTCDGGSDADVF--EYAFNEGIPDQTCQVYEAIDKE 428

Query: 144 C-----VRECQENYDV-PYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           C       +C    D  P K    +    Y     E  I  EI+  GPV  +  V ++ +
Sbjct: 429 CNDMARCMDCPPGEDCYPVKDYKRYKVSEYGEVKGEMEIKAEIFARGPVSCSMIVTEEFL 488

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            Y+ G F                                 DD      G  +G HA+ + 
Sbjct: 489 AYQGGIFV--------------------------------DD-----RGHIVGYHAVEVA 511

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           GWGE E    KYW+  NSW   WG++G F+++ G  +  I      GVP +D
Sbjct: 512 GWGETEDGT-KYWIARNSWGPYWGEHGWFRMIVGVSKGLITGYCNWGVPVID 562


>gi|410972493|ref|XP_003992693.1| PREDICTED: dipeptidyl peptidase 1 isoform 1 [Felis catus]
          Length = 463

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 101/264 (38%), Gaps = 76/264 (28%)

Query: 84  LPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEIA 121
           LPA++D R        +  +R+Q SCGSC+                         P E+ 
Sbjct: 231 LPASWDWRNVH-GTNFVTPVRNQASCGSCYSFASMGMLEARIRILTNNTQTPILSPQEVV 289

Query: 122 PCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
            C  +  G                      +C    G    C  + +E+    Y  + ++
Sbjct: 290 SCSQYAQGCDGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KPKEDCVRYYSSEYHY 347

Query: 163 GAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIR 222
               Y    NE  +  E+  HGP+  AF V++D + Y+ G ++  G            +R
Sbjct: 348 VGGFYG-GCNEALMKLELVHHGPMAVAFEVYNDFLHYRKGIYYHTG------------LR 394

Query: 223 DNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
           D          F  F+          L  HA+ ++G+G D  S   YW++ NSW   WG+
Sbjct: 395 D---------PFNPFE----------LTNHAVLLVGYGTDPVSGMDYWIVKNSWGIGWGE 435

Query: 283 NGLFKILRGKDECGIESSITAGVP 306
           +G F+I RG DEC IES   A  P
Sbjct: 436 DGYFRIRRGTDECAIESIAVAATP 459


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/145 (34%), Positives = 69/145 (47%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 398 ANKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  I A   +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470


>gi|260826514|ref|XP_002608210.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
 gi|229293561|gb|EEN64220.1| hypothetical protein BRAFLDRAFT_125840 [Branchiostoma floridae]
          Length = 470

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 94/267 (35%), Gaps = 80/267 (29%)

Query: 83  DLPANFDSRTKWPNCPTIREIRDQGSCGSCWG----------------------CRPYEI 120
            LP +FD R K      +  IRDQG CGSC+                         P EI
Sbjct: 236 QLPESFDWR-KVMGLNFVSPIRDQGQCGSCYAFASMGMLEARLRVLTNNTQQFVLSPQEI 294

Query: 121 APCEHHVNG-------------------TRPSCDASKGHTPKC--VRECQENYDVPYKKD 159
             C  +  G                       C   +G    C     C   Y   Y+  
Sbjct: 295 VSCGKYSQGCEGGFPYLIAGKYAEDFGVVLEECYPYEGKDSSCKDTSRCGRGYATNYRYV 354

Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
             F         NE+ +  E+ ++GP+  AF V+ D + YK G +               
Sbjct: 355 GGFYG-----GCNEELMQLELVKNGPMAVAFEVYSDFMHYKGGVY--------------- 394

Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
                    G    F  F+          +  HA+ ++G+G D ++  K+W + NSW   
Sbjct: 395 ------EHTGLSDPFNPFE----------ITNHAVLLVGYGRDPETGAKFWTVKNSWGEK 438

Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
           WG+ G F+I RG DEC IES   A  P
Sbjct: 439 WGEEGFFRIRRGTDECAIESIAVAADP 465


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/139 (33%), Positives = 65/139 (46%), Gaps = 39/139 (28%)

Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
           A     +S E +IM  I E GPVE AFTV++D   Y  G +                   
Sbjct: 184 AGDVQTASGEAAIMAMIAEGGPVETAFTVYEDFENYAGGIYH------------------ 225

Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
                              + +G+  GGHA++ +GWG +  +  KYW +ANSWN  WG+ 
Sbjct: 226 -------------------HVTGEEAGGHAVKFVGWGVENGT--KYWKVANSWNPYWGEA 264

Query: 284 GLFKILRGKDECGIESSIT 302
           G F+ILRG +E GIE  +T
Sbjct: 265 GYFRILRGSNEGGIEDQVT 283



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 19/50 (38%), Positives = 25/50 (50%), Gaps = 1/50 (2%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPT-IREIRDQGSCGSCWGCRPYEIA 121
           +++ Y       P  FDS  +WP C   I +IRDQ +CG CW     E A
Sbjct: 13  DVVDYVPRGGAAPEAFDSAARWPECAKLIGDIRDQSNCGCCWAFAGAEAA 62


>gi|1584943|prf||2123443A cathepsin C
          Length = 482

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 104/282 (36%), Gaps = 92/282 (32%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPT-----IREIRDQGSCGSCWG------- 114
           P+  L  L G      +LP  FD    W + P      +  IR+QG CGSC+        
Sbjct: 207 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAAL 256

Query: 115 ---------------CRPYEIAPCEHH---VNGTRPSCDASK-----------------G 139
                            P  +  C  +    NG  P   A K                  
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGE 316

Query: 140 HTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
            T KC   + C   Y   Y          Y  ++NEK +  E+  +GP    F V++D  
Sbjct: 317 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 371

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YK G                  I  +T+       F  F+          L  HA+ ++
Sbjct: 372 FYKEG------------------IYHHTTVQTDHYNFNPFE----------LTNHAVLLV 403

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           G+G D+ S E YW + NSW  +WG+ G F+ILRG DECG+ES
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/147 (33%), Positives = 67/147 (45%), Gaps = 36/147 (24%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
           Y VSSNE  IM+EI ++GPV+    V +D   YK+G  R     NE +            
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
                              +  +    HA+++ GWG        KEK+W+ ANSW   WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAHGQKEKFWIAANSWGKSWG 443

Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
           +NG F+ILRG +E  IE  I A   +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470


>gi|73696355|gb|AAZ80953.1| cathepsin C [Macaca mulatta]
          Length = 118

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+  HGP+  AF V+DD + Y++G +   G            +RD        
Sbjct: 11  NEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD-------- 50

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  S   YW++ NSW T WG++G F+I RG
Sbjct: 51  -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRG 99

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 100 TDECAIESIAVAATP 114


>gi|290988628|ref|XP_002677000.1| predicted protein [Naegleria gruberi]
 gi|284090605|gb|EFC44256.1| predicted protein [Naegleria gruberi]
          Length = 158

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 42/171 (24%)

Query: 139 GHTPKC-VRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
           G  P C ++ C     V  +K   +  KS         +M ++  +GP++    V+ D  
Sbjct: 29  GAVPACNIKSCA----VSGEKSPFYKVKSARKLKGMVDMMADLKANGPLQATMIVYKDFF 84

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YKSG +                                      + SG+ +G HAI+I+
Sbjct: 85  SYKSGVYH-------------------------------------HVSGRMVGAHAIKIV 107

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           GWG D  SK  YW+ ANSW  DWG +G F I RG+ ECG+  ++ +G P L
Sbjct: 108 GWGVDSASKLPYWICANSWGEDWGLDGYFWIARGRGECGLGKTVWSGKPAL 158


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 83/174 (47%), Gaps = 41/174 (23%)

Query: 114 GCRPYEIAPCEHHVNGTRPSCDA-SKGHTPKCVREC-QENYDVPYKKDLNFGAKSYSVS- 170
           GC+PY+  PC+H+ + +  +C +  +     C  +C  +NY V Y+ DL   +  Y  S 
Sbjct: 126 GCQPYKNRPCDHYGDSSLTNCSSLRRTQMMFCRDKCVNKNYKVKYEDDLYKTSVVYMTSW 185

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
           +N K I +EI  +GPV     V+++ + YK G +     ++TA                 
Sbjct: 186 TNVKQIQQEIMTYGPVTAFMYVYENFMGYKEGVY-----KSTA----------------- 223

Query: 231 EGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
                          G+ +G H ++++GWG DE   E YWL  NSWN++WG NG
Sbjct: 224 ---------------GELIGYHHVKLIGWGVDEAGIE-YWLAMNSWNSNWGTNG 261


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 72/149 (48%), Gaps = 32/149 (21%)

Query: 156 YKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMS 215
           +  D+     +Y + +NEK IMKE+ E+GPV+    V +D  LY+ G +       T +S
Sbjct: 333 HANDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHEDFFLYQGGIY-----SHTPVS 387

Query: 216 LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLI 272
           L +                         +  +  G H+++I GWGE+   +    KYW  
Sbjct: 388 LGR------------------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTA 423

Query: 273 ANSWNTDWGDNGLFKILRGKDECGIESSI 301
           ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 424 ANSWGPAWGERGHFRIVRGANECDIESFV 452


>gi|403364285|gb|EJY81901.1| Cathepsin H [Oxytricha trifallax]
          Length = 363

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/266 (25%), Positives = 105/266 (39%), Gaps = 84/266 (31%)

Query: 73  ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRP 132
           E +  S + +DLPAN+D    W     +  ++DQGSCGSCW      +   E H      
Sbjct: 124 EAVDLSHIVKDLPANWD----WREHNGVTPVKDQGSCGSCWTFST--VGTLEAHF---LI 174

Query: 133 SCDASKGHTPKCVRECQENYD---------------------------VPY--------- 156
               S+  + + + +C   YD                            PY         
Sbjct: 175 KYQQSRNLSEQQLVDCAGAYDNYGCNGGLPSHAFQYISDNGGIATEAAYPYFAKDRPCTI 234

Query: 157 ---KKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
              +K +     S +++ +E  +   I++HGPV  A+ V DD + Y SG +         
Sbjct: 235 QQSQKSVGVVGGSVNLTKSEDELAIAIFQHGPVSIAYEVIDDFMDYHSGVY--------- 285

Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIA 273
                 T +D                    K+G     HA+  +G+G +  +   YWL+ 
Sbjct: 286 ------TTKD-------------------CKNGPDDVNHAVVAVGFGTE--NGVDYWLVK 318

Query: 274 NSWNTDWGDNGLFKILRGKDECGIES 299
           NSW+T WGDNG FKI RG + CGI +
Sbjct: 319 NSWSTKWGDNGYFKIQRGVNMCGINN 344


>gi|2499875|sp|Q26563.1|CATC_SCHMA RecName: Full=Cathepsin C; Flags: Precursor
 gi|1262412|emb|CAA83543.1| cathepsin C [Schistosoma mansoni]
          Length = 454

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 75/282 (26%), Positives = 104/282 (36%), Gaps = 92/282 (32%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCP-----TIREIRDQGSCGSCWG------- 114
           P+  L  L G      +LP  FD    W + P      +  IR+QG CGSC+        
Sbjct: 207 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAAL 256

Query: 115 ---------------CRPYEIAPCEHH---VNGTRPSCDASK-----------------G 139
                            P  +  C  +    NG  P   A K                  
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGE 316

Query: 140 HTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
            T KC   + C   Y   Y          Y  ++NEK +  E+  +GP    F V++D  
Sbjct: 317 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 371

Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
            YK G                  I  +T+       F  F+          L  HA+ ++
Sbjct: 372 FYKEG------------------IYHHTTVQTDHYNFNPFE----------LTNHAVLLV 403

Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
           G+G D+ S E YW + NSW  +WG+ G F+ILRG DECG+ES
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/272 (26%), Positives = 105/272 (38%), Gaps = 87/272 (31%)

Query: 82  EDLPANFDSRTKWPNCPTIREIRDQGS-CGSCWG----------------------CRPY 118
            +LPA+FDSR KW +C     +RDQG  C SCW                         P 
Sbjct: 31  SNLPASFDSRQKWSDC--FSPVRDQGQKCSSCWAMTATGVLADRLCVASGGKVKKVLSPQ 88

Query: 119 EIAPCEHHVN----GTR---------------PSCDASKG-HTPKCVRECQENYDVPYKK 158
           E+  C+ + N    G R                 C++ K      C   C +     +  
Sbjct: 89  ELIDCDRNGNLGCGGGRLDTPLAYFRDNGVVTEKCESYKATQASSCSNTCDDG--TSFSN 146

Query: 159 DLNFGAKS-YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
              + +K  Y +SS E++   +IY +GP+   F ++ D+  YKSG +    + T      
Sbjct: 147 TTKYHSKDCYRLSSIEQA-KADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSAT------ 199

Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
                                    YK       HA R++GWG ++  +  YWL ANSW 
Sbjct: 200 -------------------------YKET-----HAGRVIGWGVEDGVQ--YWLAANSWG 227

Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPKLD 309
           T WG  GLFKI  G +E G E++  +     D
Sbjct: 228 TGWGQQGLFKIRSGTNEVGFEANFFSTTADFD 259


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 69/141 (48%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G             + +  IR +  
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG-------------IYRHVIRTSEE 401

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK---EKYWLIANSWNTDWGDN 283
                            +  + L  HA+++ GWG  + +K   EK+W+ ANSW   WG++
Sbjct: 402 S----------------EKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGED 445

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 446 GYFRILRGVNESDIEKLIIAA 466


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 354 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 396

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 397 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 444

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 445 GYFRILRGVNESDIEKLIIAA 465


>gi|255209|gb|AAB23200.1| preprocathepsin C, dipeptidylaminopeptidase I [rats, kidney,
           Peptide, 462 aa]
          Length = 462

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G+D  +   YW++ NSW + WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAMAAIP 458


>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
          Length = 333

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 68/145 (46%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G                  I  + +
Sbjct: 212 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG------------------IYRHVT 253

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
               E            +  + L  HA+++ GWG     +  KEK+W+ AN W   WG+N
Sbjct: 254 STNKES-----------EKYRKLQTHAVKLTGWGTRRGAQGQKEKFWIAANFWGKSWGEN 302

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  + A   +L
Sbjct: 303 GYFRILRGVNESDIEKLVIAAWGQL 327


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 67/141 (47%), Gaps = 32/141 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSS+E  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 342 YRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 384

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ ANSW   WG+N
Sbjct: 385 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGRKEKFWIAANSWGKSWGEN 432

Query: 284 GLFKILRGKDECGIESSITAG 304
           G F+ILRG +E  IE  I A 
Sbjct: 433 GYFRILRGVNESDIEKLIIAA 453


>gi|24987409|pdb|1JQP|A Chain A, Dipeptidyl Peptidase I (Cathepsin C), A Tetrameric
           Cysteine Protease Of The Papain Family
          Length = 438

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 331 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 369

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G+D  +   YW++ NSW + WG++G F+I RG
Sbjct: 370 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 419

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 420 TDECAIESIAMAAIP 434


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 76/185 (41%), Gaps = 41/185 (22%)

Query: 120 IAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKE 179
           + PC  H   T      +  H P         Y V           +Y + SN+  IMKE
Sbjct: 306 VPPCMMHSRATGRGKRQATAHCPNGHVNNNNIYQV---------TPAYRLGSNDTEIMKE 356

Query: 180 IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDD 239
           + E+GPV+    V +D  LYK G +                       LG    +     
Sbjct: 357 LMENGPVQALMEVHEDFFLYKGGIY-----------------SHTPVNLGRPERY----- 394

Query: 240 LILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG 296
                  +  G H+++I GWGE+   +  K KYW  ANSW   WG+ G F+I+RG +EC 
Sbjct: 395 -------RRHGTHSVKITGWGEETWPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECD 447

Query: 297 IESSI 301
           IES +
Sbjct: 448 IESFV 452


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/145 (33%), Positives = 68/145 (46%), Gaps = 32/145 (22%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y VSSNE  IMKEI ++GPV+    V +D   YK+G +                 R  TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
                  +            + L  HA+++ GWG     +  KEK+W+ AN W   WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANFWGKSWGEN 445

Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
           G F+ILRG +E  IE  + A   +L
Sbjct: 446 GYFRILRGVNESDIEKLVIAAWGQL 470


>gi|8393218|ref|NP_058793.1| dipeptidyl peptidase 1 precursor [Rattus norvegicus]
 gi|114152780|sp|P80067.3|CATC_RAT RecName: Full=Dipeptidyl peptidase 1; AltName: Full=Cathepsin C;
           AltName: Full=Cathepsin J; AltName: Full=Dipeptidyl
           peptidase I; Short=DPP-I; Short=DPPI; AltName:
           Full=Dipeptidyl transferase; Contains: RecName:
           Full=Dipeptidyl peptidase 1 exclusion domain chain;
           AltName: Full=Dipeptidyl peptidase I exclusion domain
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           heavy chain; AltName: Full=Dipeptidyl peptidase I heavy
           chain; Contains: RecName: Full=Dipeptidyl peptidase 1
           light chain; AltName: Full=Dipeptidyl peptidase I light
           chain; Flags: Precursor
 gi|220686|dbj|BAA14400.1| cathepsin C precursor [Rattus norvegicus]
 gi|149069035|gb|EDM18587.1| cathepsin C, isoform CRA_a [Rattus norvegicus]
          Length = 462

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+ +HGP+  AF V DD + Y SG +                        G  
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G+D  +   YW++ NSW + WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 443

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A +P
Sbjct: 444 TDECAIESIAMAAIP 458


>gi|149635146|ref|XP_001512140.1| PREDICTED: dipeptidyl peptidase 1-like [Ornithorhynchus anatinus]
          Length = 469

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)

Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
           NE  +  E+  HGP+  AF V++D + Y+ G +   G            +RD        
Sbjct: 362 NEALMKLELVRHGPMAVAFEVYNDFLHYREGVYHHTG------------LRD-------- 401

Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
             F  F+          L  HA+ ++G+G D  +   YW++ NSW T WG++G F+I RG
Sbjct: 402 -PFNPFE----------LTNHAVLLVGYGTDPATGLDYWIVKNSWGTAWGEDGYFRIRRG 450

Query: 292 KDECGIESSITAGVP 306
            DEC IES   A  P
Sbjct: 451 SDECAIESIAVAATP 465


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/140 (34%), Positives = 68/140 (48%), Gaps = 34/140 (24%)

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRF-FVPGNETTAMSLIKWTIRDN 224
           +Y + SN+  IMKE+ E+GPV+    V +D  LYK G +   P N               
Sbjct: 343 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVN--------------- 387

Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWG 281
              LG    +            +  G H+++I GWGE+ +    K KYW  ANSW   WG
Sbjct: 388 ---LGRPERY------------RRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWG 432

Query: 282 DNGLFKILRGKDECGIESSI 301
           + G F+I+RG +EC IES +
Sbjct: 433 ERGHFRIVRGVNECDIESFV 452


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 77/293 (26%), Positives = 113/293 (38%), Gaps = 91/293 (31%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +       + LP++F++  KW +   I E+ DQG CG+ W      +A     
Sbjct: 170 PTYRVKAMTRLRNPTDGLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFA 227

Query: 123 -------------------------CE-----------HHVNGTRPSCDASKGHTPKC-V 145
                                    CE           H       +C     H   C +
Sbjct: 228 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDENCYPYTQHRDTCKI 287

Query: 146 RE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
           R          CQ   +V  +  L     +YS++  E  IM EI+  GPV+    V  D 
Sbjct: 288 RHNSRSLRANGCQTPVNVD-RDTLYTVGPAYSLN-READIMAEIFHSGPVQATMRVNRDF 345

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG-HAIR 255
             Y  G +     ET A                               + KAL G H+++
Sbjct: 346 FAYSGGVY----RETAA-------------------------------NRKALTGFHSVK 370

Query: 256 ILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
           ++GWGE E + EKYW+ ANSW + WG++G F+ILRG +ECGIE  + A  P +
Sbjct: 371 LVGWGE-EHNGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLASWPYV 422


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 76/154 (49%), Gaps = 33/154 (21%)

Query: 151 NYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNE 210
           N+ V +  D+     +Y + S+EK IMKE+ E+GPV+    V +D  LY++G +      
Sbjct: 328 NHQV-HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLYQNGIY-----S 381

Query: 211 TTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKE 267
            T +SL +                         +  +  G H+++I GWGE+   +    
Sbjct: 382 HTPVSLGR------------------------PERYRRHGTHSVKITGWGEESLPDGRTL 417

Query: 268 KYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
           KYW  ANSW   WG+ G F+I+RG +EC IES +
Sbjct: 418 KYWTAANSWGPAWGERGHFRIVRGANECDIESFV 451


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 66/139 (47%), Gaps = 32/139 (23%)

Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
           +Y + SN+  IMKE+ E+GPV+    V +D  LYK G +                     
Sbjct: 312 AYRLGSNDTEIMKELMENGPVQALMEVHEDFFLYKGGIY-----------------SHTP 354

Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEK---SKEKYWLIANSWNTDWGD 282
             LG    +            +  G H+++I GWGE+ +    K KYW  ANSW   WG+
Sbjct: 355 VNLGRPERY------------RRHGTHSVKITGWGEETRPDGRKLKYWTAANSWGPAWGE 402

Query: 283 NGLFKILRGKDECGIESSI 301
            G F+I+RG +EC IES +
Sbjct: 403 RGHFRIVRGVNECDIESFV 421


>gi|308162940|gb|EFO65307.1| Cathepsin B precursor [Giardia lamblia P15]
          Length = 303

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 71/266 (26%), Positives = 116/266 (43%), Gaps = 34/266 (12%)

Query: 58  MGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS 111
           M + PD       +LP + + E+    E  + +P+ FD R ++P C  +  + DQGSCG 
Sbjct: 50  MLIRPDILGAGSGSLPPSSVTEI---QEPADPIPSQFDFRDEYPQC--VTPVMDQGSCGG 104

Query: 112 CWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
           CW      +      V G  +     S+ +   C  E        +    +F   + + +
Sbjct: 105 CWAFSAIGVFGDRRCVAGIDKEGVPYSQQYLISCSTENHGCDGGDFWPTWSF--LTLTGA 162

Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDD---LILYKS-GRFFVPGNETTAMSLIKWTIRDNTS 226
           +  + +    Y +        V DD   + LYK+ G   V  N    M ++        +
Sbjct: 163 TTAECVKYIDYPNIVASPCPAVCDDGSQIQLYKAHGYGQVSKNVQAIMHML-------AT 215

Query: 227 QLGAEGAFTVFDDLILYKSGK--------ALGGHAIRILGWGEDEKSKEKYWLIANSWNT 278
               +    V+ DL  Y+SG         +LG HA+ ++G+G  +   + YW+I NSW  
Sbjct: 216 GGPVQTMIVVYSDLSYYESGVYKHTYGTISLGLHALEMVGYGTTDDGTD-YWIIRNSWGA 274

Query: 279 DWGDNGLFKILRGKDECGIESSITAG 304
           DWG+NG F+I+RG +EC IE  I A 
Sbjct: 275 DWGENGYFRIVRGVNECRIEDEIYAA 300


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/290 (26%), Positives = 109/290 (37%), Gaps = 89/290 (30%)

Query: 67  PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAP---- 122
           P  R+  +       + LP++F++  KW +   I E+ DQG CG+ W      +A     
Sbjct: 170 PTYRVKAMTRLKNPTDGLPSSFNALDKWSS--YISEVPDQGWCGASWVLSTTSVASDRFA 227

Query: 123 -------------------------CE-----------HHVNGTRPSC-------DASK- 138
                                    CE           H       SC       D  K 
Sbjct: 228 IQSKGKEAVQLSAQNILSCTRRQQGCEGGHLDAAWRYLHKKGVVDESCYPYTQQRDTCKI 287

Query: 139 GHTPKCVRE--CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDL 196
            H  + +R   CQ  Y+V        G  +YS++  E  IM EI+  GPV+    V  D 
Sbjct: 288 RHNSRSLRANGCQTPYNVDRDTFYTVGP-AYSLN-READIMAEIFHSGPVQATMRVNRDF 345

Query: 197 ILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRI 256
             Y  G +     +T A  +                                 G H++++
Sbjct: 346 FAYAGGVY----RQTAANRMA------------------------------PTGFHSVKL 371

Query: 257 LGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
           +GWGE E + EKYW+ ANSW   WG+ G F+ILRG +ECGIE  + A  P
Sbjct: 372 VGWGE-EHNGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLASWP 420


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 97/296 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I E  DQG+C   W               
Sbjct: 139 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 193

Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
                     P  +  C+ HH  G R                   +C    G        
Sbjct: 194 SLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASP 253

Query: 141 TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
           TP+C+           Q     P  +    D+     +Y + S+EK IMKE+ E+GPV+ 
Sbjct: 254 TPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA 313

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
              V +D  LY+ G +       T +S  +                         +  + 
Sbjct: 314 LMEVHEDFFLYQRGIY-----SHTPVSQGR------------------------PEQYRR 344

Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IE+ +
Sbjct: 345 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFV 400


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y + SN+K IMKE+ E+GPV+    V +D  LYK G +       T +SL +        
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
                            +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ 
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434

Query: 284 GLFKILRGKDECGIESSI 301
           G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y + SN+K IMKE+ E+GPV+    V +D  LYK G +       T +SL +        
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
                            +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ 
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434

Query: 284 GLFKILRGKDECGIESSI 301
           G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452


>gi|253748399|gb|EET02549.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 303

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 71/297 (23%), Positives = 110/297 (37%), Gaps = 96/297 (32%)

Query: 58  MGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGS 111
           M ++PD       ++P+  L E+   ++  + LPA FD R ++P+C  +  + DQGSCG 
Sbjct: 50  MLINPDRLKARSGSMPSAPLKEI---NDPTDPLPAQFDFRDEYPHC--VSPVFDQGSCGG 104

Query: 112 CW-------------------------------------GCRPYEIAPC---EHHVNGTR 131
           CW                                     GC   +  P          T 
Sbjct: 105 CWAFSAIGMFGSRRCAVGIDKAAVLYSQQHLISCSTENFGCSGGDFFPTWSFLTQTGATT 164

Query: 132 PSC----DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
             C    D        C   C +   + + K   +G  S SV     +IM+ +   GPV+
Sbjct: 165 AECVKYVDYGSSVAAACPTTCDDGSQIQFYKAHGYGQVSKSV----PAIMQMLVSGGPVQ 220

Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
               V+ DL+ Y  G +                 R     +                   
Sbjct: 221 TMIVVYADLLYYAGGVY-----------------RHTYGPISN----------------- 246

Query: 248 ALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAG 304
             G HA+ ++G+G  +   + YW I NSW +DWG++G F+I+RG +EC IE  I A 
Sbjct: 247 --GLHALEMVGYGTTDDGTD-YWTIKNSWGSDWGEDGYFRIVRGVNECRIEDEIYAA 300


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 81/183 (44%), Gaps = 41/183 (22%)

Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
           PC  H   +R      +  T +C      N D+     +      Y + SN+K IMKE+ 
Sbjct: 163 PCMMH---SRAMGRGKRQATARCPNSHVNNNDIYQVTPV------YRLGSNDKEIMKELM 213

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
           E+GPV+    V +D  LYK G +       T +SL +                       
Sbjct: 214 ENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR----------------------- 245

Query: 242 LYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
             +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IE
Sbjct: 246 -PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 304

Query: 299 SSI 301
           S +
Sbjct: 305 SFV 307


>gi|28804799|dbj|BAC57943.1| cathepsin C [Marsupenaeus japonicus]
          Length = 449

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 86/197 (43%), Gaps = 38/197 (19%)

Query: 112 CWGCRPYEIA-PCEHHVNGTRPSCDASKGHTPKCVRE-CQENYDVPYKKDLNFGAKSYSV 169
           C G  P+ IA      V     +C   +G    C R  C ++Y   Y+         Y  
Sbjct: 287 CEGGFPFLIAGRYAQDVGVVLENCYPYEGKDDTCTRSSCTKHYTAYYRY-----VGGYYG 341

Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
           + NE+ +   + + GP+     V+DD + YKSG +   G            +RD+ + L 
Sbjct: 342 ACNEEEMKIALIKGGPLIVGLEVYDDFLHYKSGIYHHTG------------LRDSFNPL- 388

Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
                              L  HA+ ++G+GEDE + EKYW + NSW   WG++G F+I 
Sbjct: 389 ------------------ELTNHAVLLVGYGEDETTGEKYWSVKNSWGEGWGEDGYFRIR 430

Query: 290 RGKDECGIESSITAGVP 306
           RG DEC IES     VP
Sbjct: 431 RGVDECAIESMAVEAVP 447


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 97/296 (32%)

Query: 69  NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
           N +  ++G  EV   LP  F++  KWPN   I E  DQG+C   W               
Sbjct: 190 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 244

Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
                     P  +  C+ HH  G R                   +C    G        
Sbjct: 245 SLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASP 304

Query: 141 TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
           TP+C+           Q     P  +    D+     +Y + S+EK IMKE+ E+GPV+ 
Sbjct: 305 TPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA 364

Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
              V +D  LY+ G +       T +S  +                         +  + 
Sbjct: 365 LMEVHEDFFLYQRGIY-----SHTPVSQGR------------------------PEQYRR 395

Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
            G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IE+ +
Sbjct: 396 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFV 451


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y + SN+K IMKE+ E+GPV+    V +D  LYK G +       T +SL +        
Sbjct: 349 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 395

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
                            +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ 
Sbjct: 396 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 439

Query: 284 GLFKILRGKDECGIESSI 301
           G F+I+RG +EC IES +
Sbjct: 440 GHFRIVRGVNECDIESFV 457


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y + SN+K IMKE+ E+GPV+    V +D  LYK G +       T +SL +        
Sbjct: 339 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 385

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
                            +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ 
Sbjct: 386 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 429

Query: 284 GLFKILRGKDECGIESSI 301
           G F+I+RG +EC IES +
Sbjct: 430 GHFRIVRGVNECDIESFV 447


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)

Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
           Y + SN+K IMKE+ E+GPV+    V +D  LYK G +       T +SL +        
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390

Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
                            +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ 
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434

Query: 284 GLFKILRGKDECGIESSI 301
           G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452


>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
          Length = 218

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 41/183 (22%)

Query: 122 PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIY 181
           PC  H          +  H P       + Y V            Y + SN+K IMKE+ 
Sbjct: 59  PCMMHSRAMGRGKRQATAHCPNSYVNNNDIYQV---------TPVYRLGSNDKEIMKELM 109

Query: 182 EHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLI 241
           E+GPV+    V +D  LYK G +       T +SL +                       
Sbjct: 110 ENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR----------------------- 141

Query: 242 LYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
             +  +  G H+++I GWGE+   +    KYW  ANSW   WG+ G F+I+RG +EC IE
Sbjct: 142 -PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 200

Query: 299 SSI 301
           S +
Sbjct: 201 SFV 203


>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
          Length = 422

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 84/335 (25%), Positives = 123/335 (36%), Gaps = 72/335 (21%)

Query: 18  PGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGY 77
           P   W+       V   +YG K     +       H++ +     + +     L EL  Y
Sbjct: 79  PETTWKAKFNKFGVKNRSYGFKYTRNQTAVEEYMEHIRKFF----ESDAMKRHLEELDNY 134

Query: 78  SEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEH--HVNGTRPSCD 135
                DLP  FD+R KWPNCP+I  + +QG CGSC+      +A      H NGT  +  
Sbjct: 135 KS--SDLPKAFDARQKWPNCPSISNVPNQGGCGSCFAVAAAGVASDRACIHSNGTFKAL- 191

Query: 136 ASKGHTPKCVRECQENYD-----------------------VPYKKDLNFGA----KSYS 168
            S+     C   C   Y                         PY  DL+ G      ++ 
Sbjct: 192 LSEEDIIGCCSVCGNCYGGDPLKALTYWVNQGLVTGGRDGCRPYSFDLSCGVPCSPATFF 251

Query: 169 VSSNEKSIMKE---IYEHGPVE--GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
            +  +++ M+    IY     E    F  F   +  +S      G E   +  I     D
Sbjct: 252 EAEEKRTCMRRCQNIYYQQRYEEDKHFATFAYSLYPRSMTVSPDGKERVKVPTIIGHFND 311

Query: 224 -NTSQLGAEG-----------------AFTVFDDLILYKSG------------KALGGHA 253
            NT +L                     AF V ++ + Y SG            + +  H 
Sbjct: 312 KNTEKLNVTEYRNVIKKEILLYGPTTMAFPVPEEFLHYSSGVFRPFPLDGFDDRIVYWHV 371

Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
           +R++GWG+ E     YWL  NS+ + WGDNGLFKI
Sbjct: 372 VRLIGWGQSEDGTH-YWLAVNSFGSHWGDNGLFKI 405


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.137    0.447 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,609,389,746
Number of Sequences: 23463169
Number of extensions: 251284525
Number of successful extensions: 508762
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4034
Number of HSP's successfully gapped in prelim test: 1988
Number of HSP's that attempted gapping in prelim test: 491662
Number of HSP's gapped (non-prelim): 15870
length of query: 309
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 167
effective length of database: 9,027,425,369
effective search space: 1507580036623
effective search space used: 1507580036623
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)