BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy15346
         (280 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 271

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/242 (31%), Positives = 113/242 (46%), Gaps = 58/242 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +    G+VTGG++ ++TGCQP  FP CNH + + S P C++   P P+C
Sbjct: 84  CRGGIPGMAWDYWKYEGIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPEC 143

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H  C  D+YG+ + +DK+  K  Y V  E   I +EI+ NGPV    Y+Y D  +YKSG 
Sbjct: 144 HETC-QDDYGKPYKKDKFYGKSSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSG- 201

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + +G Y        +    ++I+GWG                
Sbjct: 202 ---------------VYKHITGSY--------LGGHAIRIIGWGI--------------- 223

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                             ++N  PYW   +++  Q+GD+G  KILRG NE  IES+V   
Sbjct: 224 ------------------QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAG 265

Query: 242 LP 243
           LP
Sbjct: 266 LP 267


>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 223

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 109/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G+S++ W +    GLV+GG +++  GC+P S  PC H++   S PEC     P PKC
Sbjct: 40  CSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEHSS-QGSLPEC-VGTLPTPKC 97

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y R +  DKY  K  Y +N     I+ EI +NGPV A    Y+D  SYKSG 
Sbjct: 98  KRQC-REGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGV 156

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +I+    ++I+GWG E+  P          
Sbjct: 157 YQH------------------------HSRDIIGRHAIRILGWGSEDNNP---------- 182

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GD G  K+LRG NE  IES VN  
Sbjct: 183 ------------------------YWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAG 218

Query: 242 LPK 244
           +PK
Sbjct: 219 IPK 221


>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 115/244 (47%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +  + GLVTGG + ++ GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ +  DK+  ++ Y ++ +   IQ EI KNGPV A+  +Y+D  SYKSG 
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSG- 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ ++SG        +++    ++I+GWG ENG P          
Sbjct: 270 ---------------VYQHQSG--------DVLGGHAIRILGWGTENGTP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GD G  KILRG++E  IE  +N  
Sbjct: 297 ------------------------YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAG 332

Query: 242 LPKD 245
           +PK+
Sbjct: 333 IPKN 336


>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
          Length = 337

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 59/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y + + QDK + K  Y V  +  DI  EIMKNGPV    Y++ D   YKSG 
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G         +V    ++++GWG ENG            
Sbjct: 273 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENG------------ 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW I +++ E +G+KG  ++ RG NE  IE+ +N  
Sbjct: 298 -----------VK-----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAG 335

Query: 242 LP 243
           LP
Sbjct: 336 LP 337


>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
          Length = 337

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 108/242 (44%), Gaps = 59/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y + + QDK + K  Y V ++  D   EIMKNGPV    Y++ D   YKSG 
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G         +V    ++++GWG ENG            
Sbjct: 273 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENG------------ 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW I +++ E +G+KG  ++ RG NE  IE+ +N  
Sbjct: 298 -----------VK-----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAG 335

Query: 242 LP 243
           LP
Sbjct: 336 LP 337


>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/244 (30%), Positives = 110/244 (45%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +  + GLVTGG + +N GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ +  DK+  K+ Y ++ +   IQ EI KNGPV A+  + +D  SYKSG 
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGV 270

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    ++I+GWG ENG P          
Sbjct: 271 YQH------------------------HSDDVIGGHAIRILGWGTENGTP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +GD G  KILRG++E  IE  +N  
Sbjct: 297 ------------------------YWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAG 332

Query: 242 LPKD 245
           +PK+
Sbjct: 333 IPKN 336


>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
          Length = 346

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 108/243 (44%), Gaps = 57/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI    W +    G+VTGG++ ++TGCQP  FP C H + + +   C+      P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C  D Y   +  DKY  K  Y+V  +   I +EI+ NGPV A  Y++ D  +YK+G 
Sbjct: 221 YQTCQPD-YAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTG- 278

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ Y +G         ++    ++I+GWG                
Sbjct: 279 ---------------VYKYVTG--------SLLGGHAIRIIGWGVST------------- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               N  PYW   +++ +Q+GDKG  KILRG NE  IES+V   
Sbjct: 303 -------------------LNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMVTAG 343

Query: 242 LPK 244
           LPK
Sbjct: 344 LPK 346


>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
          Length = 337

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 108/244 (44%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  ++ W +  + GLV+ G + +  GC+P S  PC H +   S P C T   P PKC
Sbjct: 154 CDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ +  DK+  K+ Y ++     IQ EI KNGPV A+  +Y+D  SYKSG 
Sbjct: 212 VHLCRK-GYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGV 270

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    ++I+GWG ENG P          
Sbjct: 271 YQH------------------------HSGDVLGGHAIRILGWGTENGTP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GD G  KILRG++E  IE  +N  
Sbjct: 297 ------------------------YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAG 332

Query: 242 LPKD 245
           +PKD
Sbjct: 333 IPKD 336


>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
 gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
 gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
 gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
          Length = 302

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 104/242 (42%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G    +W +  + GLV+GG + SN GCQP +  PC H   T  E  C       P+C
Sbjct: 122 CSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTE-TAVENACSNKTLFTPEC 180

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C N +YG  + +D ++   Y           +EI +NGP+ A+ Y+Y D  +Y+SG 
Sbjct: 181 KVQCYNPDYGTRYVKDNHQGTHY---RVPAYTAMKEIYENGPITASFYMYQDFVNYQSG- 236

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +++Y SG Y  + +        VKI+GWGEENG P          
Sbjct: 237 ---------------VYAYNSGKYVTTQA--------VKILGWGEENGTP---------- 263

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   ++F   +GD G +KILRG NE  IE  +   
Sbjct: 264 ------------------------YWLAANSFNTYWGDNGFVKILRGANECYIEEFMYAG 299

Query: 242 LP 243
           LP
Sbjct: 300 LP 301


>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
          Length = 347

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 78/244 (31%), Positives = 102/244 (41%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT-PQPK 60
           C  G   + WV++ + GLVTGG +HS+ GCQP    PC H +   S+P C    T P P 
Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEH-HMEGSKPNCSASPTEPTPA 219

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C T CT+ +    + +D+ + K  Y V       Q EI KNGP+V               
Sbjct: 220 CETTCTHGS-SLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIV--------------- 263

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                   A   +Y D F YKSGVY     +       VK++GWGE+NG PYW +     
Sbjct: 264 --------AAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLV----- 310

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                               +N   Y          +GDKG  KI RG NE   E  +  
Sbjct: 311 --------------------QNSWDY---------DWGDKGLFKIARG-NECDFEKSMTA 340

Query: 241 ALPK 244
            LPK
Sbjct: 341 GLPK 344


>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 333

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 79/243 (32%), Positives = 105/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+VTGG +HS+ GCQP   P C H      +   K L  P PKC
Sbjct: 151 CNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPFKACGKEL--PTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y + F QDK+  K+ Y + + +  IQ+EIM NGPV A   +Y+         
Sbjct: 209 SQKC-QPGYNKTFNQDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYA--------- 258

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  SYKSGVY  +    +  +A VKI+GWG EN  PYW I   +  
Sbjct: 259 --------------DFPSYKSGVYQHTTGGPLGGHA-VKILGWGTENNTPYWLIANSW-- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          GDKG  KI+RG++E  IES +   
Sbjct: 302 ----------------------NPTW----------GDKGYFKIIRGKDECGIESSIVAG 329

Query: 242 LPK 244
           +PK
Sbjct: 330 MPK 332


>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
          Length = 335

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 111/244 (45%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +    G+VTGG + ++ GCQP  FPPC H +     P C T   P P+C
Sbjct: 153 CNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPQC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DK+  K+ Y ++ +   I+ EI KNGPV A               
Sbjct: 211 VRDCRK-GYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEA--------------- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                   +  +Y+D  SYKSGVY   +   +  +A ++I+GWG ENG P          
Sbjct: 255 --------DFTVYADFVSYKSGVYQRHSDDALGGHA-IRILGWGTENGVP---------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GDKG  KILRG +E  IE  +N  
Sbjct: 296 ------------------------YWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAG 331

Query: 242 LPKD 245
           +PK+
Sbjct: 332 IPKE 335


>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
          Length = 335

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 58/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++ K G  TGG++ +  GC+P S  PC      T+ P C T     P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +CTN NY   +  DK+     Y V  +VA IQ EI+ +GPV A   +Y D + YKSG 
Sbjct: 210 VNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGV 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V  + E +    ++I+GWG +NG PYW +   + V
Sbjct: 270 Y------------------------VHTTGEELGGHAIRILGWGTDNGTPYWLVANSWNV 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +               WGE                    G  +I+RG NE  IE  V G 
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331

Query: 242 LPK 244
           +PK
Sbjct: 332 VPK 334


>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
          Length = 319

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 72/241 (29%), Positives = 105/241 (43%), Gaps = 60/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S+  W ++ + G+VTGG ++S   C+   FPPC+H       P+C T     PKC
Sbjct: 138 CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHG-IEGQYPQCSTKPPVVPKC 196

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C  + Y   + +D+Y+F   Y + + V  I+ EIM+NGPV A+  +Y D  +YKSG 
Sbjct: 197 ETTC-QEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGI 255

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +   TVKI+GWGEENG            
Sbjct: 256 YHH------------------------VEGKFMNLHTVKIIGWGEENG------------ 279

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW  V+++  ++G+ G  +I  G NE  IES V G 
Sbjct: 280 ----------------------EAYWKAVNSWNSEWGENGLFRIRLGTNECTIESQVEGG 317

Query: 242 L 242
           L
Sbjct: 318 L 318


>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
 gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
          Length = 333

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 106/244 (43%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G   + W +  K+GLV+GG + S+ GCQP +  PC +HAN T   P C       PK
Sbjct: 151 CNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGT--RPPCSG-GGRTPK 207

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           CHT C N++Y   + +DK   +  Y V  +   IQ EIM NGPV A   +YSD  +YKSG
Sbjct: 208 CHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSG 267

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                            ++    ++I+GWG ENG P         
Sbjct: 268 VYRH------------------------VKGSLLGGHAIRILGWGVENGTP--------- 294

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + +++   +GD GT KIL+G +   IE  +  
Sbjct: 295 -------------------------YWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVA 329

Query: 241 ALPK 244
            LP+
Sbjct: 330 GLPQ 333


>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
          Length = 335

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 104/243 (42%), Gaps = 58/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++ K G  TGG++ S  GC+P S  PC      T+ P+C       P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +CTN+NY   +  DK+     Y V  +VA IQ EI+ +GPV A   +Y D        
Sbjct: 210 VNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYED-------- 261

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +   E+  +A ++I+GWG +NG PYW +   + V
Sbjct: 262 ---------------FYQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +               WGE                    G  +I+RG NE  IE  V G 
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331

Query: 242 LPK 244
           +PK
Sbjct: 332 VPK 334


>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
          Length = 356

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   +   +  K GLVTG     N  CQ  SFPPC H   +T  P CK    P P+C
Sbjct: 165 CNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTKYPPCKG-EVPTPEC 223

Query: 62  HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +C +D+   R + +D Y+ ++ Y V+ +   I  EIM NGPV     +Y D  +YKSG
Sbjct: 224 KKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSG 283

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                         + E +    VK++GWG EN  PY        
Sbjct: 284 VYQH------------------------VTGEQLGGHAVKMIGWGVENDTPY-------- 311

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W IV+++ E +GD+GT KILRG NE  IE  V  
Sbjct: 312 --------------------------WLIVNSWNETWGDQGTFKILRGSNECGIEDEVVT 345

Query: 241 ALPK 244
           ALP+
Sbjct: 346 ALPQ 349


>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
          Length = 344

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 99/243 (40%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT N  Y   + QDK+     Y V  +V  IQ EI+KNGP+     +Y         
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVY--------- 264

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D + Y +GVY  +A A +  +A VKI+GWG +NG PYW +   + 
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                          I WGE                   KG  +I+RG NE  IE     
Sbjct: 310 ---------------INWGE-------------------KGYFRIIRGLNECGIEHSAVA 335

Query: 241 ALP 243
            +P
Sbjct: 336 GIP 338


>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
          Length = 344

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 99/243 (40%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT N  Y   + QDK+     Y V  +V  IQ EI+KNGP+     +Y         
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVY--------- 264

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D + Y +GVY  +A A +  +A VKI+GWG +NG PYW +   + 
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                          I WGE                   KG  +I+RG NE  IE     
Sbjct: 310 ---------------INWGE-------------------KGYFRIIRGLNECGIEHSAVA 335

Query: 241 ALP 243
            +P
Sbjct: 336 GIP 338


>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    Y   + +DKY  +  Y V +    I++EIM +GPV A   ++SD  +YKSG 
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G       AEI  +A V+I+GWG E   PY         
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +ILRG++E  IES V G 
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338

Query: 242 LP 243
           LP
Sbjct: 339 LP 340


>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    Y   + +DKY  +  Y V +    I++EIM +GPV A   ++SD  +YKSG 
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G       AEI  +A V+I+GWG E   PY         
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +ILRG++E  IES V G 
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338

Query: 242 LP 243
           LP
Sbjct: 339 LP 340


>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    Y   + +DKY  +  Y V +    I++EIM +GPV A   ++SD  +YKSG 
Sbjct: 218 HQKC-QKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G       AEI  +A V+I+GWG E   PY         
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +ILRG++E  IES V G 
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338

Query: 242 LP 243
           LP
Sbjct: 339 LP 340


>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 217

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 111/244 (45%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    G+VTGG + +  GCQP  FPPC H +     P C T   P P+C
Sbjct: 32  CNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 89

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  + Y + + +DK+  K+ Y ++ +   I+ EI KNGPV A+  +           
Sbjct: 90  AKTC-REGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGPVEADFNV----------- 137

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                       Y+D  SYKSGVY    S E++    ++I+GWG E+G P          
Sbjct: 138 ------------YADFPSYKSGVYQ-RHSKEMLGGHAIRILGWGTEDGVP---------- 174

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GDKG  KI RG +E  IE+ +N  
Sbjct: 175 ------------------------YWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAG 210

Query: 242 LPKD 245
           +PK+
Sbjct: 211 IPKE 214


>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 78/244 (31%), Positives = 110/244 (45%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +  + G+VTG +  ++TGCQP  FP C H N T   P C       PKC
Sbjct: 159 CLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH-NTTGKYPACGQKIYETPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK+  K  Y V +    I++EIM +GPV +   +YSD  +YKSG 
Sbjct: 218 QKKC-QKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +   + GV+            TV+IVGWG E G PY         
Sbjct: 276 -----------IYKHMKGTEIGVH------------TVRIVGWGVEKGTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +ILRG++E  IESLV G 
Sbjct: 304 -------------------------WLIANSWNEGWGEKGYFRILRGKDECDIESLVIGG 338

Query: 242 LPKD 245
           LP++
Sbjct: 339 LPRN 342


>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
 gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
          Length = 205

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 101/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 14  CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73

Query: 62  HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT++N Y  G+ QDK+     Y V  +V  IQ EI+ +GP+     +          
Sbjct: 74  VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTV---------- 123

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                        Y D + Y +GVY  +A   +  +A VKI+GWG +NG PYW +   + 
Sbjct: 124 -------------YEDFYQYTTGVYVHTAGKSLGGHA-VKILGWGVDNGTPYWLVANSWN 169

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
           V+               WGE                   KG  +I+RG NE  IE     
Sbjct: 170 VN---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 195

Query: 241 ALP 243
            LP
Sbjct: 196 GLP 198


>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
          Length = 345

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 102/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG++ S  GC+P S  PC       + P+C     P PKC
Sbjct: 155 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKC 214

Query: 62  HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT++N Y   + QDK+     Y V  +V  IQ EI+KNGPV     +Y         
Sbjct: 215 VEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAFTVY--------- 265

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D + Y +GVY  ++ A +  +A VKI+GWG +NG PYW +   + 
Sbjct: 266 --------------EDFYQYTTGVYVHTSGASLGGHA-VKILGWGVDNGTPYWLVANSWN 310

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
           V+               WGE                   KG  +I+RG NE  IE     
Sbjct: 311 VN---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 336

Query: 241 ALP 243
            +P
Sbjct: 337 GIP 339


>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 76/244 (31%), Positives = 110/244 (45%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H + T   PEC       PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    Y   + +DKY  +  Y V +    I++EIM +GPV     ++SD  +YKSG 
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G       AEI  +A V+I+GWG E   PY         
Sbjct: 276 ---------------IYKYMTG-------AEIGEHA-VRIIGWGVEKKTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  ++LRG++E  IES V   
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSG 338

Query: 242 LPKD 245
           LP+D
Sbjct: 339 LPRD 342


>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
 gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
          Length = 387

 Score =  110 bits (275), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 102/245 (41%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 234 EKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 293

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK+VGWG ENG PYWT    +  
Sbjct: 294 Y------------------------VHTGGKLGGGHAVKLVGWGIENGIPYWTCANSWNT 329

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 330 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 355

Query: 242 LPKDN 246
           +PK N
Sbjct: 356 VPKLN 360


>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
          Length = 342

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 72/244 (29%), Positives = 109/244 (44%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+VTGG+  ++TGCQP  FP C H +     PEC  +   +PKC
Sbjct: 159 CQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEH-HTKGRYPECGEIIYMKPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    Y   + +DKY  K  Y +      I++EIM +GPV A+  ++SD  +YKSG 
Sbjct: 218 HQKCQK-GYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G+         +    V+I+GWG E   PY         
Sbjct: 276 ---------------IYKHMTGID--------IGSHVVRIIGWGVEKETPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  ++LRG++E  IES V   
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSG 338

Query: 242 LPKD 245
           LP+D
Sbjct: 339 LPRD 342


>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
          Length = 347

 Score =  110 bits (274), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 79/243 (32%), Positives = 102/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +  + GLVTGG ++S+ GCQP   P C+H      +P C       PKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKC 222

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   NY   +  DK+  K  Y V D V  I  EIM NGPV A   +Y D        
Sbjct: 223 SKKC-EANYNVTYKDDKHYGKNSYSV-DSVEKIMTEIMTNGPVEAAFTVYED-------- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                            SYKSGVY      E+  +A VKI+GWGE+NG PYW +   +  
Sbjct: 273 ---------------FLSYKSGVYQHRTGQELGGHA-VKILGWGEDNGTPYWIVANSW-- 314

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          G++G   ILRG++E  IES +   
Sbjct: 315 ----------------------NPDW----------GNQGFFNILRGKDECGIESQIVAG 342

Query: 242 LPK 244
           LPK
Sbjct: 343 LPK 345


>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 192

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 73/244 (29%), Positives = 109/244 (44%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +     +VTGG + +  GCQP  FPPC H +     P C T   P P+C
Sbjct: 7   CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 64

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  + Y + + +DK+  K+ Y ++ +   I+ EI KNGPV                 
Sbjct: 65  AKTC-REGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVE---------------- 107

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                  A+  +Y+D  SYKSGVY    S E++    ++I+GWG E+G P          
Sbjct: 108 -------ADFSVYADFPSYKSGVYQ-RHSEEMLGGHAIRILGWGTEDGVP---------- 149

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GDKG  KI RG +E  IE  +N  
Sbjct: 150 ------------------------YWLVANSWNEDWGDKGYFKIRRGNDECGIEDDINAG 185

Query: 242 LPKD 245
           +PK+
Sbjct: 186 IPKE 189


>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
 gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
           Full=Cysteine protease-related 6; Flags: Precursor
 gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
          Length = 379

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 234 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 293

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG ++G PYWT+   +  
Sbjct: 294 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 329

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 330 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 355

Query: 242 LPKDN 246
           +PK N
Sbjct: 356 IPKLN 360


>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
 gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
          Length = 378

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 232

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 233 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 292

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG ++G PYWT+   +  
Sbjct: 293 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 328

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 329 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 354

Query: 242 LPKDN 246
           +PK N
Sbjct: 355 IPKLN 359


>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
 gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
          Length = 369

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N GC+P  FPPC H +  T    C     P PKC
Sbjct: 164 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 223

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 224 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG ++G PYWT+   +  
Sbjct: 284 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 319

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 320 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 345

Query: 242 LPKDN 246
           +PK N
Sbjct: 346 IPKLN 350


>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
          Length = 398

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 102/245 (41%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG    +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 189 CNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 248

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC  +   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 249 EKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 308

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG E+G PYWT+   +  
Sbjct: 309 Y------------------------VHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSWNT 344

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 345 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 370

Query: 242 LPKDN 246
           +PK N
Sbjct: 371 IPKLN 375


>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
          Length = 337

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 80/244 (32%), Positives = 103/244 (42%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C  G     W  WVH  GLVTGG++ S  GC+P S  PC       + P+C       P+
Sbjct: 147 CEGGYPIQAWRYWVHN-GLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPE 205

Query: 61  CHTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +CT+  +Y   + QDK+     Y +   VA IQ EIM+NGPV     +YSD      
Sbjct: 206 CVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSD------ 259

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                             + YKSG+Y   A  E+  +A VKI+GWG ENG PYW     +
Sbjct: 260 -----------------FYQYKSGIYKHVAGRELGGHA-VKILGWGVENGTPYWLAANSW 301

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
            V+               WGE                   KG  +I RG NE  IES V 
Sbjct: 302 NVN---------------WGE-------------------KGYFRIRRGTNECGIESSVV 327

Query: 240 GALP 243
             +P
Sbjct: 328 AGIP 331


>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
 gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
           Full=Cysteine protease-related 5; Flags: Precursor
 gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
          Length = 344

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 99/243 (40%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG++ +  GC+P S  PC         P C     P PKC
Sbjct: 154 CEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKC 213

Query: 62  HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT+ +NY   + QDK+     Y V  +V  IQ EI+ NGP+     +Y         
Sbjct: 214 VDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVY--------- 264

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D + Y +GVY  +A A +  +A VKI+GWG +NG PYW +   + 
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
           V+               WGE                   KG  +I+RG NE  IE     
Sbjct: 310 VA---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 335

Query: 241 ALP 243
            +P
Sbjct: 336 GIP 338


>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
          Length = 376

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 102/245 (41%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  K G+VTG  + +N+GC+P  FPPC H +  T    C     P PKC
Sbjct: 175 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 234

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D   + + +DK+     Y V D+V  IQ+E+M +GP+     +Y D  +Y  G 
Sbjct: 235 EKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 294

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG E+G PYWT    +  
Sbjct: 295 Y------------------------VHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSWNT 330

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG +E  IES V G 
Sbjct: 331 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 356

Query: 242 LPKDN 246
           +PK N
Sbjct: 357 IPKLN 361


>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
 gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
          Length = 259

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 108/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W     +GLVTGG + S+ GCQP     C+H      +P CK   +P PKC
Sbjct: 73  CNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKP-CKG-DSPTPKC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+  +  Y V  + A+IQ+EIM NGPV     +Y          
Sbjct: 131 ERKCEA-GYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVY---------- 179

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                        +D  +YKSGVY  ++ + +  +A +KI+GWGEENG P          
Sbjct: 180 -------------ADFPTYKSGVYQHTSGSALGGHA-IKILGWGEENGTP---------- 215

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD+G  KI RG +E  IES + G 
Sbjct: 216 ------------------------YWLVANSWNSDWGDEGFFKIKRGNDECGIESGIVGG 251

Query: 242 LPK 244
           LPK
Sbjct: 252 LPK 254


>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
           pulchellus]
          Length = 338

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 113/244 (46%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G+ S  W++  ++G+VTGG + +  GCQP S     +       P    L +P P C
Sbjct: 152 CKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDL-SPMPPC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   +YG+ + +DK+  ++ Y ++ + A I+ EI KNGPV A+  +Y+D +      
Sbjct: 211 KREC-RKSYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFY------ 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                            SYKSGVY   +     ++A ++I+GWG ENG PYW       +
Sbjct: 264 -----------------SYKSGVYQAHSRVRCGSHA-IRILGWGTENGVPYW-------L 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +A++                     WT      E +GDKG  KI RG NE  IE  +N  
Sbjct: 299 AANS---------------------WT------EHWGDKGYFKIRRGNNECGIEEDINAG 331

Query: 242 LPKD 245
           +PK+
Sbjct: 332 IPKE 335


>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
          Length = 344

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +  + G+VTG  +++  GCQP  FPPC H +     P C       PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEH-HVVGPRPSCGG-DVETPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C    Y   + +DK+  K  Y V+     I +E+M +GPV  +  +Y+D  +YKSG 
Sbjct: 219 KTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGV 277

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEENG PY         
Sbjct: 278 YQH------------------------VSGGLLGGHAVRLLGWGEENGVPY--------- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KI+RGRNE  IES VN  
Sbjct: 305 -------------------------WLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAG 339

Query: 242 LPK 244
           +PK
Sbjct: 340 IPK 342


>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
 gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
          Length = 335

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 101/243 (41%), Gaps = 58/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++ K G  TGG++ +  GC+P S  PC       + P+C       P C
Sbjct: 150 CDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPAC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +CTN  Y   +  DK+     Y V  +VA IQ EI+ +GPV A   +Y D        
Sbjct: 210 VNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYED-------- 261

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +   E+  +A ++I+GWG +NG PYW +   + V
Sbjct: 262 ---------------FYQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +               WGE                    G  +I+RG NE  IE  V G 
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331

Query: 242 LPK 244
           +PK
Sbjct: 332 VPK 334


>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
 gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
          Length = 330

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  SS W +  K GLV+GG ++S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            +RC    Y   + QDK+  K  Y V   V  IQ EI KNGPV     +Y D   YKSG 
Sbjct: 207 ISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    +K++GWGEE+G P          
Sbjct: 266 YQH------------------------VSGSVLGGHAIKVLGWGEEDGIP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG N   IES +   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
          Length = 329

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 78/243 (32%), Positives = 107/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +  K GLVTGG + SN GC+P S PPC H +   + P C+      PKC
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEH-HVNGTRPPCQGEGD-TPKC 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T+C  D Y   + +DKY  K+ Y V  +   I  E+ KNGPV A   +Y D   YKSG 
Sbjct: 205 QTKCI-DGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGV 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y         +L  D+     G +A            +KI+GWG+EN  P          
Sbjct: 264 Y--------QHLTGDML----GGHA------------IKILGWGKENNTP---------- 289

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G++G  KILRG +E  IES V   
Sbjct: 290 ------------------------YWLAANSWNTDWGNQGFFKILRGGDECGIESEVVAG 325

Query: 242 LPK 244
           +P+
Sbjct: 326 IPQ 328


>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
 gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
 gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
          Length = 347

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 105/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 163 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HVIGPLPSCDG-DVETPSC 220

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C    Y   + +DK+  ++ Y ++     I  E+M+NGPV  +  +Y+D  +YKSG 
Sbjct: 221 KTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADFPNYKSGV 279

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEEN  PY         
Sbjct: 280 YQH------------------------VSGALLGGHAVRLLGWGEENNVPY--------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GDKG  KI+RG+NE  IES VN  
Sbjct: 307 -------------------------WLIANSWNSDWGDKGYFKIVRGKNECGIESDVNAG 341

Query: 242 LPK 244
           +PK
Sbjct: 342 IPK 344


>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
 gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
           Full=Cysteine protease-related 4; Flags: Precursor
 gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
          Length = 335

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 58/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++ K G  TGG++ +  GC+P S  PC       + P C       P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPAC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +CTN NY   +  DK+     Y V  +V+ IQ EI+ +GPV A   +Y D        
Sbjct: 210 VNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYED-------- 261

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YK+GVY  +   E+  +A ++I+GWG +NG PYW +   + V
Sbjct: 262 ---------------FYQYKTGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +               WGE                    G  +I+RG NE  IE  V G 
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331

Query: 242 LPK 244
           +PK
Sbjct: 332 VPK 334


>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
          Length = 340

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 103/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 150 CNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCSGEGGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V  +  +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 209 SKIC-EPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E+V    V+I+GWG ENG PYW        
Sbjct: 268 YQH------------------------VTGEMVGGHAVRILGWGVENGTPYW-------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRGR+   IES +   
Sbjct: 296 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
          Length = 278

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 84/172 (48%), Gaps = 25/172 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI + +W +  + G+VTGG   + TGC P  FP C+H   T   P C     P PKC
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y + + QDK + K  Y V ++  DI  EIMKNGPV    Y++ D   YKSG 
Sbjct: 192 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG- 249

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                          I+ Y +G         +V    ++++GWG ENG  YW
Sbjct: 250 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENGVNYW 278


>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
          Length = 337

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 108/243 (44%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K+GLV+GG++ S  GC+P S  PC       + P+C       P+C
Sbjct: 147 CDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPEC 206

Query: 62  HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
            + CT+  +Y   + +DK+     Y V  + A IQ EI+++GPV A   +YSD + YKSG
Sbjct: 207 ASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSG 266

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+++ SG        E+  +A VKI+GWG ENG  YW +   + 
Sbjct: 267 ----------------IYTHVSG-------QELGGHA-VKILGWGVENGTKYWLVANSWN 302

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                          I WGE                   KG  +ILRGRNE  IES V  
Sbjct: 303 ---------------INWGE-------------------KGYFRILRGRNECGIESAVVA 328

Query: 241 ALP 243
            +P
Sbjct: 329 GIP 331


>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 76/244 (31%), Positives = 105/244 (43%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    Y   + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 275

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                         +  IV    ++I+GWG E G+PY        
Sbjct: 276 IYRH------------------------VTGSIVGGHAIRIIGWGVEKGKPY-------- 303

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W I +++ E +G+KG  +++RGR+E  IES V  
Sbjct: 304 --------------------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVA 337

Query: 241 ALPK 244
            L K
Sbjct: 338 GLIK 341


>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
          Length = 332

 Score =  106 bits (265), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLV+GG + S+ GC+P S  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEH-HVNGSRPPCTGEGGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y  G+ QDK+  K  Y V+D   +IQ EI KNGPV     +Y D   YK+G 
Sbjct: 207 TKKC-EAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      V+ SA  V    +K++GWGEENG P          
Sbjct: 266 YQH----------------------VTGSA--VGGHAIKVLGWGEENGTP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
          Length = 332

 Score =  106 bits (265), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 107/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G + + W + HK G+V+GG + S  GCQP S  PC H+    S P C+ +    PKC
Sbjct: 150 CLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHS-IPGSRPACEGVRD-TPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    YG  +  D    +  Y + ++   IQ EI+KNGP+VA++ +Y D+FSYK+G 
Sbjct: 208 KKQCEK-GYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    +KI+GWG EN  P          
Sbjct: 267 YQH------------------------VAGEVLGGHVIKILGWGVENDTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G  KILRG +E  IE  +   
Sbjct: 293 ------------------------YWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAG 328

Query: 242 LPK 244
           +P+
Sbjct: 329 IPR 331


>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
          Length = 339

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 107/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W ++ ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
 gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
          Length = 341

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 73/244 (29%), Positives = 100/244 (40%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--CKTLATPQP 59
           C+ G S + W +  KRGLVTGG + SN GCQP   PPCNH       P   C    +  P
Sbjct: 156 CNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETP 215

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           +C   C N NY + F +D  +  R  W    +  I+ E+ K+GP  A M +Y D  +YKS
Sbjct: 216 QCTLNCYNPNYSKPFLKDISKGIRIDWHCSGM--IRNELKKHGPATAIMRVYEDFLTYKS 273

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         + +++   TVK++GWG            VY
Sbjct: 274 GIYQH------------------------VTGKLLGQITVKVIGWG------------VY 297

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW   +++G  +GDKG  KI RG NE + E    
Sbjct: 298 ----------------------RGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFI 335

Query: 240 GALP 243
              P
Sbjct: 336 SGRP 339


>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
          Length = 209

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 20  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 77

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 78  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 124

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 125 -----------VYSDFLLYKSGVYQ----------------------------------- 138

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 139 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 198

Query: 242 LPKDN 246
           +P+ +
Sbjct: 199 IPRTD 203


>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
          Length = 331

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 99/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +GLV+GG + S+ GCQP    PC H    T +P  +   TP  KC
Sbjct: 149 CNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKPCAEGGRTP--KC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H  C N NY   + +D    +  Y +  +   IQ +IM NGPV A   +YSD  SYKSG 
Sbjct: 207 HKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                            ++    ++I+GWG E G P          
Sbjct: 267 YRH------------------------VKGSLLGGHAIRILGWGMEKGTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD GT KILRG +   IE  V   
Sbjct: 293 ------------------------YWLVANSWNTDWGDNGTFKILRGSDHCGIEDSVVAG 328

Query: 242 LPK 244
           LP+
Sbjct: 329 LPR 331


>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
          Length = 276

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 87  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 144

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 145 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 191

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 192 -----------VYSDFLLYKSGVYQ----------------------------------- 205

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 206 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 265

Query: 242 LPKDN 246
           +P+ +
Sbjct: 266 IPRTD 270


>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
          Length = 344

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 104/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +  + G+VTG  ++   GCQP  FPPC H +     P C+      PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEH-HVVGPRPSCEG-DVETPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C    Y   + +DK+  K  Y V+     I +E+ ++GPV  +  +Y+D  +YKSG 
Sbjct: 219 KTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGV 277

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEENG PY         
Sbjct: 278 YQH------------------------VSGGLLGGHAVRLLGWGEENGVPY--------- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KI+RGRNE  IES VN  
Sbjct: 305 -------------------------WLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAG 339

Query: 242 LPK 244
           +PK
Sbjct: 340 IPK 342


>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
          Length = 383

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 103/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +   RG+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 192 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 251

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C + NYG+ +  DKY  ++ Y V   V  IQ+EIM  GPV A+  +Y+D   Y  G 
Sbjct: 252 VKKC-DKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGG- 309

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +     G +A            VK++GWG + G PYW     +  
Sbjct: 310 -----------IYKHVAGSMGGGHA------------VKVLGWGIDQGVPYWLAANSWNT 346

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG NE  IES +   
Sbjct: 347 D---------------WGED-------------------GYFRILRGVNECGIESGIIAG 372

Query: 242 LPK 244
           +PK
Sbjct: 373 IPK 375


>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
          Length = 245

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 56  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 113

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 114 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 160

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 161 -----------VYSDFLLYKSGVYQ----------------------------------- 174

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 175 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 234

Query: 242 LPKDN 246
           +P+ +
Sbjct: 235 IPRTD 239


>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
          Length = 332

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/243 (31%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  + GLVTGG + S+ GCQP    PC H +   S P C  L  P P+C
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEH-HINGSRPACGKL-EPTPRC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   F +DK+  K  Y V+ +V  IQ EIM NGPV A   +Y+D F +    
Sbjct: 207 KKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYAD-FPH---- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YKSGVY   + AE+  +A VK++GWG E   PY         
Sbjct: 261 ------------------YKSGVYQHESGAELGGHA-VKMIGWGTEGSTPY--------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G+ G  KILRG++E  IE  +   
Sbjct: 293 -------------------------WLIANSWNTDWGNMGFFKILRGQDECGIERDIVAG 327

Query: 242 LPK 244
            PK
Sbjct: 328 EPK 330


>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
          Length = 249

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +   RG+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 58  CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 117

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C + NYG+ +  DKY  +  Y V   V  IQ+EIM  GPV A+  +Y+D   Y  G 
Sbjct: 118 VKKC-DKNYGKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGG- 175

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +     G +A            VK++GWG + G PYW     +  
Sbjct: 176 -----------IYKHVAGSMGGGHA------------VKVLGWGIDQGVPYWLAANSWNT 212

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG NE  IES +   
Sbjct: 213 D---------------WGED-------------------GYFRILRGVNECGIESGIIAG 238

Query: 242 LPK 244
           +PK
Sbjct: 239 IPK 241


>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
 gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
          Length = 261

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 72  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 176

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 177 -----------VYSDFLLYKSGVYQ----------------------------------- 190

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 191 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 250

Query: 242 LPKDN 246
           +P+ +
Sbjct: 251 IPRTD 255


>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
          Length = 373

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/244 (31%), Positives = 101/244 (41%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   S W  WVHK G+VTGG + S+ GC P     C+H    T  P C     P P+
Sbjct: 190 CNGGFPGSAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKTIPPTPR 247

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  R     Y   F  DK+  +  Y V  +   IQ EIM NGPV A+  +          
Sbjct: 248 C-VRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTV---------- 296

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                        Y D   YKSGVY     + +  +A ++++GWG ENG P         
Sbjct: 297 -------------YEDFLHYKSGVYQRHTDSALGGHA-IRLLGWGVENGVP--------- 333

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++  ++GDKG  KILRG +E  IES +  
Sbjct: 334 -------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIESDIVA 368

Query: 241 ALPK 244
            LPK
Sbjct: 369 GLPK 372


>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 328

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 77/242 (31%), Positives = 101/242 (41%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G  SS W +  K+GLVTGG   S  GC+P S  PC H    T  P   T  T  PKC
Sbjct: 146 CSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQET--PKC 203

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + +DK+  KR Y +  +   I  E+ KNGPV A   +Y+D   YK+G 
Sbjct: 204 EKKCI-DGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGV 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    +KI+GWGEE+G PYW     +  
Sbjct: 263 YQH------------------------VTGEVLGGHAIKILGWGEESGTPYWLAANSW-- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               NG             +GDKG  KI RG +E  IES +   
Sbjct: 297 --------------------NG------------DWGDKGFFKIKRGNDECGIESEMVAG 324

Query: 242 LP 243
            P
Sbjct: 325 TP 326


>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
 gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 398

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 97/245 (39%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  K G+VTG       GC+P  FPPC H +  T    CK    P PKC
Sbjct: 190 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 249

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +    + + +DK+  +  Y V D+V  IQ+EI+ +GPV     +Y D   Y  G 
Sbjct: 250 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGI 309

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    +I     VK++GWG E G PYW +   +  
Sbjct: 310 Y------------------------VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNT 345

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +I+RG +E  IES V G 
Sbjct: 346 D---------------WGED-------------------GFFRIIRGIDECGIESSVVGG 371

Query: 242 LPKDN 246
           LPK N
Sbjct: 372 LPKLN 376


>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
          Length = 332

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +GLV+GG + S++GCQP    PC H    T +P  +   TP  KC
Sbjct: 150 CNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPCAEGGRTP--KC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H  C N+NY   + +D    +  Y +  +   IQ EIM NGPV A   +YSD  + KSG 
Sbjct: 208 HRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                            ++    ++I+GWG E G P          
Sbjct: 268 YRH------------------------VKGSLLGGHAIRILGWGVEKGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GDKGT KILRG +   IE  V   
Sbjct: 294 ------------------------YWLVANSWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329

Query: 242 LPK 244
           LP+
Sbjct: 330 LPR 332


>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
 gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
 gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
          Length = 334

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 100/245 (40%), Gaps = 60/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG+  SN GC+P    PC H +   + P C       P C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEH-HVNGTRPPCTGDDNKTPSC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK   K  Y ++ EV  IQ+EIM NGPV     +Y D+ SYK G 
Sbjct: 208 KQQCEK-GYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           E +    ++I+GWG E G PY         
Sbjct: 267 YQH------------------------VKGEALGGHAIRILGWGTEKGTPY--------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD GT KILRG +   IES +   
Sbjct: 294 -------------------------WLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAG 328

Query: 242 LPKDN 246
           +PKD+
Sbjct: 329 IPKDS 333


>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
 gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
 gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
           AltName: Full=Cathepsin B1; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
 gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
 gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
 gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
 gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
 gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
 gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
 gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
          Length = 340

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 340

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 108/242 (44%), Gaps = 59/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W ++ ++G+ TGG +  +T C+P  FPPC+H      +P C  +  P P+C
Sbjct: 158 CKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPI-QPTPQC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C ++     + +D +   + Y +   V  IQ+EIM +GPV A+  + +D  +YKSG 
Sbjct: 216 VKECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGV 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y   P +           Y+ G              +VKI+GWG+E   PY         
Sbjct: 276 YIRNPKL----------KYEGG-------------HSVKIIGWGKEGNTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  ++LRGRNE  IE+ +   
Sbjct: 304 -------------------------WLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAG 338

Query: 242 LP 243
           LP
Sbjct: 339 LP 340


>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
          Length = 352

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/245 (28%), Positives = 97/245 (39%), Gaps = 58/245 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  K G+VTG       GC+P  FPPC H +  T    CK    P PKC
Sbjct: 149 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +    + + +DK+  +  Y V D+V  IQ+EI+ +GPV     +Y D   Y  G 
Sbjct: 209 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGI 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    +I     VK++GWG E G PYW +   +  
Sbjct: 269 Y------------------------VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNT 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +I+RG +E  IES V G 
Sbjct: 305 D---------------WGED-------------------GFFRIIRGIDECGIESSVVGG 330

Query: 242 LPKDN 246
           LPK N
Sbjct: 331 LPKLN 335


>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
 gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
 gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
 gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
 gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
 gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
 gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
 gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
 gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
 gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
 gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
 gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
          Length = 330

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +  K GLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            ++C    Y   + +DK+  K  Y V  + A+IQ EI KNGPV     +Y D   YKSG 
Sbjct: 207 LSQC-EAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS SA  V    +K++GWGEENG P          
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEENGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  K LRG +   IES +   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
          Length = 333

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W + HK G+V+GG + S  GCQP S  PC H+ + +S P C  + T  PKC
Sbjct: 151 CLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSS-PACGGV-TDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +  Y  +  Y + ++   IQ EI+KNGP+VA+  +Y D+FSYK G 
Sbjct: 209 KKQCEK-GYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    +KI GWG ENG P          
Sbjct: 268 YQH------------------------VAGEFLGGHVIKIFGWGIENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G  KI RG++E  IE  V+  
Sbjct: 294 ------------------------YWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAG 329

Query: 242 LPK 244
           LP+
Sbjct: 330 LPR 332


>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
          Length = 339

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
          Length = 340

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 103/245 (42%), Gaps = 60/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCKGEGGETPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V     +I  EI KNGPV          FS     
Sbjct: 209 SKTC-EPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPV-------EGAFS----- 255

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y+D   YKSGVY      E+  +A                        
Sbjct: 256 -----------VYTDFLVYKSGVYQHVTGEEVGGHA------------------------ 280

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      ++++GWG ENG PYW   +++   +GD G  KILRG++   IES +   
Sbjct: 281 -----------IRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAG 329

Query: 242 LPKDN 246
           +P+ +
Sbjct: 330 IPRTD 334


>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
 gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
          Length = 339

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
 gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
 gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
          Length = 339

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
          Length = 355

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/234 (32%), Positives = 98/234 (41%), Gaps = 64/234 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPC--NHANYTTSEPECKTLATPQPKCHTRCT-NDN 69
           W    GL TGG +    GC+P S  PC  N+ N TTS P C    TP   C   CT N  
Sbjct: 177 WWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTTSVP-CPGYHTP--PCEDHCTSNIT 233

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           +   + QDK+  K +Y V  ++ DIQ EIM NGPV+A+  +Y D + YKSG Y       
Sbjct: 234 WPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIY------- 286

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                            V  + +       KI+GWG +NG PYW  V             
Sbjct: 287 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 317

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  +G  FG+ G ++ILRG NE  IE  V  ALP
Sbjct: 318 ----------------------QWGTDFGENGFVRILRGVNEVNIEHQVLAALP 349


>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
 gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
 gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
 gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
          Length = 339

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
          Length = 339

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
          Length = 339

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
          Length = 351

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 220 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 267 -----------VYSDFLLYKSGVYQ----------------------------------- 280

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 281 HITGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 340

Query: 242 LPKDN 246
           +P+ +
Sbjct: 341 IPRTD 345


>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
          Length = 340

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 73/251 (29%), Positives = 105/251 (41%), Gaps = 61/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   + P+C       PKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH-HVNGTRPKCTGEGGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DKY     Y V     +I  EI KNGPV A   ++SD  +YKSG 
Sbjct: 209 SKTC-EPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG+ENG PYW +   + V
Sbjct: 268 YKH------------------------VAGEVLGGHAIRILGWGKENGVPYWLVGNSWNV 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG +   IES V   
Sbjct: 304 ----------------------------------DWGDNGFFKILRGEDHCGIESEVVAG 329

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 330 IPRTDQYWGRF 340


>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
          Length = 246

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +  K GLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 64  CNGGYPSAAWDFWTKDGLVSGGLYDSHIGCRPYTIPPCEH-HVNGSRPSCSGEGGETPQC 122

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC    Y   + QDK+  K  Y V+ +  DI+ EI KNGPV     +Y D   YK+G 
Sbjct: 123 VYRC-EAGYTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGV 181

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      V+ SA  +    +KI+GWGEENG P          
Sbjct: 182 YQH----------------------VTGSA--LGGHAIKILGWGEENGIP---------- 207

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  KILRG N   IES +   
Sbjct: 208 ------------------------YWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAG 243

Query: 242 LP 243
           +P
Sbjct: 244 IP 245


>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 338

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 103/242 (42%), Gaps = 59/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W ++   G+ TGG +  ++ C+P  FPPC+H +     P C  +  P PKC
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDH-HVVGQYPPCGPIK-PTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +    + + QD +   + Y + +    IQ+EIM                      
Sbjct: 214 VKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIM---------------------- 251

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
             +GPV A+  + SD  +YKSGVY      +     +VKI+GWG E G PY         
Sbjct: 252 -AHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPY--------- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+ G  K+LRG+NE  IE+ V   
Sbjct: 302 -------------------------WLIANSWNEDWGENGLFKMLRGKNECGIEAEVVAG 336

Query: 242 LP 243
           LP
Sbjct: 337 LP 338


>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
 gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
          Length = 343

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG++ S  GC+P S  PC       + P+C       PKC
Sbjct: 153 CEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKC 212

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT N +Y   + +DK+     Y V+ +V  IQ EI+KNGPV     +Y+D       
Sbjct: 213 VDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYAD------- 265

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                            + YKSGVY   A  E+  +A VK++GWG +NG P         
Sbjct: 266 ----------------FYQYKSGVYVHVAGPELGGHA-VKLLGWGVDNGTP--------- 299

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++   +G+ G  +ILRG NE  IES V  
Sbjct: 300 -------------------------YWLAANSWNTNWGENGYFRILRGVNECGIESQVVA 334

Query: 241 ALP 243
            +P
Sbjct: 335 GMP 337


>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
 gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
          Length = 332

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 104/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + GLVTGG + S  GCQP    PC H +   S P C  +  P P+C
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEH-HINGSRPACGKI-EPTPRC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   F +DK+  K  Y V+ +V  IQ EIM NGPV A   +Y+D F +    
Sbjct: 207 KKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYAD-FPH---- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YKSGVY   + AE+  +A VK++GWG E   PY         
Sbjct: 261 ------------------YKSGVYQHESGAELGGHA-VKMIGWGMEGSTPY--------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG++E  IE  +   
Sbjct: 293 -------------------------WLIANSWNSDWGDMGFFKILRGQDECGIERDIVAG 327

Query: 242 LPK 244
            P+
Sbjct: 328 EPR 330


>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 105/242 (43%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +C    Y   + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG
Sbjct: 218 KQKCQK-GYKTPYEQDKNYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 275

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                         +  IV    ++I+GWG E G+PY        
Sbjct: 276 IYRH------------------------VAGSIVGGHAIRIIGWGVEKGKPY-------- 303

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W I +++ E +G+ G  +++RGR+E  IES V  
Sbjct: 304 --------------------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVA 337

Query: 241 AL 242
            L
Sbjct: 338 GL 339


>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 414

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 103/249 (41%), Gaps = 51/249 (20%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  GI S  W WVH +G+ TGG + +      + GC P  FPPC H    +  P+C   +
Sbjct: 210 CDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 269

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
              P C  +C N  Y      D++           V D +  I  +GPV   +Y      
Sbjct: 270 YETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPV-GPIYFCDPSV 328

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           ++         V A+  +Y D  +Y+SGVY  ++  E+  +A                  
Sbjct: 329 NFDQ-------VSASFIVYEDFLAYRSGVYKHTSGKELGGHA------------------ 363

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                            VK+IGWGEE G+ YW +V+++ E +GD G  KI  G  E  I+
Sbjct: 364 -----------------VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--ID 404

Query: 236 SLVNGALPK 244
             + G  PK
Sbjct: 405 DDLLGGTPK 413


>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 319

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 101/234 (43%), Gaps = 60/234 (25%)

Query: 5   GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
           G  +  W +  K G+VTG +  ++T CQP  FP C H +     P C       P C   
Sbjct: 139 GFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEH-HTKGKYPACFEEIYKTPNCENT 197

Query: 65  CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGN 124
           C   +Y   + QDK+R K  Y V ++   IQ+EIMK GPV AN  +Y D  +YKSG Y +
Sbjct: 198 CQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKH 256

Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
                                    + ++V++  ++I+GWG EN  PY            
Sbjct: 257 ------------------------ITGKLVSWHAIRIIGWGVENNTPY------------ 280

Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                 W I +++ E +G+ G  +ILRGR+E  IES V
Sbjct: 281 ----------------------WLIPNSWNEDWGENGNFRILRGRHECSIESEV 312


>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
          Length = 330

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 105/243 (43%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +  ++GLVTGG ++S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T+C    Y   + +DK+  K  Y V  E   IQ EI KNGPV     +Y D  SYKSG 
Sbjct: 207 VTQC-EAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      V+ SA  +    +K++GWGEENG P          
Sbjct: 266 YQH----------------------VTGSA--LGGHAIKMIGWGEENGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG N   IES V   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 105/241 (43%), Gaps = 62/241 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +   +GLVTGG   S  GC+P +  PC H +   S P C+      PKC
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEH-HVNGSRPPCQG-EVETPKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T+C N+ Y   + +DK+  +R Y +  +   I  E+ KNGPV A   +Y+D   YK+G 
Sbjct: 206 VTQC-NNGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGV 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    VKI+GWGEENG P          
Sbjct: 265 YQH------------------------VTGDMLGGHAVKILGWGEENGTP---------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
                                   YW + +++   +GDKG  KI RG +E  IES +V G
Sbjct: 291 ------------------------YWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAG 326

Query: 241 A 241
           A
Sbjct: 327 A 327


>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 345

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 163 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+R K  Y V ++   IQ+EIMK GPV A+  +Y D  +YKSG 
Sbjct: 222 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    ++I+GWG EN  PY         
Sbjct: 281 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 307

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++ E +G+ G  +I+RGR+E  IES V
Sbjct: 308 -------------------------WLIANSWNEDWGENGYFRIVRGRDECFIESEV 339


>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
          Length = 340

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+R K  Y V ++   IQ+EIMK GPV A+  +Y D  +YKSG 
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    ++I+GWG EN  PY         
Sbjct: 276 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++ E +G+ G  +I+RGR+E  IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECFIESEV 334


>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
 gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
          Length = 356

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 101/235 (42%), Gaps = 60/235 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG + ++ GC+P  F PCNH +  T  P C     P P C
Sbjct: 171 CQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVC 229

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DKY   + Y ++++ +D+Q+E+M NGP+     +Y D   YK+G 
Sbjct: 230 KKAC-QSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGV 288

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    V+++GWGEENG P          
Sbjct: 289 YQH------------------------HTGSVLGGHAVRLLGWGEENGVP---------- 314

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW + +++  ++GDKG  KI RGRNE  IES
Sbjct: 315 ------------------------YWLLANSWNTEWGDKGFFKIYRGRNECGIES 345


>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
 gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
          Length = 342

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 106/243 (43%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S TGC+P    PC H    T  P C    +  PKC
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAP-CNH-DSKTPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK+   + Y V   V DIQ+EIM NGPV     +Y D        
Sbjct: 215 QHQC-EAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYED-------- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +  YKSGVY      E+  +A ++I+GWG                
Sbjct: 266 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGV--------------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG+E   PYW I +++ + +GDKG  +ILRG +   IES ++  
Sbjct: 295 ----------------WGKEE-VPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAG 337

Query: 242 LPK 244
           LPK
Sbjct: 338 LPK 340


>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 337

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 100/236 (42%), Gaps = 66/236 (27%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHT-RCTND 68
           W ++ K GL TGG + SN GCQP S  PC  +AN  + E E        P+C+  +CTN+
Sbjct: 165 WKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENE------DTPQCYKDQCTNN 218

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
           NY      D Y   + Y V  +   I  E+ KNGPVVA M +Y D   YK G        
Sbjct: 219 NYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGG-------- 270

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
                   I+ Y +G         +     VKI+GWGE+                     
Sbjct: 271 --------IYQYTTG--------GLKGDHAVKIMGWGED--------------------- 293

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                        +G  YW   +T+G  +G  G  KI RGRNE  IE+ + G LPK
Sbjct: 294 -------------DGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGLPK 336


>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
          Length = 351

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 220 SKSC-EPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPV-------EGAFS----- 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 267 -----------VYSDFLLYKSGVYQ----------------------------------- 280

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 281 HVTGEMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAG 340

Query: 242 LPKDN 246
           +P+ +
Sbjct: 341 IPRTD 345


>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 337

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 105/239 (43%), Gaps = 60/239 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G S S W +  + GLVTG ++ +N+GC P  FP C+H + + S P C  +    P C
Sbjct: 153 CNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGS-SDSYPMCGYVVYTPPVC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   +  DK+  K  Y V    +DI++EIM  GPV A++++Y D   YKSG 
Sbjct: 212 NGTC-RPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGV 270

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++   +V+I+GWG ENG P          
Sbjct: 271 YKH------------------------LTGRLITIQSVRIIGWGIENGIP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                   YW   +++ E++G  G  KILRG NE  IE+ VN 
Sbjct: 297 ------------------------YWLCANSWNEEWGLNGFFKILRGSNECEIEAFVNA 331


>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
          Length = 195

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 105/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 6   CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 63

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++    I  EI KNGPV          FS     
Sbjct: 64  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPV-------EGAFS----- 110

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 111 -----------VYSDFLLYKSGVYQ----------------------------------- 124

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 125 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 184

Query: 242 LPKDN 246
           +P+ +
Sbjct: 185 IPRTD 189


>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 323

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 99/245 (40%), Gaps = 64/245 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C  G +   W +   +G+VTGG + SN GCQP    PC+H    +S   C +L   Q   
Sbjct: 134 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 192

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           C  +C N NY   +  D Y+    Y   W N  V  IQQEIM  GPV A MY+Y +   Y
Sbjct: 193 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 250

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           K G Y                         S + E++ Y  VK++GWG            
Sbjct: 251 KEGVYK------------------------STAGELIGYHHVKLIGWGV----------- 275

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                 +E G  YW  ++++   +G+ G  KILRG N   IE L
Sbjct: 276 ----------------------DEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELL 313

Query: 238 VNGAL 242
           V   L
Sbjct: 314 VMAGL 318


>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 105/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGP           FS     
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPA-------EGAFS----- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
           Human Liver Cathepsin B: The Structural Basis For Its
           Specificity
 gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
           2.1 Angstroms Resolution: A Basis For The Design Of
           Specific Epoxysuccinyl Inhibitors
 gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
           Cathepsin B
          Length = 205

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 22  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 79

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 80  SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 138

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG ENG P          
Sbjct: 139 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 164

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG++   IES V   
Sbjct: 165 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 200

Query: 242 LPKDN 246
           +P+ +
Sbjct: 201 IPRTD 205


>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
          Length = 359

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGSTPKC 231

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            +R     Y   + +DK+     Y V     +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 232 -SRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGV 290

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    V+I+GWG E+G PYW        
Sbjct: 291 YQH------------------------VTGEMMGGHAVRILGWGVEDGTPYW-------- 318

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 319 -------------LVG-------------NSWNTDWGDSGFFKILRGQDHCGIESEIVAG 352

Query: 242 LP 243
           LP
Sbjct: 353 LP 354


>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
 gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 340

 Score =  103 bits (256), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +  K G+VTG +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+R K  Y V ++   IQ+EIMK GPV A   +Y D  +YKSG 
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGI 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    ++I+GWG EN  PY         
Sbjct: 276 YKH------------------------ITGETLGGHAIRIIGWGVENKTPY--------- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++ E +G+ G  +I+RGR+E  IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 334


>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
 gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
          Length = 266

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC  A+   + P C T     PKC
Sbjct: 77  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE-AHVNGARPPC-TGEGDTPKC 134

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 135 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 181

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +YSD   YKSGVY                                    
Sbjct: 182 -----------VYSDFLLYKSGVYQ----------------------------------- 195

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 196 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 255

Query: 242 LPKDN 246
           +P+ +
Sbjct: 256 IPRTD 260


>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
          Length = 330

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 141 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 198

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV          FS     
Sbjct: 199 SKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPV-------EGAFS----- 245

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y+D   YKSGVY                                    
Sbjct: 246 -----------VYADFLLYKSGVYQ----------------------------------- 259

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             + E++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES V   
Sbjct: 260 HVTGEMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAG 319

Query: 242 LPKDN 246
           +P+ +
Sbjct: 320 IPRTD 324


>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
 gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
          Length = 254

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 71  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 128

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 129 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 187

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG ENG P          
Sbjct: 188 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 213

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG++   IES V   
Sbjct: 214 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 249

Query: 242 LPKDN 246
           +P+ +
Sbjct: 250 IPRTD 254


>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
 gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
          Length = 333

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/244 (30%), Positives = 103/244 (42%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   +  DK+     Y V     +I  EI KNGPV     +Y+D        
Sbjct: 209 VKQC-EDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADF------- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
               P+            YKSGVY      E+  +A +KI+GWG ENG P          
Sbjct: 261 ----PM------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG++   IES +   
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAG 329

Query: 242 LPKD 245
           +PK+
Sbjct: 330 IPKN 333


>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
          Length = 339

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 105/242 (43%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+D   +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 208 SKIC-EPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    V+I+GWG E+G PYW        
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVEDGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRGR+   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
 gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
          Length = 256

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 73  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 131 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 189

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG ENG P          
Sbjct: 190 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 215

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG++   IES V   
Sbjct: 216 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 251

Query: 242 LPKDN 246
           +P+ +
Sbjct: 252 IPRTD 256


>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/233 (32%), Positives = 103/233 (44%), Gaps = 63/233 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y  
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227

Query: 73  GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
            + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG Y +       
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                             +  IV    ++I+GWG E G+PY                   
Sbjct: 280 -----------------VTGSIVGGHAIRIIGWGVEKGKPY------------------- 303

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                          W I +++ E +G+KG  +++RGR+E  IES V   L K
Sbjct: 304 ---------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGLIK 341


>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
           3.2 Angstrom Resolution
 gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
           Resolution
 gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
           Angstrom Resolution
          Length = 317

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 250

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG ENG P          
Sbjct: 251 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 276

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG++   IES V   
Sbjct: 277 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 312

Query: 242 LPKDN 246
           +P+ +
Sbjct: 313 IPRTD 317


>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
          Length = 335

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S EI+    ++I+GWG ENG PYW        
Sbjct: 267 YQH------------------------VSGEIMGGHAIRILGWGVENGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 MP 330


>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 304

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 102/242 (42%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 121 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 179

Query: 62  HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    Y   + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG
Sbjct: 180 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 237

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                         +  IV    ++I+GWG E   PY        
Sbjct: 238 IYRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY-------- 265

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W I +++ E +G+KG  +I+RGR+E  IES V  
Sbjct: 266 --------------------------WLIANSWNEDWGEKGLFRIVRGRDECSIESHVVA 299

Query: 241 AL 242
            L
Sbjct: 300 GL 301


>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +   +G+VTGG ++S+ GCQP + P C+H    +  P   +L  P PKC
Sbjct: 148 CNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCNGSL--PTPKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+     Y +N++  +I +EIM NGPV A   +++D  +YKSG 
Sbjct: 206 EKVC-EKGYNITYKNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGV 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E +    +KI+GWG EN  PYW +   +  
Sbjct: 265 YQH------------------------VSGEELGGHAIKILGWGVENNTPYWLVANSW-- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          GD G  KILRG +E  IE  V   
Sbjct: 299 ----------------------NPSW----------GDNGFFKILRGSDECGIEDEVVAG 326

Query: 242 LPK 244
           LPK
Sbjct: 327 LPK 329


>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
 gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
          Length = 356

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 99/234 (42%), Gaps = 64/234 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
           W    GL TGG ++   GC+P S  PC+  +AN TTS P C    TP   C   CT N  
Sbjct: 178 WWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVP-CPGYHTP--TCEEHCTSNIT 234

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           +   + QDK+  K +Y V  ++ DIQ EIM NGPV+A+  +Y D + YK+G Y       
Sbjct: 235 WPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIY------- 287

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                            V  + +       KI+GWG +NG PYW  V             
Sbjct: 288 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 318

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  +G  FG+ G ++ LRG NE  IE  V  ALP
Sbjct: 319 ----------------------QWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350


>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
          Length = 315

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 97/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
           C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T    + P 
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C N NY   F  D Y+  +YY ++  +A   ++I  NGP+    Y+Y D+  YKSG
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK-GKYYKLSPYMA--MKDIFDNGPITTQFYMYRDLVDYKSG 246

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      Y   +  +     +VKI GWGEENG P         
Sbjct: 247 VY---------------------QYDEQSDFDFFTVHSVKIFGWGEENGVP--------- 276

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + ++FG  +G  GT KI RG +    +  +  
Sbjct: 277 -------------------------YWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYA 311

Query: 241 ALP 243
            LP
Sbjct: 312 GLP 314


>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
          Length = 271

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 82  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 139

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 140 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 197

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW +   + V
Sbjct: 198 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 234

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 235 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 260

Query: 242 LPK 244
           +P+
Sbjct: 261 IPR 263


>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
           EGFP fusion protein [synthetic construct]
          Length = 578

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW +   + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK 244
           +P+
Sbjct: 329 IPR 331


>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
          Length = 358

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 99/234 (42%), Gaps = 64/234 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
           W    GL TGG +    GC+P +  PC+  + N TTS P C    TP   C  RCT N  
Sbjct: 180 WWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTTSVP-CPGYHTP--VCEERCTSNIT 236

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           +   + QDK+  K +Y V  ++ DIQ EIM+NGPV+A+  +Y D + YKSG Y       
Sbjct: 237 WPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY------- 289

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                            V  + +       KI+GWG +NG PYW  V             
Sbjct: 290 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 320

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  +G  FG+ G ++ILRG NE  IE  V  A P
Sbjct: 321 ----------------------QWGTDFGENGFVRILRGVNEVNIEHQVLAAQP 352


>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
          Length = 342

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/231 (32%), Positives = 102/231 (44%), Gaps = 63/231 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y  
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227

Query: 73  GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
            + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG Y +       
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                             +  IV    ++I+GWG E G+PY                   
Sbjct: 280 -----------------VTGSIVGGHAIRIIGWGVEKGKPY------------------- 303

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                          W I +++ E +G+KG  +++RGR+E  IES V   L
Sbjct: 304 ---------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
 gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H N     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-NTLGPLPVCDG-DVETPPC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V      I +E+M++GPV  +  +Y+D  +YKSG 
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEEN  P          
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G  KI+RG+NE  IES VN  
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342

Query: 242 LPK 244
           +PK
Sbjct: 343 IPK 345


>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
           Full=RSG-2; Contains: RecName: Full=Cathepsin B light
           chain; Contains: RecName: Full=Cathepsin B heavy chain;
           Flags: Precursor
 gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
          Length = 339

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW +   + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK 244
           +P+
Sbjct: 329 IPR 331


>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
 gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
 gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
          Length = 339

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW +   + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK 244
           +P+
Sbjct: 329 IPR 331


>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
 gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
          Length = 337

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++GLV+GG   SN GCQP +  PC H +   + P C+      PKC
Sbjct: 154 CNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPKC 212

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  ++Y   + +DK      Y +    A IQ+EIM NGPV     +Y D+  YK G 
Sbjct: 213 VKKC-QESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGV 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    ++I+GWG ENG  Y         
Sbjct: 272 YQH------------------------VTGKMLGGHAIRILGWGVENGTKY--------- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG +   IES ++  
Sbjct: 299 -------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSISAG 333

Query: 242 LPK 244
           LPK
Sbjct: 334 LPK 336


>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
           [Rhipicephalus pulchellus]
          Length = 346

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 98/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +  K G+VTGG + S+ GC P     C+H    T  P C     P P+C
Sbjct: 163 CNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKACDHHVNGTLGP-CDKKIPPTPRC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V  E   IQ EIM NGPV A+  +           
Sbjct: 222 VHMCRK-GYDVDYHDDKHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTV----------- 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                       YSD   YKSGVY       +  +A ++++GWG ENG P          
Sbjct: 270 ------------YSDFVHYKSGVYQRHTDEALGGHA-IRLLGWGVENGVP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++  ++GDKG  KILRG +E  IE  V   
Sbjct: 307 ------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAG 342

Query: 242 LPK 244
           LPK
Sbjct: 343 LPK 345


>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 348

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 101/240 (42%), Gaps = 57/240 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G++   WV+++K G+ TGG      +  +  GC P +FP C H    +    C   +
Sbjct: 157 CKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAHYQKKSKYGPCPKKS 216

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSD 113
              P C  RC N+ YG    +D++   R   YW N  +  I++EIMK+GP  A+ + Y D
Sbjct: 217 YETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG-IRSIKKEIMKHGPTSASFFTYED 275

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
            FSYKSG                ++ Y SG Y        V + TV+++GWG E G  YW
Sbjct: 276 FFSYKSG----------------VYKYTSGAY--------VEFHTVELIGWGTEKGVDYW 311

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
                                   W EE     W  + TF    GD G   ++ G   A+
Sbjct: 312 LAKN-------------------DWNEE-----WADLGTFKIAQGDCGINDLVLGAPAAL 347


>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 105/246 (42%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-K 60
           C  G     W +  + G+VTGG + S  GC P   PPC       SE +       QP +
Sbjct: 159 CHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPC------FSEEDGNNTCRGQPME 212

Query: 61  CHTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
            H RCT   YG     + D +RF R YY++    A IQ+++M  GP+ A+M +Y      
Sbjct: 213 KHHRCTRMCYGDQEIDYDDDHRFTRDYYYLT--YASIQKDVMTYGPIEASMEVYD----- 265

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                             D  SYKSGVY  S +A  +    VK++GWGEE+G PY     
Sbjct: 266 ------------------DFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPY----- 302

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++ E +GDKG  KI RG NE  +++ 
Sbjct: 303 -----------------------------WLMVNSWSEMWGDKGLFKIRRGTNECSVDNS 333

Query: 238 VNGALP 243
           +   +P
Sbjct: 334 MTAGVP 339


>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
 gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
 gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 101/242 (41%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLVTGG + S+ GC+P S PPC H +   + P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEEGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y  G+ QDK+  K  Y +  E   I  E++KNGPV     +Y D   YKSG 
Sbjct: 207 SNQCET-GYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS SA  V    +K++GWGEE G P          
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEEGGTP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  KILRG++   IES +   
Sbjct: 292 ------------------------YWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAG 327

Query: 242 LP 243
           +P
Sbjct: 328 VP 329


>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
          Length = 334

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 101/242 (41%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H RCT   YG      K   + ++W  D        I K            D+ +Y    
Sbjct: 210 H-RCTRMCYGNQELDFK---EDHHWTRDAYYLTYTTIQK------------DVMAY---- 249

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GP+ A+  +Y D  +YKSGVY  + +A  +    VK++GWGEE G PY         
Sbjct: 250 ---GPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPY--------- 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W +V+++ +Q+GD+G  KILRG NE  I++   G 
Sbjct: 298 -------------------------WLLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGG 332

Query: 242 LP 243
           +P
Sbjct: 333 VP 334


>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
 gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
          Length = 333

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 103/244 (42%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   +  DK+     Y V     +I  EI KNGPV     +Y+D        
Sbjct: 209 VKQC-EEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADF------- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
               P+            YKSGVY      E+  +A +KI+GWG ENG P          
Sbjct: 261 ----PL------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG++   IES +   
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAG 329

Query: 242 LPKD 245
           +PK+
Sbjct: 330 VPKN 333


>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
          Length = 330

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLV+GG + S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH-HVNGSRPPCTGEGGDTPEC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   + QDK+  K  Y V  +   IQ EI KNGPV     +Y D   YK+G 
Sbjct: 207 VRQCES-GYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS SA  V    +K++GWGEENG P          
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEENGTP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEIVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
 gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
          Length = 351

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K+G VTGG++   TGC+P  +PPC H    T    C +   P  KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QD +  +  Y V+ +V +IQ+EIM +GPV     +Y D F + SG 
Sbjct: 227 ERSC-QAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYED-FEHYSG- 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                                GVY  +A A +  +A VK++GWG +NG P          
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +G+ G  +I+RG NE  IES V G 
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGG 347

Query: 242 LPK 244
           +PK
Sbjct: 348 IPK 350


>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
 gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
          Length = 340

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 106/243 (43%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H    T  P     ATP  KC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGATP--KC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   + +DK+   + Y V   V DIQ+EIM NGPV     +Y D        
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYED-------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +  YK GVY      E+  +A ++I+GWG                
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG+E   PYW I +++   +GD+G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGDEK-IPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 99/227 (43%), Gaps = 60/227 (26%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQ 76
           GLVTG  + +N+ CQ  +F PC H   +   P C T   P P C   C +N  +   + +
Sbjct: 177 GLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPC-TGELPTPPCINSCDSNSTHTIPYSK 235

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           D +R  + Y +  +   I  EI KNGP+   + +Y D                       
Sbjct: 236 DIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYED----------------------- 272

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
             +YK+GVY      E+  +A VK+VGWG ENG PYW                       
Sbjct: 273 FLTYKTGVYQHVTGDELGGHA-VKMVGWGVENGTPYW----------------------- 308

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                      TIV+++ E +GDKGT KILRG+NE  IES    ALP
Sbjct: 309 -----------TIVNSWNESWGDKGTFKILRGKNECGIESSCVTALP 344


>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
           cantonensis]
          Length = 394

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 95/243 (39%), Gaps = 58/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTG    +N GC+P  FPPC H +  T    C+    P PKC
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKC 249

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + +  D++  +  Y V ++VA IQ+EI+ +GPV     +Y D   Y  G 
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGI 309

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V    ++     VK++GWG + G PYW I   +  
Sbjct: 310 Y------------------------VHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSWNT 345

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGEE                   G  +ILRG +E  IES V G 
Sbjct: 346 D---------------WGEE-------------------GFFRILRGVDECGIESGVVGG 371

Query: 242 LPK 244
           +PK
Sbjct: 372 IPK 374


>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
          Length = 342

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 105/246 (42%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-K 60
           C  G     W +  + G+VTGG + S  GC P   PPC       SE +       QP +
Sbjct: 159 CHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPC------FSEEDGNNTCRGQPME 212

Query: 61  CHTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
            H RCT   YG     + D +RF R YY++    A IQ+++M  GP+ A+M +Y      
Sbjct: 213 KHHRCTRMCYGDQEIDYDDDHRFTRDYYYLT--YASIQKDVMTYGPIEASMEVYD----- 265

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                             D  SYKSGVY  S +A  +    VK++GWGEE+G PY     
Sbjct: 266 ------------------DFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPY----- 302

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++ E +GDKG  KI RG NE  +++ 
Sbjct: 303 -----------------------------WLMVNSWSEMWGDKGLFKIRRGTNECSVDNS 333

Query: 238 VNGALP 243
           +   +P
Sbjct: 334 MTAGVP 339


>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
          Length = 340

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 105/251 (41%), Gaps = 61/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +  K+GLV+GG + S+ GC+P S PPC H +   + P+C       PKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPQCTGEGGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+    +I  EI KNGPV     ++SD   YK+G 
Sbjct: 209 SKTC-EPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG+ENG PYW +   + V
Sbjct: 268 YKH------------------------LAGEMLGGHAIRILGWGKENGVPYWLVGNSWNV 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KI+RG +   IES +   
Sbjct: 304 ----------------------------------DWGDSGFFKIVRGEDHCGIESEIVAG 329

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 330 IPRTDQYWGRF 340


>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
          Length = 330

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 97/243 (39%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+  K  Y V     DI +E+ KNGPV     +Y D  SYKSG 
Sbjct: 207 DMSC-EPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S   +    +KI+GWGEENG P          
Sbjct: 266 YQH------------------------VSGPALGGHAIKILGWGEENGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

Query: 242 LPK 244
           +P+
Sbjct: 328 IPQ 330


>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With Ca074 Inhibitor
 gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11017 Inhibitor
 gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
 gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
           Complex With K11777 Inhibitor
          Length = 254

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +  K G+VTG +  ++ GC+P  FP C H +     P C +     P+C
Sbjct: 72  CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+R K  Y V ++   IQ+EIMK GPV A   +Y D  +YKSG 
Sbjct: 131 KQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSG- 188

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  I            + E +    ++I+GWG EN  PY         
Sbjct: 189 -----------IYKHI------------TGETLGGHAIRIIGWGVENKAPY--------- 216

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++ E +G+ G  +I+RGR+E  IES V
Sbjct: 217 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 248


>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
          Length = 330

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +  K GLV+GG ++S+ GC+P + PPC H +   S P C       PKC
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEH-HVNGSRPHCSGEGGDTPKC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+  K  Y V   V  IQ EI +NGPV     +Y D        
Sbjct: 207 VHSC-EAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYED-------- 257

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YKSGVY  +  + +  +A +K++GWGEE+G P          
Sbjct: 258 ---------------FVMYKSGVYQHTTGSALGGHA-IKVLGWGEEDGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
          Length = 340

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W    K+G+VTGG  +S+ GCQP   P C H + T   P C       PKC
Sbjct: 158 CNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEH-HTTGDRPPCSE-GGGTPKC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D Y   + QD +     Y V+  + DIQ EIM NGPV   + +Y D  +YKSG 
Sbjct: 216 LKTC-EDGYTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSG- 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +     G +A            ++I+GWG E G PY         
Sbjct: 274 -----------VYQHVHGKALGGHA------------IRILGWGVEEGVPY--------- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G IK+LRG++   IES +   
Sbjct: 302 -------------------------WLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
          Length = 333

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 107/242 (44%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W W    G+V+GG + +N GC P S P C+H  +TT + +      P PKC
Sbjct: 154 CNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH--HTTGKYQPCPAVVPTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y + +  DK R K+ Y V   V  I QE++ NGPV A   +YSD  SYK+G 
Sbjct: 212 EKKCLT-GYPKSYSNDKTRGKKSYGVRG-VQSIMQELVDNGPVTAAFDVYSDFLSYKTG- 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + +G Y    +        VKI+G+G E+G            
Sbjct: 269 ---------------VYRHTTGSYEGGHA--------VKIIGYGTESG------------ 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                 + YW + +++ E +GDKG  KI +G++E  IES +   
Sbjct: 294 ----------------------QDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAG 331

Query: 242 LP 243
            P
Sbjct: 332 DP 333


>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
 gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B- Inhibitor Complex: Implications For
           Structure-Based Inhibitor Design
          Length = 260

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 77  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 134

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 135 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 192

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG P          
Sbjct: 193 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 219

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG N   IES +   
Sbjct: 220 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 255

Query: 242 LPK 244
           +P+
Sbjct: 256 IPR 258


>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
          Length = 339

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 70/239 (29%), Positives = 104/239 (43%), Gaps = 61/239 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S+ W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKSC-E 212

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+    Y + +  V  I++EIM                      Y NGPV
Sbjct: 213 PGYSSSYKEDKH----YGYSSYSVPGIEKEIMAE-------------------IYKNGPV 249

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                +YSD   YKSGVY                                      + E+
Sbjct: 250 EGAFSVYSDFLLYKSGVYQ-----------------------------------HVTGEM 274

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
           +    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES +   +P+ +
Sbjct: 275 MGGHAIRILGWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTD 333


>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sm31; Flags: Precursor
 gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
          Length = 340

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI    W +  K G+VT  +  ++TGC+P  FP C H +     P C +     P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYNTPRC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+R K  Y V ++   IQ+EIMK GPV A+  +Y D  +YKSG 
Sbjct: 217 KQTCQR-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    ++I+GWG EN  PY         
Sbjct: 276 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++ E +G+ G  +I+RGR+E  IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 334


>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
 gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
           Cathepsin B-Inhibitor Complex: Implications For
           Structure- Based Inhibitor Design
          Length = 254

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 71  CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 128

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 129 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 186

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG P          
Sbjct: 187 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 213

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG N   IES +   
Sbjct: 214 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 249

Query: 242 LPK 244
           +P+
Sbjct: 250 IPR 252


>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 338

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 104/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+  + W +    G+V+GG + S+ GC+P   PPC H + + + P+CK   +  PKC
Sbjct: 154 CNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEH-HTSGNRPDCKG-NSKTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C     G+ +  DK+     Y V     DI  EI+  GPV A+  +Y+D  +YKSG 
Sbjct: 212 QRQCVESFDGK-YQADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGV 270

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                K G     A         VKI+GWGEENG P          
Sbjct: 271 YQH---------------VKGGFLGGHA---------VKILGWGEENGVP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG N   IE+ +N  
Sbjct: 297 ------------------------YWLCANSWNTDWGDGGFFKILRGYNHCKIEADINAG 332

Query: 242 LPK 244
           +PK
Sbjct: 333 IPK 335


>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
 gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
 gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
 gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
 gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
 gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
 gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
 gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
 gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
 gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
          Length = 339

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 72/251 (28%), Positives = 109/251 (43%), Gaps = 62/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW     + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYWLAANSWNL 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 329 IPRTDQYWGRF 339


>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
 gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
          Length = 333

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 102/244 (41%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  + GLV+GG + S+ GC+P S PPC H +   S P CK      PKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPSCKGEEGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  + Y   +  DK+     Y V     +I  +I KNGPV     +Y+D        
Sbjct: 209 MKTC-EEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADF------- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
               P+            YKSGVY      E+  +A +KI+GWG ENG P          
Sbjct: 261 ----PL------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG++   IES V   
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAG 329

Query: 242 LPKD 245
           +PK+
Sbjct: 330 IPKN 333


>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
 gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
          Length = 322

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG ++S+ GC P + PPC H +   + P C T     PKC
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 190

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 191 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 248

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG P          
Sbjct: 249 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 275

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG N   IES +   
Sbjct: 276 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 311

Query: 242 LPK 244
           +P+
Sbjct: 312 IPR 314


>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
          Length = 279

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 96  CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 154

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+  +  Y V +    IQ++IM  GPV A   +Y D  +YKSG 
Sbjct: 155 KQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGI 213

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  IV    ++I+GWG E   PY         
Sbjct: 214 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY--------- 240

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +I+RGR+E  IES V   
Sbjct: 241 -------------------------WLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 275

Query: 242 LPK 244
           L K
Sbjct: 276 LIK 278


>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
          Length = 216

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 98/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 33  CQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V      IQ+EIM NGPV A   +Y D  +YKSG 
Sbjct: 92  KQTC-QKGYKTPYEQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 150

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  IV    ++I+GWG E   P          
Sbjct: 151 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTP---------- 176

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++ E +G+KG  +I+RGR+E  IES V   
Sbjct: 177 ------------------------YWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAG 212

Query: 242 LPK 244
           L K
Sbjct: 213 LIK 215


>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
          Length = 330

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG + S+ GC+P S PPC H    T  P C       P+C
Sbjct: 140 CNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGT-RPPCSGEGGETPEC 198

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + QDK+     Y +     +I  EI KNGPV     +YSD   YKSG 
Sbjct: 199 VKKC-EDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGV 257

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E V    ++I+GWG +NG PYW     +  
Sbjct: 258 YQH------------------------VSGEEVGGHAIRILGWGVDNGTPYWLAANSWNT 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+                   G  +ILRG++   IES +   
Sbjct: 294 D---------------WGED-------------------GFFRILRGQDHCGIESEIVAG 319

Query: 242 LPK 244
           +PK
Sbjct: 320 IPK 322


>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V      I +E+M++GPV  +  +Y+D  +YKSG 
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEEN  P          
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G  KI+RG+NE  IES VN  
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342

Query: 242 LPK 244
           +PK
Sbjct: 343 IPK 345


>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
          Length = 337

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 98/243 (40%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTGG   + TGC P  FP C H    +    C     P P C
Sbjct: 145 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPRYTYPTPSC 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y + + +DK   K  Y V+     I +EIMKNGPV A   +Y+D   YKSG 
Sbjct: 205 YPYC-QAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSG- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + SG YA            ++I+GWG ENG  YW     + V
Sbjct: 263 ---------------IYHHVSGRYA--------GKHAIRIIGWGVENGVKYWLTANSWNV 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                          GWGE                    G  +ILRG +E  IES+V   
Sbjct: 300 ---------------GWGE-------------------NGYFRILRGTDECRIESIVVAG 325

Query: 242 LPK 244
           +P+
Sbjct: 326 MPR 328


>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 71/230 (30%), Positives = 96/230 (41%), Gaps = 61/230 (26%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y  
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + QDK+     Y V      IQ+EIM  GPV A   +Y D  +YKSG Y +        
Sbjct: 228 PYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRH-------- 279

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
                            +  IV    ++I+GWG E G+PY                    
Sbjct: 280 ----------------VTGSIVGGHAIRIIGWGVEKGKPY-------------------- 303

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                         W I +++ E +G+KG  +++RGR+E  IES V   L
Sbjct: 304 --------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339


>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
 gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V      I +E+M++GPV  +  +Y+D  +YKSG 
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEEN  P          
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G  KI+RG+NE  IES VN  
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342

Query: 242 LPK 244
           +PK
Sbjct: 343 IPK 345


>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
 gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
 gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
 gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
 gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
 gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
          Length = 348

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V      I +E+M++GPV  +  +Y+D  +YKSG 
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    V+++GWGEEN  P          
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G  KI+RG+NE  IES VN  
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342

Query: 242 LPK 244
           +PK
Sbjct: 343 IPK 345


>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 328

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++GLV+GG + +  GC+P   PPC H +   S P C       PKC
Sbjct: 146 CNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH-HTNGSRPACDASEGNTPKC 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + NY   +  D +   + Y ++ +V  IQ EI++NGPV     +Y+D  +YK+G 
Sbjct: 205 AKSCES-NYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGV 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    ++I GWG EN  PY         
Sbjct: 264 YQH------------------------IKGQFLGGHAIRIFGWGVENNTPY--------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD GT KILRG +   IES +   
Sbjct: 291 -------------------------WLIANSWNTDWGDSGTFKILRGSDHCGIESGIVAG 325

Query: 242 LPK 244
           LPK
Sbjct: 326 LPK 328


>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
 gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
          Length = 338

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 71/244 (29%), Positives = 109/244 (44%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + W  W HK G+V+GG++ S  GC+P    PC H +   + P C + +TP  +
Sbjct: 155 CNGGFPGAAWSYWTHK-GIVSGGSYGSKEGCRPYEVEPCEH-HVNGTRPPCHSGSTP--R 210

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C +  Y   + +DK+   + Y VN    DIQ+EIM NGPV     +Y D+  YK+G
Sbjct: 211 CMHKCES-GYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTG 269

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +   + G +A            ++I+GWG               
Sbjct: 270 ------------VYQHVHGRQLGGHA------------IRILGWGV-------------- 291

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                            WG +N  PYW I +++   +GD G  +ILRG +   IES ++ 
Sbjct: 292 -----------------WG-DNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISA 333

Query: 241 ALPK 244
            LPK
Sbjct: 334 GLPK 337


>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
          Length = 335

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+     Y V+    +I  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E++    ++I+GWG EN  PYW        
Sbjct: 267 YQH------------------------VSGEMMGGHAIRILGWGVENDTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GDKG  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDKGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 MP 330


>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
          Length = 335

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+     Y V+    +I  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E++    ++I+GWG EN  PYW        
Sbjct: 267 YQH------------------------VSGEMMGGHAIRILGWGVENDTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GDKG  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDKGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 MP 330


>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
 gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
          Length = 339

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 100/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+VTGG + ++ GC P   P C+H    T  P  +    P PKC
Sbjct: 157 CNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             R     Y   F  DK+  K  Y V+     IQ EIMKNGPV     +Y+D   YKSG 
Sbjct: 215 -VRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                         S S + +    ++I+GWG ENG P          
Sbjct: 274 YK------------------------SHSTDALGGHAIRILGWGVENGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W + +++  ++GDKG  KILRG NE  IE  +   
Sbjct: 300 ------------------------FWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAG 335

Query: 242 LPK 244
           +PK
Sbjct: 336 IPK 338


>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 97/234 (41%), Gaps = 60/234 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +   +GLVTG     N+ C+P +FPPC+H         C   + P P C
Sbjct: 143 CNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCGD-SQPTPAC 201

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              CT  + GR +  DK R    Y V+ +V  IQ EIM  GPV A+  +Y D  +YKSG 
Sbjct: 202 VKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASFTVYEDFLTYKSGV 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y N                        A A +  +A VKI+GWG E   PY         
Sbjct: 261 YQN-----------------------VAGANLGGHA-VKIIGWGVEKNVPY--------- 287

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                    W +V+++ E +G+ G  KILRG N   IE
Sbjct: 288 -------------------------WLVVNSWNEGWGENGLFKILRGSNHVGIE 316


>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
 gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
          Length = 338

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 105/243 (43%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S  GC+P    PC H +   + P C   +TP   C
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPS--C 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   + +DK    + Y V   VA+IQQEIM NGPV     +Y D        
Sbjct: 212 QHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYED-------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +  YKSGVY      E+  +A ++I+GWG                
Sbjct: 263 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGV--------------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE    PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 292 ----------------WGESK-VPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAG 334

Query: 242 LPK 244
           LPK
Sbjct: 335 LPK 337


>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 73/231 (31%), Positives = 101/231 (43%), Gaps = 63/231 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y  
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227

Query: 73  GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
            + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG Y +       
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                             +  IV    ++I+GWG E G+PY                   
Sbjct: 280 -----------------VAGSIVGGHAIRIIGWGVEKGKPY------------------- 303

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                          W I +++ E +G+ G  +++RGR+E  IES V   L
Sbjct: 304 ---------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
          Length = 181

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/232 (30%), Positives = 97/232 (41%), Gaps = 60/232 (25%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           ++ KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C  +C    Y  
Sbjct: 9   YLVKRGIVTGGSKENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQKC-QKGYKT 66

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + QDK    + Y V      IQ+EIM NGPV A   +Y D  +YKSG Y +        
Sbjct: 67  PYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRH-------- 118

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
                            +  IV    ++I+GWG E   PY                    
Sbjct: 119 ----------------VTGSIVGGHAIRIIGWGVEKRTPY-------------------- 142

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                         W I +++ E +G+KG  +I+RGR+E  IES V   L K
Sbjct: 143 --------------WLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 180


>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 73/231 (31%), Positives = 101/231 (43%), Gaps = 63/231 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C   C    Y  
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227

Query: 73  GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
            + QDK Y  +RY  +++E A IQ+EIM  GPV A   +Y D  +YKSG Y +       
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                             +  IV    ++I+GWG E G+PY                   
Sbjct: 280 -----------------VAGSIVGGHAIRIIGWGVEKGKPY------------------- 303

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                          W I +++ E +G+ G  +++RGR+E  IES V   L
Sbjct: 304 ---------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
 gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
          Length = 340

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 104/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEH-HVNGTRPPCGH-GGGTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +DK+   + Y V   V DIQ+EIM NGPV     +Y          
Sbjct: 214 SHVCES-GYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVY---------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YK GVY      E+  +A ++I+GWG                
Sbjct: 263 -------------EDLILYKDGVYQHQHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGEE   PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGEEK-IPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
          Length = 334

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 103/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +   +GLVTGG ++S+ GCQP +   C H       P    + TPQ  C
Sbjct: 152 CNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQ--C 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DKY  K+ Y ++++   I+ EI  NGPV A   +Y+         
Sbjct: 210 VHMC-EKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVYA--------- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  +YKSGVY      E+  +A V+I+GWG E+G P          
Sbjct: 260 --------------DFVTYKSGVYRHVTGEEMGGHA-VRILGWGTESGTP---------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GDKG  KILRG +E  IES +   
Sbjct: 295 ------------------------YWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAG 330

Query: 242 LPK 244
           LPK
Sbjct: 331 LPK 333


>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
          Length = 342

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/244 (28%), Positives = 102/244 (41%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTG +  ++TGCQP  FP C H +     P C       PKC
Sbjct: 159 CQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCEH-HTKGKYPACGEKIYKTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DKY  K  Y V  +   I++EIM +GPV A   +YSD  +YKSG 
Sbjct: 218 QQKC-QKGYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ +  G         ++    V+I+GWG E   PY         
Sbjct: 276 ---------------IYKHMKGT--------VIGGHAVRIIGWGVEKKTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +ILRG++   IES V   
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAG 338

Query: 242 LPKD 245
           LP +
Sbjct: 339 LPHN 342


>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
          Length = 339

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V++   +I  EI KNGPV A   ++SD   YKSG 
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    V+I+GWG EN  PYW        
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENDTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRGR+   IES V   
Sbjct: 295 -------------LVG-------------NSWNTDWGDHGFFKILRGRDHCGIESEVVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
 gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
          Length = 330

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +  + GLVTGG + SN GC+P S  PC H +   + P C T     PKC
Sbjct: 148 CMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEH-HVNGTRPPC-TGEGDTPKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            + C N  Y   + +DK   K+ Y V  +   I  E+ KNGPV A   +Y D   YK+G 
Sbjct: 206 VSEC-NAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGV 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    +KI+GWG+EN  P          
Sbjct: 265 YQH------------------------VTGQMLGGHAIKILGWGKENNTP---------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG++E  IES +   
Sbjct: 291 ------------------------YWLVANSWNTDWGDNGFFKILRGKDECGIESEIVAG 326

Query: 242 LPK 244
           +P+
Sbjct: 327 IPR 329


>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
           Full=Antigen Sj31; Flags: Precursor
 gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
          Length = 342

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 99/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V +    IQ++IM  GPV A   +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGI 276

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  IV    ++I+GWG E   PY         
Sbjct: 277 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG  +++RGR+E  IES V   
Sbjct: 304 -------------------------WLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
 gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
          Length = 358

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 74/234 (31%), Positives = 97/234 (41%), Gaps = 64/234 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
           W    GL TGG +    GC+P S  PC+  + N TTS P C    TP   C   CT N  
Sbjct: 180 WWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTSVP-CPGYHTP--TCEEHCTSNIT 236

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           +   + QDK+  K +Y V  ++ DIQ EIM NGPV+A+  +Y D + YKSG Y       
Sbjct: 237 WPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIY------- 289

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                            V  + +       KI+GWG ++G PYW  V             
Sbjct: 290 -----------------VHTAGDQEGGMDTKIIGWGVDSGVPYWLCVH------------ 320

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  +G  FG+ G ++ LRG NE  IE  V  ALP
Sbjct: 321 ----------------------QWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 352


>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 333

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 76/246 (30%), Positives = 103/246 (41%), Gaps = 67/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G     W +  K GLVTGG + S+ GCQP   P CNH     Y     E KT     
Sbjct: 149 CAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGEGKT----- 203

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P+C   C +  Y   +  D +  ++ Y V+ EV  IQ EIM NGPV     +YSD  +YK
Sbjct: 204 PQCERTCRS-GYTTSYEADLHYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYK 262

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG            +Y  +  +  G +A            ++I+GWG ENG PYW I   
Sbjct: 263 SG------------VYQHVVGHALGGHA------------IRILGWGTENGVPYWLIANS 298

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +                         P W          GDKG  K++RG+++  IES +
Sbjct: 299 W------------------------NPSW----------GDKGYFKMIRGKDDCGIESNI 324

Query: 239 NGALPK 244
               PK
Sbjct: 325 VAGTPK 330


>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
 gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
          Length = 340

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 106/243 (43%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   RG+V+GG+++S  GC+P    PC H +     P C + +TP   C
Sbjct: 157 CNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEH-HVDGPRPPCHSGSTPH--C 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   NY   + +DK+     Y +N    +IQ+EIM NGPV     +Y D+  YK+G 
Sbjct: 214 KHQC-QPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTG- 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +   + G +A            ++I+GWG                
Sbjct: 272 -----------VYQHVHGKQLGGHA------------IRIIGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE    PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGESK-VPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
          Length = 335

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 100/244 (40%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + W  WVHK GLV+GG   SN GCQP +  PC H +   + P C+      PK
Sbjct: 152 CNGGFPGAAWSYWVHK-GLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPK 209

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C  D+Y   + +DK    + Y +      I++EIM NGPV     +Y D+  YK G
Sbjct: 210 CVKKC-QDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEG 268

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                         + +++    ++I+GWG EN   Y        
Sbjct: 269 VYQH------------------------VTGKMLGGHAIRILGWGVENNTKY-------- 296

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W I +++   +GD G  KILRG +   IES +  
Sbjct: 297 --------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSIAA 330

Query: 241 ALPK 244
            LPK
Sbjct: 331 GLPK 334


>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 74/249 (29%), Positives = 105/249 (42%), Gaps = 66/249 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSN------TGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G   S W WVH  G+ TGG + +        GC P  FPPC H    T  P+C   +
Sbjct: 128 CDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGS 187

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
              P C  +C N  Y      D     R+Y            ++++ P     Y YS + 
Sbjct: 188 YETPNCVEQCHNPKYSTSLKND-----RHY------------MLESSP-----YQYS-VN 224

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           + K+    +GPV A+  +Y D  +YKSGVY  ++ + +  +A                  
Sbjct: 225 NAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHA------------------ 266

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                            VK+IGWGEENG  YW +V+++ E +GD G  KI  G N  I +
Sbjct: 267 -----------------VKIIGWGEENGEAYWLVVNSWNEDWGDHGLFKIALG-NCQIDD 308

Query: 236 SLVNGALPK 244
            L+ G  PK
Sbjct: 309 DLL-GGTPK 316


>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
 gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
 gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
          Length = 340

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++GLV+GG   S+ GCQP +  PC H +   S P C+      PKC
Sbjct: 157 CNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEH-HVNGSRPSCEGEGGKTPKC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   + +DK   K  Y + +    IQ+EIM NGPV     +Y D+ +YK G 
Sbjct: 216 VKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGV 274

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           +++    ++I+GWG E+G  Y         
Sbjct: 275 YHH------------------------VHGKMLGGHAIRILGWGVEDGTKY--------- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG +   IES +   
Sbjct: 302 -------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
          Length = 216

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 99/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 33  CQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + QDK+     Y V      IQ+EIM NGPV A   +Y D  +YKSG 
Sbjct: 92  KQKC-QKGYKTPYKQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 150

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  IV    ++I+GWG +   P          
Sbjct: 151 YRH------------------------VTGSIVGGHAIRIIGWGVKKRTP---------- 176

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++ E +G+KG  +I+RGR+E  IES V   
Sbjct: 177 ------------------------YWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 212

Query: 242 LPK 244
           L K
Sbjct: 213 LIK 215


>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
          Length = 331

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +    GLVTGG ++S  GCQP     C+H      +P C +     P+C
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-CASKEEHTPRC 203

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   F +DK+     Y V   V  IQ EIM NGPV     +Y          
Sbjct: 204 SKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY---------- 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                        +D  +YKSGVY  ++ A +  +A ++I+GWG ENG P          
Sbjct: 253 -------------ADFPTYKSGVYQHTSGAMLGGHA-IRILGWGTENGTP---------- 288

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +G  G  KI+RG+++  IES +   
Sbjct: 289 ------------------------YWLVANSWNEDWGAMGYFKIIRGKDDCGIESQITAG 324

Query: 242 LPK 244
           +PK
Sbjct: 325 MPK 327


>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
           Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
           Extends Along The Whole Active Site Cleft
          Length = 205

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/236 (30%), Positives = 102/236 (43%), Gaps = 61/236 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC+  C  
Sbjct: 29  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCNKTC-E 85

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG Y +   
Sbjct: 86  PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 142

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 S EI+    ++I+GWG ENG PYW              
Sbjct: 143 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 167

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  L+G             +++   +GD G  KILRG++   IES +   +P
Sbjct: 168 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 203


>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
 gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
          Length = 341

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  ++GLV+GG + S  GCQP +  PC+H+    S P C        +C
Sbjct: 158 CQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSG-NGSRPVCTVGGGV--RC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   +Y   F +DK    + Y ++++V +IQ+EIM NGPV A + +Y D  SYK+G 
Sbjct: 215 QHLC-EPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                             E V    V+I+GWG                
Sbjct: 274 Y------------------------YHLEGEKVGPHAVRILGWGV--------------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG +   PYW + +++G  +GD G   I RG N   IE  +   
Sbjct: 295 ----------------WGTKK-VPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMAG 337

Query: 242 LPK 244
           LPK
Sbjct: 338 LPK 340


>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
          Length = 330

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 99/243 (40%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +  K GLVTGG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   + +DK+  K  Y V      IQ EI KNGPV     +Y D   YKSG 
Sbjct: 207 INQCES-GYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    +KI+GWG E+G P          
Sbjct: 266 YQH------------------------VSGSLIGGHAIKILGWGVEDGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES V   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
          Length = 330

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +  K GLV+GG + S+ GC+P +  PC H +   S P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEH-HVNGSRPSCTGEGGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T+C    Y   + +DK+  K  Y V  +   IQ EI KNGPV     +Y D   YKSG 
Sbjct: 207 ITKC-EAGYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS SA  V    +KI+GWG E+G P          
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKILGWGVEDGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  K LRG +   IES V   
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAG 327

Query: 242 LPK 244
           +PK
Sbjct: 328 IPK 330


>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
          Length = 342

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 152 CNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEGGDTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           + +C    Y   +  DK+     Y V     +I  EI KNGPV     +Y+D   YKSG 
Sbjct: 211 NKKC-EAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGV 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    ++++GWG E+G P          
Sbjct: 270 YQH------------------------VTGDMLGGHAIRVLGWGVEDGVP---------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG++   IES +   
Sbjct: 296 ------------------------YWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAG 331

Query: 242 LPK 244
           +P+
Sbjct: 332 IPR 334


>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
 gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
 gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
          Length = 346

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 64/243 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +  + G+VTGG + S  GCQP S  PC     T  E +  T     P C
Sbjct: 160 CDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDT-----PDC 214

Query: 62  HTR-CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             + CTN NY + +  D +     Y ++    DI +++ KNGPV A  Y+Y+D   YKSG
Sbjct: 215 SIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSG 274

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++SY  G        +I     +KI+GWG ++G  YW     ++
Sbjct: 275 ----------------VYSYTRG--------QIEGGHAIKILGWGVDDGTKYWLCANSWS 310

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
            S               WGE                    G  +ILRG NE  IE  V  
Sbjct: 311 RS---------------WGE-------------------NGLFRILRGNNECHIEDRVIA 336

Query: 241 ALP 243
            +P
Sbjct: 337 GMP 339


>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
          Length = 343

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 102/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + W+ + G+VTGG +     C+P +F PC H         C     P PKC
Sbjct: 159 CQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGGLWPTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DK+   R Y++ +   +I+QEI                       
Sbjct: 219 RKTCQR-KYNKSYQEDKHFATRAYYLPNNERNIRQEI----------------------- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPVVA   +Y D   YK G+Y               +  WG + G            
Sbjct: 255 YKNGPVVAAFRVYQDFSYYKKGIY---------------VHKWGGQTG------------ 287

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
                  A+A VK++GWG EN   YW I +++   +G+ G  +I+RG NE  IE+ +V G
Sbjct: 288 -------AHA-VKVVGWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIEAQMVGG 339

Query: 241 AL 242
           A+
Sbjct: 340 AM 341


>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
          Length = 341

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G  S+ W W    G+VTGG ++S+ GCQP S P C+H + +   P C     P P C
Sbjct: 159 CSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDH-HVSGQYPACSGEG-PTPAC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+     Y V  E   I  EIM NGPV     +Y D+ +YKSG 
Sbjct: 217 KKSC-EAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYEDLLTYKSGV 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    +KI+GWG E+G  YW        
Sbjct: 276 YQH------------------------TTGQVLGGHAIKIIGWGVESGVDYW-------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           W          + +++   +GD G  KI +G +E  IES +   
Sbjct: 304 ----------------W----------VANSWNNDWGDNGFFKIKKGVDECGIESQIVAG 337

Query: 242 LPK 244
           +PK
Sbjct: 338 MPK 340


>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 278

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 69/249 (27%), Positives = 100/249 (40%), Gaps = 66/249 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+ G  +S W WVH +G+ TGG +        + GC P  FPPC H    +  P+C   +
Sbjct: 89  CNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 148

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
              P C  +C N  Y      D++                        V ++ Y YS + 
Sbjct: 149 YETPNCAEQCHNPKYTTTLRDDRHFM----------------------VESSPYQYS-VN 185

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
             K+    +GPV A+  +Y D  +YKSGVY                              
Sbjct: 186 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYK----------------------------- 216

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                   S E +    VK+IGWGEE+G+ YW +V+++ E +GD G  KI  G     I+
Sbjct: 217 ------HTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGID 268

Query: 236 SLVNGALPK 244
             + G  PK
Sbjct: 269 DYLLGGTPK 277


>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
           Complex
          Length = 253

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/236 (30%), Positives = 99/236 (41%), Gaps = 61/236 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 77  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG Y +   
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 190

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 S EI+    ++I+GWG ENG P                
Sbjct: 191 ---------------------VSGEIMGGHAIRILGWGVENGTP---------------- 213

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                             YW + +++   +GD G  KILRG++   IES +   +P
Sbjct: 214 ------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251


>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
 gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
          Length = 335

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/246 (29%), Positives = 103/246 (41%), Gaps = 59/246 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG+  S  GC+P S  PC       + PEC    +  PKC
Sbjct: 145 CEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKC 204

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              CT N++Y   + QDK+     Y +      IQ EI+ +GPV     +Y         
Sbjct: 205 EHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVY--------- 255

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D + YK+G+Y   A  E+  +A VK++GWG +NG PYW       
Sbjct: 256 --------------EDFYLYKTGIYTHVAGGELGGHA-VKMLGWGVDNGTPYW------- 293

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
           ++A++                     W  V      +G+KG  +ILRG +E  IES    
Sbjct: 294 LAANS---------------------WNTV------WGEKGYFRILRGVDECGIESAAVA 326

Query: 241 ALPKDN 246
            +P  N
Sbjct: 327 GMPDLN 332


>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
 gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
          Length = 330

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 103/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+   +  K GLV+GG + S+ GC+P S PPC H +   + P CK      P+C
Sbjct: 148 CNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEH-HVNGTRPPCKGEEGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y  G+ QDK+  KR Y V  +  +I +E+ KNGPV     +Y D   YKSG 
Sbjct: 207 TNQC-EPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS SA  V    +K++GWGEE G P          
Sbjct: 266 YRH----------------------VSGSA--VGGHAIKVLGWGEEGGIP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  KI+RG +   IES +   
Sbjct: 292 ------------------------YWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAG 327

Query: 242 LP 243
           +P
Sbjct: 328 IP 329


>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
          Length = 340

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +   +G+VTGG + ++ GC P   P C+H    T  P  +    P PKC
Sbjct: 158 CNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             R     Y   F  DK+  K  Y V      IQ EIMKNGPV     +Y+D   YKSG 
Sbjct: 216 -VRLCRKGYNVDFKDDKHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGV 274

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                         S S + +    ++I+GWG EN  P          
Sbjct: 275 YK------------------------SHSTDALGGHAIRILGWGVENDVP---------- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++  ++GDKG  KILRG NE  IE  +   
Sbjct: 301 ------------------------YWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAG 336

Query: 242 LPK 244
           +PK
Sbjct: 337 IPK 339


>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
           E64c Complex
 gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca073 Complex
 gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca042 Complex
 gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca059 Complex
 gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca074me Complex
 gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-ca075 Complex
 gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca076 Complex
 gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca077 Complex
 gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
           Cathepsin B-Ca078 Complex
          Length = 256

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 77  SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG Y +   
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 190

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 S EI+    ++I+GWG ENG PYW              
Sbjct: 191 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 215

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  L+G             +++   +GD G  KILRG++   IES +   +P
Sbjct: 216 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251


>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
          Length = 337

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 94/243 (38%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+VTGG+     GC+   FP C+H   +   P C       PKC
Sbjct: 149 CQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG-SKKYPPCPHRIYDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   N    +  DK R    Y V      I +EIM NGPV A   +Y D F YK G 
Sbjct: 208 VPKCDTPNID--YETDKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                          ++ E +    ++I+GWGEENG PY         
Sbjct: 266 Y------------------------FHSTGEFIGGHAIRILGWGEENGTPY--------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+ G  K+LRG+NE  IE  V   
Sbjct: 293 -------------------------WLIANSWNEGWGEDGYFKMLRGKNECGIEDEVTAG 327

Query: 242 LPK 244
           LP+
Sbjct: 328 LPE 330


>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 323

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/246 (29%), Positives = 98/246 (39%), Gaps = 66/246 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L   Q  
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191

Query: 61  -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            C  +C N NY   +  D ++    Y   W N  V  IQQEIM +GPV A MY+Y +   
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTHGPVTAFMYVYENFMG 249

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK G Y                         S + E++ Y  VK++GWG +         
Sbjct: 250 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDG-------- 277

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                    +G  YW  ++++   +G+ G  KILRG N   IE 
Sbjct: 278 -------------------------DGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIEL 312

Query: 237 LVNGAL 242
           LV   +
Sbjct: 313 LVMAGI 318


>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
 gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
          Length = 340

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG   S  GC+P    PC H +   + P C + +TP  +C
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEH-HVNGTRPPCSSGSTP--RC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   + +DK    + Y + + V DIQ+EIM NGPV     +Y          
Sbjct: 214 QHVCES-SYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVY---------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YKSGVY      E+  +A ++I+GWG                
Sbjct: 263 -------------EDLILYKSGVYEHVHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG+E   PYW I +++   +GD G  +I+RG++   IES ++  
Sbjct: 294 ----------------WGDEK-IPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
          Length = 331

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +S W +    G+V+GG + S  GCQP S  PC H +   S P C       P C
Sbjct: 150 CDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEH-HVPGSRPACSG-GGDTPDC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C ++  G  + QD Y  +  Y + DE   IQ EI+KNGPV A   +Y D+ +YK G 
Sbjct: 208 RNQC-DEGSGISYDQDHYYGETVYTL-DEAKQIQAEILKNGPVEAAFTVYEDLLNYKEGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E +    +KI+GWG EN  P          
Sbjct: 266 YQH------------------------VAGEALGGHAIKILGWGVENDTP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G  KILRG +E  IE  +   
Sbjct: 292 ------------------------YWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAG 327

Query: 242 LPK 244
           LP+
Sbjct: 328 LPR 330


>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 333

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 102/239 (42%), Gaps = 64/239 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S  +W +  K GLVTG      TGC P  FP C+H + + S P+C  +    P C
Sbjct: 152 CQIGFSEFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   +  DK+  +  Y +    +DI++EIM NGPV A ++++SD  +YKSG 
Sbjct: 207 TKTCRS-GYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + ++V   +V+I+GWG EN  P          
Sbjct: 266 YRH------------------------ITGQLVTIHSVRIIGWGIENDIP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                   YW   +++ E +G  G  KILRG NE  IES VN 
Sbjct: 292 ------------------------YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNA 326


>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 250

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 101/239 (42%), Gaps = 64/239 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S  +W +  K GLVTG      TGC P  FP C+H + + S P+C  +    P C
Sbjct: 69  CQIGFSEFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPC 123

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  +  Y +    +DI++EIM NGPV A ++++SD  +YKSG 
Sbjct: 124 TKTC-RSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGV 182

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + ++V   +V+I+GWG EN  P          
Sbjct: 183 YRH------------------------ITGQLVTIHSVRIIGWGIENDIP---------- 208

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                   YW   +++ E +G  G  KILRG NE  IES VN 
Sbjct: 209 ------------------------YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNA 243


>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
          Length = 351

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 101/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K G VTGG++   TGC+P  +PPC H    T    C +   P  KC
Sbjct: 167 CNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKC 226

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QD +  +  Y V+ +  +IQ+EIM NGPV     +Y+D F   SG 
Sbjct: 227 ERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYAD-FEVYSG- 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                                GVY  +A A +  +A VK++GWG +NG P          
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +G+ G  +I+RG NE  IE  V G 
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIEHGVVGG 347

Query: 242 LPK 244
           +PK
Sbjct: 348 IPK 350


>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
 gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
           light chain; Contains: RecName: Full=Cathepsin B heavy
           chain; Flags: Precursor
 gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 103/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y ++    +I  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    ++I+GWG ENG PYW        
Sbjct: 267 YQH------------------------VTGDLMGGHAIRILGWGVENGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
 gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
          Length = 330

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 146 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 203

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +DK+   + Y V   V +IQ+EIM NGPV     +Y          
Sbjct: 204 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVY---------- 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YK GVY      E+  +A ++I+GWG                
Sbjct: 253 -------------EDLILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 283

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGEE   PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 284 ----------------WGEEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 326

Query: 242 LPK 244
           LPK
Sbjct: 327 LPK 329


>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
           [Tribolium castaneum]
 gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  + G+VTGG + +  GC+  + PPC H +     P C  +  P P+C
Sbjct: 154 CNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEH-HTEGDLPACGDI-VPTPQC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D      ++   R    Y  + + + IQ EIM NGPV A+  +Y D  +YKSG 
Sbjct: 212 KKEC--DAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSG- 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++   +G YA   +        +KI+GWG E+G P          
Sbjct: 269 ---------------VYQQTTGNYAGGHA--------IKILGWGVEDGTP---------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +GDKG  KILRG+NE  IES + G 
Sbjct: 296 ------------------------YWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGG 331

Query: 242 LP 243
           +P
Sbjct: 332 IP 333


>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
 gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
          Length = 339

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 106/245 (43%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC P + PPC H +   S P+C T     PKC
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH-HVNGSRPQC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V++   +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 TKSC-EAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +I+    ++I+GWG EN  PYW +   + V
Sbjct: 266 ---------------VYKHEAG--------DIMGGHAIRILGWGVENSVPYWLVANSWNV 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG +   IES +   
Sbjct: 303 ----------------------------------DWGDNGLFKILRGEDHCGIESEIVAG 328

Query: 242 LPKDN 246
           +P+ +
Sbjct: 329 IPRTD 333


>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
          Length = 341

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 100/244 (40%), Gaps = 62/244 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + W  WVHK G+VTGG + S+ GC P     C+H    T  P C     P P+
Sbjct: 158 CNGGFPGAAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKSIPPTPR 215

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  R     Y   F  DK+  K+ Y V   V  IQ EIM NGPV A+  +Y         
Sbjct: 216 C-VRMCRKGYNVDFADDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVY--------- 265

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                         +D   YKSGVY       +  +A ++++GWG E G P         
Sbjct: 266 --------------ADFPLYKSGVYQRHTDQALGGHA-IRLLGWGVEKGVP--------- 301

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++  ++GDKG  KILRG +E  IE  V  
Sbjct: 302 -------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVA 336

Query: 241 ALPK 244
            +P+
Sbjct: 337 GIPR 340


>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
 gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
           Full=Cathepsin B light chain; Contains: RecName:
           Full=Cathepsin B heavy chain; Flags: Precursor
 gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
 gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
 gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
          Length = 335

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG Y +   
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 269

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 S EI+    ++I+GWG ENG PYW              
Sbjct: 270 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 294

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  L+G             +++   +GD G  KILRG++   IES +   +P
Sbjct: 295 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
          Length = 335

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 103/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y ++    +I  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    ++I+GWG ENG PYW        
Sbjct: 267 YQH------------------------VTGDLMGGHAIRILGWGVENGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
          Length = 340

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 103/242 (42%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGDTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+    +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 209 SKIC-EPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + +++    V+I+GWG ENG PYW        
Sbjct: 268 YQH------------------------VAGDMMGGHAVRILGWGVENGTPYW-------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 296 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
          Length = 339

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 70/239 (29%), Positives = 104/239 (43%), Gaps = 61/239 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     P+C   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 212

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V+ +  +I+ EI KNGPV     +YSD   YKSG Y +   
Sbjct: 213 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQH--- 269

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 + +I+    ++I+GWGEENG P                
Sbjct: 270 ---------------------TTGDIMGGHAIRILGWGEENGVP---------------- 292

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
                             YW + +++   +GDKG  KILRG++   IES +   +P+ +
Sbjct: 293 ------------------YWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVAGIPRTD 333


>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
          Length = 335

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG Y +   
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 269

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                                 S EI+    ++I+GWG ENG PYW              
Sbjct: 270 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 294

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  L+G             +++   +GD G  KILRG++   IES +   +P
Sbjct: 295 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330


>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
 gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
          Length = 335

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 59/246 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ K G+ TGG++ S  GC+P S PPC       + P C    +P P C
Sbjct: 148 CEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSC 207

Query: 62  HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +CT+   Y     +D++       + +   +IQ ++M NGP+ A   +Y D   Y +G
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTG 267

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                        V  +     + +V+I+GWG   G P         
Sbjct: 268 IY------------------------VHLTGNKQGHLSVRIIGWGVWQGVP--------- 294

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++G Q+G+ GT ++LRG NE  +ES    
Sbjct: 295 -------------------------YWLCANSWGRQWGENGTFRVLRGTNECGLESNCVS 329

Query: 241 ALPKDN 246
            +PK N
Sbjct: 330 GMPKLN 335


>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
 gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
 gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
 gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
          Length = 340

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H +   + P C       PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +DK+   + Y V   V +IQ+EIM NGPV     +Y          
Sbjct: 214 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVY---------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YK GVY      E+  +A ++I+GWG                
Sbjct: 263 -------------EDLILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGEE   PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGEEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
          Length = 340

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 96/243 (39%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  +RGLV+GG + S+ GC+P + PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V     +I  EI KNGPV     +Y D   YKSG 
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E V    ++I+GWG ENG P          
Sbjct: 268 YQH------------------------VSGEQVGGHAIRILGWGVENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 294 ------------------------YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

Query: 242 LPK 244
           +P+
Sbjct: 330 VPR 332


>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
 gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
          Length = 340

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 105/243 (43%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H +   + P C    +  PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAN-GSGTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   + +DK+   + Y V   V +IQ+EIM NGPV     +Y D        
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYED-------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +  YK GVY      E+  +A ++I+GWG                
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG E   PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGNEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
 gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
          Length = 326

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G  +  W +  + GLVTGG ++S+ GC+P S  PC H +   + P C       PKC
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEH-HVNGTRPPCSG-EQDTPKC 201

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+   + Y V  +   I  E+  NGPV A   +Y D        
Sbjct: 202 TGVCI-PKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDF------- 253

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
               P+            YKSGVY     + +  +A VKI+GWGEENG P          
Sbjct: 254 ----PL------------YKSGVYQHLTGSALGGHA-VKILGWGEENGTP---------- 286

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W + +++   +GD G  KILRG +E  IES +   
Sbjct: 287 ------------------------FWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAG 322

Query: 242 LPK 244
           LPK
Sbjct: 323 LPK 325


>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
 gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
          Length = 340

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + SN GC+P    PC H    T  P      TP  KC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGGTP--KC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   + +DK+   + Y V   V +IQ+EIM NGPV     +Y D        
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYED-------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +  YK GVY      E+  +A ++I+GWG                
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG+E   PYW I +++   +GD G  +ILRG++   IES ++  
Sbjct: 294 ----------------WGDEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
          Length = 332

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 100/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +S W +    G+V+GG + S  GCQP S  PC H +     P C    +  P C
Sbjct: 150 CDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEH-HVPGPRPACSGEGS-TPDC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +   G  + +D Y  +  Y + DE   IQ EI+KNGPV A   +Y D+ +YK G 
Sbjct: 208 RNQC-DKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    +KI+GWG EN  P          
Sbjct: 267 YQH------------------------VAGSVLGGHAIKILGWGVENDTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G  KILRG++E  IE  V+  
Sbjct: 293 ------------------------YWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAG 328

Query: 242 LPK 244
           LP+
Sbjct: 329 LPR 331


>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
          Length = 325

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 83/174 (47%), Gaps = 26/174 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +V + G VTGG   + + C+   FPPC H    T   EC   A   PKC
Sbjct: 163 CQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRAR-TPKC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T CT   Y   +  DK R K  Y + + V  IQ+EIMKNGPVVA   +Y+D FSY    
Sbjct: 222 RTSCT-PGYKNSYSDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYAD-FSY---- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                             YK G+Y  +A     ++A VK++GWGEE   PYW +
Sbjct: 276 ------------------YKKGIYKHTAGRARGSHA-VKVIGWGEEGDVPYWIV 310


>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 280

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 58/171 (33%), Positives = 77/171 (45%), Gaps = 25/171 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
           C  G    +W +  + G V+GG ++SN GCQP   PPC   N  +    C T    + P 
Sbjct: 132 CDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSCTTYNREETPA 191

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C N NY   F  D Y+ K YY V   +A   +EI  NGP+    Y+Y D+  YKSG
Sbjct: 192 CEIKCNNPNYYSSFKTDIYKGK-YYQVYPFMA--MKEIFDNGPITTQFYMYRDLIDYKSG 248

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
                           ++ Y  G Y      +       KI+GWGEENG P
Sbjct: 249 ----------------VYQYDEGFY-----GDFFTVQGXKIIGWGEENGDP 278


>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
          Length = 344

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 100/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S+ GC+P    PC H +   + P C       P C
Sbjct: 161 CNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCDGEHGKTPSC 219

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   +Y   +  DK+   + Y V   V DIQ+EIM+NGPV     +Y D+  YK G 
Sbjct: 220 RHECQK-SYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDG- 277

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +   + G +A            ++I+GWG EN  PY         
Sbjct: 278 -----------VYQHVHGRELGGHA------------IRILGWGVENKTPY--------- 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G+ G  K+LRG +   IES +   
Sbjct: 306 -------------------------WLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAG 340

Query: 242 LPK 244
           LPK
Sbjct: 341 LPK 343


>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
 gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
          Length = 349

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 70/249 (28%), Positives = 102/249 (40%), Gaps = 64/249 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG + +  GC P   PPC       +         P  + 
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNT-----CGGKPMERN 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG+   QD+Y+ K  Y +N  +  I+Q++M  GPV A+  +Y D FS     
Sbjct: 209 H-QCPKTCYGKTTVQDRYKTKNEYVIN-SIETIEQDLMTYGPVEASFDVYDD-FSV---- 261

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YKSG+Y  +  A+     ++KI+GWGEENG PYW  V  ++ 
Sbjct: 262 ------------------YKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAVNSWS- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W          GD GT KI++GRNE  IE  V   
Sbjct: 303 -----------------------KFW----------GDHGTFKIIKGRNECGIERAVTAG 329

Query: 242 LPKDNYGVE 250
           +P  + G +
Sbjct: 330 IPSTSRGPQ 338


>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 346

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 72/227 (31%), Positives = 96/227 (42%), Gaps = 60/227 (26%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQ 76
           GLVTG  + +N+ CQ  S  PC H   +   P C T   P P C   C +N  Y   + +
Sbjct: 177 GLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPPC-TGELPTPPCVKSCDSNSTYTIPYPK 235

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           D ++  + Y ++     I  EI  NGP+     +Y D                       
Sbjct: 236 DLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYED----------------------- 272

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
             +YKSGVY     +E+  +A VK+VGWG ENG PY                        
Sbjct: 273 FLTYKSGVYQHVTGSELGGHA-VKMVGWGVENGTPY------------------------ 307

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                     W IV+++ E +GDKGT KILRG+NE  IES    ALP
Sbjct: 308 ----------WIIVNSWNESWGDKGTFKILRGQNECGIESECVTALP 344


>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
          Length = 351

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 103/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K+G VTGG++   +GC+P  +PPC H    T    C +   P  KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QD +  +  Y V+ + A+IQ+EIM +GPV     +Y D F + SG 
Sbjct: 227 EHSC-QAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYED-FEHYSG- 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                                GVY  +A A +  +A VK++GWG +NG P          
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +G+ G  +I+RG NE  IES V G 
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGG 347

Query: 242 LPK 244
            PK
Sbjct: 348 TPK 350


>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
 gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
 gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
 gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
          Length = 330

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 94/242 (38%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCSGEGGDTPNC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + QDK+  K  Y V      I  E+ KNGPV     +Y D   YKSG 
Sbjct: 207 DMKC-EPGYSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S   V    +KI+GWGEENG P          
Sbjct: 266 YQH------------------------MSGSPVGGHAIKILGWGEENGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

Query: 242 LP 243
           +P
Sbjct: 328 IP 329


>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
          Length = 355

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 75/240 (31%), Positives = 100/240 (41%), Gaps = 60/240 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   +   +   RGLVTGG + +   CQP +   C H +     P C T     PKC
Sbjct: 163 CNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEH-HVPGDRPPC-TEGGGTPKC 220

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D   + +  DK    + Y V ++V  IQQEIM  GPV A   +YS         
Sbjct: 221 SHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYS--------- 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  SYKSGVY  ++ +E+  +A +KI+GWG E G            
Sbjct: 272 --------------DFPSYKSGVYRHTSGSELGGHA-IKIIGWGTEGG------------ 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GDKGT KILRG NE  IE  V  A
Sbjct: 305 ----------------------DDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAA 342


>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
          Length = 323

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 72/246 (29%), Positives = 97/246 (39%), Gaps = 66/246 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L   Q  
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191

Query: 61  -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            C  +C N NY   +  D ++    Y   W N  V  IQQEIM  GPV A MY+Y +   
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMG 249

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK G Y                         S + E++ Y  VK++GWG +         
Sbjct: 250 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDG-------- 277

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                    +G  YW  ++++   +G+ G  KILRG N   IE 
Sbjct: 278 -------------------------DGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIEL 312

Query: 237 LVNGAL 242
           LV   +
Sbjct: 313 LVMAGI 318


>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
 gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
          Length = 384

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 110/246 (44%), Gaps = 58/246 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C  G   + W +    G+VTG  + +++GC+P  FPPC +H+N T  EP CK    P PK
Sbjct: 190 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 248

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C+ +C + NY + +  DKY  ++ Y V ++V  IQ+EIM  GPV A+  +Y+D   Y SG
Sbjct: 249 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSG 307

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +     G +A            VKI+GWG + G  YW     + 
Sbjct: 308 ------------IYKHVAGSVGGGHA------------VKILGWGIDQGVSYWLAANSWN 343

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                            WGE+           F       G  +ILRG +E  IES +  
Sbjct: 344 ND---------------WGED----------VFS------GYFRILRGADECGIESGIVA 372

Query: 241 ALPKDN 246
            +P+ +
Sbjct: 373 GIPRKD 378


>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 108/251 (43%), Gaps = 62/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC P + PPC H +   S P C T      +C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTHRC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG ENG PYW     + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYWLAANSWNL 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 329 IPRTDQYWGRF 339


>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
          Length = 330

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 71/244 (29%), Positives = 103/244 (42%), Gaps = 67/244 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI S T+      G V+GG ++S  GC     P CN        P CKTL    P C
Sbjct: 153 CHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN--------PSCKTLYDA-PTC 203

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVA-DIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C   +  + + +DK+  K+ Y +  +V   IQ EI+KNGPVV               
Sbjct: 204 KKECDKGSPLK-YEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVV--------------- 247

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                   A+  +Y+D   Y SGVY     ++++    V+I+GWG ENG           
Sbjct: 248 --------ASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGT---------- 289

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                   PYW + +++ E++GD+G  KI RG+NE  IE  +  
Sbjct: 290 -----------------------YPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITA 326

Query: 241 ALPK 244
            LP+
Sbjct: 327 GLPR 330


>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 104/237 (43%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G++  +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+  +  Y V    + IQ+EIM  GPV A +++Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGEFSYNVIGVESVIQKEIMMYGPVEAYLHIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGR+E +IES +
Sbjct: 300 ---------------------GTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335


>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
          Length = 343

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 108/243 (44%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +    G+V+GG+++S+ GCQP +  PC H    T +P C    T  P+C
Sbjct: 162 CNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKP-CGEGDT--PRC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC  + Y   + +D++  K  Y V   V  IQ+E++ NGP  A + +Y D   Y++G 
Sbjct: 219 VKRC-EEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGV 277

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                      VS  A  +    V+++GWG E+G P          
Sbjct: 278 YQH----------------------VSGGA--LGGHAVRLLGWGVEDGTP---------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  +ILRG++E  IES +NG 
Sbjct: 304 ------------------------YWLLANSWNYDWGDNGYFRILRGQDECGIESDINGG 339

Query: 242 LPK 244
           LPK
Sbjct: 340 LPK 342


>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
          Length = 334

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 107/245 (43%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W    K GLVTGG ++S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNT--CR--GKPAEKN 209

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F++ +R+ R  Y++N ++  IQ ++M  GP+ A         SY 
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHRYTRDAYYLNYQI--IQNDLMTYGPIEA---------SYD 257

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                         +Y D  +YKSGVY  + +A  +    VK++GWGEE G PY      
Sbjct: 258 --------------VYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
          Length = 339

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 97/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P P C
Sbjct: 155 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + QDK+     Y V +  + I QEIMKNGPV     ++ D   Y+SG 
Sbjct: 215 ARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + + +    V+++GWG EN             
Sbjct: 274 YHH------------------------VAGKFIGRHAVRMIGWGVEN------------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW + +++ E++G+ G  +++RGRNE  IES V   
Sbjct: 297 ---------------------GVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAG 335

Query: 242 LPK 244
           +P+
Sbjct: 336 MPR 338


>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
          Length = 339

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W WV K G+ TGG + +   C+P +F PC +         C   + P P+C
Sbjct: 155 CSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DK+  K+ YW+ ++  +I+ +IMKNGPV A   +Y D   YK G 
Sbjct: 215 EKFCQR-GYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ +K G+     +        VKI+GWG++NG  Y         
Sbjct: 273 ---------------IYKHKEGIQTGGHA--------VKIIGWGKDNGTDY--------- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ + +G+ G  +++RG N+  IE ++   
Sbjct: 301 -------------------------WLIANSWSKDWGESGFFRMVRGENDCEIEDMITAG 335

Query: 242 L 242
           +
Sbjct: 336 I 336


>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
          Length = 247

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 96/243 (39%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +   +G+ TGG  +S+ GCQP   P C H + T   P C  +    PKC
Sbjct: 65  CDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEH-HTTGDRPPCSDIVD-TPKC 122

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K+ Y +      IQ EI KNGPV     +YSD  +YKSG 
Sbjct: 123 VHLCEK-GYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGV 181

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E +    ++++GWG EN  P          
Sbjct: 182 YQH------------------------HSGESLGGHAIRVLGWGYENDVP---------- 207

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GDKG  KILRG +E  IES +   
Sbjct: 208 ------------------------YWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAG 243

Query: 242 LPK 244
           +PK
Sbjct: 244 IPK 246


>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S+ GC+P    PC H +   + P C+      P+C
Sbjct: 157 CNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   +  DK+   R Y ++  V DIQ EIM NGPV     +Y          
Sbjct: 216 QHKC-QASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAFTVY---------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YK GVY      E+  +A ++I+GWG E   PY         
Sbjct: 265 -------------EDLILYKDGVYEHVHGKELGGHA-IRIIGWGVEKDTPY--------- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G+ G  KILRG++   IES ++  
Sbjct: 302 -------------------------WLIANSWNTDWGNNGFFKILRGKDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
          Length = 247

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 97/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P P C
Sbjct: 63  CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 122

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + QDK+     Y V +  + I QEIMKNGPV     ++ D   Y+SG 
Sbjct: 123 ARAC-QTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 181

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + + +    V+++GWG EN             
Sbjct: 182 YHH------------------------VAGKFIGRHAVRMIGWGVEN------------- 204

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW + +++ E++G+ G  +++RGRNE  IES V   
Sbjct: 205 ---------------------GVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAG 243

Query: 242 LPK 244
           +P+
Sbjct: 244 MPR 246


>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
          Length = 331

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 65/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+ TGG + S  GCQP S  PC H +   ++ +C TL    P C
Sbjct: 149 CEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEH-HTEGNKVQCSTLDYDTPSC 207

Query: 62  HTRCTND--NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             +C +   NY           + +Y     VA+IQ+EI+ NGPV A   +YSD  +YKS
Sbjct: 208 KHKCDDSALNYKSELTFGSGSVRNFY----SVANIQKEILTNGPVEAAFDVYSDFVNYKS 263

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +   VA  YL         G +A            V+I+GWGEE+G P        
Sbjct: 264 GVYQH---VAGEYL---------GGHA------------VRILGWGEESGVP-------- 291

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                     YW + +++ E +GDKG  KI RG NE+  E  + 
Sbjct: 292 --------------------------YWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIV 325

Query: 240 GA 241
            A
Sbjct: 326 AA 327


>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
          Length = 342

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G++  +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ+EIM  GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLQIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGR+E +IES +
Sbjct: 300 ---------------------GTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335


>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
 gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
          Length = 339

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+    +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 208 SKFC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    V+I+GWG ENG PYW        
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRGR+   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G++  +W +  K G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ+EIM  GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGR+E +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335


>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
          Length = 334

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 99/242 (40%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    GLV+GG+++S  GC+P   PPC H       P   +  T  PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLP--CSGDTKTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  DNY   + QDK+  K  Y V      I+ E+ KNGPV     +Y+D+ SYKSG 
Sbjct: 209 IKKC-EDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + + +    +KI+GWG ENG  Y         
Sbjct: 268 YKH------------------------VAGDALGGHAIKIMGWGVENGNKY--------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG +   IES +   
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329

Query: 242 LP 243
            P
Sbjct: 330 EP 331


>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
          Length = 334

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 64/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG +++  GC P   PPC +      E  C     P  + 
Sbjct: 154 CGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQ---GENICD--EQPMERN 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG+   Q++Y+ K  Y++N  +  I+Q                DI +Y    
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYYINS-IKTIEQ----------------DIKTY---- 246

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPV A+   Y D+  YKSG+Y  S +A+     ++KI+GWG+E+G PYW  V  ++ 
Sbjct: 247 ---GPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAVNSWS- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W          GD GT KI++GRNE  IE  V   
Sbjct: 303 -----------------------KFW----------GDHGTFKIIKGRNECGIERAVTAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
 gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 92/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+    TGC+P  FP C H +     P C     P PKC
Sbjct: 155 CDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D     + +DK R    Y V+     I +EI+ NGPV A   ++ D   YKSG 
Sbjct: 214 VKHC--DTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGI 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                          A    V    ++I+GWGEENG PY         
Sbjct: 272 Y------------------------FHAWGGSVGGHAIRILGWGEENGVPY--------- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG ++ LRG NE  IE      
Sbjct: 299 -------------------------WLIANSWNEDWGEKGYLRFLRGHNECGIEEEATAG 333

Query: 242 LP 243
           LP
Sbjct: 334 LP 335


>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 126 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 184

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 185 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 242

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 243 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 266

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 267 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 305

Query: 242 LPK 244
           L K
Sbjct: 306 LIK 308


>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 341

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 99/242 (40%), Gaps = 62/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +S   +  K GLVTG  +++   CQ  SF PC H   T   P C T   P PKC
Sbjct: 160 CNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPAC-TGELPTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   + G G     ++  + Y V      I  EI  NGPV A   +Y D  +YKSG 
Sbjct: 219 AKTC---DSGSGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSG- 274

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +     G +A            +KIVGWG EN  PY         
Sbjct: 275 -----------VYKHVTGKALGGHA------------IKIVGWGVENNTPY--------- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W +V+++ + +GD GT KILRG+NE  IE+ V  A
Sbjct: 303 -------------------------WIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTA 337

Query: 242 LP 243
           LP
Sbjct: 338 LP 339


>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
          Length = 339

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 71/251 (28%), Positives = 107/251 (42%), Gaps = 62/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+ WG ENG PYW     + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILVWGVENGVPYWLAANSWNL 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 329 IPRTDQYWGRF 339


>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
          Length = 339

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 70/251 (27%), Positives = 107/251 (42%), Gaps = 62/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC P + PPC H +   S P C T     P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V +I  EI KN PV     ++SD  +YKSG 
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+GWG  NG PYW     + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVGNGVPYWLAANSWNL 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 329 IPRTDQYWGRF 339


>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
          Length = 330

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 99/237 (41%), Gaps = 61/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G VTGG ++S+ GCQP   P C H    + +P C+  + P PKC
Sbjct: 148 CNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEG-SEPTPKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  + Y   +  DK++   +Y + ++   I+ EI  NGPV A   +YSD  +YKSG 
Sbjct: 206 KRSC-REGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSG- 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ Y +G          +    +KI+GWG EN  PYW +   +  
Sbjct: 264 ---------------VYKYTTG--------NALGGHAIKILGWGVENNVPYWLVANSW-- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                  P W          GDKG  KILRG NE  IE+ V
Sbjct: 299 ----------------------NPDW----------GDKGFFKILRGSNECGIEASV 323


>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
 gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
           morsitans morsitans]
          Length = 340

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S+ GC+P    PC H +   + P C+      P+C
Sbjct: 157 CNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   +  DK+   R Y ++  V DIQ+EIM +GPV     +Y          
Sbjct: 216 QHKC-QASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVY---------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YK GVY      E+  +A ++I+GWG E   P          
Sbjct: 265 -------------EDLILYKDGVYEHVHGKELGGHA-IRIIGWGVEKDIP---------- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G  KILRG++   IES ++  
Sbjct: 301 ------------------------YWLVANSWNTDWGNNGFFKILRGKDHCGIESSISAG 336

Query: 242 LPK 244
           LPK
Sbjct: 337 LPK 339


>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
          Length = 331

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 104/244 (42%), Gaps = 63/244 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + +  WVH  G+V+GGA +S  GCQP    PC H + +   P+C    +  PK
Sbjct: 148 CNGGFPGAAFQYWVHS-GIVSGGAFNSTQGCQPYEIAPCEH-HVSGPRPKCAEGGS-TPK 204

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           CH  C + NY   +  D +   ++Y V+ +   I+ +IM NGPV     +Y         
Sbjct: 205 CHKNCES-NYVVDYESDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVY--------- 254

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D   YKSGVY  +    +  +A ++++GWGEE+G P         
Sbjct: 255 --------------VDFLHYKSGVYQHTHGLPLGGHA-IRVLGWGEEDGTP--------- 290

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++   +GD G  KILRG +   IES ++ 
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEISA 325

Query: 241 ALPK 244
            LPK
Sbjct: 326 GLPK 329


>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
           variabilis]
          Length = 277

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/244 (29%), Positives = 102/244 (41%), Gaps = 63/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +    G+VTGG + ++ GCQP  FPPC H +     P C T   P PKC
Sbjct: 94  CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEH-HTKGPLPNC-TDTKPTPKC 151

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DKY  K  Y ++ +   I+ EI                       
Sbjct: 152 LQVCRK-GYEKSYSEDKYFAKTVYSLHSDETQIKTEI----------------------- 187

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV A+  +Y+D  +YKSGVY    S E+          W   +    W + R    
Sbjct: 188 YKNGPVEADFSVYTDFLAYKSGVYQ-RHSYEL----------WEARHQNLGWALKR---- 232

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                 R  W + +++ + +GDKG  KI RG NE  IE+ +N  
Sbjct: 233 ----------------------RSVWLVANSWNQDWGDKGYFKIRRGNNECGIENDINAG 270

Query: 242 LPKD 245
           +PK+
Sbjct: 271 IPKE 274


>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
          Length = 341

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +  + GLVTGG ++S+ GCQP +   C+H      +P C     P PKC
Sbjct: 158 CEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+  V  I  EIM NGPV               G 
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGMSAYSVHG-VEKIMTEIMTNGPV--------------EGA 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           +          +Y+D   YKSGVY  +    +  +A +KI+GWG ENG  YW +   +  
Sbjct: 261 F---------TVYADFPQYKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSW-- 308

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          GD+G  KILRG++E  IES ++  
Sbjct: 309 ----------------------NPDW----------GDQGFFKILRGQDECGIESQISAG 336

Query: 242 LPK 244
            PK
Sbjct: 337 EPK 339


>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 340

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 75/249 (30%), Positives = 105/249 (42%), Gaps = 69/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W   + RGLVTGG + S  GC+P   PPC +     +E        P+ K 
Sbjct: 157 CNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPREKN 212

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           H RCT   YG     + D +RF R  YY      + IQ+++M+ GP+ A+  +Y D    
Sbjct: 213 H-RCTRTCYGNQDLDYNDDHRFTRDSYYLT---YSSIQKDVMRYGPIEASFDMYDDF--- 265

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                   P            SYKSGVY  S +A  +    VK++GWGEE+G  Y     
Sbjct: 266 --------P------------SYKSGVYVRSENASYLGGHAVKLIGWGEEHGVLY----- 300

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++ E +GD G  KI RG NE  I++ 
Sbjct: 301 -----------------------------WLMVNSWNEGWGDNGLFKIRRGTNECGIDNS 331

Query: 238 VNGALPKDN 246
             G +P  N
Sbjct: 332 TTGGVPVAN 340


>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
          Length = 335

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 104/243 (42%), Gaps = 66/243 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  +RG+ TGG + SN GC P   PPC        + + + L   +P  
Sbjct: 155 CQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207

Query: 62  HT-RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H  +C    YG    +++Y+ +  Y V D    I+Q                DI +Y   
Sbjct: 208 HNHKCPRACYGNSTVENRYKVESIY-VLDSFKTIEQ----------------DIRTY--- 247

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
               GPV A+  +Y D  +YKSG+Y  + +A  V   +VK++GWGEE+G PYW +V  ++
Sbjct: 248 ----GPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNSWS 303

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    +W          G++GT +I++GRNE  IE     
Sbjct: 304 ------------------------KFW----------GEQGTFRIIKGRNECGIERSATA 329

Query: 241 ALP 243
            +P
Sbjct: 330 GIP 332


>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
 gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
          Length = 339

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/232 (30%), Positives = 98/232 (42%), Gaps = 66/232 (28%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV   GLV+G  ++S+ GC+P  F PC++  +     E K      PKC   C N  Y R
Sbjct: 173 WV-DAGLVSGAPYNSSEGCKPYPFEPCSYP-FVGCHHEKK-----NPKCLHHCIN-GYDR 224

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + +DK+     Y + ++   IQ EIM NGPV     ++ D + Y SG            
Sbjct: 225 KYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSG------------ 272

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
           +Y  +   K G++A            ++IVGWG ENG PYW I   Y             
Sbjct: 273 VYKHVVGKKVGMHA------------IRIVGWGTENGTPYWLIANSY------------- 307

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                G+ +GDKG  K+LRG N   IES V   LP+
Sbjct: 308 ---------------------GDTWGDKGFFKMLRGSNHLGIESTVIAGLPQ 338


>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
          Length = 337

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 96/243 (39%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 149 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D     +  DK R    Y V  +   I +EIM NGPV A   +Y D   YKSG 
Sbjct: 208 VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y +SD                ++    ++I+GWGEENG  YW I   +  
Sbjct: 265 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWN- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                          GWGE+                   G  K+LRG+NE  IE  V   
Sbjct: 301 --------------DGWGED-------------------GYFKMLRGKNECGIEDEVTAG 327

Query: 242 LPK 244
           LP+
Sbjct: 328 LPE 330


>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GP  A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
          Length = 330

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 95/242 (39%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLVTGG ++S+ GC+P +  PC H +   S P C       P C
Sbjct: 148 CNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK+  K  Y V      I  E+ KNGPV A   +Y D   YKSG 
Sbjct: 207 DMKC-EPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S   +    +KI+GWGEENG P          
Sbjct: 266 YQH------------------------MSGSALGGHAIKILGWGEENGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327

Query: 242 LP 243
           +P
Sbjct: 328 IP 329


>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
          Length = 228

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 96/243 (39%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 40  CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 98

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D     +  DK R    Y V  +   I +EIM NGPV A   +Y D   YKSG 
Sbjct: 99  VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 155

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y +SD                ++    ++I+GWGEENG  YW I   +  
Sbjct: 156 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWN- 191

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                          GWGE+                   G  K+LRG+NE  IE  V   
Sbjct: 192 --------------DGWGED-------------------GYFKMLRGKNECGIEDEVTAG 218

Query: 242 LPK 244
           LP+
Sbjct: 219 LPE 221


>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
          Length = 335

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 94/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S  W +  + G+V+GG   + TGC P  FP C+H   T     C       PKC
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCPRELYATPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y +   +DK + K  Y V D   DI  EI+ NGPV    Y++ D   YKSG 
Sbjct: 215 EKQC-QAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYYIFEDFTVYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y SG                 I+GWG ENG            
Sbjct: 273 ---------------IYQYTSGSLMGGHG----------IIGWGVENG------------ 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW   +++ E +G+ G  +I RG NE  IES +N  
Sbjct: 296 -----------VK-----------YWLAANSWNEGWGENGYFRIRRGTNECGIESRINAG 333

Query: 242 LP 243
           LP
Sbjct: 334 LP 335


>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
          Length = 319

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 103/232 (44%), Gaps = 61/232 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C  G   + W +    G+VTG  + +++GC+P  FPPC +H+N T  EP CK    P PK
Sbjct: 146 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 204

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C+ +C + NY + +  DKY  ++ Y V ++V  IQ+EIM  GPV A+  +Y+        
Sbjct: 205 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYT-------- 255

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D   Y SG+Y   A +          VG G               
Sbjct: 256 ---------------DFLHYTSGIYKHVAGS----------VGGGH-------------- 276

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                       VK++GWG + G  YW   +++   +G+ G  +ILRG +E 
Sbjct: 277 -----------AVKILGWGIDQGVSYWLAANSWNNDWGEDGYFRILRGADEC 317


>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
          Length = 350

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 78/243 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE--------PECKT 53
           C  G+ +  W++  K G+V+GG + S  GCQP + PPCNH  +   E        P+CK 
Sbjct: 153 CEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN 212

Query: 54  L-ATPQ--------PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
           +   P+        P+C  +C N NY   + +DK+R K  Y V       + EI K    
Sbjct: 213 IPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRVK------KSEIFK---- 261

Query: 105 VANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGW 164
                   +I+ Y       GPV +   +Y D  +YK G+Y  +                
Sbjct: 262 --------EIYEY-------GPVTSYFTVYEDFLNYKEGIYNYT---------------- 290

Query: 165 GEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIK 224
                              S + +   +VK+IGWGEE G  YW   ++F   +GDKG  K
Sbjct: 291 -------------------SGQKLGLHSVKIIGWGEERGIKYWLAANSFNTDWGDKGFFK 331

Query: 225 ILR 227
           I+R
Sbjct: 332 IIR 334


>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
          Length = 337

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 101/247 (40%), Gaps = 61/247 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C       P C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEEGDTPTC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   +  DK      Y V     +I  EI KNGPV     +Y D   YKSG 
Sbjct: 210 RKKC-EEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGV 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    ++I+GWG ENG  YW       +
Sbjct: 269 YQH------------------------VAGEMLGGHAIRILGWGVENGIRYW-------L 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +A++                     W I       +GD G  K LRG+N   IES +   
Sbjct: 298 AANS---------------------WNI------DWGDNGFFKFLRGKNHCGIESEIIAG 330

Query: 242 LPK-DNY 247
           +P+ D Y
Sbjct: 331 IPRTDQY 337


>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
          Length = 332

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 102/246 (41%), Gaps = 63/246 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H +     P+C       PK
Sbjct: 149 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKCSE-GGGTPK 205

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  RC N  Y   +  D +   + Y +  +   I+ EIMKNGPV     +Y D   YKSG
Sbjct: 206 CVKRCEN-GYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSG 264

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++ ++ G+         +    ++I+GWGEENG P         
Sbjct: 265 ----------------VYQHRHGL--------PLGGHAIRILGWGEENGTP--------- 291

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++   +GD G  KILRG +   IES ++ 
Sbjct: 292 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 326

Query: 241 ALPKDN 246
            LPK N
Sbjct: 327 GLPKLN 332


>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
 gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 335

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/247 (27%), Positives = 103/247 (41%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
           C+ G     W +  + G+VTGG + +  GCQP   PPC        + E     + QP  
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206

Query: 60  ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
              KC  +C  D+    + ++ Y+ K  Y+            +KN  +  +  +Y     
Sbjct: 207 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYY------------LKNTTMQKDTMVY----- 248

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                   GP+ A+  +Y D  +Y+SGVY  + +A  +    VK++GWG E G PY    
Sbjct: 249 --------GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPY---- 296

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W +V+++GEQ+GDKG  KILRG +E  IES
Sbjct: 297 ------------------------------WLMVNSWGEQWGDKGMFKILRGTDECGIES 326

Query: 237 LVNGALP 243
                +P
Sbjct: 327 SCTAGVP 333


>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
          Length = 335

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 102/243 (41%), Gaps = 66/243 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+ TGG + SN GC P   PPC        + + + L   +P  
Sbjct: 155 CQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207

Query: 62  HT-RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H  +C    YG    +++Y+ K  Y V D    I+Q+I K GPV A+  +Y D       
Sbjct: 208 HNHKCPRACYGNSTVENRYKVKSIY-VLDSSKTIEQDIRKYGPVEASFDVYDDF------ 260

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                             +YKSG+Y  + +A  V   +VK++GWGEE+G PYW +V  ++
Sbjct: 261 -----------------ITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLVNSWS 303

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    +W          G++GT +I++GRNE  IE     
Sbjct: 304 ------------------------KFW----------GEQGTFRIIKGRNECGIERSATA 329

Query: 241 ALP 243
            +P
Sbjct: 330 GVP 332


>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
          Length = 328

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 98/229 (42%), Gaps = 62/229 (27%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV K G V+GG H+SN GCQP S   C H +     P C+    P+  C   C ++ YG+
Sbjct: 157 WVTK-GFVSGGRHNSNEGCQPYSVEECEH-HIEGPRPPCEG-DMPELVCSETC-HEEYGK 212

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + +D       Y +  +V  IQ+EIM NGPV A   +Y D  SYKSG            
Sbjct: 213 TYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSG------------ 260

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
               ++ +++G+        +  Y  V+++GWGEE G P                     
Sbjct: 261 ----VYQHETGL--------LDGYHAVRVIGWGEEEGTP--------------------- 287

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        YW + +++   +GD G  KILRG +E   E  +  A
Sbjct: 288 -------------YWLVANSWNTDWGDNGLFKILRGSDECEFEGDMAAA 323


>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
          Length = 341

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 74/243 (30%), Positives = 103/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +  K GLVTGG ++S+ GC P +   C+H      +P  K++  P PKC
Sbjct: 158 CEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPCSKSIG-PTPKC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+  V  I  EIM NGPV               G 
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGSSAYSVHG-VEKIMTEIMTNGPV--------------EGA 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           +          +Y+D   YKSGVY  +    +  +A +KI+GWG ENG  YW +   +  
Sbjct: 261 F---------TVYADFPQYKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSW-- 308

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          GD+G  KILRG++E  IES ++  
Sbjct: 309 ----------------------NPDW----------GDQGFFKILRGQDECGIESQISAG 336

Query: 242 LPK 244
            PK
Sbjct: 337 EPK 339


>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
 gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
          Length = 335

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 98/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W+     G+VTGG +    GC+  SF PC H +     P C     P P C
Sbjct: 154 CNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEH-HVDGDLPPCGP-TKPTPDC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D+     +Q+       Y ++     IQ EIM NGPV A+  +Y D  SYKSG 
Sbjct: 212 KKEC--DSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSG- 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +  G YA   +        +KI+GWG EN  P          
Sbjct: 269 ---------------VYQHLEGEYAGGHA--------IKILGWGVENDTP---------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GDKG  KILRG NE  IE  +   
Sbjct: 296 ------------------------YWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAG 331

Query: 242 LPK 244
           +P+
Sbjct: 332 IPE 334


>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
          Length = 335

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 67/247 (27%), Positives = 103/247 (41%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
           C+ G     W +  + G+VTGG + +  GCQP   PPC        + E     + QP  
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206

Query: 60  ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
              KC  +C  D+    + ++ Y+ K  Y+            +KN  +  +  +Y     
Sbjct: 207 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYY------------LKNTTMQKDTMVY----- 248

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                   GP+ A+  +Y D  +Y+SGVY  + +A  +    VK++GWG E G PY    
Sbjct: 249 --------GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPY---- 296

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W +V+++GEQ+GDKG  KILRG +E  IES
Sbjct: 297 ------------------------------WLMVNSWGEQWGDKGMFKILRGTDECGIES 326

Query: 237 LVNGALP 243
                +P
Sbjct: 327 SCTAGVP 333


>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
          Length = 340

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 95/242 (39%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V     +I  EI KNGPV     +Y D   YKSG 
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E V    ++++GWG +NG P          
Sbjct: 268 YQH------------------------VTGEQVGGHAIRLLGWGVDNGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +GD G  KILRG +   IES +   
Sbjct: 294 ------------------------YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
          Length = 407

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 103/243 (42%), Gaps = 56/243 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+VTG  + +++GC+P  FPPC H N  T    CK    P PKC
Sbjct: 205 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 264

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C + NY + +  DKY  ++ Y V ++V  IQ+EIM  GPV A+  +Y+D   Y  G 
Sbjct: 265 DRQC-DKNYKKPYKADKYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGG- 322

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +     G +A            VKI+GWG + G  YW     +  
Sbjct: 323 -----------IYKHVAGSVGGGHA------------VKILGWGIDQGVSYWLAANSWNT 359

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WGE+           F       G  +ILRG +E  IES +   
Sbjct: 360 D---------------WGED----------VFS------GYFRILRGVDECGIESGIVAG 388

Query: 242 LPK 244
           +P+
Sbjct: 389 IPR 391


>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 337

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 95/243 (39%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  + TGC+   FP C+H   +   P C       P C
Sbjct: 149 CQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D     +  DK R    Y V  +   I +EIM NGPV A   +Y D   YKSG 
Sbjct: 208 VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y +SD                ++    ++I+GWGEENG  YW I   +  
Sbjct: 265 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWND 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                          GWGE+                   G  K+LRG+NE  IE  V   
Sbjct: 302 ---------------GWGED-------------------GCFKMLRGKNECGIEDEVTAG 327

Query: 242 LPK 244
           LP+
Sbjct: 328 LPE 330


>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  I+S +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
          Length = 193

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 71/226 (31%), Positives = 95/226 (42%), Gaps = 64/226 (28%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
           W    GL TGG +    GC+P +  PC+  + N TTS P C    TP   C  RCT N  
Sbjct: 29  WWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVP-CPGYHTPV--CEERCTSNIT 85

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           +   + Q K+  K +Y V  ++ DIQ EIM+NGPV+A+  +Y D + YKSG Y       
Sbjct: 86  WPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY------- 138

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                            V  + +       KI+GWG +NG PYW  V             
Sbjct: 139 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 169

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                  +G  FG+ G ++ILRG NE  IE
Sbjct: 170 ----------------------QWGTDFGENGFMRILRGVNEVHIE 193


>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
          Length = 195

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 59/176 (33%), Positives = 82/176 (46%), Gaps = 27/176 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 44  CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 101

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 102 SKIC-EPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 160

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           Y +                         + E++    ++I+GWG ENG PYW +  
Sbjct: 161 YQH------------------------VTGEMMGGHAIRILGWGVENGTPYWLVAN 192


>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
 gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
          Length = 342

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 106/243 (43%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + W  W HK G+V+GG+++SN GC+P    PC H +   + P CK   TP   
Sbjct: 159 CNGGFPGAAWSYWTHK-GIVSGGSYNSNEGCRPYEIEPCEH-HVNGTRPPCKNGRTPS-- 214

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C + +Y   + +DK+   + Y +     +IQ+EIM NGPV     +Y         
Sbjct: 215 CKHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVY--------- 264

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D+  YKSGVY      E+  +A ++I+GWG               
Sbjct: 265 --------------EDLILYKSGVYKHVHGKELGGHA-IRILGWGV-------------- 295

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                            WG+    PYW I +++   +GD G  +I+RG +   IES ++ 
Sbjct: 296 -----------------WGDSK-VPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISA 337

Query: 241 ALP 243
            LP
Sbjct: 338 GLP 340


>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
          Length = 340

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +DI +Y 
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 252

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 253 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 301 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 332

Query: 239 NGALP 243
            G +P
Sbjct: 333 TGGVP 337


>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
          Length = 261

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 80/177 (45%), Gaps = 32/177 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C  G +   W +   +G+VTGG + SN GCQP    PC+H    +S   C +L   Q   
Sbjct: 98  CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 156

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           C  +C N NY   +  D Y+    Y   W N  V  IQQEIM  GPV A MY+Y +   Y
Sbjct: 157 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 214

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
           K G Y                         S + E++ Y  VK++GWG +E G  YW
Sbjct: 215 KEGVYK------------------------STAGELIGYHHVKLIGWGVDEAGIEYW 247


>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
          Length = 340

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +DI +Y 
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 252

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 253 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 301 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 332

Query: 239 NGALP 243
            G +P
Sbjct: 333 TGGVP 337


>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
          Length = 339

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 102/239 (42%), Gaps = 61/239 (25%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC   C  
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPAC-TGEGDTPKCSKTC-E 212

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y   + +DK+     Y +     +I  EI KNGPV          FS           
Sbjct: 213 PGYSPTYKEDKHFGYTSYSLPTNEWEIMAEIYKNGPV-------EGAFS----------- 254

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                +YSD   YKSGVY                                      + ++
Sbjct: 255 -----VYSDFLLYKSGVYQ-----------------------------------HLTGDM 274

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
           +    ++++GWGEENG PYW + +++   +GD G  +ILRG++   IES V   +P+ +
Sbjct: 275 MGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTD 333


>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
          Length = 311

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 59/178 (33%), Positives = 83/178 (46%), Gaps = 27/178 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V++   DI  EI KNGPV     +YSD   YKSG 
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           Y +                         + E++    ++I+GWG ENG PYW +   +
Sbjct: 267 YQH------------------------VTGEMMGGHAIRILGWGVENGTPYWLVANSW 300


>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
          Length = 255

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 60/177 (33%), Positives = 80/177 (45%), Gaps = 32/177 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C  G +   W     +G+VTGG + SN GCQP    PC+H    +S   C +L   Q   
Sbjct: 96  CDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMTV 154

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           C  +C N NY   +  D ++    Y   W N  V  IQQEIM  GPV A MY+Y +   Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTALMYVYENFMGY 212

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
           K G Y                         S + E++ Y  VK++GWG +E+G  YW
Sbjct: 213 KKGIYK------------------------STAGELIGYHHVKLIGWGVDEDGTEYW 245


>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
          Length = 324

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 98/233 (42%), Gaps = 73/233 (31%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           WV K G+V+GG ++SN GCQP          Y  S      L +  PKC T+C N  Y  
Sbjct: 163 WVAK-GIVSGGDYNSNEGCQP----------YEGSA----FLNSVTPKCSTKCLNSKYTT 207

Query: 73  GFFQDKYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
            + +DK+    + Y  +  VA+IQ EIM NGPVV +M +Y D +SYKSG Y +       
Sbjct: 208 PYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQH------- 260

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                             S   +    VKI+GWG E G PYW I                
Sbjct: 261 -----------------VSGNSMGGHAVKIIGWGTEKGVPYWLIAN-------------- 289

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                 WG +     W  +  F          KILRG+N   IE+ + G  P+
Sbjct: 290 -----SWGAK-----WADLDGF---------YKILRGKNHCKIETYIYGGTPQ 323


>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
          Length = 334

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +DI +Y 
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
          Length = 332

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +    G+V+GG + S  GCQP S  PC H +   S P C+        C
Sbjct: 150 CFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEH-HIPGSRPPCRGEGH-TADC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +D +  +  Y    +V +IQ EI+KNGPV A  ++Y D        
Sbjct: 208 RKQCEK-GYSIPYDKDLHYAEFVYSTERDVKEIQTEILKNGPVEAAFFVYED-------- 258

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          + +YK GVY   A A +  +A +KI+GWG ENG PY         
Sbjct: 259 ---------------LLTYKEGVYKHVAGAPVGGHA-IKILGWGVENGTPY--------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G+ G  KILRG +E  IE  V+  
Sbjct: 294 -------------------------WLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAG 328

Query: 242 LPK 244
           LP+
Sbjct: 329 LPR 331


>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 830

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 73/270 (27%), Positives = 104/270 (38%), Gaps = 72/270 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+ G  +S W WVH +G+ TGG +        + GC P  FPPC H    T  PEC  ++
Sbjct: 605 CNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVS 664

Query: 56  TP---------------------QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADI 94
                                   P C  +C N  Y      D++           V D 
Sbjct: 665 CSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDA 724

Query: 95  QQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV 154
           +  I  +GPV   +Y      ++         V A+  +Y D  +YKSGVY         
Sbjct: 725 KNAIRTDGPV-GPIYFCDPNVNFDQ-------VSASFSVYEDFLAYKSGVYK-------- 768

Query: 155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFG 214
                                        S E +    VK+IGWGEE+G+ YW +V+++ 
Sbjct: 769 ---------------------------HTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWN 801

Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           E +GD G  KI  G N  I ++L+ G  PK
Sbjct: 802 EDWGDHGLFKIALG-NCGIDDNLLGGT-PK 829


>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
          Length = 334

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +DI +Y 
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 329

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 146 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 205 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 263 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 286

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 287 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 325

Query: 242 LPK 244
           L K
Sbjct: 326 LIK 328


>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    +  Q++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
          Length = 280

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 98/243 (40%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + W + RG+VTGG     +GC+P  F PCN    +   PE KT     P C
Sbjct: 106 CEGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPCN----SYKCPEEKT-----PTC 155

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK      Y V   VA IQ EIM NGPVV    +Y D++ YKSG 
Sbjct: 156 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 214

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    +KI+GWG +NG P          
Sbjct: 215 YRH------------------------TAGRLLGGHAIKIIGWGTQNGIP---------- 240

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++G  +G+ G +K+ RG NE  IES V   
Sbjct: 241 ------------------------YWLIANSWGADWGENGFLKMRRGVNECGIESAVVAG 276

Query: 242 LPK 244
           +PK
Sbjct: 277 MPK 279



 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 42/73 (57%), Gaps = 2/73 (2%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A+  +Y D + YK GVY  +A  ++V    +KI+GWG E+G  YW I   +    
Sbjct: 3   NGPVEASFTVYEDFYIYKKGVYQYTA-GQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQC 61

Query: 184 SAEIVAYATVKLI 196
            +   A++T ++I
Sbjct: 62  GS-CWAFSTAEVI 73


>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
          Length = 332

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 78/247 (31%), Positives = 105/247 (42%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W    K GLVTGG + S+ GCQP    PC    Y  +   C+    P  K 
Sbjct: 152 CHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207

Query: 62  HTRCTNDNYG---RGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           H RCT   YG   R F +D +RF R  YY        IQ+++M  GP+ A         S
Sbjct: 208 H-RCTRMCYGDQDRDFKED-HRFTRDAYYLT---YGTIQKDVMTYGPIEA---------S 253

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y+              +Y D  SYKSGVY  + +A  +    VK++GWGEE G PY    
Sbjct: 254 YE--------------VYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPY---- 295

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W +V+++ +Q+GD+G  KI RG NE  I++
Sbjct: 296 ------------------------------WLMVNSWNDQWGDRGLFKIRRGTNECGIDN 325

Query: 237 LVNGALP 243
              G +P
Sbjct: 326 STTGGVP 332


>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
 gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +    G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGYFLPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 100/238 (42%), Gaps = 62/238 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKMYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +C    Y   +  DK Y       + +E+A IQ+EIM  GPV A + ++ D  +YKSG
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGIAINVIKNELA-IQKEIMMYGPVEAYLLIFEDFLNYKSG 275

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ Y +G +        V    V+I+GWG EN            
Sbjct: 276 ----------------IYKYTTGSF--------VGEHYVRIIGWGIEN------------ 299

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                 G  YW   +T+ E +G+KG  +I+RGRNE  IES+V
Sbjct: 300 ----------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335


>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 97/237 (40%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+       V    + IQ+EIM  GPV A + ++ D  +YKSG 
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G +        V    V+I+GWG EN             
Sbjct: 276 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES+V
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335


>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 100/242 (41%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + S  GC P    PC H    T  P CK      P C
Sbjct: 158 CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + QD +R K  Y + ++V  I+QEI  NGPV     +Y          
Sbjct: 216 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVY---------- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  +Y++GVY   A   +  +A ++I+GWG +NG            
Sbjct: 265 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG------------ 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
               EI                 PYW + +++   +G  G  KILRG +E  IE  +N  
Sbjct: 299 ----EI-----------------PYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAG 337

Query: 242 LP 243
           LP
Sbjct: 338 LP 339


>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 335

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 102/250 (40%), Gaps = 77/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
           C+ G     W +  + G+VTGG +++  GCQP   PPC        + E     + QP  
Sbjct: 153 CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPC------VRDDEGHNSCSGQPTE 206

Query: 60  ---KCHTRCTND---NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
              KC  +C  D   NY +  ++ K     YY                   ++N  +  D
Sbjct: 207 RNHKCSKKCYGDETINYKKNHYKTK---DAYY-------------------LSNTTMQKD 244

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
              Y       GP+ A+  +Y D  SY+SGVY  + +A  +    VK++GWG E G PY 
Sbjct: 245 TMVY-------GPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVEEGTPY- 296

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
                                            W +V+++GEQ+GDKG  KILRG +E  
Sbjct: 297 ---------------------------------WLMVNSWGEQWGDKGMFKILRGTDECG 323

Query: 234 IESLVNGALP 243
           +ES     +P
Sbjct: 324 VESSCTAGVP 333


>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
          Length = 334

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 101/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGK----PTEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +D+ +Y 
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
          Length = 342

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
          Length = 341

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 85/175 (48%), Gaps = 27/175 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S + +    G+VTGG +++   C+P    PC H    T   EC  +A   P+C
Sbjct: 160 CEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGMAD-TPRC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC    Y + +  D+Y +K+ Y + + V  IQ++IMKNGPVVA   +Y D   Y+SG 
Sbjct: 219 KRRCLL-GYPKSYPSDRY-YKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                      +Y      K+G++A            VK++GWGEE G PYW + 
Sbjct: 276 -----------IYKHKAGRKTGLHA------------VKVIGWGEEKGTPYWIVA 307


>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
          Length = 335

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 102/249 (40%), Gaps = 75/249 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 57
           C+ G     W +  + G+VTGG +++  GCQP   PPC       N  + +P       P
Sbjct: 153 CNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQP-----TEP 207

Query: 58  QPKCHTRCTND---NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
             KC   C  D   +Y +G     Y+ K  Y++N +                   +  D 
Sbjct: 208 NHKCSRSCYGDKTCDYKKG----HYKTKNAYYLNIDT------------------MQKDT 245

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
            +Y       GP+ A+  +Y D  +Y+SGVY  +  A+ +    VK++GWGEE+G PY  
Sbjct: 246 IAY-------GPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPY-- 296

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                           W +V+++GEQ+G  G  KILRG NE  I
Sbjct: 297 --------------------------------WLMVNSWGEQWGANGMFKILRGTNECGI 324

Query: 235 ESLVNGALP 243
           E      +P
Sbjct: 325 EGSPTAGVP 333


>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
          Length = 374

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +    GLV+GG + ++ GC+P S  PC H    T  P C     P PKC
Sbjct: 191 CNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEG-PTPKC 248

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK      Y V+++   I  EIM NGPV     +Y          
Sbjct: 249 ERTC-EKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVY---------- 297

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                        +D  +YKSGVY   +  E+  +A ++++GWG E+G P          
Sbjct: 298 -------------ADFPTYKSGVYQHVSGGELGGHA-IRVLGWGVEDGTP---------- 333

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG+NE  IE  +   
Sbjct: 334 ------------------------YWLVANSWNSDWGDNGFFKILRGQNECGIEGEIVAG 369

Query: 242 LPK 244
           LPK
Sbjct: 370 LPK 372


>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
          Length = 325

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 103/240 (42%), Gaps = 66/240 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG-----AHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT 56
           C  G   + W +  + GLVTGG     A  S+T CQP   P C H +   S+P C +   
Sbjct: 147 CEGGFLGAAWNYWKQEGLVTGGLYNPSATESDT-CQPYPLPSCEH-HINGSKPACPSKIA 204

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
             P+C   C +  Y   + QD +  +  Y V   VA+IQ EIM NGPV A   +Y+    
Sbjct: 205 KTPECVHTC-HAGYPTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYA---- 259

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                              D  +YKSGVY   +  ++  +A VK++GWGEE+G PY    
Sbjct: 260 -------------------DFPAYKSGVYKRHSLRQLGGHA-VKMIGWGEEDGIPY---- 295

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W I +++   +GD G  KI+RG++E  IES
Sbjct: 296 ------------------------------WLIANSWNSDWGDHGYFKIVRGQDECGIES 325


>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 96/237 (40%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+       V    + IQ EIM  GPV A + ++ D  +YKSG 
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G +        V    V+I+GWG EN             
Sbjct: 276 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES+V
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335


>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
          Length = 309

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G++  +W +    G+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 126 CDGGVTGYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 184

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +G V A + +Y D  +YKSG 
Sbjct: 185 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSG- 242

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 243 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 266

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 267 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 305

Query: 242 LPK 244
           L K
Sbjct: 306 LIK 308


>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/242 (26%), Positives = 94/242 (38%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGGA++S+ GC+  S  PC H     S P+C +L    P+C
Sbjct: 156 CDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPEC 215

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   +     + +   F +          +Q EI+KNGP+ A   +Y+         
Sbjct: 216 VRSCYESSLD---YTESLTFGQQVSTFTNEKQMQLEILKNGPIEAAFTVYN--------- 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  SYKSGVY  +A  E V    +K++GWG E G  Y         
Sbjct: 264 --------------DFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKY--------- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  K LRG +   IES    +
Sbjct: 301 -------------------------WLIANSWNTDWGDNGYFKFLRGVDHCGIESETAAS 335

Query: 242 LP 243
           LP
Sbjct: 336 LP 337


>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
           (Schistosoma japonicum)
          Length = 316

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 97/237 (40%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 133 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-KGKYPSCGDKMYKTPQC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+       V    + IQ+EIM  GPV A + ++ D  +YKSG 
Sbjct: 192 KRKCQK-GYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 249

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G +        V    V+I+GWG EN             
Sbjct: 250 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 273

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGRNE  +ES+V
Sbjct: 274 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSVESVV 309


>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 339

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 100/248 (40%), Gaps = 67/248 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    GLVTGG + S  GC+P   PPC      TS         P  K 
Sbjct: 156 CNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSS----CAGQPIEKN 211

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     + D +RF R YY++      IQ+++M  GP+ A+  +Y D     
Sbjct: 212 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF---- 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                              +SYKSGVY  + +A  +    VK++GWG E G PY      
Sbjct: 265 -------------------YSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPY------ 299

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++  Q+GD G  KI RG +E  I+S  
Sbjct: 300 ----------------------------WLMVNSWSAQWGDNGLFKIRRGTDECGIDSAT 331

Query: 239 NGALPKDN 246
              +P  N
Sbjct: 332 TAGVPVTN 339


>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
 gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
          Length = 337

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 103/243 (42%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  ++ W    +RG+V+GG + +  GC+P S  PC + +     P C  +    P+C
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY-HTKCRIPNCIPIVH-TPEC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DK+  ++ Y ++ +   IQ EI  NGPV A+ ++Y D   YKSG 
Sbjct: 212 VHHCRK-GYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSG- 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y    +   G++A            ++I+GWG ENG P          
Sbjct: 270 -----------VYQRHSNDGRGMHA------------IRILGWGTENGTP---------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +GDKG  KILR  NE  IE  +   
Sbjct: 297 ------------------------YWLAANSWNENWGDKGYFKILRRTNECGIEEHIYAG 332

Query: 242 LPK 244
           +PK
Sbjct: 333 IPK 335


>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
 gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
          Length = 351

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 103/243 (42%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K+G VTGG++   TGC+P  +PPC H    T    C +   P  KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QD +  +  Y V+ + A+IQ+EIM +GPV     +Y D F + SG 
Sbjct: 227 ERSC-QAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYED-FEHYSG- 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                                GVY  +A A +  +A VK++GWG +NG P          
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E +G+ G  +I+RG NE  IE  V G 
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIEGGVVGG 347

Query: 242 LPK 244
           +PK
Sbjct: 348 IPK 350


>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G Y        ++   V+++G G EN             
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGCGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE +IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 101/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++TGC+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEI 335


>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
          Length = 334

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 100/245 (40%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                 D+ +Y 
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQY----------------DVLAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
          Length = 197

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 55/172 (31%), Positives = 74/172 (43%), Gaps = 24/172 (13%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +  K G+VTG  H +N GC+P  FP C H +  T    CK    P PKC
Sbjct: 43  CNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKC 102

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C      R + +DKY  +  Y V + +  IQ+EI+  GPV     +Y D  +Y  G 
Sbjct: 103 EKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGI 162

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
           Y                        V     +     VK++GWG +NG PYW
Sbjct: 163 Y------------------------VHQGGALGGGHAVKMIGWGIDNGVPYW 190


>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
          Length = 346

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 97/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  ++G+V+GG++ S +GC+P  FPPC H    T    C     P   C
Sbjct: 162 CDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   +  DK    + Y V   V  IQ+EIM +GPV     +Y D   Y  G 
Sbjct: 222 EHKCQS-GYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKG- 279

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G Y        +    VK++GWG ENG P          
Sbjct: 280 ---------------IYKHTAGSY--------LGGHAVKMIGWGTENGIP---------- 306

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  +ILRG +E  IES V   
Sbjct: 307 ------------------------YWICSNSWNSDWGENGFFRILRGTDECGIESGVVAG 342

Query: 242 LPK 244
           LPK
Sbjct: 343 LPK 345


>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
          Length = 354

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 100/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W +    G+VTGG  +S+ GCQP     C+H    T  P C+    P P+C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   + QDK+       +++     Q EIM NGPV A+  +Y D  +YKSG 
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    +KI+GWG E G            
Sbjct: 290 YQH------------------------TTGGVLGGHAIKILGWGVEEG------------ 313

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++  ++GD G  KILRG NE  IES +N  
Sbjct: 314 ----------------------TKYWLVANSWNNEWGDNGFFKILRGSNECGIESDINFG 351

Query: 242 LPK 244
           +PK
Sbjct: 352 IPK 354


>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
          Length = 364

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 100/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W + +  GLVTGG + S TGC P    PC H +     P+C       P C
Sbjct: 182 CNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEH-HVPGDRPKCSE-GGGTPSC 239

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            ++C   N    + QDK+     Y V  +   IQ EIM +GPV     +Y+D  +YKSG 
Sbjct: 240 VSKCKG-NTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGV 298

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    ++I+GWG ENG            
Sbjct: 299 YKH------------------------VTGGVLGGHAIRILGWGSENG------------ 322

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                 VA                YW + +++   +GDKG  KILRG +E  IES V   
Sbjct: 323 ------VA----------------YWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAG 360

Query: 242 LPK 244
           +P+
Sbjct: 361 IPQ 363


>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
          Length = 333

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 101/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 153 CNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGK----PMEKN 208

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F  D +  +  Y++      IQ                +D+ +Y 
Sbjct: 209 H-RCTRMCYGDQDLDFNNDHHYTRDAYYLT--YGTIQ----------------NDVLTY- 248

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY  + +A  +    VK++GWGEE G PY      
Sbjct: 249 ------GPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWGEEYGVPY------ 296

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE  I++  
Sbjct: 297 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 328

Query: 239 NGALP 243
            G +P
Sbjct: 329 TGGVP 333


>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 351

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 77/264 (29%), Positives = 103/264 (39%), Gaps = 81/264 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTG---------------------CQPVSFPPCN 40
           C+ G  SS W +    GLV+GG + S+ G                     C+P + PPC 
Sbjct: 148 CNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCE 207

Query: 41  HANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMK 100
           H +   S P C       P+C  RC    Y   + QDK+  K  Y V+ E  +I+QEI K
Sbjct: 208 H-HVNGSRPSCSGEGGDTPECIFRC-EAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYK 265

Query: 101 NGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK 160
           NGPV     +Y D   YKSG Y +                      VS SA  +    +K
Sbjct: 266 NGPVEGAFTVYEDFVLYKSGVYQH----------------------VSGSA--LGGHAIK 301

Query: 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDK 220
           ++GWGEENG P                                  YW   +++   +GD 
Sbjct: 302 MLGWGEENGVP----------------------------------YWLCANSWNTDWGDN 327

Query: 221 GTIKILRGRNEAIIESLVNGALPK 244
           G  KILRG +   IES +    PK
Sbjct: 328 GFFKILRGADHCGIESEIVAGNPK 351


>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 342

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 101/243 (41%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   RG+VTGG+  ++T C+P  FP C+H         C       P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + QDK+     Y V    + IQ++IM +GPV A + +Y D  +YKSG 
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ Y +G        + ++   V+++GWG EN             
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +T+ E +G+KG  +I+RGRNE  IES +   
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338

Query: 242 LPK 244
           L K
Sbjct: 339 LIK 341


>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
          Length = 356

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 99/242 (40%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  + GLV+GG +H  TGCQP +  PC H +     P C       PKC
Sbjct: 165 CNGGFPQAAWEYWVQNGLVSGGLYHG-TGCQPYAIEPCEH-HTEGDRPPCTGEEGTTPKC 222

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   F QDK+     Y +      I  EI KNGPV     +Y D  +YKSG 
Sbjct: 223 SHKCV-DGYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSG- 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++S+ +G          +    ++++GWGEENG  YW        
Sbjct: 281 ---------------VYSHHTG--------SALGGHAIRVLGWGEENGEKYW-------- 309

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L G             +++   +G+ G  KI RG NE  IES + G 
Sbjct: 310 -------------LCG-------------NSWNTDWGNNGFFKIKRGVNECGIESEMVGG 343

Query: 242 LP 243
           +P
Sbjct: 344 IP 345


>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
 gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 342

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/242 (28%), Positives = 100/242 (41%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + SN GC P    PC H    T  P CK      P C
Sbjct: 160 CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   + QD +  K  Y + ++V  I+QEI  NGPV     +Y          
Sbjct: 218 VKKC-EEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY---------- 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  +Y++GVY   A   +  +A ++I+GWG +NG            
Sbjct: 267 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG------------ 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
               EI                 PYW + +++   +G  G  KILRG +E  IE  +N  
Sbjct: 301 ----EI-----------------PYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAG 339

Query: 242 LP 243
           LP
Sbjct: 340 LP 341


>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
          Length = 338

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +    GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V+    +I  EI KNGPV A   +YSD   YKSG 
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    V+I+GWG ENG PYW        
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENGTPYW-------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                        L+G             +++   +GD G  KILRG++   IES +   
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328

Query: 242 LP 243
           +P
Sbjct: 329 IP 330


>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
 gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
          Length = 337

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 98/237 (41%), Gaps = 61/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G+    W +    GLV+GG+++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 152 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   + QDK   K  Y V+ +   I+ E+ KNGPV     +YS         
Sbjct: 210 TKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYS--------- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+ SYKSGVY  +    +  +A VKI+GWG EN   Y         
Sbjct: 260 --------------DLLSYKSGVYKHTQGDALGGHA-VKILGWGVENDNKY--------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++   +GD G  KILRG +   IES +
Sbjct: 296 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 327


>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
          Length = 341

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 103/245 (42%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 210

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKCH  C + +Y   + +DK   K  Y V+ +   I+ E+ KNGPV     +YSD     
Sbjct: 211 PKCHKTCES-SYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVYSD----- 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                             + +YK+GVY  +    +  +A +KI+GWG ENG  Y      
Sbjct: 265 ------------------LLNYKNGVYKHTVGNALGGHA-IKILGWGVENGNKY------ 299

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W I +++   +GD G  KILRG +   IES +
Sbjct: 300 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 331

Query: 239 NGALP 243
               P
Sbjct: 332 VAGEP 336


>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
 gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
          Length = 342

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/243 (24%), Positives = 95/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +  K G+ TGG++ S  GC+P S  PC       + P C     P P C
Sbjct: 157 CAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y     +D++       + +   +IQ ++M NGPV A M +Y D   Y +G 
Sbjct: 217 EKKC-KPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGI 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V  +     + +V+I+GWG   G P          
Sbjct: 276 Y------------------------VHLAGNKQGHLSVRILGWGMFEGVP---------- 301

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++G+++G+ GT ++LRG NE  +E+     
Sbjct: 302 ------------------------YWLLANSWGKEWGENGTFRVLRGVNECGLEANCISG 337

Query: 242 LPK 244
           +PK
Sbjct: 338 MPK 340


>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
 gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 303

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 94/228 (41%), Gaps = 74/228 (32%)

Query: 14  VHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGR 72
           V   G+VTG +  +NTGC+P  FP C H  +T  + P C +     P+C T C    Y  
Sbjct: 145 VDLEGIVTGSSKENNTGCEPYPFPKCEH--FTKGQYPPCGSKIYKTPRCKTTC-QKRYKT 201

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + QDK+R             IQ+EIMK GPV A+  +Y D  +YKSG Y +        
Sbjct: 202 SYAQDKHRA------------IQKEIMKYGPVEASFTVYEDFLNYKSGIYKH-------- 241

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
                            + E +    ++I+GWG EN  PY                    
Sbjct: 242 ----------------ITGETLGGHAIRIIGWGVENKTPY-------------------- 265

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                         W I +++ E +G+ G  +I+RGR+E  IES V  
Sbjct: 266 --------------WLIANSWNEDWGENGYFRIVRGRDECSIESEVTA 299


>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
 gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
          Length = 326

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 75/243 (30%), Positives = 96/243 (39%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + W  +RG+VTGG  +  TGC+P    PCN  N       C  L TP   C
Sbjct: 153 CDGGFPYRAFQWWARRGVVTGG-DYLGTGCKPYPIRPCNSDN-------CVNLQTP--PC 202

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK      Y V   VA IQ +I  NGPVVA   +Y D   YKSG 
Sbjct: 203 RLSC-QPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSG- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  I     G +A            VK++GWG E G P          
Sbjct: 261 -----------IYRHIAGRSKGGHA------------VKLIGWGTERGTP---------- 287

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW  V+++G Q+G+ GT +ILRG +E  IES +   
Sbjct: 288 ------------------------YWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323

Query: 242 LPK 244
           LP+
Sbjct: 324 LPR 326


>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
 gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
           RecName: Full=Cathepsin B light chain; Contains:
           RecName: Full=Cathepsin B heavy chain; Flags: Precursor
 gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
          Length = 340

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 94/243 (38%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  +RGLV+GG + S+ GC+  + PPC H +   S P C       P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEH-HVNGSRPPCTGEGGETPRC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V     +I  EI KNGPV     +Y D   YKSG 
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E V    ++I+GWG ENG P          
Sbjct: 268 YQH------------------------VSGEQVGGHAIRILGWGVENGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G  G  KILRG +   IES +   
Sbjct: 294 ------------------------YWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329

Query: 242 LPK 244
           +P+
Sbjct: 330 VPR 332


>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
          Length = 332

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 74/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W    K GLVTGG + S  GCQP    PC    Y  +   C+    P  K 
Sbjct: 152 CHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++   +  IQ+++M  GP+ A         SY 
Sbjct: 208 H-RCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGPIEA---------SYD 255

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                         +Y D  SYKSGVY  + +A  +    VK++GWGEE G PY      
Sbjct: 256 --------------VYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPY------ 295

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GDKG  KI RG NE  I++  
Sbjct: 296 ----------------------------WLMVNSWNDQWGDKGLFKIRRGTNECGIDNST 327

Query: 239 NGALP 243
            G +P
Sbjct: 328 TGGVP 332


>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
          Length = 344

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 91/237 (38%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +  + G+VTGG + +   C+P   PPC H    T    C  +A   P C
Sbjct: 163 CDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIAD-TPDC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C    Y   +  DK   K  Y +   V  IQ+EIM  GPV A   +Y          
Sbjct: 222 VTTC-QAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAFIVYE--------- 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D F Y  G+Y              K V  GEE G            
Sbjct: 272 --------------DFFHYHRGIY--------------KHVSGGEEGGH----------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                      V+++GWGEE G  YW + +++   +G+ G  +ILRG NE  IE  V
Sbjct: 293 ----------AVRILGWGEEKGTAYWLVANSWNTDWGENGYFRILRGSNECGIEENV 339


>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W W H  G+ TGG + S   C    FP C+H +     P C     P P+C
Sbjct: 138 CNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDH-HVEGKYPPCGE-TQPTPEC 195

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   + +DK+ F   Y V   V  I+ E+M NGP+  +  +Y D  +YKSG 
Sbjct: 196 VEKC-QEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGI 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +   VA  YL         G +A            VK+VGWG E+G            
Sbjct: 255 YQH---VAGKYL---------GGHA------------VKLVGWGVEDG------------ 278

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++ E +G+ G  +I+ G+NE  IES     
Sbjct: 279 ----------------------VEYWKIANSWNEDWGENGYFRIIAGKNECGIESDGVAG 316

Query: 242 LPK 244
           +P+
Sbjct: 317 IPE 319


>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
           sinensis]
 gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 343

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 97/242 (40%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPHQYYPTPEC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D  G  + +DK R    Y +      I +EIM  GPV A       +F+     
Sbjct: 214 VQHC--DTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEA-------VFT----- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y D   YK GVY  S  A +  +A ++I+GWGEE   PY         
Sbjct: 260 -----------VYEDFLQYKFGVYFHSWGAPLSEHA-IRILGWGEEGDVPY--------- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG +K LRG NE  IE  V   
Sbjct: 299 -------------------------WLIANSWNEDWGEKGYMKFLRGLNECGIEDDVTAG 333

Query: 242 LP 243
           LP
Sbjct: 334 LP 335


>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
           japonicum]
          Length = 340

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 101/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G      V+    G+VTGG++   +GCQP   P C++ +  +   +C       P+C
Sbjct: 156 CFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D Y + +  DK+  +R Y V     DIQ+EI+ NGPV+A+              
Sbjct: 215 TNEC-QDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS-------------- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    + + +D   YKSGVY  +  +  + + T++I+GWG E   P          
Sbjct: 260 ---------ISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIP---------- 300

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E++GD G +KI RG     IES V   
Sbjct: 301 ------------------------YWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAP 336

Query: 242 LPK 244
           +PK
Sbjct: 337 IPK 339


>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
          Length = 334

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 73/245 (29%), Positives = 100/245 (40%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +D+ +Y 
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE   ++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGTDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
          Length = 334

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 94/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI S  W +    G+V+GG ++S+ GC P   PPC H       P C    T  PKC
Sbjct: 151 CNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIP-CNG-ETSTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H  C  + Y   +  DK   K  Y V      I+ EI KNGPV     +Y+D+ +YKSG 
Sbjct: 209 HRSCRKE-YTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           E +    +KI+GWG ENG  Y         
Sbjct: 268 YKH------------------------TEGEALGGHAIKIMGWGVENGNKY--------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG +   IES +   
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329

Query: 242 LP 243
            P
Sbjct: 330 EP 331


>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
 gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
          Length = 332

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 73/241 (30%), Positives = 104/241 (43%), Gaps = 69/241 (28%)

Query: 4   SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
            G S   WV V   GLV+G A++S  GC+P  F PC +  +    PE KT     P C  
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNSTDGCKPYPFKPCLYP-FVGCHPE-KT-----PSCTH 209

Query: 64  RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
            CT + Y   + +DKY     Y + ++   IQ EIM NGPV +   +Y D++ YK+G   
Sbjct: 210 HCT-EGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTG--- 265

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
                    +Y  +   + G +A            V+++GWG+E G PY           
Sbjct: 266 ---------VYQHVVGREVGKHA------------VRLIGWGKERGVPY----------- 293

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  W I +++GE +G+ G  K LRG N   IES+V   LP
Sbjct: 294 -----------------------WLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330

Query: 244 K 244
           K
Sbjct: 331 K 331


>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
          Length = 340

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 101/248 (40%), Gaps = 67/248 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG + S  GC+P   PPC   +   +         P  K 
Sbjct: 157 CHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNN----TCAGKPIEKN 212

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     + D +RF R +Y++      IQ+++M  GP+ A+  +Y D     
Sbjct: 213 H-RCTRMCYGDQDLDYNDDHRFTRDFYYLT--YGSIQKDVMTYGPIEASFDVYDDF---- 265

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                  P            SYKSGVY  + +A  +    VK++GWG E G PY      
Sbjct: 266 -------P------------SYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPY------ 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++  Q+GDKG  KI RG NE  I++  
Sbjct: 301 ----------------------------WLMVNSWNAQWGDKGLFKIRRGTNECGIDNST 332

Query: 239 NGALPKDN 246
              +P  N
Sbjct: 333 TAGVPVTN 340


>gi|227293|prf||1701299A cathepsin B
          Length = 339

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 69/251 (27%), Positives = 105/251 (41%), Gaps = 62/251 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H +   S P C      + +C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEH-HVNGSRPPCTGEGDTR-RC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V  I  EI KNGPV     ++SD  +YKSG 
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +++G        +++    ++I+ WG ENG PYW     + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILVWGVENGVPYWAAANSWNL 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                              +GD G  KILRG N   IES +   
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328

Query: 242 LPK-DNYGVEF 251
           +P+ D Y   F
Sbjct: 329 IPRTDQYWGRF 339


>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 203

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 100/237 (42%), Gaps = 66/237 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL 54
           C+ G       ++   G+VTG      G      GC P  F  CNH     SE P+CK +
Sbjct: 12  CNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNHVPTENSEYPKCKDV 71

Query: 55  A-TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
           A  P P C T CTN  Y +   +D +R K +  V ++   I+QEI  NGPV +   +Y D
Sbjct: 72  AHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGPVFSAFKMYED 131

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
            F Y                      YKSGVY V  + E++++  VKI+GW         
Sbjct: 132 -FRY----------------------YKSGVY-VPTTKEVLSFHLVKIIGW--------- 158

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                                    G ++ + YW  ++++ E++GD G IK+  G+N
Sbjct: 159 -------------------------GADSVQEYWLAMNSWNEEWGDHGLIKMAFGKN 190


>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
 gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 174

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/229 (27%), Positives = 92/229 (40%), Gaps = 60/229 (26%)

Query: 10  TWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 69
            W +    G+VTGG +     C+P  FPPC          EC   A   PKC   C    
Sbjct: 1   AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAK-TPKCQKTCQR-G 58

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           Y + + +DK+  K  Y + + V  IQ++IMKNGPVVA   +Y D   YKSG Y +     
Sbjct: 59  YLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKH----- 113

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                               +  +     VKI+GWG+E G P                  
Sbjct: 114 -------------------TAGRMTGGHAVKIIGWGKEKGTP------------------ 136

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                           YW I +++ + +G+KG  +++RG N   IE +V
Sbjct: 137 ----------------YWLIANSWHDDWGEKGFYRMIRGINNCRIEEMV 169


>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
          Length = 379

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 96/244 (39%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 60
           C  G     W+   K G+VTGG++ S+ GCQ   F PC       S + +C        +
Sbjct: 182 CKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLE 241

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C   +Y + + QD Y  +  Y + ++   IQ EIM+NGPV AN             
Sbjct: 242 CRETCRT-SYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQAN------------- 287

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                     + +Y D   YK GVY      + + Y  VKI GWG E G PYW     ++
Sbjct: 288 ----------LRIYEDFLHYKFGVYR-HVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWS 336

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                             +++G+ G  KILRG N A IE  V  
Sbjct: 337 ----------------------------------KRWGNGGFFKILRGSNHAEIEDHVMA 362

Query: 241 ALPK 244
            +PK
Sbjct: 363 GIPK 366


>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
          Length = 287

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 95/244 (38%), Gaps = 59/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +  K GL TGG++ S  GC+P S  PC       + P C     P P C
Sbjct: 100 CGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 159

Query: 62  HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +CT+ N Y     +D++       + +   +IQ ++M NGP+     +Y D   Y +G
Sbjct: 160 EKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTG 219

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                        V  +     + +V+I+GWG   G P         
Sbjct: 220 IY------------------------VHLTGNKQGHLSVRILGWGMYEGVP--------- 246

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + +++G+++G+ GT + LRG NE  +E+    
Sbjct: 247 -------------------------YWLLANSWGKEWGENGTFRALRGTNECGLEANCVS 281

Query: 241 ALPK 244
            +PK
Sbjct: 282 GMPK 285


>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
          Length = 260

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/177 (33%), Positives = 77/177 (43%), Gaps = 32/177 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C  G +   W     +G+VTGG   SN GCQP    PCNH     +   C +L   Q   
Sbjct: 96  CDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYG-NGNLKNCSSLRRTQMTV 154

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           C  +C N NY   +  D ++    Y   W N  V  IQQEIM  GPV A MY+Y +   Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 212

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
           K G Y                         S + E++ Y  VK++GWG + +G  YW
Sbjct: 213 KEGIYK------------------------STAGELIGYHHVKLIGWGVDGDGTEYW 245


>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
          Length = 340

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 98/232 (42%), Gaps = 62/232 (26%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 71
           W+ +  +VTGG +     C+P +F PC NH N     P C     P PKC   C    Y 
Sbjct: 167 WMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGP-CPRGLWPTPKCRKACQR-KYN 224

Query: 72  RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
           + + +DKY   R Y++      I++EI                       Y NGPVVA  
Sbjct: 225 KSYNEDKYFATRSYYLPSNERSIREEI-----------------------YKNGPVVAAF 261

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
            +Y D   Y+ G+Y               +  WG + G                   A+A
Sbjct: 262 KVYQDFSYYRGGIY---------------VHKWGGQTG-------------------AHA 287

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SLVNGAL 242
            VK++GWG ENG  YW I +++   +G+ G  +I RG NE  IE  +V+G +
Sbjct: 288 -VKVVGWGRENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQMVSGVM 338


>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 517

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 70/244 (28%), Positives = 100/244 (40%), Gaps = 69/244 (28%)

Query: 1   VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           + + G+  S + +  K G+ TGG +   + CQP S  PC+  +YT S P CK        
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCK-------- 389

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
               C  D Y      DK+    +Y V+    +I  EI  +GPVVA   +Y D F+Y   
Sbjct: 390 --YDCQAD-YDIPISDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYED-FTY--- 442

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                              Y SG+Y  +    +  +A ++I+GWGEENG PY        
Sbjct: 443 -------------------YISGIYQQTTYVAMGGHA-IRIIGWGEENGIPY-------- 474

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W I +++   FG+KG  +I RG NE  IES V  
Sbjct: 475 --------------------------WLIANSWNTTFGEKGFFRIRRGTNECRIESEVYT 508

Query: 241 ALPK 244
            +PK
Sbjct: 509 GIPK 512



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 61/124 (49%), Gaps = 11/124 (8%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C SG   + +++  + GLVTGG +     C P S  PC     T   P    LA   PKC
Sbjct: 70  CRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPC-----TMCRP--YMLA---PKC 119

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C   +Y     +DKY  K +Y+VN +  DI QEI + GPVVA   +Y D   Y SG+
Sbjct: 120 QRTC-QASYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQ 178

Query: 122 YGNG 125
           +  G
Sbjct: 179 FICG 182


>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 393

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 86/181 (47%), Gaps = 29/181 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W W  + G+V+      ++GC P +FP C+H   T     CK   +P P C
Sbjct: 202 CDGGQPDSAWRWFSEHGVVS----ELDSGCWPYNFPECSHHVETKGMEPCKG-NSPSPVC 256

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C N ++   F  D++  +   +  DEV +I++EI+ NGPV A   +Y D        
Sbjct: 257 STTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAFTVYED-------- 308

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                     +LY     YKSGVY     +E+  +A VKI+GWG +    YW ++  + V
Sbjct: 309 ----------FLY-----YKSGVYKHVNGSELGGHA-VKIIGWGTDQNEQYWLVMNSWNV 352

Query: 182 S 182
           +
Sbjct: 353 N 353


>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
          Length = 338

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 100/237 (42%), Gaps = 61/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    G+V+GG+++S  GC P   PPC H       P C    T  PKC
Sbjct: 153 CNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLP-CNG-DTKTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   F +DK+  K  Y V+    +I+ E+ KNGPV               G 
Sbjct: 211 QKTC-EAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPV--------------EGA 255

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           +          +YSD+ SYKSGVY  +  + +  +A VKI+GWG ENG  Y         
Sbjct: 256 F---------TVYSDLLSYKSGVYQHTDGSALGGHA-VKILGWGVENGSKY--------- 296

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++   +GD G  KILRG +   IES +
Sbjct: 297 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328


>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
          Length = 338

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 71/248 (28%), Positives = 104/248 (41%), Gaps = 67/248 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K+GLVTGG + S  GC+P   PPC + +   +     T A    + 
Sbjct: 155 CNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNN-----TCAGKPMES 209

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           + RCT   YG     F + +R+ R YY++      IQ+++M  GP+ A+  +Y D     
Sbjct: 210 NHRCTRMCYGDQDLDFDEDHRYTRDYYYLT--YGSIQKDVMTYGPIEASFDVYDDF---- 263

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                  P            SYKSGVY  S +A  +    VK++GWGEE G PY      
Sbjct: 264 -------P------------SYKSGVYVKSENASYLGGHAVKLIGWGEEYGVPY------ 298

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ E +GD G  KI RG NE  +++  
Sbjct: 299 ----------------------------WLMVNSWNEDWGDHGFFKIQRGTNECGVDNST 330

Query: 239 NGALPKDN 246
              +P  N
Sbjct: 331 TAGVPVTN 338


>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
          Length = 287

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 70/235 (29%), Positives = 99/235 (42%), Gaps = 61/235 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    GLV+GG ++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 113 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 170

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   F +DK   K  Y V+    +I+ E+ KNGPV     +YS         
Sbjct: 171 EKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYS--------- 220

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+ SYKSGVY  +    +  +A +KI+GWG ENG  Y         
Sbjct: 221 --------------DLLSYKSGVYQHTHGNALGGHA-IKILGWGVENGSKY--------- 256

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                    W I +++   +GD G +KILRG +   IES
Sbjct: 257 -------------------------WLIANSWNSDWGDNGFLKILRGEDHCGIES 286


>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
 gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
          Length = 334

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 103/242 (42%), Gaps = 64/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG + +  GC+P    PC +     +         P  + 
Sbjct: 154 CEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNT-----CGGKPMERN 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG+   Q +Y+ K  Y +N  +  I+Q                DI +Y    
Sbjct: 209 H-QCPKTCYGKTTDQKRYKTKSEYVINS-IKTIEQ----------------DIKTY---- 246

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPV A+  +Y D   YKSG+Y  + +A+              +NG            
Sbjct: 247 ---GPVEASFDVYDDFSVYKSGIYRKTPNAKY-------------QNGH----------- 279

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                     +VK+IGWG+ENG PYW  V+++ + +GD GT KI++G+NE  IE  V   
Sbjct: 280 ----------SVKIIGWGQENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
          Length = 338

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC   C + NY   + +DK   K  + V+ +   I+ E+ KNGPV     +YSD     
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSD----- 261

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                             + +YK+GVY  +    +  +A VKI+GWG ENG  Y      
Sbjct: 262 ------------------LLNYKTGVYKHTIGDALGGHA-VKILGWGVENGNKY------ 296

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W I +++   +GD G  KILRG +   IES +
Sbjct: 297 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328

Query: 239 NGALP 243
               P
Sbjct: 329 VAGEP 333


>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
          Length = 338

 Score = 90.1 bits (222), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H    N      + KT     
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC   C + NY   + +DK   K  + V+ +   I+ E+ KNGPV     +YSD     
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSD----- 261

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                             + +YK+GVY  +    +  +A VKI+GWG ENG  Y      
Sbjct: 262 ------------------LLNYKTGVYKHTIGDALGGHA-VKILGWGVENGNKY------ 296

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W I +++   +GD G  KILRG +   IES +
Sbjct: 297 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328

Query: 239 NGALP 243
               P
Sbjct: 329 VAGEP 333


>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
          Length = 343

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 69/242 (28%), Positives = 96/242 (39%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D    G+ +DK R    Y +      I +EIM  GPV A       IF+     
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y D   Y SGVY  +  A +  +A V+I+GWGE    PY         
Sbjct: 260 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVPY--------- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G++G +K LRG NE  IE  V   
Sbjct: 299 -------------------------WLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAG 333

Query: 242 LP 243
           LP
Sbjct: 334 LP 335


>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
          Length = 323

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 70/243 (28%), Positives = 96/243 (39%), Gaps = 71/243 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + W + RG+VTGG     +GC+P  F PC       S PE KT     P C
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTC 198

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK      Y V   VA IQ EIM NGPVV    +Y D++ YKSG 
Sbjct: 199 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 257

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    +KI+GWG +NG PY         
Sbjct: 258 YRH------------------------TAGRLLGGHAIKIIGWGTQNGIPY--------- 284

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++G  +G+ G +K+ RG NE  IE  V   
Sbjct: 285 -------------------------WLIANSWGANWGENGFLKMRRGVNECGIERAVVAG 319

Query: 242 LPK 244
           +P+
Sbjct: 320 MPR 322


>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/159 (38%), Positives = 75/159 (47%), Gaps = 26/159 (16%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
           G VTGG + S  GC+P  F PC H    T   EC   A   PKC  RC   +Y + ++ D
Sbjct: 179 GAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRRCQR-SYKKAYYMD 236

Query: 78  KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDI 137
           K   +  Y V   V  IQ+EIMKNGPVV    +Y D FSY                    
Sbjct: 237 KSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYED-FSY-------------------- 275

Query: 138 FSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
             YK G+Y  +A      +A +KI+GWG EN  PYW I 
Sbjct: 276 --YKKGIYKHTAGQARGGHA-IKIIGWGVENDVPYWLIA 311


>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
          Length = 346

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 75/158 (47%), Gaps = 26/158 (16%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
           G VTGG + S  GC+P  F PC H    T   EC   A   PKC  RC   +Y + ++ D
Sbjct: 179 GAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRRCQR-SYKKAYYMD 236

Query: 78  KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDI 137
           K   +  Y V   V  IQ+EIMKNGPVV    +Y D FSY                    
Sbjct: 237 KSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYED-FSY-------------------- 275

Query: 138 FSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
             YK G+Y  +A      +A +KI+GWG EN  PYW I
Sbjct: 276 --YKKGIYKHTAGQARGGHA-IKIIGWGVENDVPYWLI 310


>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
          Length = 330

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 93/237 (39%), Gaps = 69/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PAC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V   VA IQ EIM NGPV A   +Y D        
Sbjct: 206 SLSC-QSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYED-------- 256

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +A   +  +A +KI+GWG E+G P          
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSP---------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                   YW + +++G  +G+ G  KILRG ++  IE  V
Sbjct: 291 ------------------------YWLVANSWGTNWGESGFFKILRGDDQCGIEGAV 323


>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
 gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/248 (28%), Positives = 100/248 (40%), Gaps = 67/248 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+VTGG + S  GC+P   PPC        E +      P  K 
Sbjct: 157 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 212

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     + D +RF R YY++      IQ+++M  GP+ A+  +Y D     
Sbjct: 213 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF---- 265

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                  P            SYKSGVY  + +A  +    VK++GWG E G PY      
Sbjct: 266 -------P------------SYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPY------ 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++  Q+GD G  KI RG +E  I+S  
Sbjct: 301 ----------------------------WLMVNSWNAQWGDNGLFKIRRGTDECGIDSAA 332

Query: 239 NGALPKDN 246
              +P  N
Sbjct: 333 TAGVPVTN 340


>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
          Length = 332

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 72/241 (29%), Positives = 104/241 (43%), Gaps = 69/241 (28%)

Query: 4   SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
            G S   WV V   GLV+G A+++  GC+P  F PC +  +    PE KT     P C  
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNNTDGCKPYPFKPCLYP-FVGCHPE-KT-----PSCTH 209

Query: 64  RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
            CT + Y   + +DKY     Y + ++   IQ EIM NGPV +   +Y D++ YK+G   
Sbjct: 210 HCT-EGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTG--- 265

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
                    +Y  +   + G +A            V+++GWG+E G PY           
Sbjct: 266 ---------VYQHVVGREVGKHA------------VRLIGWGKERGVPY----------- 293

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  W I +++GE +G+ G  K LRG N   IES+V   LP
Sbjct: 294 -----------------------WLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330

Query: 244 K 244
           K
Sbjct: 331 K 331


>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
          Length = 194

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 77/175 (44%), Gaps = 26/175 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG +     C+P  FPPC          EC   A   PKC
Sbjct: 43  CEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAK-TPKC 101

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DK+  K  Y + + V  IQ++IMKNGPVVA   +Y D   YKSG 
Sbjct: 102 QKTCQR-GYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSG- 159

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                          I+ + +G         +     VKI+GWG+E G PYW I 
Sbjct: 160 ---------------IYKHTAG--------RMTGGHAVKIIGWGKEXGTPYWLIA 191


>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
          Length = 366

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  + GLVTGG ++S+ GCQP +   C+H      +P C       P C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQP-CSKKEEHTPVC 241

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +DK+     Y V   V  I  EIM NGPV     +Y+         
Sbjct: 242 KHECES-GYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGAFTVYA--------- 290

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D   YKSGVY  +  + +  +A +KI+GWG E G  YW +   +  
Sbjct: 291 --------------DFPQYKSGVYKHTTGSPLGGHA-IKIMGWGTEGGDDYWLVANSW-- 333

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          G++GT KILRGR+E  IES +   
Sbjct: 334 ----------------------NPDW----------GNQGTFKILRGRDECGIESQIAAG 361

Query: 242 LPK 244
            PK
Sbjct: 362 EPK 364


>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
           pisum]
 gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
          Length = 339

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 101/248 (40%), Gaps = 67/248 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    GLVTGG + S  GC+P   PPC        + +      P+ K 
Sbjct: 156 CNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR----NEDGKSSCAGKPKEKN 211

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     + D +RF R +Y++      IQ+                D+ +Y 
Sbjct: 212 H-RCTRMCYGNQDLDYDDDHRFTRDFYYLT--YGSIQK----------------DVLNY- 251

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY  + +A  +    VK++GWG E G PY      
Sbjct: 252 ------GPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPY------ 299

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++  Q+GD G  KI RG +E  I+S  
Sbjct: 300 ----------------------------WLMVNSWNAQWGDNGLFKIRRGTDECRIDSAT 331

Query: 239 NGALPKDN 246
              +P  N
Sbjct: 332 TAGVPVTN 339


>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
          Length = 337

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 71/242 (29%), Positives = 105/242 (43%), Gaps = 63/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ + G+VTGG ++S+ GC P     C+H      +P CK    P P+C
Sbjct: 155 CNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRC 212

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +D++  K  + V + V  I  EIM NGPV A   +YSD  +YKSG 
Sbjct: 213 KKECES-GYNNTYSKDEHHAKTVHAV-EGVEQIMTEIMTNGPVEAAFTVYSDFPTYKSG- 269

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ +KSG         +  +A +K +GWG E+G+ YW +   +  
Sbjct: 270 ---------------VYEHKSG-------GPLGGHA-IKTLGWGNEDGKDYWLVANSW-- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
                                  P W          GD G  KILRGR+E  IES +V G
Sbjct: 305 ----------------------NPDW----------GDNGFFKILRGRDECGIESNIVAG 332

Query: 241 AL 242
            +
Sbjct: 333 MM 334


>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
 gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
          Length = 334

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 101/246 (41%), Gaps = 67/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
           C+ G   + W +  ++GLV+GG + S+ GCQP +  PC H    T  P   E KT     
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCNGEGKT----- 206

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC  +C   +Y   + +DK+  K  Y +      IQ+E+  NGPV     +Y       
Sbjct: 207 PKCVKKC-QASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVY------- 258

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                            D+ +YK GVY  +A   +  +A ++I+GWG E           
Sbjct: 259 ----------------EDLLNYKEGVYQHTAGKMLGGHA-IRILGWGVE----------- 290

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                  N   +W I +++   +GD G  KILRG +   IES +
Sbjct: 291 -----------------------NDTKFWLIANSWNSDWGDNGYFKILRGSDHLGIESSI 327

Query: 239 NGALPK 244
              LPK
Sbjct: 328 AAGLPK 333


>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
          Length = 332

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/217 (30%), Positives = 94/217 (43%), Gaps = 68/217 (31%)

Query: 30  GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
           GCQP S PPC         P C T   P PKC   C    Y + + +DK+  K  Y +  
Sbjct: 183 GCQPYSLPPC--------VPNC-THPEPTPKCQHVC-RKGYEKSYEEDKHFAKNVYRLLK 232

Query: 90  EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
           +   I+ +I KNGPV +  ++Y+D  SYKSG Y       +M  +        GV+A   
Sbjct: 233 KCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQ-----HMIKF-------MGVHA--- 277

Query: 150 SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
                    +KI+GWG E+G PYW +   + V               GW           
Sbjct: 278 ---------IKILGWGTEDGVPYWLVANSWNV---------------GW----------- 302

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
                   GDKG  KILRG++E  IE +++  +P ++
Sbjct: 303 --------GDKGYFKILRGKDECGIEEVIDAGIPMED 331


>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
          Length = 334

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 61/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    GLV+GG+++S+ GC+P   PPC H       P   +  T  PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLP--CSGDTKTPKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + QDK+  K  Y V      I+ E+ KNGPV     +Y+D+ SYKSG 
Sbjct: 209 VKECES-GYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + + +    +KI+GWG ENG  Y         
Sbjct: 268 YKH------------------------VTGDALGGHAIKIMGWGVENGNKY--------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +GD G  KILRG +   IES +   
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329

Query: 242 LPKDN 246
            P  N
Sbjct: 330 EPLFN 334


>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 455

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/220 (30%), Positives = 91/220 (41%), Gaps = 66/220 (30%)

Query: 13  WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
           ++   GLVTGG +       ++ GC P  FP CNH     S+ P C  +    P C T C
Sbjct: 231 FMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 289

Query: 66  TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
            N  YG    +D +R K +  +      I+QEI                       + NG
Sbjct: 290 PNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEI-----------------------FDNG 326

Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
           PV A M LY D   YKSGVY                                   V  + 
Sbjct: 327 PVAAMMTLYEDFRFYKSGVY-----------------------------------VHKTG 351

Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKI 225
           +++A  T+KLIGWG E+G+ YW  V+ + E++GD G IK+
Sbjct: 352 QMLAAHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKL 391


>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
          Length = 278

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 58/172 (33%), Positives = 76/172 (44%), Gaps = 25/172 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTGG   + TGC P  FP C H    +    C     P P C
Sbjct: 132 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPGYIYPTPSC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y + + +DK   K  Y V+     I QEIMKNGPV A   +Y+D   YKSG 
Sbjct: 192 YPYC-QAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVYTDFAVYKSG- 249

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                          I+ + SG YA            ++I+GWG ENG  YW
Sbjct: 250 ---------------IYHHVSGRYA--------GKHAIRIIGWGVENGVNYW 278


>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
           Schistosoma japonicum [Schistosoma japonicum]
          Length = 312

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 1/121 (0%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI    W +    G+VTGG++ ++TGCQP  FP C H + + +   C+      P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C  D Y   +  DKY  K  Y+V  +   I +EI+ NGPV A  Y+Y D  +YK+G 
Sbjct: 221 YQTCQPD-YAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGV 279

Query: 122 Y 122
           Y
Sbjct: 280 Y 280


>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
          Length = 317

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 96/242 (39%), Gaps = 70/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C        + W +K+G+VTGG  +  +GC+P  F PC     T SE          P+C
Sbjct: 145 CKGASPLQAFRWWNKKGVVTGG-DYRGSGCKPYPFAPCTALPCTKSE---------TPRC 194

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + +DKY     Y V  +VA IQ EI                       
Sbjct: 195 SLNC-QPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEIT---------------------- 231

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
             NGPV A   +Y D   Y+SGVY    + ++V    VKI+GWG +NG PYW +      
Sbjct: 232 --NGPVEAAFIVYDDFNHYRSGVYR-HVAGKLVGGHAVKIIGWGIQNGAPYWLMAN---- 284

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG     PYW          G+ G  K+LRG +E  IES +   
Sbjct: 285 ---------------SWG-----PYW----------GENGFFKMLRGVDECGIESTIVAG 314

Query: 242 LP 243
            P
Sbjct: 315 KP 316


>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
          Length = 340

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 100/249 (40%), Gaps = 69/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           H RCT   YG     F + +R+ R  YY        IQ+++M  GP+ A+  +Y D    
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASFDVYDDF--- 265

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                   P            SYKSGVY  S +A  +    VK++GWGEE G PY     
Sbjct: 266 --------P------------SYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPY----- 300

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++   +GD G  KI RG NE  I++ 
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIRRGTNECGIDNS 331

Query: 238 VNGALPKDN 246
               +P  N
Sbjct: 332 TTAGVPVTN 340


>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
          Length = 332

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 99/242 (40%), Gaps = 64/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG + S  GC P   PPC       +         P  + 
Sbjct: 154 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG    Q +Y+ K  Y +N     ++Q+++K GP+ A+  L+ D        
Sbjct: 209 H-QCPKTCYGSTTVQKRYKVKNEYVLNSP-NTMEQDLIKYGPIEASFNLFDD-------- 258

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          + +YKSG+Y  +  A+ ++  ++KI+GWG+ENG PYW  V  ++ 
Sbjct: 259 ---------------LSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWS- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W          G++GT +I++GRNE  IE      
Sbjct: 303 -----------------------KFW----------GEQGTFRIIKGRNECGIERSATAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
 gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
          Length = 366

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 94/237 (39%), Gaps = 69/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 192 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 241

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V  +VA IQ EIM NGPV A   +Y D        
Sbjct: 242 SLSC-QSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYED-------- 292

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +A   +  +A +KI+GWG E+G P          
Sbjct: 293 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSP---------- 326

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                   YW + +++G  +G+ G  +I RG ++  IES V
Sbjct: 327 ------------------------YWLVANSWGNSWGESGFFRIFRGDDQCGIESAV 359


>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
          Length = 340

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 100/249 (40%), Gaps = 69/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           H RCT   YG     F + +R+ R  YY        IQ+++M  GP+ A+  +Y D    
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASFDVYDDF--- 265

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                   P            SYKSGVY  S +A  +    VK++GWGEE G PY     
Sbjct: 266 --------P------------SYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPY----- 300

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++   +GD G  KI RG NE  I++ 
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIRRGTNECGIDNS 331

Query: 238 VNGALPKDN 246
               +P  N
Sbjct: 332 TTAGVPVTN 340


>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 347

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 94/243 (38%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+VTG  + S +GC+P  +PPC H        +C     P   C
Sbjct: 164 CDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTC 223

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   +  DK+     Y V  +VA IQ+EIM NGPV     +Y D   Y SG 
Sbjct: 224 EYKC-QDGYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSG- 281

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G Y        +    VK++GWG EN             
Sbjct: 282 ---------------IYKHTTGDY--------LGGHAVKMLGWGTEN------------- 305

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +++   +G+ G  +ILRG +E  IES V   
Sbjct: 306 ---------------------GTDYWICANSWNSDWGENGFFRILRGVDECQIESSVVAG 344

Query: 242 LPK 244
            PK
Sbjct: 345 EPK 347


>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
          Length = 330

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 95/237 (40%), Gaps = 69/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       W   +G+VTGG +H   GC+P    PC     + S PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V  +VA IQ EIM NGPV A   +Y D        
Sbjct: 206 SLSC-QSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYED-------- 256

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +A   +  +A +KI+GWG E+G PY         
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPY--------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W + +++G  +G+ G  KI RG ++  IES V
Sbjct: 292 -------------------------WLVANSWGTSWGESGFFKIFRGDDQCGIESAV 323


>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
 gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
          Length = 228

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 93/224 (41%), Gaps = 66/224 (29%)

Query: 13  WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
           ++   GLVTGG +       ++ GC P  FP CNH     S+ P C  +    P C T C
Sbjct: 50  FMKNHGLVTGGEYKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 108

Query: 66  TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
            N  YG    +D +R K +  +      I+QEI                       + NG
Sbjct: 109 PNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEI-----------------------FDNG 145

Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
           PV A M LY D   YKSGVY                                   V  + 
Sbjct: 146 PVAAMMTLYEDFRYYKSGVY-----------------------------------VHKTG 170

Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
           +++A  T+KLIGWG E+G+ YW  ++ + E++GD G IK+  G+
Sbjct: 171 QLLAAHTLKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKLAVGK 214


>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
          Length = 334

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 100/233 (42%), Gaps = 64/233 (27%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
           G+   Q++Y+ K  Y +N  +  I+++IM  GPV A+     D+                
Sbjct: 217 GKTTVQNRYKTKSEYVINS-IKTIERDIMTYGPVEASF----DV---------------- 255

Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
              Y D+ +YKSG+Y  +  A+     ++KI+GWG++NG PYW  V  ++          
Sbjct: 256 ---YDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWS---------- 302

Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                          +W          G+ GT KI++GRNE  IE  V   +P
Sbjct: 303 --------------KFW----------GEHGTFKIIKGRNECGIERAVTAGIP 331


>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
          Length = 351

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 61/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +    GLV+GG + +   C+    PPC H +   + P C+  A P PKC
Sbjct: 169 CNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEH-HVNGTRPPCEGDA-PTPKC 226

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  + Y   + +DK+   + Y V+     I+ E++ +GPV A+  +Y+D  +YKSG 
Sbjct: 227 KNVC-QEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGV 285

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    +K++GWGEE+G P          
Sbjct: 286 YQH------------------------VSGALLGGHAIKLMGWGEEDGVP---------- 311

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++   +G+ G  KILRG+N   IES +   
Sbjct: 312 ------------------------YWLCANSWNTDWGEGGFFKILRGKNHCGIESDIVAG 347

Query: 242 LPKD 245
           +P++
Sbjct: 348 IPQN 351


>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
          Length = 331

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 101/246 (41%), Gaps = 63/246 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H +     P+C +     PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKC-SEGGGTPK 204

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    Y   +  D +   + Y +  +   I+ EIMKNGPV     +Y D   YKSG
Sbjct: 205 CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSG 263

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++ ++ G+         +    ++++GWGEENG P         
Sbjct: 264 ----------------VYQHRHGL--------PLGGHAIRVLGWGEENGTP--------- 290

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++   +GD G  KILRG +   IES ++ 
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 325

Query: 241 ALPKDN 246
            LPK N
Sbjct: 326 GLPKLN 331


>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
          Length = 341

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 103/247 (41%), Gaps = 65/247 (26%)

Query: 2   CSSGISSSTWV-WVHK---RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G  ++ W  W  K    G+VTGG + SN GCQP + P C+H      E    + +TP
Sbjct: 153 CDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGPYENCSGSQSTP 212

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
              C   C + +Y + +  DK+  K  Y ++ +V+ IQ EIM NGPV     +Y+D  +Y
Sbjct: 213 S--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSVYADFPTY 269

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
            SG Y +                         +   +    +KI+GWG ENG PYW +  
Sbjct: 270 TSGVYQH------------------------TTGSFLGGHAIKILGWGTENGVPYWLVAN 305

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
            +                         P W          GD G  KI+RG++E  IES 
Sbjct: 306 SW------------------------NPSW----------GDSGFFKIIRGKDECGIESS 331

Query: 238 VNGALPK 244
           +   +P+
Sbjct: 332 IVAGMPE 338


>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
          Length = 332

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 96/245 (39%), Gaps = 60/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +  K GL TGG++ +  GC+P S  PC       + P C     P P C
Sbjct: 144 CGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 203

Query: 62  HTRCTNDN-YGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             +CT+ N Y     +D+ Y       + +   +IQ ++M NGP+     +Y D   Y +
Sbjct: 204 EKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTT 263

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y                        V  +     + +V+I+GWG   G P        
Sbjct: 264 GIY------------------------VHLTGNKQGHLSVRILGWGMYEGVP-------- 291

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                     YW + +++G+++G+ GT + LRG NE  +E+   
Sbjct: 292 --------------------------YWLLANSWGKEWGENGTFRALRGTNECGLEANCV 325

Query: 240 GALPK 244
            A+PK
Sbjct: 326 SAMPK 330


>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
          Length = 330

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/237 (29%), Positives = 95/237 (40%), Gaps = 69/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       W   +G+VTGG +H   GC+P    PC     + S PE KT     P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V  +VA IQ EIM NGPV A   +Y D        
Sbjct: 206 SLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYED-------- 256

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                           + YKSGVY  +A   +  +A +KI+GWG E+G PY         
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPY--------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W + +++G  +G+ G  KI RG ++  IES V
Sbjct: 292 -------------------------WLVANSWGTSWGESGFFKIFRGDDQCGIESAV 323


>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
          Length = 260

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/178 (33%), Positives = 78/178 (43%), Gaps = 34/178 (19%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C  G +   W     +G+VTGG   SN GCQP    PC+H  Y  S    C +L   Q  
Sbjct: 96  CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 153

Query: 61  -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            C  +C N NY   +  D ++    Y   W N  V  IQQEIM  GPV A MY+Y +   
Sbjct: 154 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMG 211

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
           YK G Y                         S + E++ Y  VK++GWG + +G  YW
Sbjct: 212 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDGDGTEYW 245


>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
 gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
          Length = 333

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 96/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W    ++GLV+GG   S+ GC+P +  PC H       P CK   T  PKC
Sbjct: 152 CDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSP-CKDSIT--PKC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK   K  Y + ++   I++EI  NGPV A   ++ D  SYK G 
Sbjct: 209 IKKCL-PGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHG- 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + SG         +     V+I+GWG EN             
Sbjct: 267 ---------------IYQHTSG--------NLAGEHAVRILGWGVEN------------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +++   +GD G  KILRG N   IES +   
Sbjct: 291 ---------------------GTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAG 329

Query: 242 LPK 244
           LPK
Sbjct: 330 LPK 332


>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
          Length = 279

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 100/243 (41%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G      V+    G+VTGG++   +GCQP   P C++ +  +   +C       P+C
Sbjct: 95  CFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQC 153

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D Y + +  DK+  +R Y V     DIQ+EI+ NGPV+A+              
Sbjct: 154 TNEC-QDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS-------------- 198

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    + + +D   YKSGVY  +  +  + + T++I+GWG E   P          
Sbjct: 199 ---------ISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIP---------- 239

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++ E++G  G +KI RG     IES V   
Sbjct: 240 ------------------------YWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAP 275

Query: 242 LPK 244
           +PK
Sbjct: 276 IPK 278


>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
          Length = 334

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 99/245 (40%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP    PC    Y  +    K    P  K 
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGK----PAEKN 209

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ                +D+ +Y 
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                 GP+ A+  +Y D  SYKSGVY    +A  +    VK++GWGEE G PY      
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ +Q+GD+G  KI RG NE   ++  
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGTDNST 329

Query: 239 NGALP 243
            G +P
Sbjct: 330 TGGVP 334


>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 72/237 (30%), Positives = 100/237 (42%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC
Sbjct: 72  CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + + +D+   K  Y V +    IQ+EIMKNGPVV    +Y D FSY    
Sbjct: 131 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 184

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YK G+Y  +A      +A +KI+GWG+ENG PY         
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKENGVPY--------- 216

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++   +G+ G  +ILRG N   IE  V
Sbjct: 217 -------------------------WLIANSWHNDWGENGYFRILRGSNHCGIEENV 248


>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
 gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 355

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 103/250 (41%), Gaps = 69/250 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--------CKT 53
           CS G +++ W ++ K+G+VTGG + SN GCQP    PCN A+ T ++P         C  
Sbjct: 165 CSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCN-ASTTAADPSSVLGPHGVCGG 223

Query: 54  LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
                PKC   C N  +   +  D  + K+ +  +   A  ++ + K+GP V  M +Y D
Sbjct: 224 DPATTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGCSA--RKNLRKHGPYVVTMRVYED 281

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
             +YKSG Y +                         + + +   +V+++GWG E G    
Sbjct: 282 FLAYKSGVYHH------------------------VTGDYLGLLSVRMIGWGLEGG---- 313

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
                                         + +W + +++G  +GDKG  KI R  NE  
Sbjct: 314 ------------------------------QAFWLLANSWGTSWGDKGFFKIRRFVNECW 343

Query: 234 IESLVNGALP 243
           IE+     +P
Sbjct: 344 IENFRYAGVP 353


>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
          Length = 331

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 66/246 (26%), Positives = 100/246 (40%), Gaps = 63/246 (25%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G   + +  WVH  G+V+GG+ +S  GCQP    PC H + +   P+C       PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVSGPRPKCSE-GGGTPK 204

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    Y   +  D +   + Y +  +   I+ EIM NGPV     +Y D   YKSG
Sbjct: 205 CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSG 263

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++ ++ G+         +    ++++GWGEENG P         
Sbjct: 264 ----------------VYQHRHGL--------PLGGHAIRVLGWGEENGTP--------- 290

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW   +++   +GD G  KILRG +   IES ++ 
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 325

Query: 241 ALPKDN 246
            LPK N
Sbjct: 326 GLPKVN 331


>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
          Length = 350

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 102/251 (40%), Gaps = 70/251 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-----HSNTGCQPVSFPPCNHANYTTSEPECKTLAT 56
           C+ G ++  W +  K GLV+G  +     +S T CQP SFPPC+H +       C  L  
Sbjct: 158 CNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSH-HVQGEYQACTDL-- 214

Query: 57  PQ---PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
           PQ   PKC+T C +      + QD ++    Y V      I+ EI + G   A+  +YSD
Sbjct: 215 PQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSD 274

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
             +Y SG Y N                 SG Y        +    +K++GWG ENG PYW
Sbjct: 275 FLTYSSGVYQN----------------TSGSY--------MGGHAIKMLGWGVENGTPYW 310

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
                +  S               WGE                    G  KILRG NE  
Sbjct: 311 LCANSWNSS---------------WGE-------------------NGFFKILRGSNECG 336

Query: 234 IES-LVNGALP 243
           IES +V G +P
Sbjct: 337 IESGMVAGFVP 347


>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
          Length = 323

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 68/232 (29%), Positives = 93/232 (40%), Gaps = 71/232 (30%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
           W + RG+VTGG     +GC+P  F PC       S PE KT     P C   C    Y  
Sbjct: 162 WWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTCSLSC-QFGYST 208

Query: 73  GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
            + +DK      Y V   VA IQ EIM NGPVV    +Y D++ YKSG Y +        
Sbjct: 209 AYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRH-------- 260

Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
                            +  ++    +KI+GWG +NG PY                    
Sbjct: 261 ----------------TAGRLLGGHAIKIIGWGTQNGIPY-------------------- 284

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                         W I +++G  +G+ G +K+ RG NE  IE  V   +P+
Sbjct: 285 --------------WLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMPR 322


>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
          Length = 342

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 94/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S   W      G+VTGG+    TGC+   FP C H       P C     P P+C
Sbjct: 155 CRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC  D     + +DK R    Y V      + +EIM  GPV A +++Y          
Sbjct: 214 IKRC--DTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYE--------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D+  YKSGVY       +  +  ++I+GWGEE+G P          
Sbjct: 263 --------------DLLDYKSGVYFHVWGGHLGEHG-IRILGWGEEDGVP---------- 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +G+KG +++LR RNE  I   V   
Sbjct: 298 ------------------------YWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAG 333

Query: 242 LP 243
           LP
Sbjct: 334 LP 335


>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
          Length = 340

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 99/249 (39%), Gaps = 69/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K GLVTGG + S  GC+P   PPC +      E    T A    + 
Sbjct: 157 CNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY-----DESGNNTCAGKPMEA 211

Query: 62  HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + RCT   YG     F + +R+ R  YY        IQ+                D+ +Y
Sbjct: 212 NHRCTRMCYGDQDLDFDEDHRYTRDSYYLT---YGSIQK----------------DVLTY 252

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                  GPV A+  +Y D  SYKSGVY  S +A  +     K++GWGEE G PY     
Sbjct: 253 -------GPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVPY----- 300

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++   +GD G  KI RG NE  I++ 
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIQRGTNECGIDNS 331

Query: 238 VNGALPKDN 246
             G +P  N
Sbjct: 332 TTGGVPITN 340


>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
          Length = 334

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/242 (26%), Positives = 98/242 (40%), Gaps = 64/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG + +  GC P   PPC +     +         P  + 
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNT-----CGGQPMERN 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG+   Q++Y+ K  Y +N  +  I+Q                D+ +Y    
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYSINS-IKTIEQ----------------DLKTY---- 246

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPV A+  +Y D   YKSG+Y  +  A+     ++KI+GWG+ENG  YW  V  ++ 
Sbjct: 247 ---GPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAVNSWS- 302

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   +W          G+ GT KI++GRNE  IE  V   
Sbjct: 303 -----------------------KFW----------GEHGTFKIIKGRNECGIERAVTAG 329

Query: 242 LP 243
           +P
Sbjct: 330 IP 331


>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
          Length = 194

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 84/172 (48%), Gaps = 27/172 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG + S+ GC P + PPC H    +  P      T  P+C
Sbjct: 46  CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDT--PRC 103

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C    Y   + +DK+     Y V++ V +I  EI KNGPV     ++SD  +YKSG 
Sbjct: 104 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 161

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                          ++ +++G        +++    ++I+GWG ENG PYW
Sbjct: 162 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYW 190


>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 276

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 100/247 (40%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC---NHANYTTSEPECKTLATPQ 58
           CS G     W    K GLVTGG + S  GC+P   PPC   +  N T S         P 
Sbjct: 94  CSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCS-------GQPM 146

Query: 59  PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            K H RCT   YG     F + +R+ R     D      + I K            D+ +
Sbjct: 147 EKNH-RCTRMCYGDQDLDFDEDHRYTR-----DHYYLTYRGIQK------------DVIN 188

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y       GP+ A+  +Y D  SYKSG+Y  S +A  +   +VK++GWGEE G  Y    
Sbjct: 189 Y-------GPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLY---- 237

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W +V+++   +GDKG  KI RG NE  +++
Sbjct: 238 ------------------------------WLMVNSWNADWGDKGLFKIRRGTNECGVDN 267

Query: 237 LVNGALP 243
              G +P
Sbjct: 268 STTGGVP 274


>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
          Length = 557

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 100/242 (41%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHH---SNTGCQPVSFPPCNH--ANYTTSEPECKTLAT 56
           C+ G   S W W  K G+VTGG +    + T C+P  F PC H      +  P C     
Sbjct: 367 CNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEY 426

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           P P+C + C+  N+  G + +  +  R  +    + +IQ+++MK G V A   ++SD  +
Sbjct: 427 PTPECLSECSETNFSGGSYGEDKKMAREAYSLAGIENIQRDMMKYGSVTAAFSVFSDFLT 486

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y  G                +++++SG +        +    VK++GWG +         
Sbjct: 487 YSGG----------------VYTHESGSF--------MGGHAVKMIGWGTD--------- 513

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                  E +G  YW I +++   +G+ G  +ILRG NE  IE 
Sbjct: 514 -----------------------EVSGEDYWLIANSWNPSWGEGGLFRILRGVNECGIEG 550

Query: 237 LV 238
            +
Sbjct: 551 QI 552


>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
          Length = 225

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/168 (32%), Positives = 76/168 (45%), Gaps = 26/168 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  + GLV+GG   S+ GC+P + PPC H +   S P C       PKC
Sbjct: 83  CNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEH-HVNGSRPSCTGEEGDTPKC 141

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +F+DK+     Y V+   ADIQ EI KNGPV     +Y D   YKSG 
Sbjct: 142 VMQC-EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGV 200

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
           Y +                         + + V    ++I+GWG E+G
Sbjct: 201 YKH------------------------VTGDAVGGHAIRILGWGVESG 224


>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
          Length = 374

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 96/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +  K GL TGG++ S  GC+P S  PC+      + P C       P C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y     +D++       + +   +IQ ++M NGP+ A M +Y D   Y +G 
Sbjct: 249 EKKCKS-GYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGI 307

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        V  +     + +V+I+GWG   G P          
Sbjct: 308 Y------------------------VHLTGNKQGHLSVRILGWGMYEGVP---------- 333

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++G+Q+G+ GT ++LRG NE  +E+     
Sbjct: 334 ------------------------YWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSG 369

Query: 242 LPK 244
           +P+
Sbjct: 370 MPR 372


>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
 gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
          Length = 343

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 100/239 (41%), Gaps = 69/239 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     + + ++ G+ TGG + S +GC+P S  P          P   + A   P C
Sbjct: 163 CNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAP----------PTSSSTAAQTPLC 212

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWV---NDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
             +C +D Y R   +D+Y  + YY +   N  V  IQ+EIM +GPVVA M ++     YK
Sbjct: 213 QLKCISD-YKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYK 271

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y       +  L         G++A            VK++GWGE+   PYW +V  
Sbjct: 272 SGVYSANKRNDDPSL---------GLHA------------VKLIGWGEQKRIPYWLVVN- 309

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                              W            +TFGEQ    G  KI RG NE  IE+L
Sbjct: 310 ------------------SWN-----------TTFGEQ----GLFKIRRGTNECGIENL 335


>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 398

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 96/249 (38%), Gaps = 68/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G   S W WVH  G+ TGG +        + GC P  FPPC H       P C   A
Sbjct: 211 CRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPCAHFFKDPKYPACPKFA 270

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
               +C ++  +      +F D     RY+ V         +  KN              
Sbjct: 271 RVNLRCVSKLRH--MMVVYFSD-----RYFMVESVPYHFSADDAKNAIRT---------- 313

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                   +GPV A  Y+Y D  +YKSGVY  ++ + + A+A                  
Sbjct: 314 --------DGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHA------------------ 347

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                            VK+IGWGE+ G  YW +V+++ E +GD G  KI  G  +  I+
Sbjct: 348 -----------------VKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIALG--DCGID 388

Query: 236 SLVNGALPK 244
           + + G  PK
Sbjct: 389 NELLGGTPK 397


>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
          Length = 346

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 95/243 (39%), Gaps = 59/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +    G+VTG  + + +GC+P  +PPC H        +C     P   C
Sbjct: 163 CEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTC 222

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  DNY   + +DK+     Y +  + + IQQEIM +GPV     +Y D   Y SG 
Sbjct: 223 EYKC-QDNYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSG- 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G Y        V    VK++GWG EN             
Sbjct: 281 ---------------IYKHMAGEY--------VGVHAVKMLGWGTEN------------- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  YW   +++   +G+ G  +ILRG NE  IES V   
Sbjct: 305 ---------------------GVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAG 343

Query: 242 LPK 244
            PK
Sbjct: 344 KPK 346


>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
          Length = 332

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 68/247 (27%), Positives = 101/247 (40%), Gaps = 66/247 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN----YTTSEPECKTLATP 57
           C  G     W ++   G+VTGG ++  + C+P SFPPC+H N    Y+  E +   L   
Sbjct: 145 CDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEV 204

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
            P C  +C +  + R +  DK R +   Y  + D+   I+ EI  NGPV A   ++ D  
Sbjct: 205 TPSCTKKC-HPQFSRTYDVDKIRSRENPYKLIKDQ-EQIKNEIYLNGPVQAVFTVFDDFL 262

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           +YKSG            +Y      + G +A            VKI+GWG ENG P    
Sbjct: 263 NYKSG------------VYQQTTGQRRGKHA------------VKIIGWGTENGVP---- 294

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                         YW  ++++ + +G  G  KILRG N   IE
Sbjct: 295 ------------------------------YWEAINSWNDGWGINGKFKILRGFNHLDIE 324

Query: 236 SLVNGAL 242
             V  ++
Sbjct: 325 GEVYASI 331


>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
          Length = 217

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 95/242 (39%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    GLV+GG ++S+ GC P   PPC H       P C    T  PKC
Sbjct: 32  CNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPPCEHHVPGNRLP-CNG-DTKTPKC 89

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N  Y   + +DK   K  Y V      I+ E+ KNGPV A   +Y+D+ +YKSG 
Sbjct: 90  SKTCEN-GYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNGPVEAAFTVYADLLAYKSGV 148

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    +KI+GWG ENG  Y         
Sbjct: 149 YKH------------------------VEGDALGGHAIKIIGWGVENGNKY--------- 175

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G+ G  KILRG +   IES +   
Sbjct: 176 -------------------------WLIANSWNTDWGNNGFFKILRGEDHCGIESSIVAG 210

Query: 242 LP 243
            P
Sbjct: 211 EP 212


>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
          Length = 372

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 72/250 (28%), Positives = 104/250 (41%), Gaps = 67/250 (26%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
           G+VTGG  ++ TGCQP +FPPC+    + S P C      Q KC T         G+ + 
Sbjct: 162 GVVTGG-DYNGTGCQPYTFPPCSSCEASKSTPSC------QKKCQT---------GYLEA 205

Query: 78  KYRFKRYYWVNDEVADIQQE-------IMKNGP--------VVANMYLYSDIFSYKSGKY 122
            Y+  + +   ++ +    E       I+K G           +N      I + ++  Y
Sbjct: 206 TYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIY 265

Query: 123 GNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVS 182
            NGPV  +  ++ D + YKSGVY    S ++     VKI+GWG E               
Sbjct: 266 NNGPVEVSYRVFEDFYQYKSGVYHY-VSGKLTGAHAVKIIGWGTE--------------- 309

Query: 183 ASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                              N   YW + +++G  FG+KG  KI RG NE  IE  V   L
Sbjct: 310 -------------------NKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGL 350

Query: 243 PKDNYGVEFG 252
            K N G +FG
Sbjct: 351 AK-NGGTKFG 359


>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
          Length = 341

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G+ +  W +    GLV+GG+++S  GC+P   PPC H    N      + KT     
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDSKT----- 210

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKCH  C   +Y   + +DK   K  Y V+ +   I+ E+ KNGPV     +YSD     
Sbjct: 211 PKCHKTC-EASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAFTVYSD----- 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                             + +YK+GVY  +    +  +A +KI+GWG ENG  Y    R+
Sbjct: 265 ------------------LLNYKNGVYKHTVGNALGGHA-IKILGWGVENGNKY----RL 301

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                         I +++   +GD G  KILRG +   IES +
Sbjct: 302 ------------------------------IANSWNSDWGDNGFFKILRGEDHCGIESSI 331

Query: 239 NGALP 243
               P
Sbjct: 332 VAGEP 336


>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
 gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
          Length = 334

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 70/240 (29%), Positives = 96/240 (40%), Gaps = 69/240 (28%)

Query: 4   SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
            G S   WV     GLV+GGA++S  GC+P  F PC +       P         PKC  
Sbjct: 162 DGTSFQYWV---DAGLVSGGAYNSTDGCKPYPFKPCEY-------PFNDCHVEISPKCTH 211

Query: 64  RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
            C  D   R + +DK   K  Y V  +   I+ EIM NGPV A   +Y D+  YKSG   
Sbjct: 212 HC-RDGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSG--- 267

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
                    +Y  ++  + G +A            V+I+GWG + G PY           
Sbjct: 268 ---------VYRHVYGEQIGKHA------------VRIIGWGRDGGIPY----------- 295

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  W I +++G+ +GD G  K +RG N   IES +   LP
Sbjct: 296 -----------------------WLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLP 332


>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
          Length = 341

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 61/175 (34%), Positives = 83/175 (47%), Gaps = 26/175 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC
Sbjct: 160 CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + + +D+   K  Y V +    IQ+EIMKNGPVV    +Y D FSY    
Sbjct: 219 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                             YK G+Y  +A      +A +KI+GWG+E G PYW I 
Sbjct: 273 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIA 308


>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
          Length = 346

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 65/222 (29%), Positives = 96/222 (43%), Gaps = 60/222 (27%)

Query: 17  RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
           +G+VTGG + S TGC+P  F PC H    T   EC    +  P+C  +C    Y   + +
Sbjct: 178 QGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEES-TPECVKQCQK-GYKNSYRR 235

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           DK   + YY V + V  IQ+EIM++GPVV++  +Y D FSY                   
Sbjct: 236 DKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDD-FSY------------------- 275

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
              Y  G+Y  +A     ++A                                   +K+I
Sbjct: 276 ---YVKGIYKHTAGKARGSHA-----------------------------------IKII 297

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           GWG E   PYW I +++   +G+KG  +++RG N   IE  V
Sbjct: 298 GWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDV 339


>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
          Length = 350

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 32/177 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 58
           C  G     W WV + G+VTGG +     C+P +F PC  H       P   + +TP  +
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C        YG+ + +DK+  K  Y ++++   IQ+E+MKNGPV A    Y D   YK
Sbjct: 224 PYCQF-----GYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYK 278

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            G            +Y  +   + G +A            VK++GWG ENG  YWT+
Sbjct: 279 GG------------IYVHVKGRERGAHA------------VKLIGWGVENGTKYWTV 311


>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 508

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 95/241 (39%), Gaps = 61/241 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D    G+ +DK R    Y +      I +EIM  GPV A       IF+     
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y D   Y SGVY  +  A +  +A V+I+GWGE    PY         
Sbjct: 260 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVPY--------- 298

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G++G +K LRG NE  IE  V   
Sbjct: 299 -------------------------WLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAV 333

Query: 242 L 242
           L
Sbjct: 334 L 334


>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 365

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 101/252 (40%), Gaps = 67/252 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           CS G   ++W ++H  G+V+GG         +  GC P +FP C H    +    C    
Sbjct: 174 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYNFPKCAHHQKESDYKPCAKEI 233

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDI 114
              P C + C N  YG  F +D++  +  +       + I++EIM NGP  A   +Y D 
Sbjct: 234 YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDF 293

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
            SYKSG                ++ + SG +        +    V+I+GWG E G  Y  
Sbjct: 294 LSYKSG----------------VYKHTSGGF--------LGGHAVEIIGWGTEKGVDY-- 327

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                           W +++++ E++GD GT KI++G  +  I
Sbjct: 328 --------------------------------WLVMNSWNEEWGDHGTFKIVQG--DCGI 353

Query: 235 ESLVNGALPKDN 246
           + ++    P  N
Sbjct: 354 DDMILAGTPAIN 365


>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
          Length = 188

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 88/237 (37%), Gaps = 70/237 (29%)

Query: 17  RGLVTGGAHHSNT-------GCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTND 68
           RG++TG              GCQP + PPC   N       C T    + P C  +C N 
Sbjct: 11  RGIITGDMGLCQVEIITPTQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNP 70

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
           NY   F  D Y+ K Y               K  P +A      DIF        NGP+ 
Sbjct: 71  NYYTSFRTDIYKGKYY---------------KLSPYMAM----KDIFD-------NGPIT 104

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYA--TVKIVGWGEENGRPYWTIVRVYAVSASAE 186
              Y+Y D+  YKSGVY     ++   +   +VKI GWGEENG P               
Sbjct: 105 TQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVP--------------- 149

Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                              YW + ++FG  +G  GT KI RG +    +  +   LP
Sbjct: 150 -------------------YWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 187


>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 340

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 96/249 (38%), Gaps = 69/249 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W      GLVTGG + S  GC+P   PPC H     +E        P  K 
Sbjct: 157 CNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHH----AEGNNSCSDKPMEKN 212

Query: 62  HTRCTNDNYGRGFFQ----DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           H RCT   YG          +Y    YY        IQ+++M  GP+ A+     D+   
Sbjct: 213 H-RCTRMCYGDQDLDFDDDHRYTRDSYYLT---YGSIQKDVMNYGPIEASF----DV--- 261

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                           Y D  SYKSGVY  S +A  +    VK++GWGEE+G PY     
Sbjct: 262 ----------------YDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEESGVPY----- 300

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                        W +V+++   +GDKG  KI RG NE  +++ 
Sbjct: 301 -----------------------------WLMVNSWNTDWGDKGLFKIQRGTNECGVDNS 331

Query: 238 VNGALPKDN 246
               +P  N
Sbjct: 332 TTAGVPVTN 340


>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 347

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 91/235 (38%), Gaps = 61/235 (25%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 69
           W +    G+VTGG +  +  C P  FPPC H     SE P C       P+C + C    
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           Y   +  DK R    Y +   V  IQ+EI   GPV A M +Y+D  +Y  G Y +     
Sbjct: 224 YATKYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKH----- 278

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRVYAVSASAEIV 188
                               + E++    ++++GWG EE+G PYW     +         
Sbjct: 279 -------------------TTGELLGGHAIRLLGWGVEEDGTPYWLAANSW--------- 310

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                           P W          G+KG  +ILRG +   IES V+  LP
Sbjct: 311 ---------------NPSW----------GEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 332

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 94/243 (38%), Gaps = 60/243 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    +W +   +G+VTG  +++   C+P  FP C H   +   P+C +     PKC
Sbjct: 148 CDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPDCPSTDYSTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C        +  D +  +  Y V    A IQ EI+ +GPV A   +YSD  +Y+SG 
Sbjct: 208 TKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S  ++    + IVGWG E+G PYW +      
Sbjct: 268 YKH------------------------TSGSVLGGHAISIVGWGTESGSPYWLV------ 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                             + +  P W          GD G  KILRG  +  I + V G 
Sbjct: 298 ------------------KNSWNPSW----------GDGGFFKILRG--DCGINNDVVGG 327

Query: 242 LPK 244
           LPK
Sbjct: 328 LPK 330


>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 210

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 80/182 (43%), Gaps = 30/182 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +  + G+VTGG + S  GCQP S  P      T  + +  T     P C
Sbjct: 45  CDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDT-----PDC 99

Query: 62  HTR-CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             R CTN NY +G+  D +     Y ++    DI  +I KNGPV A  Y+Y+D   YKSG
Sbjct: 100 SIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSG 159

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++SY  G        +I     +KI+GWG ++   YW     ++
Sbjct: 160 ----------------VYSYTRG--------QIEGGHAIKILGWGVDDNTKYWLCANSWS 195

Query: 181 VS 182
            S
Sbjct: 196 RS 197


>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 952

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 71/259 (27%), Positives = 105/259 (40%), Gaps = 62/259 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C +G     W +    G+VTGG+    +GC+   FP C H       P C     P P+C
Sbjct: 120 CGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRR-KGRYPPCPRHIYPTPEC 178

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D     + +DK R    Y V      I +EIM NGPV A+              
Sbjct: 179 IKQC--DEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEAS-------------- 222

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           +G         +Y+D   Y  GVY       I  +A ++I+GWGE++G PY         
Sbjct: 223 FG---------IYADFLEYNGGVYFHCWGGPISRHA-IRILGWGEDDGVPY--------- 263

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ E +G+KG ++ LRG NE  IE  V  A
Sbjct: 264 -------------------------WLIANSWNEDWGEKGYVRFLRGHNECGIEEEVT-A 297

Query: 242 LPKDNYGVEFGEESGERLS 260
           +P D +  +  ++S  R +
Sbjct: 298 VPIDWFLRQMIKQSTLRCT 316



 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 67/276 (24%), Positives = 102/276 (36%), Gaps = 72/276 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S   W +    G+VTGG+    TGC+   FP C H       P C     P P+C
Sbjct: 708 CRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 766

Query: 62  HTRCTNDNYGRGFFQDKYR----------FKRYYWVNDEVADIQQEIMKNGPVVANMYLY 111
             RC  D     + +DK R            R+ +      +   +   +   +  M+  
Sbjct: 767 IKRC--DTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHS 824

Query: 112 SDIFSYKSGK------------------------YGNGPVVANMYLYSDIFSYKSGVYAV 147
            D+ S +  K                           GPV A +++Y D+  YKSGVY  
Sbjct: 825 IDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFH 884

Query: 148 SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYW 207
                +  +  ++I+GWGEE+G P                                  YW
Sbjct: 885 VWGGHLGEHG-IRILGWGEEDGVP----------------------------------YW 909

Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
            + +++ E +G+KG +++LR RNE  I   V   LP
Sbjct: 910 LVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLP 945


>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
          Length = 396

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/182 (31%), Positives = 81/182 (44%), Gaps = 35/182 (19%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G S   W W+H  G+VTGG + +      + GC P   PPC H   +T  P+C    
Sbjct: 207 CHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTNSTLYPKCPKTK 266

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVAD-IQQEIMKNGPVVANMYLYSDI 114
              P C   C N  Y     +D++  +          D I++EIM NGPV A+ YL    
Sbjct: 267 YDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALRSIDAIKKEIMTNGPVSAS-YL---- 321

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
                             +Y D  +YKSGVY  ++   +  +A VKI+GWGE+    YW 
Sbjct: 322 ------------------VYDDFLTYKSGVYKRTSHNALGGHA-VKIIGWGED----YWL 358

Query: 175 IV 176
           +V
Sbjct: 359 VV 360


>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
          Length = 325

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 27/174 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   S W++   +G+VTG  +++  GCQP  FPPC H +     P C       P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+  K  Y V      I +E+M++GPV  +  +Y+D  +YKSG 
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           Y +                         S  ++    V+++GWGEEN  PYW I
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVPYWLI 310


>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 308

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/239 (28%), Positives = 96/239 (40%), Gaps = 76/239 (31%)

Query: 9   STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           S W ++   G+V+GG ++SN GCQP  FPP   AN             P+      C + 
Sbjct: 141 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPI--AN------------IPKHLHKHTCDDH 186

Query: 69  NYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
            YG     +  D  R + YY +     DIQ+E+   GPVV   ++  D            
Sbjct: 187 CYGNSTINYNHDHVRVRNYYTI--RTRDIQKEVQTYGPVVVR-FMVCD------------ 231

Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
                     D F YKSGVYA S  A+ +     K++GWG ENG  Y             
Sbjct: 232 ----------DFFLYKSGVYAKSDKAKGIRTQYAKLIGWGVENGVDY------------- 268

Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                W +++++G ++G KG  KI  G N+  +ES V   LP+
Sbjct: 269 ---------------------WLVINSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306


>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/237 (29%), Positives = 99/237 (41%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC
Sbjct: 72  CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + + +D+   K  Y V +    IQ+EIMKNGPVV    +Y D FSY    
Sbjct: 131 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 184

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YK G+Y  +A      +A +KI+GWG+E G PY         
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPY--------- 216

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++   +G+ G  +ILRG N   IE  V
Sbjct: 217 -------------------------WLIANSWHNDWGENGYFRILRGSNHCGIEENV 248


>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 232

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/236 (27%), Positives = 92/236 (38%), Gaps = 59/236 (25%)

Query: 9   STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           + W +    G+VTG  + S +GC+P  +PPC H        +C     P   C  +C  D
Sbjct: 56  AAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKC-QD 114

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
            Y   +  DK+     Y V  +VA IQ+EIM NGPV     +Y D   Y SG        
Sbjct: 115 GYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSG-------- 166

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
                   I+ + +G Y        +    VK++GWG EN                    
Sbjct: 167 --------IYKHTTGDY--------LGGHAVKMLGWGTEN-------------------- 190

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                         G  YW   +++   +G+ G  +ILRG +E  IES V    PK
Sbjct: 191 --------------GTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEPK 232


>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
          Length = 196

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W      GLVTGG + S  GC+P   PPC +     +         P  K 
Sbjct: 13  CHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN----TCAGKPMEKN 68

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F + +R+ R YY++      IQ+++M  GP+ A+     D+    
Sbjct: 69  H-RCTRICYGDQELDFDEDHRYTRDYYYLT--YGSIQKDVMTYGPIEASF----DV---- 117

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                          YSD  SYKSG+Y  + +A  +    VK++GWGE+ G PY      
Sbjct: 118 ---------------YSDFPSYKSGIYERTENATYLGGHAVKLIGWGEQYGIPY------ 156

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W +V+++ E +GD G  KI RG NE  +++  
Sbjct: 157 ----------------------------WLMVNSWNEDWGDNGLFKIRRGTNECGVDNST 188

Query: 239 NGALP 243
              +P
Sbjct: 189 TAGVP 193


>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
          Length = 356

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/236 (27%), Positives = 97/236 (41%), Gaps = 69/236 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     +    + G+VTG  + +N GC+P  F P     Y+T            P+C
Sbjct: 178 CNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLPHTTVEYST------------PEC 225

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +C N  Y + + QDK+     Y V   +  DIQ EIM NGPV ANM +Y D   YKSG
Sbjct: 226 SKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYKSG 285

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +F +  G +A            V+IVGWG               
Sbjct: 286 ------------VYQTVFPWPLGGHA------------VRIVGWG--------------- 306

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
           V    ++                 PYW + +++   +G+ G  +I RG +E+ IES
Sbjct: 307 VDGPTKV-----------------PYWLVANSWNTDWGEDGYFRIRRGTDESYIES 345


>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
          Length = 182

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 94/227 (41%), Gaps = 60/227 (26%)

Query: 17  RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
           +G+V+GG + SN GC P    PC H    T  P CK      P C  +C  + Y   + Q
Sbjct: 15  KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKC-EEGYKVPYAQ 71

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           D +  K  Y + ++V  I+QEI  NGPV     +Y                        D
Sbjct: 72  DLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY-----------------------ED 108

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
             +Y++GVY   A   +  +A ++I+GWG +NG                EI         
Sbjct: 109 FIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG----------------EI--------- 142

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                   PYW + +++   +G  G  KILRG +E  IE  +N  LP
Sbjct: 143 --------PYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181


>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
          Length = 347

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/235 (27%), Positives = 91/235 (38%), Gaps = 61/235 (25%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 69
           W +    G+VTGG +  +  C P  FPPC H     SE P C       P+C + C    
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           Y   +  DK R    Y +   V  IQ+EI   GPV A M +Y+D  +Y  G Y +     
Sbjct: 224 YATKYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKH----- 278

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRVYAVSASAEIV 188
                               + E++    ++++GWG EE+G PYW     +         
Sbjct: 279 -------------------TTGELLGGHAIRLLGWGVEEDGTPYWLAANSW--------- 310

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                           P W          G+KG  +ILRG +   IES V+  LP
Sbjct: 311 ---------------NPSW----------GEKGFFRILRGSDHCGIESDVSAGLP 340


>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
          Length = 168

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 87/224 (38%), Gaps = 60/224 (26%)

Query: 21  TGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80
           +GG   SN GC P    PC H +   + P C       PKC   C   +Y   + QDK  
Sbjct: 5   SGGPFGSNQGCHPYKIAPCEH-HVNGTRPACNGEEGKTPKCIKHC-QASYTVAYEQDKSY 62

Query: 81  FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY 140
             + Y V   VA IQ+EIM NGPV     +Y D+  YK G Y +                
Sbjct: 63  GAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQH---------------- 106

Query: 141 KSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE 200
                    + +++    ++I+GWG EN  PY                            
Sbjct: 107 --------VTGKMLGGHAIRILGWGVENDVPY---------------------------- 130

Query: 201 ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                 W I +++   +G+ G  KILRG +   IES ++  +PK
Sbjct: 131 ------WLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168


>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
          Length = 316

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 96/237 (40%), Gaps = 60/237 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +  + G+VTGG + +   C+P   PPC      T    C T     P C
Sbjct: 135 CDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDC 193

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C    Y   +  DK   K  Y V++ V  IQ+EIM  GPVVA   +Y D F YK+G 
Sbjct: 194 KTTC-QAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTG- 251

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + SG       AE   +A V+I+GWG++ G P          
Sbjct: 252 ---------------IYKHVSG-------AEAGGHA-VRILGWGQQGGVP---------- 278

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                   YW + +++   +G+ G  +ILRG +E  IE  V
Sbjct: 279 ------------------------YWLVANSWNTDWGENGYFRILRGSDECGIEDGV 311


>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 288

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/223 (27%), Positives = 89/223 (39%), Gaps = 76/223 (34%)

Query: 13  WVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 66
           ++   G+VTG      G   S  GC P  FP C HA Y++            P C T+CT
Sbjct: 123 FLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSS------------PACQTKCT 170

Query: 67  NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
           N  Y     QD +R K +  +     +I+QEI                       + NGP
Sbjct: 171 NKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEI-----------------------FTNGP 207

Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAE 186
           V+  + +Y DI  YK+GVY                                   V  +  
Sbjct: 208 VIGMLSIYEDIRVYKAGVY-----------------------------------VHQTGS 232

Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
                T+K+IGWG E+G+ YW  V+++ E++GD G IK+  GR
Sbjct: 233 FQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 275


>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
 gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 334

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/234 (26%), Positives = 96/234 (41%), Gaps = 69/234 (29%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W ++   GLV+GG +++N GCQP   PP    N  T   E          C  RC  +N 
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGLYE--------NTCEKRCYGNN- 216

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
              + QD  + K +Y +  E  DIQ+E+   GPV     ++ +                 
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFRVFDN----------------- 257

Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
                D F YKSGVY  + ++E + +   K++GWG ENG  Y                  
Sbjct: 258 -----DFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDY------------------ 294

Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                           W +V+++G ++G  G  KI RG +E  IE+ V+   P+
Sbjct: 295 ----------------WLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 254

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 4/122 (3%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
           C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T    + P 
Sbjct: 132 CDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGHSCTTFNREETPT 191

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C N NY   F  D YR K YY V+  +A   +EI  NGP+    Y+Y D+  YKSG
Sbjct: 192 CEKKCNNPNYYTSFRADIYRGK-YYKVSPYMA--MKEIFDNGPITTQFYMYRDLVDYKSG 248

Query: 121 KY 122
            Y
Sbjct: 249 VY 250


>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
          Length = 283

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/233 (28%), Positives = 95/233 (40%), Gaps = 61/233 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+ +  W +    GLV+GG ++S+ GC+P   PPC H       P C    T  PKC
Sbjct: 112 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 169

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + +Y   F +DK   K  Y V+     I+ E+ KNGPV A   +Y          
Sbjct: 170 QKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVY---------- 218

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                        SD+ SYK+GVY  +    +  +A +KI+GWG EN   Y         
Sbjct: 219 -------------SDLLSYKNGVYKHTEGNALGGHA-IKIIGWGVENNNKY--------- 255

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    W I +++   +GD G  KILRG +   I
Sbjct: 256 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGI 283


>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 334

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 64/234 (27%), Positives = 95/234 (40%), Gaps = 69/234 (29%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W ++   GLV+GG +++N GCQP   PP    N  T   E          C  RC  +N 
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGLYE--------NTCEKRCYGNN- 216

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
              + QD  + K +Y +  E  DIQ+E+   GPV     ++ +                 
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFKVFDN----------------- 257

Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
                D F YKSGVY  + ++E + +   K++GWG ENG  YW +V              
Sbjct: 258 -----DFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLLVN------------- 299

Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                          +W      G ++G  G  KI RG +E  IE+ V+   P+
Sbjct: 300 ---------------FW------GYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332


>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
          Length = 278

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/172 (29%), Positives = 73/172 (42%), Gaps = 25/172 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+VTGG   + TGCQP  F  C+H   +     C     P+P C
Sbjct: 132 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + + QDK+     Y V +  + I QEIMKNGPV     ++ D   Y+SG 
Sbjct: 192 ARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 250

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
           Y +                         + + +    V+++GWG ENG  YW
Sbjct: 251 YHH------------------------VAGKFIGRHAVRMIGWGVENGVNYW 278


>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
 gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
          Length = 320

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/243 (28%), Positives = 97/243 (39%), Gaps = 71/243 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G SS  W +    G+V+GG  +++ GC P S      A   ++ P C +        
Sbjct: 148 CGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSV----QAFRDSTTPNCSSF------- 196

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              CTN  Y + + +DK    R Y +   +  IQ EIM +GPV A+  +Y D +SY++G 
Sbjct: 197 ---CTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG- 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  +    SG +            +VKI+GWG ENG  YW +      
Sbjct: 253 -----------VYQHVLGNVSGRH------------SVKILGWGRENGTDYWLVAN---- 285

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           WG + GR         G      G  K LRG N   IES + G 
Sbjct: 286 ---------------SWGRDWGR--------LG------GFFKFLRGENHCDIESNILGG 316

Query: 242 LPK 244
            PK
Sbjct: 317 DPK 319


>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
          Length = 341

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 96/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  ++ W +   +G+VTGG + SN GCQP S   C H      +P C  +  P P C
Sbjct: 160 CNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDI-VPTPAC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+     Y V   V  I  EIM NGPV A   +YSD  SYKSG 
Sbjct: 218 KRSC-RQGYNVTYPNDKHFGASSYGVRG-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGV 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S + +    +KI+GWG ++G            
Sbjct: 276 YQH------------------------TSGQPLGGHAIKIIGWGVQDG------------ 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ + +G+ G   I +G +E  IES V   
Sbjct: 300 ----------------------TDYWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAG 337

Query: 242 LPK 244
           LPK
Sbjct: 338 LPK 340


>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
          Length = 320

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 99/243 (40%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W    G+ TGG + S   C   SFP C H       P  ++  TP+  C
Sbjct: 138 CDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGESQETPE--C 195

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   + +DK+ F   Y+V   +  I+ E+M NGP+  + ++Y D  +YKSG 
Sbjct: 196 VKQC-QEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGI 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +   VA  YL         G +A            VK+VGWG E+G            
Sbjct: 255 YQH---VAGKYL---------GGHA------------VKLVGWGVEDG------------ 278

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++ E +G+ G  +I+ G+ E  IE    G 
Sbjct: 279 ----------------------IEYWKIANSWNEDWGENGYFRIVAGKGECGIEVGPIGG 316

Query: 242 LPK 244
           +PK
Sbjct: 317 IPK 319


>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
          Length = 206

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/168 (33%), Positives = 83/168 (49%), Gaps = 27/168 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +    GLV+GG ++S+ GC+P S  PC H +   S P+C +     P+C
Sbjct: 65  CNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEH-HVNGSRPKC-SGEIETPRC 122

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC    Y   + +DK+     Y +  +V +I  EI KNGPV A + ++ D   YKSG 
Sbjct: 123 SRRC-EAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSG- 180

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
                          ++ +K+G         I  +A +KI+GWGEENG
Sbjct: 181 ---------------VYQHKTG-------GSIGGHA-IKILGWGEENG 205


>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
          Length = 334

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/233 (27%), Positives = 98/233 (42%), Gaps = 64/233 (27%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
           G+   Q++Y+ K  Y +N  +  I+Q                D+ +Y       GPV A+
Sbjct: 217 GKTTVQNRYKTKSEYVMNS-IKTIEQ----------------DLKTY-------GPVEAS 252

Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
             +Y D   YKSG+Y  +  A+     ++KI+GWG++NG PYW  V  ++          
Sbjct: 253 FDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWS---------- 302

Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                          +W          G+ GT KI++GRNE  IE  V   +P
Sbjct: 303 --------------KFW----------GEHGTFKIIKGRNECGIERAVTAGIP 331


>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
          Length = 244

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 77/176 (43%), Gaps = 32/176 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C  G +   W +    G+VTGG  +SN GCQP    PC+H    +S   C +    Q   
Sbjct: 96  CHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRPCDHYG-DSSMTNCSSFRRTQMSI 154

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           C  +C N NY   +  D ++    Y   W N  V  IQQEIM  GPV A MY+Y +   Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSVVYMTSWTN--VTQIQQEIMTYGPVTALMYVYENFMGY 212

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPY 172
           K G Y                         S   ++V Y  VK++GWG +++G  Y
Sbjct: 213 KEGIYK------------------------STVGDLVGYHHVKLIGWGVDDDGNEY 244


>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
           Precursor
 gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
          Length = 342

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+V+GG + +   C+P    PC H    T   EC+  A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + +  DK   K  Y V   V  IQ EI+KNGPVVA+  +Y D   YKSG 
Sbjct: 215 KRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G        E+  Y  VK++GWG E              
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               N   +W I +++   +G+KG  +I+RG N+  IE  +   
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAG 335

Query: 242 L 242
           +
Sbjct: 336 I 336


>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
 gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
          Length = 317

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 96/245 (39%), Gaps = 60/245 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G+  + W ++   G+ T G+  +  GC P +FP C H    +    C       P C
Sbjct: 133 CKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSC 192

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC N+ YG    +D++       + +   +I++EIM NGP  A   +Y D  SYKSG 
Sbjct: 193 LDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSATFSVYEDFVSYKSGV 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++   +V+I+GWG E G  Y         
Sbjct: 253 YKH------------------------TNGTLMGIHSVEIIGWGTEKGVDY--------- 279

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W +++++ E +GD GT KI +G  +  I+  V G+
Sbjct: 280 -------------------------WLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVLGS 312

Query: 242 LPKDN 246
            P  N
Sbjct: 313 PPAMN 317


>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 337

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 93/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI S  W +    G+V+GG ++S  GC+P   PPC H       P C    T  PKC
Sbjct: 152 CNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N  Y   + +DK   K  Y V+     I+ E+ KNGPV     +Y+D+ +YKSG 
Sbjct: 210 QKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAFTVYADLLAYKSGV 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    +KI+GWG EN             
Sbjct: 269 YKH------------------------IQGDALGGHAIKILGWGVEN------------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +GD G  KILRG N   IE  +   
Sbjct: 292 ---------------------DNKYWLVANSWNTDWGDNGFFKILRGENHCGIEGSIIAG 330

Query: 242 LP 243
            P
Sbjct: 331 EP 332


>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
 gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
 gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
 gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
          Length = 329

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 72/237 (30%), Positives = 91/237 (38%), Gaps = 69/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       W   +G+VTGG +H   GC+P    PC   N     PE KT     P C
Sbjct: 155 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V    A IQ EI  NGPV A   +Y D + YKSG 
Sbjct: 205 SMSC-QSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGV 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +    A  YL         G +A            +KI+GWG E+G PYW +   + V
Sbjct: 264 YKH---TAGKYL---------GGHA------------IKIIGWGTESGSPYWLVANSWGV 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +               WGE                    G  KI RG ++  IES V
Sbjct: 300 N---------------WGES-------------------GFFKIYRGDDQCGIESAV 322


>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 244

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 63/252 (25%), Positives = 96/252 (38%), Gaps = 67/252 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           CS G   ++W ++H  G+V+GG         +  GC P SFP C H    +    C    
Sbjct: 53  CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEI 112

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDI 114
              P C + C N  YG  F +D++  +  +       + I++EIM NGP  A   +Y D 
Sbjct: 113 YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDF 172

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
            SYKSG Y +                         S   +    V+I+GWG E G  YW 
Sbjct: 173 LSYKSGVYKH------------------------TSGGFLGGHAVEIIGWGTEKGVDYWL 208

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
           ++                                   ++ E++GD GT KI++G  +  I
Sbjct: 209 VMN----------------------------------SWNEEWGDHGTFKIVQG--DCGI 232

Query: 235 ESLVNGALPKDN 246
           +  +    P  N
Sbjct: 233 DDTILAGTPAMN 244


>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
          Length = 331

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 100/239 (41%), Gaps = 79/239 (33%)

Query: 11  WVWVHKRGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTND 68
           W +    GLV+GG+ +++N GCQP   PP CN                P       C + 
Sbjct: 165 WEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN---------------LPTKINKRTCVDY 209

Query: 69  NYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
            YG     +  D  + + YY V  +  DIQ+E+                 +Y       G
Sbjct: 210 CYGNDTIKYNHDHVKVRYYYHVKPK--DIQKEVQ----------------TY-------G 244

Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
           PV A + LY DIF +KSGVY ++ +A+ V    VK++GWG ENG  Y             
Sbjct: 245 PVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDY------------- 291

Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                W +V+++G ++G  G +KI RG+    +ES V  A+PK
Sbjct: 292 ---------------------WLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 329


>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 234

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/200 (30%), Positives = 82/200 (41%), Gaps = 60/200 (30%)

Query: 27  SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
           ++ GC P  FP CNH     S+ P C  +    P C T C N  YG    +D +R K + 
Sbjct: 38  NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 96

Query: 86  WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
            +      I+QEI                       + NGPV A M LY D   YKSGVY
Sbjct: 97  RLPIGPEKIKQEI-----------------------FDNGPVAAMMTLYEDFRFYKSGVY 133

Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
                                              V  + +++A  T+KLIGWG E+G+ 
Sbjct: 134 -----------------------------------VHKTGQMLAAHTLKLIGWGVESGQE 158

Query: 206 YWTIVSTFGEQFGDKGTIKI 225
           YW  V+ + E++GD G IK+
Sbjct: 159 YWLAVNAWNEEWGDHGMIKL 178


>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 313

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 92/236 (38%), Gaps = 71/236 (30%)

Query: 9   STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           S W ++   G+V+GG ++SN GCQP  FPP               L   Q  C   C   
Sbjct: 147 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPI-----------ANILTHLQHTCDDHCYG- 194

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
           N    +  D  R + YY                   +   Y+  ++ +Y       GPV 
Sbjct: 195 NTSINYNHDHVRVRNYY------------------TIRTGYIQKEVQTY-------GPVA 229

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
               +  D   YKSGVY  S +A+++     K++GWG ENG  Y                
Sbjct: 230 VQFKVCDDFLLYKSGVYVKSDNAKVIRTQYAKLIGWGVENGVDY---------------- 273

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                             W +++++G ++G KG  KI RG N+  +ES+V   +P+
Sbjct: 274 ------------------WLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPE 311


>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 298

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 92/237 (38%), Gaps = 66/237 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNT------GCQPVSFPPCNHA-NYTTSEPECK-T 53
           C+ G       ++   G+VTG             GC P  F  CNH     T  P+CK  
Sbjct: 108 CNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDV 167

Query: 54  LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
           +  P P C T CTN  Y +   +D +R K +  V ++   I+QEI               
Sbjct: 168 VQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEI--------------- 212

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                   + NGPV +   +Y D   YKSGVY                            
Sbjct: 213 --------FDNGPVFSAFEMYKDFRYYKSGVY---------------------------- 236

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                  V  + E+     +K+IGWG ++ R YW  ++ + E++GD G IK+  G+N
Sbjct: 237 -------VPTTKEVDCLHVIKIIGWGADSVREYWLAMNAWNEEWGDHGLIKMAFGKN 286


>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
          Length = 209

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W      G+VTGG ++S  GCQP     C+H      +P CK      P+C
Sbjct: 28  CNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDHHVVGKLKP-CKGDGK-TPRC 85

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   F  DK+  +R Y V+  V DI +E++  GPV A   +Y          
Sbjct: 86  EKKCEA-GYNVTFKDDKHYGQRSYSVS-SVNDIMEELVTRGPVEAAFTVY---------- 133

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                        SD   Y SGVY  +  + +  +A VKI+G+G ENG  YW +   +  
Sbjct: 134 -------------SDFLQYHSGVYRHTTGSALGGHA-VKILGYGVENGDKYWLVANSW-- 177

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                  P W          GD+G  KILRG +E  IE  +   
Sbjct: 178 ----------------------NPDW----------GDQGFFKILRGVDECGIEGQIVAG 205

Query: 242 LPK 244
            PK
Sbjct: 206 EPK 208


>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
          Length = 321

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 94/243 (38%), Gaps = 73/243 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S   +    G+V+GG  +SN GC+P          YT    +        P C
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRP----------YTADAHD----QGQTPAC 196

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N  Y   +  DK+     Y V+  +  IQ E+M NGP++ N  ++ D ++Y SG 
Sbjct: 197 TKSCRN-GYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGV 255

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S E V +  VKIVGWG ENG PY         
Sbjct: 256 YRH------------------------VSGESVGFHVVKIVGWGVENGVPY--------- 282

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++G  +GD G  K+LRG+NE  IE+     
Sbjct: 283 -------------------------WLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAV 317

Query: 242 LPK 244
           +P+
Sbjct: 318 MPR 320


>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
           Precursor
 gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
          Length = 342

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+V+GG + +   C+P    PC H    T   EC+  A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C      + +  DK   K  Y V   V  IQ EI++NGPVVA+  +Y D   YKSG 
Sbjct: 215 KRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G        E+  Y  VK++GWG E              
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               N   +W I +++   +G+KG  +I+RG N+  IE  +   
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335

Query: 242 L 242
           +
Sbjct: 336 I 336


>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
          Length = 374

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 91/234 (38%), Gaps = 68/234 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S     +    G VTGG  ++  GC P SF PC        +  C    TP   C
Sbjct: 167 CQGGYSIEALRFWKSSGAVTGG-DYNGAGCMPYSFAPCK-------KDSCAQGTTPS--C 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C +      + +DK+     Y + + VA IQ EI  NGPV A+  +Y D + YKSG 
Sbjct: 217 KTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSG- 275

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ Y SG        ++V    VKI+GWG ENG  Y         
Sbjct: 276 ---------------VYQYTSG--------KLVGGHAVKIIGWGTENGVDY--------- 303

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                    W I +++G  FGD G  K+ RG NE  IE
Sbjct: 304 -------------------------WLIANSWGTTFGDSGFFKMRRGTNEVGIE 332


>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 344

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 106/272 (38%), Gaps = 87/272 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  GI+ + W ++   G+VTGG      +  +  GC P SFP C H    +    C  + 
Sbjct: 133 CQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVR 192

Query: 56  TP--------------------QPKCHTRCTNDNYGRGFFQDKYRFKRYY-WVNDEVADI 94
            P                     P C  RC N+ YG    +D++   R   ++ +   +I
Sbjct: 193 VPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNI 252

Query: 95  QQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV 154
           ++EIM NGP  A+   Y D  SYKSG                ++ + SG Y        +
Sbjct: 253 KKEIMTNGPTSASFSTYEDFSSYKSG----------------VYKHTSGGY--------L 288

Query: 155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFG 214
              +V+I+GWG E G  Y                                  W +++++ 
Sbjct: 289 GDHSVEIIGWGTEKGVDY----------------------------------WLVMNSWN 314

Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
           E +GD GT KI +G  +  I+  V G+LP  N
Sbjct: 315 EGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 344


>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
          Length = 252

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 83/179 (46%), Gaps = 28/179 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + SN GC P    PC H    T  P CK      PKC
Sbjct: 97  CNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPKC 154

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + QD +R K  Y ++++V  I+QEI  NGPV     +Y          
Sbjct: 155 VKKC-EDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVY---------- 203

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVRVY 179
                         D  +Y++GVY   A   +  +A ++I+GWG +NG  PYW +   +
Sbjct: 204 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 248


>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
          Length = 294

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 62/121 (51%), Gaps = 2/121 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + QDK+  +  Y V      IQ+EIM NGPV A   +Y D  +YKSG 
Sbjct: 218 KQKCQK-GYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 276

Query: 122 Y 122
           Y
Sbjct: 277 Y 277


>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
 gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
          Length = 276

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 100/239 (41%), Gaps = 79/239 (33%)

Query: 11  WVWVHKRGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTND 68
           W +    GLV+GG+ +++N GCQP   PP CN                P       C + 
Sbjct: 110 WEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN---------------LPTKINKRTCVDY 154

Query: 69  NYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
            YG     +  D  + + YY V  +  DIQ+E+                 +Y       G
Sbjct: 155 CYGNDTIKYNHDHVKVRYYYHVKPK--DIQKEVQ----------------TY-------G 189

Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
           PV A + LY DIF +KSGVY ++ +A+ V    VK++GWG ENG  Y             
Sbjct: 190 PVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDY------------- 236

Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                W +V+++G ++G  G +KI RG+    +ES V  A+PK
Sbjct: 237 ---------------------WLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 274


>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
          Length = 339

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 94/240 (39%), Gaps = 69/240 (28%)

Query: 4   SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
            G S   WV     GLV+GGA++S  GC+P  F PC +        E        PKC  
Sbjct: 167 DGTSFQYWV---DAGLVSGGAYNSTEGCKPYPFKPCLYPFTDCHREE-------SPKCKH 216

Query: 64  RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
            C +    + + +DK      Y V  +   I+ EIM NGPV     +Y D+F YKSG Y 
Sbjct: 217 HCQH-GVDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYR 275

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +                   VY      E V    V+I+GWG E G PY           
Sbjct: 276 H-------------------VY-----GEHVGKHAVRIIGWGREGGIPY----------- 300

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                                  W I +++GE +GD G  KI+RG N   IES V   LP
Sbjct: 301 -----------------------WLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLP 337


>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 145

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 83/206 (40%), Gaps = 62/206 (30%)

Query: 38  PCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQE 97
           PC H       P C       P+C  +C N +YG  + +D ++  +Y           +E
Sbjct: 1   PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQY---RIPGYTAMKE 56

Query: 98  IMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYA 157
           I +NGP+ A+ Y+Y D  +Y+SG                ++++ SG Y  + +       
Sbjct: 57  IYENGPITASFYMYQDFVNYQSG----------------VYAFNSGKYVTTQA------- 93

Query: 158 TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQF 217
            VKI+GWGEENG P                                  YW   ++F   +
Sbjct: 94  -VKILGWGEENGTP----------------------------------YWLAANSFNTYW 118

Query: 218 GDKGTIKILRGRNEAIIESLVNGALP 243
           GD G +KILRG NE  IE  +   LP
Sbjct: 119 GDNGFVKILRGANECYIEEFMYAGLP 144


>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 81/202 (40%), Gaps = 41/202 (20%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G ++  W W    G+VTGGA+     C+P  FP C  A+   +   C +     P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ +  DK + K +YW+ ++   IQ EIMK GPV A   +Y D   Y  G 
Sbjct: 225 KPYCQY-GYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNGGV 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        +  +  +    ++KI+GWG + G  YW I   ++ 
Sbjct: 284 Y------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWST 319

Query: 182 SASAEIVAYATVKLIGWGEENG 203
                           WGE+ G
Sbjct: 320 D---------------WGEDGG 326


>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
          Length = 340

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 71/252 (28%), Positives = 99/252 (39%), Gaps = 75/252 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
           C+ G     W    K GLVTGG + S  GC+P   PPC +    N T S         P 
Sbjct: 157 CNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCS-------GKPM 209

Query: 59  PKCHTRCTNDNYGRGFFQDK--YRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
            + H RCT   YG         +R  R  YY     +  IQ+++M  GP+ A+     D+
Sbjct: 210 EQNH-RCTRMCYGDQDLDFDDDHRHTRDSYYLT---IGSIQKDVMTYGPIEASF----DV 261

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
                              Y D  SYKSGVY  S +A  +    VK++GWGEE G P   
Sbjct: 262 -------------------YDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTP--- 299

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                          YW +++++   +GD+G  KI RG NE  +
Sbjct: 300 -------------------------------YWLMMNSWNADWGDEGLFKIRRGTNECGV 328

Query: 235 ESLVNGALPKDN 246
           ++     +P  N
Sbjct: 329 DNSTTAGVPVTN 340


>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
 gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
          Length = 329

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 82/176 (46%), Gaps = 30/176 (17%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +  ++G+V+GG + S  GC+P    PC H +   + P C   +TP   C
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPS--C 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C   +Y   + +DK    + Y V   VA+IQQEIM NGPV     +Y D        
Sbjct: 212 QHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYED-------- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG--EENGRPYWTI 175
                          +  YKSGVY      E+  +A ++I+GWG   E+  PYW I
Sbjct: 263 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGVWGESKVPYWLI 302


>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
          Length = 360

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 99/244 (40%), Gaps = 60/244 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +    G+ TGG + +   C+P +F PC   +Y     +C   + P PKC
Sbjct: 159 CGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG----KCPKDSFPTPKC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y + +  DKY     Y +      I+ EIM+NGPV A+  +Y D F +    
Sbjct: 215 RKICQY-KYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPD-FGF---- 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             Y+ GVY  S   E+  +A +KI+GWG E              
Sbjct: 269 ------------------YEKGVYVTSGGRELGGHA-IKIIGWGTE-------------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD-KGTIKILRGRNEAIIESLVNG 240
                       K+ G       PYW I +++G  +G+  G  +ILRG+N   IE  V  
Sbjct: 296 ------------KVNG----TDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIA 339

Query: 241 ALPK 244
            + K
Sbjct: 340 GMIK 343


>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 89/234 (38%), Gaps = 70/234 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    GL       +++ CQP  FP C+H      +P C       PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y V+ E                          YK   
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGEE------------------------DYKREL 242

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP V    +YSD F+YK+GVY                                    
Sbjct: 243 YFNGPFVVAFQVYSDFFAYKTGVYR----------------------------------- 267

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
             S +++    V+++GWG+ NG PYW I +++   +G  G   ILRG++E  IE
Sbjct: 268 HVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIE 321


>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 341

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S   W +    GLV+GG + S   C+P    PC H    T   EC   A+  P C
Sbjct: 155 CDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEAS-TPSC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y + +  DK      + +   V  IQ+E++KNGPV A+  +Y D   YKSG 
Sbjct: 214 KKKC-QPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYEDFSLYKSG- 271

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G        E+  Y  VK++GWG EN   Y         
Sbjct: 272 ---------------IYRHTAG--------ELRGYHAVKMIGWGTENRTDY--------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++ + +G+ G  +I+RG N+  IE  V   
Sbjct: 300 -------------------------WLIANSWHDDWGENGYFRIIRGINDCGIEENVAAG 334

Query: 242 L 242
           L
Sbjct: 335 L 335


>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 527

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 65/228 (28%)

Query: 17  RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
           RG +T G      GC P  FPPC H    T  P+C   +   P C  +C N  Y      
Sbjct: 364 RGNLTKG-----DGCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKN 418

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           D     R+Y            ++++ P     Y YS + + K+    +GP+ A+  +Y D
Sbjct: 419 D-----RHY------------MLESSP-----YQYS-VNNAKNAIRTDGPISASYLVYED 455

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
             +YKSGVY  ++ + +  +A                                   VK+I
Sbjct: 456 FLAYKSGVYKHTSGSYLGGHA-----------------------------------VKII 480

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           GWGEENG  YW +V+++ E +GD+G  KI  G  E  I+  + G  PK
Sbjct: 481 GWGEENGEAYWLVVNSWNEDWGDQGLFKIALGNCE--IDDDLLGGTPK 526


>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
          Length = 324

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 99/263 (37%), Gaps = 67/263 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S     +    G VTGG +  + GC P SF PC      ++ P CKT      K 
Sbjct: 100 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 158

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYY--------WVNDEVADIQQEIMKNGPVVANMYLYSD 113
                + +YG   +    RF+R+              V +IQ EI   GPV A+  +Y D
Sbjct: 159 EEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYED 218

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
            + YKSG                ++ Y SG        ++V    VKI+GWG ENG  Y 
Sbjct: 219 FYHYKSG----------------VYHYTSG--------KLVGGHAVKIIGWGVENGVDY- 253

Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
                                            W I +++G  FG+KG  KI RG NE  
Sbjct: 254 ---------------------------------WLIANSWGTSFGEKGFFKIRRGTNECQ 280

Query: 234 IESLVNGALPKDNYGVEFGEESG 256
           IE  V   + K     E  E+ G
Sbjct: 281 IEGNVVAGIAKLGTHSETYEDDG 303


>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
 gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
          Length = 339

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 97/243 (39%), Gaps = 61/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +  + GLVTG  +++   C+P SFPPC H      +P      TPQ  C
Sbjct: 157 CQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--C 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y + +  DK+   + Y ++ +   I +++M  GP+  +  +Y+D  SY SG 
Sbjct: 215 VKKCQPE-YPKTYENDKWYGLKAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         +  ++    V++VGWG E+G            
Sbjct: 274 YRH------------------------VAGGLLGGHAVRLVGWGVEDG------------ 297

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G  KI RG NE  IES  N  
Sbjct: 298 ----------------------ADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAG 335

Query: 242 LPK 244
            PK
Sbjct: 336 HPK 338


>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 333

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG  +S  GCQP  FPPC      T    C   +    KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  +          YR  R Y            + ++  V+A   + +DI +Y    
Sbjct: 207 QKKCFGNT------SISYRGDRRY------------VERSPYVLAYDNMQNDIMTY---- 244

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GP+ ++  +Y D  SYKSGVY  S +A  +   +VK +GWG E             V
Sbjct: 245 ---GPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-----------V 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           S                       YW +++++   +GD G  KI RG NE  +E      
Sbjct: 291 S-----------------------YWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAG 327

Query: 242 LPK 244
           +P+
Sbjct: 328 MPE 330


>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 94/241 (39%), Gaps = 60/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+V+GG + +   C+P    PC H    T   EC+  A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C      + +  DK   K  Y V   V  IQ EI++NGPVVA+  +Y D   YKSG 
Sbjct: 215 KKEC-RPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + +G        E+  Y  VK++GWG E              
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               N   +W I +++   +G+KG  +I+RG N+  IE  +   
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335

Query: 242 L 242
           +
Sbjct: 336 I 336


>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
 gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
          Length = 392

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 76/178 (42%), Gaps = 29/178 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +++  G+ T G+  +  GC P +FP C H    +    C       P C
Sbjct: 159 CTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDTPPC 218

Query: 62  HTRCTNDNYGRGFFQDKY---RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
             RC N NYG    +D++    F  Y     +  +I++EIM NGP  A   +Y D  SY+
Sbjct: 219 LDRCPNKNYGTPLDKDRHFTAHFSPYQLKGTD--NIKKEIMTNGPTSAAFSMYDDFLSYE 276

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           SG Y +                         S  ++    V+I+GWG + G  YW ++
Sbjct: 277 SGVYKH------------------------TSGTLMGEHGVEIIGWGTKQGVDYWLVM 310


>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
          Length = 348

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 97/250 (38%), Gaps = 76/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV-------------SFPPCNHANYTTSE 48
           C+ G     W      G  TGG      GC+P               + PC +  Y    
Sbjct: 153 CNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYG-- 210

Query: 49  PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANM 108
            EC  +A   P+C  RC    Y + +  D+Y  K  Y V   V  IQ+EIMKNGPVVA+ 
Sbjct: 211 -ECVGMAD-TPRCKRRCLL-GYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASF 267

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
            +Y D   YKSG Y +                         + E+  Y  VKI+GWG+E 
Sbjct: 268 AVYEDFRHYKSGIYKH------------------------TAGELRGYHAVKIIGWGKE- 302

Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
                                            N   +W I +++ + +G+KG  +I+RG
Sbjct: 303 ---------------------------------NNTDFWLIANSWHQDWGEKGYFRIVRG 329

Query: 229 RNEAIIESLV 238
           +NE  IE+ V
Sbjct: 330 KNECGIETDV 339


>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 82/175 (46%), Gaps = 28/175 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G+    W +V + G+VTGG +     C+P    PC NH     S P   +  TP   
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    YG+ + +DK   K  Y ++++   IQ+E+MKNGPV A    Y D FS+   
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYED-FSF--- 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                              Y+ G+Y  +   +  A+A VK+VGWG ENG  YW +
Sbjct: 278 -------------------YRRGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312


>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/242 (26%), Positives = 91/242 (37%), Gaps = 69/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y + +   D ++E+  NGP V +  +YSD  +YK+G 
Sbjct: 211 NTTCTD----KAIPLIKYRGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    V+IVGWG+ NG PY         
Sbjct: 267 YRH------------------------VSGDVLGGHAVRIVGWGKLNGTPY--------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G  G   ILRG NE  IES     
Sbjct: 294 -------------------------WKIANSWDTDWGMNGHFLILRGNNECGIESTGYAG 328

Query: 242 LP 243
           LP
Sbjct: 329 LP 330


>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
          Length = 332

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G+    W +V + G+VTGG +     C+P    PC NH     S P   +  TP   
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    YG+ + +DK   K  Y ++++   IQ+E+MKNGPV A    Y D FS+   
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                              Y  G+Y  +   +  A+A VK+VGWG ENG  YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312


>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
          Length = 332

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G+    W +V + G+VTGG +     C+P    PC NH     S P   +  TP   
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    YG+ + +DK   K  Y ++++   IQ+E+MKNGPV A    Y D FS+   
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                              Y  G+Y  +   +  A+A VK+VGWG ENG  YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312


>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
          Length = 332

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G+    W +V + G+VTGG +     C+P    PC NH     S P   +  TP   
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTP--A 222

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    YG+ + +DK   K  Y ++++   IQ+E+MKNGPV A    Y D FS+   
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                              Y  G+Y  +   +  A+A VK+VGWG ENG  YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312


>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
 gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
          Length = 341

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 97/243 (39%), Gaps = 70/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  +RG+ +GG ++S  GC P     C+ A+     P+C          
Sbjct: 158 CQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADEDADTPKC---------- 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
            TR     Y      D  RF R  Y V+ +   I++EI +NGPV A+  +Y D  +YK+G
Sbjct: 208 -TRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTG 266

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +F    G +A            VK++GWG EN            
Sbjct: 267 ------------VYRHVFGPMEGGHA------------VKMIGWGVEN------------ 290

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                 G  YW   +++GE +G++G  KI+RG N   IES V+ 
Sbjct: 291 ----------------------GTKYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHA 328

Query: 241 ALP 243
            LP
Sbjct: 329 GLP 331


>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
          Length = 342

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 66/242 (27%), Positives = 97/242 (40%), Gaps = 63/242 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C  G   + W +    G+VTGG + +   C+P    PC NH N T     C  ++TP   
Sbjct: 162 CDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETFYR-NCTGVSTPS-- 218

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C T C    Y   +  DK R ++ Y + + V+ IQ++I+K+GP+VA   +Y D   YK G
Sbjct: 219 CKTSC-QKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVATFSVYEDFMYYKKG 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ Y  G Y    +        V+I+GWG EN   YW I   + 
Sbjct: 278 ----------------IYRYTHGGYEGGHA--------VRILGWGVENNVKYWIIANSWN 313

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                            WGE+                   G  +++RG N+  IE  V+ 
Sbjct: 314 TD---------------WGED-------------------GFFRMVRGINDCGIEESVSA 339

Query: 241 AL 242
            L
Sbjct: 340 GL 341


>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
          Length = 309

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 67/245 (27%), Positives = 101/245 (41%), Gaps = 67/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C   ++   WV   K G+V+GG++ S  GCQP   PPC H +       C T   P P C
Sbjct: 123 CDHHLAWDHWV---KHGIVSGGSYGSKEGCQPYHLPPCEH-HRAGPRRNC-TKYGPTPSC 177

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWV---NDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
              C  D Y   +  D +  K++Y +   N+++  I+ EI  NGPV A M  Y D ++Y+
Sbjct: 178 ARVCQPD-YKISYEDDLHFGKQWYALAPHNEKI--IRTEIFHNGPVEATMAAYEDFYTYE 234

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG                I+ +  G +        V    VKI+GWG +           
Sbjct: 235 SG----------------IYHHIEGTF--------VCDHAVKIIGWGTD----------- 259

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                ++   PYW + ++F   +G+ G  KI RG NE  IE+ +
Sbjct: 260 ---------------------KKTNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENKI 298

Query: 239 NGALP 243
              +P
Sbjct: 299 TAGIP 303


>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 91/236 (38%), Gaps = 69/236 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W++  + G+       +++GCQP  FP C H     ++  C       PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  ++Y+D+F+YKSG 
Sbjct: 211 NATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y N                           + +    V+IVGWG+ NG P          
Sbjct: 267 YRN------------------------VDGDFLGGQAVRIVGWGKLNGTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                   YW + +++   +G  G + ILRG NE  IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYMLILRGNNECNIEHL 324


>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
          Length = 375

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 94/246 (38%), Gaps = 71/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 60
           C  G +     +    G+VTGG  ++  GC P SFPPC  +     S P CKT       
Sbjct: 165 CQGGYTIEAMKYWMNSGVVTGG-DYNGAGCMPYSFPPCKKSPCVEFSTPSCKT------T 217

Query: 61  CHTRCTNDNY--GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           C  + T  +Y   + F    Y+        + V  IQ EI  NGPV A+  ++ D + YK
Sbjct: 218 CQEKYTTADYKNDKHFATSAYKLST---TKNAVPTIQYEIYHNGPVEASYRVFEDFYQYK 274

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y +                         S  +V    VKI+GWG ENG  Y      
Sbjct: 275 SGVYHH------------------------VSGNLVGGHAVKIIGWGTENGVDY------ 304

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                       W + +++G  FG+KG  KI RG NE  IES +
Sbjct: 305 ----------------------------WLVANSWGTSFGEKGFFKIRRGTNECQIESNI 336

Query: 239 NGALPK 244
              L K
Sbjct: 337 VAGLAK 342


>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
          Length = 353

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 98/245 (40%), Gaps = 71/245 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S   W +    GLV+GG ++++ GCQP S      +N+              P+C
Sbjct: 143 CKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYS-----KSNFNDGV---------SPEC 188

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N  Y   +  D++     Y++   V  IQQEI+  G                   
Sbjct: 189 SKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRG------------------- 229

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPV+A   +Y D   Y+ GVY  ++ A + ++A VKI+GWG ENG  YW +      
Sbjct: 230 ---GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHA-VKIIGWGTENGWAYWLVAN---- 281

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SLVNG 240
                           WG++ G                 G  KI RG NE  IE S++ G
Sbjct: 282 ---------------SWGKDWG--------------ALGGVFKIRRGTNECKIEQSIITG 312

Query: 241 ALPKD 245
            + KD
Sbjct: 313 HVRKD 317


>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
          Length = 348

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 82/202 (40%), Gaps = 41/202 (20%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G ++  W W    G+VTGGA+     C+P  FP C  A+   +   C +     P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ +  DK + + +YW+ ++   IQ EIM+ GPV A   +Y D   Y+ G 
Sbjct: 225 KPYCQY-GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGV 283

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                        +  +  +    ++KI+GWG + G  YW I   ++ 
Sbjct: 284 Y------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWST 319

Query: 182 SASAEIVAYATVKLIGWGEENG 203
                           WGE+ G
Sbjct: 320 D---------------WGEDGG 326


>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
          Length = 330

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 27/174 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G+    W +V + G+VTGG +     C+P    PC       S P   +  TP   C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSCPRDHSFRTPA--C 222

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ + +DK   K  Y ++++   IQ+E+MKNGPV A    Y D FS+    
Sbjct: 223 KKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYED-FSF---- 276

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                             Y+ G+Y  S   +  A+A VK+VGWG ENG  YW +
Sbjct: 277 ------------------YRKGIYVHSYGRQRGAHA-VKVVGWGVENGTKYWNV 311


>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
          Length = 253

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 60/175 (34%), Positives = 77/175 (44%), Gaps = 26/175 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     + +  K+G VTGG + + +GC+P  F PC H    T   EC   AT  PKC
Sbjct: 72  CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C                  Y   N E A  Q+EIMKNGPVV    +Y D FSY    
Sbjct: 131 VRKCQKSYKKSYKKDRSIGKDAYEEPNAEKA-TQREIMKNGPVVGAFTVYED-FSY---- 184

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                             YK G+Y  +A      +A +KI+GWG+E G PYW I 
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIA 220


>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
          Length = 333

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K GLVTGG  +S  GCQP  FPPC      T    C   +    KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  +          YR  R Y            + ++  V+A   + +DI +Y    
Sbjct: 207 QKKCFGNT------SISYRGDRRY------------VERSPYVLAYDNMQNDIMTY---- 244

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GP+ ++  +Y D  SYKSGVY  S +A  +   +VK +GWG E             V
Sbjct: 245 ---GPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-----------V 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           S                       YW +++++   +GD G  KI RG NE  +E      
Sbjct: 291 S-----------------------YWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAG 327

Query: 242 LPK 244
           +P+
Sbjct: 328 VPE 330


>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
          Length = 155

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 30/179 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +   +G+ TGG + S  GC P   PPC       +         P  + 
Sbjct: 5   CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 59

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           H +C    YG    Q +Y+ K  Y +N     ++Q+++K GP+ A+  L+ D        
Sbjct: 60  H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNT-MEQDLIKYGPIEASFNLFDD-------- 109

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          + +YKSG+Y  +  A+ ++  ++KI+GWG+ENG PYW  V  ++
Sbjct: 110 ---------------LSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWS 153


>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
          Length = 260

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 89/216 (41%), Gaps = 67/216 (31%)

Query: 30  GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
           GC     P CN        P CKTL    P C   C   +  + + +DK+  K+ Y +  
Sbjct: 111 GCMSYPLPRCN--------PSCKTLYD-APTCKKECDKGSPLK-YEEDKHYAKQAYRIMS 160

Query: 90  EVA-DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS 148
           +V   IQ EI+KNGPVV                       A+  +Y+D   Y SGVY   
Sbjct: 161 KVERQIQLEIIKNGPVV-----------------------ASFTVYADFIHYLSGVYKFD 197

Query: 149 ASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWT 208
             ++++    V+I+GWG ENG                                   PYW 
Sbjct: 198 GESKLLGGHAVRIIGWGIENGT---------------------------------YPYWL 224

Query: 209 IVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           + +++ E++GD+G  KI RG+NE  IE  +   LP+
Sbjct: 225 VSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260


>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 94/243 (38%), Gaps = 63/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S+ W +    GLVTGG  +SN GC P     C+H      +P C  +  P P C
Sbjct: 286 CEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDI-QPTPAC 343

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N+     +  DK+     Y V  +   I  EI  NGPV A+  +Y+D  SYKSG 
Sbjct: 344 ANSCQNN---ATWSSDKHFGASSYSVGTDQQSIMTEIYTNGPVEASYDVYADFVSYKSG- 399

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + +G Y        +    VKI+GWG +   P          
Sbjct: 400 ---------------VYQHVTGDY--------LGGHAVKIIGWGVDGSTP---------- 426

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G+ G   ILRG +E  IE  +   
Sbjct: 427 ------------------------YWIVANSWNNDWGNNGFFNILRGSDECGIEDGIVAG 462

Query: 242 LPK 244
           +PK
Sbjct: 463 IPK 465


>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
 gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
          Length = 373

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/254 (28%), Positives = 100/254 (39%), Gaps = 67/254 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYT-TSEPECKTLATPQPK 60
           C  G S     +    G VTGG ++ N GC P SF PC  +    ++ P CKT       
Sbjct: 163 CQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQKSPCVESTTPTCKTTCQSSYT 221

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
                T+ +YG   +       R    N+ V+ IQ EI  NGPV A+  +Y D + YKSG
Sbjct: 222 TANYTTDKHYGTSAY-------RLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSG 274

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++ Y SG        ++V    VKI+GWG EN   Y        
Sbjct: 275 ----------------VYHYVSG--------KLVGGHAVKIIGWGTENDVDY-------- 302

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W + +++G +FG+ G  KI RG NE  IES V  
Sbjct: 303 --------------------------WLVANSWGIKFGEGGFFKIRRGTNECQIESNVVA 336

Query: 241 ALPKDNYGVEFGEE 254
            + K     E G++
Sbjct: 337 GVAKLGTHAEKGDD 350


>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
          Length = 225

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/167 (31%), Positives = 75/167 (44%), Gaps = 26/167 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  ++GLV+GG + S  GC+P + PPC H +   S P C       PKC
Sbjct: 83  CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEH-HVNGSRPSCSGEGGDTPKC 141

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   + +DK   +  Y V      I +EI K+GPV     +Y D   YKSG 
Sbjct: 142 VQKC-DSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSG- 199

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
                          ++ + +G        E V    +KI+GWG EN
Sbjct: 200 ---------------VYQHHTG--------EAVGGHAIKILGWGIEN 223


>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 337

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 101/246 (41%), Gaps = 66/246 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W      GLVTGG ++S  GC+P   PP N  N ++S+         +  C
Sbjct: 157 CHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           +   + D      F D +R+ R YY++      IQ+                D+ +Y   
Sbjct: 217 YGNQSID------FNDDHRYTRDYYYLT--YGSIQK----------------DVLTY--- 249

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
               GP+ A+  +Y D  SYKSGVY  S +A  +    VK++GWGEE+G PY        
Sbjct: 250 ----GPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWGEEDGTPY-------- 297

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W +V+++  Q+GD G  KI RG NE  +++    
Sbjct: 298 --------------------------WLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTA 331

Query: 241 ALPKDN 246
            +P  N
Sbjct: 332 GVPVTN 337


>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 86/207 (41%), Gaps = 62/207 (29%)

Query: 31  CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
           CQP  FP C H     ++  C       P+C+T CT+    +     KYR K  Y +   
Sbjct: 180 CQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTD----KTIPLIKYRGKDAYMLLPG 235

Query: 91  VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
             + ++E+  NGP VA +++Y+D+F+YKSG Y N   V   Y+         GV A    
Sbjct: 236 EEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRN---VDGSYM---------GVTA---- 279

Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
                   VK+VGWG+ NG P                                  YW + 
Sbjct: 280 --------VKVVGWGKLNGTP----------------------------------YWKVA 297

Query: 211 STFGEQFGDKGTIKILRGRNEAIIESL 237
           +T+   +G  G + ILRG NE  IE L
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
 gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
          Length = 340

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 86/235 (36%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C +     PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C            KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 219 NTTCERSEMDL----VKYKGSTSYSVKGE-KELMIELMTNGPLELTMQVYSDFVGYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           E +    VK+VGWG ++G P          
Sbjct: 274 YKH------------------------VLGEFLGGHAVKLVGWGTQDGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW + +++   +GDKG   I RG NE  IES
Sbjct: 300 ------------------------YWKVANSWNTDWGDKGYFLIQRGNNECKIES 330


>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
 gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
          Length = 572

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 350 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 408

Query: 59  --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
             PKC   C    Y      F QD ++    Y +     D+++++M +GPV     +Y D
Sbjct: 409 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 467

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
             SYKSG                ++ + SG+         V    +KI+GWG ENG  YW
Sbjct: 468 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 503

Query: 174 TIV 176
             V
Sbjct: 504 HAV 506


>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
          Length = 569

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405

Query: 59  --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
             PKC   C    Y      F QD ++    Y +     D+++++M +GPV     +Y D
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 464

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
             SYKSG                ++ + SG+         V    +KI+GWG ENG  YW
Sbjct: 465 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 500

Query: 174 TIV 176
             V
Sbjct: 501 HAV 503


>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 86/207 (41%), Gaps = 62/207 (29%)

Query: 31  CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
           CQP  FP C H     ++  C       P+C+T CT+    +     KYR K  Y +   
Sbjct: 180 CQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTD----KTIPLIKYRGKDAYMLLPG 235

Query: 91  VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
             + ++E+  NGP VA +++Y+D+F+YKSG Y N   V   Y+         GV A    
Sbjct: 236 EEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRN---VDGSYM---------GVTA---- 279

Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
                   VK+VGWG+ NG P                                  YW + 
Sbjct: 280 --------VKVVGWGKLNGTP----------------------------------YWKVA 297

Query: 211 STFGEQFGDKGTIKILRGRNEAIIESL 237
           +T+   +G  G + ILRG NE  IE L
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHL 324


>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
          Length = 569

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)

Query: 2   CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           C+ G     W W  ++G+VTGG   A    T C P   P C H +     P+C     P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405

Query: 59  --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
             PKC   C    Y      F QD ++    Y +     D+++++M +GPV     +Y D
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 464

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
             SYKSG                ++ + SG+         V    +KI+GWG ENG  YW
Sbjct: 465 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 500

Query: 174 TIV 176
             V
Sbjct: 501 HAV 503


>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
          Length = 342

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 98/245 (40%), Gaps = 68/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++G+ +GG ++S  GC P     C+      S  E  T     PKC
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCD-----ASGEEADT-----PKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC +       +QD+   +  Y + ++   I +EI  NGPV A    Y D+ +YKSG 
Sbjct: 206 SKRCQSGYNVTDVWQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSG- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++ + +G +A            VK++GWG ENG            
Sbjct: 265 -----------VYRHVWGHMAGGHA------------VKLMGWGVENG------------ 289

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++G+ +GD G  KI+RG N   IE  V+  
Sbjct: 290 ----------------------LKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAG 327

Query: 242 LPKDN 246
           LP  N
Sbjct: 328 LPSFN 332


>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
          Length = 405

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 77/177 (43%), Gaps = 30/177 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
           C+ G+    +    + G  TG     + GCQP  F  C H   +T  P C ++  P+ K 
Sbjct: 140 CNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDSV--PEYKA 197

Query: 61  --CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
             C   C  D Y R + +D Y  K  Y  +DE A IQ+EIM NGPV  +  +Y       
Sbjct: 198 DTCSHECQKD-YDRKYEEDLYYGKEQYGFSDE-APIQREIMTNGPVAVSFTVYES----- 250

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                        +LY     Y  G+Y  +    I  Y  V++VGWG ENG  YW I
Sbjct: 251 -------------FLY-----YSGGIYRSTPGERIKGYHAVRVVGWGVENGTKYWKI 289


>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
 gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
          Length = 342

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 98/245 (40%), Gaps = 68/245 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++G+ +GG ++S  GC P     C+      S  E  T     PKC
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCD-----ASGEEADT-----PKC 205

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC +       +QD+   +  Y + ++   I +EI  NGPV A    Y D+ +YKSG 
Sbjct: 206 SKRCQSGYNVTDVWQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSG- 264

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++ + +G +A            VK++GWG ENG            
Sbjct: 265 -----------VYRHVWGHMAGGHA------------VKLMGWGVENG------------ 289

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++G+ +GD G  KI+RG N   IE  V+  
Sbjct: 290 ----------------------LKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAG 327

Query: 242 LPKDN 246
           LP  N
Sbjct: 328 LPSFN 332


>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 91/235 (38%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   ++W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y V+ E  D ++E+                       
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGE-DDYKREL----------------------- 242

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP V   ++YSD  +YK+GVY                                    
Sbjct: 243 YFNGPFVVVFWVYSDFLAYKTGVYR----------------------------------- 267

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
             S + +    V+++GWG+ NG PYW I +++   +G  G +  LRG NE  IE+
Sbjct: 268 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEA 322


>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 61/235 (25%), Positives = 91/235 (38%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   ++W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y V+ E  D ++E+                       
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGE-DDYKREL----------------------- 242

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP V   ++YSD  +YK+GVY                                    
Sbjct: 243 YFNGPFVVVFWVYSDFLAYKTGVYR----------------------------------- 267

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
             S + +    V+++GWG+ NG PYW I +++   +G  G +  LRG NE  IE+
Sbjct: 268 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEA 322


>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 388

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 91/217 (41%), Gaps = 61/217 (28%)

Query: 28  NTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWV 87
           ++GC P +FP C+H   T     CK   +P P C T C N ++   F  D++  +   + 
Sbjct: 219 DSGCWPYNFPECSHHVDTKGMEPCKG-NSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYS 277

Query: 88  NDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV 147
            DEV +I++EI+ NGPV A   +Y D F Y                      YKSGVY  
Sbjct: 278 LDEVDEIKREIIDNGPVAAAFTVYED-FPY----------------------YKSGVYKH 314

Query: 148 SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYW 207
              +E+  +A                                   VK+IGWG +    YW
Sbjct: 315 VNGSELGGHA-----------------------------------VKIIGWGIDQNEQYW 339

Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
            +++++   +GD+G  KI  G  E  I+S V   +PK
Sbjct: 340 LVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPK 374


>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
          Length = 340

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 87/235 (37%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C +     PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C  +         KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 219 NTTCERNEMDL----VKYKGSTSYSVKGE-KELMIELMTNGPLELTMQVYSDFVGYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    VK+VGWG ++G P          
Sbjct: 274 YKH------------------------VLGDFLGGHAVKLVGWGTQDGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW + +++   +GDKG   I RG NE  IES
Sbjct: 300 ------------------------YWKVANSWNTDWGDKGYFLIQRGNNECKIES 330


>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
 gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 337

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 65/247 (26%), Positives = 98/247 (39%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
           C+ G     W      GLVTGG + S  GC+P   PPC +      + + K   + QP  
Sbjct: 155 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY------DKDGKNTCSGQPME 208

Query: 60  ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
              KC  +C  D      F   +R+ R     D+     + I K            D+ +
Sbjct: 209 SNHKCSKKCYGDE--DIDFNKDHRYTR-----DDYYLTYRGIQK------------DVIN 249

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y       GP+  +  +Y D  +YKSG+Y  S +A  +   +VK++GWGEE G  Y    
Sbjct: 250 Y-------GPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLY---- 298

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                         W +V+++   +GDKG  KI RG NE  +++
Sbjct: 299 ------------------------------WLMVNSWNADWGDKGLFKIRRGTNECRVDN 328

Query: 237 LVNGALP 243
              G +P
Sbjct: 329 STTGGVP 335


>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 341

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 52/175 (29%), Positives = 74/175 (42%), Gaps = 26/175 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + W+ + G+VTGG +     C+P SF PC           C     P PKC
Sbjct: 158 CQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPYYGPCPGGLWPTPKC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             + +   Y + + +DK+   R Y + +    I+QEI KNGPVVA   +Y D        
Sbjct: 218 R-KSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYED-------- 268

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                           +S   G+Y      +  A+A  K++GWG ENG  YW I 
Sbjct: 269 ----------------YSSTGGIYVHKWGIQTGAHAD-KVIGWGRENGTDYWLIA 306


>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 335

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/242 (26%), Positives = 97/242 (40%), Gaps = 70/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+ +G        CQP  FP C+H   +T+ P+C  L    P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR  + Y ++ E  D ++E+   GP  A   ++SD        
Sbjct: 211 NPACTDSTISK----KKYRGLKSYSLSGE-EDFRRELYFRGPFQAVFDVWSD-------- 257

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          +F+YK GVY     A I A+A V+IVGWG ++G P          
Sbjct: 258 ---------------LFAYKHGVYKHVGGAFIGAHA-VRIVGWGNQSGVP---------- 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++  ++GD+G   +LRG NE  IE   +  
Sbjct: 292 ------------------------YWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAG 327

Query: 242 LP 243
           +P
Sbjct: 328 VP 329


>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
          Length = 340

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C            KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGG 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    VK+VGWG + G P          
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG NE  IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
          Length = 345

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 171 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 223

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C            KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 224 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 278

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    VK+VGWG + G P          
Sbjct: 279 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG NE  IES
Sbjct: 305 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 335


>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 91/243 (37%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  Y+Y+D+F+YKSG 
Sbjct: 212 NATCTD----KSVPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y N                           + +    VK+VGWG+ NG P          
Sbjct: 268 YRN------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G + ILRG NE  IE L    
Sbjct: 294 ------------------------YWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAG 329

Query: 242 LPK 244
            P+
Sbjct: 330 TPE 332


>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 97/243 (39%), Gaps = 72/243 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+ +G        CQP  FP C+H   +T+ P+C  L    P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210

Query: 62  HTRCTNDNYGRGFFQDKYR-FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           +  CT+    +     KYR  K Y +  +E  D ++E+   GP  A   ++SD       
Sbjct: 211 NPACTDSTISK----KKYRGLKSYSFSGEE--DFRRELYFRGPFQAVFDVWSD------- 257

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           +F+YK GVY     A I A+A V+IVGWG ++G P         
Sbjct: 258 ----------------LFAYKHGVYKHVGGAFIGAHA-VRIVGWGNQSGVP--------- 291

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW I +++  ++GD+G   +LRG NE  IE   + 
Sbjct: 292 -------------------------YWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSA 326

Query: 241 ALP 243
            +P
Sbjct: 327 GVP 329


>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
 gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
 gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania infantum JPCM5]
 gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
          Length = 340

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C            KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    VK+VGWG + G P          
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG NE  IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/242 (26%), Positives = 88/242 (36%), Gaps = 69/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y +     D ++E+  NGP V    +YSD  +YK+G 
Sbjct: 211 NTTCTD----KAIPLIKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S + +    V+IVGWG+ NG PY         
Sbjct: 267 YRH------------------------VSGDFLGGHAVRIVGWGKLNGTPY--------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G  G   ILRG NE  IES     
Sbjct: 294 -------------------------WKIANSWDTDWGMNGHFLILRGNNECGIESTGYAG 328

Query: 242 LP 243
           LP
Sbjct: 329 LP 330


>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
          Length = 246

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/176 (30%), Positives = 79/176 (44%), Gaps = 28/176 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + S  GC P    PC H    T  P CK      P C
Sbjct: 93  CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 150

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + QD +R K  Y + ++V  I+QEI  NGPV     +Y          
Sbjct: 151 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVY---------- 199

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIV 176
                         D  +Y++GVY   A   +  +A ++I+GWG +NG  PYW + 
Sbjct: 200 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVA 241


>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
          Length = 340

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   +   P C       PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C            KY+    Y V  E  ++  E+M NGP+   M +YSD   YKSG 
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    VK+VGWG + G P          
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG NE  IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330


>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/214 (27%), Positives = 82/214 (38%), Gaps = 62/214 (28%)

Query: 31  CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
           CQP  FP C H     ++  C       PKC+  CT+    +     KYR    Y +   
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGNATYLLLHG 235

Query: 91  VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
             D ++E+  NGP VA  ++Y+D+F+YKSG Y N                          
Sbjct: 236 EEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRN------------------------VD 271

Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
            +I+    V+IVGWG+ NG P                                  YW + 
Sbjct: 272 GDILGGQAVRIVGWGKLNGTP----------------------------------YWKVA 297

Query: 211 STFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           +T+   +G  G + ILRG NE  IE L     P+
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
          Length = 339

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 92/238 (38%), Gaps = 64/238 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + ++   G+ +GG +     C+P  F PC+  NY    P  K  A   PKC
Sbjct: 158 CEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD-GNYG---PCPKEGAFDTPKC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    Y   + +DK   K  +  + D  A I+QEI  NGPV AN Y++ D   YK G
Sbjct: 214 RKIC-QFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEG 272

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y   +    GV+A            +K++GWG ENG  YW +   Y 
Sbjct: 273 ------------IYKQTYGKWIGVHA------------IKLIGWGTENGTDYWLVANSYN 308

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                            WGE                    GT +ILRG N  +IES V
Sbjct: 309 YD---------------WGE-------------------NGTFRILRGTNHCLIESQV 332


>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
          Length = 118

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 63/121 (52%), Gaps = 3/121 (2%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S  W +  K+GLV+GG ++S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 1   CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 58

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   + +DK+     Y V +   +I  EI KNGPV     +YSD   YKSG 
Sbjct: 59  SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 117

Query: 122 Y 122
           Y
Sbjct: 118 Y 118


>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 67/149 (44%), Gaps = 28/149 (18%)

Query: 27  SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
           +++GCQP  FP C H     ++  C       PKC+  CT+    +     KYR    Y 
Sbjct: 176 ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNATYL 231

Query: 87  VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
           +     D ++E+  NGP VA  ++Y+D+F+YKSG Y N                      
Sbjct: 232 LLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRN---------------------- 269

Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTI 175
                + +    V+IVGWG+ NG PYW +
Sbjct: 270 --VDGDFLGGQAVRIVGWGKLNGTPYWKV 296


>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 27/168 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + GLVTGG+  + +GC+   FP CNH       P C     P P C
Sbjct: 38  CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPAC 96

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  C  D     +  DK + K  Y V +    I +EIM+NGPV A   +Y D   Y+SG 
Sbjct: 97  NKTC--DTPEVNYILDKTKAKSSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGV 154

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
           Y                          +   ++    ++++GWGEENG
Sbjct: 155 Y------------------------FHSFGRMIGGHAIRMLGWGEENG 178


>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 91/242 (37%), Gaps = 69/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W++  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------TSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  ++Y+D+F+YKSG 
Sbjct: 211 NATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y N                           + +    V+IVGWG+ NG P          
Sbjct: 267 YRN------------------------VDGDFLGGQAVRIVGWGKLNGTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G + ILRG NE  IE L    
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTG 328

Query: 242 LP 243
            P
Sbjct: 329 FP 330


>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
 gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
          Length = 356

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/233 (28%), Positives = 92/233 (39%), Gaps = 63/233 (27%)

Query: 18  GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT------NDNYG 71
           G VTGG +  + GC+P SF PC++   + + P C      Q KC +  T      + +YG
Sbjct: 156 GAVTGGDYKGD-GCKPYSFAPCSNCVESKTTPSC------QSKCQSTYTVTNYKGDKHYG 208

Query: 72  RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
           +   +   R K     +    D     +   P++ N              Y NGPV    
Sbjct: 209 KNEGKVTERHKHLECTSAYRLDTSSNAV---PIIQNEI------------YQNGPVEVAY 253

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
            +Y D + YKSGVY      +   +A                                  
Sbjct: 254 TVYDDFYHYKSGVYHHVTGKDTGGHA---------------------------------- 279

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
            VK+IGWG E G  YW + +++G  FGDKG  KI RG NE  IES V   + K
Sbjct: 280 -VKIIGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESNVVAGMAK 331


>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
          Length = 313

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 76/175 (43%), Gaps = 27/175 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGG+    +GC+   FP C+H +     P C     P P+C
Sbjct: 155 CRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDH-HVQGHYPPCPRQIYPTPEC 213

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D    G+ +DK R    Y +      I +EIM  GPV A       +F+     
Sbjct: 214 VQDC--DTPELGYLEDKTRANISYNIYASEISIMKEIMLRGPVEA-------VFT----- 259

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                      +Y D   YKS VY  +  A +  +A ++I+GWGEE   PYW I 
Sbjct: 260 -----------VYEDFLQYKSRVYFHAWGAPMSGHA-IRILGWGEEGDVPYWLIA 302


>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 335

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 92/242 (38%), Gaps = 70/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     KYR    Y ++ E  D ++E+  NGP V    +YSD  +YK+G 
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYGLDGE-DDYKRELYFNGPFVVAFQVYSDFLAYKTGV 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S +++    V+IVGWG+ NG PY         
Sbjct: 266 YRH------------------------VSGDVLGGHAVRIVGWGKLNGTPY--------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++   +G  G   ILRG++E  IES     
Sbjct: 293 -------------------------WKIANSWDTDWGMNGHFLILRGKDECGIESEGYAG 327

Query: 242 LP 243
           LP
Sbjct: 328 LP 329


>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
          Length = 344

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/175 (32%), Positives = 83/175 (47%), Gaps = 27/175 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     + +  ++G VTGG + +   C+P  F PC H    T   EC    +  P+C
Sbjct: 160 CDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGS-TPEC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +C  + Y   + +D+ R +  Y +    V  IQ+EIM+NGPVVA   ++ D FS+   
Sbjct: 219 VRKC-QEGYETEYHEDRVRGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDD-FSF--- 273

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                              Y+ G+YA  A +    +A VKI+GWG E+G PYW I
Sbjct: 274 -------------------YRKGIYAHVAGSPRGGHA-VKIIGWGTEHGVPYWII 308


>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
          Length = 334

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 90/248 (36%), Gaps = 82/248 (33%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W++  + GLV+         CQP  FPPC H+   +  P C  +    PKC
Sbjct: 159 CDGGYPDEAWLYFTESGLVS-------DYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS----- 116
           +  CT+                                K  PVV   Y  S+ +S     
Sbjct: 212 NATCTD--------------------------------KRIPVV--RYFASESYSLQGEE 237

Query: 117 -YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK   Y  GP      +Y D  +Y+SGVY   +   +  +A                  
Sbjct: 238 DYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHA------------------ 279

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                            V+++GWGE NG PYW I +++   +G+ G +   RG++E  IE
Sbjct: 280 -----------------VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIE 322

Query: 236 SLVNGALP 243
           S  +   P
Sbjct: 323 SQGSAGTP 330


>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 81/205 (39%), Gaps = 47/205 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 58
           C  G ++  W W    G+VTGGA+     C+P  FP C  H     +       ATP  +
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARK 225

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C        YG+ +  DK + + +YW+ ++   IQ EIM+ GPV A   +Y D   Y 
Sbjct: 226 PYCQY-----GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYN 280

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
            G Y                        +  +  +    ++KI+GWG + G  YW I   
Sbjct: 281 GGVY------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANS 316

Query: 179 YAVSASAEIVAYATVKLIGWGEENG 203
           ++                 WGE+ G
Sbjct: 317 WSTD---------------WGEDGG 326


>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
          Length = 248

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 79/176 (44%), Gaps = 28/176 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + SN GC P    PC H    T  P CK      P C
Sbjct: 95  CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTC 152

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  + Y   + QD +  K  Y + ++V  I+QEI  NGPV     +Y          
Sbjct: 153 VKKC-EEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY---------- 201

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIV 176
                         D  +Y++GVY   A   +  +A ++I+GWG +NG  PYW + 
Sbjct: 202 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVA 243


>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
          Length = 199

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 80/179 (44%), Gaps = 36/179 (20%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
           C  G S   W +  ++G+VTGG +++   C+P    PC    Y   EP   EC  LA   
Sbjct: 43  CQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCG---YHKDEPYYGECDDLAD-T 98

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P+C  RC    Y + +  DK+  +  Y +   V  IQ+EIM+NGPVVA   +Y D   YK
Sbjct: 99  PRCKRRC-QLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYK 157

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR----PYW 173
            G            +Y      K+G +A            VK++GWG E       PYW
Sbjct: 158 GG------------IYKHTSGKKTGGHA------------VKVIGWGSEQKGSEKIPYW 192


>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
          Length = 360

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 72/172 (41%), Gaps = 38/172 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +    GLV+GG ++++TGCQP S       NY              P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS-----ELNYYRI----------TPPC 186

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C ND Y   +  DK+     Y++      IQ EI+                      
Sbjct: 187 NTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILS--------------------- 225

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
            G GPVVA   +Y D   Y+ GVY +  S  +     VKI+GWG ENG  YW
Sbjct: 226 -GGGPVVAAFDVYGDFKIYRDGVY-IYTSGALFGRTAVKIIGWGTENGWAYW 275


>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
          Length = 196

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 71/170 (41%), Gaps = 25/170 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  K G+ TGG++ S +GC+P   PPC H    T    C T     P C
Sbjct: 44  CEGGYPIEAWKYWVKTGICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVC 103

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+     Y V   VA IQ+EIM NGPV                 
Sbjct: 104 TNKCIA-AYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTNGPV----------------- 145

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
                  A   +Y D + Y  GVY  +  AE+  +A V+I+GWG     P
Sbjct: 146 ------EAAYTVYEDFYQYTGGVYTHTGGAEVGGHA-VRILGWGVRQQDP 188


>gi|294952601|ref|XP_002787371.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239902343|gb|EER19167.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 744

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/138 (35%), Positives = 64/138 (46%), Gaps = 26/138 (18%)

Query: 30  GCQPVSFPPCNHANYTTSE-PECKTLA-TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWV 87
           GC P  F  CNH     +E P+CK  A  P P C T CTN  Y R   +D +R K +  V
Sbjct: 494 GCWPYPFQKCNHVPTEKTEYPKCKDAAHPPLPPCRTTCTNKAYKRSLKKDVHRAKGWRKV 553

Query: 88  NDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV 147
            +    ++QEI  NGPV +   +Y D F Y                      YKSGVY V
Sbjct: 554 LNNAQSVKQEIFDNGPVFSAFKMYED-FRY----------------------YKSGVY-V 589

Query: 148 SASAEIVAYATVKIVGWG 165
             + E  ++  +KI+GWG
Sbjct: 590 PTTEEFHSFHLIKIIGWG 607


>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
          Length = 350

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 89/242 (36%), Gaps = 58/242 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+ TGG +     C+P +F PC H        EC     P P+C
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHHRNEIYYGECPKEIFPTPQC 224

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK   K  Y + +    IQ+EIM NGPV A   +Y D   Y+SG 
Sbjct: 225 TQSC-QAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSG- 282

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y      + G +A            VK++GWG                
Sbjct: 283 -----------IYVHTAGRREGGHA------------VKLIGWGV--------------- 304

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                             +++G  YW   +++   +G+ G  +I+RG +   IES V   
Sbjct: 305 ------------------DDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIESAVVAG 346

Query: 242 LP 243
           +P
Sbjct: 347 MP 348


>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
 gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
           Full=Cysteine protease-related 3; Flags: Precursor
 gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
 gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
          Length = 370

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/257 (30%), Positives = 100/257 (38%), Gaps = 71/257 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S     +    G VTGG +  + GC P SF PC     T + PE  T     P C
Sbjct: 162 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPC-----TKNCPESTT-----PSC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            T C +      + +DK+     Y V     V +IQ EI   GPV A         SYK 
Sbjct: 211 KTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA---------SYK- 260

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                        +Y D + YKSGVY  + S ++V    VKI+GWG ENG  Y       
Sbjct: 261 -------------VYEDFYHYKSGVYHYT-SGKLVGGHAVKIIGWGVENGVDY------- 299

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                      W I +++G  FG+KG  KI RG NE  IE  V 
Sbjct: 300 ---------------------------WLIANSWGTSFGEKGFFKIRRGTNECQIEGNVV 332

Query: 240 GALPKDNYGVEFGEESG 256
             + K     E  E+ G
Sbjct: 333 AGIAKLGTHSETYEDDG 349


>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  Y+Y+D+F+YKSG 
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    VK+VGWG+ NG P          
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G + ILRG NE  IE L    
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329

Query: 242 LPK 244
            P+
Sbjct: 330 TPE 332


>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
          Length = 1308

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Composition-based stats.
 Identities = 54/172 (31%), Positives = 74/172 (43%), Gaps = 39/172 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + + +V K G+VT       + CQP + P C  A     +  C       P C
Sbjct: 136 CEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCPPA-----QQPCMNFVN-TPPC 182

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C N +    F QD +  K  Y V   VA IQ EI+ NGPV A   +Y D   YKSG 
Sbjct: 183 SAKCANSSVN--FQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSG- 239

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                          ++++KSG        + +    +KIVG+G  NG PYW
Sbjct: 240 ---------------VYTHKSG--------KDLGGHCIKIVGFGVSNGTPYW 268


>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  Y+Y+D+F+YKSG 
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    VK+VGWG+ NG P          
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G + ILRG NE  IE L    
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329

Query: 242 LPK 244
            P+
Sbjct: 330 TPE 332


>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 81/214 (37%), Gaps = 62/214 (28%)

Query: 31  CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
           CQP  FP C H     ++  C       PKC+  CT+    +     KYR    Y +   
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGNATYLLLHG 235

Query: 91  VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
             D ++E+  NGP VA  Y+Y+D+F+YKSG Y +                          
Sbjct: 236 EEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRH------------------------VD 271

Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
            + +    VK+VGWG+ NG P                                  YW + 
Sbjct: 272 GDFLGGTAVKVVGWGKLNGTP----------------------------------YWKVA 297

Query: 211 STFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           +T+   +G  G + ILRG NE  IE L     P+
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331


>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
 gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 337

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+       +++ CQP  FP C H     ++  C       PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y +     D ++E+  NGP VA  Y+Y+D+F+YKSG 
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                           + +    VK+VGWG+ NG P          
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G + ILRG NE  IE L    
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329

Query: 242 LPK 244
            P+
Sbjct: 330 TPE 332


>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 325

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 95/243 (39%), Gaps = 71/243 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     +++   RG+ TGG + S  GC+P S         + SE E +T     P C
Sbjct: 153 CDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIG-------SNSEDEAET-----PLC 200

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C N+ Y     QD++  ++ YWVN     I QE+ KNGPVV    +Y D        
Sbjct: 201 TRQCINE-YPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDF------- 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    MY    ++ ++ G        + +    VK++GWG EN + YW I   +  
Sbjct: 253 ---------MYYIKGVYEHRFG--------KFLGGHAVKLIGWGIENSKKYWLISNSWNT 295

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +               WGE                    G  KI+RG+N   IES V   
Sbjct: 296 T---------------WGE-------------------NGFFKIIRGKNCCAIESYVVAG 321

Query: 242 LPK 244
           + +
Sbjct: 322 MAR 324


>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 200

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 81/184 (44%), Gaps = 47/184 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G   S W WVH +G+ TGG +        + GC P  FPPC H    T  P+C    
Sbjct: 47  CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKC---- 102

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
              PK    C+ D+  R F  +   +  +Y VN    D +  I  +GPV A+  +Y D  
Sbjct: 103 ---PK--VSCSGDD--RHFMLESSPY--HYSVN----DAKNAIRTDGPVSASFTVYEDFL 149

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           +Y+SG                ++ + SG Y        +    VKI+GWGE++G+ YW  
Sbjct: 150 AYRSG----------------VYKHTSGSY--------LGGHAVKIIGWGEKSGQAYWLA 185

Query: 176 VRVY 179
           V  +
Sbjct: 186 VNSW 189


>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
          Length = 348

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 65/242 (26%), Positives = 93/242 (38%), Gaps = 61/242 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C  G  +  + +  + GL TGG +     CQP +F PC NHA+     P C     P P 
Sbjct: 164 CKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGP-CPDELWPTPT 222

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    Y   F +DK    + Y++     +I+ EIM  GPVVA   +Y D F Y   
Sbjct: 223 CRRTC-QLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRD-FDY--- 277

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                              YK GVY +    E+     VKI+GWG+ N  P         
Sbjct: 278 -------------------YKKGVY-IHREGEVTGLHAVKIIGWGKGNDVP--------- 308

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + +++   +GD G  +I+RG +   IE  + G
Sbjct: 309 -------------------------YWLVANSWNTDWGDNGYFRIVRGTDNCEIERQMVG 343

Query: 241 AL 242
            +
Sbjct: 344 GI 345


>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
          Length = 324

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 44/183 (24%)

Query: 1   VCSSGISSST------WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           +  SGI S+       W +  K+GLV+GG +++N GCQP   PP                
Sbjct: 144 ISCSGIKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP--------------IF 189

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKY-RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
             P+   +  C N  YG       +   K  Y  +    +IQ+E+   GPV A   LY D
Sbjct: 190 NLPKKIYNRTCDNFCYGNSLIDYNHDHVKVSYTYHVLYKNIQREVQTYGPVSAYFSLYDD 249

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                                  +F Y SGVYA +  ++ V Y + K++GWG ENG  YW
Sbjct: 250 -----------------------LFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYW 286

Query: 174 TIV 176
            +V
Sbjct: 287 LLV 289


>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
          Length = 369

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W +    GLV+GG ++++TGCQP S    N+   T             P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C ND Y   +  DK+     Y++      IQ EI+  G                   
Sbjct: 187 NTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGG------------------- 227

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPVVA   +Y D   Y+                          +G  + TI+    +
Sbjct: 228 ---GPVVAAFDVYGDFKIYR--------------------------DGEQHDTILEGVYI 258

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEA-IIESLVN 239
             S  +     VK+IGWG ENG  YW   +++G+ +G  G   KI RG NE    ES++ 
Sbjct: 259 YTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIA 318

Query: 240 GAL 242
           G +
Sbjct: 319 GQV 321


>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
          Length = 237

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 77/168 (45%), Gaps = 27/168 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W +   +G+V+GG + SN GC P    PC H    T  P CK      PKC
Sbjct: 97  CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGP-CKE-GGKTPKC 154

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D Y   + QD +  K  Y ++++V  I+QEI  NGPV     +Y          
Sbjct: 155 VKKC-EDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVY---------- 203

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
                         D  +Y++GVY   A   +  +A ++I+GWG +NG
Sbjct: 204 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG 237


>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 156

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 81/198 (40%), Gaps = 59/198 (29%)

Query: 34  VSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVA 92
           + F   NHA+   S+ P+C + A  QP C T C N++Y     QD +R K +  +     
Sbjct: 5   IQFIXXNHASSAASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQ 64

Query: 93  DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE 152
            I+QEI                       + NG V+  + +Y D   YKSGVY       
Sbjct: 65  KIKQEI-----------------------FDNGTVLGVISMYEDFRLYKSGVY------- 94

Query: 153 IVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVST 212
                                       V  +  +V   ++K+IGWG E+G+ YW  V++
Sbjct: 95  ----------------------------VHTTGGLVGVHSLKIIGWGVESGQDYWLAVNS 126

Query: 213 FGEQFGDKGTIKILRGRN 230
           + E++GD G IK+  G  
Sbjct: 127 WNEEWGDHGMIKLAVGET 144


>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
          Length = 279

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/119 (38%), Positives = 58/119 (48%), Gaps = 2/119 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H +     P C T     P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    Y   + QDK+     Y V      IQ+EIM  GPV A   +Y D  +YKSG
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSG 275


>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 330

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 89/237 (37%), Gaps = 69/237 (29%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
              W ++   GLV+GG +++N GCQP   PP    N  T              C  RC  
Sbjct: 161 DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYG 210

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
           +N    ++ D  +   YY +     DIQ+E+                 +Y       GPV
Sbjct: 211 NNTIH-YYHDHVKVSHYYNIKSN-EDIQKEVQ----------------TY-------GPV 245

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                +Y D F YKSGVY  +  +  V     K++GWG ENG  Y               
Sbjct: 246 SVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDY--------------- 290

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                              W +V+++G ++G  G  KI RG NE  +E  V    P+
Sbjct: 291 -------------------WLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPE 328


>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
           [Tribolium castaneum]
 gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
          Length = 319

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 73/243 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G   + + +  K+G+V+GG  +SN GC+P            T++   K +    P C
Sbjct: 149 CSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPY-----------TADAHDKGVT---PSC 194

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    Y   +  DK+   + Y V+  V++IQ EIM NGP++ +  +Y D ++Y SG 
Sbjct: 195 TKSCRK-GYPTSYSSDKHYGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSG- 252

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ + SG Y             VKIVGWG E  + Y         
Sbjct: 253 ---------------VYHHVSGNY--------TGNHIVKIVGWGTEKEQDY--------- 280

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                    W I +++G  +G+ G  KILRG+NE  IE+     
Sbjct: 281 -------------------------WLIANSWGSSWGEHGFFKILRGKNECGIENNPYAV 315

Query: 242 LPK 244
           LPK
Sbjct: 316 LPK 318


>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 157

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 87/211 (41%), Gaps = 59/211 (27%)

Query: 30  GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
           GC P  FPPC H    T  P+C     P P C  +C N  Y      D++          
Sbjct: 2   GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHF--------- 52

Query: 90  EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
                   ++++ P     Y YS +   K+    +GPV A+  +Y D  +Y+SGVY  ++
Sbjct: 53  --------MLESSP-----YHYS-VNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTS 98

Query: 150 SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
            + +  +A                                   VK+IGWGE++G+ YW  
Sbjct: 99  GSYLGGHA-----------------------------------VKIIGWGEKSGQAYWLA 123

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
           V+++ E +GD G  KI  G N  I + L+ G
Sbjct: 124 VNSWNEDWGDHGLFKIALG-NCGIDDDLLGG 153


>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
          Length = 347

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/239 (25%), Positives = 97/239 (40%), Gaps = 60/239 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG+    + +  ++G+ +GG + +   C+P  F PC +  +      C     P P C
Sbjct: 164 CTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTC 223

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +D      +   Y   R +     V   +++I +            +IF+     
Sbjct: 224 EKACQSD------YTVPYNDDRIFGSKTIVLTGEEKIKR------------EIFN----- 260

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
             NGP+VA   +Y D   YK+G+Y               + G G   G            
Sbjct: 261 --NGPLVATYTVYEDFAYYKNGIY---------------MTGLGRATG------------ 291

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                  A+A VK+IGWGEENG  YW I +++   +G+ G  ++LRG N   IE    G
Sbjct: 292 -------AHA-VKIIGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATG 342


>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
          Length = 171

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 2/121 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLV+GG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 44  CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 102

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   +  DK+  K  Y V  +   IQ EI KNGPV     +Y D   YK+G 
Sbjct: 103 ILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGV 161

Query: 122 Y 122
           Y
Sbjct: 162 Y 162


>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
 gi|1586011|prf||2202319A cathepsin B-like Cys protease
          Length = 340

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 90/235 (38%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   ++  P C       PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C  DN        KY+    Y +  E  ++  E+M NGP+   M +Y+D  +YKSG 
Sbjct: 219 NTTC--DNVEMELV--KYKGVSSYSIKGE-RELDHELMNNGPLEVAMQVYADFVAYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S + +    VK+VGWG ++G P          
Sbjct: 274 YKH------------------------VSGDHLGGHAVKLVGWGVKDGIP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG +E  IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGNDECGIES 330


>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
          Length = 330

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 88/237 (37%), Gaps = 69/237 (29%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
              W ++   GLV+GG +++N GCQP   PP    N  T              C  RC  
Sbjct: 161 DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYG 210

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
           +N    ++ D  +   YY +     DIQ+E+                 +Y       GPV
Sbjct: 211 NNTIH-YYHDHVKVSHYYNIKSN-EDIQKEVQ----------------TY-------GPV 245

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                +Y D F YKSGVY  +  +  V     K++GWG ENG  YW +V           
Sbjct: 246 SVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLVN---------- 295

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                             +W      G ++G  G  KI RG NE  +E  V    P+
Sbjct: 296 ------------------FW------GNEWGQNGLFKIKRGTNEVHVEDYVYAGEPE 328


>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 326

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/236 (26%), Positives = 94/236 (39%), Gaps = 78/236 (33%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR-CTNDN 69
           W +    GLV+GG +++N GCQP   P               T+   Q K + R C    
Sbjct: 164 WEYFKTHGLVSGGKYNTNEGCQPSKVP---------------TVYNSQTKIYKRTCVEYC 208

Query: 70  YGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
           YG+    +  D  +   +Y++   + DIQ+E+   GPV       S  F           
Sbjct: 209 YGKDTINYNHDHVKVSNHYFI--RIKDIQKEVQTYGPV-------SVFFD---------- 249

Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAE 186
                 L+ D+F YKSGVYA +  ++   Y   K++GWG ENG  Y              
Sbjct: 250 ------LHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVENGVDY-------------- 289

Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                               W +V+++G ++G  G  KI RG +E  +ES V   L
Sbjct: 290 --------------------WLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAGL 325


>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
          Length = 196

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 73/175 (41%), Gaps = 25/175 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G S+  W++    G+ +GG +     C+P +F PC +    T   EC       P C
Sbjct: 44  CNGGYSARAWLYARNSGVCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPAC 103

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C    YG+ + +DK      Y V+ + A I+ EI   GPV A+   Y          
Sbjct: 104 KKYCQY-GYGKRYEKDKIYAXDAYRVSSDEAAIRAEIFARGPVQASFATY---------- 152

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                         D   YKSG+Y  +A      +A VKI+GWG ENG   W + 
Sbjct: 153 -------------EDFAHYKSGIYVHTAGKRRGGHA-VKIIGWGVENGTKXWIVA 193


>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
 gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
           corporis]
          Length = 473

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 97/247 (39%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  S  W ++ K GLV       +  C P          +T +  +CK    P    
Sbjct: 256 CQGGHLSRAWTFIRKFGLV-------DDYCYP----------WTGTPTKCKIPKRPNFDA 298

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            +     + G     + YR    Y + DE  DI +EIM++GPV A M +Y D FSYKSG 
Sbjct: 299 LSSICPPSLGSNLRSELYRVGPAYKIQDE-KDIMEEIMQSGPVQATMKVYQDFFSYKSGV 357

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y      +N    S  F Y S                VKI+GWGEE          +Y  
Sbjct: 358 Y----TKSNTERESSNFGYHS----------------VKILGWGEE--------TNIY-- 387

Query: 182 SASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                G+P  YW   +++G+Q+G+ G  KI RG NE  IE  V 
Sbjct: 388 ---------------------GQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVL 426

Query: 240 GALPKDN 246
            A  + N
Sbjct: 427 AAWAETN 433


>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
          Length = 226

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/169 (30%), Positives = 73/169 (43%), Gaps = 26/169 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+  ++TGCQP  FP C H +     P C       P+C
Sbjct: 81  CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 139

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   +  DK+       V    + IQ+EIM  GPV A + ++ D  +YKSG 
Sbjct: 140 KRKC-QKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 197

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
                          I+ Y +G +        V    V+I+GWG EN R
Sbjct: 198 ---------------IYRYTTGSF--------VGEHYVRIIGWGIENER 223


>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
          Length = 333

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 2/121 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ W +    GLV+GG + S+ GC+P + PPC H +   + P C       P+C
Sbjct: 148 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +  Y   +  DK+  K  Y V  +   IQ EI KNGPV     +Y D   YK+G 
Sbjct: 207 ILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGV 265

Query: 122 Y 122
           Y
Sbjct: 266 Y 266


>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
 gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
          Length = 386

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +       +QD++  +  Y + ++   I +EI  NGPV A  + Y D+ +YKSG 
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++   SG +A            VK++GWG ENG            
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW + +++G ++G+ G  KI+RG N   IE  ++  
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKIVRGENHCGIEENIHAG 365

Query: 242 LP 243
           LP
Sbjct: 366 LP 367


>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
          Length = 182

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 77/166 (46%), Gaps = 30/166 (18%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W +   +G+ TGG + +  GC P   PPC +      +  C     P  + H +C    Y
Sbjct: 47  WKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 100

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
           G+   Q++Y+ K  Y +N  +  I+Q                D+ +Y       GPV A+
Sbjct: 101 GKTTVQNRYKTKSEYVMN-SIKTIEQ----------------DLKTY-------GPVEAS 136

Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
             +Y D   YKSG+Y  +  A+     ++KI+GWG++NG PYW  V
Sbjct: 137 FDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 182


>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 340

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 90/235 (38%), Gaps = 70/235 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    G+ T         CQP  F PC+H   ++  P C       PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T C  DN        KY+    Y +  E  ++  E+M NGP+   M +Y+D  +YKSG 
Sbjct: 219 NTTC--DNVEMELV--KYKGVSSYSIKGE-RELMVELMNNGPLEVAMQVYADFVAYKSGV 273

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         S + +    VK+VGWG ++G P          
Sbjct: 274 YKH------------------------VSGDHLGGHAVKLVGWGVKDGIP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                   YW I +++   +GDKG   I RG +E  IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGNDECGIES 330


>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
          Length = 342

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 97/241 (40%), Gaps = 58/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S   W +    G+V+GG + +   C+P    PC H    T   EC   A   P C
Sbjct: 154 CGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAA-TPPC 212

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y + F  DK + K  Y V  +   IQ+EI+++GPVVA+  +Y D FS     
Sbjct: 213 KKKC-QPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYED-FSL---- 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                             YK+GVY  +A A +  Y  VK++GWG ++             
Sbjct: 267 ------------------YKTGVYKHTAGA-LRGYHAVKMMGWGVDS------------- 294

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                              +    YW I +++   +G+ G  + +RG N+  IE  V   
Sbjct: 295 -------------------KTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTVAAG 335

Query: 242 L 242
           +
Sbjct: 336 I 336


>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
          Length = 256

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/179 (30%), Positives = 78/179 (43%), Gaps = 35/179 (19%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 57
           C+ G     W      GLVTGG + S  GC+P   PPC    +  N  + +P       P
Sbjct: 97  CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQP-----MEP 151

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
             KC  +C  D      F   +R+ R     D+     + I K            D+ +Y
Sbjct: 152 NHKCSKKCYGDE--DIDFNKDHRYTR-----DDYYLTYRGIQK------------DVINY 192

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                  GP+ A+  +Y D  +YKSG+Y  S +A  +   +VK++GWGEE G  YW +V
Sbjct: 193 -------GPIEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 244


>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
          Length = 253

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 102/240 (42%), Gaps = 54/240 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI SS + +    G+V GG +   +GC      PC H   ++  P C       PKC
Sbjct: 57  CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPD-EVRAPKC 115

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +++  + + + K + ++ Y V       QQ  ++       + + +DI       
Sbjct: 116 ARKCESED--KDWTKAKVKGEKGYSV------CQQGELEG---TCAIKMAADI------- 157

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP+    ++  D  +YKSGVY     +  +    +KI+G+G E+G+           
Sbjct: 158 YQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGK----------- 206

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
                                   YW + +++ E +GD G  KI+RG+N   IE  ++NG
Sbjct: 207 -----------------------DYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVING 243


>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
          Length = 179

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 67/168 (39%), Gaps = 27/168 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VTGG+     GC+P  FP C H +     P C     P PKC
Sbjct: 38  CDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 96

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D     + +DK R    Y V+     I +EI+ NGPV A   ++ D   YKSG 
Sbjct: 97  VKHC--DTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGI 154

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
           Y                          A    V    ++I+GWGEENG
Sbjct: 155 Y------------------------FHAWGGSVGGHAIRILGWGEENG 178


>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 87/242 (35%), Gaps = 69/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    GL       +++ CQP  FP C H      +P C       PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +T CT+    +     +YR    Y +     D ++E+                       
Sbjct: 211 NTTCTD----KAIPLIEYRGNDSYVLLHGEDDFKREL----------------------- 243

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP V    ++SD  +YK+GVY                                    
Sbjct: 244 YFNGPFVVAFQVFSDFLAYKTGVYR----------------------------------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             S + +    V+++GWG+ NG PYW I +++   +G  G    LRG NE  IE      
Sbjct: 269 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAG 328

Query: 242 LP 243
           LP
Sbjct: 329 LP 330


>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
 gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
          Length = 386

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +       +QD++  +  Y + ++   I +EI  NGPV A  + Y D+ +YKSG 
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++   SG +A            VK++GWG ENG            
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW + +++G ++G+ G  K++RG N   IE  ++  
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 242 LP 243
           LP
Sbjct: 366 LP 367


>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
 gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
          Length = 386

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +       +QD++  +  Y + ++   I +EI  NGPV A  + Y D+ +YKSG 
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++   SG +A            VK++GWG ENG            
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW + +++G ++G+ G  K++RG N   IE  ++  
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 242 LP 243
           LP
Sbjct: 366 LP 367


>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
          Length = 332

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 67/237 (28%), Positives = 95/237 (40%), Gaps = 71/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S     W    G+VTGG +  + GC+P  F  CN A        C    TP+  C
Sbjct: 157 CDGGYSIQALRWWVFDGVVTGGDYQGD-GCKPYQF--CNSAG-------CPDAVTPE--C 204

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y   + +DK      Y+V   V  IQ +IM NGPV A+  +Y D + YKSG 
Sbjct: 205 ALSCQS-KYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSG- 262

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++ Y +G        +++    +KI+GWG ENG  Y         
Sbjct: 263 ---------------VYKYIAG--------KMLGGHAIKIIGWGTENGTAY--------- 290

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    W I +++G ++G+ G  KI RG NE  IE+ V
Sbjct: 291 -------------------------WLIANSWGTKWGENGFFKIRRGVNECGIENNV 322


>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
           pisum]
          Length = 169

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 87/220 (39%), Gaps = 67/220 (30%)

Query: 30  GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGF--FQDKYRFKR-YYW 86
           GC+P   PPC      TS         P  K H RCT   YG     + D +RF R YY+
Sbjct: 14  GCEPYRVPPCPRNEDGTSS----CAGQPIEKNH-RCTRMCYGNQDLDYNDDHRFTRDYYY 68

Query: 87  VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
           +      IQ+++M  GP+ A+  +Y D                        +SYKSGVY 
Sbjct: 69  LT--YGSIQKDVMNYGPIEASFDVYDDF-----------------------YSYKSGVYQ 103

Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
            + +A  +    VK++GWG E G PY                                  
Sbjct: 104 RTPNATKLGGHAVKLIGWGVEEGIPY---------------------------------- 129

Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
           W +V+++  Q+GD G  KI RG +E  I+S     +P  N
Sbjct: 130 WLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVPVTN 169


>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
          Length = 324

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 64/242 (26%), Positives = 94/242 (38%), Gaps = 75/242 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  + +    G+ +GG + S  GC+P          YT +      ++   P+C
Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKP----------YTAA------VSGETPQC 196

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C +  Y + + +D       Y VN  V  IQ+EI+ NGPV A M +Y D +SY +G 
Sbjct: 197 QKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTG- 254

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          I+ + SG +        V    VKI+GWG EN  P          
Sbjct: 255 ---------------IYQHTSGSF--------VGGHAVKIIGWGSENDVP---------- 281

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW   +++G  FG+ G  +ILRG N A IES +   
Sbjct: 282 ------------------------YWIAANSWGTGFGEDGFFRILRGSNCAGIESYIVAG 317

Query: 242 LP 243
            P
Sbjct: 318 YP 319


>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
          Length = 386

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  ++GL +GG  +S  GC P          Y   E          PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C +       +QD++  +  Y + ++   I +EI  NGPV A  + Y D+ +YKSG 
Sbjct: 244 SNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y  ++   SG +A            VK++GWG ENG            
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW + +++G ++G+ G  K++RG N   IE  ++  
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365

Query: 242 LP 243
           LP
Sbjct: 366 LP 367


>gi|294898471|ref|XP_002776250.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239883121|gb|EER08066.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 219

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 42/117 (35%), Positives = 58/117 (49%), Gaps = 7/117 (5%)

Query: 13  WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
           ++   G+VTG          S  GC P   P CNHA+   S+ P+C + A  QP C T C
Sbjct: 96  FMKNHGIVTGNEFKPADQLASADGCWPYPLPKCNHASSAASQYPKCPSEALSQPACQTEC 155

Query: 66  TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKY 122
            N++Y     QD +R K +  +      I+QEI  NG V+  + +Y D   YKSG Y
Sbjct: 156 INESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGTVLGVISMYEDFRLYKSGVY 212


>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 422

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 81/204 (39%), Gaps = 58/204 (28%)

Query: 27  SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
           ++ GC P  FP CNH     S+ P C  +    P C T C N  YG    +D +R K + 
Sbjct: 262 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 320

Query: 86  WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
            +      I+QEI  NGP+                        A M LY D F  +  VY
Sbjct: 321 RLPIGPEKIKQEIFDNGPLRX--------------------XAAMMTLYED-FDLQVCVY 359

Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
                                              V  + +++A  T+KLIGWG E+G+ 
Sbjct: 360 -----------------------------------VHKTGQMLAAHTLKLIGWGVESGQE 384

Query: 206 YWTIVSTFGEQFGDKGTIKILRGR 229
           YW  V+ + E++GD G IK+  G+
Sbjct: 385 YWLAVNAWNEEWGDHGMIKLAVGK 408


>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
          Length = 512

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 90/248 (36%), Gaps = 66/248 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           CS G     W W    G+VTGG +   H+   C P   P C H +     P+C+      
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368

Query: 59  PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           PKC   C    Y      F+D   F    +  +    I++E+M+NG +     +Y D   
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLL 428

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK G            +Y  +     G +A            VK++G+G E+GR YW  V
Sbjct: 429 YKEG------------VYHHVTGMPMGGHA------------VKVIGFGNEDGRDYWLAV 464

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                W E     YW          GDKGT KI  G  EA I+ 
Sbjct: 465 N-------------------SWNE-----YW----------GDKGTFKIEMG--EAGIDK 488

Query: 237 LVNGALPK 244
              G  PK
Sbjct: 489 EFCGGEPK 496


>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
          Length = 512

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 90/248 (36%), Gaps = 66/248 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           CS G     W W    G+VTGG +   H+   C P   P C H +     P+C+      
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368

Query: 59  PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           PKC   C    Y      F+D   F    +  +    I++E+M+NG +     +Y D   
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLL 428

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK G            +Y  +     G +A            VK++G+G E+GR YW  V
Sbjct: 429 YKEG------------VYHHVTGMPMGGHA------------VKVIGFGNEDGRDYWLAV 464

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                W E     YW          GDKGT KI  G  EA I+ 
Sbjct: 465 N-------------------SWNE-----YW----------GDKGTFKIEMG--EAGIDK 488

Query: 237 LVNGALPK 244
              G  PK
Sbjct: 489 EFCGGEPK 496


>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
 gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
          Length = 375

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 90/242 (37%), Gaps = 69/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G+ S+ W +  + G+ +GGA  S+ GCQ   F  C  +  +   P C     P    
Sbjct: 200 CDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDVCKKSGDSNDTPRCLRFCQP---- 255

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
                   Y   + +DK+  +  Y V  +   I  E+   GP  A   +Y+D   YKSG 
Sbjct: 256 -------GYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSG- 307

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                      +Y   F  + G +            +VK++GWG EN             
Sbjct: 308 -----------VYRHTFGVRVGTH------------SVKVMGWGVEN------------- 331

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      VK           YW   +++G Q+GD G  KI+RG +    E+ V   
Sbjct: 332 ----------DVK-----------YWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAG 370

Query: 242 LP 243
           LP
Sbjct: 371 LP 372


>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
          Length = 180

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/170 (30%), Positives = 72/170 (42%), Gaps = 27/170 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGG+    +GC+   FP C H +     P C     P P+C
Sbjct: 38  CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 96

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D    G+ +DK R    Y +      I +EIM  GPV A       IF+     
Sbjct: 97  VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 142

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
                      +Y D   Y SGVY  +  A +  +A V+I+GWGE    P
Sbjct: 143 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVP 180


>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
 gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
          Length = 463

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/261 (27%), Positives = 104/261 (39%), Gaps = 80/261 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECK-----TLAT 56
           CS G   + W +V K G V       N  C P          Y +++  CK     TL T
Sbjct: 252 CSGGHLDTAWNYVRKVGTV-------NDECYP----------YISAQNACKIRPSDTLIT 294

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
                 T+    N         Y+    + +N+E  DI  EI K+GPV A + ++ D FS
Sbjct: 295 ANCDLPTKVDRTNM--------YKMGPAFSLNNET-DIMIEIKKHGPVQAILRVHRDFFS 345

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YKSG Y +                     A SA  E   Y +V+++GWGEE         
Sbjct: 346 YKSGIYRHSA-------------------ASSAGDERAGYHSVRLIGWGEERN------- 379

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                        Y T K           YW  V+++G  +G+ G  +I+RG+NE  IES
Sbjct: 380 ------------GYETTK-----------YWVAVNSWGRWWGENGRFRIVRGQNECEIES 416

Query: 237 LVNGALPKDNYGVEFGEESGE 257
            V  +LP  +  V+   + GE
Sbjct: 417 YVLASLPYVHQQVKPMRQVGE 437


>gi|741376|prf||2007265A cathepsin B
          Length = 153

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 82/208 (39%), Gaps = 61/208 (29%)

Query: 39  CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEI 98
           C H +   S P C T     PKC   C    Y   + QDK+     Y V++   DI  EI
Sbjct: 1   CEH-HVNGSRPPC-TGEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 57

Query: 99  MKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
            KNGPV          FS                +YSD   YKSGVY             
Sbjct: 58  YKNGPVEG-------AFS----------------VYSDFLLYKSGVYQ------------ 82

Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG 218
                                    + E++    ++++GWG ENG PYW + +++   +G
Sbjct: 83  -----------------------HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG 119

Query: 219 DKGTIKILRGRNEAIIESLVNGALPKDN 246
           D G  KILRG++   IES V   +P+ +
Sbjct: 120 DNGFFKILRGQDHCGIESEVVAGIPRTD 147


>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
          Length = 273

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 82/208 (39%), Gaps = 62/208 (29%)

Query: 39  CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEI 98
           C H N   S P C T     PKC   C    Y   + QDK+     Y V++   DI  EI
Sbjct: 122 CIHVN--GSRPPC-TGEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 177

Query: 99  MKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
            KNGPV          FS                +YSD   YKSGVY             
Sbjct: 178 YKNGPV-------EGAFS----------------VYSDFLLYKSGVYQ------------ 202

Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG 218
                                    + E++    ++++GWG ENG PYW + +++   +G
Sbjct: 203 -----------------------HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG 239

Query: 219 DKGTIKILRGRNEAIIESLVNGALPKDN 246
           D G  KILRG++   IES V   +P+ +
Sbjct: 240 DNGFFKILRGQDHCGIESEVVAGIPRTD 267


>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
 gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
          Length = 551

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 101/246 (41%), Gaps = 76/246 (30%)

Query: 2   CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G    T+  WV+  G+ TGG + SN  C+P   PPC++ + T +           PK
Sbjct: 359 CNGGYPQRTFKYWVYS-GMPTGGPYGSNDTCKPYPIPPCSNCSETRT-----------PK 406

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYY--WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           C   C +  Y     +D++    YY  W+ ++   + ++I   GP+VA M +Y D   YK
Sbjct: 407 CSKSCIS-TYPLSLNEDRHYGSTYYQFWLGEK--SMMKDISLYGPIVAGMSVYEDFLHYK 463

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
            G                +++ +SG++        +    V+I+GWGE++  P       
Sbjct: 464 EG----------------VYTQESGIF--------LGGHAVRIIGWGEQDNIP------- 492

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                      YW + +++   FG+ G  KI RG +E  IES V
Sbjct: 493 ---------------------------YWLVANSWNTTFGEDGLFKIRRGFDECGIESYV 525

Query: 239 NGALPK 244
           +    K
Sbjct: 526 SAGRAK 531


>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
 gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
          Length = 353

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 97/246 (39%), Gaps = 76/246 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G+    W +  ++G+ +GG ++S  GC    F  C+  +     P          KC
Sbjct: 167 CQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCHSPDEDDDAP----------KC 216

Query: 62  HTRCTNDNYGRGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             +C +    +   +D+ RF R  Y  V DE   I +EI  NGPV A   +Y D  +YKS
Sbjct: 217 SRKCQSSYSVQDVSKDR-RFGRVAYSVVADE-HRIMEEIFVNGPVQAAFQVYLDFKTYKS 274

Query: 120 GKYGN--GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           G Y +  GP+               G +A            +KI+GWG EN         
Sbjct: 275 GVYRHVTGPL--------------EGGHA------------IKILGWGVEN--------- 299

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                    G  YW   +++GE +GD G  KI+RG N   IE+ 
Sbjct: 300 -------------------------GTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETD 334

Query: 238 VNGALP 243
           V+  LP
Sbjct: 335 VHAGLP 340


>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 340

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 91/242 (37%), Gaps = 70/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI +  W+W    GL       ++  CQP  FPPC H       P C +     P C
Sbjct: 166 CQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTC 218

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           ++ C + +        K++ ++ Y +  E  +   E+M  GP      +Y+D  SYKSG 
Sbjct: 219 NSTCADSHTA----LTKHKGEKSYSLRGE-REYMIELMTYGPFEVAFDVYADFVSYKSG- 272

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                          ++S+ +G        E +    VK+VGWG +NG P          
Sbjct: 273 ---------------VYSHTTG--------ERLGGHAVKLVGWGVQNGTP---------- 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW I +++   +GD G   I RG +E  IES     
Sbjct: 300 ------------------------YWKIANSWNSDWGDNGYFLIRRGTDECGIESTGVAG 335

Query: 242 LP 243
           LP
Sbjct: 336 LP 337


>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
          Length = 255

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 67/241 (27%), Positives = 91/241 (37%), Gaps = 76/241 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +    GL T   +       P  FPPC H    T    C   + P PKC
Sbjct: 85  CNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHYKPCGP-SQPTPKC 136

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             R +         + +Y  K  Y V+   A IQ EIM NGPV A   +Y          
Sbjct: 137 -VRASEK-------KPRYHGKSVYSVSP--AKIQAEIMTNGPVEAAFTVY---------- 176

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                         D  +Y+SGVY   +  E+  +A +KI+GWG E G            
Sbjct: 177 -------------QDFLAYQSGVYRHVSGPELGGHA-IKIMGWGVEAG------------ 210

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++ E +GDKGT KI RG +E  IES V   
Sbjct: 211 ----------------------NKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVAG 248

Query: 242 L 242
           +
Sbjct: 249 M 249


>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
           domestica]
          Length = 466

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 97/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ +RGLV+   +  + G +  + P      ++ S    K  AT     
Sbjct: 268 CRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKRQAT----- 322

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N    R      Y+    Y ++ +  DI +E+M+NGPV A M ++ D F YKSG 
Sbjct: 323 -AHCPNS---RAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGI 378

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT--VKIVGWGEENGRPYWTIVRVY 179
           Y + P                   ++   A    + T  VKI GWGEE            
Sbjct: 379 YKHTPA------------------SLGKPARYRQHGTHSVKITGWGEER----------- 409

Query: 180 AVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                               + +G+   YWT  +++G  +G+KG  +ILRG NE  IES 
Sbjct: 410 --------------------QPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESF 449

Query: 238 VNG 240
           V G
Sbjct: 450 VVG 452


>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
          Length = 327

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 93/240 (38%), Gaps = 72/240 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ K GLV       +  C P S       N     P    L T   + 
Sbjct: 146 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTANCQL 193

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T     N  R   + KY+    Y V +E  DI  EI+ +GPV A M +Y D F+YK G 
Sbjct: 194 PT-----NVDR---RSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGI 244

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y + P+  N                     +   Y +V+IVGWGEE              
Sbjct: 245 YRHSPISTN---------------------DRTGYHSVRIVGWGEE-------------- 269

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           +  E  + YW + +++G ++G+ G  +ILRG NE  IES V G 
Sbjct: 270 ----------------YSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGT 313


>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
          Length = 336

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 87/236 (36%), Gaps = 70/236 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+       +++ CQP  FP C H      +P C       P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------TSSQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y V  E  D ++E+  NGP V    ++SD  +YKSG 
Sbjct: 212 NATCTD----KSVPLIKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +   VA  +L                         V+IVGWG+ NG P          
Sbjct: 267 YQH---VAGNFLGGK---------------------AVRIVGWGKLNGTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                   YW + +++   +G  G   ILRG NE  IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYFLILRGDNECNIEHL 324


>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
           [Tribolium castaneum]
          Length = 453

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 66/240 (27%), Positives = 93/240 (38%), Gaps = 72/240 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ K GLV       +  C P S       N     P    L T   + 
Sbjct: 272 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTANCQL 319

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T     N  R   + KY+    Y V +E  DI  EI+ +GPV A M +Y D F+YK G 
Sbjct: 320 PT-----NVDR---RSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGI 370

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y + P+  N                     +   Y +V+IVGWGEE              
Sbjct: 371 YRHSPISTN---------------------DRTGYHSVRIVGWGEE-------------- 395

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                           +  E  + YW + +++G ++G+ G  +ILRG NE  IES V G 
Sbjct: 396 ----------------YSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGT 439


>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 179

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 69/168 (41%), Gaps = 27/168 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + GLVTGG+  + +GC+   FP C+H       P C       P C
Sbjct: 38  CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPRCSHHG-KGKYPPCPKTIFDTPNC 96

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C  D     +  DK   K  Y V      I +EIM+NGPV A   +Y D   YKSG 
Sbjct: 97  VDHC--DKPDIDYAADKTHAKSSYNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSG- 153

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
                    +Y +S                +++    ++++GWGEE G
Sbjct: 154 ---------IYFHS--------------HGKLLGGHAIRMLGWGEEKG 178


>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
          Length = 721

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 66/246 (26%), Positives = 98/246 (39%), Gaps = 73/246 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G       +   +G+VTGG    + GC P S+  C+  +   + P+CK       +C
Sbjct: 148 CQGGFVLEAMKFWKSKGVVTGGDFQGD-GCIPYSYGSCSDCHTAQTTPKCKN------EC 200

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             + T + Y     +DKY     Y ++  + V  IQ EI++NGPV A   +Y D + YKS
Sbjct: 201 QVKYTKNEYK----EDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKS 256

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRV 178
           G                ++ Y SG +        +    VKI+GWG EEN          
Sbjct: 257 G----------------VYEYISGRH--------MGGHAVKIIGWGVEENVN-------- 284

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                      YW I +++G  FG+ G  K+ RG NE  IE+ V
Sbjct: 285 ---------------------------YWLIANSWGTGFGENGFFKMRRGNNECGIENYV 317

Query: 239 NGALPK 244
              + K
Sbjct: 318 VAGMAK 323


>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
          Length = 215

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 76/171 (44%), Gaps = 33/171 (19%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W    K GLVTGG + S  GC+P   PPC +  Y  +     T +    + 
Sbjct: 75  CYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN-----TCSGQPMES 129

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           + RCT   YG     F QD    + +Y++      IQ+                D+ +Y 
Sbjct: 130 NHRCTRMCYGNQDLDFDQDHRYTRDHYYLT--YRGIQK----------------DVINY- 170

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
                 GP+ A+  +Y D  SYKSG+Y  S +A  +   +VK++GWGEE G
Sbjct: 171 ------GPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYG 215


>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
          Length = 342

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 88/238 (36%), Gaps = 61/238 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C  G   + + +    G+ TGG       C+P +F PC  H N     P C     P PK
Sbjct: 158 CDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGP-CPKELWPTPK 216

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C    Y   +  DK      Y + +    I QEI  NGPVV +  +++D   YK G
Sbjct: 217 CRKMC-QLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKG 275

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +  +  N            G +A            VKI+GWG ++G           
Sbjct: 276 VYVSNGIQQN------------GAHA------------VKIIGWGVQDG----------- 300

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                    YW I +++   +GD+G ++ LRG N   IES V
Sbjct: 301 -----------------------LKYWLIANSWNNDWGDEGYVRFLRGDNHCGIESRV 335


>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 342

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 64/246 (26%), Positives = 97/246 (39%), Gaps = 76/246 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHA-NYTTSEPECKT- 53
           C  G  +   +++   G+VTGG +       ++ GC P  FP CNH        P C + 
Sbjct: 114 CRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCNHVPGMKVKYPRCGSK 173

Query: 54  ---LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYL 110
              LA P     + C   +  R    D +R K +  +      I+QEI  NGPV A M +
Sbjct: 174 VGRLAAP-----SHCDGLHCRRA--GDVHRAKSWGRLPISPEKIKQEIFDNGPVAAIMTI 226

Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
           + D   YKSG                ++ YK+G         +V   T+K++GWG E G 
Sbjct: 227 HEDFRLYKSG----------------VYEYKTGA--------MVGAHTLKLIGWGVEAG- 261

Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                                            + YW  V+++ E++GD+G IK+  G+N
Sbjct: 262 ---------------------------------QEYWLAVNSWNEEWGDQGKIKLAVGKN 288

Query: 231 EAIIES 236
               ES
Sbjct: 289 ALDEES 294


>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
          Length = 381

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 91/242 (37%), Gaps = 67/242 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G+ S+ W +  + G+ +GGA+ S+ GCQ   F  C          +   L   QP  
Sbjct: 204 CDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLICLRQCQP-- 261

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
                   Y   + +DK+  +  Y V  +   I  E+   GPV A+  +Y+D   YKSG 
Sbjct: 262 -------GYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGV 314

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                    Y V      V   +VKIVGWG EN             
Sbjct: 315 YRH-------------------TYGVR-----VGDHSVKIVGWGVEN------------- 337

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                G  +W   +++G ++G+ G  KI+RG +   +ES V   
Sbjct: 338 ---------------------GTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESNVVAG 376

Query: 242 LP 243
           LP
Sbjct: 377 LP 378


>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
 gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
          Length = 430

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 42/122 (34%), Positives = 62/122 (50%), Gaps = 19/122 (15%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP+     +Y D+ +YK GVY    + E+      K  G  E+   P++        
Sbjct: 326 YQNGPLAIGFEVYPDLRNYKHGVYKHVTAEEL------KAQGLSEDEMIPHF-------- 371

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
               E+V +A V ++GWG ENG PYW I +++   +GD G  KILRG +E  +ES     
Sbjct: 372 ----EVVNHA-VLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAG 426

Query: 242 LP 243
           +P
Sbjct: 427 IP 428


>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 185

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 79/187 (42%), Gaps = 60/187 (32%)

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
            +  P P C T CTN  Y +   +D +R K +  V ++   I+QEI  NGPV+++  +Y 
Sbjct: 53  VVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSFKMYE 112

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F Y                      YKSGVY V  + E     ++KI+GWG  +GR  
Sbjct: 113 D-FRY----------------------YKSGVY-VPTTKESSTSHSIKIIGWGGASGR-- 146

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-- 230
                                            YW  V+++ E++GD G IK+  G+N  
Sbjct: 147 --------------------------------EYWLAVNSWNEEWGDHGLIKMAFGKNRL 174

Query: 231 EAIIESL 237
           E I+ S+
Sbjct: 175 EKIVLSI 181


>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
          Length = 342

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 83/216 (38%), Gaps = 60/216 (27%)

Query: 27  SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
           ++TGCQP  FP C H       P C T     P+C   C    Y   F QDK   +    
Sbjct: 184 NHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSN 241

Query: 87  VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
           V +     Q++IM        MY               GPV A   +Y D  + KSG+ +
Sbjct: 242 VQNNEKVFQRDIM--------MY---------------GPVEAAFDVYEDFLNSKSGI-S 277

Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
              +  IV    ++I+GWG E G PY                                  
Sbjct: 278 RHVTGSIVGGHPIRIIGWGVEKGNPY---------------------------------- 303

Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
           W I +++ E +G+ G  +++RGR+E  IES V   L
Sbjct: 304 WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339


>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
 gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
          Length = 351

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 91/244 (37%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W+++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 163 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 206

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C N N  + +   K+   + Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 207 KCVKKCVNGN--QLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKS 264

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I  +  G +A            VK+VGWG  +           
Sbjct: 265 G------------VYKHITGFALGGHA------------VKLVGWGTSH----------- 289

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI RG NE  IE+ V 
Sbjct: 290 ----------------------EGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVT 327

Query: 240 GALP 243
             LP
Sbjct: 328 AGLP 331


>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
          Length = 356

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 91/244 (37%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W+++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 168 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 211

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C N N  + +   K+   + Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 212 KCVKKCVNGN--QLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKS 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I  +  G +A            VK+VGWG  +           
Sbjct: 270 G------------VYKHITGFALGGHA------------VKLVGWGTSH----------- 294

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI RG NE  IE+ V 
Sbjct: 295 ----------------------EGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVT 332

Query: 240 GALP 243
             LP
Sbjct: 333 AGLP 336


>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 78/217 (35%), Gaps = 63/217 (29%)

Query: 27  SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
           +++ CQP  FP C H      +P C       P C+  CT+    +     KYR    Y 
Sbjct: 177 TSSQCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTD----KSVPLIKYRGNHSYE 232

Query: 87  VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
           V  E  D ++E+  NGP V    ++SD  +YKSG Y +                      
Sbjct: 233 VRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQH---------------------- 269

Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
              +   +    V+IVGWG+ NG P                                  Y
Sbjct: 270 --VAGNFLGGKAVRIVGWGKMNGTP----------------------------------Y 293

Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           W + +++   +G  G   ILRG NE  IE L     P
Sbjct: 294 WKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAGTP 330


>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 90/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KCH +C  +N  + + ++K+     Y V+    DI  E+ KNGPV     +Y D   YKS
Sbjct: 213 KCHRKCKVEN--QVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  ++    VK++GWG  +           
Sbjct: 271 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 296

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 297 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVT 333

Query: 240 GALP 243
             +P
Sbjct: 334 AGMP 337


>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
          Length = 121

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/121 (33%), Positives = 58/121 (47%), Gaps = 35/121 (28%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV  +  +Y+D  +YKSGVY   + A +  +A                          
Sbjct: 33  HGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHA-------------------------- 66

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                    V+L+GWGEEN  PYW I +++   +GD G  KI+RG+NE  IES VN  +P
Sbjct: 67  ---------VRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIP 117

Query: 244 K 244
           K
Sbjct: 118 K 118


>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Apis mellifera]
          Length = 439

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 101/250 (40%), Gaps = 80/250 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS--FPPCNHANYTTSEPE-CKTLATPQ 58
           C  G     W+++ K GLV       +  C P    +  C     T  E   C+  A P 
Sbjct: 264 CDGGYLDRAWLFMRKFGLV-------DEQCYPWKGVYEQCKLQKRTNLEAAGCRAPANPL 316

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
            K                + Y+    Y + +E  DI +EI+ +GPV A M +Y D FSY+
Sbjct: 317 RK----------------ELYKVGPAYRLGNET-DIMREILTSGPVQATMKVYQDFFSYE 359

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y + P+            Y+SG            Y +V+I+GWGE+           
Sbjct: 360 SGIYMHTPIAE---------LYESG------------YHSVRIIGWGED----------- 387

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
             +S                  ++G P  YW +V+++G+++G+ G  +I RG NE  IES
Sbjct: 388 --IST-----------------DSGLPIKYWLVVNSWGQEWGENGLFRIRRGINECDIES 428

Query: 237 LVNGALPKDN 246
            V     K N
Sbjct: 429 FVVAVWAKTN 438


>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
 gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
          Length = 313

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 90/236 (38%), Gaps = 71/236 (30%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W +  K+G+ +GG + SN GC P   PP          P+       +P C TRC   N 
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRC---NA 190

Query: 71  GRGFFQD--KYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
           G    +D    RF R  Y +  +   I ++I  NGPV A    Y DI +Y  G       
Sbjct: 191 GYNVTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGG------- 243

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
                    ++ ++SG         +     VK++GWG E+G                  
Sbjct: 244 ---------VYRHQSG--------RLKGGHAVKLIGWGVEDG------------------ 268

Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                             YW + +++G  +GD G  K++RG N   IE  V+  LP
Sbjct: 269 ----------------TKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLP 308


>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 463

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 72/180 (40%), Gaps = 28/180 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHS---NTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           C+ G     W W  ++G+VTGG   +    T C P   P C H +     P C T   P+
Sbjct: 241 CNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAH-HAKAPFPNCDTDVRPR 299

Query: 59  --PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
             PKC   C    Y                 ++ V    +++ K     ++ Y      +
Sbjct: 300 KTPKCRKDCEEAAY-----------------SEHVLPFDKDVHK----ASSSYSLRSRDA 338

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            K     +G V     +Y D  +YKSGVY       +  +A +KI+GWG E+G  YW  V
Sbjct: 339 VKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHA-IKIIGWGTEDGEEYWHAV 397


>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
          Length = 356

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 65/253 (25%), Positives = 91/253 (35%), Gaps = 78/253 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  ++G+VT     +  N GC   S P C        EP     A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KCH +C   N    + + K+     Y ++ +   I  E+ KNGPV  +  +Y D   YKS
Sbjct: 212 KCHRKCVKQNLL--WSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKS 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         + +I+    VK++GWG              
Sbjct: 270 GVYKH------------------------VTGDIMGGHAVKLIGWGT------------- 292

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 293 --------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIEDEVV 332

Query: 240 GALPK-DNYGVEF 251
             LP   N  VE 
Sbjct: 333 AGLPSARNLNVEL 345


>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
          Length = 163

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W W  K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 38  CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ+++M  GP+ A+  +Y D  +YK
Sbjct: 94  H-RCTRMCYGNQELDFKEDHHWTRDAYYLT--YTTIQKDVMAYGPIEASFDVYDDFPNYK 150

Query: 119 SGKY 122
           SG Y
Sbjct: 151 SGVY 154


>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 271

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 94/239 (39%), Gaps = 55/239 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RG+VT         C P   P    A  +    + +++   + + 
Sbjct: 75  CAGGRLDGAWWYLRRRGVVT-------EDCYPYRPPQQTPAELSRCMMQSRSVGRGKRQA 127

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC N N    +  D Y+    Y ++    +I +EI  NGPV A M ++ D F Y SG 
Sbjct: 128 TQRCPNTN---NYQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSG- 183

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y ++D+   K   Y    +       +VKI GWGEE              
Sbjct: 184 ---------IYKHTDVSFTKPPHYRKHGT------HSVKITGWGEERN------------ 216

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                             +   R YW   +++G+ +G+ G  +I RG NE  IE+ V G
Sbjct: 217 -----------------FDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIG 258


>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
          Length = 197

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/177 (31%), Positives = 74/177 (41%), Gaps = 28/177 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV--SFPPCNHANYTTSEPECKTLATPQP 59
           C  G S   + W+ +        +     C+PV  S    NH N     P C     P P
Sbjct: 44  CQGGWSIEAYKWMQRERCCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGP-CPGGLWPTP 102

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC   C    Y + + +DK+   R Y++ +    I+QEI KNGPVVA   +Y D FSY  
Sbjct: 103 KCRKTCQRKYY-KSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQD-FSY-- 158

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                               YK G+Y      +  A+A VK+VGWG EN   YW I 
Sbjct: 159 --------------------YKKGIYVHKWGGQTGAHA-VKVVGWGRENATDYWLIA 194


>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
          Length = 183

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/167 (28%), Positives = 76/167 (45%), Gaps = 28/167 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--ANYTTSEPECKTLATPQP 59
           C  G     W +    G+VTGG +   + C P  FPP +H  +  T  E   +TL  P P
Sbjct: 38  CVGGWIGDAWDYWRDNGIVTGGDYQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLY-PTP 96

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C ++C  + Y   + +DK      Y ++    +IQ+EI+ NGPV A M +Y+D  +YK+
Sbjct: 97  PCVSKC-QEGYPGEYEKDKIFALSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKT 155

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
           G Y +                         + EI+    ++++GWG+
Sbjct: 156 GVYQH------------------------TTGEILGGHAIRLLGWGK 178


>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
           rubripes]
          Length = 477

 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 61/239 (25%), Positives = 96/239 (40%), Gaps = 55/239 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RG+VT         C P   P    A       + +++   + + 
Sbjct: 272 CTGGRIDGAWWFLRRRGVVT-------EDCYPYRPPQQTPAELGRCMMQSRSVGRGKRQA 324

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC N N    +  D Y+    Y ++    +I +EI  NGPV A M ++ D F YKSG 
Sbjct: 325 TQRCPNTN---NYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSG- 380

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y ++D+   K   Y    +       +VKI GWGEE             V
Sbjct: 381 ---------IYKHTDVSFTKPPQYRKHGT------HSVKITGWGEERN-----------V 414

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
             +                   R YW   +++G+ +G++G  +I RG NE  IE+ V G
Sbjct: 415 DGAK------------------RKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIG 455


>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Ornithorhynchus anatinus]
          Length = 327

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 97/246 (39%), Gaps = 68/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RGLV+         C P++      +  + +EP C+  + P  + 
Sbjct: 124 CNGGRLDRAWSFLRRRGLVS-------DKCYPLA------SQNSIAEP-CRMYSRPMGRG 169

Query: 62  HTRCT-----NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
             + T     N ++   +  D Y+    Y ++    DI +EIM+NGPV A M ++ D F 
Sbjct: 170 KRQATGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFL 229

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK G                I+ +                 +VKI GWGEE         
Sbjct: 230 YKDG----------------IYRHTPASNGKPPQFRRQGTHSVKITGWGEEL-------- 265

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                  + NGR   +W   +++G  +G+ G+ +ILRG NE  I
Sbjct: 266 -----------------------QPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDI 302

Query: 235 ESLVNG 240
           ES V G
Sbjct: 303 ESFVVG 308


>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
 gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
          Length = 471

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 60/239 (25%), Positives = 98/239 (41%), Gaps = 55/239 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RG+VT         C P S P  +         + + +   + + 
Sbjct: 267 CAGGRIDGAWWFMRRRGVVT-------QDCYPFSPPEQSAVEVARCMMQSRAVGRGKRQA 319

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N +    +  D Y+    Y ++    +I +EIM NGPV A M ++ D F YKSG 
Sbjct: 320 TAHCPNSH---SYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSG- 375

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    ++ ++D+  +K   Y   A+       +V+I GWGEE         R Y+ 
Sbjct: 376 ---------IFRHTDVNYHKPSQYRKHAT------HSVRITGWGEE---------RDYSG 411

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                 R YW   +++G+ +G+ G  +I RG NE  IE+ V G
Sbjct: 412 RT--------------------RKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450


>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
          Length = 198

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 59/182 (32%), Positives = 77/182 (42%), Gaps = 36/182 (19%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
           C  G     W +    G+VTGG       C+     PC    Y  +EP    C ++A   
Sbjct: 43  CEGGWPIEAWKYGVTEGVVTGGNFGRKECCRSYEIHPCG---YHGNEPFYGHCHSMAR-T 98

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C  RC    Y   +  DK      Y + + V  IQ++IM+NGPVVA   +Y D F Y 
Sbjct: 99  PPCKKRC-RPGYKNSYMMDKRYGTSAYELPNSVXAIQRDIMENGPVVAGFDVYED-FKY- 155

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE---ENGR-PYWT 174
                                YKSG+Y  +A      +A VK++GWGE   ENG  PYW 
Sbjct: 156 ---------------------YKSGIYRHTAGKXTGGHA-VKVIGWGEEXTENGTIPYWI 193

Query: 175 IV 176
           I 
Sbjct: 194 IA 195


>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
          Length = 357

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 65/246 (26%), Positives = 89/246 (36%), Gaps = 77/246 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G     W++    G+VT     +  NTGC   S P C        EP       P P
Sbjct: 169 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N   G  + K+     Y +N +  DI  E+                     
Sbjct: 213 KCERKCVSRNQLWG--ESKHYGVGAYRINPDPQDIMAEV--------------------- 249

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY      +I  +A VK++GWG              
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 294 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 333

Query: 240 GALPKD 245
             LP +
Sbjct: 334 AGLPSE 339


>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
 gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
          Length = 470

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 55/173 (31%), Positives = 75/173 (43%), Gaps = 52/173 (30%)

Query: 76  QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           QD   FK    Y V+    DIQ E+M NGPV A   ++ D F Y  G          +Y 
Sbjct: 325 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 374

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
           +SD+ + K       AS+    Y +V+++GWG                      V ++T 
Sbjct: 375 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHST- 405

Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                    GRP  YW   +++G Q+G+ G  KILRG N   IES V GA  K
Sbjct: 406 ---------GRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGAWGK 449


>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
 gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
          Length = 320

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 94/244 (38%), Gaps = 74/244 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    TW +    GL + G + S  GC    F      +Y  ++P         P C
Sbjct: 149 CDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPF-----GSYCVNDP--------LPTC 195

Query: 62  HTRCTNDNYGRGFFQD-KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    Y   + QD KY    Y  + +E A I  EI +NGPVV    +++D + YKSG
Sbjct: 196 SRTC-QAGYPLTYSQDLKYGGSAYRVMWNENA-IMTEIYQNGPVVVQFEVFADFYQYKSG 253

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y +                      V+ + E   +  V+++GWG ENG           
Sbjct: 254 VYRH----------------------VTGATE--GWHAVRVIGWGVENG----------- 278

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                       VK           YW + +++G ++GDKG  K +RG N   IE  V  
Sbjct: 279 ------------VK-----------YWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYA 315

Query: 241 ALPK 244
            LPK
Sbjct: 316 GLPK 319


>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
          Length = 354

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 93/257 (36%), Gaps = 82/257 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +  + G+VT     +   TGC               S P C+ L  P P
Sbjct: 168 CDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC---------------SHPGCEPL-YPTP 211

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVND-EVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           KCH +C   N         +R  ++Y VN   V+   Q IM                   
Sbjct: 212 KCHRKCVKGNV-------LWRKSKHYGVNAYRVSHDPQSIMAE----------------- 247

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVR 177
              Y NGPV  +  +Y D   YKSGVY       +  +A VK++GWG  E G  YW IV 
Sbjct: 248 --VYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHA-VKLIGWGTSEQGEDYWLIVN 304

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
            +                 GWGE+                   G  KI RG NE  IE  
Sbjct: 305 SWNR---------------GWGED-------------------GYFKIRRGTNECGIEHS 330

Query: 238 VNGALPK-DNYGVEFGE 253
           V   LP   N  VE G+
Sbjct: 331 VVAGLPSARNLNVELGD 347


>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
          Length = 112

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 58/168 (34%)

Query: 76  QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYS 135
           QDK + K  Y V ++  DI  EIMKNGPV    Y++ D   YKSG               
Sbjct: 3   QDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG--------------- 47

Query: 136 DIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKL 195
            I+ Y +G         +V    ++++GWG ENG                       VK 
Sbjct: 48  -IYHYTTG--------RLVGGHAIRVIGWGVENG-----------------------VK- 74

Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                     YW I +++ E +G+KG  ++ RG NE  IE+ +N  LP
Sbjct: 75  ----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112


>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 336

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 83/242 (34%), Gaps = 70/242 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W +    G+       +++ CQP  FP C H      +  C       P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------ASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CT+    +     KYR    Y V  E                          YK   
Sbjct: 212 NATCTD----KTIPLIKYRGNHSYEVRGEE------------------------DYKREL 243

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP V    ++SD  +YK+GVY    +   +    V+IVGWG+ NG P          
Sbjct: 244 YFNGPFVVRFQVHSDFLAYKNGVYQ-HVAGNFLGGKAVRIVGWGKLNGTP---------- 292

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                   YW + +++   +G  G   ILRG NE  IE L    
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAG 328

Query: 242 LP 243
            P
Sbjct: 329 TP 330


>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
          Length = 346

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 97/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G   S W ++ +RG+V+         C P S         T   P C    + +   
Sbjct: 149 CQGGHLDSAWWFLRRRGVVS-------DHCYPFSG---QGRTETGPAPRCMMHSRAMGRG 198

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N         D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 199 KRQATARCPNHQV---HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLY 255

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           ++G Y + PV     L       + G +            +VKI GWGEE+         
Sbjct: 256 QNGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEES--------- 290

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 291 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 330

Query: 238 VNG 240
           V G
Sbjct: 331 VLG 333


>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 379

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 65/246 (26%), Positives = 89/246 (36%), Gaps = 77/246 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G     W++    G+VT     +  NTGC   S P C        EP       P P
Sbjct: 191 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 234

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N   G  + K+     Y +N +  DI  E+                     
Sbjct: 235 KCERKCVSRNQLWG--ESKHYGVGAYRINPDPQDIMAEV--------------------- 271

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY      +I  +A VK++GWG              
Sbjct: 272 --YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT------------- 315

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 316 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 355

Query: 240 GALPKD 245
             LP +
Sbjct: 356 AGLPSE 361


>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
          Length = 349

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 80/188 (42%), Gaps = 57/188 (30%)

Query: 66  TNDNYGRGF-----FQDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           + DN  RG       QD   FK    Y V+    DIQ E+M NGPV A   ++ D F Y 
Sbjct: 189 SRDNDRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYA 248

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
            G          +Y +SD+ + K       AS+    Y +V+++GWG             
Sbjct: 249 GG----------VYQHSDLAAQK------GASSVAEGYHSVRVLGWG------------- 279

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                    V ++T          GRP  YW   +++G Q+G+ G  KILRG N   IES
Sbjct: 280 ---------VDHST----------GRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIES 320

Query: 237 LVNGALPK 244
            V GA  K
Sbjct: 321 FVVGAWGK 328


>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
 gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
          Length = 159

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 78/205 (38%), Gaps = 62/205 (30%)

Query: 27  SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
           S  GC P  FP CNH     S  P C  ++      H   T+ +    + +D +R K + 
Sbjct: 4   SADGCWPYPFPKCNHVRSAASRYPACPAVSPSAVGAHQMETSYSL---YIRDLHRAKSFG 60

Query: 86  WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
            +     +I+QEI  NGPV+  + +Y DI  YK+G Y                       
Sbjct: 61  RLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVY----------------------- 97

Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
            V  +       T+KI+GWG E+G                                  + 
Sbjct: 98  -VHQTGSFQGIHTLKIIGWGVESG----------------------------------QD 122

Query: 206 YWTIVSTFGEQFGDKGTIKILRGRN 230
           YW  V+++ E++GD G IK+  GR 
Sbjct: 123 YWLAVNSWNEEWGDHGMIKLAVGRT 147


>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
           niloticus]
          Length = 499

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 58/240 (24%), Positives = 98/240 (40%), Gaps = 55/240 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RG+VT         C P   P    A       + +++   + + 
Sbjct: 294 CAGGRIDGAWWYLRRRGVVT-------EDCYPYQPPHQTPAEVGRCMMQSRSVGRGKRQA 346

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC N    + +  D Y+    Y ++    +I +EIM NGPV A M ++ D F YK+G 
Sbjct: 347 TQRCPNT---QNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTG- 402

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                    +Y ++D+   K   Y    +       +V+I GWGE+             V
Sbjct: 403 ---------IYKHTDVSFTKPPQYRKHGT------HSVRITGWGEDRN-----------V 436

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             ++                  R YW   +++G+ +G+ G  +I+RG NE  IE+ V G 
Sbjct: 437 DGTS------------------RKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIGV 478


>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 65/260 (25%), Positives = 92/260 (35%), Gaps = 77/260 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C  +C   N    + + K+     Y VN +   I  E+                     
Sbjct: 208 ACEKKCVKKNLL--WSESKHFSVNAYRVNSDQHSIMTEV--------------------- 244

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGP   +  +Y D   YKSGVY     +E+  +A VK++GWG              
Sbjct: 245 --YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA-VKLIGWGT------------- 288

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +   +GD G  KI+RG NE  IE +  
Sbjct: 289 --------------------SEDGEDYWLLANQWNRSWGDDGYFKIIRGTNECGIEDVTA 328

Query: 240 GALPKDNYGVEFGEESGERL 259
           G     N  +E G    + L
Sbjct: 329 GMPSTKNLDIESGVRDDDSL 348


>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
           latipes]
          Length = 474

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 98/243 (40%), Gaps = 63/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W ++ +RG+VT         C P   P    A       + + +   + + 
Sbjct: 269 CAGGRIDGAWWYLRRRGVVT-------ENCYPYQPPQQAPAEVGRCMMQSRAVGRGKRQA 321

Query: 62  HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             RC N  NY    +Q    +K    ++    +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 322 TQRCPNTYNYHNDIYQSTPPYK----LSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNG 377

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWTIVR 177
                     +Y ++D+ S K   Y    +       +V+I GWGE+   +G P      
Sbjct: 378 ----------IYKHTDVSSTKPPQYRKHGT------HSVRITGWGEDKDYDGTP------ 415

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                     R YW   +++G+ +G+ G  +I RG NE  IE+ 
Sbjct: 416 --------------------------RKYWIAANSWGKNWGENGFFRIARGANECEIEAF 449

Query: 238 VNG 240
           V G
Sbjct: 450 VIG 452


>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
          Length = 274

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 10/128 (7%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+VTGG + S  GC+P   PPC        E +      P  K 
Sbjct: 153 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 208

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     + + +RF R YY++      IQ+++M  GP+ A+  +Y D  SYK
Sbjct: 209 H-RCTRMCYGNQDLDYNEDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDFPSYK 265

Query: 119 SGKYGNGP 126
           SG Y   P
Sbjct: 266 SGVYQRTP 273


>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
          Length = 428

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 98/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H     + V  PPC    ++ +    K  AT 
Sbjct: 231 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 285

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
                 RC N         D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 286 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 337

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           +SG Y + PV     L       + G +            +VKI GWGEE          
Sbjct: 338 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------- 372

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 373 ---------LPDGRTIK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 412

Query: 238 VNG 240
           V G
Sbjct: 413 VLG 415


>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
          Length = 467

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G     W ++ +RG+V+         C P S       N    EP C    + +   
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERNEAGPEPRCMMHSRAMGRG 319

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N +       D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 320 KRQAIARCPNHHV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           + G Y + PV     L       + G +            +VKI GWGEE          
Sbjct: 377 QGGIYSHTPVS----LGKPERYRRHGTH------------SVKITGWGEET--------- 411

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGTNECDIESF 451

Query: 238 VNG 240
           V G
Sbjct: 452 VLG 454


>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
          Length = 466

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 73/173 (42%), Gaps = 52/173 (30%)

Query: 76  QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           QD   FK    Y V+    DIQ E+M NGPV A   ++ D F Y  G          +Y 
Sbjct: 321 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 370

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
           +SD+ + K       AS+    Y +V+++GWG ++                         
Sbjct: 371 HSDLAAQK------GASSVAEGYHSVRVLGWGVDH------------------------- 399

Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                    GRP  YW   +++G Q+G+ G  KILRG N   IES V GA  K
Sbjct: 400 -------STGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIGAWGK 445


>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
          Length = 526

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/170 (31%), Positives = 74/170 (43%), Gaps = 52/170 (30%)

Query: 76  QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           QD   FK    Y V+    DIQ E+M NGPV A   ++ D F Y  G          +Y 
Sbjct: 381 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 430

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
           +SD+ + K       AS+    Y +V+++GWG                      V ++T 
Sbjct: 431 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHST- 461

Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                    GRP  YW   +++G Q+G+ G  KILRG N   IES V GA
Sbjct: 462 ---------GRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502


>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
          Length = 348

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 93/244 (38%), Gaps = 63/244 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+V+GG +     C P    PC      T    C  +A P P C
Sbjct: 159 CRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMA-PTPPC 217

Query: 62  HTRCTNDNYGRGFFQDKYRFK---RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
             +C      RG ++   R+    R Y +      I+++I + G VVA       +F+  
Sbjct: 218 KRKCQPGF--RGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSVVA-------VFA-- 266

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                         +Y D   Y+SG+Y  +A          +  G               
Sbjct: 267 --------------VYEDFSHYQSGIYKHTAG---------RFTG--------------- 288

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                      Y  VK+IGWG++NG  YW I +++ + +G+ G  +++RG N   IE  V
Sbjct: 289 ----------GYHAVKMIGWGKDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIEEQV 338

Query: 239 NGAL 242
           +  +
Sbjct: 339 DAGI 342


>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
          Length = 396

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 66/243 (27%), Positives = 87/243 (35%), Gaps = 69/243 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +     +    G+VTGG +    GC P SF PC+    T  EP+        P C
Sbjct: 155 CQGGYTIEAMKYWMNSGVVTGGDYQ-GAGCIPYSFRPCS----TCKEPK------DAPSC 203

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            T C      +  ++          V + V  IQ EI  NGPV     +Y D + YKSG 
Sbjct: 204 KTTCQASYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGV 263

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y         ++Y D    K   +A            VKI+GWG E    YW +   ++ 
Sbjct: 264 Y--------YHVYGD----KPSGHA------------VKIIGWGTEKKVDYWLVANSWST 299

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           +                                  FG+ G  KI RG NE  IE  V   
Sbjct: 300 T----------------------------------FGENGFFKIRRGTNECGIEENVVAG 325

Query: 242 LPK 244
           LPK
Sbjct: 326 LPK 328


>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/244 (24%), Positives = 89/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KCH +C  +N  + + ++K+     Y V+    DI  E+ KNGPV     +Y D   YKS
Sbjct: 213 KCHRKCKVEN--QVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  ++    VK++GWG  +           
Sbjct: 271 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 296

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +G  G  KI+RG+NE  IE  V 
Sbjct: 297 -----------------------GEDYWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVT 333

Query: 240 GALP 243
             +P
Sbjct: 334 AGMP 337


>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 382

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 95/243 (39%), Gaps = 71/243 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W WVH +G+ TG         +  + P   + +             P P C
Sbjct: 210 CGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDIY-----------PTPNC 258

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C N  Y      D++                  ++++ P     Y YS +   K+  
Sbjct: 259 VEQCRNPKYTTTLRDDRHF-----------------MLESSP-----YHYS-VNDAKNAI 295

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
             +GPV A+  +Y D  +YKSGVY  ++ + +  +A VKI+GWGE++G            
Sbjct: 296 RTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHA-VKIIGWGEKSG------------ 342

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                                 + YW  V+++ E +GDKG  KI  G N  I + L+ G 
Sbjct: 343 ----------------------QAYWLAVNSWNEDWGDKGLFKIALG-NCGIDDDLLGGT 379

Query: 242 LPK 244
            PK
Sbjct: 380 -PK 381


>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
          Length = 466

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 97/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G   S W ++ +RG+V+         C P S         T   P C    + +   
Sbjct: 269 CQGGHLDSAWWFLRRRGVVS-------DHCYPFSG---QGRTETGPAPRCMMHSRAMGRG 318

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N         D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 319 KRQATARCPNHQV---HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLY 375

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           ++G Y + PV     L       + G +            +VKI GWGEE+         
Sbjct: 376 QNGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEES--------- 410

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 411 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 450

Query: 238 VNG 240
           V G
Sbjct: 451 VLG 453


>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
           gallus]
          Length = 464

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 92/240 (38%), Gaps = 58/240 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
           CS G     W ++ +RG+VT   +  ++   QP + P   H+  T       T   P P+
Sbjct: 269 CSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQ 328

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
            H              D Y+    Y +     +I +E+M+NGPV A + ++ D F YKSG
Sbjct: 329 THA------------NDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSG 376

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +         +     +VKI GWGEE   P   + +   
Sbjct: 377 ----------------IYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQ-LPDGQVQK--- 416

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YWT  +++G  +G+ G  +I RG NE  +ES V G
Sbjct: 417 -------------------------YWTAANSWGRAWGEDGHFRIARGVNECEVESFVVG 451


>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/237 (27%), Positives = 88/237 (37%), Gaps = 77/237 (32%)

Query: 11  WVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           W++    G+VT     +  NTGC   S P C        EP       P PKC  +C ++
Sbjct: 180 WLYFKYHGVVTEECDPYFDNTGC---SHPGC--------EP-----GYPTPKCVRKCVSE 223

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
           N   G  + K+     Y +N +  DI  E+ KNGPV     +Y D   YKSG        
Sbjct: 224 NQLWG--ESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSG-------- 273

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
               +Y  I   K G +A            VK++GWG                       
Sbjct: 274 ----VYKHITGTKIGGHA------------VKLIGWGT---------------------- 295

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 245
                       ++G  YW + + +   +GD G  KI RG NE  IE  V   LP D
Sbjct: 296 -----------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSD 341


>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
          Length = 125

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 35/123 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +GP+VA   +Y+D   YKSGVY  +    I  +A                        
Sbjct: 34  YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHA------------------------ 69

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      V+++GWG EN  PYW + +++ + +GD GT KILRG NEA IE   N  
Sbjct: 70  -----------VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVG 118

Query: 242 LPK 244
            P+
Sbjct: 119 YPQ 121


>gi|48762487|dbj|BAD23813.1| cathepsin B-N [Tuberaphis taiwana]
          Length = 163

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 59/124 (47%), Gaps = 10/124 (8%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K 
Sbjct: 38  CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93

Query: 62  HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           H RCT   YG     F +D +  +  Y++      IQ +I+  GP+ A+  +Y D  SYK
Sbjct: 94  H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQNDILAYGPIEASFEVYDDFPSYK 150

Query: 119 SGKY 122
           SG Y
Sbjct: 151 SGVY 154


>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 468

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 68/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVT------GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G     W ++ +RG+V+       G   +  G      PPC   +        + + 
Sbjct: 271 CRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAEAG----PAPPCMMHS--------RAMG 318

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
             + +   RC N +       D Y+    Y +  +  +I +E+M+NGPV A M ++ D F
Sbjct: 319 RGKRQATRRCPNSHTD---ANDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFF 375

Query: 116 SYKSGKYGNGPV-VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
            YK G Y + P+ +A    Y      + G +            +VKI GWGEE       
Sbjct: 376 LYKGGIYSHTPLSMARPEQYR-----RHGTH------------SVKITGWGEET------ 412

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                       +    T+K           YWT  +++G  +G++G  +ILRG NE  I
Sbjct: 413 ------------LPDGRTLK-----------YWTAANSWGPSWGERGHFRILRGSNECDI 449

Query: 235 ESLVNG 240
           ES V G
Sbjct: 450 ESFVLG 455


>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
          Length = 293

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 94/264 (35%), Gaps = 77/264 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 105 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 148

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y V     DI  E+                     
Sbjct: 149 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 185

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY       I  +A VK++GWG              
Sbjct: 186 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 229

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 230 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 269

Query: 240 GALPKDNYGVEFGEESGERLSEEF 263
             LP D   V+    S + L   F
Sbjct: 270 AGLPSDRNVVKGITTSDDLLVSSF 293


>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
          Length = 362

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 68/246 (27%), Positives = 100/246 (40%), Gaps = 68/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H     + V  PPC    ++ +    K  AT 
Sbjct: 165 CHGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 219

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
                 RC N         D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 220 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 271

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWT 174
           +SG Y + PV     L       + G +            +VKI GWGEE   +GR    
Sbjct: 272 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDGR---- 311

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                            TVK           YWT  +++G  +G++G  +I+RG NE  I
Sbjct: 312 -----------------TVK-----------YWTAANSWGPAWGERGHFRIVRGANECDI 343

Query: 235 ESLVNG 240
           ES V G
Sbjct: 344 ESFVLG 349


>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Bos taurus]
 gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
 gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
          Length = 534

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 98/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H     + V  PPC    ++ +    K  AT 
Sbjct: 337 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 391

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
                 RC N         D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 392 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 443

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           +SG Y + PV     L       + G +            +VKI GWGEE          
Sbjct: 444 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------- 478

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 479 ---------LPDGRTIK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 518

Query: 238 VNG 240
           V G
Sbjct: 519 VLG 521


>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
 gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
          Length = 362

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 94/264 (35%), Gaps = 77/264 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 174 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 217

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y V     DI  E+                     
Sbjct: 218 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 254

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY       I  +A VK++GWG              
Sbjct: 255 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 298

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 299 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 338

Query: 240 GALPKDNYGVEFGEESGERLSEEF 263
             LP D   V+    S + L   F
Sbjct: 339 AGLPSDRNVVKGITTSDDLLVSSF 362


>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
 gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
          Length = 462

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 74/168 (44%), Gaps = 52/168 (30%)

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           + Y+    + +N+E  DI  EI K+GPV A M ++ D FSYKSG Y +            
Sbjct: 306 NMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHS----------- 353

Query: 137 IFSYKSGVYAVSASAEIVA-YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKL 195
                    A S SA+  A Y +V+++GWGEE                      Y   K 
Sbjct: 354 ---------AASTSADQRAGYHSVRLIGWGEERH-------------------GYEVTK- 384

Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                     YW  V+++G  +G+ G  +ILRG NE  IES V  +LP
Sbjct: 385 ----------YWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLP 422


>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 350

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/254 (25%), Positives = 91/254 (35%), Gaps = 79/254 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W ++ + G+VT     +    GC+    P C        EP     A P P
Sbjct: 166 CDGGYPISAWQYLVENGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 209

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
            C  +C   N     +Q+K  F    Y VN +  DI  E+ KNGPV     +Y D   YK
Sbjct: 210 ACEKKCKVQNQ---VWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYK 266

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y +                         + E++    VK++GWG             
Sbjct: 267 SGVYEH------------------------ITGEMMGGHAVKLIGWGT------------ 290

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                  +G+ YW + + +   +GD G  KI+RG+NE  IE  V
Sbjct: 291 ---------------------SADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDV 329

Query: 239 NGALPKDNYGVEFG 252
              +P     V  G
Sbjct: 330 VAGMPSTKNTVRTG 343


>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
          Length = 356

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  ++G+VT     +  N GC   S P C        EP     A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KCH +C   N    + + K+     Y ++ +   I  E+ KNGPV  +  +Y D   YKS
Sbjct: 212 KCHRKCVKQNLL--WSKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKS 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         + +++    VK++GWG              
Sbjct: 270 GVYKH------------------------VTGDVMGGHAVKLIGWGT------------- 292

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +   +GD G  KI RG +E  IE  V 
Sbjct: 293 --------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIEDEVV 332

Query: 240 GALP 243
             LP
Sbjct: 333 AGLP 336


>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
          Length = 429

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 32/119 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A M +Y D F Y+ GVY  S                                   
Sbjct: 334 SGPVQAVMTVYQDFFHYRDGVYRRS--------------------------------YHG 361

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
           + E+  + +V++IGWGE+ G  YW + +++G Q+G+ G  +I RG NEA IES V   L
Sbjct: 362 NNELKGFHSVRIIGWGEDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420


>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
          Length = 515

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 3/127 (2%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W +    G+VTGG+  + +GC+   FP C+H +     P C +     P+C
Sbjct: 150 CDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPSCSH-DERGRHPLCPSEIYHTPRC 208

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  D     +  +  +    Y V D   +I  EIM NGPV A   +Y D   Y+ G 
Sbjct: 209 TKKCDTDKL--HYSAELTKANSSYNVLDSDREIMMEIMNNGPVEAVFDVYEDFLQYEKGI 266

Query: 122 YGNGPVV 128
           Y N  V+
Sbjct: 267 YFNAWVL 273


>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
          Length = 352

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/172 (29%), Positives = 74/172 (43%), Gaps = 39/172 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G + +   ++ K+G+V+         C P + P C  A     +P    + TPQ  C
Sbjct: 136 CQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTCAPAQ----QPCLNFVDTPQ--C 182

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C+N +Y   + QD +     Y +N  V  IQQEIM NGPV A   +Y          
Sbjct: 183 VEKCSNASYT--YAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYE--------- 231

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
                         D   YKSGVY  +   ++  +  VK++GWG +N   YW
Sbjct: 232 --------------DFLGYKSGVYQHTTGKDLGGHC-VKMIGWGTQNNELYW 268


>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
 gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
           Flags: Precursor
 gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
          Length = 452

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 74/171 (43%), Gaps = 48/171 (28%)

Query: 76  QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           QD   FK    Y V+    DIQ E+M NGPV A   ++ D F Y  G          +Y 
Sbjct: 307 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 356

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
           +SD+ + K       AS+    Y +V+++GWG                      V ++T 
Sbjct: 357 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHSTG 388

Query: 194 KLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           K I         YW   +++G Q+G+ G  K+LRG N   IES V GA  K
Sbjct: 389 KPI--------KYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIGAWGK 431


>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
 gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
          Length = 208

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
           C  G     W +  + G+VT         C P   P  C H       P C+  A P PK
Sbjct: 22  CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 66

Query: 61  CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +C   N     +Q+K  F    Y +N +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 67  CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 123

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  I+    VK++GWG  +           
Sbjct: 124 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 149

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 150 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 186

Query: 240 GALP 243
             +P
Sbjct: 187 AGMP 190


>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Otolemur garnettii]
          Length = 436

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/246 (26%), Positives = 98/246 (39%), Gaps = 68/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 57
           C  G     W ++ +RG+V+         C P S       +     P C   + P    
Sbjct: 239 CHGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERDKAGPAPLCMMHSRPMGRG 288

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N+        D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 289 KRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 345

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWT 174
           +SG Y + PV     L       + G +            +VKI GWGEE   +GR    
Sbjct: 346 QSGIYSHTPVS----LQRPEGYRRHGTH------------SVKITGWGEETLPDGR---- 385

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                            T+K           YWT  +++G  +G++G  +I+RG NE  I
Sbjct: 386 -----------------TLK-----------YWTAANSWGPAWGERGHFRIVRGANECDI 417

Query: 235 ESLVNG 240
           ES V G
Sbjct: 418 ESFVLG 423


>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 322

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 65/251 (25%), Positives = 97/251 (38%), Gaps = 78/251 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC            +
Sbjct: 125 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPCMM--------HSR 169

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
            +   + +   RC N +       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 170 AMGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 226

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NG 169
           D F YK G Y + PV     L       + G +            +VKI GWGEE   +G
Sbjct: 227 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDG 270

Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
           R                     T+K           YWT  +++G  +G++G  +I+RG 
Sbjct: 271 R---------------------TLK-----------YWTAANSWGPAWGERGHFRIVRGV 298

Query: 230 NEAIIESLVNG 240
           NE  IES V G
Sbjct: 299 NECDIESFVLG 309


>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
          Length = 202

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 72/180 (40%), Gaps = 29/180 (16%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W  V + G+ TGG      GC+P +F PC          EC   +   P+C
Sbjct: 44  CRGGANIRAWKHVMRNGVCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPEC 103

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C        + +D+Y     Y+V ++   I +EIM+ GPV               G 
Sbjct: 104 RKICQRGCIQLQYGKDRYYAASAYFVKNDTKAIMREIMRGGPV--------------HGA 149

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG---EENGR--PYWTIV 176
           Y           Y+D   YK GVY  +A  E     ++KI+GWG     NG   PYW + 
Sbjct: 150 YDT---------YTDFRLYKGGVYEHTA-GERTGGHSIKIMGWGNYKHPNGTVIPYWLVA 199


>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
           rotundata]
          Length = 442

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 69/153 (45%), Gaps = 48/153 (31%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI QEI+ +GPV A M +Y D FSY+SG Y +  V A +Y  SD               
Sbjct: 336 TDIMQEILTSGPVQATMRVYQDFFSYESGVYKHS-VTAELY-ESD--------------- 378

Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
               Y +V+I+GWGEE           Y+ +   +                   YW + +
Sbjct: 379 ----YHSVRIIGWGEEPP--------TYSRNTPLK-------------------YWLVAN 407

Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           ++G+Q+G+ G  +I +G NE  IES V G   K
Sbjct: 408 SWGQQWGENGLFRIQKGTNECEIESFVLGVWAK 440


>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
          Length = 347

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
           C  G     W +  + G+VT         C P   P  C H       P C+  A P PK
Sbjct: 161 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 205

Query: 61  CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +C   N     +Q+K  F    Y +N +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 206 CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 262

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  I+    VK++GWG  +           
Sbjct: 263 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 288

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 289 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 325

Query: 240 GALP 243
             +P
Sbjct: 326 AGMP 329


>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Otolemur garnettii]
          Length = 467

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 63/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 57
           C  G     W ++ +RG+V+         C P S       +     P C   + P    
Sbjct: 270 CHGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERDKAGPAPLCMMHSRPMGRG 319

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N+        D Y+    Y +     +I +E+M+NGPV A M ++ D F Y
Sbjct: 320 KRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           +SG Y + PV     L       + G +            +VKI GWGEE          
Sbjct: 377 QSGIYSHTPVS----LQRPEGYRRHGTH------------SVKITGWGEET--------- 411

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +I+RG NE  IES 
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 451

Query: 238 VNG 240
           V G
Sbjct: 452 VLG 454


>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
           cuniculus]
          Length = 467

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H      P   PPC   +        + +   
Sbjct: 270 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPA--PPCMMHS--------RAMGRG 319

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N +       D Y+    Y +     +I +E+++NGPV A M ++ D F Y
Sbjct: 320 KRQATARCPNSHV---HANDIYQVTPAYRLGSNEKEIMKELLENGPVQALMEVHEDFFLY 376

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           + G Y + PV     L       + G +            +VKI GWGEE          
Sbjct: 377 QGGIYSHTPVS----LERPERYRRHGTH------------SVKITGWGEET--------- 411

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                    +    T+K           YWT  +++G  +G++G  +ILRG NE  IES 
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRILRGTNECDIESF 451

Query: 238 VNG 240
           V G
Sbjct: 452 VLG 454


>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
 gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
          Length = 804

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 35/139 (25%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
           Y  S + +     Y NGP+  +MYL +D  S  K G+Y+   + ++         G G  
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKL---------GGGH- 233

Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
                                    V ++GWGEENG PYW   +T+G  +GD+G  KI R
Sbjct: 234 ------------------------AVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKR 269

Query: 228 GRNEAIIESLVNGALPKDN 246
           G NE  IE+    ALP D 
Sbjct: 270 GSNELKIETWPGSALPIDT 288


>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
 gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           ATCC 50803]
          Length = 804

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 35/139 (25%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
           Y  S + +     Y NGP+  +MYL +D  S  K G+Y+   + ++         G G  
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKL---------GGGH- 233

Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
                                    V ++GWGEENG PYW   +T+G  +GD+G  KI R
Sbjct: 234 ------------------------AVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKR 269

Query: 228 GRNEAIIESLVNGALPKDN 246
           G NE  IE+    ALP D 
Sbjct: 270 GSNELKIETWPGSALPIDT 288


>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
 gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
          Length = 347

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
           C  G     W +  + G+VT         C P   P  C H       P C+  A P PK
Sbjct: 161 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 205

Query: 61  CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +C   N     +Q+K  F    Y +N +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 206 CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 262

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  I+    VK++GWG  +           
Sbjct: 263 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 288

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 289 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 325

Query: 240 GALP 243
             +P
Sbjct: 326 AGMP 329


>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
          Length = 198

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 25/177 (14%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    K+G VTGG++   TGC+P  +PPC H    T    C +   P  + 
Sbjct: 44  CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQN 103

Query: 62  HTRCTNDNYGRGFFQD-KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
                  +    + +D  +R   +   + E A I + I  +G +   + ++ D F + SG
Sbjct: 104 ANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFED-FEHYSG 162

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                                 GVY  +A A +  +A VK++GWG +NG PYW I  
Sbjct: 163 ----------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLIAN 196


>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
          Length = 122

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 53/122 (43%), Gaps = 35/122 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV     +YSD   YKSGVY                                    
Sbjct: 31  YKNGPVEGAFSVYSDFLLYKSGVYQ----------------------------------- 55

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
             S EI+    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES +   
Sbjct: 56  HVSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 115

Query: 242 LP 243
           +P
Sbjct: 116 MP 117


>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/246 (26%), Positives = 88/246 (35%), Gaps = 77/246 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   + W +    G+VT     +  NTGC   S P C        EP     A P P
Sbjct: 172 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 215

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y V     DI  E+                     
Sbjct: 216 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 252

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY       I  +A VK++GWG              
Sbjct: 253 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 296

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 297 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 336

Query: 240 GALPKD 245
             LP D
Sbjct: 337 AGLPSD 342


>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
          Length = 362

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT       RC N +       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 216 RQAT------ARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 342 DIESFVLG 349


>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
 gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
 gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
          Length = 463

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 77/179 (43%), Gaps = 50/179 (27%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           Y+    + +N+E  DI  EI   G V A M +Y D FSY+SG Y +              
Sbjct: 310 YKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSA------------ 356

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
                  A + + E  AY +V+++GWGEE                    V Y  VK    
Sbjct: 357 -------AATPAEERSAYHSVRLIGWGEER-------------------VGYDVVK---- 386

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 257
                  YW  ++++G+ +G+ G  +ILRG NE  IES V  + P  +  V+   + GE
Sbjct: 387 -------YWIAINSWGQWWGENGRFRILRGSNECDIESYVLASNPYVHEHVQAIRKVGE 438


>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
          Length = 353

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 89/244 (36%), Gaps = 75/244 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C  +N  + + ++K+     Y V+    DI  E+ KNGPV    + Y  I     
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEV-AFTYCQIL---- 261

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                           D   YKSGVY    +  ++    VK++GWG  +           
Sbjct: 262 ----------------DFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDA---------- 294

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG NE  IE  V 
Sbjct: 295 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVT 331

Query: 240 GALP 243
             +P
Sbjct: 332 AGMP 335


>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 73/186 (39%), Gaps = 65/186 (34%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C ++C N +          R+K   W + E ++I Q +M+ GP+     +YSD  +Y+
Sbjct: 160 PACPSKCDNGS-------QIIRYKLQSWKSVEPSEIMQALMEYGPLSCGFMVYSDFMNYR 212

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG                ++ +KSG +    +        V + GWG ENG PYW +   
Sbjct: 213 SG----------------VYQHKSGYFEGGHA--------VLLCGWGVENGLPYWLVQN- 247

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                              WG     P W          G+KG  KILRG N   IES V
Sbjct: 248 ------------------SWG-----PAW----------GEKGFFKILRGSNHCEIESYV 274

Query: 239 NGALPK 244
              +PK
Sbjct: 275 TLGVPK 280


>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
          Length = 450

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 49/180 (27%), Positives = 73/180 (40%), Gaps = 46/180 (25%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR    Y V+    DI  EI+ NGPV A   +Y D F Y  G          +Y + D+ 
Sbjct: 312 YRMTPPYRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGG----------VYQHLDLH 361

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
            +K          ++  Y +V+I+GWGE+                      Y+T   +  
Sbjct: 362 EHK------EEERKVQGYHSVRIIGWGED----------------------YSTGPQV-- 391

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGER 258
                  YW   +++G ++G+ G  +ILRG N   IES V GA  K      F  +  +R
Sbjct: 392 ------KYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGAWGKGAKKRRFKVQKLQR 445


>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
           P15]
          Length = 627

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 59/139 (42%), Gaps = 35/139 (25%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
           Y  S + +     Y NGP+  +MYL +D  S  K G+Y+   + ++     V IVG    
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVG---- 239

Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
                                         WGEENG PYW   +T+G  +GD+G  KI R
Sbjct: 240 ------------------------------WGEENGVPYWDCANTYGTNWGDQGYFKIKR 269

Query: 228 GRNEAIIESLVNGALPKDN 246
           G NE  IE+    ALP D 
Sbjct: 270 GSNELKIETWPGSALPIDT 288


>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
          Length = 218

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 21  CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 71

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 72  RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 122

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 123 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 162

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 163 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 197

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 198 DIESFVLG 205


>gi|48762495|dbj|BAD23817.1| cathepsin B-S [Tuberaphis styraci]
          Length = 99

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 25/120 (20%)

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           P  + H +C    YG+   QD+Y+ K  Y +N  +  I+Q++M  GPV A+  +Y D FS
Sbjct: 5   PMERNH-QCPKTCYGKTTVQDRYKTKNEYVINS-IETIEQDLMTYGPVEASFDVYDD-FS 61

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                                  YKSG+Y  +  A+     ++KI+GWGEENG PYW  V
Sbjct: 62  ----------------------VYKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAV 99


>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
          Length = 352

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 64/260 (24%), Positives = 91/260 (35%), Gaps = 77/260 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C  +C   N    + + K+     Y VN +   I  E+                     
Sbjct: 208 ACEKKCVKKNLL--WSESKHFSVNAYRVNSDQHSIMTEV--------------------- 244

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGP   +  +Y D   YKSGVY     +E+  +A VK++GWG              
Sbjct: 245 --YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA-VKLIGWGT------------- 288

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +   +G  G  KI+RG NE  IE +  
Sbjct: 289 --------------------SEDGEDYWLLANQWNRSWGGDGYFKIIRGTNECGIEDVTA 328

Query: 240 GALPKDNYGVEFGEESGERL 259
           G     N  +E G    + L
Sbjct: 329 GTPSTKNLDIESGVRDDDSL 348


>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
           [Nasonia vitripennis]
          Length = 481

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 67/152 (44%), Gaps = 53/152 (34%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI QEI+ +GPV A M ++ D F Y+SG          +Y++S  F  +          
Sbjct: 372 TDIMQEILTSGPVQATMRVHRDFFHYESG----------IYVHSRPFDTRQS-------- 413

Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YWTI 209
               Y +V+IVGWGEE   PY                             NG+P  +W +
Sbjct: 414 ---GYHSVRIVGWGEEPS-PY-----------------------------NGKPIKFWRV 440

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
            +++G  +G+ G  +I+RG NE  IES V G 
Sbjct: 441 ANSWGRDWGEDGYFRIVRGNNECEIESFVLGV 472


>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
          Length = 804

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 59/139 (42%), Gaps = 35/139 (25%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
           Y  S + +     Y NGP+  +MYL +D  S  K G+Y+   + ++     V IVG    
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVG---- 239

Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
                                         WGEENG PYW   +T+G  +GD+G  KI R
Sbjct: 240 ------------------------------WGEENGVPYWDCANTYGTNWGDQGYFKIKR 269

Query: 228 GRNEAIIESLVNGALPKDN 246
           G NE  IE+    ALP D 
Sbjct: 270 GSNELKIETWPGSALPIDT 288


>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
 gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
 gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
 gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
           [Macaca mulatta]
          Length = 467

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT       RC N +       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 321 RQAT------ARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 447 DIESFVLG 454


>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
 gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
          Length = 433

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 89/243 (36%), Gaps = 76/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLK 297

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C  + N  R  F   Y     Y +N E +DI  EI  +GPV A M +Y D FSY SG
Sbjct: 298 ANGCRPSANVDRDSF---YTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSG 353

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y      AN                         + +VK+VGWGEE+            
Sbjct: 354 VYRQ--TAANR-------------------GAPTGFHSVKLVGWGEEH------------ 380

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G++G  +ILRG NE  IE  V  
Sbjct: 381 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 419

Query: 241 ALP 243
           + P
Sbjct: 420 SWP 422


>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
 gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
          Length = 433

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 89/243 (36%), Gaps = 76/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLK 297

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C  + N  R  F   Y     Y +N E +DI  EI  +GPV A M +Y D FSY SG
Sbjct: 298 ANGCRPSANVDRDSF---YTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSG 353

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y      AN                         + +VK+VGWGEE+            
Sbjct: 354 VYRQ--TAANR-------------------GAPTGFHSVKLVGWGEEH------------ 380

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G++G  +ILRG NE  IE  V  
Sbjct: 381 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 419

Query: 241 ALP 243
           + P
Sbjct: 420 SWP 422


>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
 gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
          Length = 288

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 94/245 (38%), Gaps = 74/245 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G    T+ +  K GL +GG +HS  GC+P  F                       KC
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGGATQD------------VNIVLKC 162

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWV--NDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             +C    Y   + QD       Y +   DE A ++ EI +NGP+V +  +Y D F Y+S
Sbjct: 163 DRQC-QAGYPLTYSQDLKHGASSYILPWGDENA-MKAEIYQNGPIVTSFDVYGDFFQYRS 220

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G                ++ + +G Y  S +        V+++GWG ENG          
Sbjct: 221 G----------------VYRHVTGAYKGSHA--------VRVIGWGVENG---------- 246

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                        VK           YW   +++ E++G+ G  KI+RG N   +E +  
Sbjct: 247 -------------VK-----------YWLCANSWNERWGENGFFKIVRGENHVGVEDISY 282

Query: 240 GALPK 244
             LPK
Sbjct: 283 AGLPK 287


>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
          Length = 194

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 55/121 (45%), Gaps = 2/121 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  + G+VTGG +     C+P    PC H        EC   A   P+C
Sbjct: 43  CDGGWPIKAWQFFAREGVVTGGNYGRQGCCRPYEITPCGHHGREPYYGECYDDAQ-TPRC 101

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C    Y   + +DK   ++ Y + + V  IQ+EIM +GPVVA   +Y D   Y  G 
Sbjct: 102 KRKC-QSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGI 160

Query: 122 Y 122
           Y
Sbjct: 161 Y 161


>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
          Length = 328

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 104/246 (42%), Gaps = 75/246 (30%)

Query: 2   CSSGISSSTWV-WVHKRGLVTGGAHHS-NTGCQPVSFPPCN-HANYTTSEPECKTLATPQ 58
           C+ G  +  W  W +  G+VTGG + +   GC+      C+ H N      +C+   +  
Sbjct: 149 CNGGWPAVAWSDWTN--GIVTGGLYGALEQGCKSYFLEGCDDHPN------KCRNYVS-T 199

Query: 59  PKCHTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           P C  +C   + Y +   Q+ Y    Y    +E   IQ EIM NGPV A M +Y D   Y
Sbjct: 200 PACVEQCDEPSLYYKA--QETYGQTPYEIQGEE--QIQYEIMTNGPVEATMDVYVDFAQY 255

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           +SG Y          L +D   Y+ G               VKI+GWG E+G        
Sbjct: 256 QSGIY---------QLTTD--EYEGG-------------HAVKILGWGVEDG-------- 283

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                          VK           YW + +++ E++G+ G  +I+RGR+E  IES 
Sbjct: 284 ---------------VK-----------YWLVANSWNERWGENGLFRIIRGRDEVGIEST 317

Query: 238 VNGALP 243
           ++ ALP
Sbjct: 318 IDAALP 323


>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
           abelii]
          Length = 362

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC   +      + +
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 217

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             A+    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 218 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 342 DIESFVLG 349


>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
           marinkellei]
          Length = 333

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 59/236 (25%), Positives = 89/236 (37%), Gaps = 73/236 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G     WV+    GLV+         CQP  FP C +H N +   P      TP  K
Sbjct: 160 CNGGFPEVAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCSGDYKTP--K 210

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C++ CT     +     +YR    Y ++ E    ++E++ NGP       +   F     
Sbjct: 211 CNSTCTE----KKIPLIRYRGNHSYVLSGE-EHFKRELLLNGP-------FEVAFE---- 254

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y+D  +Y  GVY                                   
Sbjct: 255 ------------VYADFMAYTGGVYK---------------------------------- 268

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
              + +++    V+L+GWGE NG PYW I +++  ++G  G   I RG NE  IES
Sbjct: 269 -HVAGDLLGGHAVRLVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIES 323


>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
          Length = 487

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 25/122 (20%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           + NGPV+ N     D   Y SG+Y   A  +           W   + RP W  V     
Sbjct: 370 FNNGPVIMNFEPGQDFMYYSSGIYHSVAQHD-----------WSSSD-RPEWEKVD---- 413

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                     +V   GWGEENG  +W + +++GEQ+G++G  ++ RG +E+ IES+   A
Sbjct: 414 ---------HSVLCYGWGEENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAA 464

Query: 242 LP 243
            P
Sbjct: 465 DP 466


>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           impatiens]
          Length = 445

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 66/149 (44%), Gaps = 51/149 (34%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI  EI+ +GPV A M +Y D F                       SY+SG+Y  +A+ 
Sbjct: 338 TDIMYEILTSGPVQATMKVYQDFF-----------------------SYESGIYKHTATT 374

Query: 152 EIVA--YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
           E  A  Y +V+I+GWGE+                SA       +K           YW +
Sbjct: 375 EHYAFGYHSVRIIGWGED---------------TSAHRYRNLPIK-----------YWLV 408

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           V+++G+Q+G+ G  +I RG NE  IES V
Sbjct: 409 VNSWGQQWGESGLFRIQRGTNECDIESFV 437


>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
          Length = 454

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/201 (28%), Positives = 83/201 (41%), Gaps = 64/201 (31%)

Query: 55  ATPQPKC--HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIM 99
           A P P+C  H+R      GRG  Q             D Y+    Y +     +I +E+M
Sbjct: 290 AGPAPRCMMHSR----AMGRGKRQATARCPSSHAHANDIYQVTPAYRLGSNEKEIMKELM 345

Query: 100 KNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159
           +NGPV A M ++ D F Y+SG Y + PV     L       + G +            +V
Sbjct: 346 ENGPVQALMEVHEDFFLYQSGIYSHTPVS----LGRPERYRRHGTH------------SV 389

Query: 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD 219
           KI GWGEE                   +    T+K           YWT  +++G  +G+
Sbjct: 390 KITGWGEET------------------LPDGRTLK-----------YWTAANSWGPAWGE 420

Query: 220 KGTIKILRGRNEAIIESLVNG 240
           +G  +I+RG NE  IES V G
Sbjct: 421 RGHFRIVRGANECDIESFVLG 441


>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
          Length = 573

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 71/165 (43%), Gaps = 50/165 (30%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR    Y +N+E  DI  EI + G V A + +Y D FSY++G Y +              
Sbjct: 419 YRMGPAYSLNNET-DIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSA------------ 465

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
                  A + + E  AY +V+++GWGEE                    V Y  VK    
Sbjct: 466 -------AATPAEERSAYHSVRLIGWGEER-------------------VGYDMVK---- 495

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  YW  V+++G  +G+ G  +ILRG NE  IES V  + P
Sbjct: 496 -------YWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNP 533


>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 359

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 89/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +    G+VT     +  NTGC               S P C+  A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           +C  +C +DN  + + + K+     Y VN    DI  E+ KNGPV  +  +Y D F++  
Sbjct: 215 RCLRKCVSDN--KLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                               YKSGVY     + I  +A VK++GWG  N           
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSN----------- 297

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G   I RG NE  IE    
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335

Query: 240 GALP 243
             LP
Sbjct: 336 AGLP 339


>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
           [Acyrthosiphon pisum]
          Length = 129

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 55/122 (45%), Gaps = 34/122 (27%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
           GP+ A+  +Y D  SYKSGVY  + +A        K+ G                     
Sbjct: 42  GPIEASFDVYDDFPSYKSGVYQRTPNA-------TKLGG--------------------- 73

Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                   VKLIGWG E G PYW +V+++  Q+GD G  KI RG +E  I+S     +P 
Sbjct: 74  ------HAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVPV 127

Query: 245 DN 246
            N
Sbjct: 128 TN 129


>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
 gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
          Length = 576

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/163 (29%), Positives = 69/163 (42%), Gaps = 46/163 (28%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           Y+    Y ++    +I  EIM NGPV A   ++ D F YKSG Y + P   +        
Sbjct: 432 YKMTPPYRISTNEREIMTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYAND-------- 483

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
             K   YA S       Y +V+I+GWG                      V ++T   I  
Sbjct: 484 --KGPAYARS------GYHSVRILGWG----------------------VDHSTGVPIK- 512

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                  YW   +++GE++G+ G  +ILRG N   IES + GA
Sbjct: 513 -------YWLCANSWGEEWGENGLFRILRGENHCDIESFIIGA 548


>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
 gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
           sapiens]
          Length = 362

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 216 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 342 DIESFVLG 349


>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 357

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 90/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 213 KCVKKCVSGN--QVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I  Y+ G +A            VK++GWG              
Sbjct: 271 G------------VYKHITGYELGGHA------------VKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +  ++GD G  KI RG NE  IE  V 
Sbjct: 294 --------------------TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVT 333

Query: 240 GALP 243
             LP
Sbjct: 334 AGLP 337


>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
 gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
          Length = 313

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 89/242 (36%), Gaps = 73/242 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   S W ++ K+G+VT         C+P + P C  A     +P    + TP   C
Sbjct: 144 CEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTCPPAQ----QPCLNFVNTPN--C 190

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C + N    + QDK++  + Y +N  V  I QEI  NGPV A   +Y D   YKSG 
Sbjct: 191 VKQCES-NSTLIYSQDKHKMAKIYSIN-SVEAIMQEISTNGPVEACFSVYEDFLGYKSGV 248

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + + +    VKI G+G  NG  YW++   +  
Sbjct: 249 YQH------------------------TTGKFLGGHCVKIFGYGTLNGVNYWSVANSWTT 284

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           S                                  +GD G   I RG +E  IE  V   
Sbjct: 285 S----------------------------------WGDNGIFLIKRGSDECGIEDEVVAG 310

Query: 242 LP 243
           +P
Sbjct: 311 IP 312


>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
           thaliana]
          Length = 183

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 86/237 (36%), Gaps = 77/237 (32%)

Query: 11  WVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           W++    G+VT     +  NTGC   S P C        EP       P PKC  +C + 
Sbjct: 4   WLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTPKCERKCVSR 47

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
           N   G  + K+     Y +N +  DI  E+                       Y NGPV 
Sbjct: 48  NQLWG--ESKHYGVGAYRINPDPQDIMAEV-----------------------YKNGPVE 82

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
               +Y D   YKSGVY      +I  +A VK++GWG                       
Sbjct: 83  VAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT---------------------- 119

Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 245
                       ++G  YW + + +   +GD G  KI RG NE  IE  V   LP +
Sbjct: 120 -----------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSE 165


>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 382

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 102/272 (37%), Gaps = 74/272 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G + + + +  K G+ T         C P     C+H       P C +  TP   C
Sbjct: 138 CNGGWTETAFEYAKKAGVPT-------EECVPYLMGKCHH-------PGCSSWQTPT--C 181

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C++ +         Y  K Y  +   V  IQ E+M+NGPV A    Y D+  Y  G 
Sbjct: 182 KKECSSLSNYNYSSNRYYASKSYS-IQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRG- 239

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG---------------- 165
                      +Y+ +   + G++A            +KIVGWG                
Sbjct: 240 -----------VYNHVMGSEQGLHA------------IKIVGWGVWRESEHMLTEEEKKA 276

Query: 166 -------------EENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVST 212
                        +E     W   +  A+  S ++    T       +E G PYW IV++
Sbjct: 277 EEEKRKRIEEEIKKEKREDKWHDFKQNALEKSKKVKRDETKN----NKEEGIPYWIIVNS 332

Query: 213 FGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
           +GE FG  G + I RG NE  IES V   +PK
Sbjct: 333 WGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364


>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
           [Pongo abelii]
          Length = 436

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC   +      + +
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 291

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             A+    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 292 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 416 DIESFVLG 423


>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
           jacchus]
          Length = 467

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 95/245 (38%), Gaps = 66/245 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G     W ++ +RG+V+       G      G  PV  PPC   +  T   + +  A
Sbjct: 270 CRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAG--PV--PPCMMHSRATGRGKRQATA 325

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
                C     N+N       + Y+    Y +     +I +E+M+NGPV A M ++ D F
Sbjct: 326 ----HCPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFF 374

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK G Y + PV     L       + G +            +VKI GWGEE        
Sbjct: 375 LYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET------- 411

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                 W +     YWT  +++G  +G++G  +I+RG NE  IE
Sbjct: 412 ----------------------WPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 449

Query: 236 SLVNG 240
           S V G
Sbjct: 450 SFVLG 454


>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
           [Nomascus leucogenys]
          Length = 362

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     ++ +E+M+NGPV A M ++ 
Sbjct: 216 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 266

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 342 DIESFVLG 349


>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
           terrestris]
          Length = 445

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/149 (30%), Positives = 66/149 (44%), Gaps = 51/149 (34%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI  EI+ +GPV A M +Y D F                       SY+SG+Y  +A+ 
Sbjct: 338 TDIMYEILTSGPVQATMKVYQDFF-----------------------SYESGIYKHTATT 374

Query: 152 EIVA--YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
           E  A  Y +V+I+GWGE+                SA       +K           YW +
Sbjct: 375 EHYAFGYHSVRIIGWGED---------------TSAHRHHNLPIK-----------YWLV 408

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           V+++G+Q+G+ G  +I RG NE  IES V
Sbjct: 409 VNSWGQQWGESGLFRIQRGTNECDIESFV 437


>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
           garnettii]
          Length = 464

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 98/247 (39%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA-------HHSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+          H +N+GC          A  + S+   K  
Sbjct: 272 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGC----------AMASRSDGRGKRH 321

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y ++    +I +EIM+NGPV A M ++ D 
Sbjct: 322 AT-KP-----CPNNIEKSNRI---YQCSPPYRISSNETEIMKEIMQNGPVQAIMQVHEDF 372

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YKSG                I+ + +  +  S +   +    VK++GWG   G     
Sbjct: 373 FHYKSG----------------IYRHVASTHGESENYRKLRTHAVKLLGWGTLRG----- 411

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 412 ------------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDI 447

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 448 EKLIIAA 454


>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
          Length = 218

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 21  CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 71

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 72  RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 122

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F Y+ G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 123 DFFLYEGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 162

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 163 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 197

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 198 DIESFVLG 205


>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Pongo abelii]
          Length = 467

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC   +      + +
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 322

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             A+    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 323 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 447 DIESFVLG 454


>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
 gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
          Length = 376

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 84/242 (34%), Gaps = 73/242 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    G+VT         C P         N   S P C+    P PKC
Sbjct: 186 CDGGYPMYAWRYFVHHGVVT-------EECDPY------FDNIGCSHPGCEP-GFPTPKC 231

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C + N  + + Q K+     Y ++ +  D+  E+ KNGPV  +  +Y D   YKSG 
Sbjct: 232 VRKCIDKN--QLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGV 289

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +                         + E++    VK++GWG                
Sbjct: 290 YKH------------------------ITGEVMGGHAVKLIGWGT--------------- 310

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                              +NG  YW + + +   +GD G  KI RG NE  IE      
Sbjct: 311 ------------------SDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAG 352

Query: 242 LP 243
           LP
Sbjct: 353 LP 354


>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
           glaber]
          Length = 467

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/242 (26%), Positives = 96/242 (39%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ +RG+V+   +  +   Q  + P      ++ +    K  AT     
Sbjct: 270 CQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSRAMGRGKRQAT----- 324

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             RC N +       + Y+    Y +  +  +I +E+M+NGPV A M +Y D F YKSG 
Sbjct: 325 -RRCPNSHDDA---NEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSG- 379

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWTIVRV 178
                          I+S+                 +VKI GWGEE   +GR        
Sbjct: 380 ---------------IYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDGR-------- 416

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                        T+K           YWT  +++G  +G++G  +ILRG NE  IES V
Sbjct: 417 -------------TLK-----------YWTAANSWGPSWGERGYFRILRGSNECDIESFV 452

Query: 239 NG 240
            G
Sbjct: 453 LG 454


>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
 gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
          Length = 431

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/244 (27%), Positives = 89/244 (36%), Gaps = 78/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CDGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 62  HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C T  N  R  F   Y     Y +N E ADI  EI  +GPV A M +  D FSY  G
Sbjct: 296 ANGCETPVNVDRDTF---YTVGPAYSLNRE-ADIMAEIFNSGPVQATMRVNRDFFSYSRG 351

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI-VAYATVKIVGWGEENGRPYWTIVRVY 179
            Y                         +A+ E    + +VK+VGWGEE+           
Sbjct: 352 VYRQ----------------------TAANREAPTGFHSVKLVGWGEEH----------- 378

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                 NG  YW   +++G  +G+KG  +ILRG NE  IE  V 
Sbjct: 379 ----------------------NGEKYWIAANSWGSWWGEKGYFRILRGSNECGIEEYVL 416

Query: 240 GALP 243
            + P
Sbjct: 417 ASWP 420


>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
           sapiens]
 gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
          Length = 436

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 290 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 416 DIESFVLG 423


>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
           paniscus]
          Length = 436

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 290 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 416 DIESFVLG 423


>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
          Length = 96

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 61/145 (42%), Gaps = 58/145 (40%)

Query: 94  IQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI 153
           IQ+EIMK GPV AN  +Y D  +YKSG Y +                         + ++
Sbjct: 3   IQKEIMKYGPVEANFIVYEDFLNYKSGIYKH------------------------ITGKL 38

Query: 154 VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
            ++  ++I+GWGEEN  P                                  YW I +++
Sbjct: 39  FSWHAIRIIGWGEENNTP----------------------------------YWLIPNSW 64

Query: 214 GEQFGDKGTIKILRGRNEAIIESLV 238
            E +G+ G  +ILRGR+E  IES V
Sbjct: 65  NEDWGENGNFRILRGRHECSIESEV 89


>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
           gorilla gorilla]
          Length = 462

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 99/251 (39%), Gaps = 78/251 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 265 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSQAMGRGK 315

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 316 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 366

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NG 169
           D F YK G Y + PV     L       + G +            +VKI GWGEE   +G
Sbjct: 367 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDG 410

Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
           R                     T+K           YWT  +++G  +G++G  +I+RG 
Sbjct: 411 R---------------------TLK-----------YWTAANSWGPAWGERGHFRIVRGV 438

Query: 230 NEAIIESLVNG 240
           NE  IES V G
Sbjct: 439 NECDIESFVLG 449


>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
 gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
          Length = 432

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 89/243 (36%), Gaps = 77/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G++       +  C P          YT S   CK   +   K 
Sbjct: 255 CEGGHLDAAWRYLHKKGVL-------DESCYP----------YTQSRGTCKVRHSGSLKA 297

Query: 62  HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H          G  +D  Y     Y ++ E ADI+ EI  +GPV A M +Y D FSY  G
Sbjct: 298 H----GCRPAPGVDRDSLYTVGPAYSLSRE-ADIKAEIFHSGPVQATMRVYRDFFSYSGG 352

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                         +       + +VK+VGWGEE+            
Sbjct: 353 IYRQ---------------------TAANRGAPTGFHSVKLVGWGEEH------------ 379

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G++G  +ILRG NE  IE  V  
Sbjct: 380 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 418

Query: 241 ALP 243
           + P
Sbjct: 419 SWP 421


>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
 gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 344

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 85/244 (34%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
           C  G   S W +  + G+VT         C P      C H       P C+  A P P 
Sbjct: 164 CDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKH-------PGCEP-AYPTPV 208

Query: 61  CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +C   N     +Q+K  F    Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 209 CEKKCKVQNQ---VWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 265

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  ++    VK++GWG  +           
Sbjct: 266 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 291

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 292 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVT 328

Query: 240 GALP 243
             +P
Sbjct: 329 AGMP 332


>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like 1 [Pan troglodytes]
          Length = 472

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 275 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 325

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 326 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 376

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 377 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 416

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 417 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 451

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 452 DIESFVLG 459


>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
           harrisii]
          Length = 467

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 61/242 (25%), Positives = 92/242 (38%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP--PCN-HANYTTSEPECKTLATPQ 58
           C  G     W ++ +RGLV+   +  + G    + P  PC  H+ +        T   P 
Sbjct: 269 CRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPAAPCMMHSRHMGRGKRQATAHCPN 328

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
            + H                Y+    Y ++    DI +E+M+NGPV A + ++ D F YK
Sbjct: 329 SRTHA------------NHIYQATPPYRLSSHEKDIMKELMENGPVQALLEVHEDFFLYK 376

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y + P              K   Y    +       +VKI GWGEE           
Sbjct: 377 SGIYKHTPASLG----------KPERYRQHGT------HSVKITGWGEE----------- 409

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
             +    + V                 YWT  +++G  +G+ G  +I+RG NE  IES V
Sbjct: 410 --IQPDGQKVK----------------YWTAANSWGPTWGENGYFRIVRGANECDIESFV 451

Query: 239 NG 240
            G
Sbjct: 452 VG 453


>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
           sapiens]
 gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; AltName:
           Full=Oxidized LDL-responsive gene 2 protein;
           Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TIN Ag-related protein;
           Short=TIN-Ag-RP; Flags: Precursor
 gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
           [Homo sapiens]
 gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
 gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
 gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
 gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
 gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
 gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
 gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
           sapiens]
 gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
 gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
 gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
 gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
          Length = 467

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 321 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 447 DIESFVLG 454


>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
           paniscus]
          Length = 467

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 321 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 447 DIESFVLG 454


>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 451

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/237 (26%), Positives = 94/237 (39%), Gaps = 70/237 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CS G     W ++ KRG+V+         C P          YT+ + + K +     K 
Sbjct: 246 CSGGHIDRAWWFMRKRGVVS-------NDCYP----------YTSGDQDKKGVCMMPGKL 288

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
            + C     GR    + +     Y +     +IQ EIM+NGPV A+  +  D F Y SG 
Sbjct: 289 PSDCPT---GRERNNELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGV 345

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y + P+ +N     D   Y +             + +VK++GWG ENG  YW        
Sbjct: 346 YRHTPIASN-----DAEQYHAS-----------EWHSVKLLGWGVENGIKYW-------- 381

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                         +G             +++G ++G+ G  KILRG NE  IES V
Sbjct: 382 --------------LG------------ANSWGTKWGEDGYFKILRGENECNIESYV 412


>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Nomascus leucogenys]
          Length = 436

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     ++ +E+M+NGPV A M ++ 
Sbjct: 290 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 340

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 416 DIESFVLG 423


>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
           echinatior]
          Length = 501

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 64/150 (42%), Gaps = 58/150 (38%)

Query: 93  DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE 152
           DI QEI+ +GPV A M +Y D                        F YK+G+Y  S SAE
Sbjct: 398 DIMQEILTSGPVQATMRVYQD-----------------------FFVYKNGIYRHSQSAE 434

Query: 153 I--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YWT 208
           +    Y +V+I+GWGEE         R Y                       G P  YW 
Sbjct: 435 LHDSGYHSVRIIGWGEE---------RSY----------------------RGPPLKYWL 463

Query: 209 IVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +V+++G  +G+ G  KI RG NE  IES V
Sbjct: 464 VVNSWGYNWGENGLFKIQRGTNECEIESYV 493


>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Nomascus leucogenys]
          Length = 467

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       D Y+    Y +     ++ +E+M+NGPV A M ++ 
Sbjct: 321 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE     
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                         +    T+K           YWT  +++G  +G++G  +I+RG NE 
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 447 DIESFVLG 454


>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
          Length = 362

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 98/248 (39%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G     W ++ +RG+V+         C P S    +  N     P C    + +   
Sbjct: 165 CQGGRLDGAWWFLRRRGVVS-------DHCYPFSG---HERNEAGPAPRCMMHSRAMGRG 214

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N         D Y+    Y +     DI +E+M+NGPV A M ++      
Sbjct: 215 KRQATARCPNSYV---HANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHE----- 266

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                             D F Y+SG+Y+ +  +                +GRP     R
Sbjct: 267 ------------------DFFLYQSGIYSHTPVS----------------HGRP--ERYR 290

Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
            +            +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE 
Sbjct: 291 RHGTH---------SVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANEC 341

Query: 233 IIESLVNG 240
            IES V G
Sbjct: 342 DIESFVLG 349


>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
          Length = 443

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 63/151 (41%), Gaps = 58/151 (38%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI QEI+ +GPV A M +Y D F                        YKSG+Y  S SA
Sbjct: 339 TDIMQEILTSGPVQATMRVYQDFF-----------------------IYKSGIYRHSRSA 375

Query: 152 EI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YW 207
           E+    Y +V+I+GWGEE         R Y                       G P  YW
Sbjct: 376 ELHDSGYHSVRIIGWGEE---------RSY----------------------RGPPLKYW 404

Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
            + +++G  +GD G  KI +G NE  IES V
Sbjct: 405 LVANSWGYNWGDNGLFKIQKGTNECEIESYV 435


>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
 gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
          Length = 673

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 30/66 (45%), Positives = 43/66 (65%), Gaps = 1/66 (1%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EE 167
           Y+YS I +Y++    NGPV A+  +YSD +SYKSG+Y  +A +  V    VK++GW  + 
Sbjct: 214 YIYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDS 273

Query: 168 NGRPYW 173
           NG PYW
Sbjct: 274 NGTPYW 279


>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 348

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 169 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           +C   C + N  + + + K+     Y V  +  DI  E+ KNGPV  +  +Y D   YKS
Sbjct: 213 RCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         + +++    VK++GWG              
Sbjct: 271 GVYKH------------------------ITGDVMGGHAVKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 294 --------------------TDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 333

Query: 240 GALP 243
             LP
Sbjct: 334 AGLP 337


>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
          Length = 345

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 35/123 (28%)

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           S+  G +GNGPVVA   +Y D   YK G+Y   A     A+A +KI+GWG ENG PY   
Sbjct: 251 SHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHA-IKIIGWGVENGLPY--- 306

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                          W I +++ + +G++G  +I+RG NE  IE
Sbjct: 307 -------------------------------WLIANSWHDDWGEQGLFRIVRGINECGIE 335

Query: 236 SLV 238
             V
Sbjct: 336 QEV 338


>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
           familiaris]
          Length = 467

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 82/201 (40%), Gaps = 64/201 (31%)

Query: 55  ATPQPKC--HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIM 99
           A P P+C  H+R      GRG  Q             D Y+    Y +     +I +E+M
Sbjct: 303 AGPAPRCMMHSR----AMGRGKRQATARCPSSHVHANDIYQVTPAYRLGTNEKEIMKELM 358

Query: 100 KNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159
           +NGPV A M ++ D F Y+ G Y + PV     L       + G +            +V
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGIYSHTPVS----LGRPERYRRHGTH------------SV 402

Query: 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD 219
           KI GWGEE                   +    T+K           YWT  +++G  +G+
Sbjct: 403 KITGWGEET------------------LPDGRTLK-----------YWTAANSWGPAWGE 433

Query: 220 KGTIKILRGRNEAIIESLVNG 240
           +G  +I+RG NE  IES V G
Sbjct: 434 RGHFRIVRGANECDIESFVLG 454


>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
          Length = 349

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +  + G+VT     +   TGC   S P C        EP     A P P
Sbjct: 170 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 213

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           +C   C + N  + + + K+     Y V  +  DI  E+ KNGPV  +  +Y D   YKS
Sbjct: 214 RCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKS 271

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         + +++    VK++GWG              
Sbjct: 272 GVYKH------------------------ITGDVMGGHAVKLIGWGT------------- 294

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 295 --------------------TDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 334

Query: 240 GALP 243
             LP
Sbjct: 335 AGLP 338


>gi|10803439|emb|CAC13132.1| putative cathepsin B.6 [Ostertagia ostertagi]
          Length = 197

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/162 (27%), Positives = 60/162 (37%), Gaps = 25/162 (15%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  + G+ +GG +     C+P    PC      T   EC       P C   C  
Sbjct: 50  SQAWEFAXRNGVCSGGWYGEKGVCKPYPLHPCGKHXNQTYYGECPDHXYXTPACKKYCQY 109

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
             Y + +  DK      Y V  + A I+ EIM  GPV A   +Y D   Y  G Y     
Sbjct: 110 -GYDKRYXNDKVXVTSAYQVXSDEAAIRAEIMSRGPVQAAFTVYGDFMLYTXGIY----- 163

Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
                              V  + +++    VKI+GWG ENG
Sbjct: 164 -------------------VHTAGKLMGGHGVKIIGWGVENG 186


>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 520

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 57/243 (23%), Positives = 99/243 (40%), Gaps = 62/243 (25%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C+ G     W ++ +RG+VT         C P S    NH   + + P C    ++    
Sbjct: 321 CNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNH---SPNAPACMMHSRSTGRG 370

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +   RC N    R    + Y+    Y ++    +I +E+M+NGPV A + ++ D F Y
Sbjct: 371 KRQAIARCPNP---RSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMY 427

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           ++G          +Y ++ + + K   Y    +       +VKI GWGEE          
Sbjct: 428 RTG----------IYRHTAVAAGKPEQYRRHGT------HSVKITGWGEEQ--------- 462

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                 + + + YW   +++G+ +G+ G  +I RG NE  IE+ 
Sbjct: 463 --------------------MPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETF 502

Query: 238 VNG 240
           V G
Sbjct: 503 VVG 505


>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 52/177 (29%), Positives = 70/177 (39%), Gaps = 65/177 (36%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C  +C+N   G    + KY     Y     V +IQ+E+MKNGPV     +YSD  +YK
Sbjct: 160 PACAAKCSN---GSQIIRYKYEKAETY----TVQNIQEELMKNGPVYFRFTVYSDFMNYK 212

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG            +Y     Y+ G +A            V ++GWG E+G PYW +   
Sbjct: 213 SG------------VYQHKSGYQEGGHA------------VLLIGWGVEDGVPYWLLQN- 247

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                              WG     P W          G+KG  KI+RG+NE   E
Sbjct: 248 ------------------SWG-----PAW----------GEKGHFKIIRGKNECGCE 271


>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 345

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 82/245 (33%), Gaps = 79/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  + G+VT          GCQ                P C+  A P P
Sbjct: 165 CDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ---------------HPGCEP-AYPTP 208

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
            C  +C   N     +++K  F    Y VN +  DI  E+ KNGPV  +  +Y D   YK
Sbjct: 209 VCEKKCKVQNQ---VWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYK 265

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y                           +  +V     K++GWG  +          
Sbjct: 266 SGVYKQ------------------------ITGRMVGGHAAKLIGWGTSDA--------- 292

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                   G  YW + + +   +GD G  KI+RG NE  IE  V
Sbjct: 293 ------------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDV 328

Query: 239 NGALP 243
           N  +P
Sbjct: 329 NAGMP 333


>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
          Length = 350

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 88/245 (35%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +    G+VT     +  + GCQ                P C+ L  P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           +C  +C ++N   G   +  RF    Y ++ +  DI  E+  NGPV  +  +Y D   YK
Sbjct: 208 QCVKQCKDENQKWG---NSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYK 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG                ++ Y  G Y        +    VK+VGWG E+G  YW +   
Sbjct: 265 SG----------------VYKYTKGDY--------MGGHAVKLVGWGTEDGTDYWLVANS 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +  +               WGE+                   G  KI RG NE  IE  V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECGIEGDV 326

Query: 239 NGALP 243
              +P
Sbjct: 327 VAGMP 331


>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
           saltator]
          Length = 443

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 49/165 (29%), Positives = 70/165 (42%), Gaps = 55/165 (33%)

Query: 76  QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYS 135
           Q+ Y+    Y + +E  DI QEI+ +GPV A M +Y D                      
Sbjct: 324 QELYKVGPAYRLGNET-DIMQEILTSGPVQATMRVYQD---------------------- 360

Query: 136 DIFSYKSGVYAVSASAEI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
             F YK+GVY  S SAE+    Y +++I+GWGEE                     +Y   
Sbjct: 361 -FFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEE--------------------PSYRGP 399

Query: 194 KLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
            L          YW + +++G  +G+ G  +I RG NE  IES V
Sbjct: 400 PL---------KYWLVANSWGRHWGENGLFRIQRGTNECEIESYV 435


>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
          Length = 246

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/116 (34%), Positives = 55/116 (47%), Gaps = 10/116 (8%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +    GLVTGG + S  GC+P   PPC H +   +    K    P  K 
Sbjct: 137 CHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGNNSCSDK----PMEKN 192

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           H RCT   YG     + D +RF R YY++      IQ+++M  GP+ A+  +Y D 
Sbjct: 193 H-RCTRMCYGDQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF 245


>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
          Length = 350

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 88/245 (35%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +    G+VT     +  + GCQ                P C+ L  P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           +C  +C ++N   G   +  RF    Y ++ +  DI  E+  NGPV  +  +Y D   YK
Sbjct: 208 QCVKQCKDENQKWG---NSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYK 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG                ++ Y  G Y        +    VK+VGWG E+G  YW +   
Sbjct: 265 SG----------------VYKYTKGDY--------MGGHAVKLVGWGTEDGTDYWLVANS 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +  +               WGE+                   G  KI RG NE  IE  V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECGIEGDV 326

Query: 239 NGALP 243
              +P
Sbjct: 327 VAGMP 331


>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 6/111 (5%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  +  W +  ++GLV+GG + S+ GC+P S PPC H +   S P C T     PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGP---VVANMY 109
              C    Y   + QDK+     Y V++   DI  EI KNG    +VAN +
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGTPYWLVANSW 257



 Score = 44.3 bits (103), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 18/46 (39%), Positives = 29/46 (63%)

Query: 201 ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
           +NG PYW + +++   +GD G  KILRG++   IES V   +P+ +
Sbjct: 245 KNGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 290


>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
 gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
          Length = 342

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 91/248 (36%), Gaps = 76/248 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
           C  G   + W +  + G+VT       + C P      C H      EPE  T     P 
Sbjct: 166 CDGGYPYAAWEYFAQTGVVT-------SQCDPYFDGKGCKHPG---CEPEYDT-----PV 210

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C ++   R     K+   + Y VN ++ DIQ EI KNGPV  +  +Y D   YKSG
Sbjct: 211 CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 267

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +F             E++    VK +GWG               
Sbjct: 268 ------------VYKHVF------------GEVLGGHAVKFIGWGT-------------- 289

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                               ++G+ YW + +++   +G+ G  +I RG NE  IES    
Sbjct: 290 -------------------TDDGKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVA 330

Query: 241 ALPKDNYG 248
            +P    G
Sbjct: 331 GIPLKKTG 338


>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
 gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Adrenocortical zonation factor 1; Short=AZ-1;
           AltName: Full=Androgen-regulated gene 1 protein;
           AltName: Full=Tubulointerstitial nephritis
           antigen-related protein; Short=TARP; Flags: Precursor
 gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
 gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
 gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
           musculus]
 gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
 gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
           musculus]
          Length = 466

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 103/248 (41%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G     W ++ +RG+V+         C P S    N A+ T   P C    + +   
Sbjct: 269 CRGGRLDGAWWFLRRRGVVS-------DNCYPFSGREQNEASPT---PRCMMHSRAMGRG 318

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +  +RC N   G+    D Y+    Y +  +  +I +E+M+NGPV A M ++      
Sbjct: 319 KRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE----- 370

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                             D F Y+ G+Y+ +  ++                GRP     R
Sbjct: 371 ------------------DFFLYQRGIYSHTPVSQ----------------GRP--EQYR 394

Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
            +            +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE 
Sbjct: 395 RHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNEC 445

Query: 233 IIESLVNG 240
            IE+ V G
Sbjct: 446 DIETFVLG 453


>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
          Length = 134

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 35/120 (29%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A   +YSD   YKSGVY   A   +  +A V+I+GWG ENG PYW          
Sbjct: 41  NGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHA-VRILGWGVENGTPYW---------- 89

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                      L+G             +++   +GD G  KILRG++   IES +   +P
Sbjct: 90  -----------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 125


>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
 gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
          Length = 431

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 62/243 (25%), Positives = 86/243 (35%), Gaps = 76/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 252 CDGGHLDAAWRFLHKKGVV-------DDSCYP----------YTQQRDTCKIRHNSRSLK 294

Query: 62  HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C  + N  R  F   Y     Y +N E  DI  EI  +GPV A M +Y D FSY  G
Sbjct: 295 ANGCRPSPNVDRDSF---YTVGPAYTLNRE-GDIMAEIYHSGPVQATMRVYRDFFSYSGG 350

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                         +       + +VK+VGWGEE+            
Sbjct: 351 IYRQ---------------------TAANRGAPQGFHSVKLVGWGEEH------------ 377

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G++G  +ILRG NE  IE  V  
Sbjct: 378 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLA 416

Query: 241 ALP 243
           + P
Sbjct: 417 SWP 419


>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
 gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
 gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
          Length = 350

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/245 (24%), Positives = 86/245 (35%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +    G+VT     +    GCQ                P C+ L  P P
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ---------------HPGCEPL-YPTP 207

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           +C  +C ++N   G   +  RF    Y +  +  DI  E+   GPV  +  +Y D   YK
Sbjct: 208 QCVKQCKDENQNWG---NSKRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYK 264

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG                ++ Y +G        + +    VK++GWG ENG  YW +   
Sbjct: 265 SG----------------VYKYITG--------DFLGGHAVKLIGWGTENGTDYWLVANS 300

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +  +               WGE+                   G  KI RG NE  IE  V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECSIEEDV 326

Query: 239 NGALP 243
              +P
Sbjct: 327 VAGMP 331


>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
 gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
          Length = 431

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 90/245 (36%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQQRDTCKIRHNSRSLR 295

Query: 62  HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C T  N  R  F   Y     Y +N E ADI  EI  +GPV A              
Sbjct: 296 ANGCQTPYNVDRDTF---YTVGPAYSLNRE-ADIMAEIFHSGPVQA-------------- 337

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI--VAYATVKIVGWGEENGRPYWTIVRV 178
                     M +  D F+Y  GVY  +A+  +    + +VK+VGWGEE+          
Sbjct: 338 ---------TMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEH---------- 378

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                  NG  YW   +++G  +G++G  +ILRG NE  IE  V
Sbjct: 379 -----------------------NGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYV 415

Query: 239 NGALP 243
             + P
Sbjct: 416 LASWP 420


>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
 gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
           E=1.3e-79, N=1) [Arabidopsis thaliana]
 gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +    G+VT     +  NTGC               S P C+  A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C +DN  + + + K+     Y V     DI  E+ KNGPV  +  +Y D F++  
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                               YKSGVY     + I  +A VK++GWG  +           
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSS----------- 297

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G   I RG NE  IE    
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335

Query: 240 GALP 243
             LP
Sbjct: 336 AGLP 339


>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
          Length = 230

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 61/126 (48%), Gaps = 14/126 (11%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
           C+ G     W +  + G+VTGG + +  GCQP   PPC        + E     + QP  
Sbjct: 113 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 166

Query: 60  ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
              KC  +C  D+    + ++ Y+ K  Y++ +    +Q++ M  GP+ A+  +Y D  +
Sbjct: 167 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYYLKNTT--MQKDTMVYGPIEASFDVYDDFMN 223

Query: 117 YKSGKY 122
           Y+SG Y
Sbjct: 224 YESGVY 229


>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
 gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
 gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
 gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
 gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
 gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
 gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
 gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
          Length = 359

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +    G+VT     +  NTGC               S P C+  A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C +DN  + + + K+     Y V     DI  E+ KNGPV  +  +Y D F++  
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                               YKSGVY     + I  +A VK++GWG  +           
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSS----------- 297

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G   I RG NE  IE    
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335

Query: 240 GALP 243
             LP
Sbjct: 336 AGLP 339


>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
 gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
          Length = 432

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 59/152 (38%), Gaps = 54/152 (35%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
           ADI  EI  +GPV A M +Y D FSY SG Y                      +  +   
Sbjct: 324 ADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQ---------------------HTAANRG 362

Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
               + +VK+VGWGEE+                                 NG  YW   +
Sbjct: 363 AATGFHSVKLVGWGEEH---------------------------------NGVKYWIAAN 389

Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           ++G  +G++G  +ILRG NE  IE  V  + P
Sbjct: 390 SWGPWWGERGYFRILRGSNECGIEEYVLASWP 421


>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
 gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
          Length = 415

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 103/248 (41%), Gaps = 72/248 (29%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
           C  G     W ++ +RG+V+         C P S    N A+ T   P C    + +   
Sbjct: 218 CRGGRLDGAWWFLRRRGVVS-------DNCYPFSGREQNEASPT---PRCMMHSRAMGRG 267

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           + +  +RC N   G+    D Y+    Y +  +  +I +E+M+NGPV A M ++      
Sbjct: 268 KRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE----- 319

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
                             D F Y+ G+Y+ +  ++                GRP     R
Sbjct: 320 ------------------DFFLYQRGIYSHTPVSQ----------------GRP--EQYR 343

Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
            +            +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE 
Sbjct: 344 RHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNEC 394

Query: 233 IIESLVNG 240
            IE+ V G
Sbjct: 395 DIETFVLG 402


>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
          Length = 71

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 41/70 (58%)

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
           +VR    +   + V    ++++GWG E G PYW I + +   +GD G IK+LRG++   I
Sbjct: 1   MVRRLPTNVHGKAVGGHAIRILGWGVEEGVPYWLIANNWNTDWGDNGYIKLLRGKDHCGI 60

Query: 235 ESLVNGALPK 244
           ES + G LPK
Sbjct: 61  ESQITGGLPK 70


>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
          Length = 387

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 92/246 (37%), Gaps = 82/246 (33%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G   + W ++HK+G+V       +  C P          Y      CK    P    
Sbjct: 208 CNGGHLDAAWRYLHKQGVV-------DESCYP----------YVGYRDACKI---PH--- 244

Query: 62  HTRCTNDNYGRGFF----QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           ++R   +N  R +      + Y     Y +N+E  DI  EI  +GPV A + +Y D FSY
Sbjct: 245 NSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNET-DIMAEIFMSGPVQATLTVYRDFFSY 303

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
             G Y                      +  ++    V + +VK++GWGEE+         
Sbjct: 304 SGGIY---------------------RHTAASRGSPVGFHSVKLIGWGEEH--------- 333

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                   +G  YW   +++G  +G+ G  +ILRG NE  IE  
Sbjct: 334 ------------------------DGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEY 369

Query: 238 VNGALP 243
           V  A P
Sbjct: 370 VLAAWP 375


>gi|324105223|gb|ADY18374.1| cathepsin B [Glycera tridactyla]
          Length = 117

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 58/118 (49%), Gaps = 3/118 (2%)

Query: 5   GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
           G   S W +    G+VTGG ++++ GC+P + P C H +   + P C +   P P+C  +
Sbjct: 1   GFPRSAWEYFKVTGIVTGGQYNTHEGCRPYTIPKCEH-HVNGTLPPCSSTIKPTPRCERK 59

Query: 65  CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKY 122
           C    Y   + + K+     Y V  + A I+QEI KNG   +  +  +D    +SG Y
Sbjct: 60  C-ESGYSTDYQKXKHHGVTVYNVESDEAQIRQEIYKNG-QRSCFHRLADFPQLQSGVY 115


>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Saimiri boliviensis boliviensis]
          Length = 436

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 98/250 (39%), Gaps = 76/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDKAGPAPPC--MMHSRAMGRGK 289

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       + Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 290 RQATAH--CPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHE 340

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE  RP 
Sbjct: 341 DFFLYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET-RP- 382

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRN 230
                                        +GR   YWT  +++G  +G++G  +I+RG N
Sbjct: 383 -----------------------------DGRKLKYWTAANSWGPAWGERGHFRIVRGVN 413

Query: 231 EAIIESLVNG 240
           E  IES V G
Sbjct: 414 ECDIESFVLG 423


>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
          Length = 130

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 67/175 (38%), Gaps = 58/175 (33%)

Query: 70  YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
           Y   + +DK+     Y V+D   +I  EI KNGPV     ++SD  +YKSG Y +     
Sbjct: 6   YSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH----- 60

Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
                               + +++    ++I+GWG ENG PYW +   + V        
Sbjct: 61  -------------------EAGDVMGGHAIRILGWGIENGVPYWLVANSWNV-------- 93

Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                      +GD G  KILRG N   IES +   +P+
Sbjct: 94  --------------------------DWGDNGFFKILRGENHCGIESEIVAGIPR 122


>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
          Length = 500

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 25/122 (20%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV+ N     D   Y+SG+Y   A  +           W  +  RP W  V     
Sbjct: 382 YTNGPVIMNFEPSYDFMYYESGIYHSVAEHD-----------WSTQE-RPEWEKVD---- 425

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                     +V   GWGEE+G  +W + +++G Q+G+ G+ ++ RG +E+ IES+   A
Sbjct: 426 ---------HSVLCYGWGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAA 476

Query: 242 LP 243
            P
Sbjct: 477 DP 478


>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
          Length = 105

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 24/95 (25%)

Query: 82  KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
           K+ Y + + V  IQ++IMKNGPVVA   +Y D   Y+SG            +Y      K
Sbjct: 1   KKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG------------IYKHKAGRK 48

Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           +G++A            VK++GWGEE G PYW + 
Sbjct: 49  TGLHA------------VKVIGWGEEKGTPYWIVA 71


>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
 gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
          Length = 331

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 62/248 (25%), Positives = 91/248 (36%), Gaps = 76/248 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
           C  G   + W +  + G+VT       + C P      C H      EPE  T     P 
Sbjct: 155 CEGGYPYAAWEYFAQTGVVT-------SQCDPYFDGKGCKHPG---CEPEYDT-----PV 199

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C  +C ++   R     K+   + Y VN ++ DIQ EI KNGPV  +  +Y D   YKSG
Sbjct: 200 CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 256

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                       +Y  +F    G +A            VK +GWG               
Sbjct: 257 ------------VYKHVFGQVLGGHA------------VKFIGWGT-------------- 278

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                               ++G+ YW + +++   +G+ G  +I RG NE  IES    
Sbjct: 279 -------------------TDDGKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVA 319

Query: 241 ALPKDNYG 248
            +P    G
Sbjct: 320 GIPLKKTG 327


>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Saimiri boliviensis boliviensis]
          Length = 467

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/250 (26%), Positives = 98/250 (39%), Gaps = 76/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
           C  G     W ++ +RG+V+         C P S          PPC    ++ +    K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDKAGPAPPC--MMHSRAMGRGK 320

Query: 53  TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
             AT    C     N+N       + Y+    Y +     +I +E+M+NGPV A M ++ 
Sbjct: 321 RQATAH--CPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHE 371

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D F YK G Y + PV     L       + G +            +VKI GWGEE  RP 
Sbjct: 372 DFFLYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET-RP- 413

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRN 230
                                        +GR   YWT  +++G  +G++G  +I+RG N
Sbjct: 414 -----------------------------DGRKLKYWTAANSWGPAWGERGHFRIVRGVN 444

Query: 231 EAIIESLVNG 240
           E  IES V G
Sbjct: 445 ECDIESFVLG 454


>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 73/175 (41%), Gaps = 37/175 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+V+         CQP  FP C H   ++    C       P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           ++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G 
Sbjct: 212 NSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y +   VA  +L         G +A            V+IVGWGE NG PYW I 
Sbjct: 267 YKH---VAGTFL---------GGHA------------VRIVGWGELNGEPYWKIA 297


>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
 gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
          Length = 488

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/115 (33%), Positives = 51/115 (44%), Gaps = 26/115 (22%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y  GP+     +Y D F+YK GVY  S + +      +   GW E N             
Sbjct: 390 YHGGPLAIAFEVYDDFFNYKGGVYTHSTALK----TKIAEPGWEETN------------- 432

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                      V L+GWGEENG PYW + +++G  +G  G  KI RG +E   ES
Sbjct: 433 ---------HAVLLVGWGEENGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCES 478



 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 34/140 (24%), Positives = 54/140 (38%), Gaps = 19/140 (13%)

Query: 43  NYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
           +Y  +E  C     P     + C  D   +  +   Y +   ++      ++  E+   G
Sbjct: 338 DYGLAEESCD----PYKGVDSVCKKDQCPKRAYGTNYAYTGGFYGATNAKNMMYELYHGG 393

Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
           P+     +Y D F+YK G          +Y +S     K          E   +A V +V
Sbjct: 394 PLAIAFEVYDDFFNYKGG----------VYTHSTALKTK----IAEPGWEETNHA-VLLV 438

Query: 163 GWGEENGRPYWTIVRVYAVS 182
           GWGEENG PYW +   +  S
Sbjct: 439 GWGEENGVPYWLVKNSWGTS 458


>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
          Length = 356

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 168 CDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 211

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y V+ +  DI  E+                     
Sbjct: 212 KCVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEV--------------------- 248

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY      E+  +A VK++GWG              
Sbjct: 249 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHA-VKLIGWGT------------- 292

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +  ++GD G  KI RG NE  IE  V 
Sbjct: 293 --------------------TEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVT 332

Query: 240 GALP 243
             LP
Sbjct: 333 AGLP 336


>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
           [Acyrthosiphon pisum]
          Length = 328

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 46/171 (26%), Positives = 72/171 (42%), Gaps = 44/171 (25%)

Query: 10  TWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 69
            W ++   GLV+GG ++++ GCQP   PP             + +   + K +T C +  
Sbjct: 161 VWEYLKSHGLVSGGKYNTSDGCQPSKIPPIE-----------EYMEYSEIKNYT-CNDHC 208

Query: 70  YGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
           YG     +  D  +   YY V  E  DIQ+E+   GPV    Y+  DIF+         P
Sbjct: 209 YGNKTINYNDDHVKVSNYYQVQYE--DIQEEVQNYGPVSVEFYIRDDIFT---------P 257

Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAY-ATVKIVGWGEENGRPYWTIV 176
            ++                 ++   +   Y   VK++GWG ENG  YW +V
Sbjct: 258 FLS-----------------INPRFQRRKYKGYVKLIGWGVENGEDYWLLV 291


>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
 gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +    G+VT     +  + GC               S P C+    P P
Sbjct: 169 CNGGYPISAWRYFVHHGVVTEECDPYFDDIGC---------------SHPGCEP-GYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C N N  + + + K+   + Y ++ +   I  EI                     
Sbjct: 213 KCARKCVNKN--QLWKKSKHYGVKPYRIDSDPESIMAEI--------------------- 249

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY       +  +A VK++GWG              
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHA-VKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                E+G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 294 --------------------SEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVV 333

Query: 240 GALP 243
             LP
Sbjct: 334 AGLP 337


>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
          Length = 348

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 63/258 (24%), Positives = 89/258 (34%), Gaps = 82/258 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
           C  G     W +  + G+VT         C P      C H       P C+  A   PK
Sbjct: 162 CDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKH-------PGCEP-AYDTPK 206

Query: 61  CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           C  +C   N     +++K  F    Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 207 CEKKCKVQNQ---VWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 263

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                         +  ++    VK++GWG  +           
Sbjct: 264 GVYKH------------------------VTGGVMGGHAVKLIGWGTSDA---------- 289

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG+NE  IE  V 
Sbjct: 290 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVV 326

Query: 240 GALPKD-----NYGVEFG 252
             +P       N+G  FG
Sbjct: 327 AGMPSTKNMAGNHGSAFG 344


>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
           gallopavo]
          Length = 467

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 55/166 (33%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR   +Y ++ +  DI +EIM  GPV A M +Y D F YK G Y +              
Sbjct: 354 YRCASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
           SYK+G    + S        VK++GWG   G+                            
Sbjct: 400 SYKAGSKWKTHS--------VKLLGWGSLPGK---------------------------- 423

Query: 199 GEENG--RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
              NG  + +W   +++G+ +G+ G  +ILRG+NE  IE L+   L
Sbjct: 424 ---NGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466


>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
          Length = 312

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 93/245 (37%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G  S+ + ++   G++          C P     C H       P C T   P PKC
Sbjct: 144 CNGGWMSTAFGFMQSNGIL-------GEDCIPYQMGKCKH-------PGCSTW--PTPKC 187

Query: 62  H-TRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           + T+C  ND        + +     Y V    ADIQ+EI +NGPV A+  +Y D+  Y+S
Sbjct: 188 NKTKCYPNDTKS----TELWHAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQS 243

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  +     G++A            +K+VGWG  +G  YWTIV  +
Sbjct: 244 G------------VYQHVTGGFEGLHA------------IKVVGWGILDGVKYWTIVNSW 279

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
           A                                  E +G  G + I RG +E  IES V 
Sbjct: 280 A----------------------------------EDWGFDGLLLIRRGVDECGIESDVV 305

Query: 240 GALPK 244
              PK
Sbjct: 306 AGQPK 310


>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
          Length = 224

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/156 (28%), Positives = 62/156 (39%), Gaps = 62/156 (39%)

Query: 87  VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
           + D V  IQ EI+ NGPV A  ++YSD  +Y  G Y                        
Sbjct: 125 IQDNVRQIQSEILSNGPVFAAFWVYSDFMAYTGGVY------------------------ 160

Query: 147 VSASAEIVAYA-----TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
            SAS E +A        V +VGWG +                                +E
Sbjct: 161 -SASKEALAQGKTGGHAVMMVGWGTD--------------------------------KE 187

Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
            G+ YW + +++ E++GDKG  KI RG +E  IESL
Sbjct: 188 TGQDYWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223


>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
          Length = 467

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 70/166 (42%), Gaps = 55/166 (33%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR   +Y V+ +  DI +EIM  GPV A M +Y D F YK G Y +              
Sbjct: 354 YRCGSHYRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
           SYK+G    + S        VK++GWG   G+                            
Sbjct: 400 SYKAGSKWKTHS--------VKLLGWGSLPGK---------------------------- 423

Query: 199 GEENG--RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
              NG  + +W   +++G+ +G+ G  +ILRG+NE  IE L+   L
Sbjct: 424 ---NGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466


>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
          Length = 484

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/239 (23%), Positives = 89/239 (37%), Gaps = 56/239 (23%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ +RG+V+         C P +    N  +      + +++   + + 
Sbjct: 288 CRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTN-GHSAPCMMQSRSMGRGKRQA 339

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N  Y      + Y+    Y +     DI +E+ +NGPV A M ++ D F YKSG 
Sbjct: 340 TNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGI 396

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y   PV             + G +            +VKI GWGEE GR           
Sbjct: 397 YRRTPVTER----EPEHHRRHGTH------------SVKITGWGEERGR----------- 429

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                             +     YW   +++G  +G+ G  +I RG NE  IE+ + G
Sbjct: 430 ------------------DGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 470


>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 76/176 (43%), Gaps = 39/176 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G     W +    G+V+         CQP  FP C +H N +   P      TP   
Sbjct: 65  CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G
Sbjct: 116 CNSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGG 170

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   VA ++L         G +A            V+IVGWGE NG PYW I 
Sbjct: 171 VYKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKIA 202


>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
 gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
          Length = 474

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 64/246 (26%), Positives = 98/246 (39%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+         C P+ F   N +N T +     + A  + K 
Sbjct: 282 CNSGSIDRAWWYLRKRGLVSHA-------CYPL-FKDQNISNNTCAM---TSKADGRGKR 330

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H TR   +N  +      Y+    Y V+    +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 331 HATRPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG 388

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      + +S + E   Y  ++                    
Sbjct: 389 IYR---------------------HVISTNEESEKYRKLQT------------------- 408

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 409 ----------HAVKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 458

Query: 236 SLVNGA 241
            L+  A
Sbjct: 459 KLIIAA 464


>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
          Length = 305

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 85/245 (34%), Gaps = 79/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +  + G+VT     +    GC+    P C        EP     A P P
Sbjct: 125 CDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 168

Query: 60  KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
            C  +C   N     +++K  F    Y VN +  DI  E+  NGPV     +Y D   YK
Sbjct: 169 VCEKKCKVQNQ---VWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYK 225

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y +                         +  ++    VK++GWG  +          
Sbjct: 226 SGVYKH------------------------ITGGVMGGHAVKLIGWGTSDA--------- 252

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                   G  YW + + +   +GD G  KI+RG+NE  IE  V
Sbjct: 253 ------------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDV 288

Query: 239 NGALP 243
              +P
Sbjct: 289 TAGMP 293


>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
 gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
          Length = 333

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/174 (25%), Positives = 72/174 (41%), Gaps = 37/174 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+V+         CQP  FP C H   ++    C       P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           ++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G 
Sbjct: 212 NSTCTD----KKIPLIKYRGNTSYILSGE-ESFKRELLLNGPFEVSFSVYADFVAYTGG- 265

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                          ++ + +GV+        +    V+IVGWGE NG PYW I
Sbjct: 266 ---------------VYKHVTGVF--------LGGHAVRIVGWGELNGEPYWKI 296


>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
          Length = 220

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 61/152 (40%), Gaps = 58/152 (38%)

Query: 85  YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
           Y+V   V+ IQ EIM NGPVV    +Y D++ YKSG Y +                    
Sbjct: 115 YYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRH-------------------- 154

Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
                +  ++    +KI+GWG +NG PY                                
Sbjct: 155 ----TAGRLLGGHAIKIIGWGTQNGIPY-------------------------------- 178

Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
             W I +++G ++G+ G  KI RG NE  IE+
Sbjct: 179 --WLIANSWGTKWGENGFFKIRRGVNECGIEN 208


>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 163

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 82/223 (36%), Gaps = 69/223 (30%)

Query: 30  GCQPVSFPPCNHANYTTSEPE--------CKTLATPQPKCHTRCTNDNYGRGFFQDKYRF 81
           G QP    PCN A+ T ++P         C       PKC   C N  +   +  D  + 
Sbjct: 1   GRQPWLVQPCN-ASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKA 59

Query: 82  KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
           K+ +  +   A  ++ + K+GP V  M +Y D  +YKSG Y +                 
Sbjct: 60  KKVFTFDGCSA--RKNLRKHGPYVVTMRVYEDFLAYKSGVYHH----------------- 100

Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
                   + + +   +V+++GWG E G                                
Sbjct: 101 -------VTGDYLGLLSVRMIGWGLEGG-------------------------------- 121

Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
             + +W   +++G  +GDKG  KI R  NE  IE+     +PK
Sbjct: 122 --QAFWLFANSWGTSWGDKGFFKIRRFVNERWIENFRYAGVPK 162


>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
          Length = 807

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 54/135 (40%), Gaps = 33/135 (24%)

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
           Y  S + +     Y NGP+  +MYL +D                                
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDF------------------------------- 212

Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
             P      +Y    + ++     V ++GWGEENG PYW   +T+G  +GD G  +I RG
Sbjct: 213 --PPKDKKSIYVSGPNTKLSGGHAVMIVGWGEENGVPYWDCANTYGTNWGDHGYFRIKRG 270

Query: 229 RNEAIIESLVNGALP 243
            NE  IE+    ALP
Sbjct: 271 SNELKIETWPGAALP 285


>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
 gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
          Length = 333

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 37/174 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+V+         CQP  FP C H   ++    C       P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           ++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G 
Sbjct: 212 NSTCTD----KKIPLIKYRGNTSYVLSGE-EPFKRELILNGPFEVSFSVYADFVAYTGGV 266

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           Y +   VA ++L         G +A            V+IVGWGE NG PYW I
Sbjct: 267 YKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKI 296


>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 109

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 37/121 (30%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A+  +Y D  +Y+SGVY  ++  E+  +A                          
Sbjct: 25  DGPVSASFIVYEDFLAYRSGVYKHTSGKELGGHA-------------------------- 58

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                    VK+IGWGEE G+ YW +V+++ E +GD G  KI  G  E  I+  + G  P
Sbjct: 59  ---------VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--IDDDLLGGTP 107

Query: 244 K 244
           K
Sbjct: 108 K 108


>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
 gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
          Length = 431

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 85/243 (34%), Gaps = 76/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 62  HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C T  N  R      Y     Y +N E ADI  EI  +GPV A M +  D F+Y  G
Sbjct: 296 ANGCQTPVNVDRDTL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                         +    +  + +VK+VGWGEE+            
Sbjct: 352 VYRE---------------------TAANRKALTGFHSVKLVGWGEEH------------ 378

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G+ G  +ILRG NE  IE  V  
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLA 417

Query: 241 ALP 243
           + P
Sbjct: 418 SWP 420


>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
           floridanus]
          Length = 443

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 62/149 (41%), Gaps = 54/149 (36%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
            DI QEI+ +GPV A M +Y D                        F Y+SGVY  S SA
Sbjct: 339 TDIMQEILTSGPVQATMRVYQD-----------------------FFVYQSGVYRHSRSA 375

Query: 152 EI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
           E+    Y +V+I+GWGEE                     +Y    L          YW +
Sbjct: 376 ELHDSGYHSVRIIGWGEEP--------------------SYRGPPL---------KYWLV 406

Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
            +++G  +G+ G  +I +G NE  IES V
Sbjct: 407 ANSWGHNWGENGLFRIQKGTNECEIESYV 435


>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
 gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
          Length = 134

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/189 (23%), Positives = 72/189 (38%), Gaps = 61/189 (32%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           P C + C N  YG  F +D++  +  +       + I++EIM NGP  A   +Y D  SY
Sbjct: 6   PSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSY 65

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           KSG Y +                         S   +    V+I+GWG E G        
Sbjct: 66  KSGVYKH------------------------TSGGFLGGHAVEIIGWGTEKG-------- 93

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                       YW +++++ E++GD GT KI++G  +  I+ +
Sbjct: 94  --------------------------VDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDM 125

Query: 238 VNGALPKDN 246
           +    P  N
Sbjct: 126 ILAGTPAIN 134


>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 75/176 (42%), Gaps = 39/176 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G     W +    G+V+         CQP  FP C +H N +   P      TP   
Sbjct: 65  CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G
Sbjct: 116 CNSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGG 170

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   VA  +L         G +A            V+IVGWGE NG PYW I 
Sbjct: 171 VYKH---VAGTFL---------GGHA------------VRIVGWGELNGEPYWKIA 202


>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
          Length = 362

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 88/244 (36%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   + W +    G+VT     +  +TGC   S P C        EP     A P P
Sbjct: 174 CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC---SHPGC--------EP-----AYPTP 217

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + Q K+     Y V     DI  E+ KNGPV  +  +Y D F++  
Sbjct: 218 KCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 272

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                               YKSGVY     + I  +A VK++GWG              
Sbjct: 273 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGT------------- 298

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW + + +   +GD G   I RG NE  IE    
Sbjct: 299 --------------------TDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPV 338

Query: 240 GALP 243
             LP
Sbjct: 339 AGLP 342


>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
          Length = 425

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 92/241 (38%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           CSSG     W ++ KRGLV+              +P     N T +     + +  + K 
Sbjct: 233 CSSGSIDRAWWYLRKRGLVSHAC-----------YPFLKDQNTTNNACAMASRSDGRGKR 281

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EI+ NGPV A M ++ D F YKSG
Sbjct: 282 HATKPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSG 339

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +     S   + +    VK+ GWG   G           
Sbjct: 340 ----------------IYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRG----------- 372

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                              +     +W + +++G  +G+ G  +ILRG NE+ IE L+  
Sbjct: 373 ------------------AQGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIA 414

Query: 241 A 241
           A
Sbjct: 415 A 415


>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
           griseus]
 gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
          Length = 465

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 101/245 (41%), Gaps = 66/245 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 60
           C  G     W ++ +RG+V+         C P      N A  ++      + +   + +
Sbjct: 269 CRGGRLDGAWWFLRRRGVVS-------DNCYPFVGREQNEAGTSSRCMMHSRAMGRGKRQ 321

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +RC N   G+    D Y+    Y +  +  +I +E+M+NGPV A M ++         
Sbjct: 322 ATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE-------- 370

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                          D F Y+SG+Y+ +  ++                GRP     R + 
Sbjct: 371 ---------------DFFLYQSGIYSHTPISQ----------------GRP--EQYRRHG 397

Query: 181 VSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                      +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IE
Sbjct: 398 TH---------SVKITGWGEEKLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIE 448

Query: 236 SLVNG 240
           S V G
Sbjct: 449 SFVLG 453


>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
          Length = 348

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 87/235 (37%), Gaps = 82/235 (34%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G ++  W +    G+V+GG ++S+ GCQP S     +A  +              KC
Sbjct: 146 CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVAS--------------KC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C ND Y   +  DK+    +Y +   V  IQ EI+ NGPV+A   +           
Sbjct: 192 VKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNV----------- 240

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
                       + DI  YKSG          +  + V I+ WG E G P          
Sbjct: 241 ------------FEDIIYYKSG----------IQLSNVSILRWGTEEGVP---------- 268

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAIIE 235
                                   YW I +++G  +GD G  IKI RG NE  IE
Sbjct: 269 ------------------------YWLIANSWGTWWGDLGGFIKIKRGTNECAIE 299


>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
 gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
          Length = 321

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 47/183 (25%), Positives = 68/183 (37%), Gaps = 61/183 (33%)

Query: 59  PKCHTRCTNDNY---GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           P C + C   N     + ++   +     +   + VADIQQEI  NGPV     +Y D  
Sbjct: 185 PACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFM 244

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           +YKSG                ++S+K+G +        +    +KI+GWG E G  YW +
Sbjct: 245 NYKSG----------------VYSHKTGSF--------LGGHAIKIIGWGVEGGVDYWLV 280

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
              ++                                    +G  GT KILRG NE  IE
Sbjct: 281 ANSWST----------------------------------DWGIDGTFKILRGHNECGIE 306

Query: 236 SLV 238
             V
Sbjct: 307 DDV 309


>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
           africana]
          Length = 476

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 62/252 (24%), Positives = 95/252 (37%), Gaps = 81/252 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT-KP-----CPNNIEKSNVI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G Y +                                                  
Sbjct: 385 FHYKTGIYRH-------------------------------------------------- 394

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGR 229
           ++R    S   + +    VKL GWG   G       +W   +++G+ +G+ G  +ILRG 
Sbjct: 395 VIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGV 454

Query: 230 NEAIIESLVNGA 241
           NE+ IE L+  A
Sbjct: 455 NESDIEKLIIAA 466


>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
           jacchus]
          Length = 476

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 93/241 (38%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T S     + +  + K 
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNSGCAMASRSDGRGKR 332

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 333 HATKPCPNNIEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTG 390

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +     S   + +    VK+ GWG   G           
Sbjct: 391 ----------------IYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRG----------- 423

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                              +     +W   +++G+ +G+ G  +ILRG NE+ IE L+  
Sbjct: 424 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465

Query: 241 A 241
           A
Sbjct: 466 A 466


>gi|239799410|dbj|BAH70626.1| ACYPI000012 [Acyrthosiphon pisum]
          Length = 265

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 14/111 (12%)

Query: 11  WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
           W ++   GLV+GG +++N GCQP   PP    N  T   E          C  RC  +N 
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGSYE--------NTCEKRCYGNN- 216

Query: 71  GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLY-SDIFSYKSG 120
              + QD  + K +Y +  E  DIQ+E+   GPV     ++ +D F YKSG
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFRVFDNDFFLYKSG 265


>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
           [Strongylocentrotus purpuratus]
          Length = 450

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 67/169 (39%), Gaps = 44/169 (26%)

Query: 72  RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
           RG   D Y     Y +     DI  EI +NGPV A   + +D F Y  G Y N       
Sbjct: 316 RGVTSDLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRN------- 368

Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
                    K    A  + ++   + +VKIVGWG +  R  W                Y 
Sbjct: 369 --------VKQEFTASQSDSDQAGWHSVKIVGWGID--RSDW----------------YN 402

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
            +K           YW   +++G  +G++G  +I+RG NE  IES V G
Sbjct: 403 PIK-----------YWLCTNSWGRNWGEQGMFRIVRGVNECEIESFVLG 440


>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
          Length = 326

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 68/167 (40%), Gaps = 58/167 (34%)

Query: 9   STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
           + W +    G+ +GG ++S+ GCQP S     +A  +    EC                 
Sbjct: 153 NAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEAS----ECV---------------- 192

Query: 69  NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
                         ++Y +   VA IQ EI+ NGPV+A   ++ D   +KSG        
Sbjct: 193 --------------KFYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSG-------- 230

Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                   ++ YKSG +        V   +VK++GWG E G PYW I
Sbjct: 231 --------VYYYKSGKF--------VGRHSVKVIGWGTEEGIPYWLI 261


>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
 gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
          Length = 231

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 85/227 (37%), Gaps = 75/227 (33%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ GI    + ++HK GLV+              FP  ++   T              KC
Sbjct: 68  CNGGIPGLVFDYIHKDGLVSDAC-----------FPYLSYDGNT------------HVKC 104

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C N N  + F  DK+   + Y V + + D  + +++         +  +I ++    
Sbjct: 105 PDFCYN-NKTKSFKSDKHFADKVYHVGEFLEDKAKRVLE---------IQKEILTH---- 150

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
              GPV A+  +YSD   YKSGVY                                    
Sbjct: 151 ---GPVNADFMVYSDFTVYKSGVYR----------------------------------- 172

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
             +        VK+IGWG ENG  YW I +++G  FG +G  KI+RG
Sbjct: 173 HQTGSFEGIHAVKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIVRG 219


>gi|123469339|ref|XP_001317882.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121900627|gb|EAY05659.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 241

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
            V+LIGWG+ENG  YW +++  G+ +G  GT+ I  G NE +IES + GA P
Sbjct: 187 AVELIGWGKENGVEYWILLNQHGKNWGINGTMHIKMGSNEGLIESFIYGATP 238


>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
 gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 76/175 (43%), Gaps = 39/175 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
           C+ G     W +    G+V+         CQP  FP C +H N +   P      TP   
Sbjct: 65  CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C++ CT+    +     KYR    Y ++ E    ++E++ NGP   +  +Y+D  +Y  G
Sbjct: 116 CNSTCTD----KKIPLIKYRGNTSYVLSGE-EPFKRELILNGPFEVSFSVYADFVAYTGG 170

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            Y +   VA ++L         G +A            V+IVGWGE NG PYW I
Sbjct: 171 VYKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKI 201


>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
          Length = 351

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 91/262 (34%), Gaps = 90/262 (34%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  ++G+VT     +    GC+    P C        EP  +T     P
Sbjct: 165 CNGGYPISAWRYFRRKGVVTDECDPYFDQVGCK---HPGC--------EPAYRT-----P 208

Query: 60  KCHTRCTNDNY----GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           KC  +C   N      + F  D YR      V+    DI  E+                 
Sbjct: 209 KCEKKCKVQNEVWKEQKHFSVDAYR------VHSNPHDIMAEV----------------- 245

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                 Y NGPV     +Y D   YKSGVY    +  ++    VK++GWG  +       
Sbjct: 246 ------YTNGPVEVAFTVYEDFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDA------ 292

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                      G  YW + + +   +GD G  KI+RG+NE  IE
Sbjct: 293 ---------------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 325

Query: 236 SLVNGALPKD-----NYGVEFG 252
             V   +P       NY   FG
Sbjct: 326 EDVVAGMPSTKNMARNYDDAFG 347


>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
           norvegicus]
 gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
           Full=Glucocorticoid-inducible protein 5; Flags:
           Precursor
 gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
          Length = 467

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 67/246 (27%), Positives = 99/246 (40%), Gaps = 67/246 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ +RG+V+                      Y  S  E    A+P P+C
Sbjct: 269 CRGGRLDGAWWFLRRRGVVSDNC-------------------YPFSGREQNDEASPTPRC 309

Query: 62  --HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
             H+R      GRG  Q   R       N +V     +I +  PV     L SD      
Sbjct: 310 MMHSR----AMGRGKRQATSRCP-----NSQVD--SNDIYQVTPVYR---LASDEKEIMK 355

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
               NGPV A M ++ D F Y+ G+Y+ +  ++                GRP     R +
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQ----------------GRP--EQYRRH 397

Query: 180 AVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                       +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  I
Sbjct: 398 GTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDI 448

Query: 235 ESLVNG 240
           E+ V G
Sbjct: 449 ETFVLG 454


>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
          Length = 475

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 92/246 (37%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           CS G     W ++ KRGLV+   +      +S  GC          A  + S+   K  A
Sbjct: 284 CSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGC----------AMASRSDGRGKRHA 333

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           T      T C N+          Y+    Y V+     I +EIMKNGPV A M ++ D F
Sbjct: 334 T------TPCPNNIEKSNRI---YQCSPPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFF 384

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK+G                I+ + +     S   + +    VK+ GWG   G      
Sbjct: 385 YYKTG----------------IYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRG------ 422

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                   +     +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 423 -----------------------AKGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
          Length = 474

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 91/242 (37%), Gaps = 60/242 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV--SFPPCNHANYTTSEPECKTLATPQP 59
           C+SG     W ++ KRGLV+         C P+  +    NH     S  + +       
Sbjct: 281 CNSGSIDRAWWFLRKRGLVSHA-------CYPLFKNQNATNHGCAMASRSDGRGKRHATK 333

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C       N         Y+    Y V+    +I +EIM+NGPV A M ++ D F YK+
Sbjct: 334 PCPNNIEKSN-------RIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKT 386

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +    AN          +SG Y        +    VK+ GWG   G          
Sbjct: 387 GIYRHITKKANE---------ESGKY------RKLQTHAVKLTGWGTLKG---------- 421

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                               +     +W   +++G+ +G+ G  +ILRG NE+ IE L+ 
Sbjct: 422 -------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLII 462

Query: 240 GA 241
            A
Sbjct: 463 AA 464


>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 403

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 62/258 (24%), Positives = 88/258 (34%), Gaps = 82/258 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  + G+VT     +    GC+                P C+  A P P
Sbjct: 215 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 258

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C  +C   N  + + + K+     Y VN +  DI  E+ +NGPV     +Y D   YKS
Sbjct: 259 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 316

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I     G +A            VK++GWG  +           
Sbjct: 317 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 342

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG NE  IE  V 
Sbjct: 343 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 379

Query: 240 GALPKD-----NYGVEFG 252
             +P       NY   FG
Sbjct: 380 AGMPSTKNMVRNYDSAFG 397


>gi|123483120|ref|XP_001323959.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121906833|gb|EAY11736.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 255

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 37/52 (71%)

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           TV++IGWG+E G PYW I++ +G  +G+ G ++I  GR++A +ES V  A P
Sbjct: 200 TVEIIGWGQEKGIPYWIILNQYGRLWGENGMMRIRMGRDDARVESYVLAAEP 251


>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
           scrofa]
          Length = 368

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 96/246 (39%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T +     + +  + K 
Sbjct: 176 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNNGCAMASRSDGRGKR 224

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N+ +      Y+    Y V+    +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 225 HATKPCPNNFEKS--NRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTG 282

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      +  S + E   Y                   +R +A
Sbjct: 283 IY---------------------RHVTSTNEESDKYRK-----------------LRTHA 304

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 305 ------------VKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 352

Query: 236 SLVNGA 241
            L+  A
Sbjct: 353 KLIIAA 358


>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
           porcellus]
          Length = 475

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/246 (23%), Positives = 89/246 (36%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G     W ++ KRGLV+   +      ++  GC   S        + T         
Sbjct: 284 CGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRSDGRGKRHAT--------- 334

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           TP P  H   +N           Y+    Y V+     I +EIM+NGPV A M ++ D F
Sbjct: 335 TPCPN-HIEKSNR---------IYQCSPPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFF 384

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
           SYK+G                I+ + +     S   + +    VK+ GWG   G      
Sbjct: 385 SYKTG----------------IYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKG------ 422

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                         +W   +++G+ +G+ G  KILRG NE+ IE
Sbjct: 423 -----------------------ARGKKEKFWIAANSWGKSWGENGYFKILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
 gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
 gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
          Length = 358

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 62/258 (24%), Positives = 88/258 (34%), Gaps = 82/258 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  + G+VT     +    GC+                P C+  A P P
Sbjct: 170 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 213

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C  +C   N  + + + K+     Y VN +  DI  E+ +NGPV     +Y D   YKS
Sbjct: 214 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 271

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I     G +A            VK++GWG  +           
Sbjct: 272 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 297

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG NE  IE  V 
Sbjct: 298 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 334

Query: 240 GALPKD-----NYGVEFG 252
             +P       NY   FG
Sbjct: 335 AGMPSTKNMVRNYDSAFG 352


>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
 gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
           Precursor
 gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
          Length = 311

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 39/172 (22%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G + S W W+ K+G V+         C P + P C  A     +P    + TP   C
Sbjct: 145 CEGGDAFSAWNWLRKQGAVS-------EECLPYTIPTCPPA----QQPCLNFVNTP--SC 191

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
              C + N    + QDK++  + Y  + + A I QEI+ NGPV A   ++ D  +YKSG 
Sbjct: 192 TKECQS-NSSLIYSQDKHKMAKIYSFDSDEA-IMQEIVTNGPVEACFTVFEDFLAYKSGV 249

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
           Y                        V  + + +    VK+VG+G  NG  Y+
Sbjct: 250 Y------------------------VHTTGKDLGGHCVKLVGFGTLNGVDYY 277


>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
          Length = 228

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 38/78 (48%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++ K G  TGG++ +  GC+P S  PC      T+ P C T     P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209

Query: 62  HTRCTNDNYGRGFFQDKY 79
             +CTN NY   +  DK+
Sbjct: 210 VNKCTNSNYNVAYKDDKH 227


>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
          Length = 228

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 3/97 (3%)

Query: 8   SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           S  W +  K+GLV+GG + S+ GC+P S PPC H +   S P C T     P+C   C  
Sbjct: 135 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 191

Query: 68  DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
             Y   + +DK+     Y V+ +  +I+ EI KNGPV
Sbjct: 192 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPV 228


>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
          Length = 430

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 84/242 (34%), Gaps = 75/242 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK   +   K 
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHSRSLKA 295

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +      N  R      Y     Y +N E ADI  EI  +GPV A M +  D F+Y  G 
Sbjct: 296 NGCQKPVNVDRDSL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGGV 351

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y                         +       + +VK+VGWGEE+             
Sbjct: 352 YRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------- 377

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                               NG  YW   +++G  +G+ G  +ILRG NE  IE  V  +
Sbjct: 378 --------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 417

Query: 242 LP 243
            P
Sbjct: 418 WP 419


>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
          Length = 599

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 24/124 (19%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVY-AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
           + NGP+V +     D   Y+ G+Y +V A+  I+          G+E+  P W  V    
Sbjct: 484 HKNGPIVVSFEPAMDFMYYQEGIYHSVDANDWIL----------GDEDKLPQWEKVD--- 530

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                      +V  +GWGE     YW + +++GE +G+KG  KI RG +E+ IES+   
Sbjct: 531 ----------HSVLCVGWGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIESMGER 580

Query: 241 ALPK 244
           A  K
Sbjct: 581 AFIK 584


>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Ailuropoda melanoleuca]
          Length = 472

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 95/250 (38%), Gaps = 77/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANY-----TTSEPECKTLAT 56
           C+SG     W ++ KRGLV+         C P+ F   N  NY     + S+   K  AT
Sbjct: 280 CNSGSIDRAWWFLRKRGLVS-------HACYPL-FKDQNATNYGCAMASRSDGRGKRHAT 331

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D F 
Sbjct: 332 -KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFH 382

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK+G Y +                                                  + 
Sbjct: 383 YKTGIYRH--------------------------------------------------VT 392

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNE 231
           R    S+    +    +KL GWG   G       +W   +++G+ +G+ G  +ILRG NE
Sbjct: 393 RTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNE 452

Query: 232 AIIESLVNGA 241
           + IE L+  A
Sbjct: 453 SDIEKLIIAA 462


>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
          Length = 298

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 86/253 (33%), Gaps = 72/253 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTG------CQPVSFPPCNHANYTTSEP------ 49
           C  G   + W +V K G VTGG  ++ TG      C     P C+H      +P      
Sbjct: 92  CDGGQIITPWTYVAKAGAVTGG-QYNGTGPFGAGLCADWFAPHCHHHGPRGDDPYPAEGD 150

Query: 50  -ECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANM 108
             C +  +P+       T       F  DK+ F          A I   I + GPV    
Sbjct: 151 AGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAF 210

Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
            +Y D  +Y  G            +Y  +   ++G +A            VK VGWG EN
Sbjct: 211 TVYEDFENYAGG------------IYHHVTGEEAGGHA------------VKFVGWGVEN 246

Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
           G  YW +   +                         PYW          G+ G  +ILRG
Sbjct: 247 GTKYWKVANSW------------------------NPYW----------GEAGYFRILRG 272

Query: 229 RNEAIIESLVNGA 241
            NE  IE  V G+
Sbjct: 273 SNEGGIEDQVTGS 285


>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
 gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
 gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
          Length = 476

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT      T C N           Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F+YK+G                I+ + +     S          VK+ GWG   G     
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
 gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
          Length = 272

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 75/180 (41%), Gaps = 63/180 (35%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P+C ++CT    G G    K+     Y V+ E   I+ EIM NGPV A   +YSDI    
Sbjct: 146 PECMSKCT----GEGHAYQKFYGLYLYTVSGE-NQIKVEIMTNGPVEAAFTVYSDIV--- 197

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                                YKSGVY  ++  ++  +A VK++GWG E+          
Sbjct: 198 --------------------HYKSGVYHHTSGGKLGGHA-VKVLGWGVED---------- 226

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                EE    YW + +++G  +GD+G  KI RG +E  IES V
Sbjct: 227 ---------------------EEE---YWLVANSWGPDWGDQGFFKIKRGSDECGIESRV 262


>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
           [Loxodonta africana]
          Length = 437

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/250 (24%), Positives = 97/250 (38%), Gaps = 76/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H      PV  PPC    ++ +    K  AT 
Sbjct: 240 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPV--PPC--MMHSRAMGRGKRQAT- 294

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
                +RC N +       D Y+    Y +     +I +E+M+NGPV A M ++      
Sbjct: 295 -----SRCPNSHV---HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHE----- 341

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGR 170
                             D F Y+ G+Y+ +  ++            +VKI GWGEE   
Sbjct: 342 ------------------DFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEET-- 381

Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                           +    T+K           YWT  +++G  +G++G  +I+RG N
Sbjct: 382 ----------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGAN 414

Query: 231 EAIIESLVNG 240
           E  IES V G
Sbjct: 415 ECDIESFVLG 424


>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
          Length = 313

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 40/67 (59%)

Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
           + Y  S I   K+  Y NGP++A   LY+DI++YKSGVY  S SA        +++GWG 
Sbjct: 154 DCYRLSSIEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGV 213

Query: 167 ENGRPYW 173
           E+G  YW
Sbjct: 214 EDGVQYW 220


>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
 gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
 gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
          Length = 467

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 85/193 (44%), Gaps = 48/193 (24%)

Query: 55  ATPQPKC--HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
           A+P P+C  H+R      GRG  Q   R    +  ++++  +        PV     L S
Sbjct: 303 ASPTPRCMMHSR----AMGRGKRQATSRCPNSHVDSNDIYQVT-------PVYR---LAS 348

Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
           D          NGPV A M ++ D F Y+ G+Y+ +  ++                GRP 
Sbjct: 349 DEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQ----------------GRP- 391

Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILR 227
               R +            +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+R
Sbjct: 392 -EQYRRHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVR 441

Query: 228 GRNEAIIESLVNG 240
           G NE  IE+ V G
Sbjct: 442 GTNECDIETFVLG 454


>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
           antigen-like [Equus caballus]
          Length = 480

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 54/241 (22%), Positives = 92/241 (38%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T ++    + +  + K 
Sbjct: 288 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNNDCAMASRSDGRGKR 336

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EIM+NGPV A M ++ D F YK G
Sbjct: 337 HATKPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKG 394

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +  +        +    +K+ GWG   G           
Sbjct: 395 ----------------IYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRG----------- 427

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                              +     +W   +++G+ +G+ G  +ILRG NE+ IE L+  
Sbjct: 428 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 469

Query: 241 A 241
           A
Sbjct: 470 A 470


>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
 gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
          Length = 234

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 64/258 (24%), Positives = 89/258 (34%), Gaps = 82/258 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  + G+VT     +    GC+    P C        EP     A P P
Sbjct: 46  CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 89

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
            C  +C   N  + + + K+     Y VN +  DI  E+ +NGPV     +Y D   YKS
Sbjct: 90  VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 147

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I     G +A            VK++GWG  +           
Sbjct: 148 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 173

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                  G  YW + + +   +GD G  KI+RG NE  IE  V 
Sbjct: 174 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 210

Query: 240 GALPKD-----NYGVEFG 252
             +P       NY   FG
Sbjct: 211 AGMPSTKNMVRNYDSAFG 228


>gi|67613207|ref|XP_667285.1| preprocathepsin c precursor [Cryptosporidium hominis TU502]
 gi|54658406|gb|EAL37056.1| preprocathepsin c precursor [Cryptosporidium hominis]
          Length = 635

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 39/158 (24%)

Query: 110 LYSDIFSYKSGKYG-------------NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
           +Y++ + Y  G YG             NGP+   M++ + +  Y++GVY  S   +   Y
Sbjct: 461 MYAEEYGYVGGCYGCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYENGVYD-SIPNDHTKY 519

Query: 157 ATV---KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
             +   ++ GW   N                        + ++GWGEENG PYW I +++
Sbjct: 520 CDLPNKQLNGWEYTN----------------------HAIAIVGWGEENGIPYWIIRNSW 557

Query: 214 GEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEF 251
           G  +G+KG  KI RG+N   IE+      P  + G+ F
Sbjct: 558 GANWGNKGYAKIRRGKNIGGIENQAVFIDPDFSRGMGF 595


>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
          Length = 301

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/118 (31%), Positives = 54/118 (45%), Gaps = 4/118 (3%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    G+VTGG +  + GC+  S  PC+H       P C  +    P C
Sbjct: 154 CNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGP-CGDIQR-TPAC 211

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
              C  D+     ++   R    Y +    + IQ EIM NGPV A+  +YSD  +YK+
Sbjct: 212 KKSC--DSTSDLEYKSDLRRGSAYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267


>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
 gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
 gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
          Length = 476

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 94/247 (38%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
          Length = 476

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 94/247 (38%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
           [Loxodonta africana]
          Length = 468

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 61/250 (24%), Positives = 97/250 (38%), Gaps = 76/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
           C  G     W ++ +RG+V+   +    H      PV  PPC    ++ +    K  AT 
Sbjct: 271 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPV--PPC--MMHSRAMGRGKRQAT- 325

Query: 58  QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
                +RC N +       D Y+    Y +     +I +E+M+NGPV A M ++      
Sbjct: 326 -----SRCPNSHV---HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHE----- 372

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGR 170
                             D F Y+ G+Y+ +  ++            +VKI GWGEE   
Sbjct: 373 ------------------DFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEET-- 412

Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                           +    T+K           YWT  +++G  +G++G  +I+RG N
Sbjct: 413 ----------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGAN 445

Query: 231 EAIIESLVNG 240
           E  IES V G
Sbjct: 446 ECDIESFVLG 455


>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
           mulatta]
          Length = 463

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+SG     W ++ KRGLV+   +      ++N GC          A  + S+   K  A
Sbjct: 272 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 321

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           T +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D F
Sbjct: 322 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 372

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK+G                I+ + +     S     +    VK+ GWG   G      
Sbjct: 373 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 410

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                   +     +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 411 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 447

Query: 236 SLVNGA 241
            L+  A
Sbjct: 448 KLIIAA 453


>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
           kowalevskii]
          Length = 93

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 52/120 (43%), Gaps = 35/120 (29%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
           GPV     +Y+D  SYKSGVY    + E +    +KI+GWG E+G               
Sbjct: 8   GPVEGAFTVYADFPSYKSGVYQ-HETGEALGGHAIKILGWGNEDG--------------- 51

Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                                YW + +++ E +GD+G  KILRG +E  IES +    PK
Sbjct: 52  -------------------HDYWLVANSWNEDWGDQGFFKILRGVDECGIESQITAGSPK 92


>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
           familiaris]
          Length = 476

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/250 (26%), Positives = 94/250 (37%), Gaps = 77/250 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANY-----TTSEPECKTLAT 56
           C+SG     W ++ KRGLV+         C P+ F   N  NY     + S+   K  AT
Sbjct: 284 CNSGSIDRAWWFLRKRGLVSHA-------CYPL-FKDQNATNYGCAMASRSDGRGKRHAT 335

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
            +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D F 
Sbjct: 336 -KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFH 386

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           YK+G Y +                                                  I 
Sbjct: 387 YKTGIYRH--------------------------------------------------IT 396

Query: 177 RVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNE 231
           R    S   + +    VKL GWG   G       +W   +++G  +G+ G  +ILRG NE
Sbjct: 397 RTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNE 456

Query: 232 AIIESLVNGA 241
           + IE L+  A
Sbjct: 457 SDIEKLIIAA 466


>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
          Length = 475

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+SG     W ++ KRGLV+   +      ++N GC          A  + S+   K  A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           T +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK+G                I+ + +     S     +    VK+ GWG   G      
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                   +     +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
          Length = 475

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+SG     W ++ KRGLV+   +      ++N GC          A  + S+   K  A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           T +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK+G                I+ + +     S     +    VK+ GWG   G      
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                   +     +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
           CCMP2712]
          Length = 85

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/63 (46%), Positives = 36/63 (57%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGP V    +Y D +SYKSGVY  SA A+ V    V +VGWG ENG  YW +   +  S+
Sbjct: 8   NGPGVVVFDVYDDFYSYKSGVYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQNSWGKSS 67

Query: 184 SAE 186
             E
Sbjct: 68  GDE 70


>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
          Length = 475

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C+SG     W ++ KRGLV+   +      ++N GC          A  + S+   K  A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333

Query: 56  TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           T +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            YK+G                I+ + +     S     +    VK+ GWG   G      
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                   +     +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
           [Equus caballus]
          Length = 467

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 60/122 (49%), Gaps = 32/122 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A M ++ D F Y+ GVY+ +  +                +GRP     R +    
Sbjct: 360 NGPVQALMEVHEDFFLYQGGVYSHTPVS----------------HGRP--ERYRRHGTH- 400

Query: 184 SAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                   +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V
Sbjct: 401 --------SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 452

Query: 239 NG 240
            G
Sbjct: 453 LG 454


>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
          Length = 306

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 24/104 (23%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           Y+ K  Y V + +A IQ EI+ NGPV A   +Y D FSY SG                ++
Sbjct: 198 YKAKTAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSG----------------VY 241

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVS 182
           S++SG         +     VKIVGWG +   PYW +   +  S
Sbjct: 242 SHQSGA--------LDGGHAVKIVGWGVDGTTPYWIVANSWGTS 277


>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
           boliviensis boliviensis]
          Length = 476

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 92/241 (38%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T S     + +  + K 
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNSGCAMASRSDGRGKR 332

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 333 HATKPCPNNIEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTG 390

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +     S     +    VK+ GWG   G           
Sbjct: 391 ----------------IYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRG----------- 423

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                              +     +W   +++G+ +G+ G  +ILRG NE+ IE L+  
Sbjct: 424 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465

Query: 241 A 241
           A
Sbjct: 466 A 466


>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 56/122 (45%), Gaps = 28/122 (22%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG--RPYWTIVRVY 179
           Y NGPVV N     D   Y  G++  +    I+             NG  +P W  V   
Sbjct: 387 YNNGPVVLNFEPSFDFMFYVGGIFHSTTPDWII-------------NGLAKPEWEKVD-- 431

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                       +V   GWGEENG  YW + +++G+Q+G+ G  ++ RG++E+ IES+  
Sbjct: 432 -----------HSVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAE 480

Query: 240 GA 241
            A
Sbjct: 481 AA 482


>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
 gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
          Length = 484

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 94/278 (33%), Gaps = 78/278 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 62  HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C T  N  R      Y     Y +N E ADI  EI  +GPV A M +  D F+Y  G
Sbjct: 296 ANGCQTPVNVDRDTL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                         +       + +VK+VGWGEE+            
Sbjct: 352 VYRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------ 378

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G+ G  +ILRG NE  IE  V  
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLA 417

Query: 241 ALP--KDNYGVEFGEESGERLSEEFGVRAESSEEFREN 276
           + P   + Y  E G    +R    F     S     EN
Sbjct: 418 SWPYVYNYYKCEVGLRGIKRALPPFATEPISELCRNEN 455


>gi|48762497|dbj|BAD23818.1| cathepsin B-S [Tuberaphis coreana]
          Length = 99

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 25/120 (20%)

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           P  + H +C    YG+   Q++Y+ K  Y +N  +  I+Q                D+ +
Sbjct: 5   PMERNH-QCPKTCYGKTTVQNRYKTKSEYSINS-IKTIEQ----------------DLKT 46

Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
           Y       GPV A+  +Y D   YKSG+Y  +  A+     ++KI+GWG+ENG  YW  V
Sbjct: 47  Y-------GPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAV 99


>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
           [Equus caballus]
          Length = 436

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 60/122 (49%), Gaps = 32/122 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A M ++ D F Y+ GVY+ +  +                +GRP     R +    
Sbjct: 329 NGPVQALMEVHEDFFLYQGGVYSHTPVS----------------HGRP--ERYRRHGTH- 369

Query: 184 SAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                   +VK+ GWGEE   +GR   YWT  +++G  +G++G  +I+RG NE  IES V
Sbjct: 370 --------SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 421

Query: 239 NG 240
            G
Sbjct: 422 LG 423


>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
 gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
          Length = 476

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/241 (24%), Positives = 93/241 (38%), Gaps = 59/241 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+         C P+S       N T +     + +  + K 
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHA-------CYPLS----KDQNATNNGCAMASRSDGRGKR 332

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EIM+NGPV A M +  D F YK+G
Sbjct: 333 HATKPCPNNVEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ + +     S     +    VK+ GWG   G           
Sbjct: 391 ----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----------- 423

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                              +     +W   +++G+ +G+ G  +ILRG NE+ IE L+  
Sbjct: 424 ------------------AQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465

Query: 241 A 241
           A
Sbjct: 466 A 466


>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 180

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 34/75 (45%), Gaps = 6/75 (8%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
           C  G   S W WVH +G+ TGG +        + GC P  FPPC H    T  P+C    
Sbjct: 100 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPEGL 159

Query: 56  TPQPKCHTRCTNDNY 70
            P P C  +C N  Y
Sbjct: 160 YPTPNCVEQCHNPKY 174


>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
          Length = 392

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/248 (23%), Positives = 87/248 (35%), Gaps = 85/248 (34%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +    G+VT     +   TGC               S P C+    P P
Sbjct: 203 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 246

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           KC  +CT++N      Q   + KRY    Y ++ +   I  E+ KNGPV     +Y D  
Sbjct: 247 KCVRKCTDEN------QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFA 300

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            Y+SG                ++ Y +G        +++    VK++GWG          
Sbjct: 301 HYESG----------------VYRYTTG--------DVMGGHAVKLIGWGT--------- 327

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                    ++G  YW + + +   +GD G   I RG NE  IE
Sbjct: 328 ------------------------TDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIE 363

Query: 236 SLVNGALP 243
             V   LP
Sbjct: 364 EGVVAGLP 371


>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
          Length = 327

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/233 (25%), Positives = 85/233 (36%), Gaps = 77/233 (33%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y VN +  DI  E+ KNGPV     +Y D   YKS
Sbjct: 213 KCVKKCVSGN--QVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G            +Y  I  Y+ G +A            VK++GWG              
Sbjct: 271 G------------VYKHITGYELGGHA------------VKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
                                ++G  YW + + +  ++GD G  KI RG NE 
Sbjct: 294 --------------------TDDGEDYWLLANQWNREWGDDGYFKIRRGTNEC 326


>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
 gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
           melanogaster]
 gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
           melanogaster]
 gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
 gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
          Length = 431

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/243 (24%), Positives = 83/243 (34%), Gaps = 76/243 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK     +   
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295

Query: 62  HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
              C    N  R      Y     Y +N E ADI  EI  +GPV A M +  D F+Y  G
Sbjct: 296 ANGCQKPVNVDRDSL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                         +       + +VK+VGWGEE+            
Sbjct: 352 VYRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------ 378

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                NG  YW   +++G  +G+ G  +ILRG NE  IE  V  
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLA 417

Query: 241 ALP 243
           + P
Sbjct: 418 SWP 420


>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 122

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 53/132 (40%), Gaps = 34/132 (25%)

Query: 112 SDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
           SD +S  +  Y NGPV     +Y D   YKSGVY      E+  +A VK++GWG      
Sbjct: 5   SDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHA-VKLIGWGT----- 58

Query: 172 YWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
                                        E+G  YW + + +   +GD G  KI RG NE
Sbjct: 59  ----------------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTNE 90

Query: 232 AIIESLVNGALP 243
             IE  V   +P
Sbjct: 91  CDIEDEVVAGMP 102


>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 355

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 80/236 (33%), Gaps = 83/236 (35%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPK 60
           C+ G     W +    GLV+         C P S  P C   N       C  L  P   
Sbjct: 93  CAGGDPLKVWNYWATTGLVS-------DSCMPFSLSPLCLGFN-------CPLLCAP--- 135

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
                    Y      D+ +  +   V   V  IQ EI+ NGPV A+  LY D    K  
Sbjct: 136 --------GYAGSIVGDRKKGLKVVTVAPYVDAIQSEIILNGPVEASFDLYLDFVHLKQ- 186

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                         S +++ +SG                         GR          
Sbjct: 187 --------------SQVYNSRSG----------------------PNLGR---------- 200

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                      +VK+IGWG ENG  YW I STFG  +G++GT   LRG N  ++ S
Sbjct: 201 ----------QSVKIIGWGVENGTEYWLITSTFGIGWGNQGTAMFLRGVNHLVLPS 246


>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
          Length = 476

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 91/247 (36%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT      T C N           Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F+YK+G                I+ + +     S          VK+ GWG   G     
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                          +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AHGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|260821944|ref|XP_002606363.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
 gi|229291704|gb|EEN62373.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
          Length = 113

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 58/123 (47%), Gaps = 35/123 (28%)

Query: 131 MYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           M +  D+FSY+SGVY  +  A+          + +V+I+GWG E   PY           
Sbjct: 1   MEVKPDLFSYRSGVYRHTELAQGEPPEYRRRGWHSVRIIGWGVEMSDPY----------- 49

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                  A +K           YWT+ +++G Q+G++G  +I+RG NE  IES V G   
Sbjct: 50  ------QAPIK-----------YWTVANSWGTQWGEEGYFRIVRGENECQIESFVLGVWG 92

Query: 244 KDN 246
           K N
Sbjct: 93  KVN 95


>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
          Length = 357

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 84/244 (34%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C   N  + + + KY     Y V  +  DI  E+                     
Sbjct: 213 KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEV--------------------- 249

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV     +Y D   YKSGVY     +++  +A VK++GWG              
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHA-VKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW I + +   +GD G   I RG NE  IE  V 
Sbjct: 294 --------------------TDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVT 333

Query: 240 GALP 243
             LP
Sbjct: 334 AGLP 337


>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
 gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
           propeptide [Medicago truncatula]
 gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
          Length = 357

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 169 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C   N  + + + K+   + Y V  +  DI  E+ KNGPV     ++ D   YKS
Sbjct: 213 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 270

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                      ++ SA  +    VK++GWG              
Sbjct: 271 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 293

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 294 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 333

Query: 240 GALP 243
             LP
Sbjct: 334 AGLP 337


>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
 gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
          Length = 323

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 65/150 (43%), Gaps = 42/150 (28%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV+A   LYSD   +K  VY  S++ ++ ++A V++VGWG  +               
Sbjct: 191 NGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHA-VRVVGWGTTS--------------- 234

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE------SL 237
                             +G  YW   +++G  +GDKG  KI RG +EA  E      + 
Sbjct: 235 ------------------DGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTA 276

Query: 238 VNGALPKDNYGVE--FGEESGERLSEEFGV 265
              ++P   YG+E  FG  S   L   F +
Sbjct: 277 DTASVPTSQYGLEYQFGGNSSTFLKPSFLI 306


>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
           carolinensis]
          Length = 476

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/154 (27%), Positives = 64/154 (41%), Gaps = 45/154 (29%)

Query: 85  YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
           Y ++ + ADI +EI +NGPV A M +Y D F YKSG            +Y  I+S +   
Sbjct: 361 YRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSG------------IYKHIWSLE--- 405

Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
              + +       ++KIVGWG                                  E   +
Sbjct: 406 -GKTQNRHQKKPHSIKIVGWGTLRD-----------------------------AEGQRQ 435

Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
            +W   +++G  +G+ G  +ILRG+NE  IE  V
Sbjct: 436 KFWIAANSWGNSWGENGYFRILRGQNECDIEKTV 469


>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 340

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C+ G     W +    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C+  C +           YR    Y +  E  D  +E+   GP      +Y D  +Y SG
Sbjct: 215 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 269

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   V+  YL         G +A            V++VGWG  NG PYW I 
Sbjct: 270 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 301


>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
 gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
          Length = 317

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C+ G     W +    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 139 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 191

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C+  C +           YR    Y +  E  D  +E+   GP      +Y D  +Y SG
Sbjct: 192 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 246

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   V+  YL         G +A            V++VGWG  NG PYW I 
Sbjct: 247 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 278


>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
 gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
          Length = 359

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C   N  + + + K+   + Y V  +  DI  E+ KNGPV     ++ D   YKS
Sbjct: 215 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 272

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                      ++ SA  +    VK++GWG              
Sbjct: 273 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 295

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 296 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 335

Query: 240 GALP 243
             LP
Sbjct: 336 AGLP 339


>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
          Length = 359

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W ++   G+VT     +    GC   S P C        EP  +T     P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C   N  + + + K+   + Y V  +  DI  E+ KNGPV     ++ D   YKS
Sbjct: 215 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKS 272

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                      ++ SA  +    VK++GWG              
Sbjct: 273 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 295

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 296 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 335

Query: 240 GALP 243
             LP
Sbjct: 336 AGLP 339


>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
 gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
          Length = 358

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/248 (23%), Positives = 87/248 (35%), Gaps = 85/248 (34%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +    G+VT     +   TGC               S P C+    P P
Sbjct: 169 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 212

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
           KC  +CT++N      Q   + KRY    Y ++ +   I  E+ KNGPV     +Y D  
Sbjct: 213 KCVRKCTDEN------QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFA 266

Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
            Y+SG                ++ Y +G        +++    VK++GWG          
Sbjct: 267 HYESG----------------VYRYTTG--------DVMGGHAVKLIGWGT--------- 293

Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                                    ++G  YW + + +   +GD G   I RG NE  IE
Sbjct: 294 ------------------------TDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIE 329

Query: 236 SLVNGALP 243
             V   LP
Sbjct: 330 EGVVAGLP 337


>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
          Length = 239

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/121 (28%), Positives = 53/121 (43%), Gaps = 12/121 (9%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W    + GLVTGG   S  GC+P   PP    +  +    C+         
Sbjct: 116 CDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPPSGSNSSNSYNHFCR--------- 166

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             +C  DN    + +D    + YY+++     IQ++++  GP+ A+  +Y D   YKSG 
Sbjct: 167 -GKCYGDNQNISYSEDHRYTRDYYYLSYNA--IQKDVLLYGPIEASFEVYDDFMIYKSGV 223

Query: 122 Y 122
           Y
Sbjct: 224 Y 224


>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
           leucogenys]
          Length = 476

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T++     + +  + K 
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATSNGCAMASRSDGRGKR 332

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +N  +      Y+    Y V+    +I +EIM+NGPV A M +  D F YK+G
Sbjct: 333 HATKPCPNNVEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      +  SA+ E   Y  ++                    
Sbjct: 391 IY---------------------RHVTSANKESEKYRKLQT------------------- 410

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 411 ----------HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460

Query: 236 SLVNGA 241
            L+  A
Sbjct: 461 KLIIAA 466


>gi|126647906|ref|XP_001388062.1| preprocathepsin c precursor [Cryptosporidium parvum Iowa II]
 gi|126117150|gb|EAZ51250.1| preprocathepsin c precursor, putative [Cryptosporidium parvum Iowa
           II]
          Length = 635

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 63/143 (44%), Gaps = 39/143 (27%)

Query: 110 LYSDIFSYKSGKYG-------------NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
           +Y++ + Y  G YG             NGP+   M++ + +  Y +GVY  S   +   Y
Sbjct: 461 MYAEEYGYVGGCYGCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYDNGVYD-SIPNDHTKY 519

Query: 157 ATV---KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
             +   ++ GW   N                        + ++GWGEENG PYW I +++
Sbjct: 520 CDLPNKQLNGWEYTN----------------------HAIAIVGWGEENGIPYWIIRNSW 557

Query: 214 GEQFGDKGTIKILRGRNEAIIES 236
           G  +G KG  KI RG+N   IE+
Sbjct: 558 GANWGKKGYAKIRRGKNIGGIEN 580


>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 217

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 1/84 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
           C  G    +W +  + G V+GG ++SN GCQP + PPC   N       C T    + P 
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRY 84
           C  +C N NY   F  D Y+ K Y
Sbjct: 190 CEKKCYNPNYYTSFRTDIYKGKYY 213


>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
           Free-electron Laser Pulse Data By Serial Femtosecond
           X-ray Crystallography
 gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
 gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
 gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 340

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C+ G     W +    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C+  C +           YR    Y +  E  D  +E+   GP      +Y D  +Y SG
Sbjct: 215 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 269

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   V+  YL         G +A            V++VGWG  NG PYW I 
Sbjct: 270 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 301


>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
 gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
           With Ca074
          Length = 325

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/176 (27%), Positives = 67/176 (38%), Gaps = 37/176 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
           C+ G     W +    GLV+         CQP  FP C+H + + +  P C       PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C +           YR    Y +  E  D  +E+   GP      +Y D  +Y SG
Sbjct: 193 CDYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 247

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
            Y +   V+  YL         G +A            V++VGWG  NG PYW I 
Sbjct: 248 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 279


>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 47/180 (26%), Positives = 73/180 (40%), Gaps = 57/180 (31%)

Query: 82  KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
           + YY  + ++  IQ++IM++GPV+A         SY+              ++ D   Y 
Sbjct: 238 RCYYHSSSDIETIQRDIMQHGPVLA---------SYE--------------VFEDFGEYD 274

Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
           SGVY                +GW                            V ++GWG E
Sbjct: 275 SGVYTCPDDGS-------DSIGW--------------------------HAVIIVGWGVE 301

Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSE 261
           +  PYW + +++G  FG  G  KI RG NE  IES +  +L  +  GV F   SG  +++
Sbjct: 302 DNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSL-VNTEGVVFASTSGAAVAK 360


>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
 gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
          Length = 432

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 56/152 (36%), Gaps = 54/152 (35%)

Query: 92  ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
           ADI  EI  +GPV A M +Y D FSY  G Y                         +   
Sbjct: 324 ADIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQ---------------------TAANRG 362

Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
               + +VK+VGWGEE+                                 +G  YW   +
Sbjct: 363 APTGFHSVKLVGWGEEH---------------------------------DGVKYWIAAN 389

Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           ++G  +G+ G  +ILRG NE  IE  V  + P
Sbjct: 390 SWGPWWGEHGYFRILRGSNECGIEEYVLASWP 421


>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
 gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
 gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
          Length = 476

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
 gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
          Length = 238

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 48/106 (45%), Gaps = 4/106 (3%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHH---SNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
           CS G   ++W ++H  G+V+G       +  GC P +FP C H    +    C       
Sbjct: 133 CSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQKESDYKPCAKELYDT 192

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGP 103
           P C + C N  YG  F +D++  +          + I++EIM NGP
Sbjct: 193 PSCSSSCPNAKYGTAFDKDRHYTESLLPSRFGSTSSIKKEIMTNGP 238


>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
          Length = 476

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
 gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
          Length = 325

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/244 (22%), Positives = 87/244 (35%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +  + G+VT     +  + GC               S P C+    P P
Sbjct: 137 CDGGYPIDAWRYFVQSGVVTEECDPYFDDIGC---------------SHPGCEP-GFPTP 180

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C + N  + + + K+     Y ++ +   I  E+  NGPV     +Y D F++  
Sbjct: 181 KCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYED-FAH-- 235

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
                               YKSGVY    + +++    VK++GWG              
Sbjct: 236 --------------------YKSGVYK-HITGDVMGGHAVKLIGWGT------------- 261

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 262 --------------------SDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 301

Query: 240 GALP 243
             LP
Sbjct: 302 AGLP 305


>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
          Length = 426

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 52/119 (43%), Gaps = 32/119 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A M ++ D F Y  G+Y  S                      PY           
Sbjct: 330 SGPVHAVMTVHQDFFHYHDGIYRRS----------------------PY----------G 357

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
              +    +V+++GWGE+ G  YW + +++G  +G+ G  +I RG NE+ IES V   L
Sbjct: 358 DNTLQGLHSVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416


>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
          Length = 476

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
 gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
          Length = 339

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/197 (24%), Positives = 73/197 (37%), Gaps = 60/197 (30%)

Query: 47  SEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
           S P C+    P PKC  +C + N  + + + K+     Y ++ +   I  E+  NGPV  
Sbjct: 183 SHPGCEP-GFPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEV 239

Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
              +Y D F++                      YKSGVY    + + +    VK++GWG 
Sbjct: 240 AFTVYED-FAH----------------------YKSGVYK-HITGDAMGGHAVKLIGWGT 275

Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
                                             E+G  YW + + +   +GD G  KI 
Sbjct: 276 ---------------------------------SEDGEDYWLLANQWNRGWGDDGYFKIK 302

Query: 227 RGRNEAIIESLVNGALP 243
           RG NE  IE  V   LP
Sbjct: 303 RGTNECGIEGAVVAGLP 319


>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
           gorilla]
          Length = 476

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 49/113 (43%), Gaps = 35/113 (30%)

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           K   + NGPV+  + LY DI  YK+GVY                                
Sbjct: 41  KQEIFTNGPVIGMLSLYEDIRVYKAGVY-------------------------------- 68

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
              V  +       T+K+IGWG E+G+ YW  V+++ E++GD G IK+  GR 
Sbjct: 69  ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 118



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 24/103 (23%)

Query: 74  FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           + +D +R K +  +     +I+QEI  NGPV+  + LY DI  YK+G Y           
Sbjct: 20  YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSLYEDIRVYKAGVY----------- 68

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                        V  +       T+KI+GWG E+G+ YW  V
Sbjct: 69  -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 98


>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
 gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 463

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/240 (25%), Positives = 87/240 (36%), Gaps = 67/240 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G  +  W W+ K GL+T   +            P      T + P+ K     Q   
Sbjct: 262 CQGGHLTRAWNWIRKFGLITEECY------------PWQGRMSTCAVPKKKKETMAQCPS 309

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
             R  ND   +      +R    Y V  E   I  EI+ +GPV A M +  D F YKSG 
Sbjct: 310 RVRSNNDRTTKTRL---HRVGPVYRVATEEG-IMHEILTSGPVQAVMKVSRDFFMYKSGV 365

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y      +N+                 AS     Y +V+IVGWGEE              
Sbjct: 366 YK----CSNL-----------------ASGSRTGYHSVRIVGWGEE-------------- 390

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                   Y   K++         YW   +++G  +G+ G  +IL+G +E  IE  V  A
Sbjct: 391 --------YQGGKIV--------KYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434


>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
          Length = 541

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 68/166 (40%), Gaps = 46/166 (27%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           Y+    Y V+    +I +EI +NGPV A M +  D F YKSG Y +   + N+       
Sbjct: 414 YKTSPVYRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVY-SSTAIDNI------- 465

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
                   V    +   Y +VKI+GWGE+  +                            
Sbjct: 466 --------VVEQVKDNTYHSVKIIGWGEKKSK---------------------------- 489

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
              N   YW + +++G  +G+ G  +I +G NE  IE ++  A P+
Sbjct: 490 --TNSGKYWIVQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWPQ 533


>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
          Length = 469

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 57/163 (34%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR   +Y V+ +  +I +EIM  GPV A M +Y D F YK G Y +              
Sbjct: 354 YRCASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVRVYAVSASAEIVAYATVKL 195
           S K+G    + S        VK++GWG   ++NG+                         
Sbjct: 400 SQKAGSKWKTHS--------VKLLGWGALADKNGQK------------------------ 427

Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                   + +W   +++G+ +G+ G  +ILRG+NE  IE L+
Sbjct: 428 --------QKFWIAANSWGKSWGENGYFRILRGQNECDIEKLI 462


>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
           guttata]
          Length = 469

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/167 (26%), Positives = 69/167 (41%), Gaps = 57/167 (34%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR   +Y V+ +  DI +EI   GPV A M +Y D F YK G Y +              
Sbjct: 354 YRCASHYRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQH-------------- 399

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVRVYAVSASAEIVAYATVKL 195
           S K+G    + S        VK++GWG   ++NG+                         
Sbjct: 400 SQKAGSKWKTHS--------VKLLGWGALPDKNGQK------------------------ 427

Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                   + +W   +++G+ +G+ G  +ILRG+NE  IE L+   L
Sbjct: 428 --------QKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466


>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
           [Acyrthosiphon pisum]
          Length = 487

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 37/128 (28%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
           G V A M +  + F Y+SGVY  S             +  G + G               
Sbjct: 368 GSVQAMMKVSKEFFMYESGVYKCSK------------LDLGSKTG--------------- 400

Query: 185 AEIVAYATVKLIGWGEE--NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                Y TV+++GWGEE  NGR   YW + +++G  +G+ G  +IL+G NE  IE  V  
Sbjct: 401 -----YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVA 455

Query: 241 ALPK-DNY 247
           A+P  DN+
Sbjct: 456 AMPDIDNF 463


>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 275

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/244 (22%), Positives = 84/244 (34%), Gaps = 91/244 (37%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G +   W W+ K+G+ T         C P          Y +            P C
Sbjct: 121 CEGGYADRVWNWIQKKGITT-------EQCLP----------YVSGSGRV-------PTC 156

Query: 62  HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
            ++C N  N  R F           W +     +  E+  NGPV A   ++ D  +YKSG
Sbjct: 157 PSKCKNGSNIVRSFVSS--------WGSFNSKTVMDEVANNGPVYACFEVFEDFLNYKSG 208

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           I+ +K+G        +   +  V ++GWG ENG P         
Sbjct: 209 ----------------IYQHKTG--------KSKGWHHVMLMGWGTENGVP--------- 235

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + +++G  +G+KG  +I RG N+  I+ +   
Sbjct: 236 -------------------------YWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYS 270

Query: 241 ALPK 244
            LPK
Sbjct: 271 GLPK 274


>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
          Length = 150

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 50/114 (43%), Gaps = 25/114 (21%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC   C    Y   + +DK      Y+V    + IQ EIM NGPV A+  +Y D + YK
Sbjct: 62  PKCALSC-QSKYNTEYAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYK 120

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
            G                ++ Y +G        E++    +KI+GWG ENG  Y
Sbjct: 121 KG----------------VYQYTAG--------EVLGGHAIKIIGWGTENGTDY 150


>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
          Length = 495

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 30/121 (24%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A M++  D ++Y+ GVY  S + +   Y  +     G+E                
Sbjct: 364 NGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHL-----GKE---------------- 402

Query: 184 SAEIVAYATVKLIGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                AY +V++IGWG +    +   YW   +T+G  +G+ G  +I RG +E+ IES V 
Sbjct: 403 -----AYHSVRIIGWGTDYTGDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVV 457

Query: 240 G 240
           G
Sbjct: 458 G 458


>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
          Length = 207

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 38/175 (21%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W++  + G+V+         CQP  FPPC H   +T    C ++    P C
Sbjct: 65  CNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPC-SVEYDTPFC 116

Query: 62  HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
           +  CTN          KY+ +  Y ++ E  D ++E          ++LY          
Sbjct: 117 NITCTNT-----IPPIKYKGRISYSLSGE-EDYKRE----------LFLY---------- 150

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
              GP      +Y D  +Y  GVY   +   +  +A V++VGWG  NG PYW I 
Sbjct: 151 ---GPFEVAFTVYEDFVAYSDGVYKHFSGNALGGHA-VRLVGWGNLNGTPYWKIA 201


>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/123 (24%), Positives = 55/123 (44%), Gaps = 35/123 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y  GP+  ++ +  D+  YK G+Y  +  A+ + +A                        
Sbjct: 189 YARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHA------------------------ 224

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      + ++GWGEE+G+ YW   +++G  +G+KG  +I+RG N   IE+    A
Sbjct: 225 -----------ISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWA 273

Query: 242 LPK 244
           +P+
Sbjct: 274 VPR 276



 Score = 41.6 bits (96), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 42/201 (20%), Positives = 76/201 (37%), Gaps = 64/201 (31%)

Query: 44  YTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGP 103
           Y   + EC  +A        RC +   G   +  K  +KRY            ++ + G 
Sbjct: 422 YEAIDKECNDMA--------RCMDCPPGEDCYPVK-DYKRY------------KVSEYGE 460

Query: 104 VVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVG 163
           V   M + ++IF+        GPV  +M +  +  +Y+ G++                  
Sbjct: 461 VKGEMEIKAEIFA-------RGPVSCSMIVTEEFLAYQGGIF------------------ 495

Query: 164 WGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGT 222
                            V     IV Y  V++ GWGE E+G  YW   +++G  +G+ G 
Sbjct: 496 -----------------VDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGW 538

Query: 223 IKILRGRNEAIIESLVNGALP 243
            +++ G ++ +I    N  +P
Sbjct: 539 FRMIVGVSKGLITGYCNWGVP 559


>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 345

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 89/247 (36%), Gaps = 80/247 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA--HHSNTGC-QPVSFPPCNHANYTTSEPECKTLATPQ 58
           C  G     W +  + G+VT     +    GC  P  +P      Y T            
Sbjct: 169 CEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGCGHPGCYP-----TYDT------------ 211

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC  RC +D     +   K+     Y V+ E  ++  E+  NGP+        D+F   
Sbjct: 212 PKCFKRCVDDEL---WVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAF----DVFE-- 262

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
                            D   YK+GVY       I  +A VK+VGWG             
Sbjct: 263 -----------------DFAHYKTGVYKHLYGGYIGGHA-VKLVGWGT------------ 292

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                 ++G  YW++V+++   +G+ GT +ILRG++E  IES  
Sbjct: 293 ---------------------TDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECGIESNA 331

Query: 239 NGALPKD 245
              LP +
Sbjct: 332 VAGLPSN 338


>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 89/245 (36%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA--HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G     W +    G+VT     +    GC   + P C    Y T E          P
Sbjct: 171 CEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC---AHPGC----YPTYE---------TP 214

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C +D +   + Q K+     Y ++ E  D+  E+  NGPV     +Y D   YK+
Sbjct: 215 KCEKQCVDDEF---WVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYKT 271

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRV 178
           G            +Y  +F    G +A            VK++GWG  ++G  YWTIV  
Sbjct: 272 G------------VYKHLFGGFMGGHA------------VKLIGWGTTDDGVDYWTIVNS 307

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
           +  +               WGE+                   G  +I+RG +E  IES  
Sbjct: 308 WNTN---------------WGED-------------------GLFRIVRGNDECGIESNA 333

Query: 239 NGALP 243
              LP
Sbjct: 334 VAGLP 338


>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
          Length = 476

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 94/247 (38%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNDGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT-KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     ++         VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
           domestica]
          Length = 468

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C  G     W ++ KRGLV+   +        +N GC   S           S+   K  
Sbjct: 276 CKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMAS----------RSDGRGKRH 325

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 326 AT-KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 376

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YKSG                I+ + + +   S     +    VK+ GWG   G     
Sbjct: 377 FHYKSG----------------IYRHINNLKDESEKYRNLRTHAVKLTGWGVLRG----- 415

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 416 ------------------------AQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDI 451

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 452 EKLIIAA 458


>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
          Length = 476

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 59/247 (23%), Positives = 91/247 (36%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+S      W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT      T C N           Y+    Y V+    +I +EIM+NGPV A M ++ D 
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F+YK+G                I+ + +     S          VK+ GWG   G     
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   +++G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E L+  A
Sbjct: 460 EKLIIAA 466


>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
 gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
          Length = 432

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 62/246 (25%), Positives = 87/246 (35%), Gaps = 82/246 (33%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G   + W ++HK+G+V       +  C P          YT     CK       + 
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DETCYP----------YTQRRDSCKI------RH 289

Query: 62  HTRCTNDNYGR---GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
           ++R    N  R   G  +D  Y     Y +  E  DI  EI  +GPV A M +Y D FSY
Sbjct: 290 NSRSLKANGCRPAYGVNRDSLYTVGPAYSLKGET-DIMAEIYHSGPVQATMRVYRDFFSY 348

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
             G Y                         +       + +VKIVGWGEE+         
Sbjct: 349 SGGVYRQ---------------------TAANRGAPTGFHSVKIVGWGEEH--------- 378

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                   +G  YW   +++G  +G+ G  +ILRG NE  IE  
Sbjct: 379 ------------------------DGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEY 414

Query: 238 VNGALP 243
           V  + P
Sbjct: 415 VLASWP 420


>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
          Length = 208

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/146 (28%), Positives = 64/146 (43%), Gaps = 32/146 (21%)

Query: 31  CQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
           CQP  FP C +H N +   P      TP   C++ CT+    +     KYR      ++ 
Sbjct: 87  CQPYPFPSCAHHVNSSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYRGNTSCILSG 140

Query: 90  EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
           E    ++E++ NGP   +  +Y+D  +Y  G                ++ + +GV+    
Sbjct: 141 E-ESFKRELLLNGPFEVSFSVYADFVAYTGG----------------VYKHVTGVF---- 179

Query: 150 SAEIVAYATVKIVGWGEENGRPYWTI 175
               +    V+IVGWGE NG PYW I
Sbjct: 180 ----LGGHAVRIVGWGELNGEPYWKI 201


>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 130

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 35/113 (30%)

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           K   + NGPV+  + +Y DI  YK+GVY                                
Sbjct: 41  KQEIFTNGPVIGMLSIYEDIRVYKAGVY-------------------------------- 68

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
              V  +       T+K+IGWG E+G+ YW  V+++ E++GD G IK+  GR 
Sbjct: 69  ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 118



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 24/103 (23%)

Query: 74  FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           + +D +R K +  +     +I+QEI  NGPV+  + +Y DI  YK+G Y           
Sbjct: 20  YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVY----------- 68

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                        V  +       T+KI+GWG E+G+ YW  V
Sbjct: 69  -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 98


>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
          Length = 476

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
           C+SG     W ++ KRGLV+   +        +N GC          A  + S+   K  
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRD 333

Query: 55  ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
           AT +P     C N+          Y+    Y V+    +I +EIM+NGPV A M +  D 
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384

Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
           F YK+G                I+ + +     S     +    VK+ GWG   G     
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423

Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
                                    +     +W   + +G+ +G+ G  +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDI 459

Query: 235 ESLVNGA 241
           E LV  A
Sbjct: 460 EKLVIAA 466


>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
          Length = 306

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 32/119 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A+M +Y D   Y+SGVY     ++I ++A V+I+G+G  +               
Sbjct: 215 DGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHA-VEIIGYGAAD--------------- 258

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
                           +E+  PYW + ++ G  +G++G   I+RG NE  IES V   L
Sbjct: 259 ----------------DEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301


>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
 gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
 gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
          Length = 475

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 63/256 (24%), Positives = 95/256 (37%), Gaps = 89/256 (34%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+                   HA Y    P  K  +T    C
Sbjct: 283 CNSGSIDRAWWFLRKRGLVS-------------------HACY----PLFKEQSTNNNSC 319

Query: 62  HTRCTNDNYGRGF--------FQDKYRFKRY---YWVNDEVADIQQEIMKNGPVVANMYL 110
                +D  G+          F+   R  +    Y ++    +I +EI++NGPV A M +
Sbjct: 320 AMASRSDGRGKRHATRPCPNSFEKSNRIYQCSPPYRISSNETEIMREIIQNGPVQAIMQV 379

Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
           + D F YK+G Y                      + VS + E   Y              
Sbjct: 380 HEDFFYYKTGIY---------------------RHVVSTNEEPEKYRK------------ 406

Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKI 225
                +R +A            VKL GWG   G       +W   +++G+ +G+ G  +I
Sbjct: 407 -----LRTHA------------VKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRI 449

Query: 226 LRGRNEAIIESLVNGA 241
           LRG NE+ IE L+  A
Sbjct: 450 LRGVNESDIEKLIIAA 465


>gi|145546673|ref|XP_001459019.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426842|emb|CAK91622.1| unnamed protein product [Paramecium tetraurelia]
          Length = 476

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 26/120 (21%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           + NGPVV N     D   Y  GV+          ++T+           P W I  +   
Sbjct: 375 HKNGPVVLNFEPSFDFMFYVGGVF----------HSTI-----------PDWIINGL--- 410

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
            A  E V + +V   GWGEENG  YW + +++G+Q+G+ G  ++ RG++E+ IES+   A
Sbjct: 411 -AKPEWVDH-SVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 468


>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 118

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 35/113 (30%)

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           K   + NGPV+  + +Y DI  YK+GVY                                
Sbjct: 29  KQEIFTNGPVIGALTIYEDIRVYKAGVY-------------------------------- 56

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
              V  +       T+K+IGWG E+G+ YW  V+++ E++GD G IK+  GR 
Sbjct: 57  ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 106



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 24/103 (23%)

Query: 74  FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
           + +D +R K +  +     +I+QEI  NGPV+  + +Y DI  YK+G Y           
Sbjct: 8   YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGALTIYEDIRVYKAGVY----------- 56

Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
                        V  +       T+KI+GWG E+G+ YW  V
Sbjct: 57  -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 86


>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
          Length = 69

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 33/53 (62%)

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
            +K++GWGEE+G PYW   +++   +GD G  K LRG +   IES +   +PK
Sbjct: 17  AIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGIPK 69


>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
          Length = 475

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 97/246 (39%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T +     + +  + K 
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +++ +      Y+    Y V+    +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTG 389

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      + VS + E   Y                   +R +A
Sbjct: 390 IY---------------------RHVVSTNEEPEKYKK-----------------LRTHA 411

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
 gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
 gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
          Length = 475

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 97/246 (39%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T +     + +  + K 
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +++ +      Y+    Y V+    +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTG 389

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      + VS + E   Y                   +R +A
Sbjct: 390 IY---------------------RHVVSTNEEPEKYKK-----------------LRTHA 411

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|221505681|gb|EEE31326.1| cathepsin L, putative [Toxoplasma gondii VEG]
          Length = 733

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV         +FSY+SGVY  +++   V            +N  P+ T      +
Sbjct: 602 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 645

Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
               E   +A V ++GWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 646 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704


>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
          Length = 281

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 35/115 (30%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y  GP  A   +Y D  SYKSGVY    + +++    V +VGWG E+G PY         
Sbjct: 193 YSRGPFEAAFSVYEDFKSYKSGVYH-HITGKMLGGHAVMVVGWGVEDGTPY--------- 242

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
                                    W I +++G  +G++G  KILRG+NE  IE+
Sbjct: 243 -------------------------WLIQNSWGTTWGEQGFFKILRGKNECGIET 272


>gi|237838179|ref|XP_002368387.1| cathepsin C [Toxoplasma gondii ME49]
 gi|211966051|gb|EEB01247.1| cathepsin C [Toxoplasma gondii ME49]
 gi|221484340|gb|EEE22636.1| cathepsin C, putative [Toxoplasma gondii GT1]
          Length = 733

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV         +FSY+SGVY  +++   V            +N  P+ T      +
Sbjct: 602 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 645

Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
               E   +A V ++GWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 646 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704


>gi|70919569|gb|AAZ15654.1| cathepsin C1 [Toxoplasma gondii]
          Length = 730

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV         +FSY+SGVY  +++   V            +N  P+ T      +
Sbjct: 599 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 642

Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
               E   +A V ++GWGE   ENG+P  YW + +T+G  +G  G +KI RG+N   IES
Sbjct: 643 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 701


>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 204

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 67/165 (40%), Gaps = 37/165 (22%)

Query: 83  RYYWVN-DEVADIQQEIMKNGPVVANMY--LYSDIFSYKSGKYGNGPVVANMYLYSDIFS 139
           R +WVN   +  I  E       V+     +Y+D    +     NGPV+A++ +  D   
Sbjct: 73  RDHWVNCSTIKQIHDECCCRADWVSEKIYNVYADQEDIQKEILMNGPVIASILVKVDFLV 132

Query: 140 YKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWG 199
           YKSGVY  +  +  + +  ++I+GWG E   P                            
Sbjct: 133 YKSGVYFPTPKSSNLGWINLRIIGWGYEGKTP---------------------------- 164

Query: 200 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
                 YW   +++ +++G+ G +K+ RG     IES V   +PK
Sbjct: 165 ------YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPK 203


>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 581

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 47/88 (53%)

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 252
           + ++GWG ENG  YW + +++G  +G+KG  +++RG N   IES    A+PKD +  +  
Sbjct: 229 ISVVGWGVENGTKYWIVRNSWGSYWGEKGYFRLVRGINSLNIESDCAWAVPKDTWTNDVR 288

Query: 253 EESGERLSEEFGVRAESSEEFRENGEEE 280
             +    + +   R       +EN +++
Sbjct: 289 NTTASNTNSQSNFRQLHDCVRQENNQKD 316


>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
          Length = 273

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/244 (21%), Positives = 85/244 (34%), Gaps = 91/244 (37%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G +   W W+ K+G+ T         C P          Y +            P C
Sbjct: 119 CNGGYADRVWNWIQKKGITT-------EQCIP----------YVSGSGRV-------PTC 154

Query: 62  HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
            ++C N  N  R F           W +     +  E+  NGPV A   ++ D ++Y+SG
Sbjct: 155 PSKCKNGSNIVRSFVSS--------WGSFNSKTVMDEVANNGPVYACFEVFEDFYNYRSG 206

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
                           ++ +K+G            +  V ++GWG ENG P         
Sbjct: 207 ----------------VYQHKTG--------RSQGWHHVMLMGWGTENGVP--------- 233

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                    YW + +++G  +G+KG  +I RG N+  I+ +   
Sbjct: 234 -------------------------YWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYS 268

Query: 241 ALPK 244
            LPK
Sbjct: 269 GLPK 272


>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
          Length = 426

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 100/224 (44%), Gaps = 48/224 (21%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           WV++ GLVTGG      GC+P SF      PC+ A +  +E E +T       C  RC N
Sbjct: 223 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 269

Query: 68  DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
             Y + + +DK+ F  +    Y  +  V+   +E +K   ++ +       F+ K+ +  
Sbjct: 270 IYYQQKYEEDKH-FATFAYSLYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 321

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           N     N+ +  +I  Y     A     E + Y++                + R + +  
Sbjct: 322 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 365

Query: 184 -SAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKI 225
               IV +  V+LIGWGE ++G+ YW  V++FG  +GD G  KI
Sbjct: 366 FDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI 409


>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
          Length = 229

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/135 (24%), Positives = 57/135 (42%), Gaps = 34/135 (25%)

Query: 110 LYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
           +Y+D    +     NGPV+A++ +  D   YKSGVY  +  +  + +  ++I+GWG E  
Sbjct: 128 VYADQEDIQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGK 187

Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
            P                                  YW   +++ +++G+ G +K+ RG 
Sbjct: 188 TP----------------------------------YWLCANSWSKEWGENGYVKVRRGV 213

Query: 230 NEAIIESLVNGALPK 244
               IES V   +PK
Sbjct: 214 QAGYIESYVRAPIPK 228


>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
          Length = 476

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 54/123 (43%), Gaps = 35/123 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y +GPV  ++ +  D+  YK G+Y               I G G +              
Sbjct: 101 YAHGPVTCSIDVPDDLLEYKGGIYEDKTG----------IAGDGHD-------------- 136

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      + ++GWGEENG PYW + +++G  +G++G  +I+RG+N   IE      
Sbjct: 137 -----------ISVVGWGEENGIPYWIVRNSWGTYWGEEGFFRIVRGKNNLGIEEGCTYG 185

Query: 242 LPK 244
           +P+
Sbjct: 186 IPR 188



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 2/53 (3%)

Query: 193 VKLIGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           V++ GWG  EE   PYW + +++G  +G+ G  +I  G+N   IE +    +P
Sbjct: 420 VEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRIAMGQNLLNIEQMCTWGVP 472


>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
          Length = 209

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 70/197 (35%), Gaps = 60/197 (30%)

Query: 47  SEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
           S P C+  A   PKC  +C   N  + + + K+     Y V  +  DI  E+ KNGPV  
Sbjct: 53  SHPGCEP-AYQTPKCVRKCVKGN--QIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEV 109

Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
              +Y D   YKSG            +Y  I   + G +A            VK++GWG 
Sbjct: 110 AFTVYEDFAHYKSG------------VYKHITGSQLGGHA------------VKLIGWGT 145

Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
                                             + G  YW I + +   +GD G   I 
Sbjct: 146 ---------------------------------TDEGEDYWLIANQWNRSWGDDGYFMIR 172

Query: 227 RGRNEAIIESLVNGALP 243
           RG NE  IE  V   LP
Sbjct: 173 RGTNECGIEEDVTAGLP 189


>gi|123377855|ref|XP_001298125.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121878571|gb|EAX85195.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 135

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/191 (24%), Positives = 76/191 (39%), Gaps = 66/191 (34%)

Query: 57  PQPKCHT-----RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLY 111
           P   CH       C  +N  +  ++ ++   ++++  DE   I+ EI++NGPV A     
Sbjct: 5   PNTTCHPFELNWTCVQNNCKK--YKTQHNSHKFFYGEDE---IKNEILQNGPVTA----- 54

Query: 112 SDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
             +F  +                 D+  YKSGVY    S E  ++               
Sbjct: 55  --VFDVRP----------------DLAYYKSGVYQSVLSEEESSFQ-------------- 82

Query: 172 YWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
                                V + GWG+E   P+W I++++G  +G  G++K LRG N 
Sbjct: 83  -------------------HAVVIYGWGKEKETPFWWILNSYGPNWGINGSMKFLRGSNH 123

Query: 232 AIIESLVNGAL 242
             IE+ V+ AL
Sbjct: 124 CNIETHVSSAL 134


>gi|145517168|ref|XP_001444467.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411889|emb|CAK77070.1| unnamed protein product [Paramecium tetraurelia]
          Length = 339

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 35/66 (53%), Gaps = 1/66 (1%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAV-SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           GPVVA M +Y D   Y+ GVY V   +        +KI+GWGE+NG  YW I   +  S 
Sbjct: 255 GPVVAIMQVYKDFLVYRDGVYQVLEGTPRFHGGHAIKIIGWGEQNGYQYWIIENTWGTSW 314

Query: 184 SAEIVA 189
             E +A
Sbjct: 315 GTEGLA 320


>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
 gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
          Length = 339

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/150 (28%), Positives = 67/150 (44%), Gaps = 10/150 (6%)

Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVA----NMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
           P ++      +I   +  K+ NG        N  +Y    SY+         +EI+    
Sbjct: 171 PYISGTTRKPEICYMQKSKHANGRQCPSGHPNSRVYRTTPSYRVSSREQDIMSEILTNGP 230

Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGRP--YWTIVSTFG 214
           V+       +G  +   V  +  +   EI  Y +V+L+GWGE+   G P  YW   +++G
Sbjct: 231 VQATF--RVHGDFFIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWG 288

Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPK 244
             +G+ GT +ILRG N   IES V GA  K
Sbjct: 289 TNWGENGTFRILRGENHCEIESFVIGAWGK 318


>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
          Length = 343

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 83/244 (34%), Gaps = 77/244 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +    G+VT     +   TGC   S P C  A Y T            P
Sbjct: 172 CDGGYPISAWQYFSYSGVVTEECDPYFDQTGC---SHPGCEPA-YNT------------P 215

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           +C  +C   N  + + + K+     Y V     DI  EI                     
Sbjct: 216 QCLRKCVGRN--QLWSESKHYSINTYVVESNPQDIMAEI--------------------- 252

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
             Y NGPV  +  +Y D   YKSGVY     + I  +A VK++GWG              
Sbjct: 253 --YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHA-VKLIGWGT------------- 296

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                ++G  YW + + +   +GD G   I RG NE  IE    
Sbjct: 297 --------------------TDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPV 336

Query: 240 GALP 243
             LP
Sbjct: 337 AGLP 340


>gi|380805035|gb|AFE74393.1| tubulointerstitial nephritis antigen-like isoform 3, partial
           [Macaca mulatta]
          Length = 129

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 69/168 (41%), Gaps = 48/168 (28%)

Query: 64  RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
           RC N +       D Y+    Y +     +I +E+M+NGPV A M ++ D F YK G Y 
Sbjct: 9   RCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYS 65

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           + PV     L       + G +            +VKI GWGEE                
Sbjct: 66  HTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------------- 94

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
              +    T+K           YWT  +++G  +G++G  +I+RG NE
Sbjct: 95  ---LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNE 128


>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
          Length = 442

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 54/122 (44%), Gaps = 36/122 (29%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A M ++ D F Y+ GVY                                 Y+ + 
Sbjct: 345 HGPVQATMRVHPDFFLYRGGVYR--------------------------------YSGTN 372

Query: 184 SAEIVAYATVKLIGWG----EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
           S +   Y +V+++GWG    + N   YW + +++G  +G+ G  +I+RG NE+ IE  V 
Sbjct: 373 SQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVL 432

Query: 240 GA 241
            A
Sbjct: 433 AA 434


>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
          Length = 333

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 60/157 (38%), Gaps = 45/157 (28%)

Query: 85  YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
           Y V+    +I +EIM+NGPV A M +  D F YK+G                I+ + +  
Sbjct: 212 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG----------------IYRHVTST 255

Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
              S     +    VK+ GWG   G                              +    
Sbjct: 256 NKESEKYRKLQTHAVKLTGWGTRRG-----------------------------AQGQKE 286

Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
            +W   + +G+ +G+ G  +ILRG NE+ IE LV  A
Sbjct: 287 KFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 323


>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
 gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
          Length = 350

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/120 (32%), Positives = 49/120 (40%), Gaps = 33/120 (27%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGP+   M +Y D +SYKSGVY    S   V    VKIVGW                   
Sbjct: 262 NGPIQVAMGVYRDFYSYKSGVYH-HVSGRYVGGHAVKIVGW------------------- 301

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                        G+   +  PYW   +++GE +G KG   ILRGR E  I  +V    P
Sbjct: 302 -------------GYDSASKLPYWICANSWGEDWGIKGYFWILRGRGECGIGKMVWSGKP 348


>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
           griseus]
          Length = 475

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 59/246 (23%), Positives = 97/246 (39%), Gaps = 69/246 (28%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+SG     W ++ KRGLV+              +P     N T +     + +  + K 
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331

Query: 62  H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           H T+   +++ +      Y+    Y V+    +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTG 389

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
            Y                      + +S + E   Y                   +R +A
Sbjct: 390 IY---------------------RHVISTNEESEKYRK-----------------LRSHA 411

Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
                       VKL GWG   G       +W   +++G+ +G+ G  +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459

Query: 236 SLVNGA 241
            L+  A
Sbjct: 460 KLIIAA 465


>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
 gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
           [Trichomonas vaginalis G3]
          Length = 288

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 51/128 (39%), Gaps = 35/128 (27%)

Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
           Y+ I   + G    GPV  ++ +YSD+  YKSG+Y                         
Sbjct: 190 YASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYT------------------------ 225

Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
                          E + +  V++IGWG +NG  YW I +++   +G  G   I RG N
Sbjct: 226 -----------HTKGEFLGHHAVEIIGWGTKNGIDYWIISNSWNTTWGMNGLFLIKRGVN 274

Query: 231 EAIIESLV 238
           E  IE  V
Sbjct: 275 ECHIEDYV 282



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 45/175 (25%), Positives = 69/175 (39%), Gaps = 54/175 (30%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  GI  + W ++  RGL           CQP         N T            +  C
Sbjct: 132 CGGGIEVNAWRYIDLRGLPL-------DSCQPYD------GNIT------------KYNC 166

Query: 62  HTRCTNDNYG-RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
             +CTN++      F + +   RY      + ++Q  IM  GPV  ++ +YSD+  YKSG
Sbjct: 167 SKKCTNESETYEAQFTEYWSVARY----ASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSG 222

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
                           I+++  G        E + +  V+I+GWG +NG  YW I
Sbjct: 223 ----------------IYTHTKG--------EFLGHHAVEIIGWGTKNGIDYWII 253


>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
          Length = 174

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 88/247 (35%), Gaps = 79/247 (31%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  G   S W +    G+VT     +    GC   S P C        EP  +T     P
Sbjct: 1   CDGGYPISAWKYFAHHGVVTEECDPYFDQIGC---SHPGC--------EPGYQT-----P 44

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C   N  + + + K+   + Y VN +  +I +E+ KNGPV     +Y D   YKS
Sbjct: 45  KCVRKCVKGN--QVWKKSKHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKS 102

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G Y +                      ++ SA  +    VK+ GWG              
Sbjct: 103 GVYKH----------------------ITGSA--LGGHAVKLNGWGT------------- 125

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
                                + G  YW + + +   +GD G  KI RG NE  IE  V 
Sbjct: 126 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVT 165

Query: 240 GA--LPK 244
               LPK
Sbjct: 166 AVCLLPK 172


>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 105

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 37/121 (30%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GPV A+  +Y D  +Y+SGVY  ++ + +  +A                          
Sbjct: 21  DGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHA-------------------------- 54

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                    VK+IGWGE++G+ YW  V+++ E +GD G  KI  G N  I + L+ G  P
Sbjct: 55  ---------VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGGT-P 103

Query: 244 K 244
           K
Sbjct: 104 K 104


>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
          Length = 559

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 35/123 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y  GP+   + +  D   YK G+Y   + A                       + +V+A+
Sbjct: 190 YARGPITCGIAVPQDFVDYKGGIYKDESGA-----------------------VEKVHAI 226

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
           S            ++GWGEENG  YW   +++G  +G++G  +I RG N   IES    A
Sbjct: 227 S------------VVGWGEENGEKYWIGRNSWGNYWGEEGWFRIARGINNLAIESECQWA 274

Query: 242 LPK 244
           +PK
Sbjct: 275 VPK 277



 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 1/60 (1%)

Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
           I + K+  +  GPV  +M +      Y  GVY  S S+ +VA   V+I GWG ENGRPYW
Sbjct: 462 IDAIKAEIFARGPVSCSMTVRESFLDYHGGVYE-SDSSPMVAGHIVEIAGWGVENGRPYW 520


>gi|294900111|ref|XP_002776905.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
 gi|239884106|gb|EER08721.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
          Length = 207

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 36/66 (54%)

Query: 57  PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
           P   C T CTN  Y     +D +R K +  V ++V +I+QEI  +GPV +   +Y D   
Sbjct: 94  PLSSCQTTCTNKAYKTSLEKDVHRAKDWRKVPNDVQNIKQEIFDDGPVCSAFKMYEDFRY 153

Query: 117 YKSGKY 122
           YKSG Y
Sbjct: 154 YKSGVY 159


>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
          Length = 350

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/243 (24%), Positives = 86/243 (35%), Gaps = 66/243 (27%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV-SFPPCNHANYTTSEPECKTLATPQPK 60
           C+ G  +  W +   +GLV+GG + S+ GC+   S  PC H  +    P   T     PK
Sbjct: 164 CNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKH--HIHGXPYVXT--GDSPK 219

Query: 61  CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
           C   C     G+ +  DK+     Y ++D   DI   I KN  V     +Y D   YK  
Sbjct: 220 CSMTCEP---GQTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFK 276

Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
           +Y                    GV     + E+     + I+G   EN   Y        
Sbjct: 277 EY-------------------QGV-----TGEMXGGHAICILGCKVENSTSY-------- 304

Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                                     W + + +   +GD G  KILRG++   IES V  
Sbjct: 305 --------------------------WLVANXWNRDWGDNGFFKILRGQDHYGIESEVVA 338

Query: 241 ALP 243
            +P
Sbjct: 339 EIP 341


>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
          Length = 91

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 33/53 (62%)

Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
            +++IGWG E   PYW + +++  ++GD G  KILRG NE  IE  +   +PK
Sbjct: 38  AIRIIGWGVEEDVPYWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90


>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
          Length = 310

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 71/178 (39%), Gaps = 43/178 (24%)

Query: 2   CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  + G+VT     +   TGCQ                P C+  A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
           KC  +C  +N  + + ++K+     Y V+    DI  E+ KNGPV    + Y  I     
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEV-AFTYCQIL---- 261

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIV 176
                           D   YKSGVY    +  ++    VK++GWG  + G  YW + 
Sbjct: 262 ----------------DFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDAGEDYWLLA 302


>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 255

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPV 104
           H RCT   YG     F + +R+ R +Y++      IQ+++M  GP+
Sbjct: 213 H-RCTRMCYGNXDLDFDEDHRYTRDFYYLT--YGSIQKDVMTYGPI 255


>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
          Length = 422

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 99/224 (44%), Gaps = 48/224 (21%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           WV++ GLVTGG      GC+P SF      PC+ A +  +E E +T       C  RC N
Sbjct: 219 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 265

Query: 68  DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
             Y + + +DK+ F  +    Y  +  V+   +E +K   ++ +       F+ K+ +  
Sbjct: 266 IYYQQRYEEDKH-FATFAYSLYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 317

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           N     N+ +  +I  Y     A     E + Y++                + R + +  
Sbjct: 318 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 361

Query: 184 -SAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKI 225
               IV +  V+LIGWG+ E+G  YW  V++FG  +GD G  KI
Sbjct: 362 FDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI 405


>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
          Length = 261

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 51/111 (45%), Gaps = 12/111 (10%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W    KRGLVTGG + S  GC+P   PPC +     +E        P+   
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANM 108
           H RCT   YG     F + +R+ R  YY        IQ+++M  GP+ A+ 
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASF 259


>gi|401401997|ref|XP_003881145.1| hypothetical protein NCLIV_041870 [Neospora caninum Liverpool]
 gi|325115557|emb|CBZ51112.1| hypothetical protein NCLIV_041870 [Neospora caninum Liverpool]
          Length = 736

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 22/120 (18%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGPV         +FSY SG+Y  ++S   V            +N  P+ +      V
Sbjct: 603 YKNGPVPVAFDAPPSLFSYSSGIYDANSSHARVC-----------DNDSPHCS-----GV 646

Query: 182 SASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
               E   +A V L+GWGE N      R YW + +T+G  +G +G +KI RG+N   IES
Sbjct: 647 LTGWEYTNHA-VTLVGWGETNAENEKPRKYWIVRNTWGPNWGVQGYLKIARGKNLGGIES 705


>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
          Length = 563

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/122 (25%), Positives = 56/122 (45%), Gaps = 35/122 (28%)

Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
           Y NGP+   + + +D+ +Y++G+++ + S+ +  +                         
Sbjct: 164 YYNGPITCKISVTNDLQNYRNGIFSRNTSSSLYDHY------------------------ 199

Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
                      V +IGWG EN  PYW + +++G  +G+ G  +ILRG N   IES  + A
Sbjct: 200 -----------VNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIESSCSYA 248

Query: 242 LP 243
           +P
Sbjct: 249 VP 250



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 1/52 (1%)

Query: 193 VKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           V+++GWG  E G  YW   + +GE +G+KG  +I+ G N  +IES  +  +P
Sbjct: 509 VEVVGWGRTEEGVEYWIGRNNWGENWGEKGWFRIMMGGNNLLIESSCSWGVP 560


>gi|48762499|dbj|BAD23819.1| cathepsin B-N [Tuberaphis styraci]
 gi|48762501|dbj|BAD23820.1| cathepsi B-N [Tuberaphis coreana]
          Length = 105

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 10/112 (8%)

Query: 5   GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
           G     W    K GLVTGG + S  GCQP   PPC    Y  +   C+    P  K H R
Sbjct: 1   GYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKNH-R 55

Query: 65  CTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
           CT   YG     F +D +  +  Y++      IQ +I+  GP+ A+  +Y D
Sbjct: 56  CTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQNDILAYGPIEASFEVYDD 105


>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
 gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
          Length = 311

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/120 (25%), Positives = 52/120 (43%), Gaps = 35/120 (29%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPV A+  ++ D ++Y+SG+Y  +   ++  +A                          
Sbjct: 225 NGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHA-------------------------- 258

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                    +K++GWG E+   YW   +++G  +G +G  KI RG +E  IE  +   LP
Sbjct: 259 ---------IKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309


>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
 gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
          Length = 410

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/224 (25%), Positives = 97/224 (43%), Gaps = 48/224 (21%)

Query: 13  WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
           WV++ GLVTGG      GC+P SF      PC+ A +  +E         +  C  RC N
Sbjct: 207 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAEE--------KRTCMRRCQN 253

Query: 68  DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
             Y + + +DK+ F  +    Y  +  V+   +E +K   ++ +       F+ K+ +  
Sbjct: 254 IYYQQKYEEDKH-FATFAYSMYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 305

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           N     N+ +  +I  Y     A     E + Y++                + R + +  
Sbjct: 306 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 349

Query: 184 -SAEIVAYATVKLIGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 225
               IV +  V+LIGWGE  +G+ YW  +++FG  +GD G  KI
Sbjct: 350 FDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI 393


>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
          Length = 561

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 15/80 (18%)

Query: 180 AVSASAEIVAYA---------------TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIK 224
           A+ A+ E+VAY                 + ++GWGEE+G+ YW + +++G  +G+ G  +
Sbjct: 197 ALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNSWGTYWGENGWFR 256

Query: 225 ILRGRNEAIIESLVNGALPK 244
           I+RG N   IES    A+P+
Sbjct: 257 IVRGTNNLGIESECTWAVPR 276


>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
 gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
          Length = 238

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 75/197 (38%), Gaps = 61/197 (30%)

Query: 45  TTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
           TT     + + T  P C T   +  Y   F    YR       N+E  DI QEI  NGPV
Sbjct: 69  TTCRIARRRVPTEDPICPTGRQDQKY---FSTPPYRVP----ANEE--DIMQEIYANGPV 119

Query: 105 VANMYLYSDIFSYKSGKYGNGPVVANM---YLYSDIFSYKSGVYAVSASAEIVAYATVKI 161
            A M +  D F Y SG Y +  +  N+   Y  SD                   + +V+I
Sbjct: 120 QALMLVKEDFFLYSSGVYKHTRLAHNLPPEYQKSD-------------------WHSVRI 160

Query: 162 VGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKG 221
           +GWG                    +   Y   K           YW   +++G  +G+ G
Sbjct: 161 LGWG-------------------VDRTQYRPQK-----------YWLCANSWGSGWGENG 190

Query: 222 TIKILRGRNEAIIESLV 238
             +I+RG +E+ IES V
Sbjct: 191 YFRIVRGEDESQIESFV 207


>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
          Length = 458

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 36/47 (76%)

Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
           G+GEE+G+ YW + +++GE++G+KG  +I RG +E  IESLV  A+P
Sbjct: 405 GYGEEDGQKYWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYAVP 451


>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
 gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
          Length = 225

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 84/239 (35%), Gaps = 88/239 (36%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA--HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C  GI  + W+++   G+VT     + S  G  P     CN     TS P          
Sbjct: 73  CDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVAPSCPKYCN----GTSTP---------- 118

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
                             KY+ K +Y V      I  EI  NGPV +   +Y D  SYKS
Sbjct: 119 --------------IDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKS 164

Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
           G                ++++++G +        +    +KIVGWG EN           
Sbjct: 165 G----------------VYTHQTGSF--------LGGHAIKIVGWGVEN----------- 189

Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                        VK           YW + +++G  +G  G  KI RG NE  IE+ V
Sbjct: 190 ------------NVK-----------YWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225


>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
           gigas]
          Length = 464

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 29/121 (23%)

Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
           K+  Y NGPV A   + SD F Y+SGVY  + +    +  +V+I+GWGE+          
Sbjct: 328 KAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKTN-------- 379

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                 +   R YW  ++++G ++G+KG  +I+RG N   IE  
Sbjct: 380 ---------------------KKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEEN 418

Query: 238 V 238
           V
Sbjct: 419 V 419


>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
          Length = 464

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 29/120 (24%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPVV +     D   Y+SG+Y   + AE   Y+               W  V       
Sbjct: 334 NGPVVLSFEPSYDFMYYESGIY--HSKAETSDYSE--------------WEKVD------ 371

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                   +V   GWGEE G  +W + +++G+Q+G+ G  ++ RG +E+ IES+   + P
Sbjct: 372 -------HSVLCYGWGEEEGVKFWMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDP 424


>gi|118378294|ref|XP_001022323.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89304090|gb|EAS02078.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 497

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 66/142 (46%), Gaps = 40/142 (28%)

Query: 97  EIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
           EIMKNGP+VAN    +D F Y                      YKSGVY    +A+ +  
Sbjct: 380 EIMKNGPIVANFKTSAD-FVY----------------------YKSGVYHSVEAADWILK 416

Query: 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWG--EENGRPYWTIVSTFG 214
             V+          P W  V  +AV    +      +   GWG  EE+G+ +W + +++G
Sbjct: 417 CEVE----------PEWRPVE-HAVMCQHQ---QQFLNSYGWGESEEDGK-FWLMQNSWG 461

Query: 215 EQFGDKGTIKILRGRNEAIIES 236
           + +G+KG  KI RG +E+ +ES
Sbjct: 462 DDWGEKGRFKIRRGTDESFVES 483


>gi|58617822|gb|AAW80530.1| cathepsin L-like cysteine protease [Leishmania infantum]
          Length = 234

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 14/129 (10%)

Query: 109 YLYSDIFSYKSGKY--GNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
           ++Y  +F+ KS  Y  GNG V   +     +   +   Y +  S E V  A      W  
Sbjct: 66  HMYGIVFTEKSYPYTSGNGDVPECLNSSKLVPGAQIDGYVMIPSNETVMAA------WLA 119

Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
           ENG          AV AS+ +   + V L+G+ +  G PYW I +++GE +G+KG ++++
Sbjct: 120 ENGP------IAIAVDASSFMSYQSGVLLVGYNKTGGVPYWVIKNSWGEDWGEKGYVRVV 173

Query: 227 RGRNEAIIE 235
            GRN  +++
Sbjct: 174 MGRNACLLK 182


>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
          Length = 350

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 55/245 (22%), Positives = 90/245 (36%), Gaps = 80/245 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
           C+ G   S W +  +RG+VT     +  N GC           N+   EP     + P P
Sbjct: 163 CNGGFPLSAWRYFSRRGVVTDECDPYFDNDGC-----------NHPGCEP-----SYPTP 206

Query: 60  KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMY-LYSDIFSYK 118
           +C   C ++         ++   ++Y                    AN Y + SD ++  
Sbjct: 207 RCVKNCKDNQ--------RWSHSKHY-------------------SANAYRIKSDPYNIM 239

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           +  + NGPV  +  +Y D   Y++GVY       +  +A VK++GWG             
Sbjct: 240 AEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHA-VKLIGWGT------------ 286

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
                                 ++G  YW I +++   +G+ G  KI RG NE  IE   
Sbjct: 287 ---------------------TDDGIDYWLIANSWNTAWGEGGYFKIARGVNECGIERDP 325

Query: 239 NGALP 243
              +P
Sbjct: 326 VAGMP 330


>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 339

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 59/249 (23%), Positives = 87/249 (34%), Gaps = 82/249 (32%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGA--HHSNTGC-QPVSFPPCNHANYTTSEPECKTLATPQ 58
           C  G     W +  + G+VT     +    GC  P  +P      Y T            
Sbjct: 163 CDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGCGHPGCYP-----TYRT------------ 205

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           PKC   C +D     + + K+     Y V+ E  D+  E+                    
Sbjct: 206 PKCVKHCVDDEL---WVKSKHLSVNAYEVSKEPEDLMAEL-------------------- 242

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVR 177
              Y NGP+  +  ++ D   YK+GVY       I  +A VK++GWG  ++G  YW    
Sbjct: 243 ---YTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHA-VKLIGWGTTDDGVDYW---- 294

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
                                         TIV+++   +G+ G  +I RG NE  IES 
Sbjct: 295 ------------------------------TIVNSWNTNWGEHGLFRIARGGNECGIESY 324

Query: 238 VNGALPKDN 246
               LP D 
Sbjct: 325 AVAGLPFDK 333


>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
 gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
          Length = 273

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 56/136 (41%), Gaps = 21/136 (15%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G S+  + +    G+VTGG       C P  F PC+H         C+    P P C
Sbjct: 87  CEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHH--------PCEVF--PTPAC 136

Query: 62  HTRC---TNDNYGRGFFQDKYRFKRYYWVNDEVAD---IQQEIMKNGPVVANM-YLYSDI 114
              C   +ND    G    K  FK    V+    D   +  EI  NGPV +    +Y + 
Sbjct: 137 PATCVGGSNDGVQNG----KASFKVKAIVDCPSFDYGCVANEIYHNGPVSSYAGDIYEEF 192

Query: 115 FSYKSGKYGNGPVVAN 130
           ++YKSG +   P VA 
Sbjct: 193 YAYKSGVFRESPSVAQ 208


>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
           [Acyrthosiphon pisum]
          Length = 487

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 36/122 (29%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
           G V A M +  + F Y+SGVY  S  A             G + G               
Sbjct: 368 GSVQAMMKVSKEFFMYESGVYRCSNLA------------LGSKTG--------------- 400

Query: 185 AEIVAYATVKLIGWGEE--NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
                Y TV+++GWGEE  NGR   YW + +++G  +G+ G  +IL+G NE  IE  V  
Sbjct: 401 -----YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVA 455

Query: 241 AL 242
           A+
Sbjct: 456 AM 457


>gi|145500930|ref|XP_001436448.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124403587|emb|CAK69051.1| unnamed protein product [Paramecium tetraurelia]
          Length = 339

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 28/52 (53%), Gaps = 1/52 (1%)

Query: 125 GPVVANMYLYSDIFSYKSGVYAV-SASAEIVAYATVKIVGWGEENGRPYWTI 175
           GP VA M +Y D   YK G+Y V            VKI+GWGE NG+ YW I
Sbjct: 255 GPAVAIMPVYKDFLIYKDGIYQVLDGQPHFHGGQAVKIIGWGEHNGQQYWII 306


>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
 gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
          Length = 815

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 40/66 (60%)

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
           +Y  +A +  +    V++IG+G E   P+W +++++G+ +G+ G  ++LRGRN   IE L
Sbjct: 572 LYTTTAGSPEIGNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIEEL 631

Query: 238 VNGALP 243
             G  P
Sbjct: 632 PVGMDP 637


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 20/36 (55%), Positives = 30/36 (83%)

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
           V ++G+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354


>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
          Length = 233

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 1/64 (1%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W +  KRG+VTGG+  ++TGCQP  FP C H       P C T     P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217

Query: 62  HTRC 65
              C
Sbjct: 218 KQTC 221


>gi|444707360|gb|ELW48642.1| Tubulointerstitial nephritis antigen-like protein [Tupaia
           chinensis]
          Length = 989

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 40/146 (27%), Positives = 60/146 (41%), Gaps = 39/146 (26%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C  G     W ++ +RG+V+         C P+S        +   E      A P P+C
Sbjct: 828 CRGGHLDGAWWFLRRRGVVS-------NHCYPLS-------GHVQGE------AGPAPRC 867

Query: 62  --HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
             H+R      GRG  Q             D Y+    Y +     +I +E+M+NGPV A
Sbjct: 868 MMHSRAV----GRGKRQATARCPSGHVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQA 923

Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMY 132
            M ++ D F Y+ G Y + P  AN +
Sbjct: 924 LMEVHEDFFLYRGGVYSHTPTAANSW 949


>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
          Length = 61

 Score = 50.4 bits (119), Expect = 0.001,   Method: Composition-based stats.
 Identities = 20/53 (37%), Positives = 35/53 (66%)

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
           + +++    ++++GWG ENG PYW + +++   +GD G  KILRG++   IES
Sbjct: 7   TGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIES 59


>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
 gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
          Length = 528

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 47/165 (28%)

Query: 79  YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
           YR+   Y+    V ++Q +++K GP+  +M +Y+D+F+Y SG Y +   V++  L S   
Sbjct: 408 YRYTGGYYGAVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSGIYRH---VSSSKLTS--- 461

Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
                   V    E+  +  V IVGWGE                                
Sbjct: 462 -------PVPNPFELTNHV-VLIVGWGE-------------------------------- 481

Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
             E G  YW + +++G  FG  G   I RG +E  IES    A+P
Sbjct: 482 -NEKGEKYWIVKNSWGTSFGMDGYFLIARGVDECAIESENASAIP 525


>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
          Length = 260

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    GLVTGG + S  GC+P   PPC   +   +         P+ K 
Sbjct: 157 CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKN----TCAGKPREKN 212

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANM 108
           H RCT   YG     +++ +R+ R +Y++      IQ+++M  GP+ A  
Sbjct: 213 H-RCTRMCYGNQDLDYREDHRYTRDFYYLT--YGSIQKDVMTYGPIEATF 259


>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
 gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
          Length = 404

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 32/120 (26%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           +GP +  M +Y D F Y+ G+Y  +   + +                             
Sbjct: 314 SGPALGIMTVYQDFFHYREGIYRHTRHGDQL----------------------------- 344

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
              +    +V+++GWGE+    YW + +++G  +G+KG  +I RG +   IES V   LP
Sbjct: 345 ---MRGLHSVRIVGWGEDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401


>gi|339235559|ref|XP_003379334.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
 gi|316978005|gb|EFV61034.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
          Length = 465

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 73/201 (36%), Gaps = 52/201 (25%)

Query: 43  NYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
           +Y      C      Q +C T  T + Y   +  D      YY  ++E+  + Q ++KNG
Sbjct: 314 DYGMVSERCVAYTGKQQQCRTPSTCERY---YATDYEYIGGYYGASNEIL-MMQALVKNG 369

Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
           P+     ++ D  SY  G                I+ Y S V  +  +  +     V IV
Sbjct: 370 PIAVGFEVHDDFLSYSHG----------------IYHYTSAVSPLKWNPFVEVNHAVIIV 413

Query: 163 GWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT 222
           G+G +                                E     YW + +++G +FG+ G 
Sbjct: 414 GYGTD--------------------------------EMTKEKYWIVKNSWGRKFGEDGY 441

Query: 223 IKILRGRNEAIIESLVNGALP 243
            +I RG NE  IESL   A P
Sbjct: 442 FRIRRGTNECGIESLAFQATP 462


>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
          Length = 483

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 79/196 (40%), Gaps = 53/196 (27%)

Query: 44  YTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
           ++++   C+      P    RC T     + F    YR       N+E  DI QEI  NG
Sbjct: 297 HSSANATCRIPRRRDPIEDARCPTGRTEQKHFSTPPYRVP----ANEE--DIMQEIYANG 350

Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
           PV A + +  D F Y+SG          +Y ++ I       Y+ S       + +V+I+
Sbjct: 351 PVQALILVKEDFFLYRSG----------VYRHTRIAESLRPQYSRS------GWHSVRIL 394

Query: 163 GWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT 222
           GWG +  +                   Y  +K           YW   +++G  +G+ G 
Sbjct: 395 GWGVDRSQ-------------------YRPIK-----------YWLCANSWGHGWGENGY 424

Query: 223 IKILRGRNEAIIESLV 238
            +I+RG +E+ IES V
Sbjct: 425 FRIVRGEDESQIESFV 440


>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
          Length = 635

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/69 (30%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
           ++    +A  V +A + ++GWGEENG P+W + +++G  +G+ G ++++RG N   +E  
Sbjct: 237 IFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVRGVNNVGVEGE 295

Query: 238 VNGALPKDN 246
               +P+D+
Sbjct: 296 CAFGVPRDD 304


>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
          Length = 238

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)

Query: 2   CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
           C+ G     W +    GLVTGG + S  GC+P   PPC   +   +         P+ K 
Sbjct: 135 CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKN----TCAGKPREKN 190

Query: 62  HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANM 108
           H RCT   YG     +++ +R+ R +Y++      IQ+++M  GP+ A  
Sbjct: 191 H-RCTRMCYGNQDLDYREDHRYTRDFYYLT--YGSIQKDVMTYGPIEATF 237


>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
          Length = 330

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 68/191 (35%), Gaps = 61/191 (31%)

Query: 59  PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
           P C   C +     G    KY+   YY +  E  DI +EI  NGPV A   +Y+   SYK
Sbjct: 193 PSCRISCVD-----GEPYKKYKASDYYQLTTE-EDIMKEIYLNGPVEAGFRVYTSFMSYK 246

Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
           SG Y           +  I     G +A            +KIVGWG E  + +W     
Sbjct: 247 SGVY-----------HHRILDIMEGGHA------------IKIVGWGVEPPKRFW----- 278

Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-----EAI 233
                                 +    YW   +++   +G  G  KI RG+N     E  
Sbjct: 279 ----------------------QKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECG 316

Query: 234 IESLVNGALPK 244
           IE  V    PK
Sbjct: 317 IEDQVFAGHPK 327


>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 620

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/55 (41%), Positives = 34/55 (61%)

Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 247
           V ++GWG ENG  YW + +++G  +G+KG  + LRG N   IE     A+PKD +
Sbjct: 226 VSIVGWGVENGVKYWIVRNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTW 280


>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
          Length = 490

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 29/120 (24%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPVV +     D   Y+SG+Y   A                + N    W  V       
Sbjct: 360 NGPVVLSFEPSYDFMYYESGIYHSKA----------------QTNDYAEWEKVD------ 397

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
                   +V   GWGEE+G  +W + +++G Q+G+ G  ++ RG +E+ IES+   + P
Sbjct: 398 -------HSVLCYGWGEEDGVKFWMLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDP 450


>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
          Length = 357

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 23/117 (19%)

Query: 77  DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
           +KY+ + Y  ++ E  +I++EI+ NGPVVA + ++ D   YK G Y              
Sbjct: 240 EKYKIQDYCVISSE-ENIKREILNNGPVVAVIQVFKDFLVYKGGIYE------------- 285

Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
                     V  S++      VK++GWG+++G  YW I   +  S   + +AY  V
Sbjct: 286 ---------VVEGSSKFQYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAV 333



 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 33/113 (29%)

Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
           NGPVVA + ++ D   YK G+Y V                                 V  
Sbjct: 263 NGPVVAVIQVFKDFLVYKGGIYEV---------------------------------VEG 289

Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
           S++      VK+IGWG+++G  YW I +++G+ +G KG   +  G+N+  +E+
Sbjct: 290 SSKFQYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLEA 342


>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
          Length = 345

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 16/94 (17%)

Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
           Y DI   K   Y NGPV+    +Y D  SY +G+Y V+  +       V + GWG +NGR
Sbjct: 249 YEDI---KEEIYTNGPVMVGFVVYDDFSSYSTGIYEVTPDSVEEGGHAVTLNGWGYDNGR 305

Query: 171 PYWT-------------IVRVYAVSASAEIVAYA 191
            YW                R+YA  A  +++A++
Sbjct: 306 LYWIGQNQWQNTWGESGFFRIYAGEAGIDLMAFS 339


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.133    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,872,915,150
Number of Sequences: 23463169
Number of extensions: 218359601
Number of successful extensions: 488940
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2423
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 480824
Number of HSP's gapped (non-prelim): 5460
length of query: 280
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 140
effective length of database: 9,074,351,707
effective search space: 1270409238980
effective search space used: 1270409238980
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)