BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy15346
(280 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|256086863|ref|XP_002579605.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228447|emb|CCD74618.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 271
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 113/242 (46%), Gaps = 58/242 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + G+VTGG++ ++TGCQP FP CNH + + S P C++ P P+C
Sbjct: 84 CRGGIPGMAWDYWKYEGIVTGGSNETHTGCQPYPFPECNHHSSSKSYPPCESYYFPTPEC 143
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H C D+YG+ + +DK+ K Y V E I +EI+ NGPV Y+Y D +YKSG
Sbjct: 144 HETC-QDDYGKPYKKDKFYGKSSYNVASEEISIMKEILLNGPVEGGFYVYEDFLNYKSG- 201
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ + +G Y + ++I+GWG
Sbjct: 202 ---------------VYKHITGSY--------LGGHAIRIIGWGI--------------- 223
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
++N PYW +++ Q+GD+G KILRG NE IES+V
Sbjct: 224 ------------------QQNHIPYWLCANSWNNQWGDQGYFKILRGTNECGIESMVTAG 265
Query: 242 LP 243
LP
Sbjct: 266 LP 267
>gi|242001640|ref|XP_002435463.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215498799|gb|EEC08293.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 223
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 109/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G+S++ W + GLV+GG +++ GC+P S PC H++ S PEC P PKC
Sbjct: 40 CSGGVSAAAWQYWKDAGLVSGGLYNTTDGCKPYSLAPCEHSS-QGSLPEC-VGTLPTPKC 97
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y R + DKY K Y +N I+ EI +NGPV A Y+D SYKSG
Sbjct: 98 KRQC-REGYERSYDDDKYFAKNVYSINGSEKQIRTEIFQNGPVEAEFTAYADFLSYKSGV 156
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +I+ ++I+GWG E+ P
Sbjct: 157 YQH------------------------HSRDIIGRHAIRILGWGSEDNNP---------- 182
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GD G K+LRG NE IES VN
Sbjct: 183 ------------------------YWLLANSWNEDWGDHGYFKMLRGVNECDIESFVNAG 218
Query: 242 LPK 244
+PK
Sbjct: 219 IPK 221
>gi|240992699|ref|XP_002404474.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491571|gb|EEC01212.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 115/244 (47%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + + GLVTGG + ++ GC+P S PC H + S P C T P PKC
Sbjct: 154 CNGGYPAAAWEYWKESGLVTGGLYGTSDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + DK+ ++ Y ++ + IQ EI KNGPV A+ +Y+D SYKSG
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGRKVYSISSDEKQIQTEIFKNGPVEADFTVYADFLSYKSG- 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ ++SG +++ ++I+GWG ENG P
Sbjct: 270 ---------------VYQHQSG--------DVLGGHAIRILGWGTENGTP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GD G KILRG++E IE +N
Sbjct: 297 ------------------------YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAG 332
Query: 242 LPKD 245
+PK+
Sbjct: 333 IPKN 336
>gi|156255405|gb|ABU62925.1| cathepsin B [Fasciola hepatica]
Length = 337
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 59/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + + QDK + K Y V + DI EIMKNGPV Y++ D YKSG
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGGQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G +V ++++GWG ENG
Sbjct: 273 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENG------------ 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW I +++ E +G+KG ++ RG NE IE+ +N
Sbjct: 298 -----------VK-----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAG 335
Query: 242 LP 243
LP
Sbjct: 336 LP 337
>gi|29374027|gb|AAO73004.1| cathepsin B [Fasciola gigantica]
Length = 337
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 108/242 (44%), Gaps = 59/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 155 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + + QDK + K Y V ++ D EIMKNGPV Y++ D YKSG
Sbjct: 215 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDFMMEIMKNGPVDGIFYMFEDFLVYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G +V ++++GWG ENG
Sbjct: 273 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENG------------ 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW I +++ E +G+KG ++ RG NE IE+ +N
Sbjct: 298 -----------VK-----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAG 335
Query: 242 LP 243
LP
Sbjct: 336 LP 337
>gi|240992702|ref|XP_002404475.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215491572|gb|EEC01213.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/244 (30%), Positives = 110/244 (45%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + + GLVTGG + +N GC+P S PC H + S P C T P PKC
Sbjct: 154 CNGGTPAAAWEYWKESGLVTGGLYGTNDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + DK+ K+ Y ++ + IQ EI KNGPV A+ + +D SYKSG
Sbjct: 212 VHLCRK-GYGKDYQDDKHFGKKVYSISSDEKQIQTEIFKNGPVEADFIVLADFLSYKSGV 270
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ ++I+GWG ENG P
Sbjct: 271 YQH------------------------HSDDVIGGHAIRILGWGTENGTP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +GD G KILRG++E IE +N
Sbjct: 297 ------------------------YWLAANSWNEDWGDHGYFKILRGKDECGIEEDINAG 332
Query: 242 LPKD 245
+PK+
Sbjct: 333 IPKN 336
>gi|56753605|gb|AAW25005.1| SJCHGC02852 protein [Schistosoma japonicum]
Length = 346
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 108/243 (44%), Gaps = 57/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI W + G+VTGG++ ++TGCQP FP C H + + + C+ P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C D Y + DKY K Y+V + I +EI+ NGPV A Y++ D +YK+G
Sbjct: 221 YQTCQPD-YAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVFDDFLNYKTG- 278
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ Y +G ++ ++I+GWG
Sbjct: 279 ---------------VYKYVTG--------SLLGGHAIRIIGWGVST------------- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
N PYW +++ +Q+GDKG KILRG NE IES+V
Sbjct: 303 -------------------LNHTPYWLCANSWNKQWGDKGYFKILRGSNECGIESMVTAG 343
Query: 242 LPK 244
LPK
Sbjct: 344 LPK 346
>gi|126681075|gb|ABO26563.1| cathepsin B-like cysteine protease form 1 [Ixodes ricinus]
Length = 337
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 108/244 (44%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++ W + + GLV+ G + + GC+P S PC H + S P C T P PKC
Sbjct: 154 CDGGYPAAAWEYWKESGLVSDGLYGTPDGCKPYSLAPCEH-HTKGSLPNC-TGTVPTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + DK+ K+ Y ++ IQ EI KNGPV A+ +Y+D SYKSG
Sbjct: 212 VHLCRK-GYGKDYQHDKHFGKKVYSISSNEKQIQTEIFKNGPVEADFTVYADFLSYKSGV 270
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ ++I+GWG ENG P
Sbjct: 271 YQH------------------------HSGDVLGGHAIRILGWGTENGTP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GD G KILRG++E IE +N
Sbjct: 297 ------------------------YWLVANSWNEDWGDHGYFKILRGKDECGIEDDINAG 332
Query: 242 LPKD 245
+PKD
Sbjct: 333 IPKD 336
>gi|187105118|ref|NP_001119619.1| cathepsin B-5880 precursor [Acyrthosiphon pisum]
gi|163300442|tpg|DAA06127.1| TPA_inf: cathepsin B transcript 5880 [Acyrthosiphon pisum]
gi|239790051|dbj|BAH71611.1| ACYPI000015 [Acyrthosiphon pisum]
gi|239790053|dbj|BAH71612.1| ACYPI000015 [Acyrthosiphon pisum]
Length = 302
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 104/242 (42%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G +W + + GLV+GG + SN GCQP + PC H T E C P+C
Sbjct: 122 CSGGQHFESWDFYRRHGLVSGGEYGSNEGCQPYTIEPCQHTE-TAVENACSNKTLFTPEC 180
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C N +YG + +D ++ Y +EI +NGP+ A+ Y+Y D +Y+SG
Sbjct: 181 KVQCYNPDYGTRYVKDNHQGTHY---RVPAYTAMKEIYENGPITASFYMYQDFVNYQSG- 236
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+++Y SG Y + + VKI+GWGEENG P
Sbjct: 237 ---------------VYAYNSGKYVTTQA--------VKILGWGEENGTP---------- 263
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW ++F +GD G +KILRG NE IE +
Sbjct: 264 ------------------------YWLAANSFNTYWGDNGFVKILRGANECYIEEFMYAG 299
Query: 242 LP 243
LP
Sbjct: 300 LP 301
>gi|22535408|emb|CAC87118.1| cathepsin B-like protease [Nilaparvata lugens]
Length = 347
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 78/244 (31%), Positives = 102/244 (41%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT-PQPK 60
C G + WV++ + GLVTGG +HS+ GCQP PC H + S+P C T P P
Sbjct: 161 CEGGFPDAAWVFIKRHGLVTGGDYHSHDGCQPYPIAPCEH-HMEGSKPNCSASPTEPTPA 219
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T CT+ + + +D+ + K Y V Q EI KNGP+V
Sbjct: 220 CETTCTHGS-SLAYQKDRQKGKSAYLVPVGEKQTQLEIFKNGPIV--------------- 263
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
A +Y D F YKSGVY + VK++GWGE+NG PYW +
Sbjct: 264 --------AAFKVYEDFFMYKSGVYKRHPESPFRGRHAVKVIGWGEQNGLPYWLV----- 310
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+N Y +GDKG KI RG NE E +
Sbjct: 311 --------------------QNSWDY---------DWGDKGLFKIARG-NECDFEKSMTA 340
Query: 241 ALPK 244
LPK
Sbjct: 341 GLPK 344
>gi|196009263|ref|XP_002114497.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190583516|gb|EDV23587.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 333
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 79/243 (32%), Positives = 105/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+VTGG +HS+ GCQP P C H + K L P PKC
Sbjct: 151 CNGGFLPQAWHYWVNNGIVTGGQYHSHKGCQPYEIPKCEHHVKGPFKACGKEL--PTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + F QDK+ K+ Y + + + IQ+EIM NGPV A +Y+
Sbjct: 209 SQKC-QPGYNKTFNQDKHFGKKSYSITNNIQQIQKEIMMNGPVEAAFTVYA--------- 258
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D SYKSGVY + + +A VKI+GWG EN PYW I +
Sbjct: 259 --------------DFPSYKSGVYQHTTGGPLGGHA-VKILGWGTENNTPYWLIANSW-- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W GDKG KI+RG++E IES +
Sbjct: 302 ----------------------NPTW----------GDKGYFKIIRGKDECGIESSIVAG 329
Query: 242 LPK 244
+PK
Sbjct: 330 MPK 332
>gi|346470617|gb|AEO35153.1| hypothetical protein [Amblyomma maculatum]
Length = 335
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 111/244 (45%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + G+VTGG + ++ GCQP FPPC H + P C T P P+C
Sbjct: 153 CNGGYPAAAWEFYKTDGIVTGGLYGTDDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPQC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DK+ K+ Y ++ + I+ EI KNGPV A
Sbjct: 211 VRDCRK-GYEKSYSEDKHYAKKVYTLSADETQIKTEIFKNGPVEA--------------- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +Y+D SYKSGVY + + +A ++I+GWG ENG P
Sbjct: 255 --------DFTVYADFVSYKSGVYQRHSDDALGGHA-IRILGWGTENGVP---------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GDKG KILRG +E IE +N
Sbjct: 296 ------------------------YWLVANSWNEDWGDKGYFKILRGNDECGIEDDINAG 331
Query: 242 LPKD 245
+PK+
Sbjct: 332 IPKE 335
>gi|341891084|gb|EGT47019.1| CBN-CPR-4 protein [Caenorhabditis brenneri]
Length = 335
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 58/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++ K G TGG++ + GC+P S PC T+ P C T P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+CTN NY + DK+ Y V +VA IQ EI+ +GPV A +Y D + YKSG
Sbjct: 210 VNKCTNSNYNVAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYEDFYQYKSGV 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V + E + ++I+GWG +NG PYW + + V
Sbjct: 270 Y------------------------VHTTGEELGGHAIRILGWGTDNGTPYWLVANSWNV 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ WGE G +I+RG NE IE V G
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331
Query: 242 LPK 244
+PK
Sbjct: 332 VPK 334
>gi|300122171|emb|CBK22745.2| unnamed protein product [Blastocystis hominis]
Length = 319
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 105/241 (43%), Gaps = 60/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W ++ + G+VTGG ++S C+ FPPC+H P+C T PKC
Sbjct: 138 CQGGYSAMAWEYLRRTGVVTGGQYNSTEWCKSYPFPPCSHG-IEGQYPQCSTKPPVVPKC 196
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C + Y + +D+Y+F Y + + V I+ EIM+NGPV A+ +Y D +YKSG
Sbjct: 197 ETTC-QEGYPIEYEKDRYKFSNVYQLENNVDQIKNEIMENGPVDASFQVYEDFMTYKSGI 255
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + TVKI+GWGEENG
Sbjct: 256 YHH------------------------VEGKFMNLHTVKIIGWGEENG------------ 279
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW V+++ ++G+ G +I G NE IES V G
Sbjct: 280 ----------------------EAYWKAVNSWNSEWGENGLFRIRLGTNECTIESQVEGG 317
Query: 242 L 242
L
Sbjct: 318 L 318
>gi|225713216|gb|ACO12454.1| Cathepsin B precursor [Lepeophtheirus salmonis]
gi|290561811|gb|ADD38303.1| Cathepsin B [Lepeophtheirus salmonis]
Length = 333
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 106/244 (43%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G + W + K+GLV+GG + S+ GCQP + PC +HAN T P C PK
Sbjct: 151 CNGGFPGAAWSFWKKKGLVSGGLYGSHKGCQPYAIAPCEHHANGT--RPPCSG-GGRTPK 207
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CHT C N++Y + +DK + Y V + IQ EIM NGPV A +YSD +YKSG
Sbjct: 208 CHTFCENEDYSLPYEKDKSFGRSSYSVKSDPKQIQLEIMNNGPVEAAFSVYSDFLNYKSG 267
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + ++ ++I+GWG ENG P
Sbjct: 268 VYRH------------------------VKGSLLGGHAIRILGWGVENGTP--------- 294
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + +++ +GD GT KIL+G + IE +
Sbjct: 295 -------------------------YWLVANSWNTDWGDNGTFKILKGSDHCGIEGSIVA 329
Query: 241 ALPK 244
LP+
Sbjct: 330 GLPQ 333
>gi|268558600|ref|XP_002637291.1| C. briggsae CBR-CPR-4 protein [Caenorhabditis briggsae]
Length = 335
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 104/243 (42%), Gaps = 58/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++ K G TGG++ S GC+P S PC T+ P+C P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYVSQFGCKPYSLAPCGETVGNTTWPDCPQDGYNTPSC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+CTN+NY + DK+ Y V +VA IQ EI+ +GPV A +Y D
Sbjct: 210 VNKCTNNNYNIAYKDDKHFGSTAYAVGKKVAQIQAEILAHGPVEAAFTVYED-------- 261
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY + E+ +A ++I+GWG +NG PYW + + V
Sbjct: 262 ---------------FYQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ WGE G +I+RG NE IE V G
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331
Query: 242 LPK 244
+PK
Sbjct: 332 VPK 334
>gi|340501578|gb|EGR28345.1| hypothetical protein IMG5_177790 [Ichthyophthirius multifiliis]
Length = 356
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 105/244 (43%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + K GLVTG N CQ SFPPC H +T P CK P P+C
Sbjct: 165 CNGGYPEAAMQYFVKTGLVTGDLFGDNNFCQAYSFPPCAHHVASTKYPPCKG-EVPTPEC 223
Query: 62 HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+C +D+ R + +D Y+ ++ Y V+ + I EIM NGPV +Y D +YKSG
Sbjct: 224 KKKCDDDSKVKRPYNEDLYKGQKSYSVSSDPKAIMTEIMNNGPVEVAFTVYEDFVTYKSG 283
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + E + VK++GWG EN PY
Sbjct: 284 VYQH------------------------VTGEQLGGHAVKMIGWGVENDTPY-------- 311
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W IV+++ E +GD+GT KILRG NE IE V
Sbjct: 312 --------------------------WLIVNSWNETWGDQGTFKILRGSNECGIEDEVVT 345
Query: 241 ALPK 244
ALP+
Sbjct: 346 ALPQ 349
>gi|341888137|gb|EGT44072.1| hypothetical protein CAEBREN_10156 [Caenorhabditis brenneri]
Length = 344
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 99/243 (40%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT N Y + QDK+ Y V +V IQ EI+KNGP+ +Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVY--------- 264
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D + Y +GVY +A A + +A VKI+GWG +NG PYW + +
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
I WGE KG +I+RG NE IE
Sbjct: 310 ---------------INWGE-------------------KGYFRIIRGLNECGIEHSAVA 335
Query: 241 ALP 243
+P
Sbjct: 336 GIP 338
>gi|341900876|gb|EGT56811.1| hypothetical protein CAEBREN_29569 [Caenorhabditis brenneri]
Length = 344
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 99/243 (40%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 154 CEGGYPIQAWKWWGKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 213
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT N Y + QDK+ Y V +V IQ EI+KNGP+ +Y
Sbjct: 214 VDACTSNHTYPTAYLQDKHFGATAYAVGKKVEQIQTEILKNGPIEVAFTVY--------- 264
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D + Y +GVY +A A + +A VKI+GWG +NG PYW + +
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
I WGE KG +I+RG NE IE
Sbjct: 310 ---------------INWGE-------------------KGYFRIIRGLNECGIEHSAVA 335
Query: 241 ALP 243
+P
Sbjct: 336 GIP 338
>gi|55793941|gb|AAV65881.1| cathepsin B1 isotype 1 precursor [Trichobilharzia regenti]
Length = 342
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C Y + +DKY + Y V + I++EIM +GPV A ++SD +YKSG
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G AEI +A V+I+GWG E PY
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +ILRG++E IES V G
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338
Query: 242 LP 243
LP
Sbjct: 339 LP 340
>gi|55793945|gb|AAV65883.1| cathepsin B1 isotype 3 precursor [Trichobilharzia regenti]
Length = 342
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C Y + +DKY + Y V + I++EIM +GPV A ++SD +YKSG
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G AEI +A V+I+GWG E PY
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +ILRG++E IES V G
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338
Query: 242 LP 243
LP
Sbjct: 339 LP 340
>gi|55793943|gb|AAV65882.1| cathepsin B1 isotype 2 precursor [Trichobilharzia regenti]
Length = 342
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/242 (32%), Positives = 110/242 (45%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C Y + +DKY + Y V + I++EIM +GPV A ++SD +YKSG
Sbjct: 218 HQKC-QKGYKTPYGKDKYYGRMSYNVLNNENAIKKEIMMHGPVEAAFTVHSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G AEI +A V+I+GWG E PY
Sbjct: 276 ---------------IYKYMTG-------AEIGGHA-VRIIGWGVEKKTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +ILRG++E IES V G
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDECGIESEVTGG 338
Query: 242 LP 243
LP
Sbjct: 339 LP 340
>gi|194246059|gb|ACF35521.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 217
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 111/244 (45%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + G+VTGG + + GCQP FPPC H + P C T P P+C
Sbjct: 32 CNGGYPSAAWQFYKDEGIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 89
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + + +DK+ K+ Y ++ + I+ EI KNGPV A+ +
Sbjct: 90 AKTC-REGYEKSYTRDKHFGKKVYSISSDETQIKTEICKNGPVEADFNV----------- 137
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y+D SYKSGVY S E++ ++I+GWG E+G P
Sbjct: 138 ------------YADFPSYKSGVYQ-RHSKEMLGGHAIRILGWGTEDGVP---------- 174
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GDKG KI RG +E IE+ +N
Sbjct: 175 ------------------------YWLVANSWNEDWGDKGYFKIRRGNDECGIENDINAG 210
Query: 242 LPKD 245
+PK+
Sbjct: 211 IPKE 214
>gi|55793951|gb|AAV65886.1| cathepsin B1 isotype 6 precursor [Trichobilharzia regenti]
Length = 342
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/244 (31%), Positives = 110/244 (45%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + + G+VTG + ++TGCQP FP C H N T P C PKC
Sbjct: 159 CLGGFPGSAWDYWVEEGVVTGSSGENHTGCQPYPFPKCEH-NTTGKYPACGQKIYETPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK+ K Y V + I++EIM +GPV + +YSD +YKSG
Sbjct: 218 QKKC-QKGYKTPYKKDKHYGKVAYNVPNNEDSIKKEIMMHGPVGSFFTVYSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + + GV+ TV+IVGWG E G PY
Sbjct: 276 -----------IYKHMKGTEIGVH------------TVRIVGWGVEKGTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +ILRG++E IESLV G
Sbjct: 304 -------------------------WLIANSWNEGWGEKGYFRILRGKDECDIESLVIGG 338
Query: 242 LPKD 245
LP++
Sbjct: 339 LPRN 342
>gi|308488550|ref|XP_003106469.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
gi|308253819|gb|EFO97771.1| hypothetical protein CRE_16049 [Caenorhabditis remanei]
Length = 205
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 101/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 14 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPEDTEPTPKC 73
Query: 62 HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT++N Y G+ QDK+ Y V +V IQ EI+ +GP+ +
Sbjct: 74 VEACTSNNTYPTGYLQDKHFGATAYAVGKKVEQIQTEILAHGPIEVAFTV---------- 123
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y D + Y +GVY +A + +A VKI+GWG +NG PYW + +
Sbjct: 124 -------------YEDFYQYTTGVYVHTAGKSLGGHA-VKILGWGVDNGTPYWLVANSWN 169
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
V+ WGE KG +I+RG NE IE
Sbjct: 170 VN---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 195
Query: 241 ALP 243
LP
Sbjct: 196 GLP 198
>gi|268555788|ref|XP_002635883.1| C. briggsae CBR-CPR-5 protein [Caenorhabditis briggsae]
Length = 345
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 102/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG++ S GC+P S PC + P+C P PKC
Sbjct: 155 CEGGYPIQAWKWWVKHGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPDDTEPTPKC 214
Query: 62 HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT++N Y + QDK+ Y V +V IQ EI+KNGPV +Y
Sbjct: 215 VEACTSNNTYPTPYLQDKHFGATAYAVGKKVEQIQTEILKNGPVEVAFTVY--------- 265
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D + Y +GVY ++ A + +A VKI+GWG +NG PYW + +
Sbjct: 266 --------------EDFYQYTTGVYVHTSGASLGGHA-VKILGWGVDNGTPYWLVANSWN 310
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
V+ WGE KG +I+RG NE IE
Sbjct: 311 VN---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 336
Query: 241 ALP 243
+P
Sbjct: 337 GIP 339
>gi|55793947|gb|AAV65884.1| cathepsin B1 isotype 4 precursor [Trichobilharzia regenti]
Length = 342
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 76/244 (31%), Positives = 110/244 (45%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTG + ++TGCQP FP C H + T PEC PKC
Sbjct: 159 CQGGFPGAAWDYWVEDGIVTGSSKENHTGCQPYPFPKCEH-HTTGKYPECGEKIYKTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C Y + +DKY + Y V + I++EIM +GPV ++SD +YKSG
Sbjct: 218 HQKC-QKGYKTPYKKDKYYGRMSYNVLNNENAIKKEIMMHGPVEVAFTVHSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G AEI +A V+I+GWG E PY
Sbjct: 276 ---------------IYKYMTG-------AEIGEHA-VRIIGWGVEKKTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG ++LRG++E IES V
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSG 338
Query: 242 LPKD 245
LP+D
Sbjct: 339 LPRD 342
>gi|308511959|ref|XP_003118162.1| CRE-CPR-6 protein [Caenorhabditis remanei]
gi|308238808|gb|EFO82760.1| CRE-CPR-6 protein [Caenorhabditis remanei]
Length = 387
Score = 110 bits (275), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 102/245 (41%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG + +N+GC+P FPPC H + T C P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 234 EKKCIADYTDKTYSEDKFYGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 293
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK+VGWG ENG PYWT +
Sbjct: 294 Y------------------------VHTGGKLGGGHAVKLVGWGIENGIPYWTCANSWNT 329
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 330 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 355
Query: 242 LPKDN 246
+PK N
Sbjct: 356 VPKLN 360
>gi|55793949|gb|AAV65885.1| cathepsin B1 isotype 5 precursor [Trichobilharzia regenti]
Length = 342
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 109/244 (44%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+VTGG+ ++TGCQP FP C H + PEC + +PKC
Sbjct: 159 CQMGFPGIAWDYWVQEGIVTGGSKENHTGCQPYPFPKCEH-HTKGRYPECGEIIYMKPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C Y + +DKY K Y + I++EIM +GPV A+ ++SD +YKSG
Sbjct: 218 HQKCQK-GYKTPYEKDKYYGKVSYNLLKNEDSIKKEIMMHGPVEASFRVHSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G+ + V+I+GWG E PY
Sbjct: 276 ---------------IYKHMTGID--------IGSHVVRIIGWGVEKETPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG ++LRG++E IES V
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRMLRGKDECGIESAVTSG 338
Query: 242 LPKD 245
LP+D
Sbjct: 339 LPRD 342
>gi|333408990|gb|AEF32260.1| cathepsin B [Cristaria plicata]
Length = 347
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 79/243 (32%), Positives = 102/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + GLVTGG ++S+ GCQP P C+H +P C PKC
Sbjct: 164 CQGGFPAEAWRYYEREGLVTGGLYNSSQGCQPYMIPACDHHVVGHLQP-CPKEEAKTPKC 222
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C NY + DK+ K Y V D V I EIM NGPV A +Y D
Sbjct: 223 SKKC-EANYNVTYKDDKHYGKNSYSV-DSVEKIMTEIMTNGPVEAAFTVYED-------- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
SYKSGVY E+ +A VKI+GWGE+NG PYW + +
Sbjct: 273 ---------------FLSYKSGVYQHRTGQELGGHA-VKILGWGEDNGTPYWIVANSW-- 314
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W G++G ILRG++E IES +
Sbjct: 315 ----------------------NPDW----------GNQGFFNILRGKDECGIESQIVAG 342
Query: 242 LPK 244
LPK
Sbjct: 343 LPK 345
>gi|194246067|gb|ACF35525.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 192
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 109/244 (44%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + +VTGG + + GCQP FPPC H + P C T P P+C
Sbjct: 7 CNGGYPSAAWQFYKDEDIVTGGLYGTEDGCQPYYFPPCEH-HTVGPLPNC-TGIKPTPEC 64
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + + +DK+ K+ Y ++ + I+ EI KNGPV
Sbjct: 65 AKTC-REGYQKSYTRDKHFGKKVYSISSDETQIKTEIYKNGPVE---------------- 107
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
A+ +Y+D SYKSGVY S E++ ++I+GWG E+G P
Sbjct: 108 -------ADFSVYADFPSYKSGVYQ-RHSEEMLGGHAIRILGWGTEDGVP---------- 149
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GDKG KI RG +E IE +N
Sbjct: 150 ------------------------YWLVANSWNEDWGDKGYFKIRRGNDECGIEDDINAG 185
Query: 242 LPKD 245
+PK+
Sbjct: 186 IPKE 189
>gi|25146613|ref|NP_741818.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
gi|1169087|sp|P43510.1|CPR6_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 6; AltName:
Full=Cysteine protease-related 6; Flags: Precursor
gi|671715|gb|AAA98787.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695294|gb|AAA98789.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351058213|emb|CCD65628.1| Protein CPR-6, isoform a [Caenorhabditis elegans]
Length = 379
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 174 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 233
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +D + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 234 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 293
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG ++G PYWT+ +
Sbjct: 294 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 329
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 330 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 355
Query: 242 LPKDN 246
+PK N
Sbjct: 356 IPKLN 360
>gi|71984043|ref|NP_001024426.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
gi|351058214|emb|CCD65629.1| Protein CPR-6, isoform b [Caenorhabditis elegans]
Length = 378
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 173 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 232
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +D + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 233 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 292
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG ++G PYWT+ +
Sbjct: 293 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 328
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 329 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 354
Query: 242 LPKDN 246
+PK N
Sbjct: 355 IPKLN 359
>gi|193209594|ref|NP_001123113.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
gi|351058222|emb|CCD65637.1| Protein CPR-6, isoform c [Caenorhabditis elegans]
Length = 369
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/245 (28%), Positives = 103/245 (42%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG + +N GC+P FPPC H + T C P PKC
Sbjct: 164 CNGGDPLAAWRYWVKDGIVTGSNYTANNGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 223
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +D + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 224 EKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG ++G PYWT+ +
Sbjct: 284 Y------------------------VHTGGKLGGGHAVKLIGWGIDDGIPYWTVANSWNT 319
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 320 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 345
Query: 242 LPKDN 246
+PK N
Sbjct: 346 IPKLN 350
>gi|341887135|gb|EGT43070.1| hypothetical protein CAEBREN_13756 [Caenorhabditis brenneri]
Length = 398
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 102/245 (41%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG +N+GC+P FPPC H + T C P PKC
Sbjct: 189 CNGGDPLAAWRYWVKDGIVTGSNFTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 248
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC + + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 249 EKRCNAEYTDKTYSEDKFYGSSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 308
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG E+G PYWT+ +
Sbjct: 309 Y------------------------VHTGGKLGGGHAVKLIGWGIEDGIPYWTVANSWNT 344
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 345 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 370
Query: 242 LPKDN 246
+PK N
Sbjct: 371 IPKLN 375
>gi|341888136|gb|EGT44071.1| hypothetical protein CAEBREN_13576 [Caenorhabditis brenneri]
Length = 337
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/244 (32%), Positives = 103/244 (42%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C G W WVH GLVTGG++ S GC+P S PC + P+C P+
Sbjct: 147 CEGGYPIQAWRYWVHN-GLVTGGSYESQYGCKPYSIAPCGQTVNGVTWPKCAADEVATPE 205
Query: 61 CHTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +CT+ +Y + QDK+ Y + VA IQ EIM+NGPV +YSD
Sbjct: 206 CVKQCTSKSDYAVPYDQDKHYGSSAYAIRQNVAQIQTEIMRNGPVEVGFLVYSD------ 259
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
+ YKSG+Y A E+ +A VKI+GWG ENG PYW +
Sbjct: 260 -----------------FYQYKSGIYKHVAGRELGGHA-VKILGWGVENGTPYWLAANSW 301
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
V+ WGE KG +I RG NE IES V
Sbjct: 302 NVN---------------WGE-------------------KGYFRIRRGTNECGIESSVV 327
Query: 240 GALP 243
+P
Sbjct: 328 AGIP 331
>gi|17565164|ref|NP_503383.1| Protein CPR-5 [Caenorhabditis elegans]
gi|1169086|sp|P43509.1|CPR5_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 5; AltName:
Full=Cysteine protease-related 5; Flags: Precursor
gi|671713|gb|AAA98786.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675502|gb|AAA98784.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351059399|emb|CCD74289.1| Protein CPR-5 [Caenorhabditis elegans]
Length = 344
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 99/243 (40%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG++ + GC+P S PC P C P PKC
Sbjct: 154 CEGGYPIQAWKWWVKHGLVTGGSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKC 213
Query: 62 HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT+ +NY + QDK+ Y V +V IQ EI+ NGP+ +Y
Sbjct: 214 VDSCTSKNNYATPYLQDKHFGSTAYAVGKKVEQIQTEILTNGPIEVAFTVY--------- 264
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D + Y +GVY +A A + +A VKI+GWG +NG PYW + +
Sbjct: 265 --------------EDFYQYTTGVYVHTAGASLGGHA-VKILGWGVDNGTPYWLVANSWN 309
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
V+ WGE KG +I+RG NE IE
Sbjct: 310 VA---------------WGE-------------------KGYFRIIRGLNECGIEHSAVA 335
Query: 241 ALP 243
+P
Sbjct: 336 GIP 338
>gi|268579855|ref|XP_002644910.1| C. briggsae CBR-CPR-6 protein [Caenorhabditis briggsae]
Length = 376
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 102/245 (41%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K G+VTG + +N+GC+P FPPC H + T C P PKC
Sbjct: 175 CNGGDPLAAWRYWVKDGIVTGSNYTANSGCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKC 234
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + + +DK+ Y V D+V IQ+E+M +GP+ +Y D +Y G
Sbjct: 235 EKKCIADYTDKTYSEDKFYGHSAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFLNYDGGV 294
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG E+G PYWT +
Sbjct: 295 Y------------------------VHTGGKLGGGHAVKLIGWGIEDGIPYWTCANSWNT 330
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG +E IES V G
Sbjct: 331 D---------------WGED-------------------GFFRILRGVDECGIESGVVGG 356
Query: 242 LPKDN 246
+PK N
Sbjct: 357 IPKLN 361
>gi|156365510|ref|XP_001626688.1| predicted protein [Nematostella vectensis]
gi|156213574|gb|EDO34588.1| predicted protein [Nematostella vectensis]
Length = 259
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 108/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W +GLVTGG + S+ GCQP C+H +P CK +P PKC
Sbjct: 73 CNGGYPESAWDHWKSKGLVTGGQYDSHKGCQPYKIAACDHHVVGKLKP-CKG-DSPTPKC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ + Y V + A+IQ+EIM NGPV +Y
Sbjct: 131 ERKCEA-GYNVSYSDDKHFGQSAYSVRSDPAEIQKEIMTNGPVEGAFTVY---------- 179
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+D +YKSGVY ++ + + +A +KI+GWGEENG P
Sbjct: 180 -------------ADFPTYKSGVYQHTSGSALGGHA-IKILGWGEENGTP---------- 215
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD+G KI RG +E IES + G
Sbjct: 216 ------------------------YWLVANSWNSDWGDEGFFKIKRGNDECGIESGIVGG 251
Query: 242 LPK 244
LPK
Sbjct: 252 LPK 254
>gi|427787723|gb|JAA59313.1| Putative cathepsin b-like cysteine protease form 2 [Rhipicephalus
pulchellus]
Length = 338
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 113/244 (46%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G+ S W++ ++G+VTGG + + GCQP S + P L +P P C
Sbjct: 152 CKGGVPSYAWMFYKEKGIVTGGLYGTEDGCQPYSIHTTRYTTTGLLPPPINDL-SPMPPC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C +YG+ + +DK+ ++ Y ++ + A I+ EI KNGPV A+ +Y+D +
Sbjct: 211 KREC-RKSYGKKYSEDKHYGEKVYTLSGDEAQIKTEIFKNGPVEADFAVYADFY------ 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
SYKSGVY + ++A ++I+GWG ENG PYW +
Sbjct: 264 -----------------SYKSGVYQAHSRVRCGSHA-IRILGWGTENGVPYW-------L 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+A++ WT E +GDKG KI RG NE IE +N
Sbjct: 299 AANS---------------------WT------EHWGDKGYFKIRRGNNECGIEEDINAG 331
Query: 242 LPKD 245
+PK+
Sbjct: 332 IPKE 335
>gi|195729973|gb|ACG50797.1| cathepsin B2 [Trichobilharzia szidati]
Length = 344
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + G+VTG +++ GCQP FPPC H + P C PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNTTDGCQPYEFPPCEH-HVVGPRPSCGG-DVETPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C Y + +DK+ K Y V+ I +E+M +GPV + +Y+D +YKSG
Sbjct: 219 KTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVMDHGPVEVDFEVYADFPNYKSGV 277
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEENG PY
Sbjct: 278 YQH------------------------VSGGLLGGHAVRLLGWGEENGVPY--------- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KI+RGRNE IES VN
Sbjct: 305 -------------------------WLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAG 339
Query: 242 LPK 244
+PK
Sbjct: 340 IPK 342
>gi|308500570|ref|XP_003112470.1| CRE-CPR-4 protein [Caenorhabditis remanei]
gi|308267038|gb|EFP10991.1| CRE-CPR-4 protein [Caenorhabditis remanei]
Length = 335
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 101/243 (41%), Gaps = 58/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++ K G TGG++ + GC+P S PC + P+C P C
Sbjct: 150 CDGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPDCPDDGYNTPAC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+CTN Y + DK+ Y V +VA IQ EI+ +GPV A +Y D
Sbjct: 210 VNKCTNTKYNTAYKDDKHFGSTAYAVGKKVAQIQAEIIAHGPVEAAFTVYED-------- 261
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY + E+ +A ++I+GWG +NG PYW + + V
Sbjct: 262 ---------------FYQYKSGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ WGE G +I+RG NE IE V G
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331
Query: 242 LPK 244
+PK
Sbjct: 332 VPK 334
>gi|51038793|gb|AAT94175.1| cathepsin B [Paralichthys olivaceus]
gi|121053785|gb|ABM47001.1| cathepsin B [Paralichthys olivaceus]
Length = 330
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G SS W + K GLV+GG ++S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSSAWDFWTKEGLVSGGLYNSHIGCRPYTISPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+RC Y + QDK+ K Y V V IQ EI KNGPV +Y D YKSG
Sbjct: 207 ISRC-EAGYSPSYKQDKHYGKSSYSVEGSVEQIQAEISKNGPVEGAFTVYEDFVMYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ +K++GWGEE+G P
Sbjct: 266 YQH------------------------VSGSVLGGHAIKVLGWGEEDGIP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG N IES +
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSNHCGIESEIVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|225708580|gb|ACO10136.1| Cathepsin B precursor [Osmerus mordax]
Length = 329
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 107/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + K GLVTGG + SN GC+P S PPC H + + P C+ PKC
Sbjct: 147 CFGGYPSAAWEYWAKSGLVTGGLYGSNKGCRPYSIPPCEH-HVNGTRPPCQGEGD-TPKC 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T+C D Y + +DKY K+ Y V + I E+ KNGPV A +Y D YKSG
Sbjct: 205 QTKCI-DGYTPAYEKDKYFGKKTYSVPSKQEQIMTELYKNGPVEAAFSVYEDFLLYKSGV 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y +L D+ G +A +KI+GWG+EN P
Sbjct: 264 Y--------QHLTGDML----GGHA------------IKILGWGKENNTP---------- 289
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G++G KILRG +E IES V
Sbjct: 290 ------------------------YWLAANSWNTDWGNQGFFKILRGGDECGIESEVVAG 325
Query: 242 LPK 244
+P+
Sbjct: 326 IPQ 328
>gi|256077361|ref|XP_002574974.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
gi|18181863|emb|CAC85211.2| cathepsin B endopeptidase [Schistosoma mansoni]
gi|353231645|emb|CCD79000.1| SmCB2 peptidase (C01 family) [Schistosoma mansoni]
Length = 347
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 105/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 163 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HVIGPLPSCDG-DVETPSC 220
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C Y + +DK+ ++ Y ++ I E+M+NGPV + +Y+D +YKSG
Sbjct: 221 KTNC-QPGYNIPYEKDKWYGEKVYRIHSNPEAIMLELMRNGPVEVDFEVYADFPNYKSGV 279
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEEN PY
Sbjct: 280 YQH------------------------VSGALLGGHAVRLLGWGEENNVPY--------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GDKG KI+RG+NE IES VN
Sbjct: 307 -------------------------WLIANSWNSDWGDKGYFKIVRGKNECGIESDVNAG 341
Query: 242 LPK 244
+PK
Sbjct: 342 IPK 344
>gi|17559068|ref|NP_504682.1| Protein CPR-4 [Caenorhabditis elegans]
gi|1169085|sp|P43508.1|CPR4_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 4; AltName:
Full=Cysteine protease-related 4; Flags: Precursor
gi|675500|gb|AAA98785.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|695293|gb|AAA98783.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|351063163|emb|CCD71204.1| Protein CPR-4 [Caenorhabditis elegans]
Length = 335
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 58/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++ K G TGG++ + GC+P S PC + P C P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNVTWPSCPDDGYDTPAC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+CTN NY + DK+ Y V +V+ IQ EI+ +GPV A +Y D
Sbjct: 210 VNKCTNKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEAAFTVYED-------- 261
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YK+GVY + E+ +A ++I+GWG +NG PYW + + V
Sbjct: 262 ---------------FYQYKTGVYVHTTGQELGGHA-IRILGWGTDNGTPYWLVANSWNV 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ WGE G +I+RG NE IE V G
Sbjct: 306 N---------------WGE-------------------NGYFRIIRGTNECGIEHAVVGG 331
Query: 242 LPK 244
+PK
Sbjct: 332 VPK 334
>gi|417399216|gb|JAA46636.1| Putative cathepsin b [Desmodus rotundus]
Length = 340
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 76/242 (31%), Positives = 103/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 150 CNGGFPSGAWNFWKKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCSGEGGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V + +I EI KNGPV A +YSD YKSG
Sbjct: 209 SKIC-EPGYSPSYKEDKHFGCDTYSVPSDEKEIMVEIYKNGPVEAAFSVYSDFLLYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E+V V+I+GWG ENG PYW
Sbjct: 268 YQH------------------------VTGEMVGGHAVRILGWGVENGTPYW-------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRGR+ IES +
Sbjct: 296 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|107921798|gb|ABF85680.1| cathepsin B3 [Fasciola hepatica]
Length = 278
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 84/172 (48%), Gaps = 25/172 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI + +W + + G+VTGG + TGC P FP C+H T P C P PKC
Sbjct: 132 CNGGIPAMSWDYWTREGVVTGGTLENPTGCLPYPFPKCSHGVVTPGLPPCPRDIYPTPKC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + + QDK + K Y V ++ DI EIMKNGPV Y++ D YKSG
Sbjct: 192 EKKC-HAGYNKTYEQDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG- 249
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
I+ Y +G +V ++++GWG ENG YW
Sbjct: 250 ---------------IYHYTTG--------RLVGGHAIRVIGWGVENGVNYW 278
>gi|268555790|ref|XP_002635884.1| Hypothetical protein CBG01104 [Caenorhabditis briggsae]
Length = 337
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 108/243 (44%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K+GLV+GG++ S GC+P S PC + P+C P+C
Sbjct: 147 CDGGYPLQAWRYWVKQGLVSGGSYESQYGCKPYSIAPCGQTVNGVTWPKCPAQEEATPEC 206
Query: 62 HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+ CT+ +Y + +DK+ Y V + A IQ EI+++GPV A +YSD + YKSG
Sbjct: 207 ASHCTSKSSYSVAYEKDKHYGLSAYPVGRKEAQIQTEILQHGPVEAGFLVYSDFYRYKSG 266
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+++ SG E+ +A VKI+GWG ENG YW + +
Sbjct: 267 ----------------IYTHVSG-------QELGGHA-VKILGWGVENGTKYWLVANSWN 302
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
I WGE KG +ILRGRNE IES V
Sbjct: 303 ---------------INWGE-------------------KGYFRILRGRNECGIESAVVA 328
Query: 241 ALP 243
+P
Sbjct: 329 GIP 331
>gi|56756436|gb|AAW26391.1| unknown [Schistosoma japonicum]
Length = 342
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 76/244 (31%), Positives = 105/244 (43%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C Y + QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 275
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + IV ++I+GWG E G+PY
Sbjct: 276 IYRH------------------------VTGSIVGGHAIRIIGWGVEKGKPY-------- 303
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ E +G+KG +++RGR+E IES V
Sbjct: 304 --------------------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVA 337
Query: 241 ALPK 244
L K
Sbjct: 338 GLIK 341
>gi|344195776|gb|AEM98130.1| cathepsin B [Cynoglossus semilaevis]
Length = 332
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLV+GG + S+ GC+P S PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWEFWTTDGLVSGGLYDSHIGCRPYSIAPCEH-HVNGSRPPCTGEGGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y G+ QDK+ K Y V+D +IQ EI KNGPV +Y D YK+G
Sbjct: 207 TKKC-EAGYTPGYTQDKHYGKLSYSVDDSEKEIQLEIYKNGPVEGAFTVYEDFLLYKTGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + V+ SA V +K++GWGEENG P
Sbjct: 266 YQH----------------------VTGSA--VGGHAIKVLGWGEENGTP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSDHCGIESEIVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|49036808|gb|AAT48985.1| cathepsin B-like proteinase [Triatoma vitticeps]
Length = 332
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 107/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + W + HK G+V+GG + S GCQP S PC H+ S P C+ + PKC
Sbjct: 150 CLGGSAENAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHS-IPGSRPACEGVRD-TPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C YG + D + Y + ++ IQ EI+KNGP+VA++ +Y D+FSYK+G
Sbjct: 208 KKQCEK-GYGIPYGDDLCYGQPGYTIENDAQKIQAEILKNGPIVASILVYEDLFSYKAGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ +KI+GWG EN P
Sbjct: 267 YQH------------------------VAGEVLGGHVIKILGWGVENDTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G KILRG +E IE +
Sbjct: 293 ------------------------YWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAG 328
Query: 242 LPK 244
+P+
Sbjct: 329 IPR 331
>gi|355697726|gb|EHH28274.1| Cathepsin B [Macaca mulatta]
Length = 339
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 107/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W ++ ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFLTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|201023319|ref|NP_001128401.1| cathepsin B-10270 precursor [Acyrthosiphon pisum]
gi|239788119|dbj|BAH70754.1| ACYPI000021 [Acyrthosiphon pisum]
Length = 341
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 100/244 (40%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--CKTLATPQP 59
C+ G S + W + KRGLVTGG + SN GCQP PPCNH P C + P
Sbjct: 156 CNGGYSGAAWQYWMKRGLVTGGDYGSNEGCQPWLIPPCNHTVMDERSPSYMCGKYKSETP 215
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C C N NY + F +D + R W + I+ E+ K+GP A M +Y D +YKS
Sbjct: 216 QCTLNCYNPNYSKPFLKDISKGIRIDWHCSGM--IRNELKKHGPATAIMRVYEDFLTYKS 273
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + +++ TVK++GWG VY
Sbjct: 274 GIYQH------------------------VTGKLLGQITVKVIGWG------------VY 297
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW +++G +GDKG KI RG NE + E
Sbjct: 298 ----------------------RGVQYWLAANSWGTSWGDKGFFKIRRGYNECLFEDYFI 335
Query: 240 GALP 243
P
Sbjct: 336 SGRP 339
>gi|181178|gb|AAA52125.1| lysosomal proteinase cathepsin B, partial [Homo sapiens]
Length = 209
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 20 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 77
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 78 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 124
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 125 -----------VYSDFLLYKSGVYQ----------------------------------- 138
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 139 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 198
Query: 242 LPKDN 246
+P+ +
Sbjct: 199 IPRTD 203
>gi|225717770|gb|ACO14731.1| Cathepsin B precursor [Caligus clemensi]
Length = 331
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 99/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +GLV+GG + S+ GCQP PC H T +P + TP KC
Sbjct: 149 CNGGFPGAAWRFWENKGLVSGGLYGSHKGCQPYLIEPCEHHVNGTRKPCAEGGRTP--KC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H C N NY + +D + Y + + IQ +IM NGPV A +YSD SYKSG
Sbjct: 207 HKTCDNKNYPISYEKDLSFGRSSYSIRSDPKQIQMDIMTNGPVEAAFSVYSDFMSYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + ++ ++I+GWG E G P
Sbjct: 267 YRH------------------------VKGSLLGGHAIRILGWGMEKGTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD GT KILRG + IE V
Sbjct: 293 ------------------------YWLVANSWNTDWGDNGTFKILRGSDHCGIEDSVVAG 328
Query: 242 LPK 244
LP+
Sbjct: 329 LPR 331
>gi|193783549|dbj|BAG53460.1| unnamed protein product [Homo sapiens]
Length = 276
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 87 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 144
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 145 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 191
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 192 -----------VYSDFLLYKSGVYQ----------------------------------- 205
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 206 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 265
Query: 242 LPKDN 246
+P+ +
Sbjct: 266 IPRTD 270
>gi|154089579|gb|ABS57370.1| cathepsin B2 [Trichobilharzia regenti]
Length = 344
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 104/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + G+VTG ++ GCQP FPPC H + P C+ PKC
Sbjct: 161 CNGGFPHSAWSYWKRSGIVTGDLYNPTDGCQPYEFPPCEH-HVVGPRPSCEG-DVETPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C Y + +DK+ K Y V+ I +E+ ++GPV + +Y+D +YKSG
Sbjct: 219 KTTC-QPGYNIPYNKDKWYGKTVYRVHSNQEAIMKEVKEHGPVEVDFEVYADFPNYKSGV 277
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEENG PY
Sbjct: 278 YQH------------------------VSGGLLGGHAVRLLGWGEENGVPY--------- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KI+RGRNE IES VN
Sbjct: 305 -------------------------WLIANSWNSDWGDNGYFKIIRGRNECGIESDVNAG 339
Query: 242 LPK 244
+PK
Sbjct: 340 IPK 342
>gi|393909827|gb|EJD75608.1| cysteine endopeptidase [Loa loa]
Length = 383
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 103/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + RG+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 192 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 251
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + NYG+ + DKY ++ Y V V IQ+EIM GPV A+ +Y+D Y G
Sbjct: 252 VKKC-DKNYGKSYKADKYYGEQVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGG- 309
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A VK++GWG + G PYW +
Sbjct: 310 -----------IYKHVAGSMGGGHA------------VKVLGWGIDQGVPYWLAANSWNT 346
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG NE IES +
Sbjct: 347 D---------------WGED-------------------GYFRILRGVNECGIESGIIAG 372
Query: 242 LPK 244
+PK
Sbjct: 373 IPK 375
>gi|194387364|dbj|BAG60046.1| unnamed protein product [Homo sapiens]
Length = 245
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 56 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 113
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 114 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 160
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 161 -----------VYSDFLLYKSGVYQ----------------------------------- 174
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 175 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 234
Query: 242 LPKDN 246
+P+ +
Sbjct: 235 IPRTD 239
>gi|34979797|gb|AAQ83887.1| cathepsin B [Branchiostoma belcheri tsingtauense]
Length = 332
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + + GLVTGG + S+ GCQP PC H + S P C L P P+C
Sbjct: 149 CNGGFPEAAWEYWKRDGLVTGGPYGSHQGCQPYEIKPCEH-HINGSRPACGKL-EPTPRC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y F +DK+ K Y V+ +V IQ EIM NGPV A +Y+D F +
Sbjct: 207 KKSCES-GYNVTFAKDKHYAKTAYSVSSKVQQIQMEIMTNGPVEAAFTVYAD-FPH---- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSGVY + AE+ +A VK++GWG E PY
Sbjct: 261 ------------------YKSGVYQHESGAELGGHA-VKMIGWGTEGSTPY--------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G+ G KILRG++E IE +
Sbjct: 293 -------------------------WLIANSWNTDWGNMGFFKILRGQDECGIERDIVAG 327
Query: 242 LPK 244
PK
Sbjct: 328 EPK 330
>gi|312091331|ref|XP_003146940.1| cathepsin B [Loa loa]
Length = 249
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + RG+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 58 CFGGEPMAAWKYWVLRGIVTGSEYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 117
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + NYG+ + DKY + Y V V IQ+EIM GPV A+ +Y+D Y G
Sbjct: 118 VKKC-DKNYGKSYKADKYYGQSVYNVESNVESIQKEIMTLGPVEASFEVYTDFLYYTGG- 175
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A VK++GWG + G PYW +
Sbjct: 176 -----------IYKHVAGSMGGGHA------------VKVLGWGIDQGVPYWLAANSWNT 212
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG NE IES +
Sbjct: 213 D---------------WGED-------------------GYFRILRGVNECGIESGIIAG 238
Query: 242 LPK 244
+PK
Sbjct: 239 IPK 241
>gi|24158605|pdb|1GMY|A Chain A, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158606|pdb|1GMY|B Chain B, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
gi|24158607|pdb|1GMY|C Chain C, Cathepsin B Complexed With Dipeptidyl Nitrile Inhibitor
Length = 261
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 72 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 129
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 130 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 176
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 177 -----------VYSDFLLYKSGVYQ----------------------------------- 190
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 191 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 250
Query: 242 LPKDN 246
+P+ +
Sbjct: 251 IPRTD 255
>gi|346472613|gb|AEO36151.1| hypothetical protein [Amblyomma maculatum]
Length = 373
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/244 (31%), Positives = 101/244 (41%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G S W WVHK G+VTGG + S+ GC P C+H T P C P P+
Sbjct: 190 CNGGFPGSAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKTIPPTPR 247
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C R Y F DK+ + Y V + IQ EIM NGPV A+ +
Sbjct: 248 C-VRMCRKGYDVDFMDDKHYGRHAYSVPAKAKQIQAEIMMNGPVEADFTV---------- 296
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y D YKSGVY + + +A ++++GWG ENG P
Sbjct: 297 -------------YEDFLHYKSGVYQRHTDSALGGHA-IRLLGWGVENGVP--------- 333
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ ++GDKG KILRG +E IES +
Sbjct: 334 -------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIESDIVA 368
Query: 241 ALPK 244
LPK
Sbjct: 369 GLPK 372
>gi|410916585|ref|XP_003971767.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 328
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 101/242 (41%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G SS W + K+GLVTGG S GC+P S PC H T P T T PKC
Sbjct: 146 CSGGYPSSAWEFWTKKGLVTGGLCGSEVGCRPYSIAPCEHHVNGTRPPCQGTQET--PKC 203
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + +DK+ KR Y + + I E+ KNGPV A +Y+D YK+G
Sbjct: 204 EKKCI-DGYLTSYLKDKHFGKRSYSLPSQQEQIMTELYKNGPVEAAFTVYADFLLYKTGV 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ +KI+GWGEE+G PYW +
Sbjct: 263 YQH------------------------VTGEVLGGHAIKILGWGEESGTPYWLAANSW-- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
NG +GDKG KI RG +E IES +
Sbjct: 297 --------------------NG------------DWGDKGFFKIKRGNDECGIESEMVAG 324
Query: 242 LP 243
P
Sbjct: 325 TP 326
>gi|1777779|gb|AAB40605.1| cathepsin B-like cysteine proteinase [Ascaris suum]
gi|324515014|gb|ADY46062.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 398
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 97/245 (39%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + K G+VTG GC+P FPPC H + T CK P PKC
Sbjct: 190 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 249
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + + +DK+ + Y V D+V IQ+EI+ +GPV +Y D Y G
Sbjct: 250 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGI 309
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V +I VK++GWG E G PYW + +
Sbjct: 310 Y------------------------VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNT 345
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +I+RG +E IES V G
Sbjct: 346 D---------------WGED-------------------GFFRIIRGIDECGIESSVVGG 371
Query: 242 LPKDN 246
LPK N
Sbjct: 372 LPKLN 376
>gi|225711544|gb|ACO11618.1| Cathepsin B precursor [Caligus rogercresseyi]
Length = 332
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +GLV+GG + S++GCQP PC H T +P + TP KC
Sbjct: 150 CNGGFPGAAWKYWTSKGLVSGGLYGSHSGCQPYDIEPCEHHVNGTRQPCAEGGRTP--KC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H C N+NY + +D + Y + + IQ EIM NGPV A +YSD + KSG
Sbjct: 208 HRTCENENYSVPYDKDLSFGRSSYSIRSDPKQIQLEIMDNGPVEAAFSVYSDFMNDKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + ++ ++I+GWG E G P
Sbjct: 268 YRH------------------------VKGSLLGGHAIRILGWGVEKGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GDKGT KILRG + IE V
Sbjct: 294 ------------------------YWLVANSWNTDWGDKGTFKILRGSDHCGIEGSVVTG 329
Query: 242 LPK 244
LP+
Sbjct: 330 LPR 332
>gi|181192|gb|AAA52129.1| preprocathepsin B [Homo sapiens]
gi|193787271|dbj|BAG52477.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|91078958|ref|XP_974220.1| PREDICTED: similar to cathepsin b [Tribolium castaneum]
gi|270004841|gb|EFA01289.1| cathepsin B precursor [Tribolium castaneum]
Length = 334
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 100/245 (40%), Gaps = 60/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG+ SN GC+P PC H + + P C P C
Sbjct: 149 CNGGFPGAAWHYWVNKGIVSGGSFGSNQGCRPYEIAPCEH-HVNGTRPPCTGDDNKTPSC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK K Y ++ EV IQ+EIM NGPV +Y D+ SYK G
Sbjct: 208 KQQCEK-GYNVPYKKDKNFGKEAYSISSEVQQIQKEIMTNGPVEGAFEVYEDLLSYKKGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + E + ++I+GWG E G PY
Sbjct: 267 YQH------------------------VKGEALGGHAIRILGWGTEKGTPY--------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD GT KILRG + IES +
Sbjct: 294 -------------------------WLIANSWNSDWGDNGTFKILRGEDHCGIESSIVAG 328
Query: 242 LPKDN 246
+PKD+
Sbjct: 329 IPKDS 333
>gi|4503139|ref|NP_001899.1| cathepsin B preproprotein [Homo sapiens]
gi|22538431|ref|NP_680090.1| cathepsin B preproprotein [Homo sapiens]
gi|22538433|ref|NP_680091.1| cathepsin B preproprotein [Homo sapiens]
gi|22538435|ref|NP_680092.1| cathepsin B preproprotein [Homo sapiens]
gi|22538437|ref|NP_680093.1| cathepsin B preproprotein [Homo sapiens]
gi|68067549|sp|P07858.3|CATB_HUMAN RecName: Full=Cathepsin B; AltName: Full=APP secretase; Short=APPS;
AltName: Full=Cathepsin B1; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|291888|gb|AAC37547.1| cathepsin B [Homo sapiens]
gi|63102437|gb|AAH95408.1| Cathepsin B [Homo sapiens]
gi|119586034|gb|EAW65630.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586036|gb|EAW65632.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586037|gb|EAW65633.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586038|gb|EAW65634.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586039|gb|EAW65635.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|119586040|gb|EAW65636.1| cathepsin B, isoform CRA_a [Homo sapiens]
gi|168277954|dbj|BAG10955.1| cathepsin B precursor [synthetic construct]
gi|193786804|dbj|BAG52127.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|426358853|ref|XP_004046705.1| PREDICTED: cathepsin B isoform 1 [Gorilla gorilla gorilla]
gi|426358855|ref|XP_004046706.1| PREDICTED: cathepsin B isoform 2 [Gorilla gorilla gorilla]
gi|426358857|ref|XP_004046707.1| PREDICTED: cathepsin B isoform 3 [Gorilla gorilla gorilla]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|30583753|gb|AAP36125.1| Homo sapiens cathepsin B [synthetic construct]
gi|61370555|gb|AAX43516.1| cathepsin B [synthetic construct]
Length = 340
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|118365170|ref|XP_001015806.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297573|gb|EAR95561.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 340
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 108/242 (44%), Gaps = 59/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W ++ ++G+ TGG + +T C+P FPPC+H +P C + P P+C
Sbjct: 158 CKGGYPSAAWGYMKRQGVSTGGLYGDDTSCKPYIFPPCDHHVTGQYQP-CGPI-QPTPQC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C ++ + +D + + Y + V IQ+EIM +GPV A+ + +D +YKSG
Sbjct: 216 VKECNSEYTQNTYEKDLHFASQTYSIKQNVQAIQREIMAHGPVQASFKVAADFLTYKSGV 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y P + Y+ G +VKI+GWG+E PY
Sbjct: 276 YIRNPKL----------KYEGG-------------HSVKIIGWGKEGNTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG ++LRGRNE IE+ +
Sbjct: 304 -------------------------WLIANSWNEDWGEKGLFRMLRGRNECGIEAQIVAG 338
Query: 242 LP 243
LP
Sbjct: 339 LP 340
>gi|324507953|gb|ADY43363.1| Cathepsin B cysteine proteinase 6 [Ascaris suum]
Length = 352
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/245 (28%), Positives = 97/245 (39%), Gaps = 58/245 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + K G+VTG GC+P FPPC H + T CK P PKC
Sbjct: 149 CDGGDPMAAWKYWVKEGIVTGSNFTMKQGCKPYPFPPCEHHSNKTHYQPCKHDLYPTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + + +DK+ + Y V D+V IQ+EI+ +GPV +Y D Y G
Sbjct: 209 EKKCLDIYTEKTYAEDKFFGETAYGVEDDVTSIQKEILTHGPVEVAFEVYEDFLMYDGGI 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V +I VK++GWG E G PYW + +
Sbjct: 269 Y------------------------VHTGGKIGGGHAVKMLGWGVEQGVPYWLVANSWNT 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +I+RG +E IES V G
Sbjct: 305 D---------------WGED-------------------GFFRIIRGIDECGIESSVVGG 330
Query: 242 LPKDN 246
LPK N
Sbjct: 331 LPKLN 335
>gi|332862712|ref|XP_003317964.1| PREDICTED: cathepsin B isoform 1 [Pan troglodytes]
gi|332862714|ref|XP_003317965.1| PREDICTED: cathepsin B isoform 2 [Pan troglodytes]
gi|332862716|ref|XP_003317966.1| PREDICTED: cathepsin B isoform 3 [Pan troglodytes]
gi|332862718|ref|XP_519607.3| PREDICTED: cathepsin B isoform 5 [Pan troglodytes]
gi|410057614|ref|XP_003954244.1| PREDICTED: cathepsin B [Pan troglodytes]
gi|410262606|gb|JAA19269.1| cathepsin B [Pan troglodytes]
gi|410262608|gb|JAA19270.1| cathepsin B [Pan troglodytes]
gi|410359820|gb|JAA44654.1| cathepsin B [Pan troglodytes]
gi|410359822|gb|JAA44655.1| cathepsin B [Pan troglodytes]
gi|410359824|gb|JAA44656.1| cathepsin B [Pan troglodytes]
gi|410359826|gb|JAA44657.1| cathepsin B [Pan troglodytes]
gi|410359828|gb|JAA44658.1| cathepsin B [Pan troglodytes]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|226821413|gb|ACO82382.1| cathepsin B [Lutjanus argentimaculatus]
Length = 330
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + K GLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++C Y + +DK+ K Y V + A+IQ EI KNGPV +Y D YKSG
Sbjct: 207 LSQC-EAGYTPSYREDKHYGKTSYSVLSDEAEIQYEIYKNGPVEGAFTVYEDFVLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS SA V +K++GWGEENG P
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEENGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G K LRG + IES +
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEIVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|87246247|gb|ABD35300.1| cathepsin B-like cysteine protease [Triatoma infestans]
Length = 333
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + HK G+V+GG + S GCQP S PC H+ + +S P C + T PKC
Sbjct: 151 CLGGSPESAWEYWHKFGIVSGGNYGSKQGCQPYSIAPCEHSIHGSS-PACGGV-TDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + + Y + Y + ++ IQ EI+KNGP+VA+ +Y D+FSYK G
Sbjct: 209 KKQCEK-GYSIPYDKAFYYGQPGYAIPNDAQKIQAEILKNGPIVASFLVYEDLFSYKEGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + +KI GWG ENG P
Sbjct: 268 YQH------------------------VAGEFLGGHVIKIFGWGIENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G KI RG++E IE V+
Sbjct: 294 ------------------------YWLVANSWNTDWGNNGFFKIPRGKDECGIEIDVSAG 329
Query: 242 LPK 244
LP+
Sbjct: 330 LPR 332
>gi|16307393|gb|AAH10240.1| Cathepsin B [Homo sapiens]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|397467300|ref|XP_003805362.1| PREDICTED: cathepsin B [Pan paniscus]
Length = 339
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|344281458|ref|XP_003412496.1| PREDICTED: cathepsin B-like [Loxodonta africana]
Length = 340
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 103/245 (42%), Gaps = 60/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCKGEGGETPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +I EI KNGPV FS
Sbjct: 209 SKTC-EPGYSPSYKEDKHYGYSSYGVPSSEQEIMAEIYKNGPV-------EGAFS----- 255
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y+D YKSGVY E+ +A
Sbjct: 256 -----------VYTDFLVYKSGVYQHVTGEEVGGHA------------------------ 280
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
++++GWG ENG PYW +++ +GD G KILRG++ IES +
Sbjct: 281 -----------IRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRGQDHCGIESEIVAG 329
Query: 242 LPKDN 246
+P+ +
Sbjct: 330 IPRTD 334
>gi|60816353|gb|AAX36379.1| cathepsin B [synthetic construct]
gi|61358313|gb|AAX41546.1| cathepsin B [synthetic construct]
Length = 339
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|197098184|ref|NP_001126573.1| cathepsin B precursor [Pongo abelii]
gi|75061687|sp|Q5R6D1.1|CATB_PONAB RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|55731764|emb|CAH92586.1| hypothetical protein [Pongo abelii]
gi|55731953|emb|CAH92685.1| hypothetical protein [Pongo abelii]
Length = 339
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|268578113|ref|XP_002644039.1| Hypothetical protein CBG17499 [Caenorhabditis briggsae]
Length = 355
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 98/234 (41%), Gaps = 64/234 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPC--NHANYTTSEPECKTLATPQPKCHTRCT-NDN 69
W GL TGG + GC+P S PC N+ N TTS P C TP C CT N
Sbjct: 177 WWQTHGLCTGGNYDDQFGCKPYSIYPCDKNYPNGTTSVP-CPGYHTP--PCEDHCTSNIT 233
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
+ + QDK+ K +Y V ++ DIQ EIM NGPV+A+ +Y D + YKSG Y
Sbjct: 234 WPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFIIYEDFWDYKSGIY------- 286
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
V + + KI+GWG +NG PYW V
Sbjct: 287 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 317
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+G FG+ G ++ILRG NE IE V ALP
Sbjct: 318 ----------------------QWGTDFGENGFVRILRGVNEVNIEHQVLAALP 349
>gi|75076082|sp|Q4R5M2.1|CATB_MACFA RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|67970521|dbj|BAE01603.1| unnamed protein product [Macaca fascicularis]
gi|355779504|gb|EHH63980.1| Cathepsin B [Macaca fascicularis]
gi|383411999|gb|AFH29213.1| cathepsin B preproprotein [Macaca mulatta]
gi|384942194|gb|AFI34702.1| cathepsin B preproprotein [Macaca mulatta]
Length = 339
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|402877481|ref|XP_003902454.1| PREDICTED: cathepsin B [Papio anubis]
Length = 339
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|302564570|ref|NP_001181828.1| cathepsin B precursor [Macaca mulatta]
Length = 339
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|332244666|ref|XP_003271495.1| PREDICTED: cathepsin B [Nomascus leucogenys]
Length = 351
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 220 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 267 -----------VYSDFLLYKSGVYQ----------------------------------- 280
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 281 HITGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 340
Query: 242 LPKDN 246
+P+ +
Sbjct: 341 IPRTD 345
>gi|348587350|ref|XP_003479431.1| PREDICTED: cathepsin B-like [Cavia porcellus]
Length = 340
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 105/251 (41%), Gaps = 61/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + + P+C PKC
Sbjct: 150 CNGGYPTEAWKYWTRKGLVSGGLYGSHVGCRPYSIPPCEH-HVNGTRPKCTGEGGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DKY Y V +I EI KNGPV A ++SD +YKSG
Sbjct: 209 SKTC-EPGYSPSYKEDKYYGYSSYSVPSTEKEIMAEIYKNGPVEAAFSVFSDFLTYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG+ENG PYW + + V
Sbjct: 268 YKH------------------------VAGEVLGGHAIRILGWGKENGVPYWLVGNSWNV 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG + IES V
Sbjct: 304 ----------------------------------DWGDNGFFKILRGEDHCGIESEVVAG 329
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 330 IPRTDQYWGRF 340
>gi|410912140|ref|XP_003969548.1| PREDICTED: cathepsin B-like [Takifugu rubripes]
Length = 246
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + K GLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 64 CNGGYPSAAWDFWTKDGLVSGGLYDSHIGCRPYTIPPCEH-HVNGSRPSCSGEGGETPQC 122
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC Y + QDK+ K Y V+ + DI+ EI KNGPV +Y D YK+G
Sbjct: 123 VYRC-EAGYTPSYKQDKHYGKTSYSVSSDEDDIKHEIYKNGPVEGAFTVYEDFVLYKTGV 181
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + V+ SA + +KI+GWGEENG P
Sbjct: 182 YQH----------------------VTGSA--LGGHAIKILGWGEENGIP---------- 207
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G KILRG N IES +
Sbjct: 208 ------------------------YWLCANSWNTDWGNNGFFKILRGSNHCGIESEIVAG 243
Query: 242 LP 243
+P
Sbjct: 244 IP 245
>gi|146165818|ref|XP_001015807.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145394|gb|EAR95562.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 338
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 103/242 (42%), Gaps = 59/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W ++ G+ TGG + ++ C+P FPPC+H + P C + P PKC
Sbjct: 156 CQGGYPSAAWKYMKATGVSTGGLYGDDSSCKPYVFPPCDH-HVVGQYPPCGPIK-PTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + + QD + + Y + + IQ+EIM
Sbjct: 214 VKQCNSQYTEKTYQQDLHHPSKVYQLPNNAEAIQREIM---------------------- 251
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+GPV A+ + SD +YKSGVY + +VKI+GWG E G PY
Sbjct: 252 -AHGPVQASFRVASDFLTYKSGVYIRDPKLKYEGGHSVKIIGWGVEQGTPY--------- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+ G K+LRG+NE IE+ V
Sbjct: 302 -------------------------WLIANSWNEDWGENGLFKMLRGKNECGIEAEVVAG 336
Query: 242 LP 243
LP
Sbjct: 337 LP 338
>gi|308488328|ref|XP_003106358.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
gi|308253708|gb|EFO97660.1| hypothetical protein CRE_16047 [Caenorhabditis remanei]
Length = 343
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG++ S GC+P S PC + P+C PKC
Sbjct: 153 CEGGYPIQAWKYWVKNGLVTGGSYESQFGCKPYSIAPCGQTVNGVTWPKCPNSDADTPKC 212
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT N +Y + +DK+ Y V+ +V IQ EI+KNGPV +Y+D
Sbjct: 213 VDHCTSNSSYPIPYEKDKHYGATAYAVSRKVDQIQSEILKNGPVEVGFTVYAD------- 265
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+ YKSGVY A E+ +A VK++GWG +NG P
Sbjct: 266 ----------------FYQYKSGVYVHVAGPELGGHA-VKLLGWGVDNGTP--------- 299
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ +G+ G +ILRG NE IES V
Sbjct: 300 -------------------------YWLAANSWNTNWGENGYFRILRGVNECGIESQVVA 334
Query: 241 ALP 243
+P
Sbjct: 335 GMP 337
>gi|260786791|ref|XP_002588440.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
gi|229273602|gb|EEN44451.1| hypothetical protein BRAFLDRAFT_199166 [Branchiostoma floridae]
Length = 332
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 104/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + GLVTGG + S GCQP PC H + S P C + P P+C
Sbjct: 149 CHGGFPEAAWEYWKQDGLVTGGPYGSMQGCQPYEIAPCEH-HINGSRPACGKI-EPTPRC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y F +DK+ K Y V+ +V IQ EIM NGPV A +Y+D F +
Sbjct: 207 KKTCES-GYNVTFNKDKHYAKSAYSVSSKVQQIQMEIMTNGPVEAAFTVYAD-FPH---- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSGVY + AE+ +A VK++GWG E PY
Sbjct: 261 ------------------YKSGVYQHESGAELGGHA-VKMIGWGMEGSTPY--------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG++E IE +
Sbjct: 293 -------------------------WLIANSWNSDWGDMGFFKILRGQDECGIERDIVAG 327
Query: 242 LPK 244
P+
Sbjct: 328 EPR 330
>gi|56753443|gb|AAW24925.1| unknown [Schistosoma japonicum]
Length = 342
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 105/242 (43%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+C Y + QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG
Sbjct: 218 KQKCQK-GYKTPYEQDKNYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 275
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + IV ++I+GWG E G+PY
Sbjct: 276 IYRH------------------------VAGSIVGGHAIRIIGWGVEKGKPY-------- 303
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ E +G+ G +++RGR+E IES V
Sbjct: 304 --------------------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVA 337
Query: 241 AL 242
L
Sbjct: 338 GL 339
>gi|294894292|ref|XP_002774787.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239880404|gb|EER06603.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 414
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 103/249 (41%), Gaps = 51/249 (20%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 55
C GI S W WVH +G+ TGG + + + GC P FPPC H + P+C +
Sbjct: 210 CDGGIPSLAWSWVHNKGIATGGDYLAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 269
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
P C +C N Y D++ V D + I +GPV +Y
Sbjct: 270 YETPNCAEQCHNPKYTTTLRDDRHFLVESVPYEYSVNDAKNAIRTDGPV-GPIYFCDPSV 328
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
++ V A+ +Y D +Y+SGVY ++ E+ +A
Sbjct: 329 NFDQ-------VSASFIVYEDFLAYRSGVYKHTSGKELGGHA------------------ 363
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VK+IGWGEE G+ YW +V+++ E +GD G KI G E I+
Sbjct: 364 -----------------VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--ID 404
Query: 236 SLVNGALPK 244
+ G PK
Sbjct: 405 DDLLGGTPK 413
>gi|256052331|ref|XP_002569726.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228435|emb|CCD74606.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 319
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 101/234 (43%), Gaps = 60/234 (25%)
Query: 5 GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
G + W + K G+VTG + ++T CQP FP C H + P C P C
Sbjct: 139 GFPALAWDYWVKEGIVTGSSKENHTSCQPYPFPKCEH-HTKGKYPACFEEIYKTPNCENT 197
Query: 65 CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGN 124
C +Y + QDK+R K Y V ++ IQ+EIMK GPV AN +Y D +YKSG Y +
Sbjct: 198 CQK-SYKTPYAQDKHRGKSRYNVKNDEKAIQKEIMKYGPVEANFIVYEDFLNYKSGIYKH 256
Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
+ ++V++ ++I+GWG EN PY
Sbjct: 257 ------------------------ITGKLVSWHAIRIIGWGVENNTPY------------ 280
Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +ILRGR+E IES V
Sbjct: 281 ----------------------WLIPNSWNEDWGENGNFRILRGRHECSIESEV 312
>gi|37788265|gb|AAO64472.1| cathepsin B precursor [Fundulus heteroclitus]
Length = 330
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 105/243 (43%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + ++GLVTGG ++S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPANAWEFWTEQGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T+C Y + +DK+ K Y V E IQ EI KNGPV +Y D SYKSG
Sbjct: 207 VTQC-EAGYTPSYQKDKHYGKTSYGVPSEEEQIQSEIYKNGPVEGAFIVYEDFPSYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + V+ SA + +K++GWGEENG P
Sbjct: 266 YQH----------------------VTGSA--LGGHAIKMIGWGEENGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG N IES V
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKILRGSNHCGIESEVVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|432946172|ref|XP_004083803.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 105/241 (43%), Gaps = 62/241 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + +GLVTGG S GC+P + PC H + S P C+ PKC
Sbjct: 148 CFGGFPSAAWEFWTNKGLVTGGLFDSKVGCRPYTLAPCEH-HVNGSRPPCQG-EVETPKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T+C N+ Y + +DK+ +R Y + + I E+ KNGPV A +Y+D YK+G
Sbjct: 206 VTQC-NNGYSLSYPKDKHFGQRSYSIPSQQEQIMTELYKNGPVEAAFSVYADFLLYKNGV 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ VKI+GWGEENG P
Sbjct: 265 YQH------------------------VTGDMLGGHAVKILGWGEENGTP---------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
YW + +++ +GDKG KI RG +E IES +V G
Sbjct: 291 ------------------------YWLVANSWNSDWGDKGFFKIKRGNDECGIESEMVAG 326
Query: 241 A 241
A
Sbjct: 327 A 327
>gi|256052329|ref|XP_002569725.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228436|emb|CCD74607.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 345
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 163 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A+ +Y D +YKSG
Sbjct: 222 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + ++I+GWG EN PY
Sbjct: 281 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 307
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 308 -------------------------WLIANSWNEDWGENGYFRIVRGRDECFIESEV 339
>gi|22531389|emb|CAD44625.1| cathepsin B1 isotype 2 [Schistosoma mansoni]
Length = 340
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 101/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDFWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A+ +Y D +YKSG
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + ++I+GWG EN PY
Sbjct: 276 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECFIESEV 334
>gi|339236191|ref|XP_003379650.1| cathepsin B [Trichinella spiralis]
gi|316977649|gb|EFV60721.1| cathepsin B [Trichinella spiralis]
Length = 356
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 68/235 (28%), Positives = 101/235 (42%), Gaps = 60/235 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG + ++ GC+P F PCNH + T P C P P C
Sbjct: 171 CQGGDPHQAWSFWVKYGLVTGGNYTTHDGCRPYPFAPCNHHSNGTYGP-CSHDLEPTPVC 229
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DKY + Y ++++ +D+Q+E+M NGP+ +Y D YK+G
Sbjct: 230 KKAC-QSTYKIQYNKDKYYGLKAYSLHNKASDLQKELMMNGPMEVAFEVYEDFLLYKTGV 288
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ V+++GWGEENG P
Sbjct: 289 YQH------------------------HTGSVLGGHAVRLLGWGEENGVP---------- 314
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW + +++ ++GDKG KI RGRNE IES
Sbjct: 315 ------------------------YWLLANSWNTEWGDKGFFKIYRGRNECGIES 345
>gi|194766882|ref|XP_001965553.1| GF22391 [Drosophila ananassae]
gi|190619544|gb|EDV35068.1| GF22391 [Drosophila ananassae]
Length = 342
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 106/243 (43%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S TGC+P PC H T P C + PKC
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGRYGSKTGCRPYEIAPCEHHVNGTRAP-CNH-DSKTPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK+ + Y V V DIQ+EIM NGPV +Y D
Sbjct: 215 QHQC-EAGYNVEYSKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYED-------- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY E+ +A ++I+GWG
Sbjct: 266 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGV--------------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG+E PYW I +++ + +GDKG +ILRG + IES ++
Sbjct: 295 ----------------WGKEE-VPYWLIANSWNDDWGDKGFFRILRGEDHCGIESSISAG 337
Query: 242 LPK 244
LPK
Sbjct: 338 LPK 340
>gi|193603738|ref|XP_001943652.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 337
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 100/236 (42%), Gaps = 66/236 (27%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHT-RCTND 68
W ++ K GL TGG + SN GCQP S PC +AN + E E P+C+ +CTN+
Sbjct: 165 WKYIKKNGLCTGGEYGSNEGCQPYSIVPCPRNANSCSKENE------DTPQCYKDQCTNN 218
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
NY D Y + Y V + I E+ KNGPVVA M +Y D YK G
Sbjct: 219 NYETPLVSDLYYAYKVYSVKPKPEIIMSEVFKNGPVVAAMKVYDDFLCYKGG-------- 270
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
I+ Y +G + VKI+GWGE+
Sbjct: 271 --------IYQYTTG--------GLKGDHAVKIMGWGED--------------------- 293
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+G YW +T+G +G G KI RGRNE IE+ + G LPK
Sbjct: 294 -------------DGIDYWLCANTWGNSWGMGGMFKIRRGRNECGIENRITGGLPK 336
>gi|403307501|ref|XP_003944231.1| PREDICTED: cathepsin B [Saimiri boliviensis boliviensis]
Length = 351
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 162 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 219
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 220 SKSC-EPGYTPTYKQDKHYGYNSYSVSNSERDIMAEIYKNGPV-------EGAFS----- 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 267 -----------VYSDFLLYKSGVYQ----------------------------------- 280
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 281 HVTGEMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAG 340
Query: 242 LPKDN 246
+P+ +
Sbjct: 341 IPRTD 345
>gi|226466816|emb|CAX69543.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 337
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 105/239 (43%), Gaps = 60/239 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S S W + + GLVTG ++ +N+GC P FP C+H + + S P C + P C
Sbjct: 153 CNFGYSESAWYYWVENGLVTGESNGNNSGCLPYPFPKCDHGS-SDSYPMCGYVVYTPPVC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + DK+ K Y V +DI++EIM GPV A++++Y D YKSG
Sbjct: 212 NGTC-RPGYPIPYNDDKHFGKSAYQVKQNESDIRREIMLYGPVEASIFIYDDFVDYKSGV 270
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +V+I+GWG ENG P
Sbjct: 271 YKH------------------------LTGRLITIQSVRIIGWGIENGIP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ E++G G KILRG NE IE+ VN
Sbjct: 297 ------------------------YWLCANSWNEEWGLNGFFKILRGSNECEIEAFVNA 331
>gi|343961899|dbj|BAK62537.1| cathepsin B precursor [Pan troglodytes]
Length = 195
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 105/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 6 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 63
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ I EI KNGPV FS
Sbjct: 64 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKGIMAEIYKNGPV-------EGAFS----- 110
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 111 -----------VYSDFLLYKSGVYQ----------------------------------- 124
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 125 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 184
Query: 242 LPKDN 246
+P+ +
Sbjct: 185 IPRTD 189
>gi|161343861|tpg|DAA06111.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 323
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 99/245 (40%), Gaps = 64/245 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C G + W + +G+VTGG + SN GCQP PC+H +S C +L Q
Sbjct: 134 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 192
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C +C N NY + D Y+ Y W N V IQQEIM GPV A MY+Y + Y
Sbjct: 193 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 250
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
K G Y S + E++ Y VK++GWG
Sbjct: 251 KEGVYK------------------------STAGELIGYHHVKLIGWGV----------- 275
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+E G YW ++++ +G+ G KILRG N IE L
Sbjct: 276 ----------------------DEAGIEYWLAMNSWNSNWGNDGLFKILRGYNFCSIELL 313
Query: 238 VNGAL 242
V L
Sbjct: 314 VMAGL 318
>gi|158261501|dbj|BAF82928.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 105/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGP FS
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPA-------EGAFS----- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 255 -----------VYSDFLLYKSGVYQ----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 269 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|999909|pdb|1HUC|B Chain B, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|999911|pdb|1HUC|D Chain D, The Refined 2.15 Angstroms X-Ray Crystal Structure Of
Human Liver Cathepsin B: The Structural Basis For Its
Specificity
gi|1421164|pdb|1CSB|B Chain B, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|1421167|pdb|1CSB|E Chain E, Crystal Structure Of Cathepsin B Inhibited With Ca030 At
2.1 Angstroms Resolution: A Basis For The Design Of
Specific Epoxysuccinyl Inhibitors
gi|122920711|pdb|2IPP|B Chain B, Crystal Structure Of The Tetragonal Form Of Human Liver
Cathepsin B
Length = 205
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 22 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 79
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 80 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 138
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG ENG P
Sbjct: 139 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 164
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG++ IES V
Sbjct: 165 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 200
Query: 242 LPKDN 246
+P+ +
Sbjct: 201 IPRTD 205
>gi|431918315|gb|ELK17542.1| Cathepsin B [Pteropus alecto]
Length = 359
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 173 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGSTPKC 231
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+R Y + +DK+ Y V +I EI KNGPV A +YSD YKSG
Sbjct: 232 -SRICEAGYTPSYKEDKHFGCSSYSVPSSETEIMAEIYKNGPVEAAFSVYSDFLLYKSGV 290
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ V+I+GWG E+G PYW
Sbjct: 291 YQH------------------------VTGEMMGGHAVRILGWGVEDGTPYW-------- 318
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 319 -------------LVG-------------NSWNTDWGDSGFFKILRGQDHCGIESEIVAG 352
Query: 242 LP 243
LP
Sbjct: 353 LP 354
>gi|256090368|ref|XP_002581167.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|22531387|emb|CAD44624.1| cathepsin B1 isotype 1 [Schistosoma mansoni]
gi|353228442|emb|CCD74613.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 340
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VTG + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTGSSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A +Y D +YKSG
Sbjct: 217 KQTCQK-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSGI 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + ++I+GWG EN PY
Sbjct: 276 YKH------------------------ITGETLGGHAIRIIGWGVENKTPY--------- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 334
>gi|189096178|pdb|3CBJ|A Chain A, Chagasin-cathepsin B Complex
gi|189096180|pdb|3CBK|A Chain A, Chagasin-Cathepsin B
Length = 266
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC A+ + P C T PKC
Sbjct: 77 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCE-AHVNGARPPC-TGEGDTPKC 134
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 135 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPV-------EGAFS----- 181
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+YSD YKSGVY
Sbjct: 182 -----------VYSDFLLYKSGVYQ----------------------------------- 195
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 196 HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 255
Query: 242 LPKDN 246
+P+ +
Sbjct: 256 IPRTD 260
>gi|296221607|ref|XP_002756833.1| PREDICTED: cathepsin B, partial [Callithrix jacchus]
Length = 330
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 141 CNGGYPAEAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 198
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV FS
Sbjct: 199 SKSC-EPGYSPTYKQDKHYGYDSYSVSNNERDIMAEIYKNGPV-------EGAFS----- 245
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y+D YKSGVY
Sbjct: 246 -----------VYADFLLYKSGVYQ----------------------------------- 259
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES V
Sbjct: 260 HVTGEMMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEVVAG 319
Query: 242 LPKDN 246
+P+ +
Sbjct: 320 IPRTD 324
>gi|262368170|pdb|3K9M|A Chain A, Cathepsin B In Complex With Stefin A
gi|262368172|pdb|3K9M|B Chain B, Cathepsin B In Complex With Stefin A
Length = 254
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 71 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 128
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 129 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 187
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG ENG P
Sbjct: 188 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 213
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG++ IES V
Sbjct: 214 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 249
Query: 242 LPKDN 246
+P+ +
Sbjct: 250 IPRTD 254
>gi|45361295|ref|NP_989225.1| cathepsin B precursor [Xenopus (Silurana) tropicalis]
gi|38969948|gb|AAH63365.1| hypothetical protein MGC75969 [Xenopus (Silurana) tropicalis]
Length = 333
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/244 (30%), Positives = 103/244 (42%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGYPSGAWKFWTETGLVSGGLYDSHLGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + DK+ Y V +I EI KNGPV +Y+D
Sbjct: 209 VKQC-EDGYAPVYGSDKHFGATSYGVPSSEKEIMAEIYKNGPVEGAFLVYADF------- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
P+ YKSGVY E+ +A +KI+GWG ENG P
Sbjct: 261 ----PM------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG++ IES +
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAG 329
Query: 242 LPKD 245
+PK+
Sbjct: 330 IPKN 333
>gi|345790427|ref|XP_543203.3| PREDICTED: cathepsin B [Canis lupus familiaris]
Length = 339
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 105/242 (43%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+D +I EI KNGPV A +YSD YKSG
Sbjct: 208 SKIC-EPGYSPSYKEDKHYGCSSYSVSDNEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ V+I+GWG E+G PYW
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVEDGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRGR+ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|333361087|pdb|3AI8|B Chain B, Cathepsin B In Complex With The Nitroxoline
gi|333361088|pdb|3AI8|A Chain A, Cathepsin B In Complex With The Nitroxoline
Length = 256
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 73 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 131 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 189
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG ENG P
Sbjct: 190 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 215
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG++ IES V
Sbjct: 216 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 251
Query: 242 LPKDN 246
+P+ +
Sbjct: 252 IPRTD 256
>gi|56752811|gb|AAW24617.1| unknown [Schistosoma japonicum]
Length = 342
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 103/233 (44%), Gaps = 63/233 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227
Query: 73 GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG Y +
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+ IV ++I+GWG E G+PY
Sbjct: 280 -----------------VTGSIVGGHAIRIIGWGVEKGKPY------------------- 303
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W I +++ E +G+KG +++RGR+E IES V L K
Sbjct: 304 ---------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGLIK 341
>gi|157833437|pdb|1PBH|A Chain A, Crystal Structure Of Human Recombinant Procathepsin B At
3.2 Angstrom Resolution
gi|157835646|pdb|2PBH|A Chain A, Crystal Structure Of Human Procathepsin B At 3.3 Angstrom
Resolution
gi|157836863|pdb|3PBH|A Chain A, Refined Crystal Structure Of Human Procathepsin B At 2.5
Angstrom Resolution
Length = 317
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 134 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 192 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 250
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG ENG P
Sbjct: 251 YQH------------------------VTGEMMGGHAIRILGWGVENGTP---------- 276
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG++ IES V
Sbjct: 277 ------------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAG 312
Query: 242 LPKDN 246
+P+ +
Sbjct: 313 IPRTD 317
>gi|73586701|gb|AAI02998.1| CTSB protein [Bos taurus]
Length = 335
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V + +I EI KNGPV +YSD YKSG
Sbjct: 208 SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S EI+ ++I+GWG ENG PYW
Sbjct: 267 YQH------------------------VSGEIMGGHAIRILGWGVENGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 MP 330
>gi|226471004|emb|CAX70583.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 304
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 102/242 (42%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 121 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 179
Query: 62 HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C Y + QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG
Sbjct: 180 KQTCQK-GYKTPYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSG 237
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + IV ++I+GWG E PY
Sbjct: 238 IYRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY-------- 265
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ E +G+KG +I+RGR+E IES V
Sbjct: 266 --------------------------WLIANSWNEDWGEKGLFRIVRGRDECSIESHVVA 299
Query: 241 AL 242
L
Sbjct: 300 GL 301
>gi|449667614|ref|XP_002166962.2| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + +G+VTGG ++S+ GCQP + P C+H + P +L P PKC
Sbjct: 148 CNGGYPQSAWEFFKTKGIVTGGPYNSHKGCQPYAIPACDHHVPHSKNPCNGSL--PTPKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ Y +N++ +I +EIM NGPV A +++D +YKSG
Sbjct: 206 EKVC-EKGYNITYKNDKHYGVTSYSINNDQNEIMREIMTNGPVEAAFTVFADFPNYKSGV 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E + +KI+GWG EN PYW + +
Sbjct: 265 YQH------------------------VSGEELGGHAIKILGWGVENNTPYWLVANSW-- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W GD G KILRG +E IE V
Sbjct: 299 ----------------------NPSW----------GDNGFFKILRGSDECGIEDEVVAG 326
Query: 242 LPK 244
LPK
Sbjct: 327 LPK 329
>gi|17560488|ref|NP_506310.1| Protein F32H5.1 [Caenorhabditis elegans]
gi|3876629|emb|CAB04249.1| Protein F32H5.1 [Caenorhabditis elegans]
Length = 356
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 99/234 (42%), Gaps = 64/234 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
W GL TGG ++ GC+P S PC+ +AN TTS P C TP C CT N
Sbjct: 178 WWQTHGLCTGGNYNDQFGCKPYSIYPCDKKYANGTTSVP-CPGYHTP--TCEEHCTSNIT 234
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
+ + QDK+ K +Y V ++ DIQ EIM NGPV+A+ +Y D + YK+G Y
Sbjct: 235 WPIAYKQDKHFGKAHYNVGKKMTDIQIEIMTNGPVIASFIIYDDFWDYKTGIY------- 287
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
V + + KI+GWG +NG PYW V
Sbjct: 288 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 318
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+G FG+ G ++ LRG NE IE V ALP
Sbjct: 319 ----------------------QWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 350
>gi|201023321|ref|NP_001128402.1| cathepsin B-1874 precursor [Acyrthosiphon pisum]
Length = 315
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 97/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
C G +W + + G V+GG ++SN GCQP + PPC N C T + P
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C N NY F D Y+ +YY ++ +A ++I NGP+ Y+Y D+ YKSG
Sbjct: 190 CEKKCYNPNYYTSFRTDIYK-GKYYKLSPYMA--MKDIFDNGPITTQFYMYRDLVDYKSG 246
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y Y + + +VKI GWGEENG P
Sbjct: 247 VY---------------------QYDEQSDFDFFTVHSVKIFGWGEENGVP--------- 276
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + ++FG +G GT KI RG + + +
Sbjct: 277 -------------------------YWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYA 311
Query: 241 ALP 243
LP
Sbjct: 312 GLP 314
>gi|203648|gb|AAA40993.1| cathepsin (EC 3.4.22.1), partial [Rattus norvegicus]
Length = 271
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 82 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 139
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 140 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 197
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + + V
Sbjct: 198 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 234
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 235 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 260
Query: 242 LPK 244
+P+
Sbjct: 261 IPR 263
>gi|25988674|gb|AAN76202.1| lysosomal cysteine proteinase cathepsin B/green fluorescent protein
EGFP fusion protein [synthetic construct]
Length = 578
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK 244
+P+
Sbjct: 329 IPR 331
>gi|341886633|gb|EGT42568.1| hypothetical protein CAEBREN_17563 [Caenorhabditis brenneri]
Length = 358
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 99/234 (42%), Gaps = 64/234 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
W GL TGG + GC+P + PC+ + N TTS P C TP C RCT N
Sbjct: 180 WWQTHGLCTGGNYDDQFGCKPYTIYPCDKKYPNGTTSVP-CPGYHTP--VCEERCTSNIT 236
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
+ + QDK+ K +Y V ++ DIQ EIM+NGPV+A+ +Y D + YKSG Y
Sbjct: 237 WPISYKQDKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY------- 289
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
V + + KI+GWG +NG PYW V
Sbjct: 290 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 320
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+G FG+ G ++ILRG NE IE V A P
Sbjct: 321 ----------------------QWGTDFGENGFVRILRGVNEVNIEHQVLAAQP 352
>gi|56752997|gb|AAW24710.1| unknown [Schistosoma japonicum]
Length = 342
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 102/231 (44%), Gaps = 63/231 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227
Query: 73 GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG Y +
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+ IV ++I+GWG E G+PY
Sbjct: 280 -----------------VTGSIVGGHAIRIIGWGVEKGKPY------------------- 303
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W I +++ E +G+KG +++RGR+E IES V L
Sbjct: 304 ---------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|226468762|emb|CAX76409.1| cathepsin B [Schistosoma japonicum]
gi|257206178|emb|CAX82740.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H N P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-NTLGPLPVCDG-DVETPPC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V I +E+M++GPV + +Y+D +YKSG
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEEN P
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G KI+RG+NE IES VN
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342
Query: 242 LPK 244
+PK
Sbjct: 343 IPK 345
>gi|1705630|sp|P00787.2|CATB_RAT RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; AltName:
Full=RSG-2; Contains: RecName: Full=Cathepsin B light
chain; Contains: RecName: Full=Cathepsin B heavy chain;
Flags: Precursor
gi|1524328|emb|CAA57792.1| cathepsin b [Rattus norvegicus]
Length = 339
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK 244
+P+
Sbjct: 329 IPR 331
>gi|82830420|ref|NP_072119.2| cathepsin B preproprotein [Rattus norvegicus]
gi|47939014|gb|AAH72490.1| Cathepsin B [Rattus norvegicus]
gi|149030258|gb|EDL85314.1| rCG52258, isoform CRA_a [Rattus norvegicus]
Length = 339
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 106/243 (43%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + S P C T PKC
Sbjct: 150 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + + V
Sbjct: 266 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVPYWLVANSWNV 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK 244
+P+
Sbjct: 329 IPR 331
>gi|347972086|ref|XP_313835.5| AGAP004533-PA [Anopheles gambiae str. PEST]
gi|333469165|gb|EAA09183.5| AGAP004533-PA [Anopheles gambiae str. PEST]
Length = 337
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG SN GCQP + PC H + + P C+ PKC
Sbjct: 154 CNGGFPGAAWSYWVRKGLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPKC 212
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C ++Y + +DK Y + A IQ+EIM NGPV +Y D+ YK G
Sbjct: 213 VKKC-QESYNVPYQKDKRFGASSYSIARHEAQIQKEIMTNGPVEGAFTVYEDLLHYKEGV 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ ++I+GWG ENG Y
Sbjct: 272 YQH------------------------VTGKMLGGHAIRILGWGVENGTKY--------- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG + IES ++
Sbjct: 299 -------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSISAG 333
Query: 242 LPK 244
LPK
Sbjct: 334 LPK 336
>gi|427785213|gb|JAA58058.1| Putative cathepsin l culex quinquefasciatus cathepsin l
[Rhipicephalus pulchellus]
Length = 346
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 98/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K G+VTGG + S+ GC P C+H T P C P P+C
Sbjct: 163 CNGGFPGSAWSFWVKTGIVTGGNYDSDDGCMPYPIKACDHHVNGTLGP-CDKKIPPTPRC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V E IQ EIM NGPV A+ +
Sbjct: 222 VHMCRK-GYDVDYHDDKHYGKSSYSVPSEEKQIQAEIMTNGPVEADFTV----------- 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YSD YKSGVY + +A ++++GWG ENG P
Sbjct: 270 ------------YSDFVHYKSGVYQRHTDEALGGHA-IRLLGWGVENGVP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ ++GDKG KILRG +E IE V
Sbjct: 307 ------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVAG 342
Query: 242 LPK 244
LPK
Sbjct: 343 LPK 345
>gi|294876463|ref|XP_002767679.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239869446|gb|EER00397.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 348
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 101/240 (42%), Gaps = 57/240 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G++ WV+++K G+ TGG + + GC P +FP C H + C +
Sbjct: 157 CKGGVARDAWVFLNKHGIATGGDFVPKSSMEAVDGCWPYNFPRCAHYQKKSKYGPCPKKS 216
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSD 113
P C RC N+ YG +D++ R YW N + I++EIMK+GP A+ + Y D
Sbjct: 217 YETPSCLDRCPNEKYGTPLDKDRHFTARAVPYWFNG-IRSIKKEIMKHGPTSASFFTYED 275
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
FSYKSG ++ Y SG Y V + TV+++GWG E G YW
Sbjct: 276 FFSYKSG----------------VYKYTSGAY--------VEFHTVELIGWGTEKGVDYW 311
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
W EE W + TF GD G ++ G A+
Sbjct: 312 LAKN-------------------DWNEE-----WADLGTFKIAQGDCGINDLVLGAPAAL 347
>gi|161343855|tpg|DAA06108.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 105/246 (42%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-K 60
C G W + + G+VTGG + S GC P PPC SE + QP +
Sbjct: 159 CHGGYPIKAWSYFRRHGIVTGGGYQSGEGCAPYRVPPC------FSEEDGNNTCRGQPME 212
Query: 61 CHTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG + D +RF R YY++ A IQ+++M GP+ A+M +Y
Sbjct: 213 KHHRCTRMCYGDQEIDYDDDHRFTRDYYYLT--YASIQKDVMTYGPIEASMEVYD----- 265
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
D SYKSGVY S +A + VK++GWGEE+G PY
Sbjct: 266 ------------------DFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPY----- 302
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ E +GDKG KI RG NE +++
Sbjct: 303 -----------------------------WLMVNSWSEMWGDKGLFKIRRGTNECSVDNS 333
Query: 238 VNGALP 243
+ +P
Sbjct: 334 MTAGVP 339
>gi|223646922|gb|ACN10219.1| Cathepsin B precursor [Salmo salar]
gi|223647940|gb|ACN10728.1| Cathepsin B precursor [Salmo salar]
gi|223672785|gb|ACN12574.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 101/242 (41%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLVTGG + S+ GC+P S PPC H + + P C P+C
Sbjct: 148 CNGGYPSAAWDFWTTEGLVTGGLYDSHVGCRPYSIPPCEH-HVNGTRPPCTGEEGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y G+ QDK+ K Y + E I E++KNGPV +Y D YKSG
Sbjct: 207 SNQCET-GYTPGYKQDKHFGKNSYSLPSEEQQIMAELLKNGPVEGAFTVYEDFLLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS SA V +K++GWGEE G P
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEEGGTP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G KILRG++ IES +
Sbjct: 292 ------------------------YWLAANSWNTDWGENGFFKILRGKDHCGIESEMVAG 327
Query: 242 LP 243
+P
Sbjct: 328 VP 329
>gi|204022100|dbj|BAG71147.1| cathepsin B-N1 [Tuberaphis takenouchii]
Length = 334
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 101/242 (41%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H RCT YG K + ++W D I K D+ +Y
Sbjct: 210 H-RCTRMCYGNQELDFK---EDHHWTRDAYYLTYTTIQK------------DVMAY---- 249
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GP+ A+ +Y D +YKSGVY + +A + VK++GWGEE G PY
Sbjct: 250 ---GPIEASFDVYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPY--------- 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W +V+++ +Q+GD+G KILRG NE I++ G
Sbjct: 298 -------------------------WLLVNSWNDQWGDQGLFKILRGTNECGIDNSTTGG 332
Query: 242 LP 243
+P
Sbjct: 333 VP 334
>gi|148229459|ref|NP_001079570.1| cathepsin B precursor [Xenopus laevis]
gi|28277314|gb|AAH44689.1| MGC53360 protein [Xenopus laevis]
Length = 333
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 103/244 (42%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGYPSGAWQFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACKGEEGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + DK+ Y V +I EI KNGPV +Y+D
Sbjct: 209 VKQC-EEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADF------- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
P+ YKSGVY E+ +A +KI+GWG ENG P
Sbjct: 261 ----PL------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG++ IES +
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAG 329
Query: 242 LPKD 245
+PK+
Sbjct: 330 VPKN 333
>gi|348534156|ref|XP_003454569.1| PREDICTED: cathepsin B-like [Oreochromis niloticus]
Length = 330
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLV+GG + S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWASEGLVSGGLYESHIGCRPYTIAPCEH-HVNGSRPPCTGEGGDTPEC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + QDK+ K Y V + IQ EI KNGPV +Y D YK+G
Sbjct: 207 VRQCES-GYTPSYIQDKHYGKTSYSVPSDEQQIQTEIYKNGPVEGAFTVYEDFLLYKTGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS SA V +K++GWGEENG P
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKVLGWGEENGTP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEIVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|308504233|ref|XP_003114300.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
gi|308261685|gb|EFP05638.1| hypothetical protein CRE_27039 [Caenorhabditis remanei]
Length = 351
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K+G VTGG++ TGC+P +PPC H T C + P KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QD + + Y V+ +V +IQ+EIM +GPV +Y D F + SG
Sbjct: 227 ERSC-QAGYALTYTQDLHFGQSAYAVSKKVTEIQKEIMTHGPVEVAFSVYED-FEHYSG- 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GVY +A A + +A VK++GWG +NG P
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +G+ G +I+RG NE IES V G
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGG 347
Query: 242 LPK 244
+PK
Sbjct: 348 IPK 350
>gi|195478432|ref|XP_002100515.1| GE16138 [Drosophila yakuba]
gi|194188039|gb|EDX01623.1| GE16138 [Drosophila yakuba]
Length = 340
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 106/243 (43%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H T P ATP KC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGATP--KC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y + +DK+ + Y V V DIQ+EIM NGPV +Y D
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVRRNVRDIQEEIMTNGPVEGAFTVYED-------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YK GVY E+ +A ++I+GWG
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG+E PYW I +++ +GD+G +ILRG++ IES ++
Sbjct: 294 ----------------WGDEK-IPYWLIGNSWNTDWGDQGFFRILRGQDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|118358706|ref|XP_001012594.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294361|gb|EAR92349.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 99/227 (43%), Gaps = 60/227 (26%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQ 76
GLVTG + +N+ CQ +F PC H + P C T P P C C +N + + +
Sbjct: 177 GLVTGDLYGNNSWCQAYTFAPCAHHVTSDIYPPC-TGELPTPPCINSCDSNSTHTIPYSK 235
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
D +R + Y + + I EI KNGP+ + +Y D
Sbjct: 236 DIHRGSKAYGIAKDEKAIMAEIYKNGPIEVALTVYED----------------------- 272
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
+YK+GVY E+ +A VK+VGWG ENG PYW
Sbjct: 273 FLTYKTGVYQHVTGDELGGHA-VKMVGWGVENGTPYW----------------------- 308
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
TIV+++ E +GDKGT KILRG+NE IES ALP
Sbjct: 309 -----------TIVNSWNESWGDKGTFKILRGKNECGIESSCVTALP 344
>gi|312271211|gb|ADQ57303.1| cathepsin B-like cysteine proteinase 1 [Angiostrongylus
cantonensis]
Length = 394
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 95/243 (39%), Gaps = 58/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTG +N GC+P FPPC H + T C+ P PKC
Sbjct: 190 CEGGDPMFAWQYWVDHGIVTGSNFTANQGCKPYPFPPCEHHSNKTRFDPCRHDLYPTPKC 249
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + D++ + Y V ++VA IQ+EI+ +GPV +Y D Y G
Sbjct: 250 SKKCVPSYKEKNYDDDRFYGRTAYGVKNDVAAIQKEILTHGPVEVAFEVYEDFLHYAGGI 309
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V ++ VK++GWG + G PYW I +
Sbjct: 310 Y------------------------VHTGGKLGGGHAVKLIGWGIDQGTPYWLIANSWNT 345
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGEE G +ILRG +E IES V G
Sbjct: 346 D---------------WGEE-------------------GFFRILRGVDECGIESGVVGG 371
Query: 242 LPK 244
+PK
Sbjct: 372 IPK 374
>gi|209863077|ref|NP_001119612.2| cathepsin B-912 precursor [Acyrthosiphon pisum]
Length = 342
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 105/246 (42%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-K 60
C G W + + G+VTGG + S GC P PPC SE + QP +
Sbjct: 159 CHGGYPIKAWSYFRRHGIVTGGDYQSGEGCAPYRVPPC------FSEEDGNNTCRGQPME 212
Query: 61 CHTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG + D +RF R YY++ A IQ+++M GP+ A+M +Y
Sbjct: 213 KHHRCTRMCYGDQEIDYDDDHRFTRDYYYLT--YASIQKDVMTYGPIEASMEVYD----- 265
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
D SYKSGVY S +A + VK++GWGEE+G PY
Sbjct: 266 ------------------DFPSYKSGVYEKSENATYLGGHAVKLIGWGEEDGVPY----- 302
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ E +GDKG KI RG NE +++
Sbjct: 303 -----------------------------WLMVNSWSEMWGDKGLFKIRRGTNECSVDNS 333
Query: 238 VNGALP 243
+ +P
Sbjct: 334 MTAGVP 339
>gi|351695295|gb|EHA98213.1| Cathepsin B [Heterocephalus glaber]
Length = 340
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 105/251 (41%), Gaps = 61/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + K+GLV+GG + S+ GC+P S PPC H + + P+C PKC
Sbjct: 150 CNGGYPSAAWKYWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGTRPQCTGEGGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ +I EI KNGPV ++SD YK+G
Sbjct: 209 SKTC-EPGYSPSYKEDKHFGYDSYSVSSNEKEIMAEIYKNGPVEGAFTVFSDFLMYKTGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG+ENG PYW + + V
Sbjct: 268 YKH------------------------LAGEMLGGHAIRILGWGKENGVPYWLVGNSWNV 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KI+RG + IES +
Sbjct: 304 ----------------------------------DWGDSGFFKIVRGEDHCGIESEIVAG 329
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 330 IPRTDQYWGRF 340
>gi|74179506|dbj|BAE44111.1| cathepsin B preproprotein [Cyprinus carpio]
Length = 330
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 97/243 (39%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWSSDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ K Y V DI +E+ KNGPV +Y D SYKSG
Sbjct: 207 DMSC-EPGYSPSYKQDKHFGKTSYSVPSNQKDIMKELYKNGPVEGAFTVYEDFLSYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + +KI+GWGEENG P
Sbjct: 266 YQH------------------------VSGPALGGHAIKILGWGEENGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327
Query: 242 LPK 244
+P+
Sbjct: 328 IPQ 330
>gi|343197337|pdb|3QSD|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With Ca074 Inhibitor
gi|343197588|pdb|3S3Q|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11017 Inhibitor
gi|343197589|pdb|3S3R|A Chain A, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197590|pdb|3S3R|B Chain B, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
gi|343197591|pdb|3S3R|C Chain C, Structure Of Cathepsin B1 From Schistosoma Mansoni In
Complex With K11777 Inhibitor
Length = 254
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VTG + ++ GC+P FP C H + P C + P+C
Sbjct: 72 CEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEH-HTKGKYPPCGSKIYKTPRC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A +Y D +YKSG
Sbjct: 131 KQTC-QKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSG- 188
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y I + E + ++I+GWG EN PY
Sbjct: 189 -----------IYKHI------------TGETLGGHAIRIIGWGVENKAPY--------- 216
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 217 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 248
>gi|116177489|gb|ABJ80691.1| cathepsin B [Hippoglossus hippoglossus]
Length = 330
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + K GLV+GG ++S+ GC+P + PPC H + S P C PKC
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYNSHIGCRPYTIPPCEH-HVNGSRPHCSGEGGDTPKC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ K Y V V IQ EI +NGPV +Y D
Sbjct: 207 VHSC-EAGYSPTYTKDKHYGKSSYSVEASVEQIQAEISQNGPVEGAFIVYED-------- 257
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSGVY + + + +A +K++GWGEE+G P
Sbjct: 258 ---------------FVMYKSGVYQHTTGSALGGHA-IKVLGWGEEDGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G KILRG + IES +
Sbjct: 292 ------------------------YWLCANSWNTDWGENGFFKILRGSDHCGIESEIVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|321452279|gb|EFX63703.1| hypothetical protein DAPPUDRAFT_306608 [Daphnia pulex]
Length = 340
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W K+G+VTGG +S+ GCQP P C H + T P C PKC
Sbjct: 158 CNGGFPGAAWSHWVKKGIVTGGNFNSSQGCQPYIIPACEH-HTTGDRPPCSE-GGGTPKC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D Y + QD + Y V+ + DIQ EIM NGPV + +Y D +YKSG
Sbjct: 216 LKTC-EDGYTVDYTQDLHYGASSYSVHKRMEDIQLEIMNNGPVEGALTVYEDFPTYKSG- 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A ++I+GWG E G PY
Sbjct: 274 -----------VYQHVHGKALGGHA------------IRILGWGVEEGVPY--------- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G IK+LRG++ IES +
Sbjct: 302 -------------------------WLIANSWNTDWGDNGYIKLLRGKDHCGIESQITAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|154761391|gb|ABS85545.1| cathepsin B preproprotein [Biomphalaria glabrata]
Length = 333
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 107/242 (44%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W W G+V+GG + +N GC P S P C+H +TT + + P PKC
Sbjct: 154 CNGGYPAAAWEWYVDTGVVSGGQYGTNEGCMPYSLPHCDH--HTTGKYQPCPAVVPTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + + DK R K+ Y V V I QE++ NGPV A +YSD SYK+G
Sbjct: 212 EKKCLT-GYPKSYSNDKTRGKKSYGVRG-VQSIMQELVDNGPVTAAFDVYSDFLSYKTG- 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ + +G Y + VKI+G+G E+G
Sbjct: 269 ---------------VYRHTTGSYEGGHA--------VKIIGYGTESG------------ 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ YW + +++ E +GDKG KI +G++E IES +
Sbjct: 294 ----------------------QDYWLVANSWNEDWGDKGFFKIAKGKDECGIESSIVAG 331
Query: 242 LP 243
P
Sbjct: 332 DP 333
>gi|1311050|pdb|1CPJ|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1311051|pdb|1CPJ|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1421561|pdb|1THE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
gi|1421562|pdb|1THE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B- Inhibitor Complex: Implications For
Structure-Based Inhibitor Design
Length = 260
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 77 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 134
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 135 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 192
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG P
Sbjct: 193 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 219
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG N IES +
Sbjct: 220 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 255
Query: 242 LPK 244
+P+
Sbjct: 256 IPR 258
>gi|444525951|gb|ELV14228.1| Cathepsin B [Tupaia chinensis]
Length = 339
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 104/239 (43%), Gaps = 61/239 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S+ W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SAAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKSC-E 212
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y + + V I++EIM Y NGPV
Sbjct: 213 PGYSSSYKEDKH----YGYSSYSVPGIEKEIMAE-------------------IYKNGPV 249
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
+YSD YKSGVY + E+
Sbjct: 250 EGAFSVYSDFLLYKSGVYQ-----------------------------------HVTGEM 274
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
+ ++++GWG ENG PYW + +++ +GD G KILRG++ IES + +P+ +
Sbjct: 275 MGGHAIRILGWGTENGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGIPRTD 333
>gi|118153|sp|P25792.1|CYSP_SCHMA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sm31; Flags: Precursor
gi|160950|gb|AAA29865.1| cathepsin B [Schistosoma mansoni]
Length = 340
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 100/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI W + K G+VT + ++TGC+P FP C H + P C + P+C
Sbjct: 158 CEGGILGPAWDYWVKEGIVTASSKENHTGCEPYPFPKCEH-HTKGKYPPCGSKIYNTPRC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+R K Y V ++ IQ+EIMK GPV A+ +Y D +YKSG
Sbjct: 217 KQTCQR-KYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEASFTVYEDFLNYKSGI 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + ++I+GWG EN PY
Sbjct: 276 YKH------------------------ITGEALGGHAIRIIGWGVENKTPY--------- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 303 -------------------------WLIANSWNEDWGENGYFRIVRGRDECSIESEV 334
>gi|1127275|pdb|1CTE|A Chain A, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
gi|1127276|pdb|1CTE|B Chain B, Crystal Structures Of Recombinant Rat Cathepsin B And A
Cathepsin B-Inhibitor Complex: Implications For
Structure- Based Inhibitor Design
Length = 254
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 71 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 128
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 129 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 186
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG P
Sbjct: 187 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 213
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG N IES +
Sbjct: 214 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 249
Query: 242 LPK 244
+P+
Sbjct: 250 IPR 252
>gi|325302582|dbj|BAJ83491.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 338
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 104/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + G+V+GG + S+ GC+P PPC H + + + P+CK + PKC
Sbjct: 154 CNGGLPENAWRYWAIDGIVSGGLYGSHVGCRPYEIPPCEH-HTSGNRPDCKG-NSKTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C G+ + DK+ Y V DI EI+ GPV A+ +Y+D +YKSG
Sbjct: 212 QRQCVESFDGK-YQADKHFASNVYNVRASEEDIMNEILVYGPVEADFIVYADFLTYKSGV 270
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + K G A VKI+GWGEENG P
Sbjct: 271 YQH---------------VKGGFLGGHA---------VKILGWGEENGVP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG N IE+ +N
Sbjct: 297 ------------------------YWLCANSWNTDWGDGGFFKILRGYNHCKIEADINAG 332
Query: 242 LPK 244
+PK
Sbjct: 333 IPK 335
>gi|6681079|ref|NP_031824.1| cathepsin B preproprotein [Mus musculus]
gi|115712|sp|P10605.2|CATB_MOUSE RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|239907|gb|AAB20536.1| preprocathepsin B [Mus sp.]
gi|309152|gb|AAA37375.1| cathepsin B [Mus musculus]
gi|13879360|gb|AAH06656.1| Cathepsin B [Mus musculus]
gi|26350521|dbj|BAC38900.1| unnamed protein product [Mus musculus]
gi|74180941|dbj|BAE27751.1| unnamed protein product [Mus musculus]
gi|74191261|dbj|BAE39458.1| unnamed protein product [Mus musculus]
gi|74198944|dbj|BAE30691.1| unnamed protein product [Mus musculus]
gi|74208073|dbj|BAE29144.1| unnamed protein product [Mus musculus]
gi|148704123|gb|EDL36070.1| cathepsin B, isoform CRA_a [Mus musculus]
Length = 339
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 109/251 (43%), Gaps = 62/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYWLAANSWNL 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 329 IPRTDQYWGRF 339
>gi|148222779|ref|NP_001080410.1| uncharacterized protein LOC380102 precursor [Xenopus laevis]
gi|28302291|gb|AAH46667.1| Cg10992 protein [Xenopus laevis]
Length = 333
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 102/244 (41%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLV+GG + S+ GC+P S PPC H + S P CK PKC
Sbjct: 150 CNGGYPSGAWRFWTETGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPSCKGEEGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + DK+ Y V +I +I KNGPV +Y+D
Sbjct: 209 MKTC-EEGYTPAYGSDKHFGATSYGVPSSEKEIMADIYKNGPVEGAFVVYADF------- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
P+ YKSGVY E+ +A +KI+GWG ENG P
Sbjct: 261 ----PL------------YKSGVYQHETGEELGGHA-IKILGWGVENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG++ IES V
Sbjct: 294 ------------------------YWLCANSWNTDWGDNGFFKILRGKDHCGIESEVVAG 329
Query: 242 LPKD 245
+PK+
Sbjct: 330 IPKN 333
>gi|1942645|pdb|1MIR|A Chain A, Rat Procathepsin B
gi|1942646|pdb|1MIR|B Chain B, Rat Procathepsin B
Length = 322
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG ++S+ GC P + PPC H + + P C T PKC
Sbjct: 133 CNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEH-HVNGARPPC-TGEGDTPKC 190
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG
Sbjct: 191 NKMC-EAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 248
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG P
Sbjct: 249 ---------------VYKHEAG--------DVMGGHAIRILGWGIENGVP---------- 275
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG N IES +
Sbjct: 276 ------------------------YWLVANSWNADWGDNGFFKILRGENHCGIESEIVAG 311
Query: 242 LPK 244
+P+
Sbjct: 312 IPR 314
>gi|56759504|gb|AAW27892.1| unknown [Schistosoma japonicum]
Length = 279
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 96 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 154
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ + Y V + IQ++IM GPV A +Y D +YKSG
Sbjct: 155 KQTCQK-GYKTPYEQDKHYGEESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGI 213
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + IV ++I+GWG E PY
Sbjct: 214 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY--------- 240
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +I+RGR+E IES V
Sbjct: 241 -------------------------WLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 275
Query: 242 LPK 244
L K
Sbjct: 276 LIK 278
>gi|56758040|gb|AAW27160.1| unknown [Schistosoma japonicum]
Length = 216
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 98/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 33 CQGGFPGQAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V IQ+EIM NGPV A +Y D +YKSG
Sbjct: 92 KQTC-QKGYKTPYEQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 150
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + IV ++I+GWG E P
Sbjct: 151 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTP---------- 176
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ E +G+KG +I+RGR+E IES V
Sbjct: 177 ------------------------YWLIANSWNEDWGEKGLFRIVRGRDECSIESHVVAG 212
Query: 242 LPK 244
L K
Sbjct: 213 LIK 215
>gi|327281751|ref|XP_003225610.1| PREDICTED: cathepsin B-like [Anolis carolinensis]
Length = 330
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG + S+ GC+P S PPC H T P C P+C
Sbjct: 140 CNGGYPSGAWKYWTEKGLVSGGLYDSHVGCRPYSIPPCEHHTNGT-RPPCSGEGGETPEC 198
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + QDK+ Y + +I EI KNGPV +YSD YKSG
Sbjct: 199 VKKC-EDGYTPAYKQDKHYGVTSYGIPRSEKEIMAEIYKNGPVEGAFVVYSDFLMYKSGV 257
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E V ++I+GWG +NG PYW +
Sbjct: 258 YQH------------------------VSGEEVGGHAIRILGWGVDNGTPYWLAANSWNT 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ G +ILRG++ IES +
Sbjct: 294 D---------------WGED-------------------GFFRILRGQDHCGIESEIVAG 319
Query: 242 LPK 244
+PK
Sbjct: 320 IPK 322
>gi|226472810|emb|CAX71091.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V I +E+M++GPV + +Y+D +YKSG
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEEN P
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G KI+RG+NE IES VN
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342
Query: 242 LPK 244
+PK
Sbjct: 343 IPK 345
>gi|27526823|emb|CAD32937.1| pro-cathepsin B2 [Fasciola hepatica]
Length = 337
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 98/243 (40%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTGG + TGC P FP C H + C P P C
Sbjct: 145 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPRYTYPTPSC 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + + +DK K Y V+ I +EIMKNGPV A +Y+D YKSG
Sbjct: 205 YPYC-QAGYDKTYEKDKVYGKTSYNVDRHEYTIMEEIMKNGPVEAGFIVYTDFAVYKSG- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + SG YA ++I+GWG ENG YW + V
Sbjct: 263 ---------------IYHHVSGRYA--------GKHAIRIIGWGVENGVKYWLTANSWNV 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
GWGE G +ILRG +E IES+V
Sbjct: 300 ---------------GWGE-------------------NGYFRILRGTDECRIESIVVAG 325
Query: 242 LPK 244
+P+
Sbjct: 326 MPR 328
>gi|226471002|emb|CAX70582.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 71/230 (30%), Positives = 96/230 (41%), Gaps = 61/230 (26%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ QDK+ Y V IQ+EIM GPV A +Y D +YKSG Y +
Sbjct: 228 PYKQDKHYGDESYNVISNEKAIQKEIMMYGPVEAAFDVYEDFLNYKSGIYRH-------- 279
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
+ IV ++I+GWG E G+PY
Sbjct: 280 ----------------VTGSIVGGHAIRIIGWGVEKGKPY-------------------- 303
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W I +++ E +G+KG +++RGR+E IES V L
Sbjct: 304 --------------WLIANSWNEDWGEKGLFRMVRGRDECSIESHVVAGL 339
>gi|226472800|emb|CAX71086.1| cathepsin B [Schistosoma japonicum]
gi|226472804|emb|CAX71088.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V I +E+M++GPV + +Y+D +YKSG
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEEN P
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G KI+RG+NE IES VN
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342
Query: 242 LPK 244
+PK
Sbjct: 343 IPK 345
>gi|30995341|gb|AAO59414.2| cathepsin B endopeptidase [Schistosoma japonicum]
gi|226472794|emb|CAX71083.1| cathepsin B [Schistosoma japonicum]
gi|226472796|emb|CAX71084.1| cathepsin B [Schistosoma japonicum]
gi|226472798|emb|CAX71085.1| cathepsin B [Schistosoma japonicum]
gi|226472802|emb|CAX71087.1| cathepsin B [Schistosoma japonicum]
gi|226472806|emb|CAX71089.1| cathepsin B [Schistosoma japonicum]
Length = 348
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V I +E+M++GPV + +Y+D +YKSG
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ V+++GWGEEN P
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G KI+RG+NE IES VN
Sbjct: 307 ------------------------YWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAG 342
Query: 242 LPK 244
+PK
Sbjct: 343 IPK 345
>gi|45822203|emb|CAE47498.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 328
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + + GC+P PPC H + S P C PKC
Sbjct: 146 CNGGYPGAAWHYWVRKGLVSGGQYGTKQGCRPYEIPPCEH-HTNGSRPACDASEGNTPKC 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + NY + D + + Y ++ +V IQ EI++NGPV +Y+D +YK+G
Sbjct: 205 AKSCES-NYKINYSNDLHFGSKAYSISSDVKQIQAEILQNGPVEGAFSVYADFVNYKTGV 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + ++I GWG EN PY
Sbjct: 264 YQH------------------------IKGQFLGGHAIRIFGWGVENNTPY--------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD GT KILRG + IES +
Sbjct: 291 -------------------------WLIANSWNTDWGDSGTFKILRGSDHCGIESGIVAG 325
Query: 242 LPK 244
LPK
Sbjct: 326 LPK 328
>gi|195393194|ref|XP_002055239.1| GJ19262 [Drosophila virilis]
gi|194149749|gb|EDW65440.1| GJ19262 [Drosophila virilis]
Length = 338
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 109/244 (44%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + W W HK G+V+GG++ S GC+P PC H + + P C + +TP +
Sbjct: 155 CNGGFPGAAWSYWTHK-GIVSGGSYGSKEGCRPYEVEPCEH-HVNGTRPPCHSGSTP--R 210
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C + Y + +DK+ + Y VN DIQ+EIM NGPV +Y D+ YK+G
Sbjct: 211 CMHKCES-GYSVDYAKDKHFGAKAYSVNRNPLDIQREIMTNGPVEGAFTVYEDLILYKTG 269
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y + + G +A ++I+GWG
Sbjct: 270 ------------VYQHVHGRQLGGHA------------IRILGWGV-------------- 291
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
WG +N PYW I +++ +GD G +ILRG + IES ++
Sbjct: 292 -----------------WG-DNKVPYWLIGNSWNTDWGDNGFFRILRGEDHCGIESAISA 333
Query: 241 ALPK 244
LPK
Sbjct: 334 GLPK 337
>gi|426220597|ref|XP_004004501.1| PREDICTED: cathepsin B [Ovis aries]
Length = 335
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ Y V+ +I EI KNGPV +YSD YKSG
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E++ ++I+GWG EN PYW
Sbjct: 267 YQH------------------------VSGEMMGGHAIRILGWGVENDTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GDKG KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDKGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 MP 330
>gi|379067374|gb|AFC90100.1| cathepsin B [Capra hircus]
Length = 335
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ Y V+ +I EI KNGPV +YSD YKSG
Sbjct: 208 SKIC-EPGYSPSYKDDKHFGCSSYSVSSNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E++ ++I+GWG EN PYW
Sbjct: 267 YQH------------------------VSGEMMGGHAIRILGWGVENDTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GDKG KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDKGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 MP 330
>gi|241998314|ref|XP_002433800.1| longipain, putative [Ixodes scapularis]
gi|215495559|gb|EEC05200.1| longipain, putative [Ixodes scapularis]
Length = 339
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 100/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+VTGG + ++ GC P P C+H T P + P PKC
Sbjct: 157 CNGGFPGAAWSYWVEKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
R Y F DK+ K Y V+ IQ EIMKNGPV +Y+D YKSG
Sbjct: 215 -VRLCRKGYNIDFKDDKHYGKSSYSVSSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y S S + + ++I+GWG ENG P
Sbjct: 274 YK------------------------SHSTDALGGHAIRILGWGVENGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W + +++ ++GDKG KILRG NE IE +
Sbjct: 300 ------------------------FWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAG 335
Query: 242 LPK 244
+PK
Sbjct: 336 IPK 338
>gi|145498570|ref|XP_001435272.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402403|emb|CAK67875.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 97/234 (41%), Gaps = 60/234 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + +GLVTG N+ C+P +FPPC+H C + P P C
Sbjct: 143 CNGGFPSGAWNYFKNKGLVTGDLFGDNSWCRPYTFPPCDHHVDDGKYGPCGD-SQPTPAC 201
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
CT + GR + DK R Y V+ +V IQ EIM GPV A+ +Y D +YKSG
Sbjct: 202 VKSCTAQS-GRNYDSDKIRSIDSYSVSSKVEQIQNEIMTFGPVEASFTVYEDFLTYKSGV 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y N A A + +A VKI+GWG E PY
Sbjct: 261 YQN-----------------------VAGANLGGHA-VKIIGWGVEKNVPY--------- 287
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
W +V+++ E +G+ G KILRG N IE
Sbjct: 288 -------------------------WLVVNSWNEGWGENGLFKILRGSNHVGIE 316
>gi|125981197|ref|XP_001354605.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
gi|54642915|gb|EAL31659.1| GA10694 [Drosophila pseudoobscura pseudoobscura]
Length = 338
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 105/243 (43%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S GC+P PC H + + P C +TP C
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPS--C 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + +DK + Y V VA+IQQEIM NGPV +Y D
Sbjct: 212 QHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYED-------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY E+ +A ++I+GWG
Sbjct: 263 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGV--------------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 292 ----------------WGESK-VPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAG 334
Query: 242 LPK 244
LPK
Sbjct: 335 LPK 337
>gi|226471008|emb|CAX70585.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 101/231 (43%), Gaps = 63/231 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227
Query: 73 GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG Y +
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+ IV ++I+GWG E G+PY
Sbjct: 280 -----------------VAGSIVGGHAIRIIGWGVEKGKPY------------------- 303
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W I +++ E +G+ G +++RGR+E IES V L
Sbjct: 304 ---------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|56758658|gb|AAW27469.1| unknown [Schistosoma japonicum]
Length = 181
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/232 (30%), Positives = 97/232 (41%), Gaps = 60/232 (25%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
++ KRG+VTGG+ ++TGCQP FP C H P C T P+C +C Y
Sbjct: 9 YLVKRGIVTGGSKENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQKC-QKGYKT 66
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ QDK + Y V IQ+EIM NGPV A +Y D +YKSG Y +
Sbjct: 67 PYEQDKNYGDQRYNVISNAKAIQKEIMMNGPVEAAFDVYEDFLNYKSGIYRH-------- 118
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
+ IV ++I+GWG E PY
Sbjct: 119 ----------------VTGSIVGGHAIRIIGWGVEKRTPY-------------------- 142
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W I +++ E +G+KG +I+RGR+E IES V L K
Sbjct: 143 --------------WLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAGLIK 180
>gi|226471006|emb|CAX70584.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 101/231 (43%), Gaps = 63/231 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV KRG+VTGG+ ++TGCQP FP C H P C T P+C C Y
Sbjct: 171 WV-KRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKT 227
Query: 73 GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ QDK Y +RY +++E A IQ+EIM GPV A +Y D +YKSG Y +
Sbjct: 228 PYEQDKHYGDQRYNVISNEKA-IQREIMMYGPVEAAFDVYEDFLNYKSGIYRH------- 279
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+ IV ++I+GWG E G+PY
Sbjct: 280 -----------------VAGSIVGGHAIRIIGWGVEKGKPY------------------- 303
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W I +++ E +G+ G +++RGR+E IES V L
Sbjct: 304 ---------------WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|194895314|ref|XP_001978227.1| GG19486 [Drosophila erecta]
gi|190649876|gb|EDV47154.1| GG19486 [Drosophila erecta]
Length = 340
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 104/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEIAPCEH-HVNGTRPPCGH-GGGTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK+ + Y V V DIQ+EIM NGPV +Y
Sbjct: 214 SHVCES-GYTVDYAKDKHFGSKSYSVKRNVRDIQEEIMTNGPVEGAFTVY---------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YK GVY E+ +A ++I+GWG
Sbjct: 263 -------------EDLILYKDGVYQHQHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGEE PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 294 ----------------WGEEK-IPYWLIGNSWNTDWGDNGFFRILRGQDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|31872149|gb|AAP59456.1| cathepsin B precursor [Araneus ventricosus]
Length = 334
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 103/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + +GLVTGG ++S+ GCQP + C H P + TPQ C
Sbjct: 152 CNGGFPGSAWEYWVDKGLVTGGLYNSHVGCQPYTIASCEHHTKGKLPPCGDIVDTPQ--C 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DKY K+ Y ++++ I+ EI NGPV A +Y+
Sbjct: 210 VHMC-EKGYNVSYRADKYFGKKSYSIDEQEDQIKTEISTNGPVEAAFTVYA--------- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D +YKSGVY E+ +A V+I+GWG E+G P
Sbjct: 260 --------------DFVTYKSGVYRHVTGEEMGGHA-VRILGWGTESGTP---------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GDKG KILRG +E IES +
Sbjct: 295 ------------------------YWLVANSWNTDWGDKGYFKILRGSDECGIESSIVAG 330
Query: 242 LPK 244
LPK
Sbjct: 331 LPK 333
>gi|195729971|gb|ACG50796.1| cathepsin B1 [Trichobilharzia szidati]
Length = 342
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 102/244 (41%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTG + ++TGCQP FP C H + P C PKC
Sbjct: 159 CQGGFPGAAWDYWVEEGIVTGSSKENHTGCQPYPFPKCEH-HTKGKYPACGEKIYKTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DKY K Y V + I++EIM +GPV A +YSD +YKSG
Sbjct: 218 QQKC-QKGYKTPYKKDKYYGKLSYNVLSKEDAIKKEIMMHGPVEAAFTVYSDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + G ++ V+I+GWG E PY
Sbjct: 276 ---------------IYKHMKGT--------VIGGHAVRIIGWGVEKKTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +ILRG++ IES V
Sbjct: 304 -------------------------WLIANSWNEDWGEKGYFRILRGKDVCGIESAVTAG 338
Query: 242 LPKD 245
LP +
Sbjct: 339 LPHN 342
>gi|410956528|ref|XP_003984894.1| PREDICTED: cathepsin B [Felis catus]
Length = 339
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V++ +I EI KNGPV A ++SD YKSG
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCNSYSVSNSEKEIMAEIYKNGPVEAAFSVFSDFLQYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ V+I+GWG EN PYW
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENDTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRGR+ IES V
Sbjct: 295 -------------LVG-------------NSWNTDWGDHGFFKILRGRDHCGIESEVVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|185135431|ref|NP_001117776.1| procathepsin B precursor [Oncorhynchus mykiss]
gi|14582897|gb|AAK69705.1|AF358667_1 procathepsin B [Oncorhynchus mykiss]
Length = 330
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + + GLVTGG + SN GC+P S PC H + + P C T PKC
Sbjct: 148 CMGGFPSAAWDYWAESGLVTGGLYGSNIGCRPYSIAPCEH-HVNGTRPPC-TGEGDTPKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C N Y + +DK K+ Y V + I E+ KNGPV A +Y D YK+G
Sbjct: 206 VSEC-NAGYTPSYKKDKRFGKQTYSVPPKEQQIMTELYKNGPVEAAFSVYEDFLLYKTGV 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ +KI+GWG+EN P
Sbjct: 265 YQH------------------------VTGQMLGGHAIKILGWGKENNTP---------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG++E IES +
Sbjct: 291 ------------------------YWLVANSWNTDWGDNGFFKILRGKDECGIESEIVAG 326
Query: 242 LPK 244
+P+
Sbjct: 327 IPR 329
>gi|1169189|sp|P43157.1|CYSP_SCHJA RecName: Full=Cathepsin B-like cysteine proteinase; AltName:
Full=Antigen Sj31; Flags: Precursor
gi|11167|emb|CAA50305.1| cathepsin B [Schistosoma japonicum]
Length = 342
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 99/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM GPV A +Y D +YKSG
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGI 276
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + IV ++I+GWG E PY
Sbjct: 277 YRH------------------------VTGSIVGGHAIRIIGWGVEKRTPY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +++RGR+E IES V
Sbjct: 304 -------------------------WLIANSWNEDWGEKGLFRMVRGRDECSIESDVVAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|308504721|ref|XP_003114544.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
gi|308261929|gb|EFP05882.1| hypothetical protein CRE_27547 [Caenorhabditis remanei]
Length = 358
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/234 (31%), Positives = 97/234 (41%), Gaps = 64/234 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
W GL TGG + GC+P S PC+ + N TTS P C TP C CT N
Sbjct: 180 WWQTHGLCTGGNYEDQFGCKPYSIYPCDKKYPNGTTSVP-CPGYHTP--TCEEHCTSNIT 236
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
+ + QDK+ K +Y V ++ DIQ EIM NGPV+A+ +Y D + YKSG Y
Sbjct: 237 WPIAYKQDKHFGKAHYNVGKKMTDIQTEIMTNGPVIASFVIYDDFWDYKSGIY------- 289
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
V + + KI+GWG ++G PYW V
Sbjct: 290 -----------------VHTAGDQEGGMDTKIIGWGVDSGVPYWLCVH------------ 320
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+G FG+ G ++ LRG NE IE V ALP
Sbjct: 321 ----------------------QWGTDFGENGFVRFLRGVNEVNIEHQVLAALP 352
>gi|340380665|ref|XP_003388842.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 333
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 103/246 (41%), Gaps = 67/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G W + K GLVTGG + S+ GCQP P CNH Y E KT
Sbjct: 149 CAGGFLQQAWEYWVKDGLVTGGQYGSDEGCQPYLIPKCNHHEPGPYENCTGEGKT----- 203
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P+C C + Y + D + ++ Y V+ EV IQ EIM NGPV +YSD +YK
Sbjct: 204 PQCERTCRS-GYTTSYEADLHYGEKAYAVHREVEAIQTEIMTNGPVEGAFTVYSDFPTYK 262
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG +Y + + G +A ++I+GWG ENG PYW I
Sbjct: 263 SG------------VYQHVVGHALGGHA------------IRILGWGTENGVPYWLIANS 298
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ P W GDKG K++RG+++ IES +
Sbjct: 299 W------------------------NPSW----------GDKGYFKMIRGKDDCGIESNI 324
Query: 239 NGALPK 244
PK
Sbjct: 325 VAGTPK 330
>gi|195058549|ref|XP_001995463.1| GH17748 [Drosophila grimshawi]
gi|193896249|gb|EDV95115.1| GH17748 [Drosophila grimshawi]
Length = 340
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 106/243 (43%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + RG+V+GG+++S GC+P PC H + P C + +TP C
Sbjct: 157 CNGGFPGAAWSYWTTRGIVSGGSYNSTEGCRPYEVEPCEH-HVDGPRPPCHSGSTPH--C 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C NY + +DK+ Y +N +IQ+EIM NGPV +Y D+ YK+G
Sbjct: 214 KHQC-QPNYSVDYEKDKHFGASSYSINRNPRNIQREIMTNGPVEGAFTVYEDLILYKTG- 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + + G +A ++I+GWG
Sbjct: 272 -----------VYQHVHGKQLGGHA------------IRIIGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 294 ----------------WGESK-VPYWLIANSWNTDWGDNGFFRILRGKDHCGIESQISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|312374701|gb|EFR22198.1| hypothetical protein AND_15621 [Anopheles darlingi]
Length = 335
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 100/244 (40%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + W WVHK GLV+GG SN GCQP + PC H + + P C+ PK
Sbjct: 152 CNGGFPGAAWSYWVHK-GLVSGGPFGSNLGCQPYAIAPCEH-HVNGTRPSCEGEGGKTPK 209
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C D+Y + +DK + Y + I++EIM NGPV +Y D+ YK G
Sbjct: 210 CVKKC-QDSYTVPYAKDKRYGSKSYSIPRHEDQIRKEIMTNGPVEGAFTVYEDLLHYKEG 268
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + +++ ++I+GWG EN Y
Sbjct: 269 VYQH------------------------VTGKMLGGHAIRILGWGVENNTKY-------- 296
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ +GD G KILRG + IES +
Sbjct: 297 --------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSIAA 330
Query: 241 ALPK 244
LPK
Sbjct: 331 GLPK 334
>gi|294954734|ref|XP_002788292.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239903555|gb|EER20088.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 105/249 (42%), Gaps = 66/249 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSN------TGCQPVSFPPCNHANYTTSEPECKTLA 55
C G S W WVH G+ TGG + + GC P FPPC H T P+C +
Sbjct: 128 CDGGYPDSAWSWVHDEGIATGGDYVARGNLTKGDGCWPYDFPPCAHHINDTKYPKCPKGS 187
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
P C +C N Y D R+Y ++++ P Y YS +
Sbjct: 188 YETPNCVEQCHNPKYSTSLKND-----RHY------------MLESSP-----YQYS-VN 224
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+ K+ +GPV A+ +Y D +YKSGVY ++ + + +A
Sbjct: 225 NAKNAIRTDGPVSASYLVYEDFLAYKSGVYKHTSGSYLGGHA------------------ 266
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VK+IGWGEENG YW +V+++ E +GD G KI G N I +
Sbjct: 267 -----------------VKIIGWGEENGEAYWLVVNSWNEDWGDHGLFKIALG-NCQIDD 308
Query: 236 SLVNGALPK 244
L+ G PK
Sbjct: 309 DLL-GGTPK 316
>gi|157167366|ref|XP_001653890.1| cathepsin b [Aedes aegypti]
gi|54289254|gb|AAV31917.1| lysosomal cathepsin B [Aedes aegypti]
gi|108874249|gb|EAT38474.1| AAEL009637-PA [Aedes aegypti]
Length = 340
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG S+ GCQP + PC H + S P C+ PKC
Sbjct: 157 CNGGFPGAAWSYWVRKGLVSGGPFGSDQGCQPYAIAPCEH-HVNGSRPSCEGEGGKTPKC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + +DK K Y + + IQ+EIM NGPV +Y D+ +YK G
Sbjct: 216 VKKC-QASYNVPYAKDKMYGKSSYSIANHEKQIQKEIMTNGPVEGAFTVYEDLLNYKEGV 274
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + +++ ++I+GWG E+G Y
Sbjct: 275 YHH------------------------VHGKMLGGHAIRILGWGVEDGTKY--------- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG + IES +
Sbjct: 302 -------------------------WLIANSWNSDWGDNGFFKILRGEDHLGIESSIAAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|56756587|gb|AAW26466.1| unknown [Schistosoma japonicum]
Length = 216
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 99/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 33 CQGGFPGVAWDYWVTQGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 91
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + QDK+ Y V IQ+EIM NGPV A +Y D +YKSG
Sbjct: 92 KQKC-QKGYKTPYKQDKHYGDESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 150
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + IV ++I+GWG + P
Sbjct: 151 YRH------------------------VTGSIVGGHAIRIIGWGVKKRTP---------- 176
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ E +G+KG +I+RGR+E IES V
Sbjct: 177 ------------------------YWLIANSWNEDWGEKGLFRIVRGRDECSIESNVVAG 212
Query: 242 LPK 244
L K
Sbjct: 213 LIK 215
>gi|50657025|emb|CAH04630.1| cathepsin B [Suberites domuncula]
Length = 331
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + GLVTGG ++S GCQP C+H +P C + P+C
Sbjct: 145 CNGGYLGAAWRYFEHTGLVTGGQYNSKEGCQPYLIASCDHHVVGKKQP-CASKEEHTPRC 203
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y F +DK+ Y V V IQ EIM NGPV +Y
Sbjct: 204 SKTC-EAGYDVSFEKDKHFGASAYSVRSSVEAIQTEIMTNGPVEGAFTVY---------- 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+D +YKSGVY ++ A + +A ++I+GWG ENG P
Sbjct: 253 -------------ADFPTYKSGVYQHTSGAMLGGHA-IRILGWGTENGTP---------- 288
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +G G KI+RG+++ IES +
Sbjct: 289 ------------------------YWLVANSWNEDWGAMGYFKIIRGKDDCGIESQITAG 324
Query: 242 LPK 244
+PK
Sbjct: 325 MPK 327
>gi|48425700|pdb|1SP4|B Chain B, Crystal Structure Of Ns-134 In Complex With Bovine
Cathepsin B: A Two Headed Epoxysuccinyl Inhibitor
Extends Along The Whole Active Site Cleft
Length = 205
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 102/236 (43%), Gaps = 61/236 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC+ C
Sbjct: 29 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCNKTC-E 85
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V + +I EI KNGPV +YSD YKSG Y +
Sbjct: 86 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 142
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
S EI+ ++I+GWG ENG PYW
Sbjct: 143 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 167
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
L+G +++ +GD G KILRG++ IES + +P
Sbjct: 168 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 203
>gi|195437434|ref|XP_002066645.1| GK24603 [Drosophila willistoni]
gi|194162730|gb|EDW77631.1| GK24603 [Drosophila willistoni]
Length = 341
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + ++GLV+GG + S GCQP + PC+H+ S P C +C
Sbjct: 158 CQGGYPGAAWAYWARKGLVSGGDYGSQQGCQPYTIEPCDHSG-NGSRPVCTVGGGV--RC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C +Y F +DK + Y ++++V +IQ+EIM NGPV A + +Y D SYK+G
Sbjct: 215 QHLC-EPSYKVDFQRDKNFASKVYSISNDVLEIQKEIMTNGPVQAILTVYEDFLSYKTGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y E V V+I+GWG
Sbjct: 274 Y------------------------YHLEGEKVGPHAVRILGWGV--------------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG + PYW + +++G +GD G I RG N IE +
Sbjct: 295 ----------------WGTKK-VPYWLVANSWGSDWGDNGFFHIFRGENHCDIEGYIMAG 337
Query: 242 LPK 244
LPK
Sbjct: 338 LPK 340
>gi|432852559|ref|XP_004067308.1| PREDICTED: cathepsin B-like [Oryzias latipes]
Length = 330
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 99/243 (40%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + K GLVTGG + S+ GC+P + PPC H + + P C P+C
Sbjct: 148 CNGGYPTAAWDFWTKEGLVTGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + +DK+ K Y V IQ EI KNGPV +Y D YKSG
Sbjct: 207 INQCES-GYTPSYKKDKHYGKTSYSVEANENQIQTEIYKNGPVEGAFMVYEDFPMYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ +KI+GWG E+G P
Sbjct: 266 YQH------------------------VSGSLIGGHAIKILGWGVEDGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES V
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEVVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|327322926|gb|AEA48884.1| cathepsin B [Oplegnathus fasciatus]
Length = 330
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + K GLV+GG + S+ GC+P + PC H + S P C P+C
Sbjct: 148 CNGGYPSAAWDFWTKEGLVSGGLYDSHIGCRPYTIAPCEH-HVNGSRPSCTGEGGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T+C Y + +DK+ K Y V + IQ EI KNGPV +Y D YKSG
Sbjct: 207 ITKC-EAGYTPSYKEDKHFGKTSYTVLSDEEQIQSEIFKNGPVEGAFIVYEDFVLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS SA V +KI+GWG E+G P
Sbjct: 266 YQH----------------------VSGSA--VGGHAIKILGWGVEDGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G K LRG + IES V
Sbjct: 292 ------------------------YWLCANSWNTDWGDNGFFKFLRGSDHCGIESEVVAG 327
Query: 242 LPK 244
+PK
Sbjct: 328 IPK 330
>gi|395507317|ref|XP_003757972.1| PREDICTED: cathepsin B [Sarcophilus harrisii]
Length = 342
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 152 CNGGFPAGAWKYWIKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEGGDTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ +C Y + DK+ Y V +I EI KNGPV +Y+D YKSG
Sbjct: 211 NKKC-EAGYSPDYKDDKHYGTTAYNVPSSEKEIMAEIYKNGPVEGAFIVYADFLQYKSGV 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ ++++GWG E+G P
Sbjct: 270 YQH------------------------VTGDMLGGHAIRVLGWGVEDGVP---------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG++ IES +
Sbjct: 296 ------------------------YWLAANSWNTDWGDNGFFKILRGKDHCGIESEMVAG 331
Query: 242 LPK 244
+P+
Sbjct: 332 IPR 334
>gi|187103108|ref|NP_001119614.1| cathepsin B-1418 precursor [Acyrthosiphon pisum]
gi|163300438|tpg|DAA06126.1| TPA_inf: cathepsin B transcript 1418 [Acyrthosiphon pisum]
gi|239788654|dbj|BAH70998.1| ACYPI000010 [Acyrthosiphon pisum]
Length = 346
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 64/243 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + + G+VTGG + S GCQP S PC T E + T P C
Sbjct: 160 CDGGSPESAWYFFMRHGIVTGGDYGSEDGCQPYSIYPCGKGRNTCIEDDPDT-----PDC 214
Query: 62 HTR-CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+ CTN NY + + D + Y ++ DI +++ KNGPV A Y+Y+D YKSG
Sbjct: 215 SIKTCTNSNYSKNYRADLHYVDTVYSLSRSEEDIMKDLYKNGPVQAAFYVYTDFMYYKSG 274
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++SY G +I +KI+GWG ++G YW ++
Sbjct: 275 ----------------VYSYTRG--------QIEGGHAIKILGWGVDDGTKYWLCANSWS 310
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
S WGE G +ILRG NE IE V
Sbjct: 311 RS---------------WGE-------------------NGLFRILRGNNECHIEDRVIA 336
Query: 241 ALP 243
+P
Sbjct: 337 GMP 339
>gi|984958|gb|AAC46877.1| cathepsin B-like proteinase [Ancylostoma caninum]
Length = 343
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 102/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W+ + G+VTGG + C+P +F PC H C P PKC
Sbjct: 159 CQGGWPIEAYKWMQRDGVVTGGKYRQKKVCKPYAFYPCGHHQNDPYYGPCPGGLWPTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DK+ R Y++ + +I+QEI
Sbjct: 219 RKTCQR-KYNKSYQEDKHFATRAYYLPNNERNIRQEI----------------------- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPVVA +Y D YK G+Y + WG + G
Sbjct: 255 YKNGPVVAAFRVYQDFSYYKKGIY---------------VHKWGGQTG------------ 287
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
A+A VK++GWG EN YW I +++ +G+ G +I+RG NE IE+ +V G
Sbjct: 288 -------AHA-VKVVGWGRENATDYWLIANSWNTDWGESGYFRIVRGTNECGIEAQMVGG 339
Query: 241 AL 242
A+
Sbjct: 340 AM 341
>gi|167538317|ref|XP_001750823.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770644|gb|EDQ84327.1| predicted protein [Monosiga brevicollis MX1]
Length = 341
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G S+ W W G+VTGG ++S+ GCQP S P C+H + + P C P P C
Sbjct: 159 CSGGYPSAAWSWFKTTGIVTGGNYNSSQGCQPYSLPNCDH-HVSGQYPACSGEG-PTPAC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ Y V E I EIM NGPV +Y D+ +YKSG
Sbjct: 217 KKSC-EAGYNNTYSNDKHFGATAYSVAGEADKIATEIMTNGPVEGAFTVYEDLLTYKSGV 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ +KI+GWG E+G YW
Sbjct: 276 YQH------------------------TTGQVLGGHAIKIIGWGVESGVDYW-------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W + +++ +GD G KI +G +E IES +
Sbjct: 304 ----------------W----------VANSWNNDWGDNGFFKIKKGVDECGIESQIVAG 337
Query: 242 LPK 244
+PK
Sbjct: 338 MPK 340
>gi|294951797|ref|XP_002787132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239901778|gb|EER18928.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 278
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 100/249 (40%), Gaps = 66/249 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+ G +S W WVH +G+ TGG + + GC P FPPC H + P+C +
Sbjct: 89 CNGGFPNSAWSWVHDKGIATGGDYVAEDDMTKDDGCWPYDFPPCAHHVNDSKYPKCPKDS 148
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
P C +C N Y D++ V ++ Y YS +
Sbjct: 149 YETPNCAEQCHNPKYTTTLRDDRHFM----------------------VESSPYQYS-VN 185
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
K+ +GPV A+ +Y D +YKSGVY
Sbjct: 186 DAKNAIRTDGPVSASFTVYEDFLAYKSGVYK----------------------------- 216
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
S E + VK+IGWGEE+G+ YW +V+++ E +GD G KI G I+
Sbjct: 217 ------HTSGEYLGGHAVKIIGWGEESGQAYWLVVNSWNEDWGDHGLFKIALGN--CGID 268
Query: 236 SLVNGALPK 244
+ G PK
Sbjct: 269 DYLLGGTPK 277
>gi|9955277|pdb|1QDQ|A Chain A, X-Ray Crystal Structure Of Bovine Cathepsin B-Ca074
Complex
Length = 253
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/236 (30%), Positives = 99/236 (41%), Gaps = 61/236 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 77 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V + +I EI KNGPV +YSD YKSG Y +
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 190
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
S EI+ ++I+GWG ENG P
Sbjct: 191 ---------------------VSGEIMGGHAIRILGWGVENGTP---------------- 213
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW + +++ +GD G KILRG++ IES + +P
Sbjct: 214 ------------------YWLVANSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251
>gi|17565162|ref|NP_503382.1| Protein W07B8.4 [Caenorhabditis elegans]
gi|351059398|emb|CCD74288.1| Protein W07B8.4 [Caenorhabditis elegans]
Length = 335
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 103/246 (41%), Gaps = 59/246 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG+ S GC+P S PC + PEC + PKC
Sbjct: 145 CEGGYPIQAWRYWVKNGLVTGGSFESQYGCKPYSIAPCGETIDGVTWPECPMKISDTPKC 204
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CT N++Y + QDK+ Y + IQ EI+ +GPV +Y
Sbjct: 205 EHHCTGNNSYPIPYDQDKHFGASAYAIGRSAKQIQTEILAHGPVEVGFIVY--------- 255
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D + YK+G+Y A E+ +A VK++GWG +NG PYW
Sbjct: 256 --------------EDFYLYKTGIYTHVAGGELGGHA-VKMLGWGVDNGTPYW------- 293
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
++A++ W V +G+KG +ILRG +E IES
Sbjct: 294 LAANS---------------------WNTV------WGEKGYFRILRGVDECGIESAAVA 326
Query: 241 ALPKDN 246
+P N
Sbjct: 327 GMPDLN 332
>gi|213514196|ref|NP_001133994.1| Cathepsin B precursor [Salmo salar]
gi|209156086|gb|ACI34275.1| Cathepsin B precursor [Salmo salar]
Length = 330
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 103/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ + K GLV+GG + S+ GC+P S PPC H + + P CK P+C
Sbjct: 148 CNGGYPSAACDFWTKEGLVSGGLYDSHIGCRPYSIPPCEH-HVNGTRPPCKGEEGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y G+ QDK+ KR Y V + +I +E+ KNGPV +Y D YKSG
Sbjct: 207 TNQC-EPGYTPGYKQDKHFGKRSYSVPSDEKEIMKELYKNGPVEGAFTVYEDFLLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS SA V +K++GWGEE G P
Sbjct: 266 YRH----------------------VSGSA--VGGHAIKVLGWGEEGGIP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G KI+RG + IES +
Sbjct: 292 ------------------------YWLAANSWNTDWGENGFFKIVRGEDHCGIESEMVAG 327
Query: 242 LP 243
+P
Sbjct: 328 IP 329
>gi|442754445|gb|JAA69382.1| Putative cathepsin b precursor [Ixodes ricinus]
Length = 340
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + +G+VTGG + ++ GC P P C+H T P + P PKC
Sbjct: 158 CNGGFPAAAWSYWVDKGIVTGGNYDTDEGCMPYPVPSCDHHVNGTLGPCGQD--PPTPKC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
R Y F DK+ K Y V IQ EIMKNGPV +Y+D YKSG
Sbjct: 216 -VRLCRKGYNVDFKDDKHYGKSSYSVPSNETQIQMEIMKNGPVEGAFTVYADFPLYKSGV 274
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y S S + + ++I+GWG EN P
Sbjct: 275 YK------------------------SHSTDALGGHAIRILGWGVENDVP---------- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ ++GDKG KILRG NE IE +
Sbjct: 301 ------------------------YWLVANSWNTEWGDKGYFKILRGSNECGIEEDIVAG 336
Query: 242 LPK 244
+PK
Sbjct: 337 IPK 339
>gi|28373366|pdb|1ITO|A Chain A, Crystal Structure Analysis Of Bovine Spleen Cathepsin B-
E64c Complex
gi|88192750|pdb|2DC6|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca073 Complex
gi|88192751|pdb|2DC7|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca042 Complex
gi|88192752|pdb|2DC8|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca059 Complex
gi|88192753|pdb|2DC9|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca074me Complex
gi|88192754|pdb|2DCA|A Chain A, X-ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-ca075 Complex
gi|88192755|pdb|2DCB|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca076 Complex
gi|88192756|pdb|2DCC|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca077 Complex
gi|88192757|pdb|2DCD|A Chain A, X-Ray Crystal Structure Analysis Of Bovine Spleen
Cathepsin B-Ca078 Complex
Length = 256
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 77 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 133
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V + +I EI KNGPV +YSD YKSG Y +
Sbjct: 134 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 190
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
S EI+ ++I+GWG ENG PYW
Sbjct: 191 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 215
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
L+G +++ +GD G KILRG++ IES + +P
Sbjct: 216 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 251
>gi|255040225|gb|ACT99885.1| cathepsin B2 [Opisthorchis viverrini]
Length = 337
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 94/243 (38%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ GC+ FP C+H + P C PKC
Sbjct: 149 CQGGYPPAAWDFWQAYGIVTGGSKEDPMGCRSYPFPKCSHHG-SKKYPPCPHRIYDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C N + DK R Y V I +EIM NGPV A +Y D F YK G
Sbjct: 208 VPKCDTPNID--YETDKTRANITYNVQRSQMAIMKEIMINGPVEAAFEVYEDFFGYKQGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y ++ E + ++I+GWGEENG PY
Sbjct: 266 Y------------------------FHSTGEFIGGHAIRILGWGEENGTPY--------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+ G K+LRG+NE IE V
Sbjct: 293 -------------------------WLIANSWNEGWGEDGYFKMLRGKNECGIEDEVTAG 327
Query: 242 LPK 244
LP+
Sbjct: 328 LPE 330
>gi|161343839|tpg|DAA06100.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 323
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/246 (29%), Positives = 98/246 (39%), Gaps = 66/246 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C G + W +G+VTGG SN GCQP PC+H Y S C +L Q
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191
Query: 61 -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
C +C N NY + D ++ Y W N V IQQEIM +GPV A MY+Y +
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTHGPVTAFMYVYENFMG 249
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G Y S + E++ Y VK++GWG +
Sbjct: 250 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDG-------- 277
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+G YW ++++ +G+ G KILRG N IE
Sbjct: 278 -------------------------DGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIEL 312
Query: 237 LVNGAL 242
LV +
Sbjct: 313 LVMAGI 318
>gi|195438776|ref|XP_002067308.1| GK16352 [Drosophila willistoni]
gi|194163393|gb|EDW78294.1| GK16352 [Drosophila willistoni]
Length = 340
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 107/243 (44%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG S GC+P PC H + + P C + +TP +C
Sbjct: 157 CNGGFPGAAWSYWTRKGIVSGGNFGSQQGCRPYEIEPCEH-HVNGTRPPCSSGSTP--RC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y + +DK + Y + + V DIQ+EIM NGPV +Y
Sbjct: 214 QHVCES-SYKVDYKKDKNFGSKSYSIKNNVLDIQKEIMNNGPVEGAFTVY---------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YKSGVY E+ +A ++I+GWG
Sbjct: 263 -------------EDLILYKSGVYEHVHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG+E PYW I +++ +GD G +I+RG++ IES ++
Sbjct: 294 ----------------WGDEK-IPYWLIANSWNTDWGDNGFFRIVRGKDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|49036806|gb|AAT48984.1| cathepsin B-like proteinase [Triatoma sordida]
Length = 331
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +S W + G+V+GG + S GCQP S PC H + S P C P C
Sbjct: 150 CDGGFPASAWDYWQNEGIVSGGNYGSKQGCQPYSIAPCEH-HVPGSRPACSG-GGDTPDC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C ++ G + QD Y + Y + DE IQ EI+KNGPV A +Y D+ +YK G
Sbjct: 208 RNQC-DEGSGISYDQDHYYGETVYTL-DEAKQIQAEILKNGPVEAAFTVYEDLLNYKEGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E + +KI+GWG EN P
Sbjct: 266 YQH------------------------VAGEALGGHAIKILGWGVENDTP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G KILRG +E IE +
Sbjct: 292 ------------------------YWLVANSWNTDWGNNGFFKILRGSDECGIEDQIVAG 327
Query: 242 LPK 244
LP+
Sbjct: 328 LPR 330
>gi|353228456|emb|CCD74627.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 333
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 102/239 (42%), Gaps = 64/239 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S +W + K GLVTG TGC P FP C+H + + S P+C + P C
Sbjct: 152 CQIGFSEFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + DK+ + Y + +DI++EIM NGPV A ++++SD +YKSG
Sbjct: 207 TKTCRS-GYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++V +V+I+GWG EN P
Sbjct: 266 YRH------------------------ITGQLVTIHSVRIIGWGIENDIP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ E +G G KILRG NE IES VN
Sbjct: 292 ------------------------YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNA 326
>gi|256090674|ref|XP_002581308.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 250
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 101/239 (42%), Gaps = 64/239 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S +W + K GLVTG TGC P FP C+H + + S P+C + P C
Sbjct: 69 CQIGFSEFSWDYWLKNGLVTGDP----TGCLPYPFPKCDHRS-SNSYPKCGYITYTAPPC 123
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ + Y + +DI++EIM NGPV A ++++SD +YKSG
Sbjct: 124 TKTC-RSGYPIPYKADKHYGRVIYSLRPNESDIRKEIMMNGPVEAGIFVHSDFLNYKSGV 182
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++V +V+I+GWG EN P
Sbjct: 183 YRH------------------------ITGQLVTIHSVRIIGWGIENDIP---------- 208
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ E +G G KILRG NE IES VN
Sbjct: 209 ------------------------YWLCANSWNEDWGLNGYFKILRGSNECEIESFVNA 243
>gi|341904470|gb|EGT60303.1| hypothetical protein CAEBREN_20420 [Caenorhabditis brenneri]
Length = 351
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 101/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K G VTGG++ TGC+P +PPC H T C + P KC
Sbjct: 167 CNGGYPIEAWRHYVKNGYVTGGSYQEKTGCKPYPYPPCEHHVNGTHYKPCPSDMYPTDKC 226
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QD + + Y V+ + +IQ+EIM NGPV +Y+D F SG
Sbjct: 227 ERSC-QAGYSLTYKQDLHFGQSAYAVSKKATEIQKEIMTNGPVEVAFTVYAD-FEVYSG- 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GVY +A A + +A VK++GWG +NG P
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +G+ G +I+RG NE IE V G
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIEHGVVGG 347
Query: 242 LPK 244
+PK
Sbjct: 348 IPK 350
>gi|147906534|ref|NP_001090927.1| cathepsin B precursor [Sus scrofa]
gi|187470655|sp|A1E295.1|CATB_PIG RecName: Full=Cathepsin B; Contains: RecName: Full=Cathepsin B
light chain; Contains: RecName: Full=Cathepsin B heavy
chain; Flags: Precursor
gi|118490058|gb|ABK96810.1| cathepsin B [Sus scrofa]
Length = 335
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 103/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y ++ +I EI KNGPV +YSD YKSG
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ ++I+GWG ENG PYW
Sbjct: 267 YQH------------------------VTGDLMGGHAIRILGWGVENGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|442616292|ref|NP_001259536.1| cathepsin B1, isoform B [Drosophila melanogaster]
gi|440216755|gb|AGB95378.1| cathepsin B1, isoform B [Drosophila melanogaster]
Length = 330
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 146 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 203
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK+ + Y V V +IQ+EIM NGPV +Y
Sbjct: 204 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVY---------- 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YK GVY E+ +A ++I+GWG
Sbjct: 253 -------------EDLILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 283
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGEE PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 284 ----------------WGEEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 326
Query: 242 LPK 244
LPK
Sbjct: 327 LPK 329
>gi|91078964|ref|XP_974298.1| PREDICTED: similar to putative cathepsin B-like like proteinase
[Tribolium castaneum]
gi|270004838|gb|EFA01286.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + + G+VTGG + + GC+ + PPC H + P C + P P+C
Sbjct: 154 CNGGWPAEAWAYWAETGIVTGGKYETKDGCKAYTVPPCEH-HTEGDLPACGDI-VPTPQC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D ++ R Y + + + IQ EIM NGPV A+ +Y D +YKSG
Sbjct: 212 KKEC--DAGVDIEYKSDLRKGSAYQTSSDESQIQTEIMTNGPVEADFDVYEDFLNYKSG- 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +G YA + +KI+GWG E+G P
Sbjct: 269 ---------------VYQQTTGNYAGGHA--------IKILGWGVEDGTP---------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +GDKG KILRG+NE IES + G
Sbjct: 296 ------------------------YWLAANSWNEDWGDKGYFKILRGQNECGIESDIIGG 331
Query: 242 LP 243
+P
Sbjct: 332 IP 333
>gi|354471594|ref|XP_003498026.1| PREDICTED: cathepsin B-like [Cricetulus griseus]
gi|344254255|gb|EGW10359.1| Cathepsin B [Cricetulus griseus]
Length = 339
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 106/245 (43%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC P + PPC H + S P+C T PKC
Sbjct: 150 CNGGYPSGAWNFWIKKGLVSGGLYNSHVGCLPYTIPPCEH-HVNGSRPQC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V++ +I EI KNGPV ++SD +YKSG
Sbjct: 208 TKSC-EAGYSPSYKEDKHYGYTSYSVSNNEKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +I+ ++I+GWG EN PYW + + V
Sbjct: 266 ---------------VYKHEAG--------DIMGGHAIRILGWGVENSVPYWLVANSWNV 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG + IES +
Sbjct: 303 ----------------------------------DWGDNGLFKILRGEDHCGIESEIVAG 328
Query: 242 LPKDN 246
+P+ +
Sbjct: 329 IPRTD 333
>gi|121309133|dbj|BAF43801.1| Longipain [Haemaphysalis longicornis]
Length = 341
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 100/244 (40%), Gaps = 62/244 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + W WVHK G+VTGG + S+ GC P C+H T P C P P+
Sbjct: 158 CNGGFPGAAWSYWVHK-GIVTGGNYDSDEGCMPYPIKACDHHVNGTLGP-CDKSIPPTPR 215
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C R Y F DK+ K+ Y V V IQ EIM NGPV A+ +Y
Sbjct: 216 C-VRMCRKGYNVDFADDKHYGKKSYSVPSNVTQIQVEIMTNGPVEADFTVY--------- 265
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+D YKSGVY + +A ++++GWG E G P
Sbjct: 266 --------------ADFPLYKSGVYQRHTDQALGGHA-IRLLGWGVEKGVP--------- 301
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ ++GDKG KILRG +E IE V
Sbjct: 302 -------------------------YWLAANSWNTEWGDKGFFKILRGSDECGIEDDVVA 336
Query: 241 ALPK 244
+P+
Sbjct: 337 GIPR 340
>gi|27806671|ref|NP_776456.1| cathepsin B precursor [Bos taurus]
gi|115312124|sp|P07688.5|CATB_BOVIN RecName: Full=Cathepsin B; AltName: Full=BCSB; Contains: RecName:
Full=Cathepsin B light chain; Contains: RecName:
Full=Cathepsin B heavy chain; Flags: Precursor
gi|289402|gb|AAA03064.1| cathepsin B [Bos taurus]
gi|809479|gb|AAA80198.1| cathepsin B [Bos taurus]
gi|296484950|tpg|DAA27065.1| TPA: cathepsin B precursor [Bos taurus]
Length = 335
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V + +I EI KNGPV +YSD YKSG Y +
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 269
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
S EI+ ++I+GWG ENG PYW
Sbjct: 270 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 294
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
L+G +++ +GD G KILRG++ IES + +P
Sbjct: 295 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|171948776|gb|ACB59245.1| cathepsin B [Sus scrofa]
Length = 335
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 103/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPSGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y ++ +I EI KNGPV +YSD YKSG
Sbjct: 208 SKIC-EPGYTPSYKEDKHFGCSSYSISRNEKEIMAEIYKNGPVEGAFTVYSDFLQYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ ++I+GWG ENG PYW
Sbjct: 267 YQH------------------------VTGDLMGGHAIRILGWGVENGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|149698064|ref|XP_001498242.1| PREDICTED: cathepsin B [Equus caballus]
Length = 340
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 103/242 (42%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGDTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ +I EI KNGPV A +YSD YKSG
Sbjct: 209 SKIC-EPGYSPSYKEDKHYGCSSYSVSSSEKEIMAEIFKNGPVEAAFTVYSDFLQYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +++ V+I+GWG ENG PYW
Sbjct: 268 YQH------------------------VAGDMMGGHAVRILGWGVENGTPYW-------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 296 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|291385792|ref|XP_002709482.1| PREDICTED: cathepsin B [Oryctolagus cuniculus]
Length = 339
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 104/239 (43%), Gaps = 61/239 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG + S+ GC+P S PPC H + S P C T P+C C
Sbjct: 156 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 212
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V+ + +I+ EI KNGPV +YSD YKSG Y +
Sbjct: 213 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPVEGAFTVYSDFLMYKSGVYQH--- 269
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
+ +I+ ++I+GWGEENG P
Sbjct: 270 ---------------------TTGDIMGGHAIRILGWGEENGVP---------------- 292
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
YW + +++ +GDKG KILRG++ IES + +P+ +
Sbjct: 293 ------------------YWLVANSWNTDWGDKGFFKILRGQDHCGIESEIVAGIPRTD 333
>gi|440913587|gb|ELR63025.1| Cathepsin B [Bos grunniens mutus]
Length = 335
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 101/236 (42%), Gaps = 61/236 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 SGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKCSKTC-E 212
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y V + +I EI KNGPV +YSD YKSG Y +
Sbjct: 213 PGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQH--- 269
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
S EI+ ++I+GWG ENG PYW
Sbjct: 270 ---------------------VSGEIMGGHAIRILGWGVENGTPYW-------------- 294
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
L+G +++ +GD G KILRG++ IES + +P
Sbjct: 295 -------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGMP 330
>gi|17565158|ref|NP_503384.1| Protein W07B8.1 [Caenorhabditis elegans]
gi|351059396|emb|CCD74286.1| Protein W07B8.1 [Caenorhabditis elegans]
Length = 335
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 97/246 (39%), Gaps = 59/246 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ K G+ TGG++ S GC+P S PPC + P C +P P C
Sbjct: 148 CEGGNPFKAWQYIQKHGIPTGGSYESQFGCKPYSIPPCGKTVGNVTYPACTNTTSPTPSC 207
Query: 62 HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+CT+ Y +D++ + + +IQ ++M NGP+ A +Y D Y +G
Sbjct: 208 EKKCTSRIGYPIDIDKDRHYGVSVDQLPNSQIEIQSDVMLNGPIQATFEVYDDFLQYTTG 267
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y V + + +V+I+GWG G P
Sbjct: 268 IY------------------------VHLTGNKQGHLSVRIIGWGVWQGVP--------- 294
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++G Q+G+ GT ++LRG NE +ES
Sbjct: 295 -------------------------YWLCANSWGRQWGENGTFRVLRGTNECGLESNCVS 329
Query: 241 ALPKDN 246
+PK N
Sbjct: 330 GMPKLN 335
>gi|18921171|ref|NP_572920.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|7292926|gb|AAF48317.1| cathepsin B1, isoform A [Drosophila melanogaster]
gi|16767940|gb|AAL28188.1| GH06546p [Drosophila melanogaster]
gi|220944992|gb|ACL85039.1| CG10992-PA [synthetic construct]
gi|220954816|gb|ACL89951.1| CG10992-PA [synthetic construct]
Length = 340
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H + + P C PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAH-GGRTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK+ + Y V V +IQ+EIM NGPV +Y
Sbjct: 214 SHVCQS-GYTVDYAKDKHFGSKSYSVRRNVREIQEEIMTNGPVEGAFTVY---------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YK GVY E+ +A ++I+GWG
Sbjct: 263 -------------EDLILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGEE PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 294 ----------------WGEEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|326916753|ref|XP_003204669.1| PREDICTED: cathepsin B-like [Meleagris gallopavo]
Length = 340
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 96/243 (39%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + +RGLV+GG + S+ GC+P + PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRPYTIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +I EI KNGPV +Y D YKSG
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E V ++I+GWG ENG P
Sbjct: 268 YQH------------------------VSGEQVGGHAIRILGWGVENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 294 ------------------------YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329
Query: 242 LPK 244
+P+
Sbjct: 330 VPR 332
>gi|195352458|ref|XP_002042729.1| GM17589 [Drosophila sechellia]
gi|194126760|gb|EDW48803.1| GM17589 [Drosophila sechellia]
Length = 340
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 105/243 (43%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H + + P C + PKC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEH-HVNGTRPPCAN-GSGTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y + +DK+ + Y V V +IQ+EIM NGPV +Y D
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYED-------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YK GVY E+ +A ++I+GWG
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG E PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 294 ----------------WGNEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|160333103|ref|NP_001103948.1| capthepsin B, b precursor [Danio rerio]
gi|133777414|gb|AAI15255.1| Ctsbb protein [Danio rerio]
Length = 326
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G + W + + GLVTGG ++S+ GC+P S PC H + + P C PKC
Sbjct: 144 CSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSIAPCEH-HVNGTRPPCSG-EQDTPKC 201
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ + Y V + I E+ NGPV A +Y D
Sbjct: 202 TGVCI-PKYSVPYKQDKHFGSKVYNVPSDQQQIMTELYTNGPVEAAFTVYEDF------- 253
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
P+ YKSGVY + + +A VKI+GWGEENG P
Sbjct: 254 ----PL------------YKSGVYQHLTGSALGGHA-VKILGWGEENGTP---------- 286
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W + +++ +GD G KILRG +E IES +
Sbjct: 287 ------------------------FWLVANSWNSDWGDNGYFKILRGHDECGIESEMVAG 322
Query: 242 LPK 244
LPK
Sbjct: 323 LPK 325
>gi|195566634|ref|XP_002106884.1| GD15875 [Drosophila simulans]
gi|194204277|gb|EDX17853.1| GD15875 [Drosophila simulans]
Length = 340
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + SN GC+P PC H T P TP KC
Sbjct: 156 CNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCEHHVNGTRPPCAHGGGTP--KC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y + +DK+ + Y V V +IQ+EIM NGPV +Y D
Sbjct: 214 SHVCQS-SYTVDYAKDKHFGSKSYSVKRNVREIQEEIMTNGPVEGAFTVYED-------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YK GVY E+ +A ++I+GWG
Sbjct: 265 ---------------LILYKDGVYQHEHGKELGGHA-IRILGWGV--------------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG+E PYW I +++ +GD G +ILRG++ IES ++
Sbjct: 294 ----------------WGDEK-IPYWLIGNSWNTDWGDHGFFRILRGQDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|38147393|gb|AAR12009.1| cathepsin B-like proteinase [Triatoma infestans]
Length = 332
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 100/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +S W + G+V+GG + S GCQP S PC H + P C + P C
Sbjct: 150 CDGGYPASAWDYWQNVGIVSGGNYGSKQGCQPYSIAPCEH-HVPGPRPACSGEGS-TPDC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + G + +D Y + Y + DE IQ EI+KNGPV A +Y D+ +YK G
Sbjct: 208 RNQC-DKRSGISYDKDLYYGESAYSLEDEAKQIQAEILKNGPVEAAFTVYEDLVNYKEGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +KI+GWG EN P
Sbjct: 267 YQH------------------------VAGSVLGGHAIKILGWGVENDTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G KILRG++E IE V+
Sbjct: 293 ------------------------YWLVANSWNTDWGNNGFFKILRGKDECGIEIDVSAG 328
Query: 242 LPK 244
LP+
Sbjct: 329 LPR 331
>gi|3087803|emb|CAA93279.1| cysteine protease [Haemonchus contortus]
Length = 325
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 83/174 (47%), Gaps = 26/174 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +V + G VTGG + + C+ FPPC H T EC A PKC
Sbjct: 163 CQGGWPIEAWEYVAREGAVTGGRLLAKSCCRSHPFPPCGHHGNETYYGECGGRAR-TPKC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T CT Y + DK R K Y + + V IQ+EIMKNGPVVA +Y+D FSY
Sbjct: 222 RTSCT-PGYKNSYSDDKIRGKDAYELPNSVKAIQREIMKNGPVVAAFTVYAD-FSY---- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK G+Y +A ++A VK++GWGEE PYW +
Sbjct: 276 ------------------YKKGIYKHTAGRARGSHA-VKVIGWGEEGDVPYWIV 310
>gi|161343829|tpg|DAA06095.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 280
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 77/171 (45%), Gaps = 25/171 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
C G +W + + G V+GG ++SN GCQP PPC N + C T + P
Sbjct: 132 CDGGSQFESWDFYRRHGFVSGGDYNSNQGCQPYMIPPCKLINEKSPRHSCTTYNREETPA 191
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C N NY F D Y+ K YY V +A +EI NGP+ Y+Y D+ YKSG
Sbjct: 192 CEIKCNNPNYYSSFKTDIYKGK-YYQVYPFMA--MKEIFDNGPITTQFYMYRDLIDYKSG 248
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
++ Y G Y + KI+GWGEENG P
Sbjct: 249 ----------------VYQYDEGFY-----GDFFTVQGXKIIGWGEENGDP 278
>gi|496317|dbj|BAA04103.1| Sarcophaga pro-cathepsin B [Sarcophaga peregrina]
Length = 344
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 100/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S+ GC+P PC H + + P C P C
Sbjct: 161 CNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCDGEHGKTPSC 219
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C +Y + DK+ + Y V V DIQ+EIM+NGPV +Y D+ YK G
Sbjct: 220 RHECQK-SYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDG- 277
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + + G +A ++I+GWG EN PY
Sbjct: 278 -----------VYQHVHGRELGGHA------------IRILGWGVENKTPY--------- 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G+ G K+LRG + IES +
Sbjct: 306 -------------------------WLIANSWNTDWGNNGFFKMLRGEDHCGIESAIAAG 340
Query: 242 LPK 244
LPK
Sbjct: 341 LPK 343
>gi|48762476|dbj|BAD23809.1| cathepsin B-S [Tuberaphis styraci]
gi|204022069|dbj|BAG71132.1| cathepsin B-S1 [Tuberaphis styraci]
Length = 349
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 70/249 (28%), Positives = 102/249 (40%), Gaps = 64/249 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG + + GC P PPC + P +
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYDEQGKNT-----CGGKPMERN 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG+ QD+Y+ K Y +N + I+Q++M GPV A+ +Y D FS
Sbjct: 209 H-QCPKTCYGKTTVQDRYKTKNEYVIN-SIETIEQDLMTYGPVEASFDVYDD-FSV---- 261
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YKSG+Y + A+ ++KI+GWGEENG PYW V ++
Sbjct: 262 ------------------YKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAVNSWS- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W GD GT KI++GRNE IE V
Sbjct: 303 -----------------------KFW----------GDHGTFKIIKGRNECGIERAVTAG 329
Query: 242 LPKDNYGVE 250
+P + G +
Sbjct: 330 IPSTSRGPQ 338
>gi|118358710|ref|XP_001012596.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89294363|gb|EAR92351.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 346
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 96/227 (42%), Gaps = 60/227 (26%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQ 76
GLVTG + +N+ CQ S PC H + P C T P P C C +N Y + +
Sbjct: 177 GLVTGDLYGNNSWCQAYSLAPCAHHVTSDVYPPC-TGELPTPPCVKSCDSNSTYTIPYPK 235
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
D ++ + Y ++ I EI NGP+ +Y D
Sbjct: 236 DLHKGSKAYSIDQNEQAIMTEIQTNGPIEVAFTVYED----------------------- 272
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
+YKSGVY +E+ +A VK+VGWG ENG PY
Sbjct: 273 FLTYKSGVYQHVTGSELGGHA-VKMVGWGVENGTPY------------------------ 307
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W IV+++ E +GDKGT KILRG+NE IES ALP
Sbjct: 308 ----------WIIVNSWNESWGDKGTFKILRGQNECGIESECVTALP 344
>gi|268557308|ref|XP_002636643.1| Hypothetical protein CBG23351 [Caenorhabditis briggsae]
Length = 351
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 103/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K+G VTGG++ +GC+P +PPC H T C + P KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQEKSGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QD + + Y V+ + A+IQ+EIM +GPV +Y D F + SG
Sbjct: 227 EHSC-QAGYPLTYTQDLHFGQSAYAVSKKPAEIQKEIMTHGPVEVAFTVYED-FEHYSG- 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GVY +A A + +A VK++GWG +NG P
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +G+ G +I+RG NE IES V G
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIESGVVGG 347
Query: 242 LPK 244
PK
Sbjct: 348 TPK 350
>gi|50540542|ref|NP_998501.1| cathepsin B, a precursor [Danio rerio]
gi|34784038|gb|AAH56688.1| Cathepsin B, a [Danio rerio]
gi|37681773|gb|AAQ97764.1| cathepsin B [Danio rerio]
gi|41351445|gb|AAH65589.1| Cathepsin B, a [Danio rerio]
Length = 330
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 94/242 (38%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWATEGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCSGEGGDTPNC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + QDK+ K Y V I E+ KNGPV +Y D YKSG
Sbjct: 207 DMKC-EPGYSPSYKQDKHFGKTSYSVPSNQNSIMAELFKNGPVEGAFTVYEDFLLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S V +KI+GWGEENG P
Sbjct: 266 YQH------------------------MSGSPVGGHAIKILGWGEENGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327
Query: 242 LP 243
+P
Sbjct: 328 IP 329
>gi|56756907|gb|AAW26625.1| unknown [Schistosoma japonicum]
Length = 342
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|313229093|emb|CBY18245.1| unnamed protein product [Oikopleura dioica]
Length = 355
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 75/240 (31%), Positives = 100/240 (41%), Gaps = 60/240 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + RGLVTGG + + CQP + C H + P C T PKC
Sbjct: 163 CNGGYPLAAMEYFVTRGLVTGGLYGTKDTCQPYTLEACEH-HVPGDRPPC-TEGGGTPKC 220
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + + DK + Y V ++V IQQEIM GPV A +YS
Sbjct: 221 SHQCIPDYTTKAYKDDKVHGHKAYSVPNDVGKIQQEIMHYGPVEAAFTVYS--------- 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D SYKSGVY ++ +E+ +A +KI+GWG E G
Sbjct: 272 --------------DFPSYKSGVYRHTSGSELGGHA-IKIIGWGTEGG------------ 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GDKGT KILRG NE IE V A
Sbjct: 305 ----------------------DDYWLINNSWNSDWGDKGTFKILRGSNECGIEGEVVAA 342
>gi|209863079|ref|NP_001119613.2| cathepsin B precursor [Acyrthosiphon pisum]
Length = 323
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/246 (29%), Positives = 97/246 (39%), Gaps = 66/246 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C G + W +G+VTGG SN GCQP PC+H Y S C +L Q
Sbjct: 134 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 191
Query: 61 -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
C +C N NY + D ++ Y W N V IQQEIM GPV A MY+Y +
Sbjct: 192 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMG 249
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G Y S + E++ Y VK++GWG +
Sbjct: 250 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDG-------- 277
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+G YW ++++ +G+ G KILRG N IE
Sbjct: 278 -------------------------DGTEYWLAMNSWNSNWGNDGLFKILRGYNFCSIEL 312
Query: 237 LVNGAL 242
LV +
Sbjct: 313 LVMAGI 318
>gi|170586854|ref|XP_001898194.1| cathepsin B-like cysteine proteinase [Brugia malayi]
gi|158594589|gb|EDP33173.1| cathepsin B-like cysteine proteinase, putative [Brugia malayi]
Length = 384
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 74/246 (30%), Positives = 110/246 (44%), Gaps = 58/246 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C G + W + G+VTG + +++GC+P FPPC +H+N T EP CK P PK
Sbjct: 190 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 248
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C+ +C + NY + + DKY ++ Y V ++V IQ+EIM GPV A+ +Y+D Y SG
Sbjct: 249 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYTDFLHYTSG 307
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y + G +A VKI+GWG + G YW +
Sbjct: 308 ------------IYKHVAGSVGGGHA------------VKILGWGIDQGVSYWLAANSWN 343
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
WGE+ F G +ILRG +E IES +
Sbjct: 344 ND---------------WGED----------VFS------GYFRILRGADECGIESGIVA 372
Query: 241 ALPKDN 246
+P+ +
Sbjct: 373 GIPRKD 378
>gi|74213457|dbj|BAE35542.1| unnamed protein product [Mus musculus]
Length = 339
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 108/251 (43%), Gaps = 62/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC P + PPC H + S P C T +C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTHRC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG ENG PYW + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYWLAANSWNL 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 329 IPRTDQYWGRF 339
>gi|226473756|emb|CAX71563.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|40557606|gb|AAR88096.1| cathepsin B-like cysteine protease [Callosobruchus maculatus]
Length = 330
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 103/244 (42%), Gaps = 67/244 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI S T+ G V+GG ++S GC P CN P CKTL P C
Sbjct: 153 CHGGIPSFTFTEWKDSGFVSGGEYNSTNGCMSYPLPRCN--------PSCKTLYDA-PTC 203
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVA-DIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C + + + +DK+ K+ Y + +V IQ EI+KNGPVV
Sbjct: 204 KKECDKGSPLK-YEEDKHYAKQAYRIMSKVERQIQLEIIKNGPVV--------------- 247
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
A+ +Y+D Y SGVY ++++ V+I+GWG ENG
Sbjct: 248 --------ASFTVYADFIHYLSGVYKFDGESKLLGGHAVRIIGWGIENGT---------- 289
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
PYW + +++ E++GD+G KI RG+NE IE +
Sbjct: 290 -----------------------YPYWLVSNSWNERWGDQGLFKIWRGKNECGIEEEITA 326
Query: 241 ALPK 244
LP+
Sbjct: 327 GLPR 330
>gi|56757271|gb|AAW26807.1| unknown [Schistosoma japonicum]
Length = 342
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 104/237 (43%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G++ +W + K G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ + Y V + IQ+EIM GPV A +++Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGEFSYNVIGVESVIQKEIMMYGPVEAYLHIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGR+E +IES +
Sbjct: 300 ---------------------GTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335
>gi|56759588|gb|AAW28820.1| Parcxpwnx02 [Periplaneta americana]
Length = 343
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 108/243 (44%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + G+V+GG+++S+ GCQP + PC H T +P C T P+C
Sbjct: 162 CNGGEPGAAWDYWVSTGIVSGGSYNSHQGCQPYAIEPCEHHVNGTRKP-CGEGDT--PRC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC + Y + +D++ K Y V V IQ+E++ NGP A + +Y D Y++G
Sbjct: 219 VKRC-EEGYDVPYGKDRHFGKSAYAVPGSVKAIQKELLLNGPAEAALTVYDDFLHYRTGV 277
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VS A + V+++GWG E+G P
Sbjct: 278 YQH----------------------VSGGA--LGGHAVRLLGWGVEDGTP---------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G +ILRG++E IES +NG
Sbjct: 304 ------------------------YWLLANSWNYDWGDNGYFRILRGQDECGIESDINGG 339
Query: 242 LPK 244
LPK
Sbjct: 340 LPK 342
>gi|56754499|gb|AAW25437.1| unknown [Schistosoma japonicum]
Length = 342
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|204022102|dbj|BAG71148.1| cathepsin B-N2 [Tuberaphis takenouchii]
Length = 334
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 107/245 (43%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W K GLVTGG ++S GCQP PPC Y + C+ P K
Sbjct: 154 CHGGYPIKAWERFRKHGLVTGGDYNSGEGCQPYRVPPCPFDEYGNNT--CR--GKPAEKN 209
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F++ +R+ R Y++N ++ IQ ++M GP+ A SY
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHRYTRDAYYLNYQI--IQNDLMTYGPIEA---------SYD 257
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+Y D +YKSGVY + +A + VK++GWGEE G PY
Sbjct: 258 --------------VYDDFPNYKSGVYMKTENASYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|29374025|gb|AAO73003.1| cathepsin B [Fasciola gigantica]
Length = 339
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 97/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+VTGG + TGCQP F C+H + C P P C
Sbjct: 155 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + QDK+ Y V + + I QEIMKNGPV ++ D Y+SG
Sbjct: 215 ARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + + V+++GWG EN
Sbjct: 274 YHH------------------------VAGKFIGRHAVRMIGWGVEN------------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW + +++ E++G+ G +++RGRNE IES V
Sbjct: 297 ---------------------GVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAG 335
Query: 242 LPK 244
+P+
Sbjct: 336 MPR 338
>gi|5764077|emb|CAB53367.1| necpain [Necator americanus]
Length = 339
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 102/241 (42%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W WV K G+ TGG + + C+P +F PC + C + P P+C
Sbjct: 155 CSGGWPFQAWEWVRKYGVCTGGDYRAKGVCKPYAFHPCGNHENQVYYGVCPKGSWPTPRC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DK+ K+ YW+ ++ +I+ +IMKNGPV A +Y D YK G
Sbjct: 215 EKFCQR-GYIKPYKKDKFYAKKSYWLPNDEKEIRLDIMKNGPVQAAFDVYEDFKLYKRG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ +K G+ + VKI+GWG++NG Y
Sbjct: 273 ---------------IYKHKEGIQTGGHA--------VKIIGWGKDNGTDY--------- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ + +G+ G +++RG N+ IE ++
Sbjct: 301 -------------------------WLIANSWSKDWGESGFFRMVRGENDCEIEDMITAG 335
Query: 242 L 242
+
Sbjct: 336 I 336
>gi|161671340|gb|ABX75522.1| cathepsin b [Lycosa singoriensis]
Length = 247
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 96/243 (39%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + +G+ TGG +S+ GCQP P C H + T P C + PKC
Sbjct: 65 CDGGFPPSAWEFWVDKGIATGGLWNSHIGCQPYEIPACEH-HTTGDRPPCSDIVD-TPKC 122
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K+ Y + IQ EI KNGPV +YSD +YKSG
Sbjct: 123 VHLCEK-GYNTSYRDDKHFGKKSYSIESLEQQIQTEIFKNGPVEGAFSVYSDFINYKSGV 181
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E + ++++GWG EN P
Sbjct: 182 YQH------------------------HSGESLGGHAIRVLGWGYENDVP---------- 207
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GDKG KILRG +E IES +
Sbjct: 208 ------------------------YWLCANSWNTDWGDKGYFKILRGSDECGIESSIVAG 243
Query: 242 LPK 244
+PK
Sbjct: 244 IPK 246
>gi|289743429|gb|ADD20462.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S+ GC+P PC H + + P C+ P+C
Sbjct: 157 CNGGFPGAAWGYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + DK+ R Y ++ V DIQ EIM NGPV +Y
Sbjct: 216 QHKC-QASYKVDYKTDKHFGSRAYSISKNVRDIQGEIMTNGPVEGAFTVY---------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YK GVY E+ +A ++I+GWG E PY
Sbjct: 265 -------------EDLILYKDGVYEHVHGKELGGHA-IRIIGWGVEKDTPY--------- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G+ G KILRG++ IES ++
Sbjct: 302 -------------------------WLIANSWNTDWGNNGFFKILRGKDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|226473762|emb|CAX71566.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474170|emb|CAX71571.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYIEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|226474184|emb|CAX71578.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMVHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|4204370|gb|AAD11445.1| cathepsin B protease, partial [Fasciola hepatica]
Length = 247
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 97/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+VTGG + TGCQP F C+H + C P P C
Sbjct: 63 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPTPPC 122
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + QDK+ Y V + + I QEIMKNGPV ++ D Y+SG
Sbjct: 123 ARAC-QTGYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 181
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + + V+++GWG EN
Sbjct: 182 YHH------------------------VAGKFIGRHAVRMIGWGVEN------------- 204
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW + +++ E++G+ G +++RGRNE IES V
Sbjct: 205 ---------------------GVNYWLMANSWNEEWGENGYFRMVRGRNECGIESEVVAG 243
Query: 242 LPK 244
+P+
Sbjct: 244 MPR 246
>gi|45822211|emb|CAE47502.1| cathepsin B-like proteinase [Diabrotica virgifera virgifera]
Length = 331
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 103/242 (42%), Gaps = 65/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+ TGG + S GCQP S PC H + ++ +C TL P C
Sbjct: 149 CEGGYPTMAWSYWIDTGITTGGLYGSKQGCQPYSLQPCEH-HTEGNKVQCSTLDYDTPSC 207
Query: 62 HTRCTND--NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C + NY + +Y VA+IQ+EI+ NGPV A +YSD +YKS
Sbjct: 208 KHKCDDSALNYKSELTFGSGSVRNFY----SVANIQKEILTNGPVEAAFDVYSDFVNYKS 263
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + VA YL G +A V+I+GWGEE+G P
Sbjct: 264 GVYQH---VAGEYL---------GGHA------------VRILGWGEESGVP-------- 291
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
YW + +++ E +GDKG KI RG NE+ E +
Sbjct: 292 --------------------------YWLVANSWNEDWGDKGLFKIRRGNNESGFEDSIV 325
Query: 240 GA 241
A
Sbjct: 326 AA 327
>gi|171474007|gb|AAX31052.2| SJCHGC09761 protein [Schistosoma japonicum]
Length = 342
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56756475|gb|AAW26410.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56758864|gb|AAW27572.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|226474172|emb|CAX71572.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56752925|gb|AAW24674.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G++ +W + K G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ+EIM GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLQIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGR+E +IES +
Sbjct: 300 ---------------------GTSYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335
>gi|226474174|emb|CAX71573.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|301776581|ref|XP_002923704.1| PREDICTED: cathepsin B-like [Ailuropoda melanoleuca]
gi|281347694|gb|EFB23278.1| hypothetical protein PANDA_012896 [Ailuropoda melanoleuca]
Length = 339
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 104/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTKQGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ +I EI KNGPV A +YSD YKSG
Sbjct: 208 SKFC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFTVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ V+I+GWG ENG PYW
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRGR+ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGRDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|56758716|gb|AAW27498.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 102/237 (43%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G++ +W + K G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGVTGYSWDYWVKHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ+EIM GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYSVIGVESAIQKEIMMYGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGR+E +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRDECLIESFI 335
>gi|389611087|dbj|BAM19154.1| cathepsin B [Papilio polytes]
Length = 334
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 99/242 (40%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + GLV+GG+++S GC+P PPC H P + T PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSTQGCRPYEIPPCEHHVPGNRLP--CSGDTKTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C DNY + QDK+ K Y V I+ E+ KNGPV +Y+D+ SYKSG
Sbjct: 209 IKKC-EDNYNVAYKQDKHYGKHIYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + + +KI+GWG ENG Y
Sbjct: 268 YKH------------------------VAGDALGGHAIKIMGWGVENGNKY--------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG + IES +
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329
Query: 242 LP 243
P
Sbjct: 330 EP 331
>gi|204022071|dbj|BAG71133.1| cathepsin B-S2 [Tuberaphis coreana]
Length = 334
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 103/242 (42%), Gaps = 64/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG +++ GC P PPC + E C P +
Sbjct: 154 CGGGNPMKAWEYFRTQGVTTGGDYNTKEGCMPYKVPPCRNKQ---GENICD--EQPMERN 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG+ Q++Y+ K Y++N + I+Q DI +Y
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYYINS-IKTIEQ----------------DIKTY---- 246
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPV A+ Y D+ YKSG+Y S +A+ ++KI+GWG+E+G PYW V ++
Sbjct: 247 ---GPVEASFDCYDDLSVYKSGIYRKSPNAKYKGGHSIKIIGWGQEDGTPYWLAVNSWS- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W GD GT KI++GRNE IE V
Sbjct: 303 -----------------------KFW----------GDHGTFKIIKGRNECGIERAVTAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|121073189|gb|ABM47071.1| cathepsin B2 [Clonorchis sinensis]
gi|358341868|dbj|GAA36574.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 92/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ TGC+P FP C H + P C P PKC
Sbjct: 155 CDGGFPPMAWDFWKTHGIVTGGSKEEPTGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D + +DK R Y V+ I +EI+ NGPV A ++ D YKSG
Sbjct: 214 VKHC--DTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGI 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y A V ++I+GWGEENG PY
Sbjct: 272 Y------------------------FHAWGGSVGGHAIRILGWGEENGVPY--------- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG ++ LRG NE IE
Sbjct: 299 -------------------------WLIANSWNEDWGEKGYLRFLRGHNECGIEEEATAG 333
Query: 242 LP 243
LP
Sbjct: 334 LP 335
>gi|56756380|gb|AAW26363.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56752787|gb|AAW24605.1| unknown [Schistosoma japonicum]
Length = 309
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 126 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 184
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 185 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 242
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 243 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 266
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 267 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 305
Query: 242 LPK 244
L K
Sbjct: 306 LIK 308
>gi|226473758|emb|CAX71564.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 104/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56759488|gb|AAW27884.1| unknown [Schistosoma japonicum]
Length = 342
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLGIESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|118364222|ref|XP_001015333.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89297100|gb|EAR95088.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 341
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/242 (30%), Positives = 99/242 (40%), Gaps = 62/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G +S + K GLVTG +++ CQ SF PC H T P C T P PKC
Sbjct: 160 CNGGYPASAMSYYVKTGLVTGDLYNTTGWCQAYSFAPCAHHVDTPLYPAC-TGELPTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + G G ++ + Y V I EI NGPV A +Y D +YKSG
Sbjct: 219 AKTC---DSGSGQTYTVHKGSKAYSVGKTQEAIMTEIQTNGPVEAAFTVYEDFLNYKSG- 274
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A +KIVGWG EN PY
Sbjct: 275 -----------VYKHVTGKALGGHA------------IKIVGWGVENNTPY--------- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W +V+++ + +GD GT KILRG+NE IE+ V A
Sbjct: 303 -------------------------WIVVNSWNQTWGDNGTFKILRGKNECGIEAQVVTA 337
Query: 242 LP 243
LP
Sbjct: 338 LP 339
>gi|309202|gb|AAA37494.1| mouse preprocathepsin B [Mus musculus]
Length = 339
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/251 (28%), Positives = 107/251 (42%), Gaps = 62/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V +I EI KNGPV ++SD +YKSG
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+ WG ENG PYW + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILVWGVENGVPYWLAANSWNL 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 329 IPRTDQYWGRF 339
>gi|74221319|dbj|BAE42140.1| unnamed protein product [Mus musculus]
Length = 339
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/251 (27%), Positives = 107/251 (42%), Gaps = 62/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC P + PPC H + S P C T P+C
Sbjct: 150 CNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEH-HVNGSRPPC-TGEGDTPRC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V +I EI KN PV ++SD +YKSG
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNDPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+GWG NG PYW + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILGWGVGNGVPYWLAANSWNL 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 329 IPRTDQYWGRF 339
>gi|221107055|ref|XP_002166984.1| PREDICTED: cathepsin B-like [Hydra magnipapillata]
Length = 330
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 99/237 (41%), Gaps = 61/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G VTGG ++S+ GCQP P C H + +P C+ + P PKC
Sbjct: 148 CNGGRLGPAWNFFKYAGAVTGGQYNSSEGCQPYEIPSCEHHTSGSKKP-CEG-SEPTPKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + DK++ +Y + ++ I+ EI NGPV A +YSD +YKSG
Sbjct: 206 KRSC-REGYNVSYSDDKHKVSSHYSIANDEEQIKNEIYLNGPVEAAFTVYSDFPNYKSG- 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ Y +G + +KI+GWG EN PYW + +
Sbjct: 264 ---------------VYKYTTG--------NALGGHAIKILGWGVENNVPYWLVANSW-- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
P W GDKG KILRG NE IE+ V
Sbjct: 299 ----------------------NPDW----------GDKGFFKILRGSNECGIEASV 323
>gi|14141821|gb|AAK07477.2|AF329480_1 probable cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
gi|289743431|gb|ADD20463.1| putative cathepsin B-like cysteine proteinase precursor [Glossina
morsitans morsitans]
Length = 340
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S+ GC+P PC H + + P C+ P+C
Sbjct: 157 CNGGFPGAAWSYWVRKGIVSGGPYGSSQGCRPYEIAPCEH-HVNGTRPPCEKEYGKTPRC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + DK+ R Y ++ V DIQ+EIM +GPV +Y
Sbjct: 216 QHKC-QASYKVDYKTDKHFGSRAYSISKNVHDIQEEIMTHGPVEGAFTVY---------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YK GVY E+ +A ++I+GWG E P
Sbjct: 265 -------------EDLILYKDGVYEHVHGKELGGHA-IRIIGWGVEKDIP---------- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G KILRG++ IES ++
Sbjct: 301 ------------------------YWLVANSWNTDWGNNGFFKILRGKDHCGIESSISAG 336
Query: 242 LPK 244
LPK
Sbjct: 337 LPK 339
>gi|330434688|gb|AEC22812.1| cathepsin B [Macrobrachium nipponense]
Length = 331
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 104/244 (42%), Gaps = 63/244 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + + WVH G+V+GGA +S GCQP PC H + + P+C + PK
Sbjct: 148 CNGGFPGAAFQYWVHS-GIVSGGAFNSTQGCQPYEIAPCEH-HVSGPRPKCAEGGS-TPK 204
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
CH C + NY + D + ++Y V+ + I+ +IM NGPV +Y
Sbjct: 205 CHKNCES-NYVVDYESDLHHGSKHYSVDKDETQIKYDIMTNGPVEGAFTVY--------- 254
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D YKSGVY + + +A ++++GWGEE+G P
Sbjct: 255 --------------VDFLHYKSGVYQHTHGLPLGGHA-IRVLGWGEEDGTP--------- 290
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ +GD G KILRG + IES ++
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGYFKILRGSDHCGIESEISA 325
Query: 241 ALPK 244
LPK
Sbjct: 326 GLPK 329
>gi|194246069|gb|ACF35526.1| putative cathepsin B-like cysteine protease form 1 [Dermacentor
variabilis]
Length = 277
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/244 (29%), Positives = 102/244 (41%), Gaps = 63/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + G+VTGG + ++ GCQP FPPC H + P C T P PKC
Sbjct: 94 CFGGYPSAAWDYYKDEGIVTGGLYGTDDGCQPYYFPPCEH-HTKGPLPNC-TDTKPTPKC 151
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DKY K Y ++ + I+ EI
Sbjct: 152 LQVCRK-GYEKSYSEDKYFAKTVYSLHSDETQIKTEI----------------------- 187
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV A+ +Y+D +YKSGVY S E+ W + W + R
Sbjct: 188 YKNGPVEADFSVYTDFLAYKSGVYQ-RHSYEL----------WEARHQNLGWALKR---- 232
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
R W + +++ + +GDKG KI RG NE IE+ +N
Sbjct: 233 ----------------------RSVWLVANSWNQDWGDKGYFKIRRGNNECGIENDINAG 270
Query: 242 LPKD 245
+PK+
Sbjct: 271 IPKE 274
>gi|338815385|gb|AEJ08755.1| cathepsin B [Crassostrea ariakensis]
Length = 341
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 102/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + + GLVTGG ++S+ GCQP + C+H +P C P PKC
Sbjct: 158 CEGGFPSAAWSYYKRDGLVTGGQYNSHQGCQPYTIKACDHHVVGKLQP-CSKDIGPTPKC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ V I EIM NGPV G
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGMSAYSVHG-VEKIMTEIMTNGPV--------------EGA 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +Y+D YKSGVY + + +A +KI+GWG ENG YW + +
Sbjct: 261 F---------TVYADFPQYKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSW-- 308
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W GD+G KILRG++E IES ++
Sbjct: 309 ----------------------NPDW----------GDQGFFKILRGQDECGIESQISAG 336
Query: 242 LPK 244
PK
Sbjct: 337 EPK 339
>gi|193716207|ref|XP_001950562.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 340
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/249 (30%), Positives = 105/249 (42%), Gaps = 69/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + RGLVTGG + S GC+P PPC + +E P+ K
Sbjct: 157 CNGGYPIKAWESFNNRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPREKN 212
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG + D +RF R YY + IQ+++M+ GP+ A+ +Y D
Sbjct: 213 H-RCTRTCYGNQDLDYNDDHRFTRDSYYLT---YSSIQKDVMRYGPIEASFDMYDDF--- 265
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
P SYKSGVY S +A + VK++GWGEE+G Y
Sbjct: 266 --------P------------SYKSGVYVRSENASYLGGHAVKLIGWGEEHGVLY----- 300
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ E +GD G KI RG NE I++
Sbjct: 301 -----------------------------WLMVNSWNEGWGDNGLFKIRRGTNECGIDNS 331
Query: 238 VNGALPKDN 246
G +P N
Sbjct: 332 TTGGVPVAN 340
>gi|204022083|dbj|BAG71139.1| cathepsin B-S [Astegopteryx styracophila]
Length = 335
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 104/243 (42%), Gaps = 66/243 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +RG+ TGG + SN GC P PPC + + + L +P
Sbjct: 155 CQGGNPIKAWKYFKRRGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207
Query: 62 HT-RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H +C YG +++Y+ + Y V D I+Q DI +Y
Sbjct: 208 HNHKCPRACYGNSTVENRYKVESIY-VLDSFKTIEQ----------------DIRTY--- 247
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
GPV A+ +Y D +YKSG+Y + +A V +VK++GWGEE+G PYW +V ++
Sbjct: 248 ----GPVEASFDVYDDFITYKSGIYQKTPNALYVGGHSVKLIGWGEEDGIPYWLLVNSWS 303
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+W G++GT +I++GRNE IE
Sbjct: 304 ------------------------KFW----------GEQGTFRIIKGRNECGIERSATA 329
Query: 241 ALP 243
+P
Sbjct: 330 GIP 332
>gi|170028916|ref|XP_001842340.1| cathepsin B [Culex quinquefasciatus]
gi|167879390|gb|EDS42773.1| cathepsin B [Culex quinquefasciatus]
Length = 339
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/232 (30%), Positives = 98/232 (42%), Gaps = 66/232 (28%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV GLV+G ++S+ GC+P F PC++ + E K PKC C N Y R
Sbjct: 173 WV-DAGLVSGAPYNSSEGCKPYPFEPCSYP-FVGCHHEKK-----NPKCLHHCIN-GYDR 224
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ +DK+ Y + ++ IQ EIM NGPV ++ D + Y SG
Sbjct: 225 KYRKDKFFGATAYKIPNDARMIQLEIMTNGPVATGFEVFEDFYFYHSG------------ 272
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
+Y + K G++A ++IVGWG ENG PYW I Y
Sbjct: 273 VYKHVVGKKVGMHA------------IRIVGWGTENGTPYWLIANSY------------- 307
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
G+ +GDKG K+LRG N IES V LP+
Sbjct: 308 ---------------------GDTWGDKGFFKMLRGSNHLGIESTVIAGLPQ 338
>gi|126116630|gb|ABN79675.1| cathepsin B3 [Clonorchis sinensis]
Length = 337
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 96/243 (39%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 149 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + DK R Y V + I +EIM NGPV A +Y D YKSG
Sbjct: 208 VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y +SD ++ ++I+GWGEENG YW I +
Sbjct: 265 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWN- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
GWGE+ G K+LRG+NE IE V
Sbjct: 301 --------------DGWGED-------------------GYFKMLRGKNECGIEDEVTAG 327
Query: 242 LPK 244
LP+
Sbjct: 328 LPE 330
>gi|226474168|emb|CAX71570.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GP A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPAEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|27882093|gb|AAH44517.1| Zgc:55862 [Danio rerio]
Length = 330
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 95/242 (39%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLVTGG ++S+ GC+P + PC H + S P C P C
Sbjct: 148 CNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEPCEH-HVNGSRPPCTGEGGDTPNC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK+ K Y V I E+ KNGPV A +Y D YKSG
Sbjct: 207 DMKC-EPGYSPLYKEDKHFGKTSYSVPSNQNGIMAELFKNGPVEAAFTVYEDFLLYKSGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + +KI+GWGEENG P
Sbjct: 266 YQH------------------------MSGSALGGHAIKILGWGEENGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 292 ------------------------YWLAANSWNTDWGDNGYFKILRGEDHCGIESEIVAG 327
Query: 242 LP 243
+P
Sbjct: 328 IP 329
>gi|167541036|gb|ABZ82028.1| cathepsin B endopeptidase [Clonorchis sinensis]
Length = 228
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 96/243 (39%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 40 CQGGFPPTAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 98
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + DK R Y V + I +EIM NGPV A +Y D YKSG
Sbjct: 99 VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 155
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y +SD ++ ++I+GWGEENG YW I +
Sbjct: 156 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWN- 191
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
GWGE+ G K+LRG+NE IE V
Sbjct: 192 --------------DGWGED-------------------GYFKMLRGKNECGIEDEVTAG 218
Query: 242 LPK 244
LP+
Sbjct: 219 LPE 221
>gi|29374023|gb|AAO73002.1| cathepsin B [Fasciola gigantica]
Length = 335
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 94/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + + G+V+GG + TGC P FP C+H T C PKC
Sbjct: 155 CEGGYPSMAWDYWWRHGIVSGGTLENPTGCLPYPFPKCSHLEETPGLAPCPRELYATPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK + K Y V D DI EI+ NGPV Y++ D YKSG
Sbjct: 215 EKQC-QAGYSKTSEEDKIKGKSSYNVGDRETDIMMEIITNGPVSTIYYIFEDFTVYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y SG I+GWG ENG
Sbjct: 273 ---------------IYQYTSGSLMGGHG----------IIGWGVENG------------ 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW +++ E +G+ G +I RG NE IES +N
Sbjct: 296 -----------VK-----------YWLAANSWNEGWGENGYFRIRRGTNECGIESRINAG 333
Query: 242 LP 243
LP
Sbjct: 334 LP 335
>gi|384597848|gb|AFI23675.1| cathepsin B, partial [Brugia malayi]
Length = 319
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 103/232 (44%), Gaps = 61/232 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C G + W + G+VTG + +++GC+P FPPC +H+N T EP CK P PK
Sbjct: 146 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHSNKTHYEP-CKHDLYPTPK 204
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C+ +C + NY + + DKY ++ Y V ++V IQ+EIM GPV A+ +Y+
Sbjct: 205 CYKQC-DKNYTKSYKADKYYGEQAYNVENDVESIQKEIMTLGPVEASFEVYT-------- 255
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D Y SG+Y A + VG G
Sbjct: 256 ---------------DFLHYTSGIYKHVAGS----------VGGGH-------------- 276
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
VK++GWG + G YW +++ +G+ G +ILRG +E
Sbjct: 277 -----------AVKILGWGIDQGVSYWLAANSWNNDWGEDGYFRILRGADEC 317
>gi|401758196|gb|AFQ01133.1| cathepsin B [Chilo suppressalis]
Length = 350
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 78/243 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE--------PECKT 53
C G+ + W++ K G+V+GG + S GCQP + PPCNH + E P+CK
Sbjct: 153 CEGGVLTRAWIYYKKIGIVSGGGYKSKQGCQPYTIPPCNHLVWGEIEQCKNIPMTPKCKN 212
Query: 54 L-ATPQ--------PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
+ P+ P+C +C N NY + +DK+R K Y V + EI K
Sbjct: 213 IPVIPEQCKYIPITPECEKKC-NKNYKVCYSKDKHRGKSVYRVK------KSEIFK---- 261
Query: 105 VANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGW 164
+I+ Y GPV + +Y D +YK G+Y +
Sbjct: 262 --------EIYEY-------GPVTSYFTVYEDFLNYKEGIYNYT---------------- 290
Query: 165 GEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIK 224
S + + +VK+IGWGEE G YW ++F +GDKG K
Sbjct: 291 -------------------SGQKLGLHSVKIIGWGEERGIKYWLAANSFNTDWGDKGFFK 331
Query: 225 ILR 227
I+R
Sbjct: 332 IIR 334
>gi|126303983|ref|XP_001381634.1| PREDICTED: cathepsin B-like [Monodelphis domestica]
Length = 337
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/247 (28%), Positives = 101/247 (40%), Gaps = 61/247 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + K+GLV+GG + S+ GC+P S PPC H + S P C P C
Sbjct: 151 CNGGFPAGAWNFWTKKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPACTGEEGDTPTC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + DK Y V +I EI KNGPV +Y D YKSG
Sbjct: 210 RKKC-EEGYSTQYKDDKNYGSTSYSVPSSEQEIMAEIYKNGPVEGAFSVYEDFLHYKSGV 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ ++I+GWG ENG YW +
Sbjct: 269 YQH------------------------VAGEMLGGHAIRILGWGVENGIRYW-------L 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+A++ W I +GD G K LRG+N IES +
Sbjct: 298 AANS---------------------WNI------DWGDNGFFKFLRGKNHCGIESEIIAG 330
Query: 242 LPK-DNY 247
+P+ D Y
Sbjct: 331 IPRTDQY 337
>gi|308390275|gb|ADO32581.1| cathepsin B [Marsupenaeus japonicus]
Length = 332
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 102/246 (41%), Gaps = 63/246 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + + WVH G+V+GG+ +S GCQP PC H + P+C PK
Sbjct: 149 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKCSE-GGGTPK 205
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C RC N Y + D + + Y + + I+ EIMKNGPV +Y D YKSG
Sbjct: 206 CVKRCEN-GYTVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSG 264
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++ ++ G+ + ++I+GWGEENG P
Sbjct: 265 ----------------VYQHRHGL--------PLGGHAIRILGWGEENGTP--------- 291
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ +GD G KILRG + IES ++
Sbjct: 292 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 326
Query: 241 ALPKDN 246
LPK N
Sbjct: 327 GLPKLN 332
>gi|187105116|ref|NP_001119618.1| cathepsin B-84 precursor [Acyrthosiphon pisum]
gi|161343843|tpg|DAA06102.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 335
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 103/247 (41%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
C+ G W + + G+VTGG + + GCQP PPC + E + QP
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206
Query: 60 ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
KC +C D+ + ++ Y+ K Y+ +KN + + +Y
Sbjct: 207 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYY------------LKNTTMQKDTMVY----- 248
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
GP+ A+ +Y D +Y+SGVY + +A + VK++GWG E G PY
Sbjct: 249 --------GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPY---- 296
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W +V+++GEQ+GDKG KILRG +E IES
Sbjct: 297 ------------------------------WLMVNSWGEQWGDKGMFKILRGTDECGIES 326
Query: 237 LVNGALP 243
+P
Sbjct: 327 SCTAGVP 333
>gi|204022085|dbj|BAG71140.1| cathepsin B-S [Astegopteryx spinocephala]
Length = 335
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 102/243 (41%), Gaps = 66/243 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+ TGG + SN GC P PPC + + + L +P
Sbjct: 155 CQGGNPIKAWKYFKRHGITTGGDYGSNEGCAPYKVPPC-------YDDQGEFLCQGKPTE 207
Query: 62 HT-RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H +C YG +++Y+ K Y V D I+Q+I K GPV A+ +Y D
Sbjct: 208 HNHKCPRACYGNSTVENRYKVKSIY-VLDSSKTIEQDIRKYGPVEASFDVYDDF------ 260
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+YKSG+Y + +A V +VK++GWGEE+G PYW +V ++
Sbjct: 261 -----------------ITYKSGIYQKTPNAFYVGGHSVKLIGWGEEDGIPYWLLVNSWS 303
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+W G++GT +I++GRNE IE
Sbjct: 304 ------------------------KFW----------GEQGTFRIIKGRNECGIERSATA 329
Query: 241 ALP 243
+P
Sbjct: 330 GVP 332
>gi|28971815|dbj|BAC65419.1| cathepsin B [Pandalus borealis]
Length = 328
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 98/229 (42%), Gaps = 62/229 (27%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV K G V+GG H+SN GCQP S C H + P C+ P+ C C ++ YG+
Sbjct: 157 WVTK-GFVSGGRHNSNEGCQPYSVEECEH-HIEGPRPPCEG-DMPELVCSETC-HEEYGK 212
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ +D Y + +V IQ+EIM NGPV A +Y D SYKSG
Sbjct: 213 TYEEDLEYGLEAYVLPQDVTQIQEEIMTNGPVTAAFAVYDDFLSYKSG------------ 260
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
++ +++G+ + Y V+++GWGEE G P
Sbjct: 261 ----VYQHETGL--------LDGYHAVRVIGWGEEEGTP--------------------- 287
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG +E E + A
Sbjct: 288 -------------YWLVANSWNTDWGDNGLFKILRGSDECEFEGDMAAA 323
>gi|405971658|gb|EKC36483.1| Cathepsin B [Crassostrea gigas]
Length = 341
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 74/243 (30%), Positives = 103/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + K GLVTGG ++S+ GC P + C+H +P K++ P PKC
Sbjct: 158 CEGGFPSAAWSYYKKDGLVTGGQYNSHQGCLPYTIKACDHHVVGKLQPCSKSIG-PTPKC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ V I EIM NGPV G
Sbjct: 217 KHTC-EAGYNVTYEKDKHYGSSAYSVHG-VEKIMTEIMTNGPV--------------EGA 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +Y+D YKSGVY + + +A +KI+GWG ENG YW + +
Sbjct: 261 F---------TVYADFPQYKSGVYKHTTGQPLGGHA-IKILGWGTENGDDYWLVANSW-- 308
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W GD+G KILRG++E IES ++
Sbjct: 309 ----------------------NPDW----------GDQGFFKILRGQDECGIESQISAG 336
Query: 242 LPK 244
PK
Sbjct: 337 EPK 339
>gi|282400164|ref|NP_001164205.1| cathepsin B precursor [Tribolium castaneum]
gi|270004839|gb|EFA01287.1| cathepsin B precursor [Tribolium castaneum]
Length = 335
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 98/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W+ G+VTGG + GC+ SF PC H + P C P P C
Sbjct: 154 CNGGMPAMAWLHWTVNGIVTGGNYEDTNGCKAYSFAPCEH-HVDGDLPPCGP-TKPTPDC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D+ +Q+ Y ++ IQ EIM NGPV A+ +Y D SYKSG
Sbjct: 212 KKEC--DSGSSLTYQNDLTHGSNYGIDPYPKQIQTEIMTNGPVEASFSVYEDFLSYKSG- 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ + G YA + +KI+GWG EN P
Sbjct: 269 ---------------VYQHLEGEYAGGHA--------IKILGWGVENDTP---------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GDKG KILRG NE IE +
Sbjct: 296 ------------------------YWLVANSWNEDWGDKGYFKILRGSNECGIEGSIVAG 331
Query: 242 LPK 244
+P+
Sbjct: 332 IPE 334
>gi|239788404|dbj|BAH70886.1| ACYPI000014 [Acyrthosiphon pisum]
Length = 335
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/247 (27%), Positives = 103/247 (41%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
C+ G W + + G+VTGG + + GCQP PPC + E + QP
Sbjct: 153 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 206
Query: 60 ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
KC +C D+ + ++ Y+ K Y+ +KN + + +Y
Sbjct: 207 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYY------------LKNTTMQKDTMVY----- 248
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
GP+ A+ +Y D +Y+SGVY + +A + VK++GWG E G PY
Sbjct: 249 --------GPIEASFDVYDDFMNYESGVYQRTGNASYLGGHAVKMIGWGVEEGTPY---- 296
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W +V+++GEQ+GDKG KILRG +E IES
Sbjct: 297 ------------------------------WLMVNSWGEQWGDKGMFKILRGTDECGIES 326
Query: 237 LVNGALP 243
+P
Sbjct: 327 SCTAGVP 333
>gi|449267314|gb|EMC78276.1| Cathepsin B [Columba livia]
Length = 340
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 95/242 (39%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG + S+ GC+P S PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTEKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +I EI KNGPV +Y D YKSG
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E V ++++GWG +NG P
Sbjct: 268 YQH------------------------VTGEQVGGHAIRLLGWGVDNGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +GD G KILRG + IES +
Sbjct: 294 ------------------------YWLAANSWNTDWGDNGFFKILRGEDHCGIESEIVAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|402594312|gb|EJW88238.1| cathepsin B5 [Wuchereria bancrofti]
Length = 407
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 103/243 (42%), Gaps = 56/243 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTG + +++GC+P FPPC H N T CK P PKC
Sbjct: 205 CFGGEPMAAWKYWVLSGIVTGSDYTNHSGCRPYPFPPCEHHNNKTHYEPCKHDLYPTPKC 264
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + NY + + DKY ++ Y V ++V IQ+EIM GPV A+ +Y+D Y G
Sbjct: 265 DRQC-DKNYKKPYKADKYYGEQAYNVENDVELIQKEIMTLGPVEASFEVYTDFLHYIGG- 322
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A VKI+GWG + G YW +
Sbjct: 323 -----------IYKHVAGSVGGGHA------------VKILGWGIDQGVSYWLAANSWNT 359
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WGE+ F G +ILRG +E IES +
Sbjct: 360 D---------------WGED----------VFS------GYFRILRGVDECGIESGIVAG 388
Query: 242 LPK 244
+P+
Sbjct: 389 IPR 391
>gi|349956183|dbj|GAA30948.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 337
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 95/243 (39%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ + TGC+ FP C+H + P C P C
Sbjct: 149 CQGGFPPIAWDFWQTEGIVTGGSKENPTGCRSYPFPRCSHHG-SKKYPPCSHRIYDTPNC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + DK R Y V + I +EIM NGPV A +Y D YKSG
Sbjct: 208 VQKC--DTPDTDYATDKTRANITYNVKAKQNAIMKEIMINGPVEAAFQVYEDFLGYKSG- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y +SD ++ ++I+GWGEENG YW I +
Sbjct: 265 ---------VYFHSD--------------GTLLGGHAIRILGWGEENGVAYWLIANSWND 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
GWGE+ G K+LRG+NE IE V
Sbjct: 302 ---------------GWGED-------------------GCFKMLRGKNECGIEDEVTAG 327
Query: 242 LPK 244
LP+
Sbjct: 328 LPE 330
>gi|226474176|emb|CAX71574.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE I+S +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIDSEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|167508668|gb|ABZ81540.1| cathepsin B-like cysteine protease [Caenorhabditis brenneri]
Length = 193
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 71/226 (31%), Positives = 95/226 (42%), Gaps = 64/226 (28%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCN--HANYTTSEPECKTLATPQPKCHTRCT-NDN 69
W GL TGG + GC+P + PC+ + N TTS P C TP C RCT N
Sbjct: 29 WWQTHGLCTGGNYDDQFGCKPYTIYPCDKTYPNGTTSVP-CPGYHTPV--CEERCTSNIT 85
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
+ + Q K+ K +Y V ++ DIQ EIM+NGPV+A+ +Y D + YKSG Y
Sbjct: 86 WPISYKQVKHFGKAHYNVGKKMTDIQTEIMRNGPVIASFIIYDDFWDYKSGIY------- 138
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
V + + KI+GWG +NG PYW V
Sbjct: 139 -----------------VHTAGDQEGGMDTKIIGWGVDNGVPYWLCVH------------ 169
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+G FG+ G ++ILRG NE IE
Sbjct: 170 ----------------------QWGTDFGENGFMRILRGVNEVHIE 193
>gi|3929733|emb|CAA77178.1| cathepsin B [Homo sapiens]
Length = 195
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 82/176 (46%), Gaps = 27/176 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 44 CNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 101
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 102 SKIC-EPGYSPTYKQDKHYGYDSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 160
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
Y + + E++ ++I+GWG ENG PYW +
Sbjct: 161 YQH------------------------VTGEMMGGHAIRILGWGVENGTPYWLVAN 192
>gi|195130519|ref|XP_002009699.1| GI15503 [Drosophila mojavensis]
gi|193908149|gb|EDW07016.1| GI15503 [Drosophila mojavensis]
Length = 342
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 106/243 (43%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWV-WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + W W HK G+V+GG+++SN GC+P PC H + + P CK TP
Sbjct: 159 CNGGFPGAAWSYWTHK-GIVSGGSYNSNEGCRPYEIEPCEH-HVNGTRPPCKNGRTPS-- 214
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C + +Y + +DK+ + Y + +IQ+EIM NGPV +Y
Sbjct: 215 CKHQCES-SYSVDYAKDKHFGSKSYSIRRNPREIQREIMTNGPVEGAFTVY--------- 264
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D+ YKSGVY E+ +A ++I+GWG
Sbjct: 265 --------------EDLILYKSGVYKHVHGKELGGHA-IRILGWGV-------------- 295
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
WG+ PYW I +++ +GD G +I+RG + IES ++
Sbjct: 296 -----------------WGDSK-VPYWLIGNSWNTDWGDNGFFRIVRGEDHCGIESAISA 337
Query: 241 ALP 243
LP
Sbjct: 338 GLP 340
>gi|226474180|emb|CAX71576.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56756410|gb|AAW26378.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|48762485|dbj|BAD23812.1| cathepsin B-N1 [Tuberaphis styraci]
Length = 340
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +DI +Y
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 252
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 253 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 301 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 332
Query: 239 NGALP 243
G +P
Sbjct: 333 TGGVP 337
>gi|157058747|gb|ABV03131.1| cathepsin B-2744 [Myzus persicae]
Length = 261
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 80/177 (45%), Gaps = 32/177 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C G + W + +G+VTGG + SN GCQP PC+H +S C +L Q
Sbjct: 98 CDGGSAYKAWEFTMGKGIVTGGPYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMMF 156
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C +C N NY + D Y+ Y W N V IQQEIM GPV A MY+Y + Y
Sbjct: 157 CRDKCVNKNYKVKYEDDLYKTSVVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 214
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
K G Y S + E++ Y VK++GWG +E G YW
Sbjct: 215 KEGVYK------------------------STAGELIGYHHVKLIGWGVDEAGIEYW 247
>gi|48762493|dbj|BAD23816.1| cathepsin B-N1 [Tuberaphis coreana]
Length = 340
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 157 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 212
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +DI +Y
Sbjct: 213 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 252
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 253 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 301 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 332
Query: 239 NGALP 243
G +P
Sbjct: 333 TGGVP 337
>gi|395842321|ref|XP_003793966.1| PREDICTED: cathepsin B [Otolemur garnettii]
Length = 339
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 102/239 (42%), Gaps = 61/239 (25%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
+ W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC C
Sbjct: 156 AEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEH-HVNGSRPAC-TGEGDTPKCSKTC-E 212
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + +DK+ Y + +I EI KNGPV FS
Sbjct: 213 PGYSPTYKEDKHFGYTSYSLPTNEWEIMAEIYKNGPV-------EGAFS----------- 254
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
+YSD YKSGVY + ++
Sbjct: 255 -----VYSDFLLYKSGVYQ-----------------------------------HLTGDM 274
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
+ ++++GWGEENG PYW + +++ +GD G +ILRG++ IES V +P+ +
Sbjct: 275 MGGHAIRILGWGEENGVPYWLVANSWNTDWGDGGFFRILRGQDHCGIESEVVAGIPRTD 333
>gi|380791571|gb|AFE67661.1| cathepsin B preproprotein, partial [Macaca mulatta]
Length = 311
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 83/178 (46%), Gaps = 27/178 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V++ DI EI KNGPV +YSD YKSG
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y + + E++ ++I+GWG ENG PYW + +
Sbjct: 267 YQH------------------------VTGEMMGGHAIRILGWGVENGTPYWLVANSW 300
>gi|157058741|gb|ABV03128.1| cathepsin B-2744 [Aulacorthum solani]
Length = 255
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 80/177 (45%), Gaps = 32/177 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C G + W +G+VTGG + SN GCQP PC+H +S C +L Q
Sbjct: 96 CDGGSAFKAWELTMSKGIVTGGNYDSNEGCQPYKNRPCDHYG-DSSLTNCSSLRRTQMTV 154
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C +C N NY + D ++ Y W N V IQQEIM GPV A MY+Y + Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTALMYVYENFMGY 212
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
K G Y S + E++ Y VK++GWG +E+G YW
Sbjct: 213 KKGIYK------------------------STAGELIGYHHVKLIGWGVDEDGTEYW 245
>gi|91089437|ref|XP_966750.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012705|gb|EFA09153.1| cathepsin B precursor [Tribolium castaneum]
Length = 324
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 98/233 (42%), Gaps = 73/233 (31%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
WV K G+V+GG ++SN GCQP Y S L + PKC T+C N Y
Sbjct: 163 WVAK-GIVSGGDYNSNEGCQP----------YEGSA----FLNSVTPKCSTKCLNSKYTT 207
Query: 73 GFFQDKYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ +DK+ + Y + VA+IQ EIM NGPVV +M +Y D +SYKSG Y +
Sbjct: 208 PYAKDKHYGTDFIYMTSKNVAEIQTEIMNNGPVVTHMDVYEDFYSYKSGVYQH------- 260
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
S + VKI+GWG E G PYW I
Sbjct: 261 -----------------VSGNSMGGHAVKIIGWGTEKGVPYWLIAN-------------- 289
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
WG + W + F KILRG+N IE+ + G P+
Sbjct: 290 -----SWGAK-----WADLDGF---------YKILRGKNHCKIETYIYGGTPQ 323
>gi|204022094|dbj|BAG71144.1| cathepsin B-N1 [Tuberaphis taiwana]
Length = 334
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +DI +Y
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|46812327|gb|AAT02230.1| cathepsin B-like proteinase [Triatoma dimidiata]
Length = 332
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + G+V+GG + S GCQP S PC H + S P C+ C
Sbjct: 150 CFGGDPGSAWEYWRDVGIVSGGNYGSKEGCQPYSIAPCEH-HIPGSRPPCRGEGH-TADC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +D + + Y +V +IQ EI+KNGPV A ++Y D
Sbjct: 208 RKQCEK-GYSIPYDKDLHYAEFVYSTERDVKEIQTEILKNGPVEAAFFVYED-------- 258
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +YK GVY A A + +A +KI+GWG ENG PY
Sbjct: 259 ---------------LLTYKEGVYKHVAGAPVGGHA-IKILGWGVENGTPY--------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G+ G KILRG +E IE V+
Sbjct: 294 -------------------------WLIANSWNTDWGNNGFFKILRGSDECGIEIDVSAG 328
Query: 242 LPK 244
LP+
Sbjct: 329 LPR 331
>gi|294894290|ref|XP_002774786.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239880403|gb|EER06602.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 830
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 104/270 (38%), Gaps = 72/270 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+ G +S W WVH +G+ TGG + + GC P FPPC H T PEC ++
Sbjct: 605 CNGGFPNSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPECPKVS 664
Query: 56 TP---------------------QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADI 94
P C +C N Y D++ V D
Sbjct: 665 CSGESPPATAETATVIAYQNSYETPNCAEQCHNPKYTTTLRDDRHFMLESSPYQYSVNDA 724
Query: 95 QQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV 154
+ I +GPV +Y ++ V A+ +Y D +YKSGVY
Sbjct: 725 KNAIRTDGPV-GPIYFCDPNVNFDQ-------VSASFSVYEDFLAYKSGVYK-------- 768
Query: 155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFG 214
S E + VK+IGWGEE+G+ YW +V+++
Sbjct: 769 ---------------------------HTSGEYLGGHAVKIIGWGEESGQAYWIVVNSWN 801
Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPK 244
E +GD G KI G N I ++L+ G PK
Sbjct: 802 EDWGDHGLFKIALG-NCGIDDNLLGGT-PK 829
>gi|204022092|dbj|BAG71143.1| cathepsin B-N2 [Tuberaphis coreana]
Length = 334
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNNT--CR--GKPAEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +DI +Y
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDILAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|226473754|emb|CAX71562.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 329
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 146 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 205 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 263 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 286
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 287 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 325
Query: 242 LPK 244
L K
Sbjct: 326 LIK 328
>gi|56752809|gb|AAW24616.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56757646|gb|AAW26973.1| unknown [Schistosoma japonicum]
Length = 342
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + Q++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSGESVFQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|268566089|ref|XP_002647469.1| Hypothetical protein CBG06541 [Caenorhabditis briggsae]
Length = 280
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 98/243 (40%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + RG+VTGG +GC+P F PCN + PE KT P C
Sbjct: 106 CEGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPCN----SYKCPEEKT-----PTC 155
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK Y V VA IQ EIM NGPVV +Y D++ YKSG
Sbjct: 156 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 214
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +KI+GWG +NG P
Sbjct: 215 YRH------------------------TAGRLLGGHAIKIIGWGTQNGIP---------- 240
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++G +G+ G +K+ RG NE IES V
Sbjct: 241 ------------------------YWLIANSWGADWGENGFLKMRRGVNECGIESAVVAG 276
Query: 242 LPK 244
+PK
Sbjct: 277 MPK 279
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 42/73 (57%), Gaps = 2/73 (2%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A+ +Y D + YK GVY +A ++V +KI+GWG E+G YW I +
Sbjct: 3 NGPVEASFTVYEDFYIYKKGVYQYTA-GQVVGVHAIKIMGWGTEHGTDYWLIANSWGAQC 61
Query: 184 SAEIVAYATVKLI 196
+ A++T ++I
Sbjct: 62 GS-CWAFSTAEVI 73
>gi|204022106|dbj|BAG71150.1| cathepsin B-N [Astegopteryx spinocephala]
Length = 332
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 78/247 (31%), Positives = 105/247 (42%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W K GLVTGG + S+ GCQP PC Y + C+ P K
Sbjct: 152 CHGGYPIKAWERFKKHGLVTGGNYDSSEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207
Query: 62 HTRCTNDNYG---RGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
H RCT YG R F +D +RF R YY IQ+++M GP+ A S
Sbjct: 208 H-RCTRMCYGDQDRDFKED-HRFTRDAYYLT---YGTIQKDVMTYGPIEA---------S 253
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y+ +Y D SYKSGVY + +A + VK++GWGEE G PY
Sbjct: 254 YE--------------VYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPY---- 295
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 296 ------------------------------WLMVNSWNDQWGDRGLFKIRRGTNECGIDN 325
Query: 237 LVNGALP 243
G +P
Sbjct: 326 STTGGVP 332
>gi|226474164|emb|CAX71568.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
gi|226474166|emb|CAX71569.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|226474160|emb|CAX71567.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|56756114|gb|AAW26235.1| unknown [Schistosoma japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGYFLPSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|226469952|emb|CAX70257.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 100/238 (42%), Gaps = 62/238 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKMYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+C Y + DK Y + +E+A IQ+EIM GPV A + ++ D +YKSG
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGIAINVIKNELA-IQKEIMMYGPVEAYLLIFEDFLNYKSG 275
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ Y +G + V V+I+GWG EN
Sbjct: 276 ----------------IYKYTTGSF--------VGEHYVRIIGWGIEN------------ 299
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGRNE IES+V
Sbjct: 300 ----------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335
>gi|226469950|emb|CAX70256.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 97/237 (40%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ V + IQ+EIM GPV A + ++ D +YKSG
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + V V+I+GWG EN
Sbjct: 276 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGRNE IES+V
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335
>gi|161343863|tpg|DAA06112.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 100/242 (41%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + S GC P PC H T P CK P C
Sbjct: 158 CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + QD +R K Y + ++V I+QEI NGPV +Y
Sbjct: 216 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVY---------- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D +Y++GVY A + +A ++I+GWG +NG
Sbjct: 265 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG------------ 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
EI PYW + +++ +G G KILRG +E IE +N
Sbjct: 299 ----EI-----------------PYWLVANSWNSDWGSDGFFKILRGSDECGIEGQINAG 337
Query: 242 LP 243
LP
Sbjct: 338 LP 339
>gi|161343865|tpg|DAA06113.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 335
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 102/250 (40%), Gaps = 77/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
C+ G W + + G+VTGG +++ GCQP PPC + E + QP
Sbjct: 153 CNGGNPLKAWKYFKRHGVVTGGNYNTTDGCQPYRVPPC------VRDDEGHNSCSGQPTE 206
Query: 60 ---KCHTRCTND---NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
KC +C D NY + ++ K YY ++N + D
Sbjct: 207 RNHKCSKKCYGDETINYKKNHYKTK---DAYY-------------------LSNTTMQKD 244
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
Y GP+ A+ +Y D SY+SGVY + +A + VK++GWG E G PY
Sbjct: 245 TMVY-------GPIEASFDVYDDFTSYESGVYQKTENASYLGGHAVKMIGWGVEEGTPY- 296
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
W +V+++GEQ+GDKG KILRG +E
Sbjct: 297 ---------------------------------WLMVNSWGEQWGDKGMFKILRGTDECG 323
Query: 234 IESLVNGALP 243
+ES +P
Sbjct: 324 VESSCTAGVP 333
>gi|204022098|dbj|BAG71146.1| cathepsin B-N2 [Tuberaphis sumatrana]
Length = 334
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 101/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGYPIKAWERFKKHGLVTGGNYESGEGCQPYRVPPCPLDEYGNNTCSGK----PTEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +D+ +Y
Sbjct: 210 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|56755451|gb|AAW25905.1| unknown [Schistosoma japonicum]
Length = 342
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 102/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQICQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|1345924|sp|P25802.3|CYSP1_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
Length = 341
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 85/175 (48%), Gaps = 27/175 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S + + G+VTGG +++ C+P PC H T EC +A P+C
Sbjct: 160 CEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGECVGMAD-TPRC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC Y + + D+Y +K+ Y + + V IQ++IMKNGPVVA +Y D Y+SG
Sbjct: 219 KRRCLL-GYPKSYPSDRY-YKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
+Y K+G++A VK++GWGEE G PYW +
Sbjct: 276 -----------IYKHKAGRKTGLHA------------VKVIGWGEEKGTPYWIVA 307
>gi|52630945|gb|AAU84936.1| putative cathepsin B-S [Toxoptera citricida]
Length = 335
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 102/249 (40%), Gaps = 75/249 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 57
C+ G W + + G+VTGG +++ GCQP PPC N + +P P
Sbjct: 153 CNGGNPLKAWQYFKRHGVVTGGNYNTTDGCQPYKVPPCVKDEEGHNSCSGQP-----TEP 207
Query: 58 QPKCHTRCTND---NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
KC C D +Y +G Y+ K Y++N + + D
Sbjct: 208 NHKCSRSCYGDKTCDYKKG----HYKTKNAYYLNIDT------------------MQKDT 245
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
+Y GP+ A+ +Y D +Y+SGVY + A+ + VK++GWGEE+G PY
Sbjct: 246 IAY-------GPIEASFDVYDDFVNYESGVYQKTEDAKYLGGHAVKMIGWGEEDGTPY-- 296
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
W +V+++GEQ+G G KILRG NE I
Sbjct: 297 --------------------------------WLMVNSWGEQWGANGMFKILRGTNECGI 324
Query: 235 ESLVNGALP 243
E +P
Sbjct: 325 EGSPTAGVP 333
>gi|443692853|gb|ELT94358.1| hypothetical protein CAPTEDRAFT_221292 [Capitella teleta]
Length = 374
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 99/243 (40%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + GLV+GG + ++ GC+P S PC H T P C P PKC
Sbjct: 191 CNGGFPPAAWEYFRDTGLVSGGQYGTHQGCRPYSIAPCEHHVNGTRLP-CSGEG-PTPKC 248
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK Y V+++ I EIM NGPV +Y
Sbjct: 249 ERTC-EKGYKVKYEDDKNFGYTAYSVDNDEKQIMTEIMTNGPVEGAFTVY---------- 297
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+D +YKSGVY + E+ +A ++++GWG E+G P
Sbjct: 298 -------------ADFPTYKSGVYQHVSGGELGGHA-IRVLGWGVEDGTP---------- 333
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG+NE IE +
Sbjct: 334 ------------------------YWLVANSWNSDWGDNGFFKILRGQNECGIEGEIVAG 369
Query: 242 LPK 244
LPK
Sbjct: 370 LPK 372
>gi|211853248|emb|CAP17587.1| cathepsin-like protein 4 [Crateromorpha meyeri]
Length = 325
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 103/240 (42%), Gaps = 66/240 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG-----AHHSNTGCQPVSFPPCNHANYTTSEPECKTLAT 56
C G + W + + GLVTGG A S+T CQP P C H + S+P C +
Sbjct: 147 CEGGFLGAAWNYWKQEGLVTGGLYNPSATESDT-CQPYPLPSCEH-HINGSKPACPSKIA 204
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
P+C C + Y + QD + + Y V VA+IQ EIM NGPV A +Y+
Sbjct: 205 KTPECVHTC-HAGYPTSYEQDLHYGESAYSVRRRVAEIQTEIMTNGPVEAAFTVYA---- 259
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
D +YKSGVY + ++ +A VK++GWGEE+G PY
Sbjct: 260 -------------------DFPAYKSGVYKRHSLRQLGGHA-VKMIGWGEEDGIPY---- 295
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W I +++ +GD G KI+RG++E IES
Sbjct: 296 ------------------------------WLIANSWNSDWGDHGYFKIVRGQDECGIES 325
>gi|226469948|emb|CAX70255.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 96/237 (40%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 159 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ V + IQ EIM GPV A + ++ D +YKSG
Sbjct: 218 KRKCQK-GYTTPYEHDKHYGGISINVIKNESAIQNEIMMYGPVEAYLLIFEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + V V+I+GWG EN
Sbjct: 276 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGRNE IES+V
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESVV 335
>gi|56754307|gb|AAW25341.1| unknown [Schistosoma japonicum]
Length = 309
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G++ +W + G+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 126 CDGGVTGYSWDYWVSHGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 184
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +G V A + +Y D +YKSG
Sbjct: 185 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGTVEAYLEIYEDFLNYKSG- 242
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 243 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 266
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 267 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 305
Query: 242 LPK 244
L K
Sbjct: 306 LIK 308
>gi|332376204|gb|AEE63242.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 94/242 (38%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGGA++S+ GC+ S PC H S P+C +L P+C
Sbjct: 156 CDGGYVAEPWDYWRTDGIVTGGAYNSSQGCKDYSLEPCEHHVEVGSRPQCSSLNFDTPEC 215
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + + + F + +Q EI+KNGP+ A +Y+
Sbjct: 216 VRSCYESSLD---YTESLTFGQQVSTFTNEKQMQLEILKNGPIEAAFTVYN--------- 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D SYKSGVY +A E V +K++GWG E G Y
Sbjct: 264 --------------DFLSYKSGVYQATAQDESVGGHAIKVLGWGVEEGTKY--------- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G K LRG + IES +
Sbjct: 301 -------------------------WLIANSWNTDWGDNGYFKFLRGVDHCGIESETAAS 335
Query: 242 LP 243
LP
Sbjct: 336 LP 337
>gi|345308|pir||S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke
(Schistosoma japonicum)
Length = 316
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 97/237 (40%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 133 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-KGKYPSCGDKMYKTPQC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ V + IQ+EIM GPV A + ++ D +YKSG
Sbjct: 192 KRKCQK-GYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 249
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + V V+I+GWG EN
Sbjct: 250 ---------------IYRYTTGSF--------VGEHYVRIIGWGIEN------------- 273
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGRNE +ES+V
Sbjct: 274 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSVESVV 309
>gi|328697984|ref|XP_003240502.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 339
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 100/248 (40%), Gaps = 67/248 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + GLVTGG + S GC+P PPC TS P K
Sbjct: 156 CNGGYPIKAWKYFSSHGLVTGGNYKSGEGCEPYRVPPCPRNEDGTSS----CAGQPIEKN 211
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG + D +RF R YY++ IQ+++M GP+ A+ +Y D
Sbjct: 212 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF---- 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+SYKSGVY + +A + VK++GWG E G PY
Sbjct: 265 -------------------YSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGIPY------ 299
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ Q+GD G KI RG +E I+S
Sbjct: 300 ----------------------------WLMVNSWSAQWGDNGLFKIRRGTDECGIDSAT 331
Query: 239 NGALPKDN 246
+P N
Sbjct: 332 TAGVPVTN 339
>gi|241154720|ref|XP_002407359.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
gi|215494103|gb|EEC03744.1| cathepsin B endopeptidase, putative [Ixodes scapularis]
Length = 337
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 103/243 (42%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++ W +RG+V+GG + + GC+P S PC + + P C + P+C
Sbjct: 154 CKGGFPAAAWEHWKERGIVSGGLYGTPDGCKPYSLAPCEY-HTKCRIPNCIPIVH-TPEC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DK+ ++ Y ++ + IQ EI NGPV A+ ++Y D YKSG
Sbjct: 212 VHHCRK-GYDKDYQEDKHFGQKVYSISRDEKQIQTEIFTNGPVEADFHVYGDFLCYKSG- 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G++A ++I+GWG ENG P
Sbjct: 270 -----------VYQRHSNDGRGMHA------------IRILGWGTENGTP---------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +GDKG KILR NE IE +
Sbjct: 297 ------------------------YWLAANSWNENWGDKGYFKILRRTNECGIEEHIYAG 332
Query: 242 LPK 244
+PK
Sbjct: 333 IPK 335
>gi|392920988|ref|NP_506011.2| Protein F57F5.1 [Caenorhabditis elegans]
gi|206994319|emb|CAB00098.2| Protein F57F5.1 [Caenorhabditis elegans]
Length = 351
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 103/243 (42%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K+G VTGG++ TGC+P +PPC H T C + P KC
Sbjct: 167 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTDKC 226
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QD + + Y V+ + A+IQ+EIM +GPV +Y D F + SG
Sbjct: 227 ERSC-QAGYALTYQQDLHFGQSAYAVSKKAAEIQKEIMTHGPVEVAFTVYED-FEHYSG- 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GVY +A A + +A VK++GWG +NG P
Sbjct: 284 ---------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTP---------- 311
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E +G+ G +I+RG NE IE V G
Sbjct: 312 ------------------------YWLCANSWNEDWGENGYFRIIRGVNECGIEGGVVGG 347
Query: 242 LPK 244
+PK
Sbjct: 348 IPK 350
>gi|226473760|emb|CAX71565.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 103/243 (42%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G Y ++ V+++G G EN
Sbjct: 276 ---------------IYRYTTGKY--------ISGHAVRLIGCGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE +IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECLIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|226474178|emb|CAX71575.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 101/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++TGC+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTGCRPYPFPKCDHF-VKGKYRACGDKLYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 NQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEI 335
>gi|204022096|dbj|BAG71145.1| cathepsin B-N1 [Tuberaphis sumatrana]
Length = 334
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 100/245 (40%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGNPIKAWERFQKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ D+ +Y
Sbjct: 210 H-RCTRMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQY----------------DVLAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|10803443|emb|CAC13134.1| putative cathepsin B.8 [Ostertagia ostertagi]
Length = 197
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/172 (31%), Positives = 74/172 (43%), Gaps = 24/172 (13%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K G+VTG H +N GC+P FP C H + T CK P PKC
Sbjct: 43 CNGGDPLSAWKFWVKEGIVTGSNHSTNAGCKPYPFPACEHHSNKTHYDPCKHDLFPTPKC 102
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C R + +DKY + Y V + + IQ+EI+ GPV +Y D +Y G
Sbjct: 103 EKSCQATFGERTYKEDKYFGRSAYGVKNHMEAIQKEIITYGPVEVAFEVYEDFLNYAGGI 162
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
Y V + VK++GWG +NG PYW
Sbjct: 163 Y------------------------VHQGGALGGGHAVKMIGWGIDNGVPYW 190
>gi|390994431|gb|AFM37365.1| cathepsin B2 [Dictyocaulus viviparus]
Length = 346
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 97/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + ++G+V+GG++ S +GC+P FPPC H T C P C
Sbjct: 162 CDGGFPYAAWNYWVEKGIVSGGSYTSKSGCKPYPFPPCEHHTNGTHYHPCPKDLYPTNTC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + DK + Y V V IQ+EIM +GPV +Y D Y G
Sbjct: 222 EHKCQS-GYATAYTNDKRYGAKAYTVAARVKAIQKEIMLHGPVEVAYDVYEDFEHYLKG- 279
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G Y + VK++GWG ENG P
Sbjct: 280 ---------------IYKHTAGSY--------LGGHAVKMIGWGTENGIP---------- 306
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G +ILRG +E IES V
Sbjct: 307 ------------------------YWICSNSWNSDWGENGFFRILRGTDECGIESGVVAG 342
Query: 242 LPK 244
LPK
Sbjct: 343 LPK 345
>gi|390357905|ref|XP_003729132.1| PREDICTED: cathepsin B-like [Strongylocentrotus purpuratus]
Length = 354
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 100/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + G+VTGG +S+ GCQP C+H T P C+ P P+C
Sbjct: 173 CNGGFPGSAWEYYKDTGIVTGGQWNSSQGCQPYQIKSCDHHVNGTKGP-CQGEG-PTPEC 230
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + QDK+ +++ Q EIM NGPV A+ +Y D +YKSG
Sbjct: 231 KHKC-EASYSTPYEQDKHYALSVNSISNNPEATQTEIMTNGPVEADFTVYEDFPTYKSGV 289
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +KI+GWG E G
Sbjct: 290 YQH------------------------TTGGVLGGHAIKILGWGVEEG------------ 313
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ ++GD G KILRG NE IES +N
Sbjct: 314 ----------------------TKYWLVANSWNNEWGDNGFFKILRGSNECGIESDINFG 351
Query: 242 LPK 244
+PK
Sbjct: 352 IPK 354
>gi|198429088|ref|XP_002120307.1| PREDICTED: similar to cathepsin B [Ciona intestinalis]
Length = 364
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 100/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLVTGG + S TGC P PC H + P+C P C
Sbjct: 182 CNGGFPGSAWKYWNSDGLVTGGLYGSKTGCLPYQIKPCEH-HVPGDRPKCSE-GGGTPSC 239
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++C N + QDK+ Y V + IQ EIM +GPV +Y+D +YKSG
Sbjct: 240 VSKCKG-NTTIHYNQDKHYGLSSYAVGSDPTQIQTEIMTHGPVEGAFTVYADFPTYKSGV 298
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ ++I+GWG ENG
Sbjct: 299 YKH------------------------VTGGVLGGHAIRILGWGSENG------------ 322
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VA YW + +++ +GDKG KILRG +E IES V
Sbjct: 323 ------VA----------------YWLVANSWNTDWGDKGYFKILRGSDECGIESSVVAG 360
Query: 242 LPK 244
+P+
Sbjct: 361 IPQ 363
>gi|204022108|dbj|BAG71151.1| cathepsin B-N [Cerataphis jamuritsu]
Length = 333
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 101/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 153 CNGGYPIRAWERFRKHGLVTGGNYDSYEGCQPYRVPPCPLDEYGNNTCHGK----PMEKN 208
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F D + + Y++ IQ +D+ +Y
Sbjct: 209 H-RCTRMCYGDQDLDFNNDHHYTRDAYYLT--YGTIQ----------------NDVLTY- 248
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY + +A + VK++GWGEE G PY
Sbjct: 249 ------GPIEASFEVYDDFPSYKSGVYVKTENASYLGGHAVKLIGWGEEYGVPY------ 296
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE I++
Sbjct: 297 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGIDNST 328
Query: 239 NGALP 243
G +P
Sbjct: 329 TGGVP 333
>gi|47217183|emb|CAG11019.1| unnamed protein product [Tetraodon nigroviridis]
Length = 351
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 103/264 (39%), Gaps = 81/264 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTG---------------------CQPVSFPPCN 40
C+ G SS W + GLV+GG + S+ G C+P + PPC
Sbjct: 148 CNGGYPSSAWNFWVSDGLVSGGLYDSHIGRIQVSLCVLLLAVDRDFVSPGCRPYTIPPCE 207
Query: 41 HANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMK 100
H + S P C P+C RC Y + QDK+ K Y V+ E +I+QEI K
Sbjct: 208 H-HVNGSRPSCSGEGGDTPECIFRC-EAGYSPSYKQDKHFGKTSYSVSSEEDEIKQEIYK 265
Query: 101 NGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVK 160
NGPV +Y D YKSG Y + VS SA + +K
Sbjct: 266 NGPVEGAFTVYEDFVLYKSGVYQH----------------------VSGSA--LGGHAIK 301
Query: 161 IVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDK 220
++GWGEENG P YW +++ +GD
Sbjct: 302 MLGWGEENGVP----------------------------------YWLCANSWNTDWGDN 327
Query: 221 GTIKILRGRNEAIIESLVNGALPK 244
G KILRG + IES + PK
Sbjct: 328 GFFKILRGADHCGIESEIVAGNPK 351
>gi|226474182|emb|CAX71577.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 342
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 101/243 (41%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + RG+VTGG+ ++T C+P FP C+H C P+C
Sbjct: 159 CDGGFLGPSWDYWVLRGIVTGGSKENHTSCRPYPFPKCDHF-VKGKYRACGDKLYETPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + QDK+ Y V + IQ++IM +GPV A + +Y D +YKSG
Sbjct: 218 KQTCQK-GYNTSYEQDKHYGGFSYNVLSVESVIQKDIMMHGPVEAYLEIYEDFLNYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ Y +G + ++ V+++GWG EN
Sbjct: 276 ---------------IYRYTTG--------QFISGHAVRLIGWGVEN------------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +T+ E +G+KG +I+RGRNE IES +
Sbjct: 300 ---------------------GTAYWLAANTWNEDWGEKGYFRIVRGRNECSIESEIAAG 338
Query: 242 LPK 244
L K
Sbjct: 339 LIK 341
>gi|313233819|emb|CBY09988.1| unnamed protein product [Oikopleura dioica]
Length = 356
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 99/242 (40%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + + GLV+GG +H TGCQP + PC H + P C PKC
Sbjct: 165 CNGGFPQAAWEYWVQNGLVSGGLYHG-TGCQPYAIEPCEH-HTEGDRPPCTGEEGTTPKC 222
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y F QDK+ Y + I EI KNGPV +Y D +YKSG
Sbjct: 223 SHKCV-DGYTGNFAQDKHYGSVAYRIPANEKAIMNEIYKNGPVEGAFIVYEDFPTYKSG- 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++S+ +G + ++++GWGEENG YW
Sbjct: 281 ---------------VYSHHTG--------SALGGHAIRVLGWGEENGEKYW-------- 309
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L G +++ +G+ G KI RG NE IES + G
Sbjct: 310 -------------LCG-------------NSWNTDWGNNGFFKIKRGVNECGIESEMVGG 343
Query: 242 LP 243
+P
Sbjct: 344 IP 345
>gi|187097096|ref|NP_001119608.1| cathepsin B-348 precursor [Acyrthosiphon pisum]
gi|161343833|tpg|DAA06097.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 342
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/242 (28%), Positives = 100/242 (41%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + SN GC P PC H T P CK P C
Sbjct: 160 CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + QD + K Y + ++V I+QEI NGPV +Y
Sbjct: 218 VKKC-EEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY---------- 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D +Y++GVY A + +A ++I+GWG +NG
Sbjct: 267 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG------------ 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
EI PYW + +++ +G G KILRG +E IE +N
Sbjct: 301 ----EI-----------------PYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAG 339
Query: 242 LP 243
LP
Sbjct: 340 LP 341
>gi|355681635|gb|AER96808.1| cathepsin B [Mustela putorius furo]
Length = 338
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 102/242 (42%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGFPAEAWNFWTXXGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V+ +I EI KNGPV A +YSD YKSG
Sbjct: 208 SKIC-EPGYTPSYKEDKHYGCSSYSVSSSEKEIMAEIYKNGPVEAAFSVYSDFLMYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ V+I+GWG ENG PYW
Sbjct: 267 YQH------------------------VTGEMMGGHAVRILGWGVENGTPYW-------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
L+G +++ +GD G KILRG++ IES +
Sbjct: 295 -------------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAG 328
Query: 242 LP 243
+P
Sbjct: 329 IP 330
>gi|112983908|ref|NP_001036850.1| cathepsin B precursor [Bombyx mori]
gi|13548667|dbj|BAB40804.1| cathepsin B [Bombyx mori]
Length = 337
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 98/237 (41%), Gaps = 61/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G+ W + GLV+GG+++S+ GC+P PPC H P C T PKC
Sbjct: 152 CSGGMPRLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + QDK K Y V+ + I+ E+ KNGPV +YS
Sbjct: 210 TKKCES-GYDVNYKQDKQYGKHVYTVSGDEDHIRAELFKNGPVEGAFTVYS--------- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ SYKSGVY + + +A VKI+GWG EN Y
Sbjct: 260 --------------DLLSYKSGVYKHTQGDALGGHA-VKILGWGVENDNKY--------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +GD G KILRG + IES +
Sbjct: 296 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 327
>gi|118424551|gb|ABK90823.1| cathepsin B-like cysteine proteinase [Spodoptera exigua]
Length = 341
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 103/245 (42%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 210
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKCH C + +Y + +DK K Y V+ + I+ E+ KNGPV +YSD
Sbjct: 211 PKCHKTCES-SYNVDYHKDKRYGKHVYSVSSKEDHIKAELYKNGPVEGAFTVYSD----- 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+ +YK+GVY + + +A +KI+GWG ENG Y
Sbjct: 265 ------------------LLNYKNGVYKHTVGNALGGHA-IKILGWGVENGNKY------ 299
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +GD G KILRG + IES +
Sbjct: 300 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 331
Query: 239 NGALP 243
P
Sbjct: 332 VAGEP 336
>gi|308488594|ref|XP_003106491.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
gi|308253841|gb|EFO97793.1| hypothetical protein CRE_15919 [Caenorhabditis remanei]
Length = 342
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 95/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + K G+ TGG++ S GC+P S PC + P C P P C
Sbjct: 157 CAGGNPLKAWQYWQKHGIPTGGSYESQFGCKPYSIAPCGKTIGNVTYPPCTNTTLPTPTC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y +D++ + + +IQ ++M NGPV A M +Y D Y +G
Sbjct: 217 EKKC-KPGYPVDLDKDRHYGVSVDQLPNRQIEIQSDVMLNGPVEATMEIYDDFLQYTTGI 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V + + +V+I+GWG G P
Sbjct: 276 Y------------------------VHLAGNKQGHLSVRILGWGMFEGVP---------- 301
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++G+++G+ GT ++LRG NE +E+
Sbjct: 302 ------------------------YWLLANSWGKEWGENGTFRVLRGVNECGLEANCISG 337
Query: 242 LPK 244
+PK
Sbjct: 338 MPK 340
>gi|256090364|ref|XP_002581165.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
gi|353228444|emb|CCD74615.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 303
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 94/228 (41%), Gaps = 74/228 (32%)
Query: 14 VHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGR 72
V G+VTG + +NTGC+P FP C H +T + P C + P+C T C Y
Sbjct: 145 VDLEGIVTGSSKENNTGCEPYPFPKCEH--FTKGQYPPCGSKIYKTPRCKTTC-QKRYKT 201
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ QDK+R IQ+EIMK GPV A+ +Y D +YKSG Y +
Sbjct: 202 SYAQDKHRA------------IQKEIMKYGPVEASFTVYEDFLNYKSGIYKH-------- 241
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
+ E + ++I+GWG EN PY
Sbjct: 242 ----------------ITGETLGGHAIRIIGWGVENKTPY-------------------- 265
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ E +G+ G +I+RGR+E IES V
Sbjct: 266 --------------WLIANSWNEDWGENGYFRIVRGRDECSIESEVTA 299
>gi|392922404|ref|NP_507186.3| Protein CPR-2 [Caenorhabditis elegans]
gi|206994217|emb|CAB04322.3| Protein CPR-2 [Caenorhabditis elegans]
Length = 326
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 75/243 (30%), Positives = 96/243 (39%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W +RG+VTGG + TGC+P PCN N C L TP C
Sbjct: 153 CDGGFPYRAFQWWARRGVVTGG-DYLGTGCKPYPIRPCNSDN-------CVNLQTP--PC 202
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK Y V VA IQ +I NGPVVA +Y D YKSG
Sbjct: 203 RLSC-QPGYRTTYTNDKNYGNSAYPVPRTVAAIQADIYYNGPVVAAFIVYEDFEKYKSG- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y I G +A VK++GWG E G P
Sbjct: 261 -----------IYRHIAGRSKGGHA------------VKLIGWGTERGTP---------- 287
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW V+++G Q+G+ GT +ILRG +E IES +
Sbjct: 288 ------------------------YWLAVNSWGSQWGESGTFRILRGVDECGIESRIVAG 323
Query: 242 LPK 244
LP+
Sbjct: 324 LPR 326
>gi|46195455|ref|NP_990702.1| cathepsin B precursor [Gallus gallus]
gi|1168790|sp|P43233.1|CATB_CHICK RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Contains:
RecName: Full=Cathepsin B light chain; Contains:
RecName: Full=Cathepsin B heavy chain; Flags: Precursor
gi|603203|gb|AAA87075.1| cathepsin B [Gallus gallus]
Length = 340
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 94/243 (38%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + +RGLV+GG + S+ GC+ + PPC H + S P C P+C
Sbjct: 150 CNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEH-HVNGSRPPCTGEGGETPRC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +I EI KNGPV +Y D YKSG
Sbjct: 209 SRHC-EPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVYEDFLMYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E V ++I+GWG ENG P
Sbjct: 268 YQH------------------------VSGEQVGGHAIRILGWGVENGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G G KILRG + IES +
Sbjct: 294 ------------------------YWLAANSWNTDWGITGFFKILRGEDHCGIESEIVAG 329
Query: 242 LPK 244
+P+
Sbjct: 330 VPR 332
>gi|204022104|dbj|BAG71149.1| cathepsin B-N [Astegopteryx styracophila]
Length = 332
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 74/245 (30%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W K GLVTGG + S GCQP PC Y + C+ P K
Sbjct: 152 CHGGYPIKAWERFQKHGLVTGGDYDSGEGCQPYRVSPCPLDEYGNNT--CR--GKPAEKN 207
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ + IQ+++M GP+ A SY
Sbjct: 208 H-RCTRMCYGNQDLDFKKDHHFTRDAYYLTFGI--IQRDVMAYGPIEA---------SYD 255
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+Y D SYKSGVY + +A + VK++GWGEE G PY
Sbjct: 256 --------------VYDDFPSYKSGVYVRTENATYLGGHAVKLIGWGEEYGVPY------ 295
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GDKG KI RG NE I++
Sbjct: 296 ----------------------------WLMVNSWNDQWGDKGLFKIRRGTNECGIDNST 327
Query: 239 NGALP 243
G +P
Sbjct: 328 TGGVP 332
>gi|76576341|gb|ABA53864.1| cathepsin B-like cysteine protease 2 [Parelaphostrongylus tenuis]
Length = 344
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 91/237 (38%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + + G+VTGG + + C+P PPC H T C +A P C
Sbjct: 163 CDGGYPISAWEYFVETGVVTGGLYGTKDSCRPYEIPPCGHHRNETFYGNCTQIAD-TPDC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C Y + DK K Y + V IQ+EIM GPV A +Y
Sbjct: 222 VTTC-QAGYPISYDDDKTFGKDSYTIESSVTAIQKEIMTYGPVTAAFIVYE--------- 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D F Y G+Y K V GEE G
Sbjct: 272 --------------DFFHYHRGIY--------------KHVSGGEEGGH----------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
V+++GWGEE G YW + +++ +G+ G +ILRG NE IE V
Sbjct: 293 ----------AVRILGWGEEKGTAYWLVANSWNTDWGENGYFRILRGSNECGIEENV 339
>gi|300176937|emb|CBK25506.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 101/243 (41%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W W H G+ TGG + S C FP C+H + P C P P+C
Sbjct: 138 CNGGWPSMAWSWFHSTGVTTGGEYGSKDWCNAYEFPKCDH-HVEGKYPPCGE-TQPTPEC 195
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + +DK+ F Y V V I+ E+M NGP+ + +Y D +YKSG
Sbjct: 196 VEKC-QEGYPVEYKKDKHFFGEAYHVPSNVEAIKTELMTNGPIEVDFSVYEDFMTYKSGI 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VA YL G +A VK+VGWG E+G
Sbjct: 255 YQH---VAGKYL---------GGHA------------VKLVGWGVEDG------------ 278
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ E +G+ G +I+ G+NE IES
Sbjct: 279 ----------------------VEYWKIANSWNEDWGENGYFRIIAGKNECGIESDGVAG 316
Query: 242 LPK 244
+P+
Sbjct: 317 IPE 319
>gi|118429531|gb|ABK91813.1| cathepsin B-like cysteine proteinase precursor [Clonorchis
sinensis]
gi|358331549|dbj|GAA37857.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 343
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 97/242 (40%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CSGGYPAVAWDYWGAHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPHQYYPTPEC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D G + +DK R Y + I +EIM GPV A +F+
Sbjct: 214 VQHC--DTPGIDYVKDKTRANMSYNIYSSEILIMKEIMLRGPVEA-------VFT----- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y D YK GVY S A + +A ++I+GWGEE PY
Sbjct: 260 -----------VYEDFLQYKFGVYFHSWGAPLSEHA-IRILGWGEEGDVPY--------- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG +K LRG NE IE V
Sbjct: 299 -------------------------WLIANSWNEDWGEKGYMKFLRGLNECGIEDDVTAG 333
Query: 242 LP 243
LP
Sbjct: 334 LP 335
>gi|226466652|emb|CAX69461.1| Cathepsin B-like cysteine proteinase precursor [Schistosoma
japonicum]
Length = 340
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 101/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G V+ G+VTGG++ +GCQP P C++ + + +C P+C
Sbjct: 156 CFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D Y + + DK+ +R Y V DIQ+EI+ NGPV+A+
Sbjct: 215 TNEC-QDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS-------------- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ + +D YKSGVY + + + + T++I+GWG E P
Sbjct: 260 ---------ISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIP---------- 300
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E++GD G +KI RG IES V
Sbjct: 301 ------------------------YWLCANSWNEEWGDNGYVKIQRGVQAGYIESYVRAP 336
Query: 242 LPK 244
+PK
Sbjct: 337 IPK 339
>gi|204022090|dbj|BAG71142.1| cathepsin B-N3 [Tuberaphis styraci]
Length = 334
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 100/245 (40%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + K P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVPPCPLDEYGNNTCSGK----PAEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +D+ +Y
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE ++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGTDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|357613937|gb|EHJ68797.1| cathepsin B-like cysteine proteinase [Danaus plexippus]
Length = 334
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 94/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI S W + G+V+GG ++S+ GC P PPC H P C T PKC
Sbjct: 151 CNGGIPSFAWEYWKHFGIVSGGNYNSSQGCLPYEIPPCEHHVPGNRIP-CNG-ETSTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H C + Y + DK K Y V I+ EI KNGPV +Y+D+ +YKSG
Sbjct: 209 HRSCRKE-YTNSYKSDKKYGKHVYSVGGGEEHIKAEIFKNGPVEGAFTVYADLLTYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + E + +KI+GWG ENG Y
Sbjct: 268 YKH------------------------TEGEALGGHAIKIMGWGVENGNKY--------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG + IES +
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329
Query: 242 LP 243
P
Sbjct: 330 EP 331
>gi|157167368|ref|XP_001653891.1| cathepsin b [Aedes aegypti]
gi|108874250|gb|EAT38475.1| AAEL009642-PA [Aedes aegypti]
Length = 332
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 73/241 (30%), Positives = 104/241 (43%), Gaps = 69/241 (28%)
Query: 4 SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
G S WV V GLV+G A++S GC+P F PC + + PE KT P C
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNSTDGCKPYPFKPCLYP-FVGCHPE-KT-----PSCTH 209
Query: 64 RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
CT + Y + +DKY Y + ++ IQ EIM NGPV + +Y D++ YK+G
Sbjct: 210 HCT-EGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTG--- 265
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+Y + + G +A V+++GWG+E G PY
Sbjct: 266 ---------VYQHVVGREVGKHA------------VRLIGWGKERGVPY----------- 293
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W I +++GE +G+ G K LRG N IES+V LP
Sbjct: 294 -----------------------WLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330
Query: 244 K 244
K
Sbjct: 331 K 331
>gi|300835056|gb|ADK37857.1| putative cathepsin precursor [Sitobion avenae]
Length = 340
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 101/248 (40%), Gaps = 67/248 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG + S GC+P PPC + + P K
Sbjct: 157 CHGGYPIKAWKYFSKHGLVTGGNYKSGEGCEPYRVPPCPRDDKGNN----TCAGKPIEKN 212
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG + D +RF R +Y++ IQ+++M GP+ A+ +Y D
Sbjct: 213 H-RCTRMCYGDQDLDYNDDHRFTRDFYYLT--YGSIQKDVMTYGPIEASFDVYDDF---- 265
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
P SYKSGVY + +A + VK++GWG E G PY
Sbjct: 266 -------P------------SYKSGVYEKTENASYLGGHAVKLIGWGVEEGTPY------ 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ Q+GDKG KI RG NE I++
Sbjct: 301 ----------------------------WLMVNSWNAQWGDKGLFKIRRGTNECGIDNST 332
Query: 239 NGALPKDN 246
+P N
Sbjct: 333 TAGVPVTN 340
>gi|227293|prf||1701299A cathepsin B
Length = 339
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 105/251 (41%), Gaps = 62/251 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC P + PPC H + S P C + +C
Sbjct: 150 CNGGYPSGAWNFWTKKGLVSGGYYDSHIGCLPYTIPPCEH-HVNGSRPPCTGEGDTR-RC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V I EI KNGPV ++SD +YKSG
Sbjct: 208 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKKIMAEIYKNGPVEGAFTVFSDFLTYKSG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +++G +++ ++I+ WG ENG PYW + +
Sbjct: 266 ---------------VYKHEAG--------DMMGGHAIRILVWGVENGVPYWAAANSWNL 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+GD G KILRG N IES +
Sbjct: 303 ----------------------------------DWGDNGFFKILRGENHCGIESEIVAG 328
Query: 242 LPK-DNYGVEF 251
+P+ D Y F
Sbjct: 329 IPRTDQYWGRF 339
>gi|294952611|ref|XP_002787376.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902348|gb|EER19172.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 203
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 100/237 (42%), Gaps = 66/237 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSE-PECKTL 54
C+ G ++ G+VTG G GC P F CNH SE P+CK +
Sbjct: 12 CNGGTFVEAMSFLEDYGVVTGNDFKPQGQLSEADGCWPYPFQKCNHVPTENSEYPKCKDV 71
Query: 55 A-TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
A P P C T CTN Y + +D +R K + V ++ I+QEI NGPV + +Y D
Sbjct: 72 AHQPLPPCRTTCTNKAYKKSLKKDVHRAKSWRKVFNDAQSIKQEIFDNGPVFSAFKMYED 131
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
F Y YKSGVY V + E++++ VKI+GW
Sbjct: 132 -FRY----------------------YKSGVY-VPTTKEVLSFHLVKIIGW--------- 158
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
G ++ + YW ++++ E++GD G IK+ G+N
Sbjct: 159 -------------------------GADSVQEYWLAMNSWNEEWGDHGLIKMAFGKN 190
>gi|729283|sp|Q06544.1|CYSP3_OSTOS RecName: Full=Cathepsin B-like cysteine proteinase 3
gi|159952|gb|AAA29436.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 174
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 92/229 (40%), Gaps = 60/229 (26%)
Query: 10 TWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 69
W + G+VTGG + C+P FPPC EC A PKC C
Sbjct: 1 AWQYFALEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDTAK-TPKCQKTCQR-G 58
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
Y + + +DK+ K Y + + V IQ++IMKNGPVVA +Y D YKSG Y +
Sbjct: 59 YLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKH----- 113
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
+ + VKI+GWG+E G P
Sbjct: 114 -------------------TAGRMTGGHAVKIIGWGKEKGTP------------------ 136
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW I +++ + +G+KG +++RG N IE +V
Sbjct: 137 ----------------YWLIANSWHDDWGEKGFYRMIRGINNCRIEEMV 169
>gi|321461662|gb|EFX72692.1| hypothetical protein DAPPUDRAFT_308155 [Daphnia pulex]
Length = 379
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/244 (28%), Positives = 96/244 (39%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTS-EPECKTLATPQPK 60
C G W+ K G+VTGG++ S+ GCQ F PC S + +C +
Sbjct: 182 CKGGFPGGAWMHWSKHGIVTGGSYSSDYGCQKYQFFPCYQPRTKGSIKNKCPKTDNTLLE 241
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C +Y + + QD Y + Y + ++ IQ EIM+NGPV AN
Sbjct: 242 CRETCRT-SYNKSYKQDLYYGESVYRIPNDARAIQLEIMENGPVQAN------------- 287
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+ +Y D YK GVY + + Y VKI GWG E G PYW ++
Sbjct: 288 ----------LRIYEDFLHYKFGVYR-HVHGQGLEYHAVKIFGWGTEGGTPYWLAANPWS 336
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+++G+ G KILRG N A IE V
Sbjct: 337 ----------------------------------KRWGNGGFFKILRGSNHAEIEDHVMA 362
Query: 241 ALPK 244
+PK
Sbjct: 363 GIPK 366
>gi|341900875|gb|EGT56810.1| hypothetical protein CAEBREN_32632 [Caenorhabditis brenneri]
Length = 287
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 95/244 (38%), Gaps = 59/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + K GL TGG++ S GC+P S PC + P C P P C
Sbjct: 100 CGGGNAFKAWQYWGKHGLPTGGSYESQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 159
Query: 62 HTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+CT+ N Y +D++ + + +IQ ++M NGP+ +Y D Y +G
Sbjct: 160 EKKCTSKNGYPVDIDKDRHYGASVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTTG 219
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y V + + +V+I+GWG G P
Sbjct: 220 IY------------------------VHLTGNKQGHLSVRILGWGMYEGVP--------- 246
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + +++G+++G+ GT + LRG NE +E+
Sbjct: 247 -------------------------YWLLANSWGKEWGENGTFRALRGTNECGLEANCVS 281
Query: 241 ALPK 244
+PK
Sbjct: 282 GMPK 285
>gi|157058745|gb|ABV03130.1| cathepsin B-2744 [Sitobion avenae]
Length = 260
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 77/177 (43%), Gaps = 32/177 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C G + W +G+VTGG SN GCQP PCNH + C +L Q
Sbjct: 96 CDGGSAFKAWELTMSKGIVTGGNFDSNEGCQPYKIRPCNHYG-NGNLKNCSSLRRTQMTV 154
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C +C N NY + D ++ Y W N V IQQEIM GPV A MY+Y + Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMGY 212
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
K G Y S + E++ Y VK++GWG + +G YW
Sbjct: 213 KEGIYK------------------------STAGELIGYHHVKLIGWGVDGDGTEYW 245
>gi|984960|gb|AAC46878.1| cathepsin B proteinase, partial [Ancylostoma caninum]
Length = 340
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 98/232 (42%), Gaps = 62/232 (26%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYG 71
W+ + +VTGG + C+P +F PC NH N P C P PKC C Y
Sbjct: 167 WMQRSVVVTGGKYRQKDVCKPYAFYPCGNHTNERYYGP-CPRGLWPTPKCRKACQR-KYN 224
Query: 72 RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ + +DKY R Y++ I++EI Y NGPVVA
Sbjct: 225 KSYNEDKYFATRSYYLPSNERSIREEI-----------------------YKNGPVVAAF 261
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+Y D Y+ G+Y + WG + G A+A
Sbjct: 262 KVYQDFSYYRGGIY---------------VHKWGGQTG-------------------AHA 287
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SLVNGAL 242
VK++GWG ENG YW I +++ +G+ G +I RG NE IE +V+G +
Sbjct: 288 -VKVVGWGRENGTDYWLIANSWNTDWGENGYFRIARGSNECGIEGQMVSGVM 338
>gi|339242313|ref|XP_003377082.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974149|gb|EFV57673.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 517
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 100/244 (40%), Gaps = 69/244 (28%)
Query: 1 VCSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
+ + G+ S + + K G+ TGG + + CQP S PC+ +YT S P CK
Sbjct: 338 ILACGMIPSPFNYWKKMGIATGGPYGDKSCCQPYSIAPCSKCSYTASTPSCK-------- 389
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C D Y DK+ +Y V+ +I EI +GPVVA +Y D F+Y
Sbjct: 390 --YDCQAD-YDIPISDDKFYASEHYHVSSNQYEIMNEIYTHGPVVAGFIVYED-FTY--- 442
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y SG+Y + + +A ++I+GWGEENG PY
Sbjct: 443 -------------------YISGIYQQTTYVAMGGHA-IRIIGWGEENGIPY-------- 474
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W I +++ FG+KG +I RG NE IES V
Sbjct: 475 --------------------------WLIANSWNTTFGEKGFFRIRRGTNECRIESEVYT 508
Query: 241 ALPK 244
+PK
Sbjct: 509 GIPK 512
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 61/124 (49%), Gaps = 11/124 (8%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C SG + +++ + GLVTGG + C P S PC T P LA PKC
Sbjct: 70 CRSGKIEAAFIYWQRSGLVTGGPYGEKACCLPYSISPC-----TMCRP--YMLA---PKC 119
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C +Y +DKY K +Y+VN + DI QEI + GPVVA +Y D Y SG+
Sbjct: 120 QRTC-QASYNLSLKRDKYYGKSHYYVNQDEFDIMQEIYQRGPVVAGFKVYHDFLYYISGQ 178
Query: 122 YGNG 125
+ G
Sbjct: 179 FICG 182
>gi|294883442|ref|XP_002770942.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239874068|gb|EER02758.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 393
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/181 (31%), Positives = 86/181 (47%), Gaps = 29/181 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W W + G+V+ ++GC P +FP C+H T CK +P P C
Sbjct: 202 CDGGQPDSAWRWFSEHGVVS----ELDSGCWPYNFPECSHHVETKGMEPCKG-NSPSPVC 256
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C N ++ F D++ + + DEV +I++EI+ NGPV A +Y D
Sbjct: 257 STTCRNHHFKPSFESDRHFTEDEGYSLDEVDEIKKEIIDNGPVAAAFTVYED-------- 308
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+LY YKSGVY +E+ +A VKI+GWG + YW ++ + V
Sbjct: 309 ----------FLY-----YKSGVYKHVNGSELGGHA-VKIIGWGTDQNEQYWLVMNSWNV 352
Query: 182 S 182
+
Sbjct: 353 N 353
>gi|56462338|gb|AAV91452.1| cysteine peptidase 2 cathepsin-B-like [Lonomia obliqua]
Length = 338
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 100/237 (42%), Gaps = 61/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + G+V+GG+++S GC P PPC H P C T PKC
Sbjct: 153 CNGGMPTLAWEYWKHAGIVSGGSYNSTQGCIPYEVPPCEHHVPGNRLP-CNG-DTKTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y F +DK+ K Y V+ +I+ E+ KNGPV G
Sbjct: 211 QKTC-EAGYNVPFKKDKHYGKHVYSVSGNEDNIKAELFKNGPV--------------EGA 255
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +YSD+ SYKSGVY + + + +A VKI+GWG ENG Y
Sbjct: 256 F---------TVYSDLLSYKSGVYQHTDGSALGGHA-VKILGWGVENGSKY--------- 296
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +GD G KILRG + IES +
Sbjct: 297 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328
>gi|51947600|gb|AAU14266.1| cathepsin B-N [Myzus persicae]
Length = 338
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 104/248 (41%), Gaps = 67/248 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K+GLVTGG + S GC+P PPC + + + T A +
Sbjct: 155 CNGGYPIKAWKRFSKKGLVTGGDYKSGEGCEPYRVPPCPNDDQGNN-----TCAGKPMES 209
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+ RCT YG F + +R+ R YY++ IQ+++M GP+ A+ +Y D
Sbjct: 210 NHRCTRMCYGDQDLDFDEDHRYTRDYYYLT--YGSIQKDVMTYGPIEASFDVYDDF---- 263
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
P SYKSGVY S +A + VK++GWGEE G PY
Sbjct: 264 -------P------------SYKSGVYVKSENASYLGGHAVKLIGWGEEYGVPY------ 298
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ E +GD G KI RG NE +++
Sbjct: 299 ----------------------------WLMVNSWNEDWGDHGFFKIQRGTNECGVDNST 330
Query: 239 NGALPKDN 246
+P N
Sbjct: 331 TAGVPVTN 338
>gi|183988832|gb|ACC66065.1| cathepsin B [Antheraea assama]
Length = 287
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 70/235 (29%), Positives = 99/235 (42%), Gaps = 61/235 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + GLV+GG ++S+ GC+P PPC H P C T PKC
Sbjct: 113 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 170
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y F +DK K Y V+ +I+ E+ KNGPV +YS
Sbjct: 171 EKTCES-SYTVPFKKDKRYGKHVYSVSGHEDNIKAELFKNGPVEGAFTVYS--------- 220
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ SYKSGVY + + +A +KI+GWG ENG Y
Sbjct: 221 --------------DLLSYKSGVYQHTHGNALGGHA-IKILGWGVENGSKY--------- 256
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W I +++ +GD G +KILRG + IES
Sbjct: 257 -------------------------WLIANSWNSDWGDNGFLKILRGEDHCGIES 286
>gi|204022077|dbj|BAG71136.1| cathepsin B-S1 [Tuberaphis sumatrana]
gi|204022079|dbj|BAG71137.1| cathepsin B-S2 [Tuberaphis sumatrana]
Length = 334
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 103/242 (42%), Gaps = 64/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG + + GC+P PC + + P +
Sbjct: 154 CEGGYPIKAWRYFRTQGVTTGGDYDTKEGCKPYKVAPCYNKQGKNT-----CGGKPMERN 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG+ Q +Y+ K Y +N + I+Q DI +Y
Sbjct: 209 H-QCPKTCYGKTTDQKRYKTKSEYVINS-IKTIEQ----------------DIKTY---- 246
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPV A+ +Y D YKSG+Y + +A+ +NG
Sbjct: 247 ---GPVEASFDVYDDFSVYKSGIYRKTPNAKY-------------QNGH----------- 279
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+VK+IGWG+ENG PYW V+++ + +GD GT KI++G+NE IE V
Sbjct: 280 ----------SVKIIGWGQENGTPYWLAVNSWSKFWGDHGTFKIIKGKNECGIERAVTAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|7537454|gb|AAF35867.2| cathepsin B-like cysteine proteinase [Helicoverpa armigera]
Length = 338
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC C + NY + +DK K + V+ + I+ E+ KNGPV +YSD
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSD----- 261
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+ +YK+GVY + + +A VKI+GWG ENG Y
Sbjct: 262 ------------------LLNYKTGVYKHTIGDALGGHA-VKILGWGVENGNKY------ 296
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +GD G KILRG + IES +
Sbjct: 297 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328
Query: 239 NGALP 243
P
Sbjct: 329 VAGEP 333
>gi|119887749|gb|ABM05925.1| cathepsin B-like cysteine proteinase [Helicoverpa assulta]
Length = 338
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G+ + W + GLV+GG+++S+ GC+P PPC H N + KT
Sbjct: 153 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRMPCNGDSKT----- 207
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC C + NY + +DK K + V+ + I+ E+ KNGPV +YSD
Sbjct: 208 PKCEKTCES-NYNVDYRKDKRYGKHVFSVSSKEDHIRAELFKNGPVEGAFTVYSD----- 261
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+ +YK+GVY + + +A VKI+GWG ENG Y
Sbjct: 262 ------------------LLNYKTGVYKHTIGDALGGHA-VKILGWGVENGNKY------ 296
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +GD G KILRG + IES +
Sbjct: 297 ----------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSI 328
Query: 239 NGALP 243
P
Sbjct: 329 VAGEP 333
>gi|132566367|gb|ABO34080.1| cathepsin B5 [Clonorchis sinensis]
Length = 343
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 96/242 (39%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D G+ +DK R Y + I +EIM GPV A IF+
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y D Y SGVY + A + +A V+I+GWGE PY
Sbjct: 260 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVPY--------- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G++G +K LRG NE IE V
Sbjct: 299 -------------------------WLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAG 333
Query: 242 LP 243
LP
Sbjct: 334 LP 335
>gi|268572243|ref|XP_002648913.1| Hypothetical protein CBG17826 [Caenorhabditis briggsae]
Length = 323
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/243 (28%), Positives = 96/243 (39%), Gaps = 71/243 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + RG+VTGG +GC+P F PC S PE KT P C
Sbjct: 151 CKGGYPIQAFRWWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTC 198
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK Y V VA IQ EIM NGPVV +Y D++ YKSG
Sbjct: 199 SLSC-QFGYSTAYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGV 257
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +KI+GWG +NG PY
Sbjct: 258 YRH------------------------TAGRLLGGHAIKIIGWGTQNGIPY--------- 284
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++G +G+ G +K+ RG NE IE V
Sbjct: 285 -------------------------WLIANSWGANWGENGFLKMRRGVNECGIERAVVAG 319
Query: 242 LPK 244
+P+
Sbjct: 320 MPR 322
>gi|239938584|gb|ACS36091.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 75/159 (47%), Gaps = 26/159 (16%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
G VTGG + S GC+P F PC H T EC A PKC RC +Y + ++ D
Sbjct: 179 GAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRRCQR-SYKKAYYMD 236
Query: 78 KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDI 137
K + Y V V IQ+EIMKNGPVV +Y D FSY
Sbjct: 237 KSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYED-FSY-------------------- 275
Query: 138 FSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G+Y +A +A +KI+GWG EN PYW I
Sbjct: 276 --YKKGIYKHTAGQARGGHA-IKIIGWGVENDVPYWLIA 311
>gi|239938582|gb|ACS36090.1| cysteine proteinase [Haemonchus contortus]
Length = 346
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 75/158 (47%), Gaps = 26/158 (16%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
G VTGG + S GC+P F PC H T EC A PKC RC +Y + ++ D
Sbjct: 179 GAVTGGDYGSKDGCRPYPFHPCGHHGNDTYYGECPKGAK-TPKCRRRCQR-SYKKAYYMD 236
Query: 78 KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDI 137
K + Y V V IQ+EIMKNGPVV +Y D FSY
Sbjct: 237 KSYGEDAYEVPHSVKAIQREIMKNGPVVGAFTVYED-FSY-------------------- 275
Query: 138 FSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK G+Y +A +A +KI+GWG EN PYW I
Sbjct: 276 --YKKGIYKHTAGQARGGHA-IKIIGWGVENDVPYWLI 310
>gi|268557292|ref|XP_002636635.1| C. briggsae CBR-CPR-1 protein [Caenorhabditis briggsae]
Length = 330
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 93/237 (39%), Gaps = 69/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PAC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V VA IQ EIM NGPV A +Y D
Sbjct: 206 SLSC-QSGYSTAYAKDKHFGASAYAVARSVAAIQTEIMTNGPVEAAFTVYED-------- 256
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY +A + +A +KI+GWG E+G P
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSP---------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW + +++G +G+ G KILRG ++ IE V
Sbjct: 291 ------------------------YWLVANSWGTNWGESGFFKILRGDDQCGIEGAV 323
>gi|187104114|ref|NP_001119617.1| cathepsin B-16A precursor [Acyrthosiphon pisum]
gi|161343835|tpg|DAA06098.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 100/248 (40%), Gaps = 67/248 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+VTGG + S GC+P PPC E + P K
Sbjct: 157 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 212
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG + D +RF R YY++ IQ+++M GP+ A+ +Y D
Sbjct: 213 H-RCTRMCYGNQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF---- 265
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
P SYKSGVY + +A + VK++GWG E G PY
Sbjct: 266 -------P------------SYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPY------ 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ Q+GD G KI RG +E I+S
Sbjct: 301 ----------------------------WLMVNSWNAQWGDNGLFKIRRGTDECGIDSAA 332
Query: 239 NGALPKDN 246
+P N
Sbjct: 333 TAGVPVTN 340
>gi|54289256|gb|AAV31918.1| putative vitellogenic cathepsin B [Aedes aegypti]
Length = 332
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 104/241 (43%), Gaps = 69/241 (28%)
Query: 4 SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
G S WV V GLV+G A+++ GC+P F PC + + PE KT P C
Sbjct: 160 DGTSFQYWVDV---GLVSGAAYNNTDGCKPYPFKPCLYP-FVGCHPE-KT-----PSCTH 209
Query: 64 RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
CT + Y + +DKY Y + ++ IQ EIM NGPV + +Y D++ YK+G
Sbjct: 210 HCT-EGYDGTYRRDKYYGSAAYKLPNDERMIQLEIMTNGPVESGFSVYQDLYLYKTG--- 265
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+Y + + G +A V+++GWG+E G PY
Sbjct: 266 ---------VYQHVVGREVGKHA------------VRLIGWGKERGVPY----------- 293
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W I +++GE +G+ G K LRG N IES+V LP
Sbjct: 294 -----------------------WLIANSYGEDWGEHGYFKFLRGSNHLGIESVVIAGLP 330
Query: 244 K 244
K
Sbjct: 331 K 331
>gi|10803452|emb|CAB97365.2| putative cathepsin B.2 [Ostertagia ostertagi]
Length = 194
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 77/175 (44%), Gaps = 26/175 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG + C+P FPPC EC A PKC
Sbjct: 43 CEGGWPMKAWQYFXLEGVVTGGNYRKQGCCRPYEFPPCGRHGKEPYYGECYDSAK-TPKC 101
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DK+ K Y + + V IQ++IMKNGPVVA +Y D YKSG
Sbjct: 102 QKTCQR-GYLKPYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSG- 159
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
I+ + +G + VKI+GWG+E G PYW I
Sbjct: 160 ---------------IYKHTAG--------RMTGGHAVKIIGWGKEXGTPYWLIA 191
>gi|323147412|gb|ADX32985.1| cathepsin B [Pinctada fucata]
Length = 366
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLVTGG ++S+ GCQP + C+H +P C P C
Sbjct: 183 CNGGFLSGAWEYYKRDGLVTGGQYNSHQGCQPYTVKACDHHVVGKLQP-CSKKEEHTPVC 241
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK+ Y V V I EIM NGPV +Y+
Sbjct: 242 KHECES-GYNVSYTKDKHYGATAYSVRG-VQQIMTEIMTNGPVEGAFTVYA--------- 290
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D YKSGVY + + + +A +KI+GWG E G YW + +
Sbjct: 291 --------------DFPQYKSGVYKHTTGSPLGGHA-IKIMGWGTEGGDDYWLVANSW-- 333
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W G++GT KILRGR+E IES +
Sbjct: 334 ----------------------NPDW----------GNQGTFKILRGRDECGIESQIAAG 361
Query: 242 LPK 244
PK
Sbjct: 362 EPK 364
>gi|350535627|ref|NP_001233013.1| uncharacterized protein LOC100164982 precursor [Acyrthosiphon
pisum]
gi|239789514|dbj|BAH71377.1| ACYPI005957 [Acyrthosiphon pisum]
Length = 339
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 70/248 (28%), Positives = 101/248 (40%), Gaps = 67/248 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + GLVTGG + S GC+P PPC + + P+ K
Sbjct: 156 CNGGYPIKAWKYFSTHGLVTGGNYKSGKGCEPYRVPPCPR----NEDGKSSCAGKPKEKN 211
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG + D +RF R +Y++ IQ+ D+ +Y
Sbjct: 212 H-RCTRMCYGNQDLDYDDDHRFTRDFYYLT--YGSIQK----------------DVLNY- 251
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY + +A + VK++GWG E G PY
Sbjct: 252 ------GPIEASFDVYDDFPSYKSGVYQRTPNATKLGGHAVKLIGWGVEEGTPY------ 299
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ Q+GD G KI RG +E I+S
Sbjct: 300 ----------------------------WLMVNSWNAQWGDNGLFKIRRGTDECRIDSAT 331
Query: 239 NGALPKDN 246
+P N
Sbjct: 332 TAGVPVTN 339
>gi|170787211|gb|ACB38229.1| cathepsin B [Meretrix meretrix]
Length = 337
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/242 (29%), Positives = 105/242 (43%), Gaps = 63/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ + G+VTGG ++S+ GC P C+H +P CK P P+C
Sbjct: 155 CNGGFLEGAWNYLKRDGIVTGGPYNSHQGCLPYEIKACDHHVVGKLQP-CKGDG-PTPRC 212
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +D++ K + V + V I EIM NGPV A +YSD +YKSG
Sbjct: 213 KKECES-GYNNTYSKDEHHAKTVHAV-EGVEQIMTEIMTNGPVEAAFTVYSDFPTYKSG- 269
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ +KSG + +A +K +GWG E+G+ YW + +
Sbjct: 270 ---------------VYEHKSG-------GPLGGHA-IKTLGWGNEDGKDYWLVANSW-- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
P W GD G KILRGR+E IES +V G
Sbjct: 305 ----------------------NPDW----------GDNGFFKILRGRDECGIESNIVAG 332
Query: 241 AL 242
+
Sbjct: 333 MM 334
>gi|170028910|ref|XP_001842337.1| cathepsin L [Culex quinquefasciatus]
gi|167879387|gb|EDS42770.1| cathepsin L [Culex quinquefasciatus]
Length = 334
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 101/246 (41%), Gaps = 67/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
C+ G + W + ++GLV+GG + S+ GCQP + PC H T P E KT
Sbjct: 152 CNGGFPGAAWSYWVRKGLVSGGPYGSDQGCQPYAISPCEHHVNGTRGPCNGEGKT----- 206
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC +C +Y + +DK+ K Y + IQ+E+ NGPV +Y
Sbjct: 207 PKCVKKC-QASYNVPYAKDKFFGKSSYSIASHEQQIQKELFTNGPVEGAFTVY------- 258
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
D+ +YK GVY +A + +A ++I+GWG E
Sbjct: 259 ----------------EDLLNYKEGVYQHTAGKMLGGHA-IRILGWGVE----------- 290
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
N +W I +++ +GD G KILRG + IES +
Sbjct: 291 -----------------------NDTKFWLIANSWNSDWGDNGYFKILRGSDHLGIESSI 327
Query: 239 NGALPK 244
LPK
Sbjct: 328 AAGLPK 333
>gi|28932700|gb|AAO60044.1| midgut cysteine proteinase 1 [Rhipicephalus appendiculatus]
Length = 332
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 94/217 (43%), Gaps = 68/217 (31%)
Query: 30 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
GCQP S PPC P C T P PKC C Y + + +DK+ K Y +
Sbjct: 183 GCQPYSLPPC--------VPNC-THPEPTPKCQHVC-RKGYEKSYEEDKHFAKNVYRLLK 232
Query: 90 EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
+ I+ +I KNGPV + ++Y+D SYKSG Y +M + GV+A
Sbjct: 233 KCDAIKTDIYKNGPVESAFFVYADFPSYKSGVYQQ-----HMIKF-------MGVHA--- 277
Query: 150 SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
+KI+GWG E+G PYW + + V GW
Sbjct: 278 ---------IKILGWGTEDGVPYWLVANSWNV---------------GW----------- 302
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
GDKG KILRG++E IE +++ +P ++
Sbjct: 303 --------GDKGYFKILRGKDECGIEEVIDAGIPMED 331
>gi|389608541|dbj|BAM17880.1| cathepsin B [Papilio xuthus]
Length = 334
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 61/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + GLV+GG+++S+ GC+P PPC H P + T PKC
Sbjct: 151 CNGGMPTLAWEYWKHFGLVSGGSYNSSQGCRPYEIPPCEHHVPGNRLP--CSGDTKTPKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + QDK+ K Y V I+ E+ KNGPV +Y+D+ SYKSG
Sbjct: 209 VKECES-GYKVPYKQDKHYGKHVYSVRGGEDHIKAELYKNGPVEGAFTVYADLLSYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + + +KI+GWG ENG Y
Sbjct: 268 YKH------------------------VTGDALGGHAIKIMGWGVENGNKY--------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +GD G KILRG + IES +
Sbjct: 295 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGIESSIVAG 329
Query: 242 LPKDN 246
P N
Sbjct: 330 EPLFN 334
>gi|294891881|ref|XP_002773785.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878989|gb|EER05601.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 455
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 91/220 (41%), Gaps = 66/220 (30%)
Query: 13 WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
++ GLVTGG + ++ GC P FP CNH S+ P C + P C T C
Sbjct: 231 FMKNHGLVTGGEYKPPEELGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 289
Query: 66 TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
N YG +D +R K + + I+QEI + NG
Sbjct: 290 PNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEI-----------------------FDNG 326
Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
PV A M LY D YKSGVY V +
Sbjct: 327 PVAAMMTLYEDFRFYKSGVY-----------------------------------VHKTG 351
Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKI 225
+++A T+KLIGWG E+G+ YW V+ + E++GD G IK+
Sbjct: 352 QMLAAHTLKLIGWGVESGQEYWLAVNAWNEEWGDHGMIKL 391
>gi|107921773|gb|ABF85678.1| cathepsin B1 [Fasciola hepatica]
Length = 278
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 76/172 (44%), Gaps = 25/172 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTGG + TGC P FP C H + C P P C
Sbjct: 132 CQGGSPPAAWDYWWRNGIVTGGTLENPTGCLPYPFPQCRHPGSRSQLNPCPGYIYPTPSC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + + +DK K Y V+ I QEIMKNGPV A +Y+D YKSG
Sbjct: 192 YPYC-QAGYDKTYEEDKVYGKTSYNVDRHEYTIMQEIMKNGPVEAGFIVYTDFAVYKSG- 249
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
I+ + SG YA ++I+GWG ENG YW
Sbjct: 250 ---------------IYHHVSGRYA--------GKHAIRIIGWGVENGVNYW 278
>gi|29840882|gb|AAP05883.1| similar to GenBank Accession Number X70968 cathepsin B in
Schistosoma japonicum [Schistosoma japonicum]
Length = 312
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 1/121 (0%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI W + G+VTGG++ ++TGCQP FP C H + + + C+ P+C
Sbjct: 161 CNGGIPGMAWDYWKDEGIVTGGSNETHTGCQPYPFPECIHHSTSINHSSCEVKYYSTPEC 220
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C D Y + DKY K Y+V + I +EI+ NGPV A Y+Y D +YK+G
Sbjct: 221 YQTCQPD-YAIQYENDKYYGKSSYYVTSDEVSIMKEILLNGPVEATFYVYDDFLNYKTGV 279
Query: 122 Y 122
Y
Sbjct: 280 Y 280
>gi|268555420|ref|XP_002635699.1| Hypothetical protein CBG22436 [Caenorhabditis briggsae]
Length = 317
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 96/242 (39%), Gaps = 70/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C + W +K+G+VTGG + +GC+P F PC T SE P+C
Sbjct: 145 CKGASPLQAFRWWNKKGVVTGG-DYRGSGCKPYPFAPCTALPCTKSE---------TPRC 194
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + +DKY Y V +VA IQ EI
Sbjct: 195 SLNC-QPAYSKAYSKDKYFGTPAYIVGMDVAAIQTEIT---------------------- 231
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
NGPV A +Y D Y+SGVY + ++V VKI+GWG +NG PYW +
Sbjct: 232 --NGPVEAAFIVYDDFNHYRSGVYR-HVAGKLVGGHAVKIIGWGIQNGAPYWLMAN---- 284
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG PYW G+ G K+LRG +E IES +
Sbjct: 285 ---------------SWG-----PYW----------GENGFFKMLRGVDECGIESTIVAG 314
Query: 242 LP 243
P
Sbjct: 315 KP 316
>gi|201023315|ref|NP_001128400.1| cathepsin B-16D2 precursor [Acyrthosiphon pisum]
Length = 340
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 100/249 (40%), Gaps = 69/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG F + +R+ R YY IQ+++M GP+ A+ +Y D
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASFDVYDDF--- 265
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
P SYKSGVY S +A + VK++GWGEE G PY
Sbjct: 266 --------P------------SYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPY----- 300
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ +GD G KI RG NE I++
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIRRGTNECGIDNS 331
Query: 238 VNGALPKDN 246
+P N
Sbjct: 332 TTAGVPVTN 340
>gi|204022081|dbj|BAG71138.1| cathepsin B-S1 [Tuberaphis takenouchii]
Length = 332
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 99/242 (40%), Gaps = 64/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG + S GC P PPC + P +
Sbjct: 154 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG Q +Y+ K Y +N ++Q+++K GP+ A+ L+ D
Sbjct: 209 H-QCPKTCYGSTTVQKRYKVKNEYVLNSP-NTMEQDLIKYGPIEASFNLFDD-------- 258
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ +YKSG+Y + A+ ++ ++KI+GWG+ENG PYW V ++
Sbjct: 259 ---------------LSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWS- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W G++GT +I++GRNE IE
Sbjct: 303 -----------------------KFW----------GEQGTFRIIKGRNECGIERSATAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|308504375|ref|XP_003114371.1| CRE-CPR-1 protein [Caenorhabditis remanei]
gi|308261756|gb|EFP05709.1| CRE-CPR-1 protein [Caenorhabditis remanei]
Length = 366
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 94/237 (39%), Gaps = 69/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 192 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 241
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +VA IQ EIM NGPV A +Y D
Sbjct: 242 SLSC-QSGYTTAYAKDKHFGTSAYAVARKVASIQTEIMTNGPVEAAFTVYED-------- 292
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY +A + +A +KI+GWG E+G P
Sbjct: 293 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSP---------- 326
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW + +++G +G+ G +I RG ++ IES V
Sbjct: 327 ------------------------YWLVANSWGNSWGESGFFRIFRGDDQCGIESAV 359
>gi|328718094|ref|XP_003246386.1| PREDICTED: cathepsin B [Acyrthosiphon pisum]
Length = 340
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 100/249 (40%), Gaps = 69/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG F + +R+ R YY IQ+++M GP+ A+ +Y D
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASFDVYDDF--- 265
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
P SYKSGVY S +A + VK++GWGEE G PY
Sbjct: 266 --------P------------SYKSGVYVKSENATYLGGHAVKLIGWGEEYGVPY----- 300
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ +GD G KI RG NE I++
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIRRGTNECGIDNS 331
Query: 238 VNGALPKDN 246
+P N
Sbjct: 332 TTAGVPVTN 340
>gi|312271213|gb|ADQ57304.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 347
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 94/243 (38%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTG + S +GC+P +PPC H +C P C
Sbjct: 164 CDGGDPYAAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTC 223
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + DK+ Y V +VA IQ+EIM NGPV +Y D Y SG
Sbjct: 224 EYKC-QDGYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSG- 281
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G Y + VK++GWG EN
Sbjct: 282 ---------------IYKHTTGDY--------LGGHAVKMLGWGTEN------------- 305
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +++ +G+ G +ILRG +E IES V
Sbjct: 306 ---------------------GTDYWICANSWNSDWGENGFFRILRGVDECQIESSVVAG 344
Query: 242 LPK 244
PK
Sbjct: 345 EPK 347
>gi|341904369|gb|EGT60202.1| hypothetical protein CAEBREN_08101 [Caenorhabditis brenneri]
Length = 330
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 95/237 (40%), Gaps = 69/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +G+VTGG +H GC+P PC + S PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +VA IQ EIM NGPV A +Y D
Sbjct: 206 SLSC-QSGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYED-------- 256
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY +A + +A +KI+GWG E+G PY
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPY--------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W + +++G +G+ G KI RG ++ IES V
Sbjct: 292 -------------------------WLVANSWGTSWGESGFFKIFRGDDQCGIESAV 323
>gi|294916952|ref|XP_002778399.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886773|gb|EER10194.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 228
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 93/224 (41%), Gaps = 66/224 (29%)
Query: 13 WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
++ GLVTGG + ++ GC P FP CNH S+ P C + P C T C
Sbjct: 50 FMKNHGLVTGGEYKPPEKLGNDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTC 108
Query: 66 TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
N YG +D +R K + + I+QEI + NG
Sbjct: 109 PNKAYGTSMQKDTHRAKSWGRLPIGPEKIKQEI-----------------------FDNG 145
Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
PV A M LY D YKSGVY V +
Sbjct: 146 PVAAMMTLYEDFRYYKSGVY-----------------------------------VHKTG 170
Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
+++A T+KLIGWG E+G+ YW ++ + E++GD G IK+ G+
Sbjct: 171 QLLAAHTLKLIGWGVESGQEYWLAMNAWNEEWGDHGMIKLAVGK 214
>gi|204022073|dbj|BAG71134.1| cathepsin B-S1 [Tuberaphis taiwana]
Length = 334
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 100/233 (42%), Gaps = 64/233 (27%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
G+ Q++Y+ K Y +N + I+++IM GPV A+ D+
Sbjct: 217 GKTTVQNRYKTKSEYVINS-IKTIERDIMTYGPVEASF----DV---------------- 255
Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
Y D+ +YKSG+Y + A+ ++KI+GWG++NG PYW V ++
Sbjct: 256 ---YDDLSAYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWS---------- 302
Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+W G+ GT KI++GRNE IE V +P
Sbjct: 303 --------------KFW----------GEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|325302580|dbj|BAJ83490.1| cathepsin B-like peptidase [Echinococcus multilocularis]
Length = 351
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 103/244 (42%), Gaps = 61/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + GLV+GG + + C+ PPC H + + P C+ A P PKC
Sbjct: 169 CNGGFPSQAWNFWKHEGLVSGGLYGTKGVCRAYEIPPCEH-HVNGTRPPCEGDA-PTPKC 226
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK+ + Y V+ I+ E++ +GPV A+ +Y+D +YKSG
Sbjct: 227 KNVC-QEEYKVPYKKDKHYAVKVYSVHSNEDAIKHELITHGPVEADFEVYADFPTYKSGV 285
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ +K++GWGEE+G P
Sbjct: 286 YQH------------------------VSGALLGGHAIKLMGWGEEDGVP---------- 311
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ +G+ G KILRG+N IES +
Sbjct: 312 ------------------------YWLCANSWNTDWGEGGFFKILRGKNHCGIESDIVAG 347
Query: 242 LPKD 245
+P++
Sbjct: 348 IPQN 351
>gi|298370749|gb|ADI80349.1| cathepsin B [Litopenaeus vannamei]
Length = 331
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/246 (27%), Positives = 101/246 (41%), Gaps = 63/246 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + + WVH G+V+GG+ +S GCQP PC H + P+C + PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVPGPRPKC-SEGGGTPK 204
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C Y + D + + Y + + I+ EIMKNGPV +Y D YKSG
Sbjct: 205 CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMKNGPVEGAFTVYVDFLHYKSG 263
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++ ++ G+ + ++++GWGEENG P
Sbjct: 264 ----------------VYQHRHGL--------PLGGHAIRVLGWGEENGTP--------- 290
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ +GD G KILRG + IES ++
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 325
Query: 241 ALPKDN 246
LPK N
Sbjct: 326 GLPKLN 331
>gi|340380685|ref|XP_003388852.1| PREDICTED: cathepsin B-like [Amphimedon queenslandica]
Length = 341
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 103/247 (41%), Gaps = 65/247 (26%)
Query: 2 CSSGISSSTWV-WVHK---RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G ++ W W K G+VTGG + SN GCQP + P C+H E + +TP
Sbjct: 153 CDGGYPAAAWRHWADKLLYEGIVTGGQYDSNAGCQPYTIPKCDHHEPGPYENCSGSQSTP 212
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C C + +Y + + DK+ K Y ++ +V+ IQ EIM NGPV +Y+D +Y
Sbjct: 213 S--CKRSCIS-SYDKSYRSDKHYGKNSYSISSDVSSIQTEIMTNGPVEGAFSVYADFPTY 269
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
SG Y + + + +KI+GWG ENG PYW +
Sbjct: 270 TSGVYQH------------------------TTGSFLGGHAIKILGWGTENGVPYWLVAN 305
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ P W GD G KI+RG++E IES
Sbjct: 306 SW------------------------NPSW----------GDSGFFKIIRGKDECGIESS 331
Query: 238 VNGALPK 244
+ +P+
Sbjct: 332 IVAGMPE 338
>gi|341888224|gb|EGT44159.1| hypothetical protein CAEBREN_15022 [Caenorhabditis brenneri]
Length = 332
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 96/245 (39%), Gaps = 60/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + K GL TGG++ + GC+P S PC + P C P P C
Sbjct: 144 CGGGNAFKAWQYWGKHGLPTGGSYETQFGCKPYSIAPCGKTVGNVTYPACTNTTLPTPSC 203
Query: 62 HTRCTNDN-YGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+CT+ N Y +D+ Y + + +IQ ++M NGP+ +Y D Y +
Sbjct: 204 EKKCTSKNGYPVDIDKDRHYGASSVDQLPNRQIEIQSDVMLNGPIETTFEVYDDFLQYTT 263
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y V + + +V+I+GWG G P
Sbjct: 264 GIY------------------------VHLTGNKQGHLSVRILGWGMYEGVP-------- 291
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
YW + +++G+++G+ GT + LRG NE +E+
Sbjct: 292 --------------------------YWLLANSWGKEWGENGTFRALRGTNECGLEANCV 325
Query: 240 GALPK 244
A+PK
Sbjct: 326 SAMPK 330
>gi|341878049|gb|EGT33984.1| CBN-CPR-1 protein [Caenorhabditis brenneri]
Length = 330
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/237 (29%), Positives = 95/237 (40%), Gaps = 69/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +G+VTGG +H GC+P PC + S PE KT P C
Sbjct: 156 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCT----SGSCPESKT-----PAC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V +VA IQ EIM NGPV A +Y D
Sbjct: 206 SLSC-QPGYTTAYAKDKHFGTSAYAVAKKVASIQTEIMTNGPVEAAFTVYED-------- 256
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ YKSGVY +A + +A +KI+GWG E+G PY
Sbjct: 257 ---------------FYKYKSGVYKHTAGKALGGHA-IKIIGWGTESGSPY--------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W + +++G +G+ G KI RG ++ IES V
Sbjct: 292 -------------------------WLVANSWGTSWGESGFFKIFRGDDQCGIESAV 323
>gi|157058739|gb|ABV03127.1| cathepsin B-2744 [Acyrthosiphon pisum]
Length = 260
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 78/178 (43%), Gaps = 34/178 (19%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C G + W +G+VTGG SN GCQP PC+H Y S C +L Q
Sbjct: 96 CDGGSAFKAWELTMNKGIVTGGNFDSNEGCQPYKNRPCDH--YGDSRLTNCSSLRRTQMT 153
Query: 61 -CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
C +C N NY + D ++ Y W N V IQQEIM GPV A MY+Y +
Sbjct: 154 VCRKKCVNKNYKVKYEDDLHKTSIVYMTSWTN--VKQIQQEIMTYGPVTAFMYVYENFMG 211
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYW 173
YK G Y S + E++ Y VK++GWG + +G YW
Sbjct: 212 YKEGIYK------------------------STTGELIGYHHVKLIGWGVDGDGTEYW 245
>gi|170028912|ref|XP_001842338.1| oryzain gamma chain [Culex quinquefasciatus]
gi|167879388|gb|EDS42771.1| oryzain gamma chain [Culex quinquefasciatus]
Length = 333
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 96/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++GLV+GG S+ GC+P + PC H P CK T PKC
Sbjct: 152 CDGGAPGAGWKHWIEKGLVSGGPFGSDQGCRPYTIEPCVHVENGAQSP-CKDSIT--PKC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK K Y + ++ I++EI NGPV A ++ D SYK G
Sbjct: 209 IKKCL-PGYNVPYAKDKSFGKSTYSIANDERQIRKEIFTNGPVEATFTVFDDFASYKHG- 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + SG + V+I+GWG EN
Sbjct: 267 ---------------IYQHTSG--------NLAGEHAVRILGWGVEN------------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +++ +GD G KILRG N IES +
Sbjct: 291 ---------------------GTKYWLAANSWNSDWGDNGYFKILRGSNHVDIESAIVAG 329
Query: 242 LPK 244
LPK
Sbjct: 330 LPK 332
>gi|56755295|gb|AAW25827.1| SJCHGC06356 protein [Schistosoma japonicum]
Length = 279
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 100/243 (41%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G V+ G+VTGG++ +GCQP P C++ + + +C P+C
Sbjct: 95 CFHGSEVEVLVYWITYGIVTGGSYEDQSGCQPYPLPKCSY-HPESRFLDCNNNTFEFPQC 153
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D Y + + DK+ +R Y V DIQ+EI+ NGPV+A+
Sbjct: 154 TNEC-QDGYNKTYDDDKFYGERIYNVYGTQEDIQKEILMNGPVIAS-------------- 198
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ + +D YKSGVY + + + + T++I+GWG E P
Sbjct: 199 ---------ISVNTDFLVYKSGVYLPTPRSRNLGWITLRIIGWGYEGKIP---------- 239
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++ E++G G +KI RG IES V
Sbjct: 240 ------------------------YWLCANSWNEEWGANGYVKIQRGVQAGYIESYVRAP 275
Query: 242 LPK 244
+PK
Sbjct: 276 IPK 278
>gi|204022088|dbj|BAG71141.1| cathepsin B-N2 [Tuberaphis styraci]
Length = 334
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/245 (29%), Positives = 99/245 (40%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PC Y + K P K
Sbjct: 154 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYKVSPCPLDEYGNNTCSGK----PAEKN 209
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +D+ +Y
Sbjct: 210 H-RCTQMCYGNQNLDFKEDHHYTRDAYYLT--YGTIQ----------------NDVLAY- 249
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
GP+ A+ +Y D SYKSGVY +A + VK++GWGEE G PY
Sbjct: 250 ------GPIEASFEVYDDFPSYKSGVYTKMENATYLGGHAVKLIGWGEEYGVPY------ 297
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ +Q+GD+G KI RG NE ++
Sbjct: 298 ----------------------------WLLVNSWNDQWGDQGLFKIRRGTNECGTDNST 329
Query: 239 NGALP 243
G +P
Sbjct: 330 TGGVP 334
>gi|239938574|gb|ACS36086.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/237 (30%), Positives = 100/237 (42%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + K+G VTGG + + +GC+P F PC H T EC AT PKC
Sbjct: 72 CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + +D+ K Y V + IQ+EIMKNGPVV +Y D FSY
Sbjct: 131 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 184
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YK G+Y +A +A +KI+GWG+ENG PY
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKENGVPY--------- 216
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +G+ G +ILRG N IE V
Sbjct: 217 -------------------------WLIANSWHNDWGENGYFRILRGSNHCGIEENV 248
>gi|201023369|ref|NP_001128426.1| cathepsin B-3483 [Acyrthosiphon pisum]
gi|328712086|ref|XP_003244726.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 355
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 103/250 (41%), Gaps = 69/250 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPE--------CKT 53
CS G +++ W ++ K+G+VTGG + SN GCQP PCN A+ T ++P C
Sbjct: 165 CSGGYTAAAWRYILKKGIVTGGDYGSNEGCQPWLVQPCN-ASTTAADPSSVLGPHGVCGG 223
Query: 54 LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
PKC C N + + D + K+ + + A ++ + K+GP V M +Y D
Sbjct: 224 DPATTPKCDLSCYNARHEGKYLDDIIKAKKVFTFDGCSA--RKNLRKHGPYVVTMRVYED 281
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
+YKSG Y + + + + +V+++GWG E G
Sbjct: 282 FLAYKSGVYHH------------------------VTGDYLGLLSVRMIGWGLEGG---- 313
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
+ +W + +++G +GDKG KI R NE
Sbjct: 314 ------------------------------QAFWLLANSWGTSWGDKGFFKIRRFVNECW 343
Query: 234 IESLVNGALP 243
IE+ +P
Sbjct: 344 IENFRYAGVP 353
>gi|146217390|gb|ABQ10737.1| cathepsin B [Penaeus monodon]
Length = 331
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 100/246 (40%), Gaps = 63/246 (25%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + + WVH G+V+GG+ +S GCQP PC H + + P+C PK
Sbjct: 148 CNGGFPGAAFKYWVHS-GIVSGGSFNSTQGCQPYEIAPCEH-HVSGPRPKCSE-GGGTPK 204
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C Y + D + + Y + + I+ EIM NGPV +Y D YKSG
Sbjct: 205 CAKTCEK-GYIVDYESDLHHGGKAYSIMKDEDQIKYEIMNNGPVEGAFTVYVDFLHYKSG 263
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++ ++ G+ + ++++GWGEENG P
Sbjct: 264 ----------------VYQHRHGL--------PLGGHAIRVLGWGEENGTP--------- 290
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW +++ +GD G KILRG + IES ++
Sbjct: 291 -------------------------YWLCANSWNTDWGDNGLFKILRGSDHCGIESEISA 325
Query: 241 ALPKDN 246
LPK N
Sbjct: 326 GLPKVN 331
>gi|38373697|gb|AAR19103.1| cathepsin B [Uronema marinum]
Length = 350
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 102/251 (40%), Gaps = 70/251 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-----HSNTGCQPVSFPPCNHANYTTSEPECKTLAT 56
C+ G ++ W + K GLV+G + +S T CQP SFPPC+H + C L
Sbjct: 158 CNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSH-HVQGEYQACTDL-- 214
Query: 57 PQ---PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
PQ PKC+T C + + QD ++ Y V I+ EI + G A+ +YSD
Sbjct: 215 PQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSD 274
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
+Y SG Y N SG Y + +K++GWG ENG PYW
Sbjct: 275 FLTYSSGVYQN----------------TSGSY--------MGGHAIKMLGWGVENGTPYW 310
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
+ S WGE G KILRG NE
Sbjct: 311 LCANSWNSS---------------WGE-------------------NGFFKILRGSNECG 336
Query: 234 IES-LVNGALP 243
IES +V G +P
Sbjct: 337 IESGMVAGFVP 347
>gi|268570495|ref|XP_002648548.1| Hypothetical protein CBG24861 [Caenorhabditis briggsae]
Length = 323
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 68/232 (29%), Positives = 93/232 (40%), Gaps = 71/232 (30%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGR 72
W + RG+VTGG +GC+P F PC S PE KT P C C Y
Sbjct: 162 WWNSRGVVTGG-DFRGSGCRPYPFAPC------ISCPEEKT-----PTCSLSC-QFGYST 208
Query: 73 GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMY 132
+ +DK Y V VA IQ EIM NGPVV +Y D++ YKSG Y +
Sbjct: 209 AYAKDKRFGVSAYAVARNVAAIQTEIMTNGPVVGAFTMYEDMYKYKSGVYRH-------- 260
Query: 133 LYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYAT 192
+ ++ +KI+GWG +NG PY
Sbjct: 261 ----------------TAGRLLGGHAIKIIGWGTQNGIPY-------------------- 284
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W I +++G +G+ G +K+ RG NE IE V +P+
Sbjct: 285 --------------WLIANSWGANWGENGFLKMRRGVNECGIERAVVAGMPR 322
>gi|118429529|gb|ABK91812.1| cathepsin B precursor [Clonorchis sinensis]
Length = 342
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 94/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W G+VTGG+ TGC+ FP C H P C P P+C
Sbjct: 155 CRGGYSPIAWDLWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC D + +DK R Y V + +EIM GPV A +++Y
Sbjct: 214 IKRC--DTKEIDYEKDKTRANISYNVYPAEQAVMKEIMLRGPVGAILHVYE--------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D+ YKSGVY + + ++I+GWGEE+G P
Sbjct: 263 --------------DLLDYKSGVYFHVWGGHLGEHG-IRILGWGEEDGVP---------- 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +G+KG +++LR RNE I V
Sbjct: 298 ------------------------YWLVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAG 333
Query: 242 LP 243
LP
Sbjct: 334 LP 335
>gi|52630925|gb|AAU84926.1| putative cathepsin B-N [Toxoptera citricida]
Length = 340
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 99/249 (39%), Gaps = 69/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K GLVTGG + S GC+P PPC + E T A +
Sbjct: 157 CNGGYPIKAWEHFKKHGLVTGGDYKSGEGCEPYRVPPCPY-----DESGNNTCAGKPMEA 211
Query: 62 HTRCTNDNYGRGF--FQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ RCT YG F + +R+ R YY IQ+ D+ +Y
Sbjct: 212 NHRCTRMCYGDQDLDFDEDHRYTRDSYYLT---YGSIQK----------------DVLTY 252
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
GPV A+ +Y D SYKSGVY S +A + K++GWGEE G PY
Sbjct: 253 -------GPVEASFDVYDDFPSYKSGVYIRSENASYLGGHAAKLIGWGEEYGVPY----- 300
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ +GD G KI RG NE I++
Sbjct: 301 -----------------------------WLMVNSWNADWGDNGLFKIQRGTNECGIDNS 331
Query: 238 VNGALPKDN 246
G +P N
Sbjct: 332 TTGGVPITN 340
>gi|48762491|dbj|BAD23815.1| cathepsin B-S1 [Tuberaphis coreana]
Length = 334
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 98/242 (40%), Gaps = 64/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG + + GC P PPC + + P +
Sbjct: 154 CGGGYPIKAWKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQGKNT-----CGGQPMERN 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG+ Q++Y+ K Y +N + I+Q D+ +Y
Sbjct: 209 H-QCPKTCYGKTTVQNRYKTKSEYSINS-IKTIEQ----------------DLKTY---- 246
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPV A+ +Y D YKSG+Y + A+ ++KI+GWG+ENG YW V ++
Sbjct: 247 ---GPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAVNSWS- 302
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W G+ GT KI++GRNE IE V
Sbjct: 303 -----------------------KFW----------GEHGTFKIIKGRNECGIERAVTAG 329
Query: 242 LP 243
+P
Sbjct: 330 IP 331
>gi|3929817|emb|CAA77181.1| cathepsin B [Mus musculus]
Length = 194
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 84/172 (48%), Gaps = 27/172 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG + S+ GC P + PPC H + P T P+C
Sbjct: 46 CNGGYPSGAWNFWTKKGLVSGGVYDSHIGCLPYTIPPCEHHVNGSRPPMHGEGDT--PRC 103
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C Y + +DK+ Y V++ V +I EI KNGPV ++SD +YKSG
Sbjct: 104 NKSC-EAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSG- 161
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
++ +++G +++ ++I+GWG ENG PYW
Sbjct: 162 ---------------VYKHEAG--------DMMGGHAIRILGWGVENGVPYW 190
>gi|161343871|tpg|DAA06116.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 276
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 100/247 (40%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC---NHANYTTSEPECKTLATPQ 58
CS G W K GLVTGG + S GC+P PPC + N T S P
Sbjct: 94 CSGGYPIRAWKRYKKHGLVTGGNYKSGEGCEPYRVPPCPNDDQGNNTCS-------GQPM 146
Query: 59 PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
K H RCT YG F + +R+ R D + I K D+ +
Sbjct: 147 EKNH-RCTRMCYGDQDLDFDEDHRYTR-----DHYYLTYRGIQK------------DVIN 188
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y GP+ A+ +Y D SYKSG+Y S +A + +VK++GWGEE G Y
Sbjct: 189 Y-------GPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLY---- 237
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W +V+++ +GDKG KI RG NE +++
Sbjct: 238 ------------------------------WLMVNSWNADWGDKGLFKIRRGTNECGVDN 267
Query: 237 LVNGALP 243
G +P
Sbjct: 268 STTGGVP 274
>gi|299471123|emb|CBN78981.1| cathepsin B-like proteinase [Ectocarpus siliculosus]
Length = 557
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 100/242 (41%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHH---SNTGCQPVSFPPCNH--ANYTTSEPECKTLAT 56
C+ G S W W K G+VTGG + + T C+P F PC H + P C
Sbjct: 367 CNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEY 426
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
P P+C + C+ N+ G + + + R + + +IQ+++MK G V A ++SD +
Sbjct: 427 PTPECLSECSETNFSGGSYGEDKKMAREAYSLAGIENIQRDMMKYGSVTAAFSVFSDFLT 486
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y G +++++SG + + VK++GWG +
Sbjct: 487 YSGG----------------VYTHESGSF--------MGGHAVKMIGWGTD--------- 513
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
E +G YW I +++ +G+ G +ILRG NE IE
Sbjct: 514 -----------------------EVSGEDYWLIANSWNPSWGEGGLFRILRGVNECGIEG 550
Query: 237 LV 238
+
Sbjct: 551 QI 552
>gi|44965401|gb|AAS49537.1| cathepsin B [Latimeria chalumnae]
Length = 225
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 76/168 (45%), Gaps = 26/168 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + + GLV+GG S+ GC+P + PPC H + S P C PKC
Sbjct: 83 CNGGYPSGAWNFWTETGLVSGGLFKSHIGCRPYTIPPCEH-HVNGSRPSCTGEEGDTPKC 141
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y +F+DK+ Y V+ ADIQ EI KNGPV +Y D YKSG
Sbjct: 142 VMQC-EAGYTPSYFKDKHFGSTSYAVSSNEADIQIEIYKNGPVEGAFTVYEDFLQYKSGV 200
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
Y + + + V ++I+GWG E+G
Sbjct: 201 YKH------------------------VTGDAVGGHAIRILGWGVESG 224
>gi|268555786|ref|XP_002635882.1| Hypothetical protein CBG01102 [Caenorhabditis briggsae]
Length = 374
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 58/243 (23%), Positives = 96/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + K GL TGG++ S GC+P S PC+ + P C P C
Sbjct: 189 CAGGNVFKAWQYWQKHGLPTGGSYESQFGCKPYSISPCDTVIGNITFPGCLNSTVQTPSC 248
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y +D++ + + +IQ ++M NGP+ A M +Y D Y +G
Sbjct: 249 EKKCKS-GYPVELDKDRHYGVSVDQLPNRQIEIQSDVMLNGPISATMEVYDDFLQYTTGI 307
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y V + + +V+I+GWG G P
Sbjct: 308 Y------------------------VHLTGNKQGHLSVRILGWGMYEGVP---------- 333
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++G+Q+G+ GT ++LRG NE +E+
Sbjct: 334 ------------------------YWLLANSWGKQWGENGTFRVLRGVNECGLEANCVSG 369
Query: 242 LPK 244
+P+
Sbjct: 370 MPR 372
>gi|339239305|ref|XP_003381207.1| cathepsin B [Trichinella spiralis]
gi|316975778|gb|EFV59177.1| cathepsin B [Trichinella spiralis]
Length = 343
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 71/239 (29%), Positives = 100/239 (41%), Gaps = 69/239 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + ++ G+ TGG + S +GC+P S P P + A P C
Sbjct: 163 CNGGFPLLAFKYWNEIGVPTGGPYGSKSGCKPFSIAP----------PTSSSTAAQTPLC 212
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWV---NDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+C +D Y R +D+Y + YY + N V IQ+EIM +GPVVA M ++ YK
Sbjct: 213 QLKCISD-YKRKLDKDRYYGESYYLITSSNQPVKTIQREIMDHGPVVAAMEIFESFLYYK 271
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + L G++A VK++GWGE+ PYW +V
Sbjct: 272 SGVYSANKRNDDPSL---------GLHA------------VKLIGWGEQKRIPYWLVVN- 309
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +TFGEQ G KI RG NE IE+L
Sbjct: 310 ------------------SWN-----------TTFGEQ----GLFKIRRGTNECGIENL 335
>gi|294939825|ref|XP_002782575.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239894358|gb|EER14370.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 398
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 96/249 (38%), Gaps = 68/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G S W WVH G+ TGG + + GC P FPPC H P C A
Sbjct: 211 CRGGFPYSAWSWVHDEGIATGGDYVPRDNMTEDDGCWPYDFPPCAHFFKDPKYPACPKFA 270
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
+C ++ + +F D RY+ V + KN
Sbjct: 271 RVNLRCVSKLRH--MMVVYFSD-----RYFMVESVPYHFSADDAKNAIRT---------- 313
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+GPV A Y+Y D +YKSGVY ++ + + A+A
Sbjct: 314 --------DGPVSATFYVYEDFLAYKSGVYKHTSGSLLGAHA------------------ 347
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VK+IGWGE+ G YW +V+++ E +GD G KI G + I+
Sbjct: 348 -----------------VKIIGWGEDGGEAYWLVVNSWNEGWGDHGLFKIALG--DCGID 388
Query: 236 SLVNGALPK 244
+ + G PK
Sbjct: 389 NELLGGTPK 397
>gi|76576339|gb|ABA53863.1| cathepsin B-like cysteine protease 1 [Parelaphostrongylus tenuis]
Length = 346
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 95/243 (39%), Gaps = 59/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTG + + +GC+P +PPC H +C P C
Sbjct: 163 CEGGDTYKAWNYWTTDGIVTGSNYTTKSGCKPYPYPPCEHYIDAGRYKKCPKDLYPTNTC 222
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C DNY + +DK+ Y + + + IQQEIM +GPV +Y D Y SG
Sbjct: 223 EYKC-QDNYTISYDEDKHYGAYPYVLVGDASFIQQEIMNHGPVEVTFDVYEDFEHYSSG- 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G Y V VK++GWG EN
Sbjct: 281 ---------------IYKHMAGEY--------VGVHAVKMLGWGTEN------------- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G YW +++ +G+ G +ILRG NE IES V
Sbjct: 305 ---------------------GVDYWICANSWNSDWGENGFFRILRGENECGIESNVVAG 343
Query: 242 LPK 244
PK
Sbjct: 344 KPK 346
>gi|145481831|ref|XP_001426938.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124394016|emb|CAK59540.1| unnamed protein product [Paramecium tetraurelia]
Length = 332
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 68/247 (27%), Positives = 101/247 (40%), Gaps = 66/247 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHAN----YTTSEPECKTLATP 57
C G W ++ G+VTGG ++ + C+P SFPPC+H N Y+ E + L
Sbjct: 145 CDGGYPYGAWKYLRVDGIVTGGTYNDFSLCKPYSFPPCSHGNDSGKYSKCENDFFMLTEV 204
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
P C +C + + R + DK R + Y + D+ I+ EI NGPV A ++ D
Sbjct: 205 TPSCTKKC-HPQFSRTYDVDKIRSRENPYKLIKDQ-EQIKNEIYLNGPVQAVFTVFDDFL 262
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+YKSG +Y + G +A VKI+GWG ENG P
Sbjct: 263 NYKSG------------VYQQTTGQRRGKHA------------VKIIGWGTENGVP---- 294
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
YW ++++ + +G G KILRG N IE
Sbjct: 295 ------------------------------YWEAINSWNDGWGINGKFKILRGFNHLDIE 324
Query: 236 SLVNGAL 242
V ++
Sbjct: 325 GEVYASI 331
>gi|308512693|gb|ADO33000.1| cathepsin B [Biston betularia]
Length = 217
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 67/242 (27%), Positives = 95/242 (39%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + GLV+GG ++S+ GC P PPC H P C T PKC
Sbjct: 32 CNGGMPTLAWEYWKHMGLVSGGNYNSSQGCSPYVIPPCEHHVPGNRLP-CNG-DTKTPKC 89
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N Y + +DK K Y V I+ E+ KNGPV A +Y+D+ +YKSG
Sbjct: 90 SKTCEN-GYNVLYKKDKRYGKHVYAVRGGEDHIKAELFKNGPVEAAFTVYADLLAYKSGV 148
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + +KI+GWG ENG Y
Sbjct: 149 YKH------------------------VEGDALGGHAIKIIGWGVENGNKY--------- 175
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G+ G KILRG + IES +
Sbjct: 176 -------------------------WLIANSWNTDWGNNGFFKILRGEDHCGIESSIVAG 210
Query: 242 LP 243
P
Sbjct: 211 EP 212
>gi|268561878|ref|XP_002638441.1| Hypothetical protein CBG18657 [Caenorhabditis briggsae]
Length = 372
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 104/250 (41%), Gaps = 67/250 (26%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQD 77
G+VTGG ++ TGCQP +FPPC+ + S P C Q KC T G+ +
Sbjct: 162 GVVTGG-DYNGTGCQPYTFPPCSSCEASKSTPSC------QKKCQT---------GYLEA 205
Query: 78 KYRFKRYYWVNDEVADIQQE-------IMKNGP--------VVANMYLYSDIFSYKSGKY 122
Y+ + + ++ + E I+K G +N I + ++ Y
Sbjct: 206 TYKNDKRFENEEQDSSYMSENFYQVLIILKGGKSAYRLSTTTSSNKISTDAIITIQTEIY 265
Query: 123 GNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVS 182
NGPV + ++ D + YKSGVY S ++ VKI+GWG E
Sbjct: 266 NNGPVEVSYRVFEDFYQYKSGVYHY-VSGKLTGAHAVKIIGWGTE--------------- 309
Query: 183 ASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
N YW + +++G FG+KG KI RG NE IE V L
Sbjct: 310 -------------------NKVDYWLVANSWGTDFGEKGFFKIRRGTNECGIEENVVAGL 350
Query: 243 PKDNYGVEFG 252
K N G +FG
Sbjct: 351 AK-NGGTKFG 359
>gi|306992171|gb|ADN19566.1| cathepsin B-like proteinase [Spodoptera frugiperda]
Length = 341
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 69/245 (28%), Positives = 102/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G+ + W + GLV+GG+++S GC+P PPC H N + KT
Sbjct: 156 CNGGMPTLAWEYWKHFGLVSGGSYNSGQGCRPYEIPPCEHHVPGNRVPCNGDSKT----- 210
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKCH C +Y + +DK K Y V+ + I+ E+ KNGPV +YSD
Sbjct: 211 PKCHKTC-EASYSVDYHKDKRYGKHVYSVSSKEDHIKAELFKNGPVEGAFTVYSD----- 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+ +YK+GVY + + +A +KI+GWG ENG Y R+
Sbjct: 265 ------------------LLNYKNGVYKHTVGNALGGHA-IKILGWGVENGNKY----RL 301
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
I +++ +GD G KILRG + IES +
Sbjct: 302 ------------------------------IANSWNSDWGDNGFFKILRGEDHCGIESSI 331
Query: 239 NGALP 243
P
Sbjct: 332 VAGEP 336
>gi|347972088|ref|XP_313836.5| AGAP004534-PA [Anopheles gambiae str. PEST]
gi|333469166|gb|EAA09182.5| AGAP004534-PA [Anopheles gambiae str. PEST]
Length = 334
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 96/240 (40%), Gaps = 69/240 (28%)
Query: 4 SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
G S WV GLV+GGA++S GC+P F PC + P PKC
Sbjct: 162 DGTSFQYWV---DAGLVSGGAYNSTDGCKPYPFKPCEY-------PFNDCHVEISPKCTH 211
Query: 64 RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
C D R + +DK K Y V + I+ EIM NGPV A +Y D+ YKSG
Sbjct: 212 HC-RDGVDRHYSKDKLFGKVAYSVPRDERAIRYEIMTNGPVEAGFDVYEDVLLYKSG--- 267
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+Y ++ + G +A V+I+GWG + G PY
Sbjct: 268 ---------VYRHVYGEQIGKHA------------VRIIGWGRDGGIPY----------- 295
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W I +++G+ +GD G K +RG N IES + LP
Sbjct: 296 -----------------------WLIANSYGDDWGDHGYFKFVRGSNHLGIESKIITGLP 332
>gi|1181143|emb|CAA93278.1| cysteine proteinase [Haemonchus contortus]
Length = 341
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 83/175 (47%), Gaps = 26/175 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + K+G VTGG + + +GC+P F PC H T EC AT PKC
Sbjct: 160 CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + +D+ K Y V + IQ+EIMKNGPVV +Y D FSY
Sbjct: 219 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G+Y +A +A +KI+GWG+E G PYW I
Sbjct: 273 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIA 308
>gi|335347291|gb|AEH42093.1| cysteine proteinase 6 [Haemonchus contortus]
Length = 346
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/222 (29%), Positives = 96/222 (43%), Gaps = 60/222 (27%)
Query: 17 RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
+G+VTGG + S TGC+P F PC H T EC + P+C +C Y + +
Sbjct: 178 QGVVTGGDYGSKTGCRPYPFHPCGHHGNETYYGECPKEES-TPECVKQCQK-GYKNSYRR 235
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
DK + YY V + V IQ+EIM++GPVV++ +Y D FSY
Sbjct: 236 DKTWGEDYYEVENSVKAIQREIMRSGPVVSSFTVYDD-FSY------------------- 275
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
Y G+Y +A ++A +K+I
Sbjct: 276 ---YVKGIYKHTAGKARGSHA-----------------------------------IKII 297
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
GWG E PYW I +++ +G+KG +++RG N IE V
Sbjct: 298 GWGTEKNVPYWIIANSWHNDWGEKGFFRMVRGTNHCGIEEDV 339
>gi|3087799|emb|CAA93276.1| cysteine proteinase [Haemonchus contortus]
Length = 350
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 81/177 (45%), Gaps = 32/177 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 58
C G W WV + G+VTGG + C+P +F PC H P + +TP +
Sbjct: 164 CEGGYDHLAWEWVQRFGVVTGGPYQQKGVCRPYAFHPCGLHHGRRYDCPWDHSFSTPACK 223
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C YG+ + +DK+ K Y ++++ IQ+E+MKNGPV A Y D YK
Sbjct: 224 PYCQF-----GYGKRYEKDKFFVKSTYILDNDEKVIQREMMKNGPVQAAFITYEDFSPYK 278
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
G +Y + + G +A VK++GWG ENG YWT+
Sbjct: 279 GG------------IYVHVKGRERGAHA------------VKLIGWGVENGTKYWTV 311
>gi|358331547|dbj|GAA35870.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 508
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 95/241 (39%), Gaps = 61/241 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D G+ +DK R Y + I +EIM GPV A IF+
Sbjct: 214 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y D Y SGVY + A + +A V+I+GWGE PY
Sbjct: 260 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVPY--------- 298
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G++G +K LRG NE IE V
Sbjct: 299 -------------------------WLIANSWNEDWGEEGYMKFLRGYNECGIEDDVTAV 333
Query: 242 L 242
L
Sbjct: 334 L 334
>gi|294914603|ref|XP_002778294.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239886508|gb|EER10089.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 365
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 101/252 (40%), Gaps = 67/252 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA------HHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
CS G ++W ++H G+V+GG + GC P +FP C H + C
Sbjct: 174 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYNFPKCAHHQKESDYKPCAKEI 233
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDI 114
P C + C N YG F +D++ + + + I++EIM NGP A +Y D
Sbjct: 234 YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDF 293
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
SYKSG ++ + SG + + V+I+GWG E G Y
Sbjct: 294 LSYKSG----------------VYKHTSGGF--------LGGHAVEIIGWGTEKGVDY-- 327
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
W +++++ E++GD GT KI++G + I
Sbjct: 328 --------------------------------WLVMNSWNEEWGDHGTFKIVQG--DCGI 353
Query: 235 ESLVNGALPKDN 246
+ ++ P N
Sbjct: 354 DDMILAGTPAIN 365
>gi|239793607|dbj|BAH72912.1| ACYPI000019 [Acyrthosiphon pisum]
Length = 188
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 88/237 (37%), Gaps = 70/237 (29%)
Query: 17 RGLVTGGAHHSNT-------GCQPVSFPPCNHANYTTSEPECKTLATPQ-PKCHTRCTND 68
RG++TG GCQP + PPC N C T + P C +C N
Sbjct: 11 RGIITGDMGLCQVEIITPTQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPICEKKCYNP 70
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
NY F D Y+ K Y K P +A DIF NGP+
Sbjct: 71 NYYTSFRTDIYKGKYY---------------KLSPYMAM----KDIFD-------NGPIT 104
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYA--TVKIVGWGEENGRPYWTIVRVYAVSASAE 186
Y+Y D+ YKSGVY ++ + +VKI GWGEENG P
Sbjct: 105 TQFYMYRDLVDYKSGVYQYDEQSDFDFFTVHSVKIFGWGEENGVP--------------- 149
Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW + ++FG +G GT KI RG + + + LP
Sbjct: 150 -------------------YWLVANSFGTDWGYNGTFKISRGNDGCFFQEKMYAGLP 187
>gi|161343867|tpg|DAA06114.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 340
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 96/249 (38%), Gaps = 69/249 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W GLVTGG + S GC+P PPC H +E P K
Sbjct: 157 CNGGYPIKAWERFKSHGLVTGGDYKSGEGCEPYRVPPCRHH----AEGNNSCSDKPMEKN 212
Query: 62 HTRCTNDNYGRGFFQ----DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
H RCT YG +Y YY IQ+++M GP+ A+ D+
Sbjct: 213 H-RCTRMCYGDQDLDFDDDHRYTRDSYYLT---YGSIQKDVMNYGPIEASF----DV--- 261
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
Y D SYKSGVY S +A + VK++GWGEE+G PY
Sbjct: 262 ----------------YDDFPSYKSGVYIRSDNASYLGGHAVKLIGWGEESGVPY----- 300
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
W +V+++ +GDKG KI RG NE +++
Sbjct: 301 -----------------------------WLMVNSWNTDWGDKGLFKIQRGTNECGVDNS 331
Query: 238 VNGALPKDN 246
+P N
Sbjct: 332 TTAGVPVTN 340
>gi|358341561|dbj|GAA37330.2| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 347
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 91/235 (38%), Gaps = 61/235 (25%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 69
W + G+VTGG + + C P FPPC H SE P C P+C + C
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
Y + DK R Y + V IQ+EI GPV A M +Y+D +Y G Y +
Sbjct: 224 YATKYEDDKIRASTSYNLYRSVTTIQKEIWMRGPVEATMNVYTDFANYAGGVYKH----- 278
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRVYAVSASAEIV 188
+ E++ ++++GWG EE+G PYW +
Sbjct: 279 -------------------TTGELLGGHAIRLLGWGVEEDGTPYWLAANSW--------- 310
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
P W G+KG +ILRG + IES V+ LP
Sbjct: 311 ---------------NPSW----------GEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|320166129|gb|EFW43028.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 332
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 94/243 (38%), Gaps = 60/243 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +W + +G+VTG +++ C+P FP C H + P+C + PKC
Sbjct: 148 CDGGQLGPSWDYYKNKGIVTGYLYNTTGYCKPYDFPACAHHEASPDYPDCPSTDYSTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + D + + Y V A IQ EI+ +GPV A +YSD +Y+SG
Sbjct: 208 TKSCVAGYTANTYTADLHYGQSSYSVGRTDAAIQTEILNHGPVEAAFTVYSDFPTYRSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S ++ + IVGWG E+G PYW +
Sbjct: 268 YKH------------------------TSGSVLGGHAISIVGWGTESGSPYWLV------ 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ + P W GD G KILRG + I + V G
Sbjct: 298 ------------------KNSWNPSW----------GDGGFFKILRG--DCGINNDVVGG 327
Query: 242 LPK 244
LPK
Sbjct: 328 LPK 330
>gi|161343875|tpg|DAA06118.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 210
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 80/182 (43%), Gaps = 30/182 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + G+VTGG + S GCQP S P T + + T P C
Sbjct: 45 CDGGSPEAAWYFFMRHGIVTGGDYESGDGCQPYSIYPRGKGRNTCIDDDIDT-----PDC 99
Query: 62 HTR-CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
R CTN NY +G+ D + Y ++ DI +I KNGPV A Y+Y+D YKSG
Sbjct: 100 SIRTCTNSNYTKGYRADLHYVDTVYSLSRSEEDIMTDIYKNGPVQAAFYVYTDFMYYKSG 159
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++SY G +I +KI+GWG ++ YW ++
Sbjct: 160 ----------------VYSYTRG--------QIEGGHAIKILGWGVDDNTKYWLCANSWS 195
Query: 181 VS 182
S
Sbjct: 196 RS 197
>gi|358341867|dbj|GAA49438.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 952
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/259 (27%), Positives = 105/259 (40%), Gaps = 62/259 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C +G W + G+VTGG+ +GC+ FP C H P C P P+C
Sbjct: 120 CGAGFHPMAWDFWKTHGIVTGGSKEEPSGCRSFPFPKCGHRR-KGRYPPCPRHIYPTPEC 178
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + +DK R Y V I +EIM NGPV A+
Sbjct: 179 IKQC--DEPEVNYEKDKTRANISYNVYPSDISIMKEIMLNGPVEAS-------------- 222
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+G +Y+D Y GVY I +A ++I+GWGE++G PY
Sbjct: 223 FG---------IYADFLEYNGGVYFHCWGGPISRHA-IRILGWGEDDGVPY--------- 263
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ E +G+KG ++ LRG NE IE V A
Sbjct: 264 -------------------------WLIANSWNEDWGEKGYVRFLRGHNECGIEEEVT-A 297
Query: 242 LPKDNYGVEFGEESGERLS 260
+P D + + ++S R +
Sbjct: 298 VPIDWFLRQMIKQSTLRCT 316
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 102/276 (36%), Gaps = 72/276 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + G+VTGG+ TGC+ FP C H P C P P+C
Sbjct: 708 CRGGYSPIAWDFWKTHGIVTGGSKEKPTGCRSYPFPSCEHRG-KGQYPPCPHQLYPTPEC 766
Query: 62 HTRCTNDNYGRGFFQDKYR----------FKRYYWVNDEVADIQQEIMKNGPVVANMYLY 111
RC D + +DK R R+ + + + + + M+
Sbjct: 767 IKRC--DTKEIDYEKDKTRGFDSASSEQLADRHCFHTSNFGEASAQRTLHLTCLNFMHHS 824
Query: 112 SDIFSYKSGK------------------------YGNGPVVANMYLYSDIFSYKSGVYAV 147
D+ S + K GPV A +++Y D+ YKSGVY
Sbjct: 825 IDLLSSRLEKAVLRSTANISYNVYPAEQAVMKEIMLRGPVGAILHVYEDLLDYKSGVYFH 884
Query: 148 SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYW 207
+ + ++I+GWGEE+G P YW
Sbjct: 885 VWGGHLGEHG-IRILGWGEEDGVP----------------------------------YW 909
Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+ +++ E +G+KG +++LR RNE I V LP
Sbjct: 910 LVANSWNEDWGEKGYMRVLRWRNECGIVDQVTAGLP 945
>gi|157092993|gb|ABV22151.1| cysteine proteinase [Perkinsus chesapeaki]
Length = 396
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 81/182 (44%), Gaps = 35/182 (19%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHS------NTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G S W W+H G+VTGG + + + GC P PPC H +T P+C
Sbjct: 207 CHGGSSLDAWQWLHTTGVVTGGDYSAEKDMTESDGCWPYDIPPCAHYTNSTLYPKCPKTK 266
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVAD-IQQEIMKNGPVVANMYLYSDI 114
P C C N Y +D++ + D I++EIM NGPV A+ YL
Sbjct: 267 YDFPTCQESCPNKKYDTPMEKDRHFVEEESLSALRSIDAIKKEIMTNGPVSAS-YL---- 321
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
+Y D +YKSGVY ++ + +A VKI+GWGE+ YW
Sbjct: 322 ------------------VYDDFLTYKSGVYKRTSHNALGGHA-VKIIGWGED----YWL 358
Query: 175 IV 176
+V
Sbjct: 359 VV 360
>gi|226472808|emb|CAX71090.1| cathepsin B [Schistosoma japonicum]
Length = 325
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/174 (30%), Positives = 78/174 (44%), Gaps = 27/174 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W++ +G+VTG +++ GCQP FPPC H + P C P C
Sbjct: 164 CNGGFPHSAWLYWKNQGIVTGDLYNTTNGCQPYEFPPCEH-HTLGPLPVCDG-DVETPPC 221
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ K Y V I +E+M++GPV + +Y+D +YKSG
Sbjct: 222 KRTC-QAGYNVSYENDKWYGKVVYRVKSNQEAIMKELMQHGPVEVDFEVYADFPNYKSGV 280
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y + S ++ V+++GWGEEN PYW I
Sbjct: 281 YQH------------------------VSGALLGGHAVRLLGWGEENNVPYWLI 310
>gi|193688336|ref|XP_001945899.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 308
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/239 (28%), Positives = 96/239 (40%), Gaps = 76/239 (31%)
Query: 9 STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
S W ++ G+V+GG ++SN GCQP FPP AN P+ C +
Sbjct: 141 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPI--AN------------IPKHLHKHTCDDH 186
Query: 69 NYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
YG + D R + YY + DIQ+E+ GPVV ++ D
Sbjct: 187 CYGNSTINYNHDHVRVRNYYTI--RTRDIQKEVQTYGPVVVR-FMVCD------------ 231
Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
D F YKSGVYA S A+ + K++GWG ENG Y
Sbjct: 232 ----------DFFLYKSGVYAKSDKAKGIRTQYAKLIGWGVENGVDY------------- 268
Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +++++G ++G KG KI G N+ +ES V LP+
Sbjct: 269 ---------------------WLVINSWGHEWGQKGLFKIKSGTNQCGVESFVYAGLPE 306
>gi|239938576|gb|ACS36087.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 99/237 (41%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + K+G VTGG + + +GC+P F PC H T EC AT PKC
Sbjct: 72 CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + +D+ K Y V + IQ+EIMKNGPVV +Y D FSY
Sbjct: 131 VRKCQKSYK-KSYKKDRSIGKDAYEVPNSEKAIQREIMKNGPVVGAFTVYED-FSY---- 184
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YK G+Y +A +A +KI+GWG+E G PY
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPY--------- 216
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++ +G+ G +ILRG N IE V
Sbjct: 217 -------------------------WLIANSWHNDWGENGYFRILRGSNHCGIEENV 248
>gi|329669000|gb|AEB96388.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 232
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 92/236 (38%), Gaps = 59/236 (25%)
Query: 9 STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
+ W + G+VTG + S +GC+P +PPC H +C P C +C D
Sbjct: 56 AAWSYWVSNGIVTGSNYTSKSGCKPYPYPPCEHHIPEHHYKKCPKDIYPTNTCEYKC-QD 114
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
Y + DK+ Y V +VA IQ+EIM NGPV +Y D Y SG
Sbjct: 115 GYSISYNSDKHYGASVYAVAQDVASIQKEIMTNGPVEVAFDVYEDFEHYSSG-------- 166
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
I+ + +G Y + VK++GWG EN
Sbjct: 167 --------IYKHTTGDY--------LGGHAVKMLGWGTEN-------------------- 190
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
G YW +++ +G+ G +ILRG +E IES V PK
Sbjct: 191 --------------GTDYWICANSWNSDWGENGFFRILRGVDECEIESGVVAGEPK 232
>gi|161343821|tpg|DAA06091.1| TPA_inf: cathepsin B [Aphis gossypii]
Length = 196
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 99/245 (40%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W GLVTGG + S GC+P PPC + + P K
Sbjct: 13 CHGGYPIRAWKRFKNHGLVTGGDYKSGEGCEPYRVPPCPYDEQGNN----TCAGKPMEKN 68
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F + +R+ R YY++ IQ+++M GP+ A+ D+
Sbjct: 69 H-RCTRICYGDQELDFDEDHRYTRDYYYLT--YGSIQKDVMTYGPIEASF----DV---- 117
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
YSD SYKSG+Y + +A + VK++GWGE+ G PY
Sbjct: 118 ---------------YSDFPSYKSGIYERTENATYLGGHAVKLIGWGEQYGIPY------ 156
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W +V+++ E +GD G KI RG NE +++
Sbjct: 157 ----------------------------WLMVNSWNEDWGDNGLFKIRRGTNECGVDNST 188
Query: 239 NGALP 243
+P
Sbjct: 189 TAGVP 193
>gi|300952942|gb|ADK46902.1| cathepsin B [Radopholus similis]
Length = 356
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 97/236 (41%), Gaps = 69/236 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + G+VTG + +N GC+P F P Y+T P+C
Sbjct: 178 CNGGYPDEAFEHYAQSGVVTGSGNSANQGCKPYPFLPHTTVEYST------------PEC 225
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+C N Y + + QDK+ Y V + DIQ EIM NGPV ANM +Y D YKSG
Sbjct: 226 SKKCENYQYKKAYKQDKHFGMSVYNVQFSDPVDIQYEIMNNGPVEANMIVYYDFMFYKSG 285
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y +F + G +A V+IVGWG
Sbjct: 286 ------------VYQTVFPWPLGGHA------------VRIVGWG--------------- 306
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
V ++ PYW + +++ +G+ G +I RG +E+ IES
Sbjct: 307 VDGPTKV-----------------PYWLVANSWNTDWGEDGYFRIRRGTDESYIES 345
>gi|239792046|dbj|BAH72408.1| ACYPI000003 [Acyrthosiphon pisum]
Length = 182
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 94/227 (41%), Gaps = 60/227 (26%)
Query: 17 RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
+G+V+GG + SN GC P PC H T P CK P C +C + Y + Q
Sbjct: 15 KGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTCVKKC-EEGYKVPYAQ 71
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
D + K Y + ++V I+QEI NGPV +Y D
Sbjct: 72 DLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY-----------------------ED 108
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
+Y++GVY A + +A ++I+GWG +NG EI
Sbjct: 109 FIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG----------------EI--------- 142
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
PYW + +++ +G G KILRG +E IE +N LP
Sbjct: 143 --------PYWLVANSWNTDWGSDGFFKILRGSDECGIEGQINAGLP 181
>gi|144952804|gb|ABP04056.1| cathepsin B-4 [Clonorchis sinensis]
Length = 347
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 91/235 (38%), Gaps = 61/235 (25%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDN 69
W + G+VTGG + + C P FPPC H SE P C P+C + C
Sbjct: 165 WDYWRDNGIVTGGEYKDSHTCLPYPFPPCRHHGAKGSEYPPCPEKMYSTPQCVSEC-QKG 223
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
Y + DK R Y + V IQ+EI GPV A M +Y+D +Y G Y +
Sbjct: 224 YATKYEDDKIRASTSYNLYRSVTAIQKEIWMRGPVEATMNVYTDFANYAGGVYKH----- 278
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRVYAVSASAEIV 188
+ E++ ++++GWG EE+G PYW +
Sbjct: 279 -------------------TTGELLGGHAIRLLGWGVEEDGTPYWLAANSW--------- 310
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
P W G+KG +ILRG + IES V+ LP
Sbjct: 311 ---------------NPSW----------GEKGFFRILRGSDHCGIESDVSAGLP 340
>gi|269146930|gb|ACZ28411.1| cathepsin b [Simulium nigrimanum]
Length = 168
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 87/224 (38%), Gaps = 60/224 (26%)
Query: 21 TGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYR 80
+GG SN GC P PC H + + P C PKC C +Y + QDK
Sbjct: 5 SGGPFGSNQGCHPYKIAPCEH-HVNGTRPACNGEEGKTPKCIKHC-QASYTVAYEQDKSY 62
Query: 81 FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY 140
+ Y V VA IQ+EIM NGPV +Y D+ YK G Y +
Sbjct: 63 GAKSYSVPHHVAQIQKEIMTNGPVEGAFTVYEDLVQYKDGVYQH---------------- 106
Query: 141 KSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE 200
+ +++ ++I+GWG EN PY
Sbjct: 107 --------VTGKMLGGHAIRILGWGVENDVPY---------------------------- 130
Query: 201 ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W I +++ +G+ G KILRG + IES ++ +PK
Sbjct: 131 ------WLIANSWNTDWGNNGFFKILRGSDHCGIESQISAGIPK 168
>gi|329668994|gb|AEB96385.1| cathepsin B-like cysteine protease 2 [Angiostrongylus cantonensis]
Length = 316
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 96/237 (40%), Gaps = 60/237 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + + G+VTGG + + C+P PPC T C T P C
Sbjct: 135 CDGGWPVSAWQYFVETGVVTGGLYGTKDACRPYEIPPCGIHKNETFYSNC-TQEIDTPDC 193
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C Y + DK K Y V++ V IQ+EIM GPVVA +Y D F YK+G
Sbjct: 194 KTTC-QAGYPISYDDDKTYGKTAYSVSNSVHAIQKEIMTYGPVVAAFTVYDDFFHYKTG- 251
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + SG AE +A V+I+GWG++ G P
Sbjct: 252 ---------------IYKHVSG-------AEAGGHA-VRILGWGQQGGVP---------- 278
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW + +++ +G+ G +ILRG +E IE V
Sbjct: 279 ------------------------YWLVANSWNTDWGENGYFRILRGSDECGIEDGV 311
>gi|294929081|ref|XP_002779258.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239888294|gb|EER11053.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 288
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/223 (27%), Positives = 89/223 (39%), Gaps = 76/223 (34%)
Query: 13 WVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT 66
++ G+VTG G S GC P FP C HA Y++ P C T+CT
Sbjct: 123 FLKNHGIVTGDEFKPAGQLSSADGCWPYPFPKCKHAGYSS------------PACQTKCT 170
Query: 67 NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
N Y QD +R K + + +I+QEI + NGP
Sbjct: 171 NKAYKTSLQQDLHRAKSFGRLPAIPQNIKQEI-----------------------FTNGP 207
Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAE 186
V+ + +Y DI YK+GVY V +
Sbjct: 208 VIGMLSIYEDIRVYKAGVY-----------------------------------VHQTGS 232
Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
T+K+IGWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 233 FQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGR 275
>gi|209863086|ref|NP_001119616.2| cathepsin B-1674 precursor [Acyrthosiphon pisum]
gi|239799412|dbj|BAH70627.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 334
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/234 (26%), Positives = 96/234 (41%), Gaps = 69/234 (29%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W ++ GLV+GG +++N GCQP PP N T E C RC +N
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGLYE--------NTCEKRCYGNN- 216
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
+ QD + K +Y + E DIQ+E+ GPV ++ +
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFRVFDN----------------- 257
Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
D F YKSGVY + ++E + + K++GWG ENG Y
Sbjct: 258 -----DFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDY------------------ 294
Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +V+++G ++G G KI RG +E IE+ V+ P+
Sbjct: 295 ----------------WLLVNSWGYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|161343873|tpg|DAA06117.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 254
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 4/122 (3%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
C G +W + + G V+GG ++SN GCQP + PPC N C T + P
Sbjct: 132 CDGGSLFESWDFYRRHGFVSGGEYNSNQGCQPYTIPPCKLINEKPPGHSCTTFNREETPT 191
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C N NY F D YR K YY V+ +A +EI NGP+ Y+Y D+ YKSG
Sbjct: 192 CEKKCNNPNYYTSFRADIYRGK-YYKVSPYMA--MKEIFDNGPITTQFYMYRDLVDYKSG 248
Query: 121 KY 122
Y
Sbjct: 249 VY 250
>gi|183988834|gb|ACC66066.1| cathepsin B [Samia ricini]
Length = 283
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/233 (28%), Positives = 95/233 (40%), Gaps = 61/233 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ + W + GLV+GG ++S+ GC+P PPC H P C T PKC
Sbjct: 112 CNGGMPTLAWEYWKHVGLVSGGNYNSSQGCRPYEIPPCEHHVPGNRMP-CNG-DTKTPKC 169
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +Y F +DK K Y V+ I+ E+ KNGPV A +Y
Sbjct: 170 QKNCES-SYNVPFKKDKRYGKHVYSVSGHEDHIKAELFKNGPVEAAFTVY---------- 218
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
SD+ SYK+GVY + + +A +KI+GWG EN Y
Sbjct: 219 -------------SDLLSYKNGVYKHTEGNALGGHA-IKIIGWGVENNNKY--------- 255
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
W I +++ +GD G KILRG + I
Sbjct: 256 -------------------------WLIANSWNSDWGDNGFFKILRGEDHCGI 283
>gi|161343849|tpg|DAA06105.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 334
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 95/234 (40%), Gaps = 69/234 (29%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W ++ GLV+GG +++N GCQP PP N T E C RC +N
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGLYE--------NTCEKRCYGNN- 216
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
+ QD + K +Y + E DIQ+E+ GPV ++ +
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFKVFDN----------------- 257
Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
D F YKSGVY + ++E + + K++GWG ENG YW +V
Sbjct: 258 -----DFFLYKSGVYEKTTNSEFIQWQYAKLIGWGVENGVDYWLLVN------------- 299
Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+W G ++G G KI RG +E IE+ V+ P+
Sbjct: 300 ---------------FW------GYEWGQNGLFKIKRGTDECNIETFVHAGEPQ 332
>gi|107921791|gb|ABF85679.1| cathepsin B2 [Fasciola hepatica]
Length = 278
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 73/172 (42%), Gaps = 25/172 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+VTGG + TGCQP F C+H + C P+P C
Sbjct: 132 CRGGYPPKAWDYWMREGIVTGGTWENRTGCQPWMFTKCDHVGDSRKYSRCPHYTYPKPPC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + QDK+ Y V + + I QEIMKNGPV ++ D Y+SG
Sbjct: 192 ARACQT-GYNKTYEQDKFYGNSSYNVGEHESYIMQEIMKNGPVEVTFAIFQDFGVYRSGI 250
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
Y + + + + V+++GWG ENG YW
Sbjct: 251 YHH------------------------VAGKFIGRHAVRMIGWGVENGVNYW 278
>gi|91089435|ref|XP_966663.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
gi|270012706|gb|EFA09154.1| cathepsin B precursor [Tribolium castaneum]
Length = 320
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 69/243 (28%), Positives = 97/243 (39%), Gaps = 71/243 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G SS W + G+V+GG +++ GC P S A ++ P C +
Sbjct: 148 CGGGYSSRAWQYWVTDGIVSGGDFNTSQGCHPYSV----QAFRDSTTPNCSSF------- 196
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
CTN Y + + +DK R Y + + IQ EIM +GPV A+ +Y D +SY++G
Sbjct: 197 ---CTNPKYQKNYSEDKRYGARSYRIAKNIEQIQAEIMTSGPVQASYVVYDDFYSYQNG- 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + SG + +VKI+GWG ENG YW +
Sbjct: 253 -----------VYQHVLGNVSGRH------------SVKILGWGRENGTDYWLVAN---- 285
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
WG + GR G G K LRG N IES + G
Sbjct: 286 ---------------SWGRDWGR--------LG------GFFKFLRGENHCDIESNILGG 316
Query: 242 LPK 244
PK
Sbjct: 317 DPK 319
>gi|326427908|gb|EGD73478.1| cathepsin B [Salpingoeca sp. ATCC 50818]
Length = 341
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/243 (27%), Positives = 96/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G ++ W + +G+VTGG + SN GCQP S C H +P C + P P C
Sbjct: 160 CNGGYPAAAWEYWKNQGIVTGGQYDSNQGCQPYSLAKCEHHTTGPYKP-CGDI-VPTPAC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ Y V V I EIM NGPV A +YSD SYKSG
Sbjct: 218 KRSC-RQGYNVTYPNDKHFGASSYGVRG-VDQIATEIMTNGPVEAAFTVYSDFLSYKSGV 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + + +KI+GWG ++G
Sbjct: 276 YQH------------------------TSGQPLGGHAIKIIGWGVQDG------------ 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ + +G+ G I +G +E IES V
Sbjct: 300 ----------------------TDYWIVANSWNDSWGNDGFFWIKKGTDECGIESQVVAG 337
Query: 242 LPK 244
LPK
Sbjct: 338 LPK 340
>gi|300176938|emb|CBK25507.2| unnamed protein product [Blastocystis hominis]
Length = 320
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 99/243 (40%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W G+ TGG + S C SFP C H P ++ TP+ C
Sbjct: 138 CDGGWLDMAWRWFQSTGVTTGGEYGSKDWCNAYSFPKCEHHAEGKYPPCGESQETPE--C 195
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + +DK+ F Y+V + I+ E+M NGP+ + ++Y D +YKSG
Sbjct: 196 VKQC-QEGYPVEYEKDKHFFGEAYYVQGGIDAIKTELMTNGPLEVSFFVYEDFLTYKSGI 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VA YL G +A VK+VGWG E+G
Sbjct: 255 YQH---VAGKYL---------GGHA------------VKLVGWGVEDG------------ 278
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ E +G+ G +I+ G+ E IE G
Sbjct: 279 ----------------------IEYWKIANSWNEDWGENGYFRIVAGKGECGIEVGPIGG 316
Query: 242 LPK 244
+PK
Sbjct: 317 IPK 319
>gi|44968648|gb|AAS49594.1| cathepsin B [Scyliorhinus canicula]
Length = 206
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 83/168 (49%), Gaps = 27/168 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + GLV+GG ++S+ GC+P S PC H + S P+C + P+C
Sbjct: 65 CNGGYPSGAWEFWTNDGLVSGGLYYSHIGCRPYSISPCEH-HVNGSRPKC-SGEIETPRC 122
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC Y + +DK+ Y + +V +I EI KNGPV A + ++ D YKSG
Sbjct: 123 SRRC-EAGYSPKYSEDKHYGLTSYSIGSDVTEIMTEIYKNGPVEAALEVFKDFLLYKSG- 180
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
++ +K+G I +A +KI+GWGEENG
Sbjct: 181 ---------------VYQHKTG-------GSIGGHA-IKILGWGEENG 205
>gi|204022075|dbj|BAG71135.1| cathepsin B-S2 [Tuberaphis taiwana]
Length = 334
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/233 (27%), Positives = 98/233 (42%), Gaps = 64/233 (27%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 163 WKYFRTQGVTTGGDYGTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 216
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
G+ Q++Y+ K Y +N + I+Q D+ +Y GPV A+
Sbjct: 217 GKTTVQNRYKTKSEYVMNS-IKTIEQ----------------DLKTY-------GPVEAS 252
Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAY 190
+Y D YKSG+Y + A+ ++KI+GWG++NG PYW V ++
Sbjct: 253 FDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAVNSWS---------- 302
Query: 191 ATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+W G+ GT KI++GRNE IE V +P
Sbjct: 303 --------------KFW----------GEHGTFKIIKGRNECGIERAVTAGIP 331
>gi|157058743|gb|ABV03129.1| cathepsin B-2744 [Pterocomma populeum]
Length = 244
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 77/176 (43%), Gaps = 32/176 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C G + W + G+VTGG +SN GCQP PC+H +S C + Q
Sbjct: 96 CHGGSAFKAWEFTMGNGIVTGGNFNSNEGCQPYKNRPCDHYG-DSSMTNCSSFRRTQMSI 154
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY---WVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
C +C N NY + D ++ Y W N V IQQEIM GPV A MY+Y + Y
Sbjct: 155 CREKCVNKNYKVKYEDDLHKTSVVYMTSWTN--VTQIQQEIMTYGPVTALMYVYENFMGY 212
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPY 172
K G Y S ++V Y VK++GWG +++G Y
Sbjct: 213 KEGIYK------------------------STVGDLVGYHHVKLIGWGVDDDGNEY 244
>gi|118122|sp|P25793.1|CYSP2_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 2; Flags:
Precursor
gi|159165|gb|AAA29171.1| cathepsin B-like cysteine protease [Haemonchus contortus]
Length = 342
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+V+GG + + C+P PC H T EC+ A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + DK K Y V V IQ EI+KNGPVVA+ +Y D YKSG
Sbjct: 215 KRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFAVYEDFRHYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G E+ Y VK++GWG E
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
N +W I +++ +G+KG +I+RG N+ IE +
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIVRGSNDCGIEGTIAAG 335
Query: 242 L 242
+
Sbjct: 336 I 336
>gi|294935195|ref|XP_002781337.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239891887|gb|EER13132.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 317
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 96/245 (39%), Gaps = 60/245 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G+ + W ++ G+ T G+ + GC P +FP C H + C P C
Sbjct: 133 CKGGMILNAWSFLKTHGIATEGSMSAADGCWPYNFPKCAHHQKKSKYEPCSKKLYDTPSC 192
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC N+ YG +D++ + + +I++EIM NGP A +Y D SYKSG
Sbjct: 193 LDRCPNEKYGIPLDKDRHFTAHSPDLFEGTDNIKKEIMTNGPTSATFSVYEDFVSYKSGV 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ +V+I+GWG E G Y
Sbjct: 253 YKH------------------------TNGTLMGIHSVEIIGWGTEKGVDY--------- 279
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W +++++ E +GD GT KI +G + I+ V G+
Sbjct: 280 -------------------------WLVMNSWNEGWGDHGTFKIAQG--DCGIDDAVLGS 312
Query: 242 LPKDN 246
P N
Sbjct: 313 PPAMN 317
>gi|254746338|emb|CAX16634.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 337
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 93/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI S W + G+V+GG ++S GC+P PPC H P C T PKC
Sbjct: 152 CNGGIPSLAWEYWKHFGIVSGGNYNSTQGCRPYEIPPCEHHVPGNRMP-CSG-DTKTPKC 209
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N Y + +DK K Y V+ I+ E+ KNGPV +Y+D+ +YKSG
Sbjct: 210 QKNCEN-GYNVMYKKDKRYGKHVYSVSAGEDHIRAELYKNGPVEGAFTVYADLLAYKSGV 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + +KI+GWG EN
Sbjct: 269 YKH------------------------IQGDALGGHAIKILGWGVEN------------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +GD G KILRG N IE +
Sbjct: 292 ---------------------DNKYWLVANSWNTDWGDNGFFKILRGENHCGIEGSIIAG 330
Query: 242 LP 243
P
Sbjct: 331 EP 332
>gi|32566081|ref|NP_506002.2| Protein CPR-1 [Caenorhabditis elegans]
gi|32172429|sp|P25807.2|CPR1_CAEEL RecName: Full=Gut-specific cysteine proteinase; Flags: Precursor
gi|1395200|gb|AAB88058.1| gut-specific cysteine protease-1 [Caenorhabditis elegans]
gi|24817276|emb|CAB01410.2| Protein CPR-1 [Caenorhabditis elegans]
Length = 329
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 72/237 (30%), Positives = 91/237 (38%), Gaps = 69/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W +G+VTGG +H GC+P PC N PE KT P C
Sbjct: 155 CEGGYPIQALRWWDSKGVVTGGDYH-GAGCKPYPIAPCTSGNC----PESKT-----PSC 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V A IQ EI NGPV A +Y D + YKSG
Sbjct: 205 SMSC-QSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYKSGV 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + A YL G +A +KI+GWG E+G PYW + + V
Sbjct: 264 YKH---TAGKYL---------GGHA------------IKIIGWGTESGSPYWLVANSWGV 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ WGE G KI RG ++ IES V
Sbjct: 300 N---------------WGES-------------------GFFKIYRGDDQCGIESAV 322
>gi|294873367|ref|XP_002766594.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239867622|gb|EEQ99311.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 244
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 96/252 (38%), Gaps = 67/252 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
CS G ++W ++H G+V+GG + GC P SFP C H + C
Sbjct: 53 CSGGNPITSWTFLHTNGIVSGGGFVPEKNMKAADGCWPYSFPKCAHHQDGSDYKPCAKEI 112
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDI 114
P C + C N YG F +D++ + + + I++EIM NGP A +Y D
Sbjct: 113 YDTPSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDF 172
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
SYKSG Y + S + V+I+GWG E G YW
Sbjct: 173 LSYKSGVYKH------------------------TSGGFLGGHAVEIIGWGTEKGVDYWL 208
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
++ ++ E++GD GT KI++G + I
Sbjct: 209 VMN----------------------------------SWNEEWGDHGTFKIVQG--DCGI 232
Query: 235 ESLVNGALPKDN 246
+ + P N
Sbjct: 233 DDTILAGTPAMN 244
>gi|193629592|ref|XP_001944624.1| PREDICTED: cathepsin B-like isoform 4 [Acyrthosiphon pisum]
Length = 331
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 100/239 (41%), Gaps = 79/239 (33%)
Query: 11 WVWVHKRGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTND 68
W + GLV+GG+ +++N GCQP PP CN P C +
Sbjct: 165 WEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN---------------LPTKINKRTCVDY 209
Query: 69 NYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
YG + D + + YY V + DIQ+E+ +Y G
Sbjct: 210 CYGNDTIKYNHDHVKVRYYYHVKPK--DIQKEVQ----------------TY-------G 244
Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
PV A + LY DIF +KSGVY ++ +A+ V VK++GWG ENG Y
Sbjct: 245 PVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDY------------- 291
Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +V+++G ++G G +KI RG+ +ES V A+PK
Sbjct: 292 ---------------------WLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 329
>gi|294891885|ref|XP_002773787.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878991|gb|EER05603.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 234
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 82/200 (41%), Gaps = 60/200 (30%)
Query: 27 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
++ GC P FP CNH S+ P C + P C T C N YG +D +R K +
Sbjct: 38 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 96
Query: 86 WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
+ I+QEI + NGPV A M LY D YKSGVY
Sbjct: 97 RLPIGPEKIKQEI-----------------------FDNGPVAAMMTLYEDFRFYKSGVY 133
Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
V + +++A T+KLIGWG E+G+
Sbjct: 134 -----------------------------------VHKTGQMLAAHTLKLIGWGVESGQE 158
Query: 206 YWTIVSTFGEQFGDKGTIKI 225
YW V+ + E++GD G IK+
Sbjct: 159 YWLAVNAWNEEWGDHGMIKL 178
>gi|193688334|ref|XP_001945855.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 313
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 92/236 (38%), Gaps = 71/236 (30%)
Query: 9 STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
S W ++ G+V+GG ++SN GCQP FPP L Q C C
Sbjct: 147 SIWEYLKSHGVVSGGKYNSNDGCQPFKFPPI-----------ANILTHLQHTCDDHCYG- 194
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
N + D R + YY + Y+ ++ +Y GPV
Sbjct: 195 NTSINYNHDHVRVRNYY------------------TIRTGYIQKEVQTY-------GPVA 229
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
+ D YKSGVY S +A+++ K++GWG ENG Y
Sbjct: 230 VQFKVCDDFLLYKSGVYVKSDNAKVIRTQYAKLIGWGVENGVDY---------------- 273
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +++++G ++G KG KI RG N+ +ES+V +P+
Sbjct: 274 ------------------WLVINSWGHEWGQKGLFKIKRGTNQCGVESVVYAGVPE 311
>gi|294945206|ref|XP_002784584.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239897729|gb|EER16380.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 298
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 92/237 (38%), Gaps = 66/237 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNT------GCQPVSFPPCNHA-NYTTSEPECK-T 53
C+ G ++ G+VTG GC P F CNH T P+CK
Sbjct: 108 CNGGRLVEAMSFLRDHGVVTGNDFKPQDQLREADGCWPYPFQKCNHVPTEGTGYPKCKDV 167
Query: 54 LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
+ P P C T CTN Y + +D +R K + V ++ I+QEI
Sbjct: 168 VQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEI--------------- 212
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
+ NGPV + +Y D YKSGVY
Sbjct: 213 --------FDNGPVFSAFEMYKDFRYYKSGVY---------------------------- 236
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V + E+ +K+IGWG ++ R YW ++ + E++GD G IK+ G+N
Sbjct: 237 -------VPTTKEVDCLHVIKIIGWGADSVREYWLAMNAWNEEWGDHGLIKMAFGKN 286
>gi|356984175|gb|AET43950.1| cathepsin B, partial [Reishia clavigera]
Length = 209
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 71/243 (29%), Positives = 98/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W G+VTGG ++S GCQP C+H +P CK P+C
Sbjct: 28 CNGGYPSAAWEVFDHDGVVTGGQYNSKQGCQPYLIAACDHHVVGKLKP-CKGDGK-TPRC 85
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y F DK+ +R Y V+ V DI +E++ GPV A +Y
Sbjct: 86 EKKCEA-GYNVTFKDDKHYGQRSYSVS-SVNDIMEELVTRGPVEAAFTVY---------- 133
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
SD Y SGVY + + + +A VKI+G+G ENG YW + +
Sbjct: 134 -------------SDFLQYHSGVYRHTTGSALGGHA-VKILGYGVENGDKYWLVANSW-- 177
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
P W GD+G KILRG +E IE +
Sbjct: 178 ----------------------NPDW----------GDQGFFKILRGVDECGIEGQIVAG 205
Query: 242 LPK 244
PK
Sbjct: 206 EPK 208
>gi|86279343|gb|ABC88767.1| putative cathepsin B-like proteinase [Tenebrio molitor]
Length = 321
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 94/243 (38%), Gaps = 73/243 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S + G+V+GG +SN GC+P YT + P C
Sbjct: 151 CGGGYMMSALDFYINEGIVSGGDVNSNEGCRP----------YTADAHD----QGQTPAC 196
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N Y + DK+ Y V+ + IQ E+M NGP++ N ++ D ++Y SG
Sbjct: 197 TKSCRN-GYSTSYSADKHYGSNDYVVSSVIDQIQYEVMTNGPIIVNFEVFQDFYNYVSGV 255
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S E V + VKIVGWG ENG PY
Sbjct: 256 YRH------------------------VSGESVGFHVVKIVGWGVENGVPY--------- 282
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++G +GD G K+LRG+NE IE+
Sbjct: 283 -------------------------WLIANSWGSSWGDHGFFKMLRGQNECGIENYPYAV 317
Query: 242 LPK 244
+P+
Sbjct: 318 MPR 320
>gi|118118|sp|P19092.1|CYSP1_HAECO RecName: Full=Cathepsin B-like cysteine proteinase 1; Flags:
Precursor
gi|159173|gb|AAA29175.1| cysteine protease (AC-1) [Haemonchus contortus]
Length = 342
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+V+GG + + C+P PC H T EC+ A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + + DK K Y V V IQ EI++NGPVVA+ +Y D YKSG
Sbjct: 215 KRKC-RPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G E+ Y VK++GWG E
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
N +W I +++ +G+KG +I+RG N+ IE +
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335
Query: 242 L 242
+
Sbjct: 336 I 336
>gi|341888694|gb|EGT44629.1| hypothetical protein CAEBREN_31940 [Caenorhabditis brenneri]
Length = 374
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 69/234 (29%), Positives = 91/234 (38%), Gaps = 68/234 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S + G VTGG ++ GC P SF PC + C TP C
Sbjct: 167 CQGGYSIEALRFWKSSGAVTGG-DYNGAGCMPYSFAPCK-------KDSCAQGTTPS--C 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C + + +DK+ Y + + VA IQ EI NGPV A+ +Y D + YKSG
Sbjct: 217 KTTCQSSYKTAEYTKDKHFGTTAYKITNSVAAIQTEIYHNGPVEASFKVYEDFYKYKSG- 275
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ Y SG ++V VKI+GWG ENG Y
Sbjct: 276 ---------------VYQYTSG--------KLVGGHAVKIIGWGTENGVDY--------- 303
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
W I +++G FGD G K+ RG NE IE
Sbjct: 304 -------------------------WLIANSWGTTFGDSGFFKMRRGTNEVGIE 332
>gi|294877489|ref|XP_002768007.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239870145|gb|EER00725.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 344
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 106/272 (38%), Gaps = 87/272 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG------AHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C GI+ + W ++ G+VTGG + + GC P SFP C H + C +
Sbjct: 133 CQGGIARAAWSFLKMHGIVTGGDFVPKGSMSAADGCWPYSFPKCAHDQEDSKYEPCPEVR 192
Query: 56 TP--------------------QPKCHTRCTNDNYGRGFFQDKYRFKRYY-WVNDEVADI 94
P P C RC N+ YG +D++ R ++ + +I
Sbjct: 193 VPPLGERHQRGAGASIHQKLYDTPSCLDRCPNEKYGTPRDKDRHFTARALPYLFEGTDNI 252
Query: 95 QQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIV 154
++EIM NGP A+ Y D SYKSG ++ + SG Y +
Sbjct: 253 KKEIMTNGPTSASFSTYEDFSSYKSG----------------VYKHTSGGY--------L 288
Query: 155 AYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFG 214
+V+I+GWG E G Y W +++++
Sbjct: 289 GDHSVEIIGWGTEKGVDY----------------------------------WLVMNSWN 314
Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
E +GD GT KI +G + I+ V G+LP N
Sbjct: 315 EGWGDHGTFKIAQG--DCGIDDAVQGSLPAMN 344
>gi|157058767|gb|ABV03141.1| cathepsin B-348 [Sitobion avenae]
Length = 252
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 83/179 (46%), Gaps = 28/179 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + SN GC P PC H T P CK PKC
Sbjct: 97 CNGGFPGAAWHYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPKC 154
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + QD +R K Y ++++V I+QEI NGPV +Y
Sbjct: 155 VKKC-EDGYKVPYEQDLHRGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVY---------- 203
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIVRVY 179
D +Y++GVY A + +A ++I+GWG +NG PYW + +
Sbjct: 204 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVANSW 248
>gi|56758644|gb|AAW27462.1| unknown [Schistosoma japonicum]
Length = 294
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 62/121 (51%), Gaps = 2/121 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + QDK+ + Y V IQ+EIM NGPV A +Y D +YKSG
Sbjct: 218 KQKCQK-GYKTPYEQDKHYGEESYNVISNEKAIQKEIMMNGPVEAAFDVYEDFLNYKSGI 276
Query: 122 Y 122
Y
Sbjct: 277 Y 277
>gi|328722316|ref|XP_003247542.1| PREDICTED: cathepsin B-like isoform 2 [Acyrthosiphon pisum]
gi|328722318|ref|XP_003247543.1| PREDICTED: cathepsin B-like isoform 3 [Acyrthosiphon pisum]
Length = 276
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 100/239 (41%), Gaps = 79/239 (33%)
Query: 11 WVWVHKRGLVTGGA-HHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPKCHTRCTND 68
W + GLV+GG+ +++N GCQP PP CN P C +
Sbjct: 110 WEYFKTHGLVSGGSIYNTNDGCQPSKIPPVCN---------------LPTKINKRTCVDY 154
Query: 69 NYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNG 125
YG + D + + YY V + DIQ+E+ +Y G
Sbjct: 155 CYGNDTIKYNHDHVKVRYYYHVKPK--DIQKEVQ----------------TY-------G 189
Query: 126 PVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASA 185
PV A + LY DIF +KSGVY ++ +A+ V VK++GWG ENG Y
Sbjct: 190 PVTAALNLYDDIFLHKSGVYTLTKNAKYVRLQYVKLIGWGVENGVDY------------- 236
Query: 186 EIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +V+++G ++G G +KI RG+ +ES V A+PK
Sbjct: 237 ---------------------WLLVNSWGNEWGQNGLLKIKRGKYGCAVESFVYAAVPK 274
>gi|312374702|gb|EFR22199.1| hypothetical protein AND_15622 [Anopheles darlingi]
Length = 339
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 94/240 (39%), Gaps = 69/240 (28%)
Query: 4 SGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHT 63
G S WV GLV+GGA++S GC+P F PC + E PKC
Sbjct: 167 DGTSFQYWV---DAGLVSGGAYNSTEGCKPYPFKPCLYPFTDCHREE-------SPKCKH 216
Query: 64 RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
C + + + +DK Y V + I+ EIM NGPV +Y D+F YKSG Y
Sbjct: 217 HCQH-GVDKRYARDKVFGSVAYSVPRDERVIRYEIMTNGPVEGGFDVYEDVFLYKSGVYR 275
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+ VY E V V+I+GWG E G PY
Sbjct: 276 H-------------------VY-----GEHVGKHAVRIIGWGREGGIPY----------- 300
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W I +++GE +GD G KI+RG N IES V LP
Sbjct: 301 -----------------------WLISNSYGEDWGDHGYFKIVRGINHLGIESKVITGLP 337
>gi|161343877|tpg|DAA06119.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 145
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 56/206 (27%), Positives = 83/206 (40%), Gaps = 62/206 (30%)
Query: 38 PCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQE 97
PC H P C P+C +C N +YG + +D ++ +Y +E
Sbjct: 1 PCQHTESAVENP-CSNKTFFTPECKVQCYNPDYGTRYVKDNHKGTQY---RIPGYTAMKE 56
Query: 98 IMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYA 157
I +NGP+ A+ Y+Y D +Y+SG ++++ SG Y + +
Sbjct: 57 IYENGPITASFYMYQDFVNYQSG----------------VYAFNSGKYVTTQA------- 93
Query: 158 TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQF 217
VKI+GWGEENG P YW ++F +
Sbjct: 94 -VKILGWGEENGTP----------------------------------YWLAANSFNTYW 118
Query: 218 GDKGTIKILRGRNEAIIESLVNGALP 243
GD G +KILRG NE IE + LP
Sbjct: 119 GDNGFVKILRGANECYIEEFMYAGLP 144
>gi|254575663|gb|ACT68328.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 81/202 (40%), Gaps = 41/202 (20%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++ W W G+VTGGA+ C+P FP C A+ + C + P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + DK + K +YW+ ++ IQ EIMK GPV A +Y D Y G
Sbjct: 225 KPYCQY-GYGKRYENDKIKAKTWYWLPNDERTIQLEIMKKGPVHATFNIYEDFEHYNGGV 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + ++KI+GWG + G YW I ++
Sbjct: 284 Y------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWST 319
Query: 182 SASAEIVAYATVKLIGWGEENG 203
WGE+ G
Sbjct: 320 D---------------WGEDGG 326
>gi|161343879|tpg|DAA06120.1| TPA_inf: cathepsin B [Toxoptera citricida]
Length = 340
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 99/252 (39%), Gaps = 75/252 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH---ANYTTSEPECKTLATPQ 58
C+ G W K GLVTGG + S GC+P PPC + N T S P
Sbjct: 157 CNGGYPIKAWERFKKHGLVTGGEYKSGEGCEPYRVPPCPYDESGNNTCS-------GKPM 209
Query: 59 PKCHTRCTNDNYGRGFFQDK--YRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
+ H RCT YG +R R YY + IQ+++M GP+ A+ D+
Sbjct: 210 EQNH-RCTRMCYGDQDLDFDDDHRHTRDSYYLT---IGSIQKDVMTYGPIEASF----DV 261
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
Y D SYKSGVY S +A + VK++GWGEE G P
Sbjct: 262 -------------------YDDFLSYKSGVYVRSENASYLGGHAVKLIGWGEEYGTP--- 299
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
YW +++++ +GD+G KI RG NE +
Sbjct: 300 -------------------------------YWLMMNSWNADWGDEGLFKIRRGTNECGV 328
Query: 235 ESLVNGALPKDN 246
++ +P N
Sbjct: 329 DNSTTAGVPVTN 340
>gi|195165479|ref|XP_002023566.1| GL19846 [Drosophila persimilis]
gi|194105700|gb|EDW27743.1| GL19846 [Drosophila persimilis]
Length = 329
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 82/176 (46%), Gaps = 30/176 (17%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++G+V+GG + S GC+P PC H + + P C +TP C
Sbjct: 155 CNGGFPGAAWSYWTRKGIVSGGPYGSTQGCRPYEIAPCEH-HVNGTRPPCSHGSTPS--C 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +Y + +DK + Y V VA+IQQEIM NGPV +Y D
Sbjct: 212 QHKC-QASYSVEYAKDKNFGSKSYSVRRNVAEIQQEIMTNGPVEGAFTVYED-------- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG--EENGRPYWTI 175
+ YKSGVY E+ +A ++I+GWG E+ PYW I
Sbjct: 263 ---------------LILYKSGVYQHEHGKELGGHA-IRILGWGVWGESKVPYWLI 302
>gi|119638965|gb|ABL85237.1| cysteine proteinase 3 [Necator americanus]
Length = 360
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 99/244 (40%), Gaps = 60/244 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+ TGG + + C+P +F PC +Y +C + P PKC
Sbjct: 159 CGGGHTIRAWEYFKNTGVCTGGLYGTKDSCKPYAFYPCKDESYG----KCPKDSFPTPKC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + + DKY Y + I+ EIM+NGPV A+ +Y D F +
Sbjct: 215 RKICQY-KYSKKYADDKYYANSAYRIPQNETWIKLEIMRNGPVTASFRIYPD-FGF---- 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y+ GVY S E+ +A +KI+GWG E
Sbjct: 269 ------------------YEKGVYVTSGGRELGGHA-IKIIGWGTE-------------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD-KGTIKILRGRNEAIIESLVNG 240
K+ G PYW I +++G +G+ G +ILRG+N IE V
Sbjct: 296 ------------KVNG----TDLPYWLIANSWGTDWGENNGYFRILRGQNHCQIEQKVIA 339
Query: 241 ALPK 244
+ K
Sbjct: 340 GMIK 343
>gi|166030312|gb|ABY78823.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/234 (26%), Positives = 89/234 (38%), Gaps = 70/234 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + GL +++ CQP FP C+H +P C PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCDHHGGKGKKPPCSKYDFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y V+ E YK
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGEE------------------------DYKREL 242
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP V +YSD F+YK+GVY
Sbjct: 243 YFNGPFVVAFQVYSDFFAYKTGVYR----------------------------------- 267
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
S +++ V+++GWG+ NG PYW I +++ +G G ILRG++E IE
Sbjct: 268 HVSGDVLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLILRGKDECGIE 321
>gi|159179|gb|AAA29178.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 341
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 95/241 (39%), Gaps = 60/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + GLV+GG + S C+P PC H T EC A+ P C
Sbjct: 155 CDGGWSIKAWEYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEAS-TPSC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + + DK + + V IQ+E++KNGPV A+ +Y D YKSG
Sbjct: 214 KKKC-QPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKNGPVTASFAVYEDFSLYKSG- 271
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G E+ Y VK++GWG EN Y
Sbjct: 272 ---------------IYRHTAG--------ELRGYHAVKMIGWGTENRTDY--------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ + +G+ G +I+RG N+ IE V
Sbjct: 300 -------------------------WLIANSWHDDWGENGYFRIIRGINDCGIEENVAAG 334
Query: 242 L 242
L
Sbjct: 335 L 335
>gi|294885809|ref|XP_002771442.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
gi|239875086|gb|EER03258.1| cathepsin L precursor, putative [Perkinsus marinus ATCC 50983]
Length = 527
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/228 (28%), Positives = 96/228 (42%), Gaps = 65/228 (28%)
Query: 17 RGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQ 76
RG +T G GC P FPPC H T P+C + P C +C N Y
Sbjct: 364 RGNLTKG-----DGCWPYDFPPCAHHINDTKYPKCPKGSYETPNCVEQCHNPKYTTSLKN 418
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
D R+Y ++++ P Y YS + + K+ +GP+ A+ +Y D
Sbjct: 419 D-----RHY------------MLESSP-----YQYS-VNNAKNAIRTDGPISASYLVYED 455
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLI 196
+YKSGVY ++ + + +A VK+I
Sbjct: 456 FLAYKSGVYKHTSGSYLGGHA-----------------------------------VKII 480
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
GWGEENG YW +V+++ E +GD+G KI G E I+ + G PK
Sbjct: 481 GWGEENGEAYWLVVNSWNEDWGDQGLFKIALGNCE--IDDDLLGGTPK 526
>gi|7507648|pir||T24819 hypothetical protein T10H4.12 - Caenorhabditis elegans
Length = 324
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 99/263 (37%), Gaps = 67/263 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S + G VTGG + + GC P SF PC ++ P CKT K
Sbjct: 100 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPCTKNCPESTTPSCKTTCQSSYKT 158
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYY--------WVNDEVADIQQEIMKNGPVVANMYLYSD 113
+ +YG + RF+R+ V +IQ EI GPV A+ +Y D
Sbjct: 159 EEYKKDKHYGELVWHSFNRFQRFLNRASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYED 218
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
+ YKSG ++ Y SG ++V VKI+GWG ENG Y
Sbjct: 219 FYHYKSG----------------VYHYTSG--------KLVGGHAVKIIGWGVENGVDY- 253
Query: 174 TIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAI 233
W I +++G FG+KG KI RG NE
Sbjct: 254 ---------------------------------WLIANSWGTSFGEKGFFKIRRGTNECQ 280
Query: 234 IESLVNGALPKDNYGVEFGEESG 256
IE V + K E E+ G
Sbjct: 281 IEGNVVAGIAKLGTHSETYEDDG 303
>gi|121073168|gb|ABM47070.1| cathepsin B1 [Clonorchis sinensis]
gi|358341105|dbj|GAA29748.2| cathepsin B [Clonorchis sinensis]
Length = 339
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 97/243 (39%), Gaps = 61/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + + GLVTG +++ C+P SFPPC H +P TPQ C
Sbjct: 157 CQGGYPAQAWEYWVRNGLVTGDLYNTTDTCRPYSFPPCEHHVVGPRKPCTGDPTTPQ--C 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + + DK+ + Y ++ + I +++M GP+ + +Y+D SY SG
Sbjct: 215 VKKCQPE-YPKTYENDKWYGLKAYSIHSDQEAIMRDLMTYGPLEVDFEVYADFPSYSSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + ++ V++VGWG E+G
Sbjct: 274 YRH------------------------VAGGLLGGHAVRLVGWGVEDG------------ 297
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G KI RG NE IES N
Sbjct: 298 ----------------------ADYWLIANSWNTDWGDGGYFKIRRGVNECGIESDANAG 335
Query: 242 LPK 244
PK
Sbjct: 336 HPK 338
>gi|161343851|tpg|DAA06106.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 333
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG +S GCQP FPPC T C + KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + YR R Y + ++ V+A + +DI +Y
Sbjct: 207 QKKCFGNT------SISYRGDRRY------------VERSPYVLAYDNMQNDIMTY---- 244
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GP+ ++ +Y D SYKSGVY S +A + +VK +GWG E V
Sbjct: 245 ---GPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-----------V 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S YW +++++ +GD G KI RG NE +E
Sbjct: 291 S-----------------------YWLMMNSWNNTWGDGGNFKIRRGTNECQVEDSSTAG 327
Query: 242 LPK 244
+P+
Sbjct: 328 MPE 330
>gi|124502519|gb|ABN13633.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 94/241 (39%), Gaps = 60/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+V+GG + + C+P PC H T EC+ A P P C
Sbjct: 156 CEGGWPIEAWKYFIYDGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECRGTA-PTPPC 214
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + + DK K Y V V IQ EI++NGPVVA+ +Y D YKSG
Sbjct: 215 KKEC-RPGVRKVYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFAVYEDFRHYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + +G E+ Y VK++GWG E
Sbjct: 273 ---------------IYKHTAG--------ELRGYHAVKMIGWGNE-------------- 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
N +W I +++ +G+KG +I+RG N+ IE +
Sbjct: 296 --------------------NNTDFWLIANSWHNDWGEKGYFRIIRGTNDCGIEGTIAAG 335
Query: 242 L 242
+
Sbjct: 336 I 336
>gi|294955270|ref|XP_002788457.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
gi|239903926|gb|EER20253.1| cysteine protease, putative [Perkinsus marinus ATCC 50983]
Length = 392
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 76/178 (42%), Gaps = 29/178 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W +++ G+ T G+ + GC P +FP C H + C P C
Sbjct: 159 CTKGRPDAAWSFLNVYGIATEGSMSAADGCWPYNFPKCGHHQQDSKYQPCPEKNYDTPPC 218
Query: 62 HTRCTNDNYGRGFFQDKY---RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
RC N NYG +D++ F Y + +I++EIM NGP A +Y D SY+
Sbjct: 219 LDRCPNKNYGTPLDKDRHFTAHFSPYQLKGTD--NIKKEIMTNGPTSAAFSMYDDFLSYE 276
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
SG Y + S ++ V+I+GWG + G YW ++
Sbjct: 277 SGVYKH------------------------TSGTLMGEHGVEIIGWGTKQGVDYWLVM 310
>gi|3912916|gb|AAC78691.1| thiol protease [Trichuris suis]
Length = 348
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/250 (27%), Positives = 97/250 (38%), Gaps = 76/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV-------------SFPPCNHANYTTSE 48
C+ G W G TGG GC+P + PC + Y
Sbjct: 153 CNGGFPIEAWRHFTVAGNCTGGKTIDKYGCKPYKPTGPIGRHLKRNDYAPCPNDTYYG-- 210
Query: 49 PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANM 108
EC +A P+C RC Y + + D+Y K Y V V IQ+EIMKNGPVVA+
Sbjct: 211 -ECVGMAD-TPRCKRRCLL-GYPKSYPSDRYYGKSAYIVKQSVKAIQREIMKNGPVVASF 267
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
+Y D YKSG Y + + E+ Y VKI+GWG+E
Sbjct: 268 AVYEDFRHYKSGIYKH------------------------TAGELRGYHAVKIIGWGKE- 302
Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
N +W I +++ + +G+KG +I+RG
Sbjct: 303 ---------------------------------NNTDFWLIANSWHQDWGEKGYFRIVRG 329
Query: 229 RNEAIIESLV 238
+NE IE+ V
Sbjct: 330 KNECGIETDV 339
>gi|239938578|gb|ACS36088.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 82/175 (46%), Gaps = 28/175 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G+ W +V + G+VTGG + C+P PC NH S P + TP
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C YG+ + +DK K Y ++++ IQ+E+MKNGPV A Y D FS+
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAASITYED-FSF--- 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y+ G+Y + + A+A VK+VGWG ENG YW +
Sbjct: 278 -------------------YRRGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312
>gi|166030308|gb|ABY78821.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 91/242 (37%), Gaps = 69/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CKGGAPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y + + D ++E+ NGP V + +YSD +YK+G
Sbjct: 211 NTTCTD----KAIPLIKYRGNNSYMLLNGEDDYKRELYFNGPFVVDFGVYSDFLAYKTGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ V+IVGWG+ NG PY
Sbjct: 267 YRH------------------------VSGDVLGGHAVRIVGWGKLNGTPY--------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G G ILRG NE IES
Sbjct: 294 -------------------------WKIANSWDTDWGMNGHFLILRGNNECGIESTGYAG 328
Query: 242 LP 243
LP
Sbjct: 329 LP 330
>gi|335347289|gb|AEH42092.1| cysteine proteinase 1 [Haemonchus contortus]
Length = 332
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G+ W +V + G+VTGG + C+P PC NH S P + TP
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C YG+ + +DK K Y ++++ IQ+E+MKNGPV A Y D FS+
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y G+Y + + A+A VK+VGWG ENG YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312
>gi|239938580|gb|ACS36089.1| cysteine proteinase [Haemonchus contortus]
Length = 332
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G+ W +V + G+VTGG + C+P PC NH S P + TP
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTPA-- 222
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C YG+ + +DK K Y ++++ IQ+E+MKNGPV A Y D FS+
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y G+Y + + A+A VK+VGWG ENG YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312
>gi|404250524|gb|AFR54113.1| cysteine proteinase, partial [Haemonchus contortus]
Length = 332
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 81/175 (46%), Gaps = 28/175 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G+ W +V + G+VTGG + C+P PC NH S P + TP
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYPLHPCGNHGGKFWSCPRDHSFRTP--A 222
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C YG+ + +DK K Y ++++ IQ+E+MKNGPV A Y D FS+
Sbjct: 223 CKKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFITYED-FSF--- 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y G+Y + + A+A VK+VGWG ENG YW +
Sbjct: 278 -------------------YTKGIYVHTRGRQRGAHA-VKVVGWGVENGTKYWNV 312
>gi|170060936|ref|XP_001866022.1| cathepsin B [Culex quinquefasciatus]
gi|167879259|gb|EDS42642.1| cathepsin B [Culex quinquefasciatus]
Length = 341
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 97/243 (39%), Gaps = 70/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +RG+ +GG ++S GC P C+ A+ P+C
Sbjct: 158 CQGGNLGPAWQFWVQRGVSSGGPYNSRQGCHPYPVDVCHSADEDADTPKC---------- 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
TR Y D RF R Y V+ + I++EI +NGPV A+ +Y D +YK+G
Sbjct: 208 -TRKCQSMYNVTNVSDDRRFGRVAYSVSQDEERIKEEIFRNGPVQASFDVYLDFKAYKTG 266
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y +F G +A VK++GWG EN
Sbjct: 267 ------------VYRHVFGPMEGGHA------------VKMIGWGVEN------------ 290
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
G YW +++GE +G++G KI+RG N IES V+
Sbjct: 291 ----------------------GTKYWLCSNSWGEDWGERGFFKIVRGENHCGIESDVHA 328
Query: 241 ALP 243
LP
Sbjct: 329 GLP 331
>gi|390994433|gb|AFM37366.1| cathepsin B3 [Dictyocaulus viviparus]
Length = 342
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 97/242 (40%), Gaps = 63/242 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C G + W + G+VTGG + + C+P PC NH N T C ++TP
Sbjct: 162 CDGGFPDAAWEYFVSTGVVTGGLYGTKNACRPYEISPCGNHPNETFYR-NCTGVSTPS-- 218
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T C Y + DK R ++ Y + + V+ IQ++I+K+GP+VA +Y D YK G
Sbjct: 219 CKTSC-QKGYPVSYKDDKTRGRKSYNLANSVSAIQKDILKHGPLVATFSVYEDFMYYKKG 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ Y G Y + V+I+GWG EN YW I +
Sbjct: 278 ----------------IYRYTHGGYEGGHA--------VRILGWGVENNVKYWIIANSWN 313
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
WGE+ G +++RG N+ IE V+
Sbjct: 314 TD---------------WGED-------------------GFFRMVRGINDCGIEESVSA 339
Query: 241 AL 242
L
Sbjct: 340 GL 341
>gi|407080581|gb|AFS89610.1| procathepsin B precursor [Phenacoccus solenopsis]
Length = 309
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 101/245 (41%), Gaps = 67/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C ++ WV K G+V+GG++ S GCQP PPC H + C T P P C
Sbjct: 123 CDHHLAWDHWV---KHGIVSGGSYGSKEGCQPYHLPPCEH-HRAGPRRNC-TKYGPTPSC 177
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWV---NDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C D Y + D + K++Y + N+++ I+ EI NGPV A M Y D ++Y+
Sbjct: 178 ARVCQPD-YKISYEDDLHFGKQWYALAPHNEKI--IRTEIFHNGPVEATMAAYEDFYTYE 234
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG I+ + G + V VKI+GWG +
Sbjct: 235 SG----------------IYHHIEGTF--------VCDHAVKIIGWGTD----------- 259
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
++ PYW + ++F +G+ G KI RG NE IE+ +
Sbjct: 260 ---------------------KKTNTPYWLVANSFNTDWGEYGFFKIKRGVNECGIENKI 298
Query: 239 NGALP 243
+P
Sbjct: 299 TAGIP 303
>gi|166030328|gb|ABY78831.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/236 (25%), Positives = 91/236 (38%), Gaps = 69/236 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W++ + G+ +++GCQP FP C H ++ C PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA ++Y+D+F+YKSG
Sbjct: 211 NATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y N + + V+IVGWG+ NG P
Sbjct: 267 YRN------------------------VDGDFLGGQAVRIVGWGKLNGTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
YW + +++ +G G + ILRG NE IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYMLILRGNNECNIEHL 324
>gi|268561802|ref|XP_002638421.1| C. briggsae CBR-CPR-3 protein [Caenorhabditis briggsae]
Length = 375
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 94/246 (38%), Gaps = 71/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 60
C G + + G+VTGG ++ GC P SFPPC + S P CKT
Sbjct: 165 CQGGYTIEAMKYWMNSGVVTGG-DYNGAGCMPYSFPPCKKSPCVEFSTPSCKT------T 217
Query: 61 CHTRCTNDNY--GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C + T +Y + F Y+ + V IQ EI NGPV A+ ++ D + YK
Sbjct: 218 CQEKYTTADYKNDKHFATSAYKLST---TKNAVPTIQYEIYHNGPVEASYRVFEDFYQYK 274
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + S +V VKI+GWG ENG Y
Sbjct: 275 SGVYHH------------------------VSGNLVGGHAVKIIGWGTENGVDY------ 304
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W + +++G FG+KG KI RG NE IES +
Sbjct: 305 ----------------------------WLVANSWGTSFGEKGFFKIRRGTNECQIESNI 336
Query: 239 NGALPK 244
L K
Sbjct: 337 VAGLAK 342
>gi|189239879|ref|XP_968767.2| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270012755|gb|EFA09203.1| cathepsin B precursor [Tribolium castaneum]
Length = 353
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 98/245 (40%), Gaps = 71/245 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + GLV+GG ++++ GCQP S +N+ P+C
Sbjct: 143 CKGGYSYYAWKYYTSTGLVSGGDYNTSRGCQPYS-----KSNFNDGV---------SPEC 188
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N Y + D++ Y++ V IQQEI+ G
Sbjct: 189 SKTCQNTKYPTSYLNDRHFGDGTYYILKNVTTIQQEILLRG------------------- 229
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPV+A +Y D Y+ GVY ++ A + ++A VKI+GWG ENG YW +
Sbjct: 230 ---GPVMAGFDVYEDFKLYREGVYVHTSGALLGSHA-VKIIGWGTENGWAYWLVAN---- 281
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE-SLVNG 240
WG++ G G KI RG NE IE S++ G
Sbjct: 282 ---------------SWGKDWG--------------ALGGVFKIRRGTNECKIEQSIITG 312
Query: 241 ALPKD 245
+ KD
Sbjct: 313 HVRKD 317
>gi|2944340|gb|AAC05262.1| cathepsin B-like cysteine protease GCP7 [Haemonchus contortus]
Length = 348
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 82/202 (40%), Gaps = 41/202 (20%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++ W W G+VTGGA+ C+P FP C A+ + C + P C
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCG-AHKGKAFNNCPSHPYATPAC 224
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + DK + + +YW+ ++ IQ EIM+ GPV A +Y D Y+ G
Sbjct: 225 KPYCQY-GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYEGGV 283
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + ++KI+GWG + G YW I ++
Sbjct: 284 Y------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANSWST 319
Query: 182 SASAEIVAYATVKLIGWGEENG 203
WGE+ G
Sbjct: 320 D---------------WGEDGG 326
>gi|3087797|emb|CAA93275.1| cysteine proteinase [Haemonchus contortus]
Length = 330
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 80/174 (45%), Gaps = 27/174 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G+ W +V + G+VTGG + C+P PC S P + TP C
Sbjct: 165 CNGGMDHKAWEYVKEFGVVTGGRYQEKGVCKPYHLHPCEITGKFWSCPRDHSFRTPA--C 222
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + +DK K Y ++++ IQ+E+MKNGPV A Y D FS+
Sbjct: 223 KKYCQY-GYGKRYEKDKSYVKSVYILDEDEKAIQREMMKNGPVQAAFTTYED-FSF---- 276
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y+ G+Y S + A+A VK+VGWG ENG YW +
Sbjct: 277 ------------------YRKGIYVHSYGRQRGAHA-VKVVGWGVENGTKYWNV 311
>gi|291291827|gb|ADD91786.1| cysteine proteinase [Haemonchus contortus]
Length = 253
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 77/175 (44%), Gaps = 26/175 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + K+G VTGG + + +GC+P F PC H T EC AT PKC
Sbjct: 72 CNGGWPIQAFNYFSKQGAVTGGDYKATSGCRPYPFHPCGHHGKDTYYGECPNEAT-TPKC 130
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y N E A Q+EIMKNGPVV +Y D FSY
Sbjct: 131 VRKCQKSYKKSYKKDRSIGKDAYEEPNAEKA-TQREIMKNGPVVGAFTVYED-FSY---- 184
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G+Y +A +A +KI+GWG+E G PYW I
Sbjct: 185 ------------------YKKGIYKHTAGKARGGHA-IKIIGWGKEGGVPYWLIA 220
>gi|209863073|ref|NP_001119610.2| cathepsin B-1852 [Acyrthosiphon pisum]
Length = 333
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K GLVTGG +S GCQP FPPC T C + KC
Sbjct: 153 CQGGYPIRAWRYYSKHGLVTGGNFNSFEGCQPYMFPPC------TGNNSCSGQSEKNHKC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + YR R Y + ++ V+A + +DI +Y
Sbjct: 207 QKKCFGNT------SISYRGDRRY------------VERSPYVLAYDNMQNDIMTY---- 244
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GP+ ++ +Y D SYKSGVY S +A + +VK +GWG E V
Sbjct: 245 ---GPIESSFDVYDDFISYKSGVYFKSPNATYLGGHSVKCIGWGVERN-----------V 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S YW +++++ +GD G KI RG NE +E
Sbjct: 291 S-----------------------YWLMMNSWNSTWGDGGYFKIRRGTNECQVEDSSTAG 327
Query: 242 LPK 244
+P+
Sbjct: 328 VPE 330
>gi|48762483|dbj|BAD23811.1| cathepsin B-S [Tuberaphis takenouchii]
Length = 155
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 30/179 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + +G+ TGG + S GC P PPC + P +
Sbjct: 5 CEGGYPIKAWQYFRTQGVPTGGDYDSKEGCAPYKIPPCFDQKGKNT-----CAGKPLERN 59
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
H +C YG Q +Y+ K Y +N ++Q+++K GP+ A+ L+ D
Sbjct: 60 H-QCPKTCYGSTTVQKRYKVKNEYVLNSPNT-MEQDLIKYGPIEASFNLFDD-------- 109
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+ +YKSG+Y + A+ ++ ++KI+GWG+ENG PYW V ++
Sbjct: 110 ---------------LSAYKSGIYQKTPKAKFLSGHSIKIIGWGKENGVPYWLAVNSWS 153
>gi|160688716|gb|ABX45136.1| cathepsin B-like cysteine protease 2 [Callosobruchus maculatus]
Length = 260
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 89/216 (41%), Gaps = 67/216 (31%)
Query: 30 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
GC P CN P CKTL P C C + + + +DK+ K+ Y +
Sbjct: 111 GCMSYPLPRCN--------PSCKTLYD-APTCKKECDKGSPLK-YEEDKHYAKQAYRIMS 160
Query: 90 EVA-DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVS 148
+V IQ EI+KNGPVV A+ +Y+D Y SGVY
Sbjct: 161 KVERQIQLEIIKNGPVV-----------------------ASFTVYADFIHYLSGVYKFD 197
Query: 149 ASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWT 208
++++ V+I+GWG ENG PYW
Sbjct: 198 GESKLLGGHAVRIIGWGIENGT---------------------------------YPYWL 224
Query: 209 IVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+ +++ E++GD+G KI RG+NE IE + LP+
Sbjct: 225 VSNSWNERWGDQGLFKIWRGKNECGIEEEITAGLPR 260
>gi|326515156|dbj|BAK03491.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 94/243 (38%), Gaps = 63/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ W + GLVTGG +SN GC P C+H +P C + P P C
Sbjct: 286 CEGGYPSAAWDYFQSTGLVTGGDWNSNQGCYPYQLQACDHHVTGKYQP-CGDI-QPTPAC 343
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N+ + DK+ Y V + I EI NGPV A+ +Y+D SYKSG
Sbjct: 344 ANSCQNN---ATWSSDKHFGASSYSVGTDQQSIMTEIYTNGPVEASYDVYADFVSYKSG- 399
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ + +G Y + VKI+GWG + P
Sbjct: 400 ---------------VYQHVTGDY--------LGGHAVKIIGWGVDGSTP---------- 426
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G+ G ILRG +E IE +
Sbjct: 427 ------------------------YWIVANSWNNDWGNNGFFNILRGSDECGIEDGIVAG 462
Query: 242 LPK 244
+PK
Sbjct: 463 IPK 465
>gi|308466896|ref|XP_003095699.1| CRE-CPR-3 protein [Caenorhabditis remanei]
gi|308244581|gb|EFO88533.1| CRE-CPR-3 protein [Caenorhabditis remanei]
Length = 373
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 100/254 (39%), Gaps = 67/254 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYT-TSEPECKTLATPQPK 60
C G S + G VTGG ++ N GC P SF PC + ++ P CKT
Sbjct: 163 CQGGYSIEAMRFWKSNGAVTGGDYNGN-GCMPYSFAPCQKSPCVESTTPTCKTTCQSSYT 221
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
T+ +YG + R N+ V+ IQ EI NGPV A+ +Y D + YKSG
Sbjct: 222 TANYTTDKHYGTSAY-------RLATTNNVVSTIQYEIYHNGPVEASYKVYEDFYQYKSG 274
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++ Y SG ++V VKI+GWG EN Y
Sbjct: 275 ----------------VYHYVSG--------KLVGGHAVKIIGWGTENDVDY-------- 302
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W + +++G +FG+ G KI RG NE IES V
Sbjct: 303 --------------------------WLVANSWGIKFGEGGFFKIRRGTNECQIESNVVA 336
Query: 241 ALPKDNYGVEFGEE 254
+ K E G++
Sbjct: 337 GVAKLGTHAEKGDD 350
>gi|44965462|gb|AAS49538.1| cathepsin B [Protopterus dolloi]
Length = 225
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/167 (31%), Positives = 75/167 (44%), Gaps = 26/167 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + ++GLV+GG + S GC+P + PPC H + S P C PKC
Sbjct: 83 CNGGYPSGAWQYWTEKGLVSGGLYGSGIGCRPYTIPPCEH-HVNGSRPSCSGEGGDTPKC 141
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + +DK + Y V I +EI K+GPV +Y D YKSG
Sbjct: 142 VQKC-DSGYTPAYEKDKIYGQSAYSVPSSPESIMEEIYKDGPVEGAFTVYEDFLLYKSG- 199
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
++ + +G E V +KI+GWG EN
Sbjct: 200 ---------------VYQHHTG--------EAVGGHAIKILGWGIEN 223
>gi|161343869|tpg|DAA06115.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 337
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 101/246 (41%), Gaps = 66/246 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W GLVTGG ++S GC+P PP N N ++S+ + C
Sbjct: 157 CHGGYPIKAWKRFSTHGLVTGGDYNSGEGCEPYRVPPSNDGNSSSSDQPLAINHICRRHC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+ + D F D +R+ R YY++ IQ+ D+ +Y
Sbjct: 217 YGNQSID------FNDDHRYTRDYYYLT--YGSIQK----------------DVLTY--- 249
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
GP+ A+ +Y D SYKSGVY S +A + VK++GWGEE+G PY
Sbjct: 250 ----GPIEASFDVYDDFPSYKSGVYVKSDNASYLGGHAVKLIGWGEEDGTPY-------- 297
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W +V+++ Q+GD G KI RG NE +++
Sbjct: 298 --------------------------WLMVNSWNTQWGDNGFFKIRRGTNECGVDNSTTA 331
Query: 241 ALPKDN 246
+P N
Sbjct: 332 GVPVTN 337
>gi|166030322|gb|ABY78828.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343471419|emb|CCD16168.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 86/207 (41%), Gaps = 62/207 (29%)
Query: 31 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
CQP FP C H ++ C P+C+T CT+ + KYR K Y +
Sbjct: 180 CQPYPFPQCEHQGAQGNKTPCSNYKFVTPQCNTTCTD----KTIPLIKYRGKDAYMLLPG 235
Query: 91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
+ ++E+ NGP VA +++Y+D+F+YKSG Y N V Y+ GV A
Sbjct: 236 EEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRN---VDGSYM---------GVTA---- 279
Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
VK+VGWG+ NG P YW +
Sbjct: 280 --------VKVVGWGKLNGTP----------------------------------YWKVA 297
Query: 211 STFGEQFGDKGTIKILRGRNEAIIESL 237
+T+ +G G + ILRG NE IE L
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|389593817|ref|XP_003722157.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
gi|321438655|emb|CBZ12414.1| cysteine peptidase C (CPC) [Leishmania major strain Friedlin]
Length = 340
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 86/235 (36%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C + PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 219 NTTCERSEMDL----VKYKGSTSYSVKGE-KELMIELMTNGPLELTMQVYSDFVGYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + E + VK+VGWG ++G P
Sbjct: 274 YKH------------------------VLGEFLGGHAVKLVGWGTQDGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW + +++ +GDKG I RG NE IES
Sbjct: 300 ------------------------YWKVANSWNTDWGDKGYFLIQRGNNECKIES 330
>gi|237836005|ref|XP_002367300.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|211964964|gb|EEB00160.1| cysteine proteinase, putative [Toxoplasma gondii ME49]
gi|221506020|gb|EEE31655.1| cysteine proteinase, putative [Toxoplasma gondii VEG]
Length = 572
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 350 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 408
Query: 59 --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
PKC C Y F QD ++ Y + D+++++M +GPV +Y D
Sbjct: 409 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 467
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
SYKSG ++ + SG+ V +KI+GWG ENG YW
Sbjct: 468 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 503
Query: 174 TIV 176
V
Sbjct: 504 HAV 506
>gi|21700775|gb|AAL60053.1| cysteine proteinase [Toxoplasma gondii]
Length = 569
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405
Query: 59 --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
PKC C Y F QD ++ Y + D+++++M +GPV +Y D
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 464
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
SYKSG ++ + SG+ V +KI+GWG ENG YW
Sbjct: 465 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 500
Query: 174 TIV 176
V
Sbjct: 501 HAV 503
>gi|343476073|emb|CCD12715.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 86/207 (41%), Gaps = 62/207 (29%)
Query: 31 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
CQP FP C H ++ C P+C+T CT+ + KYR K Y +
Sbjct: 180 CQPYPFPQCEHHGAQGNKTPCSNYKFVTPQCNTTCTD----KTIPLIKYRGKDAYMLLPG 235
Query: 91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
+ ++E+ NGP VA +++Y+D+F+YKSG Y N V Y+ GV A
Sbjct: 236 EEEFKRELYFNGPFVAILFVYTDLFAYKSGVYRN---VDGSYM---------GVTA---- 279
Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
VK+VGWG+ NG P YW +
Sbjct: 280 --------VKVVGWGKLNGTP----------------------------------YWKVA 297
Query: 211 STFGEQFGDKGTIKILRGRNEAIIESL 237
+T+ +G G + ILRG NE IE L
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHL 324
>gi|221484923|gb|EEE23213.1| cysteine proteinase, putative [Toxoplasma gondii GT1]
Length = 569
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 34/183 (18%)
Query: 2 CSSGISSSTWVWVHKRGLVTGG---AHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
C+ G W W ++G+VTGG A T C P P C H + P+C P+
Sbjct: 347 CNGGQPGMAWRWFERKGVVTGGDFDALGKGTTCWPYEVPFCAH-HAKAPFPDCDATLVPR 405
Query: 59 --PKCHTRCTNDNYGRG---FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
PKC C Y F QD ++ Y + D+++++M +GPV +Y D
Sbjct: 406 KTPKCRKDCEEQAYADNVHPFDQDTHKATSAYSLRSR-DDVKRDMMTHGPVSGAFMVYED 464
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
SYKSG ++ + SG+ V +KI+GWG ENG YW
Sbjct: 465 FLSYKSG----------------VYKHVSGL--------PVGGHAIKIIGWGTENGEEYW 500
Query: 174 TIV 176
V
Sbjct: 501 HAV 503
>gi|1008858|gb|AAA79004.1| cathepsin B-like thiol protease [Aedes aegypti]
Length = 342
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 98/245 (40%), Gaps = 68/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++G+ +GG ++S GC P C+ S E T PKC
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCD-----ASGEEADT-----PKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC + +QD+ + Y + ++ I +EI NGPV A Y D+ +YKSG
Sbjct: 206 SKRCQSGYNVTDVWQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSG- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ + +G +A VK++GWG ENG
Sbjct: 265 -----------VYRHVWGHMAGGHA------------VKLMGWGVENG------------ 289
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++G+ +GD G KI+RG N IE V+
Sbjct: 290 ----------------------LKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAG 327
Query: 242 LPKDN 246
LP N
Sbjct: 328 LPSFN 332
>gi|268619140|gb|ACZ13346.1| cathepsin B-like cysteine proteinase [Bursaphelenchus xylophilus]
Length = 405
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 77/177 (43%), Gaps = 30/177 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK- 60
C+ G+ + + G TG + GCQP F C H +T P C ++ P+ K
Sbjct: 140 CNGGLEEVAFEKFIENGFPTGSEVDKHQGCQPYPFKHCAHHVNSTEYPPCDSV--PEYKA 197
Query: 61 --CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C C D Y R + +D Y K Y +DE A IQ+EIM NGPV + +Y
Sbjct: 198 DTCSHECQKD-YDRKYEEDLYYGKEQYGFSDE-APIQREIMTNGPVAVSFTVYES----- 250
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+LY Y G+Y + I Y V++VGWG ENG YW I
Sbjct: 251 -------------FLY-----YSGGIYRSTPGERIKGYHAVRVVGWGVENGTKYWKI 289
>gi|157167283|ref|XP_001658486.1| cathepsin b [Aedes aegypti]
gi|108876477|gb|EAT40702.1| AAEL007599-PA [Aedes aegypti]
Length = 342
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 98/245 (40%), Gaps = 68/245 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++G+ +GG ++S GC P C+ S E T PKC
Sbjct: 156 CKGGYLGPAWQFWVEQGVSSGGPYNSRQGCHPYPIDVCD-----ASGEEADT-----PKC 205
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC + +QD+ + Y + ++ I +EI NGPV A Y D+ +YKSG
Sbjct: 206 SKRCQSGYNVTDVWQDRRYGRVAYSIPNDEQKIMEEIYINGPVQAAFMTYQDLHAYKSG- 264
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ + +G +A VK++GWG ENG
Sbjct: 265 -----------VYRHVWGHMAGGHA------------VKLMGWGVENG------------ 289
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++G+ +GD G KI+RG N IE V+
Sbjct: 290 ----------------------LKYWLVANSWGDDWGDNGFFKIVRGENHCGIEKDVHAG 327
Query: 242 LPKDN 246
LP N
Sbjct: 328 LPSFN 332
>gi|342181301|emb|CCC90780.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 91/235 (38%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y V+ E D ++E+
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGE-DDYKREL----------------------- 242
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP V ++YSD +YK+GVY
Sbjct: 243 YFNGPFVVVFWVYSDFLAYKTGVYR----------------------------------- 267
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
S + + V+++GWG+ NG PYW I +++ +G G + LRG NE IE+
Sbjct: 268 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEA 322
>gi|166030310|gb|ABY78822.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 91/235 (38%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTSWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYHFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y V+ E D ++E+
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYEVHGE-DDYKREL----------------------- 242
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP V ++YSD +YK+GVY
Sbjct: 243 YFNGPFVVVFWVYSDFLAYKTGVYR----------------------------------- 267
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
S + + V+++GWG+ NG PYW I +++ +G G + LRG NE IE+
Sbjct: 268 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHLLFLRGNNECGIEA 322
>gi|294914336|ref|XP_002778250.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239886453|gb|EER10045.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 388
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 91/217 (41%), Gaps = 61/217 (28%)
Query: 28 NTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWV 87
++GC P +FP C+H T CK +P P C T C N ++ F D++ + +
Sbjct: 219 DSGCWPYNFPECSHHVDTKGMEPCKG-NSPSPVCSTTCRNHHFKPSFESDRHFTEDEGYS 277
Query: 88 NDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV 147
DEV +I++EI+ NGPV A +Y D F Y YKSGVY
Sbjct: 278 LDEVDEIKREIIDNGPVAAAFTVYED-FPY----------------------YKSGVYKH 314
Query: 148 SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYW 207
+E+ +A VK+IGWG + YW
Sbjct: 315 VNGSELGGHA-----------------------------------VKIIGWGIDQNEQYW 339
Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+++++ +GD+G KI G E I+S V +PK
Sbjct: 340 LVMNSWNVNWGDQGIFKIAIG--ECGIDSEVTAGIPK 374
>gi|1848229|gb|AAB48119.1| cathepsin B-like protease [Leishmania major]
Length = 340
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 87/235 (37%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C + PKC
Sbjct: 166 CHGGIPTVAWLWWVWVGIAT-------EDCQPYPFDPCSHHGNSEKYPPCPSTIYDTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C + KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 219 NTTCERNEMDL----VKYKGSTSYSVKGE-KELMIELMTNGPLELTMQVYSDFVGYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + VK+VGWG ++G P
Sbjct: 274 YKH------------------------VLGDFLGGHAVKLVGWGTQDGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW + +++ +GDKG I RG NE IES
Sbjct: 300 ------------------------YWKVANSWNTDWGDKGYFLIQRGNNECKIES 330
>gi|187107122|ref|NP_001119621.1| cathepsin B-3098 precursor [Acyrthosiphon pisum]
gi|161343841|tpg|DAA06101.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 337
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 65/247 (26%), Positives = 98/247 (39%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
C+ G W GLVTGG + S GC+P PPC + + + K + QP
Sbjct: 155 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPY------DKDGKNTCSGQPME 208
Query: 60 ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
KC +C D F +R+ R D+ + I K D+ +
Sbjct: 209 SNHKCSKKCYGDE--DIDFNKDHRYTR-----DDYYLTYRGIQK------------DVIN 249
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y GP+ + +Y D +YKSG+Y S +A + +VK++GWGEE G Y
Sbjct: 250 Y-------GPIETSFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLY---- 298
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W +V+++ +GDKG KI RG NE +++
Sbjct: 299 ------------------------------WLMVNSWNADWGDKGLFKIRRGTNECRVDN 328
Query: 237 LVNGALP 243
G +P
Sbjct: 329 STTGGVP 335
>gi|4325188|gb|AAD17297.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 341
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 74/175 (42%), Gaps = 26/175 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W+ + G+VTGG + C+P SF PC C P PKC
Sbjct: 158 CQGGWPIEAYRWMQRDGVVTGGKYRQRDVCKPYSFYPCGQHKDVPYYGPCPGGLWPTPKC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ + Y + + +DK+ R Y + + I+QEI KNGPVVA +Y D
Sbjct: 218 R-KSSQRKYNKTYQEDKHFATRSYSLPNNERSIRQEIYKNGPVVAAFKVYED-------- 268
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
+S G+Y + A+A K++GWG ENG YW I
Sbjct: 269 ----------------YSSTGGIYVHKWGIQTGAHAD-KVIGWGRENGTDYWLIA 306
>gi|343474530|emb|CCD13852.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 335
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 97/242 (40%), Gaps = 70/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+ +G CQP FP C+H +T+ P+C L P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR + Y ++ E D ++E+ GP A ++SD
Sbjct: 211 NPACTDSTISK----KKYRGLKSYSLSGE-EDFRRELYFRGPFQAVFDVWSD-------- 257
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+F+YK GVY A I A+A V+IVGWG ++G P
Sbjct: 258 ---------------LFAYKHGVYKHVGGAFIGAHA-VRIVGWGNQSGVP---------- 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ ++GD+G +LRG NE IE +
Sbjct: 292 ------------------------YWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSAG 327
Query: 242 LP 243
+P
Sbjct: 328 VP 329
>gi|12004577|gb|AAG44098.1| cathepsin B cysteine protease [Leishmania chagasi]
Length = 340
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGG 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ VK+VGWG + G P
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG NE IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|409905640|gb|AFV46426.1| cysteine protease C [Leishmania donovani]
Length = 345
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C PKC
Sbjct: 171 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 223
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 224 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 278
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ VK+VGWG + G P
Sbjct: 279 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG NE IES
Sbjct: 305 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 335
>gi|343470805|emb|CCD16605.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 91/243 (37%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA Y+Y+D+F+YKSG
Sbjct: 212 NATCTD----KSVPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y N + + VK+VGWG+ NG P
Sbjct: 268 YRN------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G + ILRG NE IE L
Sbjct: 294 ------------------------YWKVANSWDTDWGMDGYLLILRGNNECNIEHLGFAG 329
Query: 242 LPK 244
P+
Sbjct: 330 TPE 332
>gi|166030318|gb|ABY78826.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 97/243 (39%), Gaps = 72/243 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+ +G CQP FP C+H +T+ P+C L P C
Sbjct: 158 CLGGDPDMAWAYFSSEGIASGR-------CQPYPFPRCSHYTNSTTYPQCSALHLWTPTC 210
Query: 62 HTRCTNDNYGRGFFQDKYR-FKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+ CT+ + KYR K Y + +E D ++E+ GP A ++SD
Sbjct: 211 NPACTDSTISK----KKYRGLKSYSFSGEE--DFRRELYFRGPFQAVFDVWSD------- 257
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+F+YK GVY A I A+A V+IVGWG ++G P
Sbjct: 258 ----------------LFAYKHGVYKHVGGAFIGAHA-VRIVGWGNQSGVP--------- 291
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW I +++ ++GD+G +LRG NE IE +
Sbjct: 292 -------------------------YWKIANSWNAEWGDRGYFFMLRGDNECGIEDSGSA 326
Query: 241 ALP 243
+P
Sbjct: 327 GVP 329
>gi|146092987|ref|XP_001466605.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|398018677|ref|XP_003862503.1| cysteine peptidase C (CPC) [Leishmania donovani]
gi|12005276|gb|AAG44365.1| cathepsin B-like cysteine protease [Leishmania donovani]
gi|134070968|emb|CAM69644.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania infantum JPCM5]
gi|322500733|emb|CBZ35810.1| cysteine peptidase C (CPC) [Leishmania donovani]
Length = 340
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ VK+VGWG + G P
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG NE IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|343476048|emb|CCD12737.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 88/242 (36%), Gaps = 69/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPDSAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y + D ++E+ NGP V +YSD +YK+G
Sbjct: 211 NTTCTD----KAIPLIKYRGNDSYVLLHGEDDFKRELYFNGPFVVAFQVYSDFLAYKTGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + + V+IVGWG+ NG PY
Sbjct: 267 YRH------------------------VSGDFLGGHAVRIVGWGKLNGTPY--------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G G ILRG NE IES
Sbjct: 294 -------------------------WKIANSWDTDWGMNGHFLILRGNNECGIESTGYAG 328
Query: 242 LP 243
LP
Sbjct: 329 LP 330
>gi|157058769|gb|ABV03142.1| cathepsin B-348 [Myzus persicae]
Length = 246
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/176 (30%), Positives = 79/176 (44%), Gaps = 28/176 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + S GC P PC H T P CK P C
Sbjct: 93 CNGGFPGAAWHYWKTKGIVSGGPYGSKMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPAC 150
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + QD +R K Y + ++V I+QEI NGPV +Y
Sbjct: 151 VKKC-EDGYKVPYAQDLHRGKSAYSLGNDVDQIRQEIYTNGPVEGAFTVY---------- 199
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIV 176
D +Y++GVY A + +A ++I+GWG +NG PYW +
Sbjct: 200 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVA 241
>gi|17384033|emb|CAD12394.1| cysteine proteinase [Leishmania infantum]
Length = 340
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/235 (27%), Positives = 86/235 (36%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H + P C PKC
Sbjct: 166 CYGGIPTMAWLWWVWVGITT-------EVCQPYPFGPCSHHGNSDKYPPCPNTIYDTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C KY+ Y V E ++ E+M NGP+ M +YSD YKSG
Sbjct: 219 NTTCEKSEMDL----VKYKGGTSYSVKGE-KELMIELMTNGPLEVTMQVYSDFVGYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ VK+VGWG + G P
Sbjct: 274 YKH------------------------VSGDLLGGHAVKLVGWGTQGGVP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG NE IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGSNECGIES 330
>gi|166030320|gb|ABY78827.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 82/214 (38%), Gaps = 62/214 (28%)
Query: 31 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
CQP FP C H ++ C PKC+ CT+ + KYR Y +
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGNATYLLLHG 235
Query: 91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
D ++E+ NGP VA ++Y+D+F+YKSG Y N
Sbjct: 236 EEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRN------------------------VD 271
Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
+I+ V+IVGWG+ NG P YW +
Sbjct: 272 GDILGGQAVRIVGWGKLNGTP----------------------------------YWKVA 297
Query: 211 STFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+T+ +G G + ILRG NE IE L P+
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|119638992|gb|ABL85238.1| cysteine proteinase 4 [Necator americanus]
Length = 339
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/238 (28%), Positives = 92/238 (38%), Gaps = 64/238 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + ++ G+ +GG + C+P F PC+ NY P K A PKC
Sbjct: 158 CEGGYPIQAYFYLENTGVCSGGEYREKNVCKPYPFYPCD-GNYG---PCPKEGAFDTPKC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C Y + +DK K + + D A I+QEI NGPV AN Y++ D YK G
Sbjct: 214 RKIC-QFRYPVPYEEDKVFGKNSHILLQDNEARIRQEIFINGPVGANFYVFEDFIHYKEG 272
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y + GV+A +K++GWG ENG YW + Y
Sbjct: 273 ------------IYKQTYGKWIGVHA------------IKLIGWGTENGTDYWLVANSYN 308
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
WGE GT +ILRG N +IES V
Sbjct: 309 YD---------------WGE-------------------NGTFRILRGTNHCLIESQV 332
>gi|115605092|gb|ABJ15785.1| cathepsin B [Bos taurus]
Length = 118
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 63/121 (52%), Gaps = 3/121 (2%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S W + K+GLV+GG ++S+ GC+P S PPC H + S P C T PKC
Sbjct: 1 CNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 58
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + +DK+ Y V + +I EI KNGPV +YSD YKSG
Sbjct: 59 SKTC-EPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 117
Query: 122 Y 122
Y
Sbjct: 118 Y 118
>gi|166030326|gb|ABY78830.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 67/149 (44%), Gaps = 28/149 (18%)
Query: 27 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
+++GCQP FP C H ++ C PKC+ CT+ + KYR Y
Sbjct: 176 ASSGCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKCNATCTD----KSIPLVKYRGNATYL 231
Query: 87 VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
+ D ++E+ NGP VA ++Y+D+F+YKSG Y N
Sbjct: 232 LLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGVYRN---------------------- 269
Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTI 175
+ + V+IVGWG+ NG PYW +
Sbjct: 270 --VDGDFLGGQAVRIVGWGKLNGTPYWKV 296
>gi|12658201|gb|AAK01061.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 27/168 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + GLVTGG+ + +GC+ FP CNH P C P P C
Sbjct: 38 CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPKCNHHGKGPDAP-CPEKIFPTPAC 96
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C D + DK + K Y V + I +EIM+NGPV A +Y D Y+SG
Sbjct: 97 NKTC--DTPEVNYILDKTKAKSSYNVPNSEKAIMKEIMQNGPVEAAFEVYEDFLHYESGV 154
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
Y + ++ ++++GWGEENG
Sbjct: 155 Y------------------------FHSFGRMIGGHAIRMLGWGEENG 178
>gi|343472937|emb|CCD15042.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 91/242 (37%), Gaps = 69/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W++ + G+ +++ CQP FP C H ++ C PKC
Sbjct: 158 CKGGFPGFAWLYYVEYGI-------TSSQCQPYPFPHCEHRGAQGNKTPCSKYKFDTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA ++Y+D+F+YKSG
Sbjct: 211 NATCTD----KSIPLVKYRGNATYLLLHGEEDYKRELYFNGPFVAVFFVYTDLFAYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y N + + V+IVGWG+ NG P
Sbjct: 267 YRN------------------------VDGDFLGGQAVRIVGWGKLNGTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G + ILRG NE IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYMLILRGNNECNIEHLGFTG 328
Query: 242 LP 243
P
Sbjct: 329 FP 330
>gi|308507719|ref|XP_003116043.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
gi|308250987|gb|EFO94939.1| hypothetical protein CRE_08645 [Caenorhabditis remanei]
Length = 356
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/233 (28%), Positives = 92/233 (39%), Gaps = 63/233 (27%)
Query: 18 GLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCT------NDNYG 71
G VTGG + + GC+P SF PC++ + + P C Q KC + T + +YG
Sbjct: 156 GAVTGGDYKGD-GCKPYSFAPCSNCVESKTTPSC------QSKCQSTYTVTNYKGDKHYG 208
Query: 72 RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
+ + R K + D + P++ N Y NGPV
Sbjct: 209 KNEGKVTERHKHLECTSAYRLDTSSNAV---PIIQNEI------------YQNGPVEVAY 253
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
+Y D + YKSGVY + +A
Sbjct: 254 TVYDDFYHYKSGVYHHVTGKDTGGHA---------------------------------- 279
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
VK+IGWG E G YW + +++G FGDKG KI RG NE IES V + K
Sbjct: 280 -VKIIGWGTEKGVDYWLVTNSWGTSFGDKGFFKIRRGTNECGIESNVVAGMAK 331
>gi|255040223|gb|ACT99884.1| truncated cathepsin B [Opisthorchis viverrini]
Length = 313
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/175 (30%), Positives = 76/175 (43%), Gaps = 27/175 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ +GC+ FP C+H + P C P P+C
Sbjct: 155 CRGGYPAVAWDYWRTHGIVTGGSKEDPSGCRSYPFPKCDH-HVQGHYPPCPRQIYPTPEC 213
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D G+ +DK R Y + I +EIM GPV A +F+
Sbjct: 214 VQDC--DTPELGYLEDKTRANISYNIYASEISIMKEIMLRGPVEA-------VFT----- 259
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
+Y D YKS VY + A + +A ++I+GWGEE PYW I
Sbjct: 260 -----------VYEDFLQYKSRVYFHAWGAPMSGHA-IRILGWGEEGDVPYWLIA 302
>gi|166030314|gb|ABY78824.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 335
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 92/242 (38%), Gaps = 70/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPGTAWEYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + KYR Y ++ E D ++E+ NGP V +YSD +YK+G
Sbjct: 211 NTTCTD----KAIPLIKYRGNHSYGLDGE-DDYKRELYFNGPFVVAFQVYSDFLAYKTGV 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S +++ V+IVGWG+ NG PY
Sbjct: 266 YRH------------------------VSGDVLGGHAVRIVGWGKLNGTPY--------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++ +G G ILRG++E IES
Sbjct: 293 -------------------------WKIANSWDTDWGMNGHFLILRGKDECGIESEGYAG 327
Query: 242 LP 243
LP
Sbjct: 328 LP 329
>gi|3087801|emb|CAA93277.1| cysteine proteinase [Haemonchus contortus]
Length = 344
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 83/175 (47%), Gaps = 27/175 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + ++G VTGG + + C+P F PC H T EC + P+C
Sbjct: 160 CDGGYVIDAFKFFAEQGAVTGGDYGAKDCCRPYPFHPCGHHGNETYYGECPEDGS-TPEC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+C + Y + +D+ R + Y + V IQ+EIM+NGPVVA ++ D FS+
Sbjct: 219 VRKC-QEGYETEYHEDRVRGEDAYRLPIGSVKAIQKEIMRNGPVVAAFIVFDD-FSF--- 273
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y+ G+YA A + +A VKI+GWG E+G PYW I
Sbjct: 274 -------------------YRKGIYAHVAGSPRGGHA-VKIIGWGTEHGVPYWII 308
>gi|340053922|emb|CCC48215.1| cysteine peptidase C (CPC) [Trypanosoma vivax Y486]
Length = 334
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 90/248 (36%), Gaps = 82/248 (33%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W++ + GLV+ CQP FPPC H+ + P C + PKC
Sbjct: 159 CDGGYPDEAWLYFTESGLVS-------DYCQPYPFPPCKHSGGRSKNPSCHDMHFHTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS----- 116
+ CT+ K PVV Y S+ +S
Sbjct: 212 NATCTD--------------------------------KRIPVV--RYFASESYSLQGEE 237
Query: 117 -YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK Y GP +Y D +Y+SGVY + + +A
Sbjct: 238 DYKRELYLRGPFEVAFTVYEDFLAYESGVYKHVSGGPVGGHA------------------ 279
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
V+++GWGE NG PYW I +++ +G+ G + RG++E IE
Sbjct: 280 -----------------VRVVGWGERNGVPYWKIANSWNTDWGENGYLYFYRGKDECGIE 322
Query: 236 SLVNGALP 243
S + P
Sbjct: 323 SQGSAGTP 330
>gi|254575665|gb|ACT68329.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 81/205 (39%), Gaps = 47/205 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCN-HANYTTSEPECKTLATP--Q 58
C G ++ W W G+VTGGA+ C+P FP C H + ATP +
Sbjct: 166 CDGGYNARAWKWATIAGVVTGGAYKEKGNCKPYVFPQCGAHKGKAFNNCPSHPYATPARK 225
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C YG+ + DK + + +YW+ ++ IQ EIM+ GPV A +Y D Y
Sbjct: 226 PYCQY-----GYGKRYENDKIKARTWYWLPNDERTIQLEIMQKGPVHATFNIYEDFEHYN 280
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
G Y + + + ++KI+GWG + G YW I
Sbjct: 281 GGVY------------------------IHTAGAMEGGHSIKIIGWGVDKGVKYWLIANS 316
Query: 179 YAVSASAEIVAYATVKLIGWGEENG 203
++ WGE+ G
Sbjct: 317 WSTD---------------WGEDGG 326
>gi|157058763|gb|ABV03139.1| cathepsin B-348 [Acyrthosiphon pisum]
Length = 248
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/176 (30%), Positives = 79/176 (44%), Gaps = 28/176 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + SN GC P PC H T P CK P C
Sbjct: 95 CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEIAPCEHHVNGTRGP-CKE-GGKTPTC 152
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + QD + K Y + ++V I+QEI NGPV +Y
Sbjct: 153 VKKC-EEGYKVPYAQDLHHGKSAYSIRNDVDQIRQEIYTNGPVEGAFTVY---------- 201
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR-PYWTIV 176
D +Y++GVY A + +A ++I+GWG +NG PYW +
Sbjct: 202 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNGEIPYWLVA 243
>gi|10803450|emb|CAB97364.2| putative cathepsin B.1 [Ostertagia ostertagi]
Length = 199
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 80/179 (44%), Gaps = 36/179 (20%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
C G S W + ++G+VTGG +++ C+P PC Y EP EC LA
Sbjct: 43 CQGGWSIRAWYYFAEQGVVTGGNYNTKGSCRPYEIHPCG---YHKDEPYYGECDDLAD-T 98
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P+C RC Y + + DK+ + Y + V IQ+EIM+NGPVVA +Y D YK
Sbjct: 99 PRCKRRC-QLGYPKSYPSDKHYGRTAYQLPMSVESIQREIMRNGPVVAGFTVYEDFAHYK 157
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR----PYW 173
G +Y K+G +A VK++GWG E PYW
Sbjct: 158 GG------------IYKHTSGKKTGGHA------------VKVIGWGSEQKGSEKIPYW 192
>gi|91088083|ref|XP_968689.1| PREDICTED: similar to AGAP004533-PA [Tribolium castaneum]
Length = 360
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 72/172 (41%), Gaps = 38/172 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + GLV+GG ++++TGCQP S NY P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS-----ELNYYRI----------TPPC 186
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C ND Y + DK+ Y++ IQ EI+
Sbjct: 187 NTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILS--------------------- 225
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
G GPVVA +Y D Y+ GVY + S + VKI+GWG ENG YW
Sbjct: 226 -GGGPVVAAFDVYGDFKIYRDGVY-IYTSGALFGRTAVKIIGWGTENGWAYW 275
>gi|10803437|emb|CAC13131.1| putative cathepsin B.5 [Ostertagia ostertagi]
Length = 196
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 71/170 (41%), Gaps = 25/170 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + K G+ TGG++ S +GC+P PPC H T C T P C
Sbjct: 44 CEGGYPIEAWKYWVKTGICTGGSYESQSGCKPYPIPPCGHHKNQTYFGPCPTDEYDTPVC 103
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ Y V VA IQ+EIM NGPV
Sbjct: 104 TNKCIA-AYKTPYSDDKHYGTSAYNVAKTVAGIQKEIMTNGPV----------------- 145
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
A +Y D + Y GVY + AE+ +A V+I+GWG P
Sbjct: 146 ------EAAYTVYEDFYQYTGGVYTHTGGAEVGGHA-VRILGWGVRQQDP 188
>gi|294952601|ref|XP_002787371.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239902343|gb|EER19167.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 744
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 64/138 (46%), Gaps = 26/138 (18%)
Query: 30 GCQPVSFPPCNHANYTTSE-PECKTLA-TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWV 87
GC P F CNH +E P+CK A P P C T CTN Y R +D +R K + V
Sbjct: 494 GCWPYPFQKCNHVPTEKTEYPKCKDAAHPPLPPCRTTCTNKAYKRSLKKDVHRAKGWRKV 553
Query: 88 NDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAV 147
+ ++QEI NGPV + +Y D F Y YKSGVY V
Sbjct: 554 LNNAQSVKQEIFDNGPVFSAFKMYED-FRY----------------------YKSGVY-V 589
Query: 148 SASAEIVAYATVKIVGWG 165
+ E ++ +KI+GWG
Sbjct: 590 PTTEEFHSFHLIKIIGWG 607
>gi|390994429|gb|AFM37364.1| cathepsin B1 [Dictyocaulus viviparus]
Length = 350
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 89/242 (36%), Gaps = 58/242 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+ TGG + C+P +F PC H EC P P+C
Sbjct: 165 CRGGYPIEAWRYFMLHGVCTGGHYAEKDVCKPYAFHPCGHHRNEIYYGECPKEIFPTPQC 224
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK K Y + + IQ+EIM NGPV A +Y D Y+SG
Sbjct: 225 TQSC-QAGYASDYEDDKIYGKSAYALPNNEKAIQREIMTNGPVQAAFMVYEDFSRYRSG- 282
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y + G +A VK++GWG
Sbjct: 283 -----------IYVHTAGRREGGHA------------VKLIGWGV--------------- 304
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+++G YW +++ +G+ G +I+RG + IES V
Sbjct: 305 ------------------DDDGNKYWLAANSWNSDWGENGYFRIVRGVDHCGIESAVVAG 346
Query: 242 LP 243
+P
Sbjct: 347 MP 348
>gi|17559066|ref|NP_506790.1| Protein CPR-3 [Caenorhabditis elegans]
gi|1169083|sp|P43507.1|CPR3_CAEEL RecName: Full=Cathepsin B-like cysteine proteinase 3; AltName:
Full=Cysteine protease-related 3; Flags: Precursor
gi|675494|gb|AAA98788.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|675496|gb|AAA98782.1| cathepsin B-like cysteine proteinase [Caenorhabditis elegans]
gi|14530554|emb|CAB61032.2| Protein CPR-3 [Caenorhabditis elegans]
Length = 370
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 100/257 (38%), Gaps = 71/257 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S + G VTGG + + GC P SF PC T + PE T P C
Sbjct: 162 CKGGYSIEALRFWASSGAVTGGDYGGH-GCMPYSFAPC-----TKNCPESTT-----PSC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
T C + + +DK+ Y V V +IQ EI GPV A SYK
Sbjct: 211 KTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEA---------SYK- 260
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
+Y D + YKSGVY + S ++V VKI+GWG ENG Y
Sbjct: 261 -------------VYEDFYHYKSGVYHYT-SGKLVGGHAVKIIGWGVENGVDY------- 299
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
W I +++G FG+KG KI RG NE IE V
Sbjct: 300 ---------------------------WLIANSWGTSFGEKGFFKIRRGTNECQIEGNVV 332
Query: 240 GALPKDNYGVEFGEESG 256
+ K E E+ G
Sbjct: 333 AGIAKLGTHSETYEDDG 349
>gi|343474137|emb|CCD14154.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA Y+Y+D+F+YKSG
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + VK+VGWG+ NG P
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G + ILRG NE IE L
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329
Query: 242 LPK 244
P+
Sbjct: 330 TPE 332
>gi|281208776|gb|EFA82951.1| peptidase C1A family protein [Polysphondylium pallidum PN500]
Length = 1308
Score = 78.2 bits (191), Expect = 4e-12, Method: Composition-based stats.
Identities = 54/172 (31%), Positives = 74/172 (43%), Gaps = 39/172 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + +V K G+VT + CQP + P C A + C P C
Sbjct: 136 CEGGDPYTAYKYVQKNGVVT-------SNCQPYTIPTCPPA-----QQPCMNFVN-TPPC 182
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C N + F QD + K Y V VA IQ EI+ NGPV A +Y D YKSG
Sbjct: 183 SAKCANSSVN--FQQDLHHLKTVYAVKPNVAAIQNEIVTNGPVEACFEVYEDFLGYKSG- 239
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
++++KSG + + +KIVG+G NG PYW
Sbjct: 240 ---------------VYTHKSG--------KDLGGHCIKIVGFGVSNGTPYW 268
>gi|343474132|emb|CCD14149.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA Y+Y+D+F+YKSG
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + VK+VGWG+ NG P
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G + ILRG NE IE L
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329
Query: 242 LPK 244
P+
Sbjct: 330 TPE 332
>gi|166030324|gb|ABY78829.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 81/214 (37%), Gaps = 62/214 (28%)
Query: 31 CQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDE 90
CQP FP C H ++ C PKC+ CT+ + KYR Y +
Sbjct: 180 CQPYPFPHCEHRGAQGNKTPCSKYNFDTPKCNATCTD----KSIPLVKYRGNATYLLLHG 235
Query: 91 VADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSAS 150
D ++E+ NGP VA Y+Y+D+F+YKSG Y +
Sbjct: 236 EEDYKRELYFNGPFVAVFYVYTDLFAYKSGVYRH------------------------VD 271
Query: 151 AEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIV 210
+ + VK+VGWG+ NG P YW +
Sbjct: 272 GDFLGGTAVKVVGWGKLNGTP----------------------------------YWKVA 297
Query: 211 STFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+T+ +G G + ILRG NE IE L P+
Sbjct: 298 NTWDTDWGMDGYLLILRGNNECNIEHLGFAGTPE 331
>gi|166030330|gb|ABY78832.1| cathepsin B-like protease [Trypanosoma congolense]
gi|343476577|emb|CCD12360.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 337
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 91/243 (37%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+ +++ CQP FP C H ++ C PKC
Sbjct: 159 CKGGFPGFAWRYYVEYGI-------TSSSCQPYPFPRCEHQGAQGNKTPCSKYNFDTPKC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y + D ++E+ NGP VA Y+Y+D+F+YKSG
Sbjct: 212 NATCTD----KAIPLIKYRGNATYLLLHGEEDYKRELYFNGPFVAVFYVYTDLFAYKSGV 267
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + VK+VGWG+ NG P
Sbjct: 268 YRH------------------------VDGDFLGGTAVKVVGWGKLNGTP---------- 293
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G + ILRG NE IE L
Sbjct: 294 ------------------------YWKLANSWDTDWGMGGYLLILRGNNECNIEHLGFAG 329
Query: 242 LPK 244
P+
Sbjct: 330 TPE 332
>gi|339242629|ref|XP_003377240.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316973974|gb|EFV57515.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 325
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 64/243 (26%), Positives = 95/243 (39%), Gaps = 71/243 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G +++ RG+ TGG + S GC+P S + SE E +T P C
Sbjct: 153 CDGGYPDKAFIYWATRGIPTGGPYGSTKGCKPYSIG-------SNSEDEAET-----PLC 200
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C N+ Y QD++ ++ YWVN I QE+ KNGPVV +Y D
Sbjct: 201 TRQCINE-YPYNLSQDRHFGEKPYWVNSNEEQIMQELYKNGPVVVAFNVYEDF------- 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
MY ++ ++ G + + VK++GWG EN + YW I +
Sbjct: 253 ---------MYYIKGVYEHRFG--------KFLGGHAVKLIGWGIENSKKYWLISNSWNT 295
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ WGE G KI+RG+N IES V
Sbjct: 296 T---------------WGE-------------------NGFFKIIRGKNCCAIESYVVAG 321
Query: 242 LPK 244
+ +
Sbjct: 322 MAR 324
>gi|294888035|ref|XP_002772321.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239876433|gb|EER04137.1| Cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 200
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 81/184 (44%), Gaps = 47/184 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G S W WVH +G+ TGG + + GC P FPPC H T P+C
Sbjct: 47 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKC---- 102
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
PK C+ D+ R F + + +Y VN D + I +GPV A+ +Y D
Sbjct: 103 ---PK--VSCSGDD--RHFMLESSPY--HYSVN----DAKNAIRTDGPVSASFTVYEDFL 149
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+Y+SG ++ + SG Y + VKI+GWGE++G+ YW
Sbjct: 150 AYRSG----------------VYKHTSGSY--------LGGHAVKIIGWGEKSGQAYWLA 185
Query: 176 VRVY 179
V +
Sbjct: 186 VNSW 189
>gi|21930117|gb|AAM82155.1| cysteine proteinase [Ancylostoma ceylanicum]
Length = 348
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 93/242 (38%), Gaps = 61/242 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C G + + + + GL TGG + CQP +F PC NHA+ P C P P
Sbjct: 164 CKGGYPARAFGYAWRYGLSTGGPYGEKDACQPYAFYPCGNHAHEPYYGP-CPDELWPTPT 222
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C Y F +DK + Y++ +I+ EIM GPVVA +Y D F Y
Sbjct: 223 CRRTC-QLGYPIPFEKDKIFNDQTYYIFGNETEIKYEIMTRGPVVATYKVYRD-FDY--- 277
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
YK GVY + E+ VKI+GWG+ N P
Sbjct: 278 -------------------YKKGVY-IHREGEVTGLHAVKIIGWGKGNDVP--------- 308
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + +++ +GD G +I+RG + IE + G
Sbjct: 309 -------------------------YWLVANSWNTDWGDNGYFRIVRGTDNCEIERQMVG 343
Query: 241 AL 242
+
Sbjct: 344 GI 345
>gi|193610664|ref|XP_001948185.1| PREDICTED: cathepsin B-like [Acyrthosiphon pisum]
Length = 324
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 77/183 (42%), Gaps = 44/183 (24%)
Query: 1 VCSSGISSST------WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTL 54
+ SGI S+ W + K+GLV+GG +++N GCQP PP
Sbjct: 144 ISCSGIKSNAMADDQAWKFFKKQGLVSGGKYNTNDGCQPSKIPP--------------IF 189
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKY-RFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
P+ + C N YG + K Y + +IQ+E+ GPV A LY D
Sbjct: 190 NLPKKIYNRTCDNFCYGNSLIDYNHDHVKVSYTYHVLYKNIQREVQTYGPVSAYFSLYDD 249
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
+F Y SGVYA + ++ V Y + K++GWG ENG YW
Sbjct: 250 -----------------------LFLYTSGVYARTEKSKFVRYQSAKLIGWGVENGVDYW 286
Query: 174 TIV 176
+V
Sbjct: 287 LLV 289
>gi|270012756|gb|EFA09204.1| cathepsin B precursor [Tribolium castaneum]
Length = 369
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 96/243 (39%), Gaps = 65/243 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + GLV+GG ++++TGCQP S N+ T P C
Sbjct: 142 CKGGYTYYAWNYFMLTGLVSGGDYNTSTGCQPYS--ELNYYRIT-------------PPC 186
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C ND Y + DK+ Y++ IQ EI+ G
Sbjct: 187 NTTCQNDKYPIPYVSDKHFGDSIYYIPQNETAIQNEILSGG------------------- 227
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPVVA +Y D Y+ +G + TI+ +
Sbjct: 228 ---GPVVAAFDVYGDFKIYR--------------------------DGEQHDTILEGVYI 258
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEA-IIESLVN 239
S + VK+IGWG ENG YW +++G+ +G G KI RG NE ES++
Sbjct: 259 YTSGALFGRTAVKIIGWGTENGWAYWLAANSWGKDWGALGGFFKIRRGTNECGFEESIIA 318
Query: 240 GAL 242
G +
Sbjct: 319 GQV 321
>gi|157058765|gb|ABV03140.1| cathepsin B-348 [Aulacorthum solani]
Length = 237
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 77/168 (45%), Gaps = 27/168 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + +G+V+GG + SN GC P PC H T P CK PKC
Sbjct: 97 CNGGFPGAAWNYWKTKGIVSGGPYGSNMGCIPYEVAPCEHHVNGTRGP-CKE-GGKTPKC 154
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D Y + QD + K Y ++++V I+QEI NGPV +Y
Sbjct: 155 VKKC-EDGYKVPYAQDLHHGKSAYSLSNDVDQIRQEIYTNGPVEGAFTVY---------- 203
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
D +Y++GVY A + +A ++I+GWG +NG
Sbjct: 204 -------------EDFIAYRAGVYKHVAGKALGGHA-IRILGWGVQNG 237
>gi|294891865|ref|XP_002773777.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878981|gb|EER05593.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 156
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 81/198 (40%), Gaps = 59/198 (29%)
Query: 34 VSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVA 92
+ F NHA+ S+ P+C + A QP C T C N++Y QD +R K + +
Sbjct: 5 IQFIXXNHASSAASQYPKCPSEALSQPACQTECINESYKTSLQQDLHRAKSWGRLPTSPQ 64
Query: 93 DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE 152
I+QEI + NG V+ + +Y D YKSGVY
Sbjct: 65 KIKQEI-----------------------FDNGTVLGVISMYEDFRLYKSGVY------- 94
Query: 153 IVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVST 212
V + +V ++K+IGWG E+G+ YW V++
Sbjct: 95 ----------------------------VHTTGGLVGVHSLKIIGWGVESGQDYWLAVNS 126
Query: 213 FGEQFGDKGTIKILRGRN 230
+ E++GD G IK+ G
Sbjct: 127 WNEEWGDHGMIKLAVGET 144
>gi|56758130|gb|AAW27205.1| unknown [Schistosoma japonicum]
Length = 279
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 58/119 (48%), Gaps = 2/119 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H + P C T P+C
Sbjct: 159 CQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEH-HTKGKYPACGTKIYKTPQC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C Y + QDK+ Y V IQ+EIM GPV A +Y D +YKSG
Sbjct: 218 KQTCQK-GYKTPYEQDKHYGDESYNVISNEKAIQREIMMYGPVEAAFDVYEDFLNYKSG 275
>gi|193606095|ref|XP_001951499.1| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 330
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 89/237 (37%), Gaps = 69/237 (29%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
W ++ GLV+GG +++N GCQP PP N T C RC
Sbjct: 161 DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYG 210
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
+N ++ D + YY + DIQ+E+ +Y GPV
Sbjct: 211 NNTIH-YYHDHVKVSHYYNIKSN-EDIQKEVQ----------------TY-------GPV 245
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
+Y D F YKSGVY + + V K++GWG ENG Y
Sbjct: 246 SVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDY--------------- 290
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
W +V+++G ++G G KI RG NE +E V P+
Sbjct: 291 -------------------WLLVNSWGNEWGQNGLFKIKRGTNEVHVEDYVYAGEPE 328
>gi|91078960|ref|XP_974244.1| PREDICTED: similar to putative cathepsin B-like proteinase
[Tribolium castaneum]
gi|270004840|gb|EFA01288.1| cathepsin B precursor [Tribolium castaneum]
Length = 319
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 100/243 (41%), Gaps = 73/243 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G + + + K+G+V+GG +SN GC+P T++ K + P C
Sbjct: 149 CSGGYMMAAFDFYIKQGVVSGGDLNSNEGCRPY-----------TADAHDKGVT---PSC 194
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C Y + DK+ + Y V+ V++IQ EIM NGP++ + +Y D ++Y SG
Sbjct: 195 TKSCRK-GYPTSYSSDKHYGSKDYIVDAGVSNIQYEIMTNGPIIVSFKVYQDFYNYGSG- 252
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ + SG Y VKIVGWG E + Y
Sbjct: 253 ---------------VYHHVSGNY--------TGNHIVKIVGWGTEKEQDY--------- 280
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
W I +++G +G+ G KILRG+NE IE+
Sbjct: 281 -------------------------WLIANSWGSSWGEHGFFKILRGKNECGIENNPYAV 315
Query: 242 LPK 244
LPK
Sbjct: 316 LPK 318
>gi|294879717|ref|XP_002768767.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239871616|gb|EER01485.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 157
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 87/211 (41%), Gaps = 59/211 (27%)
Query: 30 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
GC P FPPC H T P+C P P C +C N Y D++
Sbjct: 2 GCWPYDFPPCAHHINDTKYPKCPKGLYPTPNCVEQCHNPKYTTTLRDDRHF--------- 52
Query: 90 EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
++++ P Y YS + K+ +GPV A+ +Y D +Y+SGVY ++
Sbjct: 53 --------MLESSP-----YHYS-VNDAKNAIRTDGPVSASFTVYEDFLAYRSGVYKHTS 98
Query: 150 SAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
+ + +A VK+IGWGE++G+ YW
Sbjct: 99 GSYLGGHA-----------------------------------VKIIGWGEKSGQAYWLA 123
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
V+++ E +GD G KI G N I + L+ G
Sbjct: 124 VNSWNEDWGDHGLFKIALG-NCGIDDDLLGG 153
>gi|119638954|gb|ABL85236.1| cysteine proteinase 2 [Necator americanus]
Length = 347
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 97/239 (40%), Gaps = 60/239 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG+ + + ++G+ +GG + + C+P F PC + + C P P C
Sbjct: 164 CTSGVPRQAFNYAIRKGVCSGGPYGTKGVCKPYPFYPCGYHAHLPYYGPCPDGMWPTPTC 223
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C +D + Y R + V +++I + +IF+
Sbjct: 224 EKACQSD------YTVPYNDDRIFGSKTIVLTGEEKIKR------------EIFN----- 260
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
NGP+VA +Y D YK+G+Y + G G G
Sbjct: 261 --NGPLVATYTVYEDFAYYKNGIY---------------MTGLGRATG------------ 291
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
A+A VK+IGWGEENG YW I +++ +G+ G ++LRG N IE G
Sbjct: 292 -------AHA-VKIIGWGEENGVKYWLIANSWNTDWGENGFFRMLRGTNLCDIELSATG 342
>gi|327239610|gb|AEA39649.1| cathepsin B [Epinephelus coioides]
Length = 171
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 2/121 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLV+GG + S+ GC+P + PPC H + + P C P+C
Sbjct: 44 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 102
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + DK+ K Y V + IQ EI KNGPV +Y D YK+G
Sbjct: 103 ILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGV 161
Query: 122 Y 122
Y
Sbjct: 162 Y 162
>gi|728602|emb|CAA88490.1| cathepsin B-like enzyme [Leishmania mexicana]
gi|1586011|prf||2202319A cathepsin B-like Cys protease
Length = 340
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 90/235 (38%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H ++ P C PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C DN KY+ Y + E ++ E+M NGP+ M +Y+D +YKSG
Sbjct: 219 NTTC--DNVEMELV--KYKGVSSYSIKGE-RELDHELMNNGPLEVAMQVYADFVAYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + + VK+VGWG ++G P
Sbjct: 274 YKH------------------------VSGDHLGGHAVKLVGWGVKDGIP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG +E IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGNDECGIES 330
>gi|239790303|dbj|BAH71722.1| ACYPI001175 [Acyrthosiphon pisum]
Length = 330
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 88/237 (37%), Gaps = 69/237 (29%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
W ++ GLV+GG +++N GCQP PP N T C RC
Sbjct: 161 DDVWEYLKSHGLVSGGKYNTNDGCQPSKIPPI--GNIPTH--------LYNHTCEERCYG 210
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
+N ++ D + YY + DIQ+E+ +Y GPV
Sbjct: 211 NNTIH-YYHDHVKVSHYYNIKSN-EDIQKEVQ----------------TY-------GPV 245
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
+Y D F YKSGVY + + V K++GWG ENG YW +V
Sbjct: 246 SVKFRVYDDFFLYKSGVYVKTEKSLYVRRHFAKLIGWGVENGVDYWLLVN---------- 295
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+W G ++G G KI RG NE +E V P+
Sbjct: 296 ------------------FW------GNEWGQNGLFKIKRGTNEVHVEDYVYAGEPE 328
>gi|328701234|ref|XP_001948885.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 326
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/236 (26%), Positives = 94/236 (39%), Gaps = 78/236 (33%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR-CTNDN 69
W + GLV+GG +++N GCQP P T+ Q K + R C
Sbjct: 164 WEYFKTHGLVSGGKYNTNEGCQPSKVP---------------TVYNSQTKIYKRTCVEYC 208
Query: 70 YGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
YG+ + D + +Y++ + DIQ+E+ GPV S F
Sbjct: 209 YGKDTINYNHDHVKVSNHYFI--RIKDIQKEVQTYGPV-------SVFFD---------- 249
Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAE 186
L+ D+F YKSGVYA + ++ Y K++GWG ENG Y
Sbjct: 250 ------LHDDLFLYKSGVYAKTEKSKDKRYHHAKLIGWGVENGVDY-------------- 289
Query: 187 IVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W +V+++G ++G G KI RG +E +ES V L
Sbjct: 290 --------------------WLLVNSWGYEWGQNGLFKIKRGTDECSVESHVYAGL 325
>gi|10803454|emb|CAB97366.2| putative cathepsin B.3 [Ostertagia ostertagi]
Length = 196
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 73/175 (41%), Gaps = 25/175 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W++ G+ +GG + C+P +F PC + T EC P C
Sbjct: 44 CNGGYSARAWLYARNSGVCSGGRYQEKGVCKPYTFHPCGYHKNQTYYGECPKHTYQTPAC 103
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C YG+ + +DK Y V+ + A I+ EI GPV A+ Y
Sbjct: 104 KKYCQY-GYGKRYEKDKIYAXDAYRVSSDEAAIRAEIFARGPVQASFATY---------- 152
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
D YKSG+Y +A +A VKI+GWG ENG W +
Sbjct: 153 -------------EDFAHYKSGIYVHTAGKRRGGHA-VKIIGWGVENGTKXWIVA 193
>gi|242014495|ref|XP_002427925.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
gi|212512409|gb|EEB15187.1| tubulointerstitial nephritis antigen, putative [Pediculus humanus
corporis]
Length = 473
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 97/247 (39%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W ++ K GLV + C P +T + +CK P
Sbjct: 256 CQGGHLSRAWTFIRKFGLV-------DDYCYP----------WTGTPTKCKIPKRPNFDA 298
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ + G + YR Y + DE DI +EIM++GPV A M +Y D FSYKSG
Sbjct: 299 LSSICPPSLGSNLRSELYRVGPAYKIQDE-KDIMEEIMQSGPVQATMKVYQDFFSYKSGV 357
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y +N S F Y S VKI+GWGEE +Y
Sbjct: 358 Y----TKSNTERESSNFGYHS----------------VKILGWGEE--------TNIY-- 387
Query: 182 SASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G+P YW +++G+Q+G+ G KI RG NE IE V
Sbjct: 388 ---------------------GQPIKYWLAANSWGQQWGENGFFKIRRGTNECEIEEFVL 426
Query: 240 GALPKDN 246
A + N
Sbjct: 427 AAWAETN 433
>gi|56755425|gb|AAW25892.1| unknown [Schistosoma japonicum]
Length = 226
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 73/169 (43%), Gaps = 26/169 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ ++TGCQP FP C H + P C P+C
Sbjct: 81 CDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHS-IGKYPSCGDKIYKTPQC 139
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + DK+ V + IQ+EIM GPV A + ++ D +YKSG
Sbjct: 140 KRKC-QKGYTTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSG- 197
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
I+ Y +G + V V+I+GWG EN R
Sbjct: 198 ---------------IYRYTTGSF--------VGEHYVRIIGWGIENER 223
>gi|197725747|gb|ACH73069.1| cathepsin B precursor [Epinephelus coioides]
Length = 333
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 61/121 (50%), Gaps = 2/121 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ W + GLV+GG + S+ GC+P + PPC H + + P C P+C
Sbjct: 148 CNGGYPSAAWDFWTDVGLVSGGLYDSHVGCRPYTIPPCEH-HVNGTRPPCTGEGGDTPQC 206
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + Y + DK+ K Y V + IQ EI KNGPV +Y D YK+G
Sbjct: 207 ILQCES-GYTPSYKADKHYGKSSYSVPSDEEQIQSEIYKNGPVEGAFTVYEDFLLYKTGV 265
Query: 122 Y 122
Y
Sbjct: 266 Y 266
>gi|157167281|ref|XP_001658485.1| cathepsin b [Aedes aegypti]
gi|108876476|gb|EAT40701.1| AAEL007585-PA [Aedes aegypti]
Length = 386
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + +QD++ + Y + ++ I +EI NGPV A + Y D+ +YKSG
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ SG +A VK++GWG ENG
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW + +++G ++G+ G KI+RG N IE ++
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKIVRGENHCGIEENIHAG 365
Query: 242 LP 243
LP
Sbjct: 366 LP 367
>gi|48762481|dbj|BAD23810.1| cathepsin B-S [Tuberaphis taiwana]
Length = 182
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/166 (29%), Positives = 77/166 (46%), Gaps = 30/166 (18%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W + +G+ TGG + + GC P PPC + + C P + H +C Y
Sbjct: 47 WKYFRTQGVTTGGDYDTKEGCMPYKVPPCYNKQ---GKNTCG--GQPMERNH-QCPKTCY 100
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVAN 130
G+ Q++Y+ K Y +N + I+Q D+ +Y GPV A+
Sbjct: 101 GKTTVQNRYKTKSEYVMN-SIKTIEQ----------------DLKTY-------GPVEAS 136
Query: 131 MYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
+Y D YKSG+Y + A+ ++KI+GWG++NG PYW V
Sbjct: 137 FDVYDDFSVYKSGIYRKTPKAKYQGGHSIKIIGWGQQNGTPYWLAV 182
>gi|401415968|ref|XP_003872479.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322488703|emb|CBZ23950.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 340
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 90/235 (38%), Gaps = 70/235 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W G+ T CQP F PC+H ++ P C PKC
Sbjct: 166 CYGGIPAMAWLWWVWVGVTT-------ELCQPYPFGPCSHHGNSSKYPPCPNTIYNTPKC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T C DN KY+ Y + E ++ E+M NGP+ M +Y+D +YKSG
Sbjct: 219 NTTC--DNVEMELV--KYKGVSSYSIKGE-RELMVELMNNGPLEVAMQVYADFVAYKSGV 273
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + S + + VK+VGWG ++G P
Sbjct: 274 YKH------------------------VSGDHLGGHAVKLVGWGVKDGIP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
YW I +++ +GDKG I RG +E IES
Sbjct: 300 ------------------------YWKIANSWNTDWGDKGYFLIQRGNDECGIES 330
>gi|159177|gb|AAA29177.1| cysteine proteinase [Haemonchus contortus]
Length = 342
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/241 (26%), Positives = 97/241 (40%), Gaps = 58/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W + G+V+GG + + C+P PC H T EC A P C
Sbjct: 154 CGGGWSIRAWEYFVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAA-TPPC 212
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + F DK + K Y V + IQ+EI+++GPVVA+ +Y D FS
Sbjct: 213 KKKC-QPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGPVVASFAVYED-FSL---- 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
YK+GVY +A A + Y VK++GWG ++
Sbjct: 267 ------------------YKTGVYKHTAGA-LRGYHAVKMMGWGVDS------------- 294
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ YW I +++ +G+ G + +RG N+ IE V
Sbjct: 295 -------------------KTKAKYWLIANSWHNDWGENGYFRFIRGINDCEIEDTVAAG 335
Query: 242 L 242
+
Sbjct: 336 I 336
>gi|157058749|gb|ABV03132.1| cathepsin B-3098 [Acyrthosiphon pisum]
Length = 256
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/179 (30%), Positives = 78/179 (43%), Gaps = 35/179 (19%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC----NHANYTTSEPECKTLATP 57
C+ G W GLVTGG + S GC+P PPC + N + +P P
Sbjct: 97 CNGGYPIRAWKRFKNHGLVTGGNYKSGEGCEPYRVPPCPYDKDGKNTCSGQP-----MEP 151
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
KC +C D F +R+ R D+ + I K D+ +Y
Sbjct: 152 NHKCSKKCYGDE--DIDFNKDHRYTR-----DDYYLTYRGIQK------------DVINY 192
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
GP+ A+ +Y D +YKSG+Y S +A + +VK++GWGEE G YW +V
Sbjct: 193 -------GPIEASFDVYDDFPNYKSGIYVKSENASYLGGHSVKLIGWGEEYGVLYWLMV 244
>gi|323448735|gb|EGB04630.1| hypothetical protein AURANDRAFT_32318 [Aureococcus anophagefferens]
Length = 253
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 102/240 (42%), Gaps = 54/240 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI SS + + G+V GG + +GC PC H ++ P C PKC
Sbjct: 57 CNGGIPSSVYSYWALSGIVDGGNYGDKSGCWSYQLEPCAHHVNSSKYPACPD-EVRAPKC 115
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C +++ + + + K + ++ Y V QQ ++ + + +DI
Sbjct: 116 ARKCESED--KDWTKAKVKGEKGYSV------CQQGELEG---TCAIKMAADI------- 157
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP+ ++ D +YKSGVY + + +KI+G+G E+G+
Sbjct: 158 YQNGPITGMFFVKQDFLAYKSGVYEPKLLSPPLGGHAIKIMGFGTEDGK----------- 206
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES-LVNG 240
YW + +++ E +GD G KI+RG+N IE ++NG
Sbjct: 207 -----------------------DYWLVANSWNEDWGDDGYFKIIRGKNACQIEDPVING 243
>gi|13469701|gb|AAK27318.1| cysteine proteinase [Clonorchis sinensis]
Length = 179
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 67/168 (39%), Gaps = 27/168 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VTGG+ GC+P FP C H + P C P PKC
Sbjct: 38 CDGGFPPMAWDFWKTHGIVTGGSKEEPAGCRPYPFPKCQHHS-QGHYPPCPRRIYPTPKC 96
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D + +DK R Y V+ I +EI+ NGPV A ++ D YKSG
Sbjct: 97 VKHC--DTPKIDYQKDKTRANTSYNVHQSEVAIMKEILLNGPVEATFEVHEDFPEYKSGI 154
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
Y A V ++I+GWGEENG
Sbjct: 155 Y------------------------FHAWGGSVGGHAIRILGWGEENG 178
>gi|166030316|gb|ABY78825.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 87/242 (35%), Gaps = 69/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + GL +++ CQP FP C H +P C PKC
Sbjct: 158 CDGGYPDAAWRYYVSHGL-------ASSYCQPYPFPHCGHHGGKGKKPPCSKYDFHTPKC 210
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+T CT+ + +YR Y + D ++E+
Sbjct: 211 NTTCTD----KAIPLIEYRGNDSYVLLHGEDDFKREL----------------------- 243
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP V ++SD +YK+GVY
Sbjct: 244 YFNGPFVVAFQVFSDFLAYKTGVYR----------------------------------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S + + V+++GWG+ NG PYW I +++ +G G LRG NE IE
Sbjct: 269 HVSGDFLGGHAVRIVGWGKLNGTPYWKIANSWDTDWGMNGHFLFLRGNNECGIEFEGYAG 328
Query: 242 LP 243
LP
Sbjct: 329 LP 330
>gi|157131748|ref|XP_001662318.1| cathepsin b [Aedes aegypti]
gi|108871395|gb|EAT35620.1| AAEL012216-PA [Aedes aegypti]
Length = 386
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + +QD++ + Y + ++ I +EI NGPV A + Y D+ +YKSG
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ SG +A VK++GWG ENG
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW + +++G ++G+ G K++RG N IE ++
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 242 LP 243
LP
Sbjct: 366 LP 367
>gi|157111449|ref|XP_001651570.1| cathepsin b [Aedes aegypti]
gi|108868331|gb|EAT32556.1| AAEL015312-PA [Aedes aegypti]
Length = 386
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + +QD++ + Y + ++ I +EI NGPV A + Y D+ +YKSG
Sbjct: 244 SNKCRSGYNVTDVWQDRHYGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ SG +A VK++GWG ENG
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW + +++G ++G+ G K++RG N IE ++
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 242 LP 243
LP
Sbjct: 366 LP 367
>gi|268566077|ref|XP_002647467.1| Hypothetical protein CBG06539 [Caenorhabditis briggsae]
Length = 332
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 95/237 (40%), Gaps = 71/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W G+VTGG + + GC+P F CN A C TP+ C
Sbjct: 157 CDGGYSIQALRWWVFDGVVTGGDYQGD-GCKPYQF--CNSAG-------CPDAVTPE--C 204
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + +DK Y+V V IQ +IM NGPV A+ +Y D + YKSG
Sbjct: 205 ALSCQS-KYNTEYAKDKNFGTSAYYVGMTVNAIQTDIMTNGPVEASFKVYEDFYKYKSG- 262
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ Y +G +++ +KI+GWG ENG Y
Sbjct: 263 ---------------VYKYIAG--------KMLGGHAIKIIGWGTENGTAY--------- 290
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
W I +++G ++G+ G KI RG NE IE+ V
Sbjct: 291 -------------------------WLIANSWGTKWGENGFFKIRRGVNECGIENNV 322
>gi|328726600|ref|XP_003248962.1| PREDICTED: cathepsin B-like cysteine proteinase-like [Acyrthosiphon
pisum]
Length = 169
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 87/220 (39%), Gaps = 67/220 (30%)
Query: 30 GCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGF--FQDKYRFKR-YYW 86
GC+P PPC TS P K H RCT YG + D +RF R YY+
Sbjct: 14 GCEPYRVPPCPRNEDGTSS----CAGQPIEKNH-RCTRMCYGNQDLDYNDDHRFTRDYYY 68
Query: 87 VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
+ IQ+++M GP+ A+ +Y D +SYKSGVY
Sbjct: 69 LT--YGSIQKDVMNYGPIEASFDVYDDF-----------------------YSYKSGVYQ 103
Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
+ +A + VK++GWG E G PY
Sbjct: 104 RTPNATKLGGHAVKLIGWGVEEGIPY---------------------------------- 129
Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
W +V+++ Q+GD G KI RG +E I+S +P N
Sbjct: 130 WLMVNSWSAQWGDNGLFKIRRGTDECGIDSATTAGVPVTN 169
>gi|19526442|gb|AAL89717.1|AF483623_1 cathepsin B [Apriona germari]
Length = 324
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/242 (26%), Positives = 94/242 (38%), Gaps = 75/242 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + + G+ +GG + S GC+P YT + ++ P+C
Sbjct: 153 CRGGFLNEPYKYWVTNGIPSGGDYGSKLGCKP----------YTAA------VSGETPQC 196
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + Y + + +D Y VN V IQ+EI+ NGPV A M +Y D +SY +G
Sbjct: 197 QKACVS-GYEKSWEKDLRHATSAYQVNGGVLQIQREILDNGPVTAYMEVYEDFYSYGTG- 254
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
I+ + SG + V VKI+GWG EN P
Sbjct: 255 ---------------IYQHTSGSF--------VGGHAVKIIGWGSENDVP---------- 281
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++G FG+ G +ILRG N A IES +
Sbjct: 282 ------------------------YWIAANSWGTGFGEDGFFRILRGSNCAGIESYIVAG 317
Query: 242 LP 243
P
Sbjct: 318 YP 319
>gi|5031250|gb|AAD38132.1|AF127592_1 vitellogenic cathepsin-B like protease [Aedes aegypti]
Length = 386
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 96/242 (39%), Gaps = 68/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + ++GL +GG +S GC P Y E PKC
Sbjct: 194 CRGGTLGPAWQFWVEKGLSSGGPLNSRQGCHP----------YPIGECRIPGEDEDTPKC 243
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + +QD++ + Y + ++ I +EI NGPV A + Y D+ +YKSG
Sbjct: 244 SNKCRSGYNVTDVWQDRHIGRVAYSLPNDERKIMEEIFINGPVQAAFHTYLDLHAYKSG- 302
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++ SG +A VK++GWG ENG
Sbjct: 303 -----------IYRHVWGPLSGGHA------------VKLLGWGVENG------------ 327
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW + +++G ++G+ G K++RG N IE ++
Sbjct: 328 -----------VK-----------YWLVANSWGREWGENGFFKMVRGENHCGIEENIHAG 365
Query: 242 LP 243
LP
Sbjct: 366 LP 367
>gi|294898471|ref|XP_002776250.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239883121|gb|EER08066.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 219
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 58/117 (49%), Gaps = 7/117 (5%)
Query: 13 WVHKRGLVTGGAHH------SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRC 65
++ G+VTG S GC P P CNHA+ S+ P+C + A QP C T C
Sbjct: 96 FMKNHGIVTGNEFKPADQLASADGCWPYPLPKCNHASSAASQYPKCPSEALSQPACQTEC 155
Query: 66 TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKY 122
N++Y QD +R K + + I+QEI NG V+ + +Y D YKSG Y
Sbjct: 156 INESYKTSLQQDLHRAKSWGRLPTSPQKIKQEIFDNGTVLGVISMYEDFRLYKSGVY 212
>gi|294891889|ref|XP_002773789.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878993|gb|EER05605.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 422
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 81/204 (39%), Gaps = 58/204 (28%)
Query: 27 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
++ GC P FP CNH S+ P C + P C T C N YG +D +R K +
Sbjct: 262 NDDGCWPYPFPKCNHVPGLESKYPRCAQVRD-LPACATTCPNKAYGTSMQKDTHRAKSWG 320
Query: 86 WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
+ I+QEI NGP+ A M LY D F + VY
Sbjct: 321 RLPIGPEKIKQEIFDNGPLRX--------------------XAAMMTLYED-FDLQVCVY 359
Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
V + +++A T+KLIGWG E+G+
Sbjct: 360 -----------------------------------VHKTGQMLAAHTLKLIGWGVESGQE 384
Query: 206 YWTIVSTFGEQFGDKGTIKILRGR 229
YW V+ + E++GD G IK+ G+
Sbjct: 385 YWLAVNAWNEEWGDHGMIKLAVGK 408
>gi|339831342|gb|AEK20867.1| cathepsin B [Eimeria tenella]
Length = 512
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 90/248 (36%), Gaps = 66/248 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
CS G W W G+VTGG + H+ C P P C H + P+C+
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368
Query: 59 PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
PKC C Y F+D F + + I++E+M+NG + +Y D
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLL 428
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G +Y + G +A VK++G+G E+GR YW V
Sbjct: 429 YKEG------------VYHHVTGMPMGGHA------------VKVIGFGNEDGRDYWLAV 464
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W E YW GDKGT KI G EA I+
Sbjct: 465 N-------------------SWNE-----YW----------GDKGTFKIEMG--EAGIDK 488
Query: 237 LVNGALPK 244
G PK
Sbjct: 489 EFCGGEPK 496
>gi|359427491|gb|AEV46267.1| eimeripain [Eimeria tenella]
Length = 512
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 90/248 (36%), Gaps = 66/248 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH---HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
CS G W W G+VTGG + H+ C P P C H + P+C+
Sbjct: 310 CSGGQPRMAWRWFSNDGVVTGGDYNELHTGKSCWPYEIPFCRH-HSEGPYPKCEGPLPKA 368
Query: 59 PKCHTRCTNDNYGRGF--FQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
PKC C Y F+D F + + I++E+M+NG + +Y D
Sbjct: 369 PKCRKDCEEAEYTSKVKPFKDDLHFATSAYSVEGRDQIKRELMENGTLTGAFLVYEDFLL 428
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G +Y + G +A VK++G+G E+GR YW V
Sbjct: 429 YKEG------------VYHHVTGMPMGGHA------------VKVIGFGNEDGRDYWLAV 464
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W E YW GDKGT KI G EA I+
Sbjct: 465 N-------------------SWNE-----YW----------GDKGTFKIEMG--EAGIDK 488
Query: 237 LVNGALPK 244
G PK
Sbjct: 489 EFCGGEPK 496
>gi|347972080|ref|XP_313831.5| AGAP004531-PA [Anopheles gambiae str. PEST]
gi|333469162|gb|EAA09191.5| AGAP004531-PA [Anopheles gambiae str. PEST]
Length = 375
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 90/242 (37%), Gaps = 69/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G+ S+ W + + G+ +GGA S+ GCQ F C + + P C P
Sbjct: 200 CDGGVPSAVWHYWVENGITSGGAFGSHEGCQSYPFDVCKKSGDSNDTPRCLRFCQP---- 255
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
Y + +DK+ + Y V + I E+ GP A +Y+D YKSG
Sbjct: 256 -------GYNVTYPEDKHYGRVAYTVPKDEERIMYEVFNFGPAQATFTMYTDFVQYKSG- 307
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y F + G + +VK++GWG EN
Sbjct: 308 -----------VYRHTFGVRVGTH------------SVKVMGWGVEN------------- 331
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
VK YW +++G Q+GD G KI+RG + E+ V
Sbjct: 332 ----------DVK-----------YWLCANSWGAQWGDGGFFKIVRGEDHLSFETNVVAG 370
Query: 242 LP 243
LP
Sbjct: 371 LP 372
>gi|4099305|gb|AAD00577.1| cysteine proteinase [Clonorchis sinensis]
Length = 180
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/170 (30%), Positives = 72/170 (42%), Gaps = 27/170 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ +GC+ FP C H + P C P P+C
Sbjct: 38 CRGGYPAVAWDYWKTHGIVTGGSKEDPSGCRSYPFPKCEH-HVQGHYPPCPRELYPTPEC 96
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D G+ +DK R Y + I +EIM GPV A IF+
Sbjct: 97 VQQC--DTPDVGYLEDKTRANMSYNIYASEISIMKEIMLRGPVEA-------IFT----- 142
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
+Y D Y SGVY + A + +A V+I+GWGE P
Sbjct: 143 -----------MYEDFLRYSSGVYFHALGAPMSGHA-VRILGWGELGNVP 180
>gi|170045773|ref|XP_001850470.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
gi|167868692|gb|EDS32075.1| tubulointerstitial nephritis antigen [Culex quinquefasciatus]
Length = 463
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/261 (27%), Positives = 104/261 (39%), Gaps = 80/261 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECK-----TLAT 56
CS G + W +V K G V N C P Y +++ CK TL T
Sbjct: 252 CSGGHLDTAWNYVRKVGTV-------NDECYP----------YISAQNACKIRPSDTLIT 294
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
T+ N Y+ + +N+E DI EI K+GPV A + ++ D FS
Sbjct: 295 ANCDLPTKVDRTNM--------YKMGPAFSLNNET-DIMIEIKKHGPVQAILRVHRDFFS 345
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YKSG Y + A SA E Y +V+++GWGEE
Sbjct: 346 YKSGIYRHSA-------------------ASSAGDERAGYHSVRLIGWGEERN------- 379
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
Y T K YW V+++G +G+ G +I+RG+NE IES
Sbjct: 380 ------------GYETTK-----------YWVAVNSWGRWWGENGRFRIVRGQNECEIES 416
Query: 237 LVNGALPKDNYGVEFGEESGE 257
V +LP + V+ + GE
Sbjct: 417 YVLASLPYVHQQVKPMRQVGE 437
>gi|741376|prf||2007265A cathepsin B
Length = 153
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/208 (28%), Positives = 82/208 (39%), Gaps = 61/208 (29%)
Query: 39 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEI 98
C H + S P C T PKC C Y + QDK+ Y V++ DI EI
Sbjct: 1 CEH-HVNGSRPPC-TGEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 57
Query: 99 MKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
KNGPV FS +YSD YKSGVY
Sbjct: 58 YKNGPVEG-------AFS----------------VYSDFLLYKSGVYQ------------ 82
Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG 218
+ E++ ++++GWG ENG PYW + +++ +G
Sbjct: 83 -----------------------HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG 119
Query: 219 DKGTIKILRGRNEAIIESLVNGALPKDN 246
D G KILRG++ IES V +P+ +
Sbjct: 120 DNGFFKILRGQDHCGIESEVVAGIPRTD 147
>gi|194384502|dbj|BAG59411.1| unnamed protein product [Homo sapiens]
Length = 273
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 82/208 (39%), Gaps = 62/208 (29%)
Query: 39 CNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEI 98
C H N S P C T PKC C Y + QDK+ Y V++ DI EI
Sbjct: 122 CIHVN--GSRPPC-TGEGDTPKCSKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEI 177
Query: 99 MKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
KNGPV FS +YSD YKSGVY
Sbjct: 178 YKNGPV-------EGAFS----------------VYSDFLLYKSGVYQ------------ 202
Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFG 218
+ E++ ++++GWG ENG PYW + +++ +G
Sbjct: 203 -----------------------HVTGEMMGGHAIRILGWGVENGTPYWLVANSWNTDWG 239
Query: 219 DKGTIKILRGRNEAIIESLVNGALPKDN 246
D G KILRG++ IES V +P+ +
Sbjct: 240 DNGFFKILRGQDHCGIESEVVAGIPRTD 267
>gi|339241013|ref|XP_003376432.1| Gut-specific cysteine proteinase [Trichinella spiralis]
gi|316974853|gb|EFV58323.1| Gut-specific cysteine proteinase [Trichinella spiralis]
Length = 551
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 101/246 (41%), Gaps = 76/246 (30%)
Query: 2 CSSGISSSTW-VWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
C+ G T+ WV+ G+ TGG + SN C+P PPC++ + T + PK
Sbjct: 359 CNGGYPQRTFKYWVYS-GMPTGGPYGSNDTCKPYPIPPCSNCSETRT-----------PK 406
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYY--WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C C + Y +D++ YY W+ ++ + ++I GP+VA M +Y D YK
Sbjct: 407 CSKSCIS-TYPLSLNEDRHYGSTYYQFWLGEK--SMMKDISLYGPIVAGMSVYEDFLHYK 463
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
G +++ +SG++ + V+I+GWGE++ P
Sbjct: 464 EG----------------VYTQESGIF--------LGGHAVRIIGWGEQDNIP------- 492
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW + +++ FG+ G KI RG +E IES V
Sbjct: 493 ---------------------------YWLVANSWNTTFGEDGLFKIRRGFDECGIESYV 525
Query: 239 NGALPK 244
+ K
Sbjct: 526 SAGRAK 531
>gi|170060938|ref|XP_001866023.1| cathepsin B [Culex quinquefasciatus]
gi|167879260|gb|EDS42643.1| cathepsin B [Culex quinquefasciatus]
Length = 353
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/246 (27%), Positives = 97/246 (39%), Gaps = 76/246 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G+ W + ++G+ +GG ++S GC F C+ + P KC
Sbjct: 167 CQGGVLGPAWDYWVQKGVSSGGPYNSKQGCHSYPFDTCHSPDEDDDAP----------KC 216
Query: 62 HTRCTNDNYGRGFFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C + + +D+ RF R Y V DE I +EI NGPV A +Y D +YKS
Sbjct: 217 SRKCQSSYSVQDVSKDR-RFGRVAYSVVADE-HRIMEEIFVNGPVQAAFQVYLDFKTYKS 274
Query: 120 GKYGN--GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
G Y + GP+ G +A +KI+GWG EN
Sbjct: 275 GVYRHVTGPL--------------EGGHA------------IKILGWGVEN--------- 299
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
G YW +++GE +GD G KI+RG N IE+
Sbjct: 300 -------------------------GTKYWLCSNSWGEDWGDHGFFKIVRGENHLGIETD 334
Query: 238 VNGALP 243
V+ LP
Sbjct: 335 VHAGLP 340
>gi|154340956|ref|XP_001566431.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134063754|emb|CAM39941.1| cysteine peptidase C (CPC) [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 340
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 91/242 (37%), Gaps = 70/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W+W GL ++ CQP FPPC H P C + P C
Sbjct: 166 CQGGIPTMAWLWWVWVGL-------TSEVCQPYPFPPCGHHTDGGKYPACPSTIYDTPTC 218
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++ C + + K++ ++ Y + E + E+M GP +Y+D SYKSG
Sbjct: 219 NSTCADSHTA----LTKHKGEKSYSLRGE-REYMIELMTYGPFEVAFDVYADFVSYKSG- 272
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++S+ +G E + VK+VGWG +NG P
Sbjct: 273 ---------------VYSHTTG--------ERLGGHAVKLVGWGVQNGTP---------- 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW I +++ +GD G I RG +E IES
Sbjct: 300 ------------------------YWKIANSWNSDWGDNGYFLIRRGTDECGIESTGVAG 335
Query: 242 LP 243
LP
Sbjct: 336 LP 337
>gi|428174191|gb|EKX43088.1| hypothetical protein GUITHDRAFT_73372 [Guillardia theta CCMP2712]
Length = 255
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 91/241 (37%), Gaps = 76/241 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + GL T + P FPPC H T C + P PKC
Sbjct: 85 CNGGFPTGAWRFFKMHGLTTESKY-------PYVFPPCEHHINKTHYKPCGP-SQPTPKC 136
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
R + + +Y K Y V+ A IQ EIM NGPV A +Y
Sbjct: 137 -VRASEK-------KPRYHGKSVYSVSP--AKIQAEIMTNGPVEAAFTVY---------- 176
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
D +Y+SGVY + E+ +A +KI+GWG E G
Sbjct: 177 -------------QDFLAYQSGVYRHVSGPELGGHA-IKIMGWGVEAG------------ 210
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ E +GDKGT KI RG +E IES V
Sbjct: 211 ----------------------NKYWLVANSWNEDWGDKGTFKIARGDDECGIESSVVAG 248
Query: 242 L 242
+
Sbjct: 249 M 249
>gi|126330441|ref|XP_001381244.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Monodelphis
domestica]
Length = 466
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 97/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ +RGLV+ + + G + + P ++ S K AT
Sbjct: 268 CRGGRLDGAWWFLRRRGLVSNHCYPFSAGNRDATAPAAPCMMHSRSMGRGKRQAT----- 322
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N R Y+ Y ++ + DI +E+M+NGPV A M ++ D F YKSG
Sbjct: 323 -AHCPNS---RAHANHIYQATPPYRLSSDEKDIMKELMENGPVQALMEVHEDFFLYKSGI 378
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYAT--VKIVGWGEENGRPYWTIVRVY 179
Y + P ++ A + T VKI GWGEE
Sbjct: 379 YKHTPA------------------SLGKPARYRQHGTHSVKITGWGEER----------- 409
Query: 180 AVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ +G+ YWT +++G +G+KG +ILRG NE IES
Sbjct: 410 --------------------QPDGQRLKYWTAANSWGPTWGEKGHFRILRGANECDIESF 449
Query: 238 VNG 240
V G
Sbjct: 450 VVG 452
>gi|270011021|gb|EFA07469.1| cathepsin B precursor [Tribolium castaneum]
Length = 327
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 93/240 (38%), Gaps = 72/240 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ K GLV + C P S N P L T +
Sbjct: 146 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTANCQL 193
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T N R + KY+ Y V +E DI EI+ +GPV A M +Y D F+YK G
Sbjct: 194 PT-----NVDR---RSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGI 244
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + P+ N + Y +V+IVGWGEE
Sbjct: 245 YRHSPISTN---------------------DRTGYHSVRIVGWGEE-------------- 269
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E + YW + +++G ++G+ G +ILRG NE IES V G
Sbjct: 270 ----------------YSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGT 313
>gi|166030332|gb|ABY78833.1| cathepsin B-like protease [Trypanosoma congolense]
Length = 336
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 87/236 (36%), Gaps = 70/236 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+ +++ CQP FP C H +P C P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------TSSQCQPYPFPRCEHRGAQGKKPPCSKYKFVTPQC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y V E D ++E+ NGP V ++SD +YKSG
Sbjct: 212 NATCTD----KSVPLIKYRGNHSYEVRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + VA +L V+IVGWG+ NG P
Sbjct: 267 YQH---VAGNFLGGK---------------------AVRIVGWGKLNGTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
YW + +++ +G G ILRG NE IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYFLILRGDNECNIEHL 324
>gi|189238903|ref|XP_967834.2| PREDICTED: similar to tubulointerstitial nephritis antigen
[Tribolium castaneum]
Length = 453
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 93/240 (38%), Gaps = 72/240 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ K GLV + C P S N P L T +
Sbjct: 272 CNGGYLDRAWSYIRKIGLV-------DEQCFPYS-----ATNEKCRIPRRGDLVTANCQL 319
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T N R + KY+ Y V +E DI EI+ +GPV A M +Y D F+YK G
Sbjct: 320 PT-----NVDR---RSKYKVAPAYRVGNET-DIMYEILHSGPVQATMKVYHDFFTYKRGI 370
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + P+ N + Y +V+IVGWGEE
Sbjct: 371 YRHSPISTN---------------------DRTGYHSVRIVGWGEE-------------- 395
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ E + YW + +++G ++G+ G +ILRG NE IES V G
Sbjct: 396 ----------------YSPEGLKKYWKVANSWGPEWGENGYFRILRGSNECEIESFVLGT 439
>gi|12330246|gb|AAG52660.1| cysteine proteinase [Metagonimus yokogawai]
Length = 179
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/168 (29%), Positives = 69/168 (41%), Gaps = 27/168 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + GLVTGG+ + +GC+ FP C+H P C P C
Sbjct: 38 CHGGFPPRAWDFWMENGLVTGGSKENPSGCRSYPFPRCSHHG-KGKYPPCPKTIFDTPNC 96
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C D + DK K Y V I +EIM+NGPV A +Y D YKSG
Sbjct: 97 VDHC--DKPDIDYAADKTHAKSSYNVQSNERVIMKEIMRNGPVEAAFMVYEDFIEYKSG- 153
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
+Y +S +++ ++++GWGEE G
Sbjct: 154 ---------IYFHS--------------HGKLLGGHAIRMLGWGEEKG 178
>gi|268560898|ref|XP_002638183.1| Hypothetical protein CBG22612 [Caenorhabditis briggsae]
Length = 721
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 98/246 (39%), Gaps = 73/246 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + +G+VTGG + GC P S+ C+ + + P+CK +C
Sbjct: 148 CQGGFVLEAMKFWKSKGVVTGGDFQGD-GCIPYSYGSCSDCHTAQTTPKCKN------EC 200
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVN--DEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+ T + Y +DKY Y ++ + V IQ EI++NGPV A +Y D + YKS
Sbjct: 201 QVKYTKNEYK----EDKYYGSSAYRLSTSNAVRTIQSEILRNGPVEATYQVYEDFYYYKS 256
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRV 178
G ++ Y SG + + VKI+GWG EEN
Sbjct: 257 G----------------VYEYISGRH--------MGGHAVKIIGWGVEENVN-------- 284
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW I +++G FG+ G K+ RG NE IE+ V
Sbjct: 285 ---------------------------YWLIANSWGTGFGENGFFKMRRGNNECGIENYV 317
Query: 239 NGALPK 244
+ K
Sbjct: 318 VAGMAK 323
>gi|157058751|gb|ABV03133.1| cathepsin B-3098 [Aulacorthum solani]
Length = 215
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 76/171 (44%), Gaps = 33/171 (19%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W K GLVTGG + S GC+P PPC + Y + T + +
Sbjct: 75 CYGGYPIRAWKSFKKHGLVTGGNYKSGEGCEPYRVPPCPYDEYGNN-----TCSGQPMES 129
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+ RCT YG F QD + +Y++ IQ+ D+ +Y
Sbjct: 130 NHRCTRMCYGNQDLDFDQDHRYTRDHYYLT--YRGIQK----------------DVINY- 170
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
GP+ A+ +Y D SYKSG+Y S +A + +VK++GWGEE G
Sbjct: 171 ------GPIEASFDVYDDFPSYKSGIYVKSENASYLGGHSVKLIGWGEEYG 215
>gi|119638996|gb|ABL85239.1| cysteine proteinase 5 [Necator americanus]
Length = 342
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 88/238 (36%), Gaps = 61/238 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C G + + + G+ TGG C+P +F PC H N P C P PK
Sbjct: 158 CDGGRPFAAFFFAIDNGVCTGGPFREPNVCKPYAFYPCGRHQNQKYFGP-CPKELWPTPK 216
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C Y + DK Y + + I QEI NGPVV + +++D YK G
Sbjct: 217 CRKMC-QLKYNVAYKDDKIYGNDAYSLPNNETRIMQEIFTNGPVVGSFSVFADFAIYKKG 275
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + N G +A VKI+GWG ++G
Sbjct: 276 VYVSNGIQQN------------GAHA------------VKIIGWGVQDG----------- 300
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
YW I +++ +GD+G ++ LRG N IES V
Sbjct: 301 -----------------------LKYWLIANSWNNDWGDEGYVRFLRGDNHCGIESRV 335
>gi|294889976|ref|XP_002773021.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239877724|gb|EER04837.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 342
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 97/246 (39%), Gaps = 76/246 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHH------SNTGCQPVSFPPCNHA-NYTTSEPECKT- 53
C G + +++ G+VTGG + ++ GC P FP CNH P C +
Sbjct: 114 CRRGSVAEGLIFMKNHGIVTGGEYKPPKKLGNDDGCWPYPFPKCNHVPGMKVKYPRCGSK 173
Query: 54 ---LATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYL 110
LA P + C + R D +R K + + I+QEI NGPV A M +
Sbjct: 174 VGRLAAP-----SHCDGLHCRRA--GDVHRAKSWGRLPISPEKIKQEIFDNGPVAAIMTI 226
Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
+ D YKSG ++ YK+G +V T+K++GWG E G
Sbjct: 227 HEDFRLYKSG----------------VYEYKTGA--------MVGAHTLKLIGWGVEAG- 261
Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
+ YW V+++ E++GD+G IK+ G+N
Sbjct: 262 ---------------------------------QEYWLAVNSWNEEWGDQGKIKLAVGKN 288
Query: 231 EAIIES 236
ES
Sbjct: 289 ALDEES 294
>gi|312382740|gb|EFR28091.1| hypothetical protein AND_04395 [Anopheles darlingi]
Length = 381
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 91/242 (37%), Gaps = 67/242 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G+ S+ W + + G+ +GGA+ S+ GCQ F C + L QP
Sbjct: 204 CDGGVPSAVWHYWVENGITSGGAYESHEGCQSYPFGVCKPQEIFAPHVDLICLRQCQP-- 261
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
Y + +DK+ + Y V + I E+ GPV A+ +Y+D YKSG
Sbjct: 262 -------GYNTTYLEDKHFGRVAYSVPRDEDRILYELFYFGPVQASFTVYTDFIQYKSGV 314
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + Y V V +VKIVGWG EN
Sbjct: 315 YRH-------------------TYGVR-----VGDHSVKIVGWGVEN------------- 337
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
G +W +++G ++G+ G KI+RG + +ES V
Sbjct: 338 ---------------------GTKFWLCANSWGAEWGENGFFKIIRGEDHLSVESNVVAG 376
Query: 242 LP 243
LP
Sbjct: 377 LP 378
>gi|291000017|ref|XP_002682576.1| cathepsin C [Naegleria gruberi]
gi|284096203|gb|EFC49832.1| cathepsin C [Naegleria gruberi]
Length = 430
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/122 (34%), Positives = 62/122 (50%), Gaps = 19/122 (15%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP+ +Y D+ +YK GVY + E+ K G E+ P++
Sbjct: 326 YQNGPLAIGFEVYPDLRNYKHGVYKHVTAEEL------KAQGLSEDEMIPHF-------- 371
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
E+V +A V ++GWG ENG PYW I +++ +GD G KILRG +E +ES
Sbjct: 372 ----EVVNHA-VLMVGWGVENGTPYWKIKNSWSTTWGDNGYFKILRGSDECGVESDAEAG 426
Query: 242 LP 243
+P
Sbjct: 427 IP 428
>gi|294952605|ref|XP_002787373.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239902345|gb|EER19169.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 185
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 79/187 (42%), Gaps = 60/187 (32%)
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
+ P P C T CTN Y + +D +R K + V ++ I+QEI NGPV+++ +Y
Sbjct: 53 VVQQPVPPCRTTCTNKAYKKSLEKDVHRAKSWRKVLNDAQSIKQEIFDNGPVLSSFKMYE 112
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F Y YKSGVY V + E ++KI+GWG +GR
Sbjct: 113 D-FRY----------------------YKSGVY-VPTTKESSTSHSIKIIGWGGASGR-- 146
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-- 230
YW V+++ E++GD G IK+ G+N
Sbjct: 147 --------------------------------EYWLAVNSWNEEWGDHGLIKMAFGKNRL 174
Query: 231 EAIIESL 237
E I+ S+
Sbjct: 175 EKIVLSI 181
>gi|56754337|gb|AAW25356.1| SJCHGC00056 protein [Schistosoma japonicum]
Length = 342
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 83/216 (38%), Gaps = 60/216 (27%)
Query: 27 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
++TGCQP FP C H P C T P+C C Y F QDK +
Sbjct: 184 NHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQCKQTCQK-GYKTPFEQDKPFGEGSSN 241
Query: 87 VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
V + Q++IM MY GPV A +Y D + KSG+ +
Sbjct: 242 VQNNEKVFQRDIM--------MY---------------GPVEAAFDVYEDFLNSKSGI-S 277
Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
+ IV ++I+GWG E G PY
Sbjct: 278 RHVTGSIVGGHPIRIIGWGVEKGNPY---------------------------------- 303
Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
W I +++ E +G+ G +++RGR+E IES V L
Sbjct: 304 WLIANSWNEDWGENGLFRMVRGRDECSIESHVVAGL 339
>gi|357511627|ref|XP_003626102.1| Cathepsin L-like proteinase [Medicago truncatula]
gi|355501117|gb|AES82320.1| Cathepsin L-like proteinase [Medicago truncatula]
Length = 351
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 91/244 (37%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W+++ G+VT + GC S P C EP +T P
Sbjct: 163 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 206
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N N + + K+ + Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 207 KCVKKCVNGN--QLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKS 264
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I + G +A VK+VGWG +
Sbjct: 265 G------------VYKHITGFALGGHA------------VKLVGWGTSH----------- 289
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI RG NE IE+ V
Sbjct: 290 ----------------------EGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVT 327
Query: 240 GALP 243
LP
Sbjct: 328 AGLP 331
>gi|87240981|gb|ABD32839.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
Length = 356
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 91/244 (37%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W+++ G+VT + GC S P C EP +T P
Sbjct: 168 CAGGTPFSAWIYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPTYRT-----P 211
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N N + + K+ + Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 212 KCVKKCVNGN--QLWETSKHYSVKAYTVNSDPQDIMAEVYKNGPVEVAFTVYEDFAHYKS 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I + G +A VK+VGWG +
Sbjct: 270 G------------VYKHITGFALGGHA------------VKLVGWGTSH----------- 294
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI RG NE IE+ V
Sbjct: 295 ----------------------EGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIENAVT 332
Query: 240 GALP 243
LP
Sbjct: 333 AGLP 336
>gi|343475054|emb|CCD13447.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 78/217 (35%), Gaps = 63/217 (29%)
Query: 27 SNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYW 86
+++ CQP FP C H +P C P C+ CT+ + KYR Y
Sbjct: 177 TSSQCQPYPFPRCEHRGAQGKKPPCSKYNFDTPTCNATCTD----KSVPLIKYRGNHSYE 232
Query: 87 VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
V E D ++E+ NGP V ++SD +YKSG Y +
Sbjct: 233 VRGE-EDYKRELYFNGPFVVRFQVHSDFLAYKSGVYQH---------------------- 269
Query: 147 VSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPY 206
+ + V+IVGWG+ NG P Y
Sbjct: 270 --VAGNFLGGKAVRIVGWGKMNGTP----------------------------------Y 293
Query: 207 WTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
W + +++ +G G ILRG NE IE L P
Sbjct: 294 WKVANSWDTDWGMNGYFLILRGNNECNIEHLGFAGTP 330
>gi|194352768|emb|CAQ00112.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326488519|dbj|BAJ93928.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508126|dbj|BAJ99330.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 90/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KCH +C +N + + ++K+ Y V+ DI E+ KNGPV +Y D YKS
Sbjct: 213 KCHRKCKVEN--QVWKKNKHFSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + ++ VK++GWG +
Sbjct: 271 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 296
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 297 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVT 333
Query: 240 GALP 243
+P
Sbjct: 334 AGMP 337
>gi|189502866|gb|ACE06814.1| unknown [Schistosoma japonicum]
Length = 121
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/121 (33%), Positives = 58/121 (47%), Gaps = 35/121 (28%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV + +Y+D +YKSGVY + A + +A
Sbjct: 33 HGPVEVDFEVYADFPNYKSGVYQHVSGALLGGHA-------------------------- 66
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
V+L+GWGEEN PYW I +++ +GD G KI+RG+NE IES VN +P
Sbjct: 67 ---------VRLLGWGEENNVPYWLIANSWNTDWGDNGYFKIIRGKNECGIESDVNAGIP 117
Query: 244 K 244
K
Sbjct: 118 K 118
>gi|66506619|ref|XP_393283.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Apis mellifera]
Length = 439
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 101/250 (40%), Gaps = 80/250 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS--FPPCNHANYTTSEPE-CKTLATPQ 58
C G W+++ K GLV + C P + C T E C+ A P
Sbjct: 264 CDGGYLDRAWLFMRKFGLV-------DEQCYPWKGVYEQCKLQKRTNLEAAGCRAPANPL 316
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
K + Y+ Y + +E DI +EI+ +GPV A M +Y D FSY+
Sbjct: 317 RK----------------ELYKVGPAYRLGNET-DIMREILTSGPVQATMKVYQDFFSYE 359
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + P+ Y+SG Y +V+I+GWGE+
Sbjct: 360 SGIYMHTPIAE---------LYESG------------YHSVRIIGWGED----------- 387
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+S ++G P YW +V+++G+++G+ G +I RG NE IES
Sbjct: 388 --IST-----------------DSGLPIKYWLVVNSWGQEWGENGLFRIRRGINECDIES 428
Query: 237 LVNGALPKDN 246
V K N
Sbjct: 429 FVVAVWAKTN 438
>gi|157167285|ref|XP_001658487.1| cathepsin b [Aedes aegypti]
gi|108876478|gb|EAT40703.1| AAEL007590-PA [Aedes aegypti]
Length = 313
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 90/236 (38%), Gaps = 71/236 (30%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W + K+G+ +GG + SN GC P PP P+ +P C TRC N
Sbjct: 141 WSYWVKQGVSSGGPYGSNQGCHPYPMPPSCPKPSEGDYPD-------EPNCSTRC---NA 190
Query: 71 GRGFFQD--KYRFKRY-YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
G +D RF R Y + + I ++I NGPV A Y DI +Y G
Sbjct: 191 GYNVTEDLRDRRFGRVAYSIPADERKIMEDIFVNGPVQAVFQWYEDIVNYSGG------- 243
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEI 187
++ ++SG + VK++GWG E+G
Sbjct: 244 ---------VYRHQSG--------RLKGGHAVKLIGWGVEDG------------------ 268
Query: 188 VAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW + +++G +GD G K++RG N IE V+ LP
Sbjct: 269 ----------------TKYWLVANSWGRVWGDDGFFKMVRGENHCGIEENVHAGLP 308
>gi|327408413|emb|CCA30060.1| unnamed protein product [Neospora caninum Liverpool]
Length = 463
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 72/180 (40%), Gaps = 28/180 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHS---NTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
C+ G W W ++G+VTGG + T C P P C H + P C T P+
Sbjct: 241 CNGGQPGMAWRWFERKGVVTGGDFDTLGKGTTCWPYEIPFCAH-HAKAPFPNCDTDVRPR 299
Query: 59 --PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
PKC C Y ++ V +++ K ++ Y +
Sbjct: 300 KTPKCRKDCEEAAY-----------------SEHVLPFDKDVHK----ASSSYSLRSRDA 338
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
K +G V +Y D +YKSGVY + +A +KI+GWG E+G YW V
Sbjct: 339 VKRDMMAHGTVTGAFMVYEDFLNYKSGVYKHVYGGPLGGHA-IKIIGWGTEDGEEYWHAV 397
>gi|609175|emb|CAA57522.1| cathepsin B-like cysteine proteinase [Nicotiana rustica]
Length = 356
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 91/253 (35%), Gaps = 78/253 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + ++G+VT + N GC S P C EP A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KCH +C N + + K+ Y ++ + I E+ KNGPV + +Y D YKS
Sbjct: 212 KCHRKCVKQNLL--WSRSKHFGVNAYMISSDPHSIMTEVYKNGPVEVSFTVYEDFAHYKS 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + +I+ VK++GWG
Sbjct: 270 GVYKH------------------------VTGDIMGGHAVKLIGWGT------------- 292
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + +GD G KI RG NE IE V
Sbjct: 293 --------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTNECEIEDEVV 332
Query: 240 GALPK-DNYGVEF 251
LP N VE
Sbjct: 333 AGLPSARNLNVEL 345
>gi|48762489|dbj|BAD23814.1| cathepsin B-N [Tuberaphis takenouchii]
Length = 163
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 38 CHGGYPIKAWEWFKKHGLVTGGDYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ+++M GP+ A+ +Y D +YK
Sbjct: 94 H-RCTRMCYGNQELDFKEDHHWTRDAYYLT--YTTIQKDVMAYGPIEASFDVYDDFPNYK 150
Query: 119 SGKY 122
SG Y
Sbjct: 151 SGVY 154
>gi|47212965|emb|CAF93376.1| unnamed protein product [Tetraodon nigroviridis]
Length = 271
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 94/239 (39%), Gaps = 55/239 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RG+VT C P P A + + +++ + +
Sbjct: 75 CAGGRLDGAWWYLRRRGVVT-------EDCYPYRPPQQTPAELSRCMMQSRSVGRGKRQA 127
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC N N + D Y+ Y ++ +I +EI NGPV A M ++ D F Y SG
Sbjct: 128 TQRCPNTN---NYQNDIYQSTPPYRLSTSEKEIMKEIQDNGPVQAIMEVHEDFFMYNSG- 183
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++D+ K Y + +VKI GWGEE
Sbjct: 184 ---------IYKHTDVSFTKPPHYRKHGT------HSVKITGWGEERN------------ 216
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ R YW +++G+ +G+ G +I RG NE IE+ V G
Sbjct: 217 -----------------FDGTTRKYWIAANSWGKNWGENGYFRIARGENECEIEAFVIG 258
>gi|496968|gb|AAA96831.1| cysteine protease homologue, partial [Ancylostoma caninum]
Length = 197
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 74/177 (41%), Gaps = 28/177 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV--SFPPCNHANYTTSEPECKTLATPQP 59
C G S + W+ + + C+PV S NH N P C P P
Sbjct: 44 CQGGWSIEAYKWMQRERCCYRWENTDRRVCKPVRPSIRVGNHPNDPYYGP-CPGGLWPTP 102
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC C Y + + +DK+ R Y++ + I+QEI KNGPVVA +Y D FSY
Sbjct: 103 KCRKTCQRKYY-KSYQEDKHFATRAYYLPNNERSIRQEIYKNGPVVAAFRVYQD-FSY-- 158
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G+Y + A+A VK+VGWG EN YW I
Sbjct: 159 --------------------YKKGIYVHKWGGQTGAHA-VKVVGWGRENATDYWLIA 194
>gi|12330244|gb|AAG52659.1| cysteine proteinase [Metagonimus yokogawai]
Length = 183
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 76/167 (45%), Gaps = 28/167 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNH--ANYTTSEPECKTLATPQP 59
C G W + G+VTGG + + C P FPP +H + T E +TL P P
Sbjct: 38 CVGGWIGDAWDYWRDNGIVTGGDYQDKSTCLPYPFPPSHHLVSKGTPFEIYPQTLY-PTP 96
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C ++C + Y + +DK Y ++ +IQ+EI+ NGPV A M +Y+D +YK+
Sbjct: 97 PCVSKC-QEGYPGEYEKDKIFALSSYKIDRNATEIQKEILINGPVEAGMNVYADFPNYKT 155
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
G Y + + EI+ ++++GWG+
Sbjct: 156 GVYQH------------------------TTGEILGGHAIRLLGWGK 178
>gi|410910940|ref|XP_003968948.1| PREDICTED: tubulointerstitial nephritis antigen-like [Takifugu
rubripes]
Length = 477
Score = 67.8 bits (164), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 61/239 (25%), Positives = 96/239 (40%), Gaps = 55/239 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RG+VT C P P A + +++ + +
Sbjct: 272 CTGGRIDGAWWFLRRRGVVT-------EDCYPYRPPQQTPAELGRCMMQSRSVGRGKRQA 324
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC N N + D Y+ Y ++ +I +EI NGPV A M ++ D F YKSG
Sbjct: 325 TQRCPNTN---NYQNDIYQSTPPYRLSTNEKEIMKEIQDNGPVQAIMEVHEDFFVYKSG- 380
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++D+ K Y + +VKI GWGEE V
Sbjct: 381 ---------IYKHTDVSFTKPPQYRKHGT------HSVKITGWGEERN-----------V 414
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ R YW +++G+ +G++G +I RG NE IE+ V G
Sbjct: 415 DGAK------------------RKYWIAANSWGKNWGEEGYFRIARGENECEIEAFVIG 455
>gi|345327151|ref|XP_001507103.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Ornithorhynchus anatinus]
Length = 327
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 97/246 (39%), Gaps = 68/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RGLV+ C P++ + + +EP C+ + P +
Sbjct: 124 CNGGRLDRAWSFLRRRGLVS-------DKCYPLA------SQNSIAEP-CRMYSRPMGRG 169
Query: 62 HTRCT-----NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
+ T N ++ + D Y+ Y ++ DI +EIM+NGPV A M ++ D F
Sbjct: 170 KRQATGPCPNNFHHSNDYSNDIYQSTPPYRLSSNEKDIMKEIMENGPVQALMEVHEDFFL 229
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK G I+ + +VKI GWGEE
Sbjct: 230 YKDG----------------IYRHTPASNGKPPQFRRQGTHSVKITGWGEEL-------- 265
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ NGR +W +++G +G+ G+ +ILRG NE I
Sbjct: 266 -----------------------QPNGRRVKFWRAANSWGPTWGEGGSFRILRGCNECDI 302
Query: 235 ESLVNG 240
ES V G
Sbjct: 303 ESFVVG 308
>gi|129270160|ref|NP_001038442.2| tubulointerstitial nephritis antigen-like precursor [Danio rerio]
gi|126632071|gb|AAI33830.1| Si:dkey-158b13.1 [Danio rerio]
Length = 471
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 60/239 (25%), Positives = 98/239 (41%), Gaps = 55/239 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RG+VT C P S P + + + + + +
Sbjct: 267 CAGGRIDGAWWFMRRRGVVT-------QDCYPFSPPEQSAVEVARCMMQSRAVGRGKRQA 319
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N + + D Y+ Y ++ +I +EIM NGPV A M ++ D F YKSG
Sbjct: 320 TAHCPNSH---SYHNDIYQSTPPYRLSTNENEIMKEIMDNGPVQAIMEVHEDFFVYKSG- 375
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
++ ++D+ +K Y A+ +V+I GWGEE R Y+
Sbjct: 376 ---------IFRHTDVNYHKPSQYRKHAT------HSVRITGWGEE---------RDYSG 411
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
R YW +++G+ +G+ G +I RG NE IE+ V G
Sbjct: 412 RT--------------------RKYWIGANSWGKNWGEDGYFRIARGVNECDIETFVIG 450
>gi|10803441|emb|CAC13133.1| putative cathepsin B.7 [Ostertagia ostertagi]
Length = 198
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 77/182 (42%), Gaps = 36/182 (19%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEP---ECKTLATPQ 58
C G W + G+VTGG C+ PC Y +EP C ++A
Sbjct: 43 CEGGWPIEAWKYGVTEGVVTGGNFGRKECCRSYEIHPCG---YHGNEPFYGHCHSMAR-T 98
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C RC Y + DK Y + + V IQ++IM+NGPVVA +Y D F Y
Sbjct: 99 PPCKKRC-RPGYKNSYMMDKRYGTSAYELPNSVXAIQRDIMENGPVVAGFDVYED-FKY- 155
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE---ENGR-PYWT 174
YKSG+Y +A +A VK++GWGE ENG PYW
Sbjct: 156 ---------------------YKSGIYRHTAGKXTGGHA-VKVIGWGEEXTENGTIPYWI 193
Query: 175 IV 176
I
Sbjct: 194 IA 195
>gi|2317912|gb|AAC24376.1| cathepsin B-like cysteine proteinase [Arabidopsis thaliana]
Length = 357
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 89/246 (36%), Gaps = 77/246 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G W++ G+VT + NTGC S P C EP P P
Sbjct: 169 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N G + K+ Y +N + DI E+
Sbjct: 213 KCERKCVSRNQLWG--ESKHYGVGAYRINPDPQDIMAEV--------------------- 249
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY +I +A VK++GWG
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 294 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 333
Query: 240 GALPKD 245
LP +
Sbjct: 334 AGLPSE 339
>gi|308494436|ref|XP_003109407.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
gi|308246820|gb|EFO90772.1| hypothetical protein CRE_08204 [Caenorhabditis remanei]
Length = 470
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 55/173 (31%), Positives = 75/173 (43%), Gaps = 52/173 (30%)
Query: 76 QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
QD FK Y V+ DIQ E+M NGPV A ++ D F Y G +Y
Sbjct: 325 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 374
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
+SD+ + K AS+ Y +V+++GWG V ++T
Sbjct: 375 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHST- 405
Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
GRP YW +++G Q+G+ G KILRG N IES V GA K
Sbjct: 406 ---------GRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGAWGK 449
>gi|170030060|ref|XP_001842908.1| cathepsin B [Culex quinquefasciatus]
gi|167865914|gb|EDS29297.1| cathepsin B [Culex quinquefasciatus]
Length = 320
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 94/244 (38%), Gaps = 74/244 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G TW + GL + G + S GC F +Y ++P P C
Sbjct: 149 CDGGYVGKTWQYWVDSGLTSEGPYKSGQGCNSYPF-----GSYCVNDP--------LPTC 195
Query: 62 HTRCTNDNYGRGFFQD-KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C Y + QD KY Y + +E A I EI +NGPVV +++D + YKSG
Sbjct: 196 SRTC-QAGYPLTYSQDLKYGGSAYRVMWNENA-IMTEIYQNGPVVVQFEVFADFYQYKSG 253
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + V+ + E + V+++GWG ENG
Sbjct: 254 VYRH----------------------VTGATE--GWHAVRVIGWGVENG----------- 278
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
VK YW + +++G ++GDKG K +RG N IE V
Sbjct: 279 ------------VK-----------YWLVANSWGVRWGDKGFFKFVRGENHLGIEDFVYA 315
Query: 241 ALPK 244
LPK
Sbjct: 316 GLPK 319
>gi|38639325|gb|AAR25800.1| cathepsin B-like cysteine proteinase [Solanum tuberosum]
Length = 354
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 93/257 (36%), Gaps = 82/257 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + + G+VT + TGC S P C+ L P P
Sbjct: 168 CDGGYPIAAWRYFKRSGVVTEECDPYFDTTGC---------------SHPGCEPL-YPTP 211
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVND-EVADIQQEIMKNGPVVANMYLYSDIFSYK 118
KCH +C N +R ++Y VN V+ Q IM
Sbjct: 212 KCHRKCVKGNV-------LWRKSKHYGVNAYRVSHDPQSIMAE----------------- 247
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVR 177
Y NGPV + +Y D YKSGVY + +A VK++GWG E G YW IV
Sbjct: 248 --VYKNGPVEVSFTVYEDFAHYKSGVYKHVTGGNMGGHA-VKLIGWGTSEQGEDYWLIVN 304
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ GWGE+ G KI RG NE IE
Sbjct: 305 SWNR---------------GWGED-------------------GYFKIRRGTNECGIEHS 330
Query: 238 VNGALPK-DNYGVEFGE 253
V LP N VE G+
Sbjct: 331 VVAGLPSARNLNVELGD 347
>gi|283468816|emb|CAO98753.1| putative cathepsin B [Fasciola hepatica]
Length = 112
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 71/168 (42%), Gaps = 58/168 (34%)
Query: 76 QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYS 135
QDK + K Y V ++ DI EIMKNGPV Y++ D YKSG
Sbjct: 3 QDKVKGKSSYNVGEQETDIMMEIMKNGPVDGIFYMFEDFLVYKSG--------------- 47
Query: 136 DIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKL 195
I+ Y +G +V ++++GWG ENG VK
Sbjct: 48 -IYHYTTG--------RLVGGHAIRVIGWGVENG-----------------------VK- 74
Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW I +++ E +G+KG ++ RG NE IE+ +N LP
Sbjct: 75 ----------YWLIANSWNEGWGEKGYFRMRRGNNECGIEARINAGLP 112
>gi|343477197|emb|CCD11909.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 336
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/242 (24%), Positives = 83/242 (34%), Gaps = 70/242 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+ +++ CQP FP C H + C P+C
Sbjct: 159 CEGGYPDAAWEYYVSHGI-------ASSQCQPYPFPRCEHRGAQGKKTPCSKYKFVTPQC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CT+ + KYR Y V E YK
Sbjct: 212 NATCTD----KTIPLIKYRGNHSYEVRGEE------------------------DYKREL 243
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP V ++SD +YK+GVY + + V+IVGWG+ NG P
Sbjct: 244 YFNGPFVVRFQVHSDFLAYKNGVYQ-HVAGNFLGGKAVRIVGWGKLNGTP---------- 292
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW + +++ +G G ILRG NE IE L
Sbjct: 293 ------------------------YWKVANSWDTDWGMNGYFLILRGDNECNIEHLGFAG 328
Query: 242 LP 243
P
Sbjct: 329 TP 330
>gi|417409900|gb|JAA51439.1| Putative cysteine proteinase tin-ag, partial [Desmodus rotundus]
Length = 346
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 97/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G S W ++ +RG+V+ C P S T P C + +
Sbjct: 149 CQGGHLDSAWWFLRRRGVVS-------DHCYPFSG---QGRTETGPAPRCMMHSRAMGRG 198
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 199 KRQATARCPNHQV---HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLY 255
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
++G Y + PV L + G + +VKI GWGEE+
Sbjct: 256 QNGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEES--------- 290
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 291 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 330
Query: 238 VNG 240
V G
Sbjct: 331 VLG 333
>gi|18378945|ref|NP_563647.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332189291|gb|AEE27412.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 379
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 89/246 (36%), Gaps = 77/246 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G W++ G+VT + NTGC S P C EP P P
Sbjct: 191 CNGGFPMGAWLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTP 234
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N G + K+ Y +N + DI E+
Sbjct: 235 KCERKCVSRNQLWG--ESKHYGVGAYRINPDPQDIMAEV--------------------- 271
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY +I +A VK++GWG
Sbjct: 272 --YKNGPVEVAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT------------- 315
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 316 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVV 355
Query: 240 GALPKD 245
LP +
Sbjct: 356 AGLPSE 361
>gi|341891358|gb|EGT47293.1| hypothetical protein CAEBREN_29072 [Caenorhabditis brenneri]
Length = 349
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 80/188 (42%), Gaps = 57/188 (30%)
Query: 66 TNDNYGRGF-----FQDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+ DN RG QD FK Y V+ DIQ E+M NGPV A ++ D F Y
Sbjct: 189 SRDNDRRGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYA 248
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
G +Y +SD+ + K AS+ Y +V+++GWG
Sbjct: 249 GG----------VYQHSDLAAQK------GASSVAEGYHSVRVLGWG------------- 279
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
V ++T GRP YW +++G Q+G+ G KILRG N IES
Sbjct: 280 ---------VDHST----------GRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIES 320
Query: 237 LVNGALPK 244
V GA K
Sbjct: 321 FVVGAWGK 328
>gi|294937366|ref|XP_002782055.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
gi|239893340|gb|EER13850.1| cathepsin B precursor, putative [Perkinsus marinus ATCC 50983]
Length = 159
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 78/205 (38%), Gaps = 62/205 (30%)
Query: 27 SNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYY 85
S GC P FP CNH S P C ++ H T+ + + +D +R K +
Sbjct: 4 SADGCWPYPFPKCNHVRSAASRYPACPAVSPSAVGAHQMETSYSL---YIRDLHRAKSFG 60
Query: 86 WVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVY 145
+ +I+QEI NGPV+ + +Y DI YK+G Y
Sbjct: 61 RLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVY----------------------- 97
Query: 146 AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP 205
V + T+KI+GWG E+G +
Sbjct: 98 -VHQTGSFQGIHTLKIIGWGVESG----------------------------------QD 122
Query: 206 YWTIVSTFGEQFGDKGTIKILRGRN 230
YW V+++ E++GD G IK+ GR
Sbjct: 123 YWLAVNSWNEEWGDHGMIKLAVGRT 147
>gi|348513320|ref|XP_003444190.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oreochromis
niloticus]
Length = 499
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 58/240 (24%), Positives = 98/240 (40%), Gaps = 55/240 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RG+VT C P P A + +++ + +
Sbjct: 294 CAGGRIDGAWWYLRRRGVVT-------EDCYPYQPPHQTPAEVGRCMMQSRSVGRGKRQA 346
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC N + + D Y+ Y ++ +I +EIM NGPV A M ++ D F YK+G
Sbjct: 347 TQRCPNT---QNYHNDIYQSTPPYRLSSNEKEIMKEIMDNGPVQAIMEVHEDFFVYKTG- 402
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+Y ++D+ K Y + +V+I GWGE+ V
Sbjct: 403 ---------IYKHTDVSFTKPPQYRKHGT------HSVRITGWGEDRN-----------V 436
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
++ R YW +++G+ +G+ G +I+RG NE IE+ V G
Sbjct: 437 DGTS------------------RKYWIAANSWGKNWGENGYFRIVRGENECEIETFVIGV 478
>gi|14582576|gb|AAK69541.1|AF283476_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 92/260 (35%), Gaps = 77/260 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + + G+VT + TGC S P C EP A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N + + K+ Y VN + I E+
Sbjct: 208 ACEKKCVKKNLL--WSESKHFSVNAYRVNSDQHSIMTEV--------------------- 244
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGP + +Y D YKSGVY +E+ +A VK++GWG
Sbjct: 245 --YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA-VKLIGWGT------------- 288
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + +GD G KI+RG NE IE +
Sbjct: 289 --------------------SEDGEDYWLLANQWNRSWGDDGYFKIIRGTNECGIEDVTA 328
Query: 240 GALPKDNYGVEFGEESGERL 259
G N +E G + L
Sbjct: 329 GMPSTKNLDIESGVRDDDSL 348
>gi|432884030|ref|XP_004074413.1| PREDICTED: tubulointerstitial nephritis antigen-like [Oryzias
latipes]
Length = 474
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 98/243 (40%), Gaps = 63/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W ++ +RG+VT C P P A + + + + +
Sbjct: 269 CAGGRIDGAWWYLRRRGVVT-------ENCYPYQPPQQAPAEVGRCMMQSRAVGRGKRQA 321
Query: 62 HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
RC N NY +Q +K ++ +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 322 TQRCPNTYNYHNDIYQSTPPYK----LSSNEKEIMKEIMENGPVQAIMEVHEDFFVYKNG 377
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWTIVR 177
+Y ++D+ S K Y + +V+I GWGE+ +G P
Sbjct: 378 ----------IYKHTDVSSTKPPQYRKHGT------HSVRITGWGEDKDYDGTP------ 415
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
R YW +++G+ +G+ G +I RG NE IE+
Sbjct: 416 --------------------------RKYWIAANSWGKNWGENGFFRIARGANECEIEAF 449
Query: 238 VNG 240
V G
Sbjct: 450 VIG 452
>gi|157058733|gb|ABV03124.1| cathepsin B-16a [Acyrthosiphon pisum]
Length = 274
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 61/128 (47%), Gaps = 10/128 (7%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+VTGG + S GC+P PPC E + P K
Sbjct: 153 CNGGYPIKAWKYFSSHGIVTGGNYKSGEGCEPYRVPPCPQ----DEEGKSSCAGKPIEKN 208
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG + + +RF R YY++ IQ+++M GP+ A+ +Y D SYK
Sbjct: 209 H-RCTRMCYGNQDLDYNEDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDFPSYK 265
Query: 119 SGKYGNGP 126
SG Y P
Sbjct: 266 SGVYQRTP 273
>gi|358421824|ref|XP_003585145.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bos taurus]
Length = 428
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 98/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H + V PPC ++ + K AT
Sbjct: 231 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 285
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
RC N D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 286 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 337
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+SG Y + PV L + G + +VKI GWGEE
Sbjct: 338 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------- 372
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 373 ---------LPDGRTIK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 412
Query: 238 VNG 240
V G
Sbjct: 413 VLG 415
>gi|431891156|gb|ELK02033.1| Tubulointerstitial nephritis antigen-like protein [Pteropus alecto]
Length = 467
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G W ++ +RG+V+ C P S N EP C + +
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERNEAGPEPRCMMHSRAMGRG 319
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N + D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 320 KRQAIARCPNHHV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+ G Y + PV L + G + +VKI GWGEE
Sbjct: 377 QGGIYSHTPVS----LGKPERYRRHGTH------------SVKITGWGEET--------- 411
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGTNECDIESF 451
Query: 238 VNG 240
V G
Sbjct: 452 VLG 454
>gi|341898422|gb|EGT54357.1| hypothetical protein CAEBREN_10381 [Caenorhabditis brenneri]
Length = 466
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 73/173 (42%), Gaps = 52/173 (30%)
Query: 76 QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
QD FK Y V+ DIQ E+M NGPV A ++ D F Y G +Y
Sbjct: 321 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 370
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
+SD+ + K AS+ Y +V+++GWG ++
Sbjct: 371 HSDLAAQK------GASSVAEGYHSVRVLGWGVDH------------------------- 399
Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
GRP YW +++G Q+G+ G KILRG N IES V GA K
Sbjct: 400 -------STGRPIKYWLCANSWGTQWGEDGYFKILRGDNHCEIESFVIGAWGK 445
>gi|268564843|ref|XP_002639246.1| Hypothetical protein CBG03805 [Caenorhabditis briggsae]
Length = 526
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 74/170 (43%), Gaps = 52/170 (30%)
Query: 76 QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
QD FK Y V+ DIQ E+M NGPV A ++ D F Y G +Y
Sbjct: 381 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 430
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
+SD+ + K AS+ Y +V+++GWG V ++T
Sbjct: 431 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHST- 461
Query: 194 KLIGWGEENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
GRP YW +++G Q+G+ G KILRG N IES V GA
Sbjct: 462 ---------GRPIKYWLCANSWGTQWGEDGYFKILRGENHCEIESFVIGA 502
>gi|159175|gb|AAA29176.1| cysteine proteinase [Haemonchus contortus]
Length = 348
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 93/244 (38%), Gaps = 63/244 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+V+GG + C P PC T C +A P P C
Sbjct: 159 CRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHGNDTFYGNCVGMA-PTPPC 217
Query: 62 HTRCTNDNYGRGFFQDKYRFK---RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+C RG ++ R+ R Y + I+++I + G VVA +F+
Sbjct: 218 KRKCQPGF--RGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSVVA-------VFA-- 266
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+Y D Y+SG+Y +A + G
Sbjct: 267 --------------VYEDFSHYQSGIYKHTAG---------RFTG--------------- 288
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
Y VK+IGWG++NG YW I +++ + +G+ G +++RG N IE V
Sbjct: 289 ----------GYHAVKMIGWGKDNGTDYWLIANSWHDDWGENGFFRMIRGINNCGIEEQV 338
Query: 239 NGAL 242
+ +
Sbjct: 339 DAGI 342
>gi|268561866|ref|XP_002638438.1| Hypothetical protein CBG18654 [Caenorhabditis briggsae]
Length = 396
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 87/243 (35%), Gaps = 69/243 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + G+VTGG + GC P SF PC+ T EP+ P C
Sbjct: 155 CQGGYTIEAMKYWMNSGVVTGGDYQ-GAGCIPYSFRPCS----TCKEPK------DAPSC 203
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
T C + ++ V + V IQ EI NGPV +Y D + YKSG
Sbjct: 204 KTTCQASYKAKSAYRLPTTTSSNAIVANAVQMIQTEIYNNGPVEVAYQVYDDFYHYKSGV 263
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y ++Y D K +A VKI+GWG E YW + ++
Sbjct: 264 Y--------YHVYGD----KPSGHA------------VKIIGWGTEKKVDYWLVANSWST 299
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ FG+ G KI RG NE IE V
Sbjct: 300 T----------------------------------FGENGFFKIRRGTNECGIEENVVAG 325
Query: 242 LPK 244
LPK
Sbjct: 326 LPK 328
>gi|326492684|dbj|BAJ90198.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/244 (24%), Positives = 89/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 169 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KCH +C +N + + ++K+ Y V+ DI E+ KNGPV +Y D YKS
Sbjct: 213 KCHRKCKVEN--QVWKKNKHSSVNAYRVHSNPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + ++ VK++GWG +
Sbjct: 271 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 296
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +G G KI+RG+NE IE V
Sbjct: 297 -----------------------GEDYWLLANQWNRGWGGDGYFKIIRGKNECGIEEDVT 333
Query: 240 GALP 243
+P
Sbjct: 334 AGMP 337
>gi|294898091|ref|XP_002776152.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239882839|gb|EER07968.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 382
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 95/243 (39%), Gaps = 71/243 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W WVH +G+ TG + + P + + P P C
Sbjct: 210 CGGGDPYSAWSWVHDKGIATGEGSRPKRVSESEAIPVIAYQDIY-----------PTPNC 258
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C N Y D++ ++++ P Y YS + K+
Sbjct: 259 VEQCRNPKYTTTLRDDRHF-----------------MLESSP-----YHYS-VNDAKNAI 295
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+GPV A+ +Y D +YKSGVY ++ + + +A VKI+GWGE++G
Sbjct: 296 RTDGPVSASFTVYEDFLAYKSGVYKHTSGSYLGGHA-VKIIGWGEKSG------------ 342
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ YW V+++ E +GDKG KI G N I + L+ G
Sbjct: 343 ----------------------QAYWLAVNSWNEDWGDKGLFKIALG-NCGIDDDLLGGT 379
Query: 242 LPK 244
PK
Sbjct: 380 -PK 381
>gi|417401428|gb|JAA47600.1| Putative cysteine proteinase tin-ag [Desmodus rotundus]
Length = 466
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 97/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G S W ++ +RG+V+ C P S T P C + +
Sbjct: 269 CQGGHLDSAWWFLRRRGVVS-------DHCYPFSG---QGRTETGPAPRCMMHSRAMGRG 318
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 319 KRQATARCPNHQV---HANDIYQVTPAYRLGSSEKEIMKELMENGPVQALMEVHEDFFLY 375
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
++G Y + PV L + G + +VKI GWGEE+
Sbjct: 376 QNGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEES--------- 410
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 411 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 450
Query: 238 VNG 240
V G
Sbjct: 451 VLG 453
>gi|363742306|ref|XP_428202.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Gallus
gallus]
Length = 464
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 59/240 (24%), Positives = 92/240 (38%), Gaps = 58/240 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-HSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPK 60
CS G W ++ +RG+VT + ++ QP + P H+ T T P P+
Sbjct: 269 CSGGRLDGAWWYLRRRGVVTDECYPFTSQDSQPAAQPCMMHSRSTGRGKRQATARCPNPQ 328
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H D Y+ Y + +I +E+M+NGPV A + ++ D F YKSG
Sbjct: 329 THA------------NDIYQSTPAYRLAPSEKEIMKELMENGPVQAILEVHEDFFLYKSG 376
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + + +VKI GWGEE P + +
Sbjct: 377 ----------------IYRHTAVAEGKGPKHQQHGTHSVKITGWGEEQ-LPDGQVQK--- 416
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YWT +++G +G+ G +I RG NE +ES V G
Sbjct: 417 -------------------------YWTAANSWGRAWGEDGHFRIARGVNECEVESFVVG 451
>gi|297843026|ref|XP_002889394.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
gi|297335236|gb|EFH65653.1| hypothetical protein ARALYDRAFT_887367 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/237 (27%), Positives = 88/237 (37%), Gaps = 77/237 (32%)
Query: 11 WVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
W++ G+VT + NTGC S P C EP P PKC +C ++
Sbjct: 180 WLYFKYHGVVTEECDPYFDNTGC---SHPGC--------EP-----GYPTPKCVRKCVSE 223
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
N G + K+ Y +N + DI E+ KNGPV +Y D YKSG
Sbjct: 224 NQLWG--ESKHYGVSAYRINHDPQDIMAEVYKNGPVEVAFTVYEDFAHYKSG-------- 273
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
+Y I K G +A VK++GWG
Sbjct: 274 ----VYKHITGTKIGGHA------------VKLIGWGT---------------------- 295
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 245
++G YW + + + +GD G KI RG NE IE V LP D
Sbjct: 296 -----------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVVAGLPSD 341
>gi|110456454|gb|ABG74712.1| cathepsin B preproprotein-like protein [Diaphorina citri]
Length = 125
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 35/123 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y +GP+VA +Y+D YKSGVY + I +A
Sbjct: 34 YEHGPLVAIFSVYADFLQYKSGVYQHNFGDSIGLHA------------------------ 69
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
V+++GWG EN PYW + +++ + +GD GT KILRG NEA IE N
Sbjct: 70 -----------VRVLGWGVENDIPYWLVANSWNDHWGDHGTFKILRGENEADIEMGFNVG 118
Query: 242 LPK 244
P+
Sbjct: 119 YPQ 121
>gi|48762487|dbj|BAD23813.1| cathepsin B-N [Tuberaphis taiwana]
Length = 163
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 59/124 (47%), Gaps = 10/124 (8%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W K GLVTGG + S GCQP PPC Y + C+ P K
Sbjct: 38 CSGGYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKN 93
Query: 62 HTRCTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
H RCT YG F +D + + Y++ IQ +I+ GP+ A+ +Y D SYK
Sbjct: 94 H-RCTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQNDILAYGPIEASFEVYDDFPSYK 150
Query: 119 SGKY 122
SG Y
Sbjct: 151 SGVY 154
>gi|348570708|ref|XP_003471139.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 468
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 63/246 (25%), Positives = 99/246 (40%), Gaps = 68/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVT------GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G W ++ +RG+V+ G + G PPC + + +
Sbjct: 271 CRGGHLDGAWWFLRRRGVVSDHCYPFSGREQAEAG----PAPPCMMHS--------RAMG 318
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
+ + RC N + D Y+ Y + + +I +E+M+NGPV A M ++ D F
Sbjct: 319 RGKRQATRRCPNSHTD---ANDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHEDFF 375
Query: 116 SYKSGKYGNGPV-VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
YK G Y + P+ +A Y + G + +VKI GWGEE
Sbjct: 376 LYKGGIYSHTPLSMARPEQYR-----RHGTH------------SVKITGWGEET------ 412
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ T+K YWT +++G +G++G +ILRG NE I
Sbjct: 413 ------------LPDGRTLK-----------YWTAANSWGPSWGERGHFRILRGSNECDI 449
Query: 235 ESLVNG 240
ES V G
Sbjct: 450 ESFVLG 455
>gi|222424744|dbj|BAH20325.1| AT1G02305 [Arabidopsis thaliana]
Length = 293
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 94/264 (35%), Gaps = 77/264 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 105 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 148
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y V DI E+
Sbjct: 149 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 185
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY I +A VK++GWG
Sbjct: 186 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 229
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 230 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 269
Query: 240 GALPKDNYGVEFGEESGERLSEEF 263
LP D V+ S + L F
Sbjct: 270 AGLPSDRNVVKGITTSDDLLVSSF 293
>gi|426221788|ref|XP_004005089.1| PREDICTED: tubulointerstitial nephritis antigen-like [Ovis aries]
Length = 362
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 100/246 (40%), Gaps = 68/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H + V PPC ++ + K AT
Sbjct: 165 CHGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 219
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
RC N D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 220 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 271
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWT 174
+SG Y + PV L + G + +VKI GWGEE +GR
Sbjct: 272 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDGR---- 311
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
TVK YWT +++G +G++G +I+RG NE I
Sbjct: 312 -----------------TVK-----------YWTAANSWGPAWGERGHFRIVRGANECDI 343
Query: 235 ESLVNG 240
ES V G
Sbjct: 344 ESFVLG 349
>gi|297465285|ref|XP_887401.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Bos taurus]
gi|297472148|ref|XP_002685665.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Bos taurus]
gi|296490232|tpg|DAA32345.1| TPA: tubulointerstitial nephritis antigen-like 1-like [Bos taurus]
Length = 534
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 98/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H + V PPC ++ + K AT
Sbjct: 337 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHGRD--EAVPAPPC--MMHSRAMGRGKRQAT- 391
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
RC N D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 392 -----ARCPNSYV---HANDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 443
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+SG Y + PV L + G + +VKI GWGEE
Sbjct: 444 QSGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------- 478
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 479 ---------LPDGRTIK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 518
Query: 238 VNG 240
V G
Sbjct: 519 VLG 521
>gi|18378947|ref|NP_563648.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|16226808|gb|AAL16267.1|AF428337_1 At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|14532526|gb|AAK63991.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|25090140|gb|AAN72238.1| At1g02300/T6A9_10 [Arabidopsis thaliana]
gi|332189292|gb|AEE27413.1| putative cathepsin B-like cysteine protease [Arabidopsis thaliana]
Length = 362
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 94/264 (35%), Gaps = 77/264 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 174 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 217
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y V DI E+
Sbjct: 218 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 254
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY I +A VK++GWG
Sbjct: 255 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 298
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 299 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 338
Query: 240 GALPKDNYGVEFGEESGERLSEEF 263
LP D V+ S + L F
Sbjct: 339 AGLPSDRNVVKGITTSDDLLVSSF 362
>gi|157116531|ref|XP_001658537.1| tubulointerstitial nephritis antigen [Aedes aegypti]
gi|108883447|gb|EAT47672.1| AAEL001232-PA [Aedes aegypti]
Length = 462
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 74/168 (44%), Gaps = 52/168 (30%)
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
+ Y+ + +N+E DI EI K+GPV A M ++ D FSYKSG Y +
Sbjct: 306 NMYKMGPAFSLNNE-TDIMLEIKKHGPVQAIMRVHRDFFSYKSGIYRHS----------- 353
Query: 137 IFSYKSGVYAVSASAEIVA-YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKL 195
A S SA+ A Y +V+++GWGEE Y K
Sbjct: 354 ---------AASTSADQRAGYHSVRLIGWGEERH-------------------GYEVTK- 384
Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW V+++G +G+ G +ILRG NE IES V +LP
Sbjct: 385 ----------YWIAVNSWGTWWGENGRFRILRGSNECEIESYVLASLP 422
>gi|357116869|ref|XP_003560199.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 350
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/254 (25%), Positives = 91/254 (35%), Gaps = 79/254 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W ++ + G+VT + GC+ P C EP A P P
Sbjct: 166 CDGGYPISAWQYLVENGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 209
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C +C N +Q+K F Y VN + DI E+ KNGPV +Y D YK
Sbjct: 210 ACEKKCKVQNQ---VWQEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYK 266
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + + E++ VK++GWG
Sbjct: 267 SGVYEH------------------------ITGEMMGGHAVKLIGWGT------------ 290
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+G+ YW + + + +GD G KI+RG+NE IE V
Sbjct: 291 ---------------------SADGKDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDV 329
Query: 239 NGALPKDNYGVEFG 252
+P V G
Sbjct: 330 VAGMPSTKNTVRTG 343
>gi|94958151|gb|ABF47216.1| cathepsin B [Nicotiana benthamiana]
Length = 356
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + ++G+VT + N GC S P C EP A P P
Sbjct: 168 CDGGYPLQAWKYFVRKGVVTDECDPYFDNEGC---SHPGC--------EP-----AYPTP 211
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KCH +C N + + K+ Y ++ + I E+ KNGPV + +Y D YKS
Sbjct: 212 KCHRKCVKQNLL--WSKSKHFGVNAYMISSDPHSIMTELYKNGPVEVSFTVYEDFAHYKS 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + +++ VK++GWG
Sbjct: 270 GVYKH------------------------VTGDVMGGHAVKLIGWGT------------- 292
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + +GD G KI RG +E IE V
Sbjct: 293 --------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTDECEIEDEVV 332
Query: 240 GALP 243
LP
Sbjct: 333 AGLP 336
>gi|389608479|dbj|BAM17849.1| tubulointerstitial nephritis antigen [Papilio xuthus]
Length = 429
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 55/119 (46%), Gaps = 32/119 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A M +Y D F Y+ GVY S
Sbjct: 334 SGPVQAVMTVYQDFFHYRDGVYRRS--------------------------------YHG 361
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
+ E+ + +V++IGWGE+ G YW + +++G Q+G+ G +I RG NEA IES V L
Sbjct: 362 NNELKGFHSVRIIGWGEDRGDRYWVVANSWGRQWGENGYFRIARGSNEADIESFVVTGL 420
>gi|358341865|dbj|GAA49436.1| cathepsin B-like cysteine proteinase [Clonorchis sinensis]
Length = 515
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 58/127 (45%), Gaps = 3/127 (2%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W + G+VTGG+ + +GC+ FP C+H + P C + P+C
Sbjct: 150 CDGGFPAQAWNYWSTDGIVTGGSKENPSGCRSYPFPSCSH-DERGRHPLCPSEIYHTPRC 208
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C D + + + Y V D +I EIM NGPV A +Y D Y+ G
Sbjct: 209 TKKCDTDKL--HYSAELTKANSSYNVLDSDREIMMEIMNNGPVEAVFDVYEDFLQYEKGI 266
Query: 122 YGNGPVV 128
Y N V+
Sbjct: 267 YFNAWVL 273
>gi|328871084|gb|EGG19455.1| peptidase C1A family protein [Dictyostelium fasciculatum]
Length = 352
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 74/172 (43%), Gaps = 39/172 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + + ++ K+G+V+ C P + P C A +P + TPQ C
Sbjct: 136 CQGGDAYTAMKFIQKKGIVS-------NDCLPYTIPTCAPAQ----QPCLNFVDTPQ--C 182
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C+N +Y + QD + Y +N V IQQEIM NGPV A +Y
Sbjct: 183 VEKCSNASYT--YAQDLHFIDGVYSMNPTVNAIQQEIMTNGPVEACFEVYE--------- 231
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
D YKSGVY + ++ + VK++GWG +N YW
Sbjct: 232 --------------DFLGYKSGVYQHTTGKDLGGHC-VKMIGWGTQNNELYW 268
>gi|193202653|ref|NP_492593.2| Protein F26E4.3 [Caenorhabditis elegans]
gi|205371857|sp|P90850.3|YCF2E_CAEEL RecName: Full=Uncharacterized peptidase C1-like protein F26E4.3;
Flags: Precursor
gi|166157004|emb|CAB03007.2| Protein F26E4.3 [Caenorhabditis elegans]
Length = 452
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 74/171 (43%), Gaps = 48/171 (28%)
Query: 76 QDKYRFKRY--YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
QD FK Y V+ DIQ E+M NGPV A ++ D F Y G +Y
Sbjct: 307 QDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGG----------VYQ 356
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
+SD+ + K AS+ Y +V+++GWG V ++T
Sbjct: 357 HSDLAAQK------GASSVAEGYHSVRVLGWG----------------------VDHSTG 388
Query: 194 KLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
K I YW +++G Q+G+ G K+LRG N IES V GA K
Sbjct: 389 KPI--------KYWLCANSWGTQWGEDGYFKVLRGENHCEIESFVIGAWGK 431
>gi|414886872|tpg|DAA62886.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
gi|414886873|tpg|DAA62887.1| TPA: hypothetical protein ZEAMMB73_253741 [Zea mays]
Length = 208
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
C G W + + G+VT C P P C H P C+ A P PK
Sbjct: 22 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 66
Query: 61 CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N +Q+K F Y +N + DI E+ KNGPV +Y D YKS
Sbjct: 67 CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 123
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + I+ VK++GWG +
Sbjct: 124 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 149
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 150 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 186
Query: 240 GALP 243
+P
Sbjct: 187 AGMP 190
>gi|395856781|ref|XP_003800797.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Otolemur garnettii]
Length = 436
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 98/246 (39%), Gaps = 68/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 57
C G W ++ +RG+V+ C P S + P C + P
Sbjct: 239 CHGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERDKAGPAPLCMMHSRPMGRG 288
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N+ D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 289 KRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 345
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWT 174
+SG Y + PV L + G + +VKI GWGEE +GR
Sbjct: 346 QSGIYSHTPVS----LQRPEGYRRHGTH------------SVKITGWGEETLPDGR---- 385
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
T+K YWT +++G +G++G +I+RG NE I
Sbjct: 386 -----------------TLK-----------YWTAANSWGPAWGERGHFRIVRGANECDI 417
Query: 235 ESLVNG 240
ES V G
Sbjct: 418 ESFVLG 423
>gi|297282815|ref|XP_002802331.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 322
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 97/251 (38%), Gaps = 78/251 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC +
Sbjct: 125 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPCMM--------HSR 169
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
+ + + RC N + D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 170 AMGRGKRQATARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 226
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NG 169
D F YK G Y + PV L + G + +VKI GWGEE +G
Sbjct: 227 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDG 270
Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
R T+K YWT +++G +G++G +I+RG
Sbjct: 271 R---------------------TLK-----------YWTAANSWGPAWGERGHFRIVRGV 298
Query: 230 NEAIIESLVNG 240
NE IES V G
Sbjct: 299 NECDIESFVLG 309
>gi|603044|gb|AAA96832.1| cysteine protease homolog, partial [Strongyloides ratti]
Length = 202
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 51/180 (28%), Positives = 72/180 (40%), Gaps = 29/180 (16%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W V + G+ TGG GC+P +F PC EC + P+C
Sbjct: 44 CRGGANIRAWKHVMRNGVCTGGPCGYKYGCRPYAFHPCGVHKDQVYYGECPRKSYDTPEC 103
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + +D+Y Y+V ++ I +EIM+ GPV G
Sbjct: 104 RKICQRGCIQLQYGKDRYYAASAYFVKNDTKAIMREIMRGGPV--------------HGA 149
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG---EENGR--PYWTIV 176
Y Y+D YK GVY +A E ++KI+GWG NG PYW +
Sbjct: 150 YDT---------YTDFRLYKGGVYEHTA-GERTGGHSIKIMGWGNYKHPNGTVIPYWLVA 199
>gi|383861394|ref|XP_003706171.1| PREDICTED: tubulointerstitial nephritis antigen-like [Megachile
rotundata]
Length = 442
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 69/153 (45%), Gaps = 48/153 (31%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI QEI+ +GPV A M +Y D FSY+SG Y + V A +Y SD
Sbjct: 336 TDIMQEILTSGPVQATMRVYQDFFSYESGVYKHS-VTAELY-ESD--------------- 378
Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
Y +V+I+GWGEE Y+ + + YW + +
Sbjct: 379 ----YHSVRIIGWGEEPP--------TYSRNTPLK-------------------YWLVAN 407
Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
++G+Q+G+ G +I +G NE IES V G K
Sbjct: 408 SWGQQWGENGLFRIQKGTNECEIESFVLGVWAK 440
>gi|414886870|tpg|DAA62884.1| TPA: cathepsin B-like cysteine proteinase 3 [Zea mays]
Length = 347
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
C G W + + G+VT C P P C H P C+ A P PK
Sbjct: 161 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 205
Query: 61 CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N +Q+K F Y +N + DI E+ KNGPV +Y D YKS
Sbjct: 206 CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 262
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + I+ VK++GWG +
Sbjct: 263 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 288
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 289 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 325
Query: 240 GALP 243
+P
Sbjct: 326 AGMP 329
>gi|395856779|ref|XP_003800796.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Otolemur garnettii]
Length = 467
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATP---- 57
C G W ++ +RG+V+ C P S + P C + P
Sbjct: 270 CHGGRLDGAWWFLRRRGVVS-------DHCYPFSG---QERDKAGPAPLCMMHSRPMGRG 319
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N+ D Y+ Y + +I +E+M+NGPV A M ++ D F Y
Sbjct: 320 KRQATARCPNNQVQA---NDIYQVTPAYRLGSNEKEIMKELMENGPVQALMEVHEDFFLY 376
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+SG Y + PV L + G + +VKI GWGEE
Sbjct: 377 QSGIYSHTPVS----LQRPEGYRRHGTH------------SVKITGWGEET--------- 411
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +I+RG NE IES
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGANECDIESF 451
Query: 238 VNG 240
V G
Sbjct: 452 VLG 454
>gi|291408920|ref|XP_002720687.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Oryctolagus
cuniculus]
Length = 467
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 96/243 (39%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H P PPC + + +
Sbjct: 270 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHEQDEAGPA--PPCMMHS--------RAMGRG 319
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N + D Y+ Y + +I +E+++NGPV A M ++ D F Y
Sbjct: 320 KRQATARCPNSHV---HANDIYQVTPAYRLGSNEKEIMKELLENGPVQALMEVHEDFFLY 376
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+ G Y + PV L + G + +VKI GWGEE
Sbjct: 377 QGGIYSHTPVS----LERPERYRRHGTH------------SVKITGWGEET--------- 411
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ T+K YWT +++G +G++G +ILRG NE IES
Sbjct: 412 ---------LPDGRTLK-----------YWTAANSWGPAWGERGHFRILRGTNECDIESF 451
Query: 238 VNG 240
V G
Sbjct: 452 VLG 454
>gi|159111216|ref|XP_001705840.1| Hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
gi|157433930|gb|EDO78166.1| hypothetical protein GL50803_113303 [Giardia lamblia ATCC 50803]
Length = 804
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 35/139 (25%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
Y S + + Y NGP+ +MYL +D S K G+Y+ + ++ G G
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKL---------GGGH- 233
Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
V ++GWGEENG PYW +T+G +GD+G KI R
Sbjct: 234 ------------------------AVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKR 269
Query: 228 GRNEAIIESLVNGALPKDN 246
G NE IE+ ALP D
Sbjct: 270 GSNELKIETWPGSALPIDT 288
>gi|159120206|ref|XP_001710319.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
gi|157438437|gb|EDO82645.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
ATCC 50803]
Length = 804
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 61/139 (43%), Gaps = 35/139 (25%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
Y S + + Y NGP+ +MYL +D S K G+Y+ + ++ G G
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKL---------GGGH- 233
Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
V ++GWGEENG PYW +T+G +GD+G KI R
Sbjct: 234 ------------------------AVMIVGWGEENGVPYWDCANTYGTNWGDQGYFKIKR 269
Query: 228 GRNEAIIESLVNGALPKDN 246
G NE IE+ ALP D
Sbjct: 270 GSNELKIETWPGSALPIDT 288
>gi|226497010|ref|NP_001150152.1| LOC100283781 precursor [Zea mays]
gi|195637168|gb|ACG38052.1| cathepsin B-like cysteine proteinase 3 precursor [Zea mays]
Length = 347
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP-PCNHANYTTSEPECKTLATPQPK 60
C G W + + G+VT C P P C H P C+ A P PK
Sbjct: 161 CDGGYPIEAWRYFVQNGVVT-------DECDPYFDPVGCKH-------PGCEP-AYPTPK 205
Query: 61 CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N +Q+K F Y +N + DI E+ KNGPV +Y D YKS
Sbjct: 206 CEKKCKEQNQ---VWQEKKHFSIDAYRINSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 262
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + I+ VK++GWG +
Sbjct: 263 GVYKH------------------------ITGGIMGGHAVKLIGWGTSDA---------- 288
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 289 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEGVV 325
Query: 240 GALP 243
+P
Sbjct: 326 AGMP 329
>gi|508264|gb|AAA96833.1| cysteine protease, partial [Caenorhabditis elegans]
Length = 198
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 75/177 (42%), Gaps = 25/177 (14%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W K+G VTGG++ TGC+P +PPC H T C + P +
Sbjct: 44 CNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCEHHVNGTHYKPCPSNMYPTGQN 103
Query: 62 HTRCTNDNYGRGFFQD-KYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+ + +D +R + + E A I + I +G + + ++ D F + SG
Sbjct: 104 ANALGKLDIALTYHKDLHFRTILHTPASKEAAGIPKGIKTHGQLRGGITVFED-FEHYSG 162
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
GVY +A A + +A VK++GWG +NG PYW I
Sbjct: 163 ----------------------GVYVHTAGASLGGHA-VKMLGWGVDNGTPYWLIAN 196
>gi|162813|gb|AAA30434.1| cathepsin B, partial [Bos taurus]
Length = 122
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 53/122 (43%), Gaps = 35/122 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV +YSD YKSGVY
Sbjct: 31 YKNGPVEGAFSVYSDFLLYKSGVYQ----------------------------------- 55
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S EI+ ++++GWG ENG PYW + +++ +GD G KILRG++ IES +
Sbjct: 56 HVSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 115
Query: 242 LP 243
+P
Sbjct: 116 MP 117
>gi|297843028|ref|XP_002889395.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
gi|297335237|gb|EFH65654.1| hypothetical protein ARALYDRAFT_887368 [Arabidopsis lyrata subsp.
lyrata]
Length = 360
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/246 (26%), Positives = 88/246 (35%), Gaps = 77/246 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G + W + G+VT + NTGC S P C EP A P P
Sbjct: 172 CNGGYPIAAWRYFKHHGVVTEECDPYFDNTGC---SHPGC--------EP-----AYPTP 215
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y V DI E+
Sbjct: 216 KCARKCVSGN--QLWRESKHYGVSAYKVRSHPDDIMAEV--------------------- 252
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY I +A VK++GWG
Sbjct: 253 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGTNIGGHA-VKLIGWGT------------- 296
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 297 --------------------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEHGVV 336
Query: 240 GALPKD 245
LP D
Sbjct: 337 AGLPSD 342
>gi|402853710|ref|XP_003891533.1| PREDICTED: tubulointerstitial nephritis antigen-like [Papio anubis]
Length = 362
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT RC N + D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 216 RQAT------ARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341
Query: 233 IIESLVNG 240
IES V G
Sbjct: 342 DIESFVLG 349
>gi|158285208|ref|XP_001687862.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|158285210|ref|XP_308187.4| AGAP007684-PB [Anopheles gambiae str. PEST]
gi|157019881|gb|EDO64511.1| AGAP007684-PA [Anopheles gambiae str. PEST]
gi|157019882|gb|EAA04576.4| AGAP007684-PB [Anopheles gambiae str. PEST]
Length = 463
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 77/179 (43%), Gaps = 50/179 (27%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
Y+ + +N+E DI EI G V A M +Y D FSY+SG Y +
Sbjct: 310 YKMGPAFSLNNET-DIMAEIKDRGTVQAIMRVYRDFFSYRSGIYRHSA------------ 356
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
A + + E AY +V+++GWGEE V Y VK
Sbjct: 357 -------AATPAEERSAYHSVRLIGWGEER-------------------VGYDVVK---- 386
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGE 257
YW ++++G+ +G+ G +ILRG NE IES V + P + V+ + GE
Sbjct: 387 -------YWIAINSWGQWWGENGRFRILRGSNECDIESYVLASNPYVHEHVQAIRKVGE 438
>gi|21699|emb|CAA46811.1| cathepsin B [Triticum aestivum]
Length = 353
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 89/244 (36%), Gaps = 75/244 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C +N + + ++K+ Y V+ DI E+ KNGPV + Y I
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEV-AFTYCQIL---- 261
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
D YKSGVY + ++ VK++GWG +
Sbjct: 262 ----------------DFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDA---------- 294
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG NE IE V
Sbjct: 295 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGENECGIEGDVT 331
Query: 240 GALP 243
+P
Sbjct: 332 AGMP 335
>gi|156708110|gb|ABU93313.1| cathepsin B4 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 73/186 (39%), Gaps = 65/186 (34%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C ++C N + R+K W + E ++I Q +M+ GP+ +YSD +Y+
Sbjct: 160 PACPSKCDNGS-------QIIRYKLQSWKSVEPSEIMQALMEYGPLSCGFMVYSDFMNYR 212
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG ++ +KSG + + V + GWG ENG PYW +
Sbjct: 213 SG----------------VYQHKSGYFEGGHA--------VLLCGWGVENGLPYWLVQN- 247
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
WG P W G+KG KILRG N IES V
Sbjct: 248 ------------------SWG-----PAW----------GEKGFFKILRGSNHCEIESYV 274
Query: 239 NGALPK 244
+PK
Sbjct: 275 TLGVPK 280
>gi|324512900|gb|ADY45327.1| Peptidase C1-like protein [Ascaris suum]
Length = 450
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 49/180 (27%), Positives = 73/180 (40%), Gaps = 46/180 (25%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR Y V+ DI EI+ NGPV A +Y D F Y G +Y + D+
Sbjct: 312 YRMTPPYRVSSREQDIMTEIITNGPVQATFLVYEDFFMYSGG----------VYQHLDLH 361
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
+K ++ Y +V+I+GWGE+ Y+T +
Sbjct: 362 EHK------EEERKVQGYHSVRIIGWGED----------------------YSTGPQV-- 391
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGER 258
YW +++G ++G+ G +ILRG N IES V GA K F + +R
Sbjct: 392 ------KYWLAANSWGNEWGEDGLFRILRGENHCEIESFVIGAWGKGAKKRRFKVQKLQR 445
>gi|308157698|gb|EFO60800.1| Cathepsin B-like cysteine proteinase 3 precursor [Giardia lamblia
P15]
Length = 627
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 59/139 (42%), Gaps = 35/139 (25%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
Y S + + Y NGP+ +MYL +D S K G+Y+ + ++ V IVG
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVG---- 239
Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
WGEENG PYW +T+G +GD+G KI R
Sbjct: 240 ------------------------------WGEENGVPYWDCANTYGTNWGDQGYFKIKR 269
Query: 228 GRNEAIIESLVNGALPKDN 246
G NE IE+ ALP D
Sbjct: 270 GSNELKIETWPGSALPIDT 288
>gi|14290553|gb|AAH09048.1| TINAGL1 protein [Homo sapiens]
Length = 218
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 21 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 71
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 72 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 122
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 123 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 162
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 163 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 197
Query: 233 IIESLVNG 240
IES V G
Sbjct: 198 DIESFVLG 205
>gi|48762495|dbj|BAD23817.1| cathepsin B-S [Tuberaphis styraci]
Length = 99
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 25/120 (20%)
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
P + H +C YG+ QD+Y+ K Y +N + I+Q++M GPV A+ +Y D FS
Sbjct: 5 PMERNH-QCPKTCYGKTTVQDRYKTKNEYVINS-IETIEQDLMTYGPVEASFDVYDD-FS 61
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YKSG+Y + A+ ++KI+GWGEENG PYW V
Sbjct: 62 ----------------------VYKSGIYRKTPKAKYEGGHSIKIIGWGEENGTPYWLAV 99
>gi|6165885|gb|AAF04727.1|AF101239_1 cathepsin B-like cysteine proteinase [Ipomoea batatas]
Length = 352
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 91/260 (35%), Gaps = 77/260 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + + G+VT + TGC S P C EP A P P
Sbjct: 164 CDGGYPIAAWQYFKRTGVVTSECDPYFDQTGC---SHPGC--------EP-----AYPTP 207
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N + + K+ Y VN + I E+
Sbjct: 208 ACEKKCVKKNLL--WSESKHFSVNAYRVNSDQHSIMTEV--------------------- 244
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGP + +Y D YKSGVY +E+ +A VK++GWG
Sbjct: 245 --YTNGPAEVSFTVYEDFAHYKSGVYKHVTGSEMGGHA-VKLIGWGT------------- 288
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + +G G KI+RG NE IE +
Sbjct: 289 --------------------SEDGEDYWLLANQWNRSWGGDGYFKIIRGTNECGIEDVTA 328
Query: 240 GALPKDNYGVEFGEESGERL 259
G N +E G + L
Sbjct: 329 GTPSTKNLDIESGVRDDDSL 348
>gi|345488309|ref|XP_001605531.2| PREDICTED: uncharacterized peptidase C1-like protein F26E4.3-like
[Nasonia vitripennis]
Length = 481
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 67/152 (44%), Gaps = 53/152 (34%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI QEI+ +GPV A M ++ D F Y+SG +Y++S F +
Sbjct: 372 TDIMQEILTSGPVQATMRVHRDFFHYESG----------IYVHSRPFDTRQS-------- 413
Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YWTI 209
Y +V+IVGWGEE PY NG+P +W +
Sbjct: 414 ---GYHSVRIVGWGEEPS-PY-----------------------------NGKPIKFWRV 440
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+++G +G+ G +I+RG NE IES V G
Sbjct: 441 ANSWGRDWGEDGYFRIVRGNNECEIESFVLGV 472
>gi|308161545|gb|EFO63987.1| Cathepsin B-like cysteine proteinase [Giardia lamblia P15]
Length = 804
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 59/139 (42%), Gaps = 35/139 (25%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSY-KSGVYAVSASAEIVAYATVKIVGWGEE 167
Y S + + Y NGP+ +MYL +D S K G+Y+ + ++ V IVG
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDFPSKDKKGIYSSGPNTKLRGGHAVMIVG---- 239
Query: 168 NGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILR 227
WGEENG PYW +T+G +GD+G KI R
Sbjct: 240 ------------------------------WGEENGVPYWDCANTYGTNWGDQGYFKIKR 269
Query: 228 GRNEAIIESLVNGALPKDN 246
G NE IE+ ALP D
Sbjct: 270 GSNELKIETWPGSALPIDT 288
>gi|355557764|gb|EHH14544.1| hypothetical protein EGK_00488 [Macaca mulatta]
gi|355745087|gb|EHH49712.1| hypothetical protein EGM_00421 [Macaca fascicularis]
gi|384948750|gb|AFI37980.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|384948752|gb|AFI37981.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
gi|387540550|gb|AFJ70902.1| tubulointerstitial nephritis antigen-like isoform 1 precursor
[Macaca mulatta]
Length = 467
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT RC N + D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 321 RQAT------ARCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446
Query: 233 IIESLVNG 240
IES V G
Sbjct: 447 DIESFVLG 454
>gi|195154396|ref|XP_002018108.1| GL16940 [Drosophila persimilis]
gi|194113904|gb|EDW35947.1| GL16940 [Drosophila persimilis]
Length = 433
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 89/243 (36%), Gaps = 76/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLK 297
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C + N R F Y Y +N E +DI EI +GPV A M +Y D FSY SG
Sbjct: 298 ANGCRPSANVDRDSF---YTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSG 353
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y AN + +VK+VGWGEE+
Sbjct: 354 VYRQ--TAANR-------------------GAPTGFHSVKLVGWGEEH------------ 380
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G++G +ILRG NE IE V
Sbjct: 381 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 419
Query: 241 ALP 243
+ P
Sbjct: 420 SWP 422
>gi|125810908|ref|XP_001361665.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
gi|54636841|gb|EAL26244.1| GA15908 [Drosophila pseudoobscura pseudoobscura]
Length = 433
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 89/243 (36%), Gaps = 76/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 255 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLK 297
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C + N R F Y Y +N E +DI EI +GPV A M +Y D FSY SG
Sbjct: 298 ANGCRPSANVDRDSF---YTVGPAYTLNKE-SDIMAEIYHSGPVQATMRVYRDFFSYSSG 353
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y AN + +VK+VGWGEE+
Sbjct: 354 VYRQ--TAANR-------------------GAPTGFHSVKLVGWGEEH------------ 380
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G++G +ILRG NE IE V
Sbjct: 381 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 419
Query: 241 ALP 243
+ P
Sbjct: 420 SWP 422
>gi|170030062|ref|XP_001842909.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
gi|167865915|gb|EDS29298.1| cathepsin B-like thiol protease [Culex quinquefasciatus]
Length = 288
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 94/245 (38%), Gaps = 74/245 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G T+ + K GL +GG +HS GC+P F KC
Sbjct: 115 CDGGYVHKTFDYWVKYGLTSGGPYHSGQGCKPYPFGGATQD------------VNIVLKC 162
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWV--NDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C Y + QD Y + DE A ++ EI +NGP+V + +Y D F Y+S
Sbjct: 163 DRQC-QAGYPLTYSQDLKHGASSYILPWGDENA-MKAEIYQNGPIVTSFDVYGDFFQYRS 220
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G ++ + +G Y S + V+++GWG ENG
Sbjct: 221 G----------------VYRHVTGAYKGSHA--------VRVIGWGVENG---------- 246
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
VK YW +++ E++G+ G KI+RG N +E +
Sbjct: 247 -------------VK-----------YWLCANSWNERWGENGFFKIVRGENHVGVEDISY 282
Query: 240 GALPK 244
LPK
Sbjct: 283 AGLPK 287
>gi|10803435|emb|CAC13130.1| putative cathepsin B.4 [Ostertagia ostertagi]
Length = 194
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 55/121 (45%), Gaps = 2/121 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + + G+VTGG + C+P PC H EC A P+C
Sbjct: 43 CDGGWPIKAWQFFAREGVVTGGNYGRQGCCRPYEITPCGHHGREPYYGECYDDAQ-TPRC 101
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C Y + +DK ++ Y + + V IQ+EIM +GPVVA +Y D Y G
Sbjct: 102 KRKC-QSGYKTTYKKDKRYGRKAYQLPNSVKAIQREIMMHGPVVAGYTVYEDFSYYTKGI 160
Query: 122 Y 122
Y
Sbjct: 161 Y 161
>gi|332374788|gb|AEE62535.1| unknown [Dendroctonus ponderosae]
Length = 328
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 104/246 (42%), Gaps = 75/246 (30%)
Query: 2 CSSGISSSTWV-WVHKRGLVTGGAHHS-NTGCQPVSFPPCN-HANYTTSEPECKTLATPQ 58
C+ G + W W + G+VTGG + + GC+ C+ H N +C+ +
Sbjct: 149 CNGGWPAVAWSDWTN--GIVTGGLYGALEQGCKSYFLEGCDDHPN------KCRNYVS-T 199
Query: 59 PKCHTRCTNDN-YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
P C +C + Y + Q+ Y Y +E IQ EIM NGPV A M +Y D Y
Sbjct: 200 PACVEQCDEPSLYYKA--QETYGQTPYEIQGEE--QIQYEIMTNGPVEATMDVYVDFAQY 255
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
+SG Y L +D Y+ G VKI+GWG E+G
Sbjct: 256 QSGIY---------QLTTD--EYEGG-------------HAVKILGWGVEDG-------- 283
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
VK YW + +++ E++G+ G +I+RGR+E IES
Sbjct: 284 ---------------VK-----------YWLVANSWNERWGENGLFRIIRGRDEVGIEST 317
Query: 238 VNGALP 243
++ ALP
Sbjct: 318 IDAALP 323
>gi|395730851|ref|XP_003775799.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Pongo
abelii]
Length = 362
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC + + +
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 217
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
A+ C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 218 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341
Query: 233 IIESLVNG 240
IES V G
Sbjct: 342 DIESFVLG 349
>gi|407425570|gb|EKF39488.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi
marinkellei]
Length = 333
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/236 (25%), Positives = 89/236 (37%), Gaps = 73/236 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G WV+ GLV+ CQP FP C +H N + P TP K
Sbjct: 160 CNGGFPEVAWVFYVVHGLVS-------EYCQPYPFPSCAHHVNSSDLAPCSGDYKTP--K 210
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C++ CT + +YR Y ++ E ++E++ NGP + F
Sbjct: 211 CNSTCTE----KKIPLIRYRGNHSYVLSGE-EHFKRELLLNGP-------FEVAFE---- 254
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y+D +Y GVY
Sbjct: 255 ------------VYADFMAYTGGVYK---------------------------------- 268
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+ +++ V+L+GWGE NG PYW I +++ ++G G I RG NE IES
Sbjct: 269 -HVAGDLLGGHAVRLVGWGELNGEPYWKIANSWNHEWGMNGYFLIARGVNECGIES 323
>gi|145541902|ref|XP_001456639.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424451|emb|CAK89242.1| unnamed protein product [Paramecium tetraurelia]
Length = 487
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 25/122 (20%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ NGPV+ N D Y SG+Y A + W + RP W V
Sbjct: 370 FNNGPVIMNFEPGQDFMYYSSGIYHSVAQHD-----------WSSSD-RPEWEKVD---- 413
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+V GWGEENG +W + +++GEQ+G++G ++ RG +E+ IES+ A
Sbjct: 414 ---------HSVLCYGWGEENGVKFWLLQNSWGEQWGEQGNFRMKRGTDESAIESMAEAA 464
Query: 242 LP 243
P
Sbjct: 465 DP 466
>gi|350408961|ref|XP_003488566.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
impatiens]
Length = 445
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 66/149 (44%), Gaps = 51/149 (34%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI EI+ +GPV A M +Y D F SY+SG+Y +A+
Sbjct: 338 TDIMYEILTSGPVQATMKVYQDFF-----------------------SYESGIYKHTATT 374
Query: 152 EIVA--YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
E A Y +V+I+GWGE+ SA +K YW +
Sbjct: 375 EHYAFGYHSVRIIGWGED---------------TSAHRYRNLPIK-----------YWLV 408
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
V+++G+Q+G+ G +I RG NE IES V
Sbjct: 409 VNSWGQQWGESGLFRIQRGTNECDIESFV 437
>gi|355724275|gb|AES08176.1| tubulointerstitial nephritis antigen-like 1 [Mustela putorius furo]
Length = 454
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 83/201 (41%), Gaps = 64/201 (31%)
Query: 55 ATPQPKC--HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIM 99
A P P+C H+R GRG Q D Y+ Y + +I +E+M
Sbjct: 290 AGPAPRCMMHSR----AMGRGKRQATARCPSSHAHANDIYQVTPAYRLGSNEKEIMKELM 345
Query: 100 KNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159
+NGPV A M ++ D F Y+SG Y + PV L + G + +V
Sbjct: 346 ENGPVQALMEVHEDFFLYQSGIYSHTPVS----LGRPERYRRHGTH------------SV 389
Query: 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD 219
KI GWGEE + T+K YWT +++G +G+
Sbjct: 390 KITGWGEET------------------LPDGRTLK-----------YWTAANSWGPAWGE 420
Query: 220 KGTIKILRGRNEAIIESLVNG 240
+G +I+RG NE IES V G
Sbjct: 421 RGHFRIVRGANECDIESFVLG 441
>gi|312383398|gb|EFR28501.1| hypothetical protein AND_03481 [Anopheles darlingi]
Length = 573
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 71/165 (43%), Gaps = 50/165 (30%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR Y +N+E DI EI + G V A + +Y D FSY++G Y +
Sbjct: 419 YRMGPAYSLNNET-DIMTEIKERGTVQAILRVYRDFFSYQNGIYRHSA------------ 465
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
A + + E AY +V+++GWGEE V Y VK
Sbjct: 466 -------AATPAEERSAYHSVRLIGWGEER-------------------VGYDMVK---- 495
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
YW V+++G +G+ G +ILRG NE IES V + P
Sbjct: 496 -------YWIAVNSWGTWWGENGRFRILRGTNECEIESYVLASNP 533
>gi|297814171|ref|XP_002874969.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
gi|297320806|gb|EFH51228.1| hypothetical protein ARALYDRAFT_490415 [Arabidopsis lyrata subsp.
lyrata]
Length = 359
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 89/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + G+VT + NTGC S P C+ A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C +C +DN + + + K+ Y VN DI E+ KNGPV + +Y D F++
Sbjct: 215 RCLRKCVSDN--KLWSESKHYSVSTYTVNSSPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
YKSGVY + I +A VK++GWG N
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSN----------- 297
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G I RG NE IE
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335
Query: 240 GALP 243
LP
Sbjct: 336 AGLP 339
>gi|328726763|ref|XP_003249034.1| PREDICTED: cathepsin B-like cysteine proteinase-like, partial
[Acyrthosiphon pisum]
Length = 129
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 55/122 (45%), Gaps = 34/122 (27%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
GP+ A+ +Y D SYKSGVY + +A K+ G
Sbjct: 42 GPIEASFDVYDDFPSYKSGVYQRTPNA-------TKLGG--------------------- 73
Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
VKLIGWG E G PYW +V+++ Q+GD G KI RG +E I+S +P
Sbjct: 74 ------HAVKLIGWGVEEGTPYWLMVNSWNAQWGDNGLFKIRRGTDECRIDSATTAGVPV 127
Query: 245 DN 246
N
Sbjct: 128 TN 129
>gi|339248603|ref|XP_003373289.1| cathepsin B [Trichinella spiralis]
gi|316970616|gb|EFV54519.1| cathepsin B [Trichinella spiralis]
Length = 576
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 69/163 (42%), Gaps = 46/163 (28%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
Y+ Y ++ +I EIM NGPV A ++ D F YKSG Y + P +
Sbjct: 432 YKMTPPYRISTNEREIMTEIMANGPVQATFLVHEDFFMYKSGVYQHLPYAND-------- 483
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
K YA S Y +V+I+GWG V ++T I
Sbjct: 484 --KGPAYARS------GYHSVRILGWG----------------------VDHSTGVPIK- 512
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
YW +++GE++G+ G +ILRG N IES + GA
Sbjct: 513 -------YWLCANSWGEEWGENGLFRILRGENHCDIESFIIGA 548
>gi|324713036|ref|NP_001191344.1| tubulointerstitial nephritis antigen-like isoform 3 [Homo sapiens]
gi|119628008|gb|EAX07603.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_a [Homo
sapiens]
Length = 362
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 216 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 266
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341
Query: 233 IIESLVNG 240
IES V G
Sbjct: 342 DIESFVLG 349
>gi|356505709|ref|XP_003521632.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 357
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 90/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 213 KCVKKCVSGN--QVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I Y+ G +A VK++GWG
Sbjct: 271 G------------VYKHITGYELGGHA------------VKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + ++GD G KI RG NE IE V
Sbjct: 294 --------------------TDDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVT 333
Query: 240 GALP 243
LP
Sbjct: 334 AGLP 337
>gi|330805199|ref|XP_003290573.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
gi|325079281|gb|EGC32888.1| hypothetical protein DICPUDRAFT_155103 [Dictyostelium purpureum]
Length = 313
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 62/242 (25%), Positives = 89/242 (36%), Gaps = 73/242 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S W ++ K+G+VT C+P + P C A +P + TP C
Sbjct: 144 CEGGDDVSAWNFLKKQGVVT-------QECKPYTIPTCPPAQ----QPCLNFVNTPN--C 190
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + N + QDK++ + Y +N V I QEI NGPV A +Y D YKSG
Sbjct: 191 VKQCES-NSTLIYSQDKHKMAKIYSIN-SVEAIMQEISTNGPVEACFSVYEDFLGYKSGV 248
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + + + VKI G+G NG YW++ +
Sbjct: 249 YQH------------------------TTGKFLGGHCVKIFGYGTLNGVNYWSVANSWTT 284
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S +GD G I RG +E IE V
Sbjct: 285 S----------------------------------WGDNGIFLIKRGSDECGIEDEVVAG 310
Query: 242 LP 243
+P
Sbjct: 311 IP 312
>gi|62320420|dbj|BAD94873.1| cathepsin B-like cysteine proteinase like protein [Arabidopsis
thaliana]
Length = 183
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 86/237 (36%), Gaps = 77/237 (32%)
Query: 11 WVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
W++ G+VT + NTGC S P C EP P PKC +C +
Sbjct: 4 WLYFKYHGVVTQECDPYFDNTGC---SHPGC--------EP-----TYPTPKCERKCVSR 47
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
N G + K+ Y +N + DI E+ Y NGPV
Sbjct: 48 NQLWG--ESKHYGVGAYRINPDPQDIMAEV-----------------------YKNGPVE 82
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIV 188
+Y D YKSGVY +I +A VK++GWG
Sbjct: 83 VAFTVYEDFAHYKSGVYKYITGTKIGGHA-VKLIGWGT---------------------- 119
Query: 189 AYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKD 245
++G YW + + + +GD G KI RG NE IE V LP +
Sbjct: 120 -----------SDDGEDYWLLANQWNRSWGDDGYFKIRRGTNECGIEQSVVAGLPSE 165
>gi|156708120|gb|ABU93318.1| cathepsin B9 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 382
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/272 (25%), Positives = 102/272 (37%), Gaps = 74/272 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + + + + K G+ T C P C+H P C + TP C
Sbjct: 138 CNGGWTETAFEYAKKAGVPT-------EECVPYLMGKCHH-------PGCSSWQTPT--C 181
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C++ + Y K Y + V IQ E+M+NGPV A Y D+ Y G
Sbjct: 182 KKECSSLSNYNYSSNRYYASKSYS-IQRNVEAIQLELMRNGPVTAVFTTYDDLAVYWRG- 239
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG---------------- 165
+Y+ + + G++A +KIVGWG
Sbjct: 240 -----------VYNHVMGSEQGLHA------------IKIVGWGVWRESEHMLTEEEKKA 276
Query: 166 -------------EENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVST 212
+E W + A+ S ++ T +E G PYW IV++
Sbjct: 277 EEEKRKRIEEEIKKEKREDKWHDFKQNALEKSKKVKRDETKN----NKEEGIPYWIIVNS 332
Query: 213 FGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+GE FG G + I RG NE IES V +PK
Sbjct: 333 WGEDFGMDGILLIKRGVNECGIESDVYTGIPK 364
>gi|297665716|ref|XP_002811185.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 3
[Pongo abelii]
Length = 436
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC + + +
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 291
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
A+ C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 292 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415
Query: 233 IIESLVNG 240
IES V G
Sbjct: 416 DIESFVLG 423
>gi|296207307|ref|XP_002750588.1| PREDICTED: tubulointerstitial nephritis antigen-like [Callithrix
jacchus]
Length = 467
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 95/245 (38%), Gaps = 66/245 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTG------GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G W ++ +RG+V+ G G PV PPC + T + + A
Sbjct: 270 CRGGHLDGAWWFLRRRGVVSDHCYPFLGRERDKAG--PV--PPCMMHSRATGRGKRQATA 325
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
C N+N + Y+ Y + +I +E+M+NGPV A M ++ D F
Sbjct: 326 ----HCPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHEDFF 374
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK G Y + PV L + G + +VKI GWGEE
Sbjct: 375 LYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET------- 411
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
W + YWT +++G +G++G +I+RG NE IE
Sbjct: 412 ----------------------WPDGRKLKYWTAANSWGPAWGERGHFRIVRGVNECDIE 449
Query: 236 SLVNG 240
S V G
Sbjct: 450 SFVLG 454
>gi|332254562|ref|XP_003276398.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 3
[Nomascus leucogenys]
Length = 362
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 165 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 215
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + ++ +E+M+NGPV A M ++
Sbjct: 216 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 266
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 267 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 306
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 307 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 341
Query: 233 IIESLVNG 240
IES V G
Sbjct: 342 DIESFVLG 349
>gi|340712697|ref|XP_003394892.1| PREDICTED: tubulointerstitial nephritis antigen-like [Bombus
terrestris]
Length = 445
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 66/149 (44%), Gaps = 51/149 (34%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI EI+ +GPV A M +Y D F SY+SG+Y +A+
Sbjct: 338 TDIMYEILTSGPVQATMKVYQDFF-----------------------SYESGIYKHTATT 374
Query: 152 EIVA--YATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
E A Y +V+I+GWGE+ SA +K YW +
Sbjct: 375 EHYAFGYHSVRIIGWGED---------------TSAHRHHNLPIK-----------YWLV 408
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
V+++G+Q+G+ G +I RG NE IES V
Sbjct: 409 VNSWGQQWGESGLFRIQRGTNECDIESFV 437
>gi|395833440|ref|XP_003789742.1| PREDICTED: tubulointerstitial nephritis antigen [Otolemur
garnettii]
Length = 464
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 98/247 (39%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA-------HHSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ H +N+GC A + S+ K
Sbjct: 272 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQHATNSGC----------AMASRSDGRGKRH 321
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y ++ +I +EIM+NGPV A M ++ D
Sbjct: 322 AT-KP-----CPNNIEKSNRI---YQCSPPYRISSNETEIMKEIMQNGPVQAIMQVHEDF 372
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YKSG I+ + + + S + + VK++GWG G
Sbjct: 373 FHYKSG----------------IYRHVASTHGESENYRKLRTHAVKLLGWGTLRG----- 411
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 412 ------------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDI 447
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 448 EKLIIAA 454
>gi|14042811|dbj|BAB55403.1| unnamed protein product [Homo sapiens]
Length = 218
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 21 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 71
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 72 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 122
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F Y+ G Y + PV L + G + +VKI GWGEE
Sbjct: 123 DFFLYEGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 162
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 163 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 197
Query: 233 IIESLVNG 240
IES V G
Sbjct: 198 DIESFVLG 205
>gi|297665714|ref|XP_002811184.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Pongo abelii]
Length = 467
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 96/248 (38%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC + + +
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPTPPCMMHSRAMGRGKRQ 322
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
A+ C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 323 ATAS----CPNSHVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446
Query: 233 IIESLVNG 240
IES V G
Sbjct: 447 DIESFVLG 454
>gi|255548165|ref|XP_002515139.1| cathepsin B, putative [Ricinus communis]
gi|223545619|gb|EEF47123.1| cathepsin B, putative [Ricinus communis]
Length = 376
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 84/242 (34%), Gaps = 73/242 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + G+VT C P N S P C+ P PKC
Sbjct: 186 CDGGYPMYAWRYFVHHGVVT-------EECDPY------FDNIGCSHPGCEP-GFPTPKC 231
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C + N + + Q K+ Y ++ + D+ E+ KNGPV + +Y D YKSG
Sbjct: 232 VRKCIDKN--QLWRQSKHYSVNAYRISSDPHDVMAEVYKNGPVEVSFTVYEDFAHYKSGV 289
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + E++ VK++GWG
Sbjct: 290 YKH------------------------ITGEVMGGHAVKLIGWGT--------------- 310
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+NG YW + + + +GD G KI RG NE IE
Sbjct: 311 ------------------SDNGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEDDAVAG 352
Query: 242 LP 243
LP
Sbjct: 353 LP 354
>gi|351709947|gb|EHB12866.1| Tubulointerstitial nephritis antigen-like protein [Heterocephalus
glaber]
Length = 467
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 96/242 (39%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ +RG+V+ + + Q + P ++ + K AT
Sbjct: 270 CQGGRLDGAWWFLRRRGVVSDHCYPFSGHEQAEAGPATPCMMHSRAMGRGKRQAT----- 324
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
RC N + + Y+ Y + + +I +E+M+NGPV A M +Y D F YKSG
Sbjct: 325 -RRCPNSHDDA---NEIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVYEDFFLYKSG- 379
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NGRPYWTIVRV 178
I+S+ +VKI GWGEE +GR
Sbjct: 380 ---------------IYSHTLVSMGRPEQYRRHGTHSVKITGWGEEMLPDGR-------- 416
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
T+K YWT +++G +G++G +ILRG NE IES V
Sbjct: 417 -------------TLK-----------YWTAANSWGPSWGERGYFRILRGSNECDIESFV 452
Query: 239 NG 240
G
Sbjct: 453 LG 454
>gi|194882138|ref|XP_001975170.1| GG20712 [Drosophila erecta]
gi|190658357|gb|EDV55570.1| GG20712 [Drosophila erecta]
Length = 431
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/244 (27%), Positives = 89/244 (36%), Gaps = 78/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CDGGHLDAAWRYLHKKGVV-------DESCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 62 HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T N R F Y Y +N E ADI EI +GPV A M + D FSY G
Sbjct: 296 ANGCETPVNVDRDTF---YTVGPAYSLNRE-ADIMAEIFNSGPVQATMRVNRDFFSYSRG 351
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI-VAYATVKIVGWGEENGRPYWTIVRVY 179
Y +A+ E + +VK+VGWGEE+
Sbjct: 352 VYRQ----------------------TAANREAPTGFHSVKLVGWGEEH----------- 378
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
NG YW +++G +G+KG +ILRG NE IE V
Sbjct: 379 ----------------------NGEKYWIAANSWGSWWGEKGYFRILRGSNECGIEEYVL 416
Query: 240 GALP 243
+ P
Sbjct: 417 ASWP 420
>gi|324711034|ref|NP_001191343.1| tubulointerstitial nephritis antigen-like isoform 2 precursor [Homo
sapiens]
gi|194391000|dbj|BAG60618.1| unnamed protein product [Homo sapiens]
Length = 436
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 290 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415
Query: 233 IIESLVNG 240
IES V G
Sbjct: 416 DIESFVLG 423
>gi|397515891|ref|XP_003828175.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2 [Pan
paniscus]
Length = 436
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 290 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 340
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415
Query: 233 IIESLVNG 240
IES V G
Sbjct: 416 DIESFVLG 423
>gi|256052327|ref|XP_002569724.1| cathepsin B-like peptidase (C01 family) [Schistosoma mansoni]
Length = 96
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 61/145 (42%), Gaps = 58/145 (40%)
Query: 94 IQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI 153
IQ+EIMK GPV AN +Y D +YKSG Y + + ++
Sbjct: 3 IQKEIMKYGPVEANFIVYEDFLNYKSGIYKH------------------------ITGKL 38
Query: 154 VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
++ ++I+GWGEEN P YW I +++
Sbjct: 39 FSWHAIRIIGWGEENNTP----------------------------------YWLIPNSW 64
Query: 214 GEQFGDKGTIKILRGRNEAIIESLV 238
E +G+ G +ILRGR+E IES V
Sbjct: 65 NEDWGENGNFRILRGRHECSIESEV 89
>gi|426328832|ref|XP_004025452.1| PREDICTED: tubulointerstitial nephritis antigen-like [Gorilla
gorilla gorilla]
Length = 462
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 68/251 (27%), Positives = 99/251 (39%), Gaps = 78/251 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 265 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSQAMGRGK 315
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 316 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 366
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEE---NG 169
D F YK G Y + PV L + G + +VKI GWGEE +G
Sbjct: 367 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEETLPDG 410
Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
R T+K YWT +++G +G++G +I+RG
Sbjct: 411 R---------------------TLK-----------YWTAANSWGPAWGERGHFRIVRGV 438
Query: 230 NEAIIESLVNG 240
NE IES V G
Sbjct: 439 NECDIESFVLG 449
>gi|195426329|ref|XP_002061289.1| GK20838 [Drosophila willistoni]
gi|194157374|gb|EDW72275.1| GK20838 [Drosophila willistoni]
Length = 432
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 89/243 (36%), Gaps = 77/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G++ + C P YT S CK + K
Sbjct: 255 CEGGHLDAAWRYLHKKGVL-------DESCYP----------YTQSRGTCKVRHSGSLKA 297
Query: 62 HTRCTNDNYGRGFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H G +D Y Y ++ E ADI+ EI +GPV A M +Y D FSY G
Sbjct: 298 H----GCRPAPGVDRDSLYTVGPAYSLSRE-ADIKAEIFHSGPVQATMRVYRDFFSYSGG 352
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + +VK+VGWGEE+
Sbjct: 353 IYRQ---------------------TAANRGAPTGFHSVKLVGWGEEH------------ 379
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G++G +ILRG NE IE V
Sbjct: 380 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEDYVLA 418
Query: 241 ALP 243
+ P
Sbjct: 419 SWP 421
>gi|40643250|emb|CAC83720.1| cathepsin B [Hordeum vulgare subsp. vulgare]
gi|326494236|dbj|BAJ90387.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326499864|dbj|BAJ90767.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 344
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 85/244 (34%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
C G S W + + G+VT C P C H P C+ A P P
Sbjct: 164 CDGGYPISAWQYFVQNGVVT-------EECDPYFDQVGCKH-------PGCEP-AYPTPV 208
Query: 61 CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N +Q+K F Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 209 CEKKCKVQNQ---VWQEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 265
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + ++ VK++GWG +
Sbjct: 266 GVYKH------------------------ITGGVMGGHAVKLIGWGTSDA---------- 291
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 292 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDVT 328
Query: 240 GALP 243
+P
Sbjct: 329 AGMP 332
>gi|332808277|ref|XP_524645.3| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like 1 [Pan troglodytes]
Length = 472
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 275 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 325
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 326 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 376
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 377 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 416
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 417 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 451
Query: 233 IIESLVNG 240
IES V G
Sbjct: 452 DIESFVLG 459
>gi|395526635|ref|XP_003765465.1| PREDICTED: tubulointerstitial nephritis antigen-like [Sarcophilus
harrisii]
Length = 467
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 61/242 (25%), Positives = 92/242 (38%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFP--PCN-HANYTTSEPECKTLATPQ 58
C G W ++ +RGLV+ + + G + P PC H+ + T P
Sbjct: 269 CRGGRLDGAWWFLRRRGLVSNNCYPFSEGDHNGAAPAAPCMMHSRHMGRGKRQATAHCPN 328
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+ H Y+ Y ++ DI +E+M+NGPV A + ++ D F YK
Sbjct: 329 SRTHA------------NHIYQATPPYRLSSHEKDIMKELMENGPVQALLEVHEDFFLYK 376
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + P K Y + +VKI GWGEE
Sbjct: 377 SGIYKHTPASLG----------KPERYRQHGT------HSVKITGWGEE----------- 409
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ + V YWT +++G +G+ G +I+RG NE IES V
Sbjct: 410 --IQPDGQKVK----------------YWTAANSWGPTWGENGYFRIVRGANECDIESFV 451
Query: 239 NG 240
G
Sbjct: 452 VG 453
>gi|11545918|ref|NP_071447.1| tubulointerstitial nephritis antigen-like isoform 1 precursor [Homo
sapiens]
gi|61213628|sp|Q9GZM7.1|TINAL_HUMAN RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; AltName:
Full=Oxidized LDL-responsive gene 2 protein;
Short=OLRG-2; AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TIN Ag-related protein;
Short=TIN-Ag-RP; Flags: Precursor
gi|11602840|gb|AAG38876.1|AF236150_1 tubulointerstitial nephritis antigen-related protein precursor
[Homo sapiens]
gi|11275667|gb|AAG33699.1| oxidized-LDL responsive gene 2 [Homo sapiens]
gi|11527793|dbj|BAB18636.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11527809|dbj|BAB18727.1| glucocorticoid-inducible protein [Homo sapiens]
gi|11761715|gb|AAG40154.1| tubulointerstitial nephritis antigen-related protein [Homo sapiens]
gi|22761462|dbj|BAC11596.1| unnamed protein product [Homo sapiens]
gi|37181967|gb|AAQ88787.1| LCN7 [Homo sapiens]
gi|40353044|gb|AAH64633.1| Tubulointerstitial nephritis antigen-like 1 [Homo sapiens]
gi|119628009|gb|EAX07604.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628010|gb|EAX07605.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|119628011|gb|EAX07606.1| tubulointerstitial nephritis antigen-like 1, isoform CRA_b [Homo
sapiens]
gi|158258977|dbj|BAF85459.1| unnamed protein product [Homo sapiens]
gi|261858502|dbj|BAI45773.1| tubulointerstitial nephritis antigen-like 1 [synthetic construct]
gi|410265400|gb|JAA20666.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307560|gb|JAA32380.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307562|gb|JAA32381.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410307564|gb|JAA32382.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
gi|410335249|gb|JAA36571.1| tubulointerstitial nephritis antigen-like 1 [Pan troglodytes]
Length = 467
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 321 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446
Query: 233 IIESLVNG 240
IES V G
Sbjct: 447 DIESFVLG 454
>gi|397515889|ref|XP_003828174.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1 [Pan
paniscus]
Length = 467
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 66/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 321 RQATAH--CPNSYVNNN-------DIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446
Query: 233 IIESLVNG 240
IES V G
Sbjct: 447 DIESFVLG 454
>gi|291228863|ref|XP_002734398.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 451
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/237 (26%), Positives = 94/237 (39%), Gaps = 70/237 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CS G W ++ KRG+V+ C P YT+ + + K + K
Sbjct: 246 CSGGHIDRAWWFMRKRGVVS-------NDCYP----------YTSGDQDKKGVCMMPGKL 288
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ C GR + + Y + +IQ EIM+NGPV A+ + D F Y SG
Sbjct: 289 PSDCPT---GRERNNELHHSTPPYRIAANEREIQVEIMENGPVQASFEVKEDFFMYGSGV 345
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + P+ +N D Y + + +VK++GWG ENG YW
Sbjct: 346 YRHTPIASN-----DAEQYHAS-----------EWHSVKLLGWGVENGIKYW-------- 381
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+G +++G ++G+ G KILRG NE IES V
Sbjct: 382 --------------LG------------ANSWGTKWGEDGYFKILRGENECNIESYV 412
>gi|332254560|ref|XP_003276397.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Nomascus leucogenys]
Length = 436
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 289
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + ++ +E+M+NGPV A M ++
Sbjct: 290 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 340
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 341 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 380
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 381 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 415
Query: 233 IIESLVNG 240
IES V G
Sbjct: 416 DIESFVLG 423
>gi|332030944|gb|EGI70570.1| Uncharacterized peptidase C1-like protein F26E4.3 [Acromyrmex
echinatior]
Length = 501
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 64/150 (42%), Gaps = 58/150 (38%)
Query: 93 DIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE 152
DI QEI+ +GPV A M +Y D F YK+G+Y S SAE
Sbjct: 398 DIMQEILTSGPVQATMRVYQD-----------------------FFVYKNGIYRHSQSAE 434
Query: 153 I--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YWT 208
+ Y +V+I+GWGEE R Y G P YW
Sbjct: 435 LHDSGYHSVRIIGWGEE---------RSY----------------------RGPPLKYWL 463
Query: 209 IVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+V+++G +G+ G KI RG NE IES V
Sbjct: 464 VVNSWGYNWGENGLFKIQRGTNECEIESYV 493
>gi|332254558|ref|XP_003276396.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Nomascus leucogenys]
Length = 467
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 65/248 (26%), Positives = 97/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDEAGPAPPC--MMHSRAMGRGK 320
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N D Y+ Y + ++ +E+M+NGPV A M ++
Sbjct: 321 RQATAH--CPNSHVNNN-------DIYQVTPVYRLGSNDKEVMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE
Sbjct: 372 DFFLYKGGIYSHTPVS----LGRPERYRRHGTH------------SVKITGWGEET---- 411
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 412 --------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNEC 446
Query: 233 IIESLVNG 240
IES V G
Sbjct: 447 DIESFVLG 454
>gi|335290878|ref|XP_003127800.2| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Sus scrofa]
Length = 362
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 98/248 (39%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G W ++ +RG+V+ C P S + N P C + +
Sbjct: 165 CQGGRLDGAWWFLRRRGVVS-------DHCYPFSG---HERNEAGPAPRCMMHSRAMGRG 214
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N D Y+ Y + DI +E+M+NGPV A M ++
Sbjct: 215 KRQATARCPNSYV---HANDIYQVTPAYRLGSNEKDIMKELMENGPVQALMEVHE----- 266
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
D F Y+SG+Y+ + + +GRP R
Sbjct: 267 ------------------DFFLYQSGIYSHTPVS----------------HGRP--ERYR 290
Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ +VK+ GWGEE +GR YWT +++G +G++G +I+RG NE
Sbjct: 291 RHGTH---------SVKITGWGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANEC 341
Query: 233 IIESLVNG 240
IES V G
Sbjct: 342 DIESFVLG 349
>gi|322788703|gb|EFZ14296.1| hypothetical protein SINV_07506 [Solenopsis invicta]
Length = 443
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 63/151 (41%), Gaps = 58/151 (38%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI QEI+ +GPV A M +Y D F YKSG+Y S SA
Sbjct: 339 TDIMQEILTSGPVQATMRVYQDFF-----------------------IYKSGIYRHSRSA 375
Query: 152 EI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRP--YW 207
E+ Y +V+I+GWGEE R Y G P YW
Sbjct: 376 ELHDSGYHSVRIIGWGEE---------RSY----------------------RGPPLKYW 404
Query: 208 TIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ +++G +GD G KI +G NE IES V
Sbjct: 405 LVANSWGYNWGDNGLFKIQKGTNECEIESYV 435
>gi|290982673|ref|XP_002674054.1| predicted protein [Naegleria gruberi]
gi|284087642|gb|EFC41310.1| predicted protein [Naegleria gruberi]
Length = 673
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/66 (45%), Positives = 43/66 (65%), Gaps = 1/66 (1%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EE 167
Y+YS I +Y++ NGPV A+ +YSD +SYKSG+Y +A + V VK++GW +
Sbjct: 214 YIYSPITNYQTEIMTNGPVEADFDVYSDFYSYKSGIYQKTAGSTYVGGHAVKVLGWASDS 273
Query: 168 NGRPYW 173
NG PYW
Sbjct: 274 NGTPYW 279
>gi|449446774|ref|XP_004141146.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 348
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + + G+VT + TGC S P C EP A P P
Sbjct: 169 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C C + N + + + K+ Y V + DI E+ KNGPV + +Y D YKS
Sbjct: 213 RCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + +++ VK++GWG
Sbjct: 271 GVYKH------------------------ITGDVMGGHAVKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 294 --------------------TDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 333
Query: 240 GALP 243
LP
Sbjct: 334 AGLP 337
>gi|1644295|emb|CAB03627.1| cysteine proteinase [Haemonchus contortus]
Length = 345
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 35/123 (28%)
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
S+ G +GNGPVVA +Y D YK G+Y A A+A +KI+GWG ENG PY
Sbjct: 251 SHSEGDHGNGPVVAVFTVYEDFSYYKKGIYVHIAGKARGAHA-IKIIGWGVENGLPY--- 306
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
W I +++ + +G++G +I+RG NE IE
Sbjct: 307 -------------------------------WLIANSWHDDWGEQGLFRIVRGINECGIE 335
Query: 236 SLV 238
V
Sbjct: 336 QEV 338
>gi|345794363|ref|XP_535330.3| PREDICTED: tubulointerstitial nephritis antigen-like 1 [Canis lupus
familiaris]
Length = 467
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 82/201 (40%), Gaps = 64/201 (31%)
Query: 55 ATPQPKC--HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIM 99
A P P+C H+R GRG Q D Y+ Y + +I +E+M
Sbjct: 303 AGPAPRCMMHSR----AMGRGKRQATARCPSSHVHANDIYQVTPAYRLGTNEKEIMKELM 358
Query: 100 KNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATV 159
+NGPV A M ++ D F Y+ G Y + PV L + G + +V
Sbjct: 359 ENGPVQALMEVHEDFFLYQGGIYSHTPVS----LGRPERYRRHGTH------------SV 402
Query: 160 KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGD 219
KI GWGEE + T+K YWT +++G +G+
Sbjct: 403 KITGWGEET------------------LPDGRTLK-----------YWTAANSWGPAWGE 433
Query: 220 KGTIKILRGRNEAIIESLVNG 240
+G +I+RG NE IES V G
Sbjct: 434 RGHFRIVRGANECDIESFVLG 454
>gi|449489527|ref|XP_004158338.1| PREDICTED: cathepsin B-like [Cucumis sativus]
Length = 349
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + + G+VT + TGC S P C EP A P P
Sbjct: 170 CDGGYPISAWRYFVRHGVVTEQCDPYFDTTGC---SHPGC--------EP-----AYPTP 213
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C C + N + + + K+ Y V + DI E+ KNGPV + +Y D YKS
Sbjct: 214 RCVRHCVDKN--QIWRKTKHYGVSAYRVKRDPNDIMAEVYKNGPVEVSFTVYEDFAHYKS 271
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + +++ VK++GWG
Sbjct: 272 GVYKH------------------------ITGDVMGGHAVKLIGWGT------------- 294
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 295 --------------------TDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 334
Query: 240 GALP 243
LP
Sbjct: 335 AGLP 338
>gi|10803439|emb|CAC13132.1| putative cathepsin B.6 [Ostertagia ostertagi]
Length = 197
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/162 (27%), Positives = 60/162 (37%), Gaps = 25/162 (15%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + + G+ +GG + C+P PC T EC P C C
Sbjct: 50 SQAWEFAXRNGVCSGGWYGEKGVCKPYPLHPCGKHXNQTYYGECPDHXYXTPACKKYCQY 109
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPV 127
Y + + DK Y V + A I+ EIM GPV A +Y D Y G Y
Sbjct: 110 -GYDKRYXNDKVXVTSAYQVXSDEAAIRAEIMSRGPVQAAFTVYGDFMLYTXGIY----- 163
Query: 128 VANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
V + +++ VKI+GWG ENG
Sbjct: 164 -------------------VHTAGKLMGGHGVKIIGWGVENG 186
>gi|327281715|ref|XP_003225592.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 520
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/243 (23%), Positives = 99/243 (40%), Gaps = 62/243 (25%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C+ G W ++ +RG+VT C P S NH + + P C ++
Sbjct: 321 CNGGRIDGAWWFLRRRGVVT-------DECYPFSNQETNH---SPNAPACMMHSRSTGRG 370
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + RC N R + Y+ Y ++ +I +E+M+NGPV A + ++ D F Y
Sbjct: 371 KRQAIARCPNP---RSHANEIYQSTPAYRLSSNEKEIMKELMENGPVQAILEVHEDFFMY 427
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
++G +Y ++ + + K Y + +VKI GWGEE
Sbjct: 428 RTG----------IYRHTAVAAGKPEQYRRHGT------HSVKITGWGEEQ--------- 462
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ + + YW +++G+ +G+ G +I RG NE IE+
Sbjct: 463 --------------------MPDGSNQKYWIAANSWGKDWGEHGYFRITRGENECEIETF 502
Query: 238 VNG 240
V G
Sbjct: 503 VVG 505
>gi|156708114|gb|ABU93315.1| cathepsin B6 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 52/177 (29%), Positives = 70/177 (39%), Gaps = 65/177 (36%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C +C+N G + KY Y V +IQ+E+MKNGPV +YSD +YK
Sbjct: 160 PACAAKCSN---GSQIIRYKYEKAETY----TVQNIQEELMKNGPVYFRFTVYSDFMNYK 212
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG +Y Y+ G +A V ++GWG E+G PYW +
Sbjct: 213 SG------------VYQHKSGYQEGGHA------------VLLIGWGVEDGVPYWLLQN- 247
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
WG P W G+KG KI+RG+NE E
Sbjct: 248 ------------------SWG-----PAW----------GEKGHFKIIRGKNECGCE 271
>gi|326490902|dbj|BAJ90118.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326508404|dbj|BAJ99469.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514912|dbj|BAJ99817.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 345
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 82/245 (33%), Gaps = 79/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + + G+VT GCQ P C+ A P P
Sbjct: 165 CDGGYPIFAWQYFVENGVVTDECDPFFDQVGCQ---------------HPGCEP-AYPTP 208
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C +C N +++K F Y VN + DI E+ KNGPV + +Y D YK
Sbjct: 209 VCEKKCKVQNQ---VWEEKKHFSIDAYQVNSDPHDIMAEVYKNGPVEVSFIIYEDFAHYK 265
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + +V K++GWG +
Sbjct: 266 SGVYKQ------------------------ITGRMVGGHAAKLIGWGTSDA--------- 292
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW + + + +GD G KI+RG NE IE V
Sbjct: 293 ------------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEGDV 328
Query: 239 NGALP 243
N +P
Sbjct: 329 NAGMP 333
>gi|224285427|gb|ACN40436.1| unknown [Picea sitchensis]
Length = 350
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 88/245 (35%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + G+VT + + GCQ P C+ L P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+C +C ++N G + RF Y ++ + DI E+ NGPV + +Y D YK
Sbjct: 208 QCVKQCKDENQKWG---NSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYK 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG ++ Y G Y + VK+VGWG E+G YW +
Sbjct: 265 SG----------------VYKYTKGDY--------MGGHAVKLVGWGTEDGTDYWLVANS 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ + WGE+ G KI RG NE IE V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECGIEGDV 326
Query: 239 NGALP 243
+P
Sbjct: 327 VAGMP 331
>gi|307201161|gb|EFN81067.1| Uncharacterized peptidase C1-like protein F26E4.3 [Harpegnathos
saltator]
Length = 443
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 70/165 (42%), Gaps = 55/165 (33%)
Query: 76 QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYS 135
Q+ Y+ Y + +E DI QEI+ +GPV A M +Y D
Sbjct: 324 QELYKVGPAYRLGNET-DIMQEILTSGPVQATMRVYQD---------------------- 360
Query: 136 DIFSYKSGVYAVSASAEI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
F YK+GVY S SAE+ Y +++I+GWGEE +Y
Sbjct: 361 -FFVYKNGVYRHSRSAELHDSGYHSMRIIGWGEE--------------------PSYRGP 399
Query: 194 KLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
L YW + +++G +G+ G +I RG NE IES V
Sbjct: 400 PL---------KYWLVANSWGRHWGENGLFRIQRGTNECEIESYV 435
>gi|157058735|gb|ABV03125.1| cathepsin B-16 [Aulacorthum solani]
Length = 246
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 55/116 (47%), Gaps = 10/116 (8%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + GLVTGG + S GC+P PPC H + + K P K
Sbjct: 137 CHGGYPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCQHHHQGNNSCSDK----PMEKN 192
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
H RCT YG + D +RF R YY++ IQ+++M GP+ A+ +Y D
Sbjct: 193 H-RCTRMCYGDQDLDYNDDHRFTRDYYYLT--YGSIQKDVMNYGPIEASFDVYDDF 245
>gi|116784401|gb|ABK23329.1| unknown [Picea sitchensis]
Length = 350
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 88/245 (35%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + G+VT + + GCQ P C+ L P P
Sbjct: 164 CDGGYPISAWQYFISTGVVTAECDPYFDDAGCQ---------------HPGCEPL-YPTP 207
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+C +C ++N G + RF Y ++ + DI E+ NGPV + +Y D YK
Sbjct: 208 QCVKQCKDENQKWG---NSKRFSATAYRISSKPYDIMAEVYTNGPVEVSFSVYEDFAHYK 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG ++ Y G Y + VK+VGWG E+G YW +
Sbjct: 265 SG----------------VYKYTKGDY--------MGGHAVKLVGWGTEDGTDYWLVANS 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ + WGE+ G KI RG NE IE V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECGIEGDV 326
Query: 239 NGALP 243
+P
Sbjct: 327 VAGMP 331
>gi|90074902|dbj|BAE87131.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 57/111 (51%), Gaps = 6/111 (5%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W + ++GLV+GG + S+ GC+P S PPC H + S P C T PKC
Sbjct: 150 CNGGYPAGAWNFWTRKGLVSGGLYDSHVGCRPYSIPPCEH-HVNGSRPPC-TGEGDTPKC 207
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGP---VVANMY 109
C Y + QDK+ Y V++ DI EI KNG +VAN +
Sbjct: 208 SKIC-EPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGTPYWLVANSW 257
Score = 44.3 bits (103), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 29/46 (63%)
Query: 201 ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDN 246
+NG PYW + +++ +GD G KILRG++ IES V +P+ +
Sbjct: 245 KNGTPYWLVANSWNTDWGDNGFFKILRGQDHCGIESEVVAGIPRTD 290
>gi|302823081|ref|XP_002993195.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
gi|300138965|gb|EFJ05715.1| hypothetical protein SELMODRAFT_270024 [Selaginella moellendorffii]
Length = 342
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 91/248 (36%), Gaps = 76/248 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
C G + W + + G+VT + C P C H EPE T P
Sbjct: 166 CDGGYPYAAWEYFAQTGVVT-------SQCDPYFDGKGCKHPG---CEPEYDT-----PV 210
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C ++ R K+ + Y VN ++ DIQ EI KNGPV + +Y D YKSG
Sbjct: 211 CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 267
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y +F E++ VK +GWG
Sbjct: 268 ------------VYKHVF------------GEVLGGHAVKFIGWGT-------------- 289
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
++G+ YW + +++ +G+ G +I RG NE IES
Sbjct: 290 -------------------TDDGKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVA 330
Query: 241 ALPKDNYG 248
+P G
Sbjct: 331 GIPLKKTG 338
>gi|270132817|ref|NP_075965.2| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|270132824|ref|NP_001161805.1| tubulointerstitial nephritis antigen-like precursor [Mus musculus]
gi|61213616|sp|Q99JR5.1|TINAL_MOUSE RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Adrenocortical zonation factor 1; Short=AZ-1;
AltName: Full=Androgen-regulated gene 1 protein;
AltName: Full=Tubulointerstitial nephritis
antigen-related protein; Short=TARP; Flags: Precursor
gi|13543125|gb|AAH05738.1| Tinagl1 protein [Mus musculus]
gi|17391278|gb|AAH18539.1| Tinagl1 protein [Mus musculus]
gi|30314458|dbj|BAC76038.1| tubulointersititial nephritis antigen-related protein [Mus
musculus]
gi|148698197|gb|EDL30144.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
gi|148698198|gb|EDL30145.1| tubulointerstitial nephritis antigen-like, isoform CRA_a [Mus
musculus]
Length = 466
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 103/248 (41%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G W ++ +RG+V+ C P S N A+ T P C + +
Sbjct: 269 CRGGRLDGAWWFLRRRGVVS-------DNCYPFSGREQNEASPT---PRCMMHSRAMGRG 318
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + +RC N G+ D Y+ Y + + +I +E+M+NGPV A M ++
Sbjct: 319 KRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE----- 370
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
D F Y+ G+Y+ + ++ GRP R
Sbjct: 371 ------------------DFFLYQRGIYSHTPVSQ----------------GRP--EQYR 394
Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ +VK+ GWGEE +GR YWT +++G +G++G +I+RG NE
Sbjct: 395 RHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNEC 445
Query: 233 IIESLVNG 240
IE+ V G
Sbjct: 446 DIETFVLG 453
>gi|349604734|gb|AEQ00202.1| Cathepsin B-like protein, partial [Equus caballus]
Length = 134
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 35/120 (29%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A +YSD YKSGVY A + +A V+I+GWG ENG PYW
Sbjct: 41 NGPVEAAFTVYSDFLQYKSGVYQHVAGDMMGGHA-VRILGWGVENGTPYW---------- 89
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
L+G +++ +GD G KILRG++ IES + +P
Sbjct: 90 -----------LVG-------------NSWNTDWGDNGFFKILRGQDHCGIESEIVAGIP 125
>gi|194753202|ref|XP_001958906.1| GF12327 [Drosophila ananassae]
gi|190620204|gb|EDV35728.1| GF12327 [Drosophila ananassae]
Length = 431
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 62/243 (25%), Positives = 86/243 (35%), Gaps = 76/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 252 CDGGHLDAAWRFLHKKGVV-------DDSCYP----------YTQQRDTCKIRHNSRSLK 294
Query: 62 HTRCT-NDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C + N R F Y Y +N E DI EI +GPV A M +Y D FSY G
Sbjct: 295 ANGCRPSPNVDRDSF---YTVGPAYTLNRE-GDIMAEIYHSGPVQATMRVYRDFFSYSGG 350
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + +VK+VGWGEE+
Sbjct: 351 IYRQ---------------------TAANRGAPQGFHSVKLVGWGEEH------------ 377
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G++G +ILRG NE IE V
Sbjct: 378 ---------------------NGDKYWIAANSWGPWWGERGYFRILRGSNECGIEEYVLA 416
Query: 241 ALP 243
+ P
Sbjct: 417 SWP 419
>gi|116779190|gb|ABK21175.1| unknown [Picea sitchensis]
gi|148907952|gb|ABR17096.1| unknown [Picea sitchensis]
gi|224284884|gb|ACN40172.1| unknown [Picea sitchensis]
Length = 350
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/245 (24%), Positives = 86/245 (35%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + G+VT + GCQ P C+ L P P
Sbjct: 164 CDGGYPLSAWQYFISTGVVTAECDPYFDEAGCQ---------------HPGCEPL-YPTP 207
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
+C +C ++N G + RF Y + + DI E+ GPV + +Y D YK
Sbjct: 208 QCVKQCKDENQNWG---NSKRFSATAYRITSKPYDIMAEVYTKGPVEVDFLVYEDFAHYK 264
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG ++ Y +G + + VK++GWG ENG YW +
Sbjct: 265 SG----------------VYKYITG--------DFLGGHAVKLIGWGTENGTDYWLVANS 300
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ + WGE+ G KI RG NE IE V
Sbjct: 301 WNTA---------------WGED-------------------GYFKIARGSNECSIEEDV 326
Query: 239 NGALP 243
+P
Sbjct: 327 VAGMP 331
>gi|195488613|ref|XP_002092389.1| GE11695 [Drosophila yakuba]
gi|194178490|gb|EDW92101.1| GE11695 [Drosophila yakuba]
Length = 431
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 90/245 (36%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DESCYP----------YTQQRDTCKIRHNSRSLR 295
Query: 62 HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T N R F Y Y +N E ADI EI +GPV A
Sbjct: 296 ANGCQTPYNVDRDTF---YTVGPAYSLNRE-ADIMAEIFHSGPVQA-------------- 337
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEI--VAYATVKIVGWGEENGRPYWTIVRV 178
M + D F+Y GVY +A+ + + +VK+VGWGEE+
Sbjct: 338 ---------TMRVNRDFFAYAGGVYRQTAANRMAPTGFHSVKLVGWGEEH---------- 378
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
NG YW +++G +G++G +ILRG NE IE V
Sbjct: 379 -----------------------NGEKYWIAANSWGPWWGERGYFRILRGSNECGIEEYV 415
Query: 239 NGALP 243
+ P
Sbjct: 416 LASWP 420
>gi|30678927|ref|NP_849281.1| cathepsin B [Arabidopsis thaliana]
gi|3859606|gb|AAC72872.1| contains similarity to cysteine proteases (Pfam: PF00112,
E=1.3e-79, N=1) [Arabidopsis thaliana]
gi|7268205|emb|CAB77732.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656653|gb|AEE82053.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + G+VT + NTGC S P C+ A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C +DN + + + K+ Y V DI E+ KNGPV + +Y D F++
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
YKSGVY + I +A VK++GWG +
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSS----------- 297
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G I RG NE IE
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335
Query: 240 GALP 243
LP
Sbjct: 336 AGLP 339
>gi|157058753|gb|ABV03134.1| cathepsin B-84 [Acyrthosiphon pisum]
Length = 230
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 61/126 (48%), Gaps = 14/126 (11%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP-- 59
C+ G W + + G+VTGG + + GCQP PPC + E + QP
Sbjct: 113 CNGGYPLKAWQYFKRHGVVTGGDYDTTDGCQPYRVPPC------VKDDEGHNSCSGQPTE 166
Query: 60 ---KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
KC +C D+ + ++ Y+ K Y++ + +Q++ M GP+ A+ +Y D +
Sbjct: 167 RNHKCSKKCYGDD-TIDYKKNHYKTKDAYYLKNTT--MQKDTMVYGPIEASFDVYDDFMN 223
Query: 117 YKSGKY 122
Y+SG Y
Sbjct: 224 YESGVY 229
>gi|18411686|ref|NP_567215.1| cathepsin B [Arabidopsis thaliana]
gi|13877861|gb|AAK44008.1|AF370193_1 putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|17473834|gb|AAL38343.1| unknown protein [Arabidopsis thaliana]
gi|21281113|gb|AAM45063.1| putative cathepsin B cysteine protease [Arabidopsis thaliana]
gi|21554165|gb|AAM63244.1| cathepsin B-like cysteine protease, putative [Arabidopsis thaliana]
gi|24417490|gb|AAN60355.1| unknown [Arabidopsis thaliana]
gi|24899725|gb|AAN65077.1| unknown protein [Arabidopsis thaliana]
gi|51968702|dbj|BAD43043.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969104|dbj|BAD43244.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51969220|dbj|BAD43302.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970472|dbj|BAD43928.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970630|dbj|BAD44007.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970704|dbj|BAD44044.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970802|dbj|BAD44093.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51970974|dbj|BAD44179.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971008|dbj|BAD44196.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|51971116|dbj|BAD44250.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|62320144|dbj|BAD94342.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|110740287|dbj|BAF02040.1| cathepsin B-like cysteine protease [Arabidopsis thaliana]
gi|332656652|gb|AEE82052.1| cathepsin B [Arabidopsis thaliana]
Length = 359
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + G+VT + NTGC S P C+ A P P
Sbjct: 171 CDGGYPIAAWQYFSYSGVVTEECDPYFDNTGC---------------SHPGCEP-AYPTP 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C +DN + + + K+ Y V DI E+ KNGPV + +Y D F++
Sbjct: 215 KCSRKCVSDN--KLWSESKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 269
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
YKSGVY + I +A VK++GWG +
Sbjct: 270 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGTSS----------- 297
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G I RG NE IE
Sbjct: 298 ----------------------EGEDYWLMANQWNRGWGDDGYFMIRRGTNECGIEDEPV 335
Query: 240 GALP 243
LP
Sbjct: 336 AGLP 339
>gi|195026034|ref|XP_001986167.1| GH20676 [Drosophila grimshawi]
gi|193902167|gb|EDW01034.1| GH20676 [Drosophila grimshawi]
Length = 432
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 59/152 (38%), Gaps = 54/152 (35%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
ADI EI +GPV A M +Y D FSY SG Y + +
Sbjct: 324 ADIMAEIYHSGPVQATMTVYRDFFSYSSGVYQ---------------------HTAANRG 362
Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
+ +VK+VGWGEE+ NG YW +
Sbjct: 363 AATGFHSVKLVGWGEEH---------------------------------NGVKYWIAAN 389
Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
++G +G++G +ILRG NE IE V + P
Sbjct: 390 SWGPWWGERGYFRILRGSNECGIEEYVLASWP 421
>gi|12060418|dbj|BAB20596.1| ARG1 [Mus musculus]
gi|71059879|emb|CAJ18483.1| Lcn7 [Mus musculus]
Length = 415
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 103/248 (41%), Gaps = 72/248 (29%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPEC----KTLATP 57
C G W ++ +RG+V+ C P S N A+ T P C + +
Sbjct: 218 CRGGRLDGAWWFLRRRGVVS-------DNCYPFSGREQNEASPT---PRCMMHSRAMGRG 267
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+ + +RC N G+ D Y+ Y + + +I +E+M+NGPV A M ++
Sbjct: 268 KRQATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE----- 319
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
D F Y+ G+Y+ + ++ GRP R
Sbjct: 320 ------------------DFFLYQRGIYSHTPVSQ----------------GRP--EQYR 343
Query: 178 VYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEA 232
+ +VK+ GWGEE +GR YWT +++G +G++G +I+RG NE
Sbjct: 344 RHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNEC 394
Query: 233 IIESLVNG 240
IE+ V G
Sbjct: 395 DIETFVLG 402
>gi|321446975|gb|EFX60976.1| hypothetical protein DAPPUDRAFT_274869 [Daphnia pulex]
Length = 71
Score = 60.5 bits (145), Expect = 9e-07, Method: Composition-based stats.
Identities = 27/70 (38%), Positives = 41/70 (58%)
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+VR + + V ++++GWG E G PYW I + + +GD G IK+LRG++ I
Sbjct: 1 MVRRLPTNVHGKAVGGHAIRILGWGVEEGVPYWLIANNWNTDWGDNGYIKLLRGKDHCGI 60
Query: 235 ESLVNGALPK 244
ES + G LPK
Sbjct: 61 ESQITGGLPK 70
>gi|289724789|gb|ADD18342.1| putative cysteine proteinase TIN-ag [Glossina morsitans morsitans]
Length = 387
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 92/246 (37%), Gaps = 82/246 (33%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W ++HK+G+V + C P Y CK P
Sbjct: 208 CNGGHLDAAWRYLHKQGVV-------DESCYP----------YVGYRDACKI---PH--- 244
Query: 62 HTRCTNDNYGRGFF----QDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
++R +N R + + Y Y +N+E DI EI +GPV A + +Y D FSY
Sbjct: 245 NSRSLRNNGCRSYSGVDRDELYTVGPAYSLNNET-DIMAEIFMSGPVQATLTVYRDFFSY 303
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
G Y + ++ V + +VK++GWGEE+
Sbjct: 304 SGGIY---------------------RHTAASRGSPVGFHSVKLIGWGEEH--------- 333
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+G YW +++G +G+ G +ILRG NE IE
Sbjct: 334 ------------------------DGNKYWIATNSWGTWWGEHGNFRILRGSNECGIEEY 369
Query: 238 VNGALP 243
V A P
Sbjct: 370 VLAAWP 375
>gi|324105223|gb|ADY18374.1| cathepsin B [Glycera tridactyla]
Length = 117
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 58/118 (49%), Gaps = 3/118 (2%)
Query: 5 GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
G S W + G+VTGG ++++ GC+P + P C H + + P C + P P+C +
Sbjct: 1 GFPRSAWEYFKVTGIVTGGQYNTHEGCRPYTIPKCEH-HVNGTLPPCSSTIKPTPRCERK 59
Query: 65 CTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKY 122
C Y + + K+ Y V + A I+QEI KNG + + +D +SG Y
Sbjct: 60 C-ESGYSTDYQKXKHHGVTVYNVESDEAQIRQEIYKNG-QRSCFHRLADFPQLQSGVY 115
>gi|403293251|ref|XP_003937634.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Saimiri boliviensis boliviensis]
Length = 436
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 98/250 (39%), Gaps = 76/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 239 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDKAGPAPPC--MMHSRAMGRGK 289
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N + Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 290 RQATAH--CPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHE 340
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE RP
Sbjct: 341 DFFLYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET-RP- 382
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRN 230
+GR YWT +++G +G++G +I+RG N
Sbjct: 383 -----------------------------DGRKLKYWTAANSWGPAWGERGHFRIVRGVN 413
Query: 231 EAIIESLVNG 240
E IES V G
Sbjct: 414 ECDIESFVLG 423
>gi|149030260|gb|EDL85316.1| rCG52258, isoform CRA_c [Rattus norvegicus]
Length = 130
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 67/175 (38%), Gaps = 58/175 (33%)
Query: 70 YGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVA 129
Y + +DK+ Y V+D +I EI KNGPV ++SD +YKSG Y +
Sbjct: 6 YSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKH----- 60
Query: 130 NMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVA 189
+ +++ ++I+GWG ENG PYW + + V
Sbjct: 61 -------------------EAGDVMGGHAIRILGWGIENGVPYWLVANSWNV-------- 93
Query: 190 YATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+GD G KILRG N IES + +P+
Sbjct: 94 --------------------------DWGDNGFFKILRGENHCGIESEIVAGIPR 122
>gi|145540170|ref|XP_001455775.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423583|emb|CAK88378.1| unnamed protein product [Paramecium tetraurelia]
Length = 500
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 57/122 (46%), Gaps = 25/122 (20%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV+ N D Y+SG+Y A + W + RP W V
Sbjct: 382 YTNGPVIMNFEPSYDFMYYESGIYHSVAEHD-----------WSTQE-RPEWEKVD---- 425
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+V GWGEE+G +W + +++G Q+G+ G+ ++ RG +E+ IES+ A
Sbjct: 426 ---------HSVLCYGWGEEDGVKFWLLQNSWGSQWGENGSFRMKRGVDESAIESMAEAA 476
Query: 242 LP 243
P
Sbjct: 477 DP 478
>gi|159950|gb|AAA29435.1| cathepsin B-like cysteine protease, partial [Ostertagia ostertagi]
Length = 105
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 48/95 (50%), Gaps = 24/95 (25%)
Query: 82 KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
K+ Y + + V IQ++IMKNGPVVA +Y D Y+SG +Y K
Sbjct: 1 KKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSG------------IYKHKAGRK 48
Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
+G++A VK++GWGEE G PYW +
Sbjct: 49 TGLHA------------VKVIGWGEEKGTPYWIVA 71
>gi|302764096|ref|XP_002965469.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
gi|300166283|gb|EFJ32889.1| hypothetical protein SELMODRAFT_143272 [Selaginella moellendorffii]
Length = 331
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 62/248 (25%), Positives = 91/248 (36%), Gaps = 76/248 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
C G + W + + G+VT + C P C H EPE T P
Sbjct: 155 CEGGYPYAAWEYFAQTGVVT-------SQCDPYFDGKGCKHPG---CEPEYDT-----PV 199
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C +C ++ R K+ + Y VN ++ DIQ EI KNGPV + +Y D YKSG
Sbjct: 200 CVKQCVDNEQWR---DSKHFTVQTYAVNSDIYDIQAEIYKNGPVEVSYTVYEDFAHYKSG 256
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y +F G +A VK +GWG
Sbjct: 257 ------------VYKHVFGQVLGGHA------------VKFIGWGT-------------- 278
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
++G+ YW + +++ +G+ G +I RG NE IES
Sbjct: 279 -------------------TDDGKDYWIVANSWNRSWGEDGFFQISRGSNECGIESEPVA 319
Query: 241 ALPKDNYG 248
+P G
Sbjct: 320 GIPLKKTG 327
>gi|403293249|ref|XP_003937633.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Saimiri boliviensis boliviensis]
Length = 467
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 98/250 (39%), Gaps = 76/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVS---------FPPCNHANYTTSEPECK 52
C G W ++ +RG+V+ C P S PPC ++ + K
Sbjct: 270 CRGGRLDGAWWFLRRRGVVS-------DHCYPFSGRERDKAGPAPPC--MMHSRAMGRGK 320
Query: 53 TLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
AT C N+N + Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 321 RQATAH--CPNGHVNNN-------NIYQVTPAYRLGSNDTEIMKELMENGPVQALMEVHE 371
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D F YK G Y + PV L + G + +VKI GWGEE RP
Sbjct: 372 DFFLYKGGIYSHTPV----NLGRPERYRRHGTH------------SVKITGWGEET-RP- 413
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEENGR--PYWTIVSTFGEQFGDKGTIKILRGRN 230
+GR YWT +++G +G++G +I+RG N
Sbjct: 414 -----------------------------DGRKLKYWTAANSWGPAWGERGHFRIVRGVN 444
Query: 231 EAIIESLVNG 240
E IES V G
Sbjct: 445 ECDIESFVLG 454
>gi|71656032|ref|XP_816569.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70881707|gb|EAN94718.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 73/175 (41%), Gaps = 37/175 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+V+ CQP FP C H ++ C P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 212 NSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + VA +L G +A V+IVGWGE NG PYW I
Sbjct: 267 YKH---VAGTFL---------GGHA------------VRIVGWGELNGEPYWKIA 297
>gi|290973351|ref|XP_002669412.1| predicted protein [Naegleria gruberi]
gi|284082959|gb|EFC36668.1| predicted protein [Naegleria gruberi]
Length = 488
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 51/115 (44%), Gaps = 26/115 (22%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y GP+ +Y D F+YK GVY S + + + GW E N
Sbjct: 390 YHGGPLAIAFEVYDDFFNYKGGVYTHSTALK----TKIAEPGWEETN------------- 432
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
V L+GWGEENG PYW + +++G +G G KI RG +E ES
Sbjct: 433 ---------HAVLLVGWGEENGVPYWLVKNSWGTSWGINGFFKIKRGTDECDCES 478
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 54/140 (38%), Gaps = 19/140 (13%)
Query: 43 NYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
+Y +E C P + C D + + Y + ++ ++ E+ G
Sbjct: 338 DYGLAEESCD----PYKGVDSVCKKDQCPKRAYGTNYAYTGGFYGATNAKNMMYELYHGG 393
Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
P+ +Y D F+YK G +Y +S K E +A V +V
Sbjct: 394 PLAIAFEVYDDFFNYKGG----------VYTHSTALKTK----IAEPGWEETNHA-VLLV 438
Query: 163 GWGEENGRPYWTIVRVYAVS 182
GWGEENG PYW + + S
Sbjct: 439 GWGEENGVPYWLVKNSWGTS 458
>gi|356572872|ref|XP_003554589.1| PREDICTED: cathepsin B-like [Glycine max]
Length = 356
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/244 (25%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 168 CDGGYPLYAWQYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 211
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y V+ + DI E+
Sbjct: 212 KCVKKCVSGN--QVWKKSKHYSVNAYRVSSDPHDIMTEV--------------------- 248
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY E+ +A VK++GWG
Sbjct: 249 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGYELGGHA-VKLIGWGT------------- 292
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + ++GD G KI RG NE IE V
Sbjct: 293 --------------------TEDGEDYWLLANQWNREWGDDGYFKIRRGTNECGIEEDVT 332
Query: 240 GALP 243
LP
Sbjct: 333 AGLP 336
>gi|328702238|ref|XP_001943280.2| PREDICTED: cathepsin B-like cysteine proteinase 5-like
[Acyrthosiphon pisum]
Length = 328
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 72/171 (42%), Gaps = 44/171 (25%)
Query: 10 TWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDN 69
W ++ GLV+GG ++++ GCQP PP + + + K +T C +
Sbjct: 161 VWEYLKSHGLVSGGKYNTSDGCQPSKIPPIE-----------EYMEYSEIKNYT-CNDHC 208
Query: 70 YGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGP 126
YG + D + YY V E DIQ+E+ GPV Y+ DIF+ P
Sbjct: 209 YGNKTINYNDDHVKVSNYYQVQYE--DIQEEVQNYGPVSVEFYIRDDIFT---------P 257
Query: 127 VVANMYLYSDIFSYKSGVYAVSASAEIVAY-ATVKIVGWGEENGRPYWTIV 176
++ ++ + Y VK++GWG ENG YW +V
Sbjct: 258 FLS-----------------INPRFQRRKYKGYVKLIGWGVENGEDYWLLV 291
>gi|224064400|ref|XP_002301457.1| predicted protein [Populus trichocarpa]
gi|222843183|gb|EEE80730.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + G+VT + + GC S P C+ P P
Sbjct: 169 CNGGYPISAWRYFVHHGVVTEECDPYFDDIGC---------------SHPGCEP-GYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N N + + + K+ + Y ++ + I EI
Sbjct: 213 KCARKCVNKN--QLWKKSKHYGVKPYRIDSDPESIMAEI--------------------- 249
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY + +A VK++GWG
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGGMMGGHA-VKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
E+G YW + + + +GD G KI RG NE IE V
Sbjct: 294 --------------------SEDGEAYWLLANQWNRGWGDDGYFKIRRGTNECGIEGDVV 333
Query: 240 GALP 243
LP
Sbjct: 334 AGLP 337
>gi|262217337|gb|ACY38050.1| cathepsin B [Dactylis glomerata]
Length = 348
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/258 (24%), Positives = 89/258 (34%), Gaps = 82/258 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQP-VSFPPCNHANYTTSEPECKTLATPQPK 60
C G W + + G+VT C P C H P C+ A PK
Sbjct: 162 CDGGYPIKAWQYFVQSGVVT-------EECDPYFDQVGCKH-------PGCEP-AYDTPK 206
Query: 61 CHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N +++K F Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 207 CEKKCKVQNQ---VWEEKKHFSINAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAHYKS 263
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + + ++ VK++GWG +
Sbjct: 264 GVYKH------------------------VTGGVMGGHAVKLIGWGTSDA---------- 289
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 290 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEEVV 326
Query: 240 GALPKD-----NYGVEFG 252
+P N+G FG
Sbjct: 327 AGMPSTKNMAGNHGSAFG 344
>gi|326916361|ref|XP_003204476.1| PREDICTED: tubulointerstitial nephritis antigen-like [Meleagris
gallopavo]
Length = 467
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 70/166 (42%), Gaps = 55/166 (33%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR +Y ++ + DI +EIM GPV A M +Y D F YK G Y +
Sbjct: 354 YRCASHYRISSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
SYK+G + S VK++GWG G+
Sbjct: 400 SYKAGSKWKTHS--------VKLLGWGSLPGK---------------------------- 423
Query: 199 GEENG--RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
NG + +W +++G+ +G+ G +ILRG+NE IE L+ L
Sbjct: 424 ---NGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466
>gi|86451908|gb|ABC97349.1| cathepsin B [Streblomastix strix]
Length = 312
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 93/245 (37%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G S+ + ++ G++ C P C H P C T P PKC
Sbjct: 144 CNGGWMSTAFGFMQSNGIL-------GEDCIPYQMGKCKH-------PGCSTW--PTPKC 187
Query: 62 H-TRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+ T+C ND + + Y V ADIQ+EI +NGPV A+ +Y D+ Y+S
Sbjct: 188 NKTKCYPNDTKS----TELWHAASSYSVRSNEADIQKEIYENGPVTASFAVYEDLSVYQS 243
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y + G++A +K+VGWG +G YWTIV +
Sbjct: 244 G------------VYQHVTGGFEGLHA------------IKVVGWGILDGVKYWTIVNSW 279
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
A E +G G + I RG +E IES V
Sbjct: 280 A----------------------------------EDWGFDGLLLIRRGVDECGIESDVV 305
Query: 240 GALPK 244
PK
Sbjct: 306 AGQPK 310
>gi|428168267|gb|EKX37214.1| hypothetical protein GUITHDRAFT_78289 [Guillardia theta CCMP2712]
Length = 224
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/156 (28%), Positives = 62/156 (39%), Gaps = 62/156 (39%)
Query: 87 VNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYA 146
+ D V IQ EI+ NGPV A ++YSD +Y G Y
Sbjct: 125 IQDNVRQIQSEILSNGPVFAAFWVYSDFMAYTGGVY------------------------ 160
Query: 147 VSASAEIVAYA-----TVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
SAS E +A V +VGWG + +E
Sbjct: 161 -SASKEALAQGKTGGHAVMMVGWGTD--------------------------------KE 187
Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
G+ YW + +++ E++GDKG KI RG +E IESL
Sbjct: 188 TGQDYWLLQNSWSEKWGDKGRFKIKRGVDECGIESL 223
>gi|363732245|ref|XP_419905.3| PREDICTED: tubulointerstitial nephritis antigen [Gallus gallus]
Length = 467
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 70/166 (42%), Gaps = 55/166 (33%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR +Y V+ + DI +EIM GPV A M +Y D F YK G Y +
Sbjct: 354 YRCGSHYRVSSKETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
SYK+G + S VK++GWG G+
Sbjct: 400 SYKAGSKWKTHS--------VKLLGWGSLPGK---------------------------- 423
Query: 199 GEENG--RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
NG + +W +++G+ +G+ G +ILRG+NE IE L+ L
Sbjct: 424 ---NGQKQKFWIAANSWGKYWGENGYFRILRGQNECDIEKLILTTL 466
>gi|134023803|gb|AAI35570.1| LOC100124858 protein [Xenopus (Silurana) tropicalis]
Length = 484
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/239 (23%), Positives = 89/239 (37%), Gaps = 56/239 (23%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ +RG+V+ C P + N + + +++ + +
Sbjct: 288 CRGGRVDGAWWYLRRRGVVS-------EPCYPFTSLNTN-GHSAPCMMQSRSMGRGKRQA 339
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N Y + Y+ Y + DI +E+ +NGPV A M ++ D F YKSG
Sbjct: 340 TNNCPNQYYSS---NEIYQSTPAYRLASSEKDIMKELYENGPVQAIMEVHEDFFMYKSGI 396
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y PV + G + +VKI GWGEE GR
Sbjct: 397 YRRTPVTER----EPEHHRRHGTH------------SVKITGWGEERGR----------- 429
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ YW +++G +G+ G +I RG NE IE+ + G
Sbjct: 430 ------------------DGQTHKYWLAANSWGRDWGEDGYFRIARGENECEIETFIVG 470
>gi|15723280|gb|AAL06328.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 76/176 (43%), Gaps = 39/176 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G W + G+V+ CQP FP C +H N + P TP
Sbjct: 65 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 116 CNSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGG 170
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + VA ++L G +A V+IVGWGE NG PYW I
Sbjct: 171 VYKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKIA 202
>gi|130502070|ref|NP_001076255.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
gi|818411|gb|AAC48477.1| tubulointerstitial nephritis antigen [Oryctolagus cuniculus]
Length = 474
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 64/246 (26%), Positives = 98/246 (39%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ C P+ F N +N T + + A + K
Sbjct: 282 CNSGSIDRAWWYLRKRGLVSHA-------CYPL-FKDQNISNNTCAM---TSKADGRGKR 330
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H TR +N + Y+ Y V+ +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 331 HATRPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKTG 388
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + +S + E Y ++
Sbjct: 389 IYR---------------------HVISTNEESEKYRKLQT------------------- 408
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 409 ----------HAVKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 458
Query: 236 SLVNGA 241
L+ A
Sbjct: 459 KLIIAA 464
>gi|21693|emb|CAA46810.1| cathepsin B [Triticum aestivum]
Length = 305
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/245 (24%), Positives = 85/245 (34%), Gaps = 79/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + + G+VT + GC+ P C EP A P P
Sbjct: 125 CDGGYPISAWQYFVQNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 168
Query: 60 KCHTRCTNDNYGRGFFQDKYRFK-RYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
C +C N +++K F Y VN + DI E+ NGPV +Y D YK
Sbjct: 169 VCEKKCKVQNQ---VWEEKKHFSINAYQVNSDPHDIMAEVYNNGPVEVAFTVYEDFAHYK 225
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + + ++ VK++GWG +
Sbjct: 226 SGVYKH------------------------ITGGVMGGHAVKLIGWGTSDA--------- 252
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
G YW + + + +GD G KI+RG+NE IE V
Sbjct: 253 ------------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIEEDV 288
Query: 239 NGALP 243
+P
Sbjct: 289 TAGMP 293
>gi|3088522|gb|AAD03404.1| cathepsin B-like protease precursor [Trypanosoma cruzi]
gi|407859283|gb|EKG06969.1| cysteine peptidase C (CPC) [Trypanosoma cruzi]
Length = 333
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/174 (25%), Positives = 72/174 (41%), Gaps = 37/174 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+V+ CQP FP C H ++ C P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 212 NSTCTD----KKIPLIKYRGNTSYILSGE-ESFKRELLLNGPFEVSFSVYADFVAYTGG- 265
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
++ + +GV+ + V+IVGWGE NG PYW I
Sbjct: 266 ---------------VYKHVTGVF--------LGGHAVRIVGWGELNGEPYWKI 296
>gi|268572255|ref|XP_002648916.1| Hypothetical protein CBG17829 [Caenorhabditis briggsae]
Length = 220
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 61/152 (40%), Gaps = 58/152 (38%)
Query: 85 YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
Y+V V+ IQ EIM NGPVV +Y D++ YKSG Y +
Sbjct: 115 YYVGMTVSAIQTEIMTNGPVVGVFTMYEDMYKYKSGVYRH-------------------- 154
Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
+ ++ +KI+GWG +NG PY
Sbjct: 155 ----TAGRLLGGHAIKIIGWGTQNGIPY-------------------------------- 178
Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W I +++G ++G+ G KI RG NE IE+
Sbjct: 179 --WLIANSWGTKWGENGFFKIRRGVNECGIEN 208
>gi|161343857|tpg|DAA06109.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 163
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 82/223 (36%), Gaps = 69/223 (30%)
Query: 30 GCQPVSFPPCNHANYTTSEPE--------CKTLATPQPKCHTRCTNDNYGRGFFQDKYRF 81
G QP PCN A+ T ++P C PKC C N + + D +
Sbjct: 1 GRQPWLVQPCN-ASTTAADPSSVLGPHGVCGGDPATTPKCDLSCYNARHEGKYLDDIIKA 59
Query: 82 KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
K+ + + A ++ + K+GP V M +Y D +YKSG Y +
Sbjct: 60 KKVFTFDGCSA--RKNLRKHGPYVVTMRVYEDFLAYKSGVYHH----------------- 100
Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
+ + + +V+++GWG E G
Sbjct: 101 -------VTGDYLGLLSVRMIGWGLEGG-------------------------------- 121
Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+ +W +++G +GDKG KI R NE IE+ +PK
Sbjct: 122 --QAFWLFANSWGTSWGDKGFFKIRRFVNERWIENFRYAGVPK 162
>gi|253747613|gb|EET02212.1| Hypothetical protein GL50581_498 [Giardia intestinalis ATCC 50581]
Length = 807
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 54/135 (40%), Gaps = 33/135 (24%)
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
Y S + + Y NGP+ +MYL +D
Sbjct: 184 YRLSGVDAMMRDIYQNGPIAVSMYLANDF------------------------------- 212
Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
P +Y + ++ V ++GWGEENG PYW +T+G +GD G +I RG
Sbjct: 213 --PPKDKKSIYVSGPNTKLSGGHAVMIVGWGEENGVPYWDCANTYGTNWGDHGYFRIKRG 270
Query: 229 RNEAIIESLVNGALP 243
NE IE+ ALP
Sbjct: 271 SNELKIETWPGAALP 285
>gi|71424150|ref|XP_812694.1| cysteine peptidase C (CPC) [Trypanosoma cruzi strain CL Brener]
gi|70877506|gb|EAN90843.1| cysteine peptidase C (CPC), putative [Trypanosoma cruzi]
Length = 333
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 49/174 (28%), Positives = 74/174 (42%), Gaps = 37/174 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+V+ CQP FP C H ++ C P C
Sbjct: 160 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSG-EYDTPTC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 212 NSTCTD----KKIPLIKYRGNTSYVLSGE-EPFKRELILNGPFEVSFSVYADFVAYTGGV 266
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y + VA ++L G +A V+IVGWGE NG PYW I
Sbjct: 267 YKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKI 296
>gi|294890224|ref|XP_002773108.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239878009|gb|EER04924.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 109
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 55/121 (45%), Gaps = 37/121 (30%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A+ +Y D +Y+SGVY ++ E+ +A
Sbjct: 25 DGPVSASFIVYEDFLAYRSGVYKHTSGKELGGHA-------------------------- 58
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
VK+IGWGEE G+ YW +V+++ E +GD G KI G E I+ + G P
Sbjct: 59 ---------VKIIGWGEETGQAYWLVVNSWNEDWGDNGLFKIALGNCE--IDDDLLGGTP 107
Query: 244 K 244
K
Sbjct: 108 K 108
>gi|195346663|ref|XP_002039877.1| GM15657 [Drosophila sechellia]
gi|194135226|gb|EDW56742.1| GM15657 [Drosophila sechellia]
Length = 431
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 85/243 (34%), Gaps = 76/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 62 HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T N R Y Y +N E ADI EI +GPV A M + D F+Y G
Sbjct: 296 ANGCQTPVNVDRDTL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + + +VK+VGWGEE+
Sbjct: 352 VYRE---------------------TAANRKALTGFHSVKLVGWGEEH------------ 378
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G+ G +ILRG NE IE V
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEDYVLA 417
Query: 241 ALP 243
+ P
Sbjct: 418 SWP 420
>gi|307175943|gb|EFN65753.1| Uncharacterized peptidase C1-like protein F26E4.3 [Camponotus
floridanus]
Length = 443
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 62/149 (41%), Gaps = 54/149 (36%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
DI QEI+ +GPV A M +Y D F Y+SGVY S SA
Sbjct: 339 TDIMQEILTSGPVQATMRVYQD-----------------------FFVYQSGVYRHSRSA 375
Query: 152 EI--VAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTI 209
E+ Y +V+I+GWGEE +Y L YW +
Sbjct: 376 ELHDSGYHSVRIIGWGEEP--------------------SYRGPPL---------KYWLV 406
Query: 210 VSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+++G +G+ G +I +G NE IES V
Sbjct: 407 ANSWGHNWGENGLFRIQKGTNECEIESYV 435
>gi|294897889|ref|XP_002776090.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
gi|239882699|gb|EER07906.1| cysteine protease Cys2, putative [Perkinsus marinus ATCC 50983]
Length = 134
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/189 (23%), Positives = 72/189 (38%), Gaps = 61/189 (32%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGPVVANMYLYSDIFSY 117
P C + C N YG F +D++ + + + I++EIM NGP A +Y D SY
Sbjct: 6 PSCSSSCPNAKYGTAFDKDRHYTESLFPSRFGSTSSIKKEIMTNGPTSAAFSVYEDFLSY 65
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
KSG Y + S + V+I+GWG E G
Sbjct: 66 KSGVYKH------------------------TSGGFLGGHAVEIIGWGTEKG-------- 93
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
YW +++++ E++GD GT KI++G + I+ +
Sbjct: 94 --------------------------VDYWLVMNSWNEEWGDHGTFKIVQG--DCGIDDM 125
Query: 238 VNGALPKDN 246
+ P N
Sbjct: 126 ILAGTPAIN 134
>gi|15723276|gb|AAL06326.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 75/176 (42%), Gaps = 39/176 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G W + G+V+ CQP FP C +H N + P TP
Sbjct: 65 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 116 CNSTCTD----KKVPLIKYRGNTSYLLSGE-ESFKRELLLNGPFEVSFSVYADFLAYTGG 170
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + VA +L G +A V+IVGWGE NG PYW I
Sbjct: 171 VYKH---VAGTFL---------GGHA------------VRIVGWGELNGEPYWKIA 202
>gi|312283137|dbj|BAJ34434.1| unnamed protein product [Thellungiella halophila]
Length = 362
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 88/244 (36%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G + W + G+VT + +TGC S P C EP A P P
Sbjct: 174 CDGGYPIAAWQYFSYSGVVTEECDPYFDDTGC---SHPGC--------EP-----AYPTP 217
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + Q K+ Y V DI E+ KNGPV + +Y D F++
Sbjct: 218 KCMRKCVSGN--QLWSQSKHYSVSTYTVKSNPQDIMAEVYKNGPVEVSFTVYED-FAH-- 272
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
YKSGVY + I +A VK++GWG
Sbjct: 273 --------------------YKSGVYKHITGSNIGGHA-VKLIGWGT------------- 298
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW + + + +GD G I RG NE IE
Sbjct: 299 --------------------TDEGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPV 338
Query: 240 GALP 243
LP
Sbjct: 339 AGLP 342
>gi|431838263|gb|ELK00195.1| Tubulointerstitial nephritis antigen [Pteropus alecto]
Length = 425
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 92/241 (38%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
CSSG W ++ KRGLV+ +P N T + + + + K
Sbjct: 233 CSSGSIDRAWWYLRKRGLVSHAC-----------YPFLKDQNTTNNACAMASRSDGRGKR 281
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EI+ NGPV A M ++ D F YKSG
Sbjct: 282 HATKPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIIHNGPVQAIMQVHEDFFHYKSG 339
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + S + + VK+ GWG G
Sbjct: 340 ----------------IYRHVTSTNEKSEKYQKLQTHAVKLTGWGTLRG----------- 372
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ +W + +++G +G+ G +ILRG NE+ IE L+
Sbjct: 373 ------------------AQGRKEKFWIVANSWGNSWGENGYFRILRGVNESDIEKLIIA 414
Query: 241 A 241
A
Sbjct: 415 A 415
>gi|354472325|ref|XP_003498390.1| PREDICTED: tubulointerstitial nephritis antigen [Cricetulus
griseus]
gi|344245030|gb|EGW01134.1| Tubulointerstitial nephritis antigen-like [Cricetulus griseus]
Length = 465
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 101/245 (41%), Gaps = 66/245 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTT-SEPECKTLATPQPK 60
C G W ++ +RG+V+ C P N A ++ + + + +
Sbjct: 269 CRGGRLDGAWWFLRRRGVVS-------DNCYPFVGREQNEAGTSSRCMMHSRAMGRGKRQ 321
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+RC N G+ D Y+ Y + + +I +E+M+NGPV A M ++
Sbjct: 322 ATSRCPN---GQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQALMEVHE-------- 370
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
D F Y+SG+Y+ + ++ GRP R +
Sbjct: 371 ---------------DFFLYQSGIYSHTPISQ----------------GRP--EQYRRHG 397
Query: 181 VSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+VK+ GWGEE +GR YWT +++G +G++G +I+RG NE IE
Sbjct: 398 TH---------SVKITGWGEEKLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIE 448
Query: 236 SLVNG 240
S V G
Sbjct: 449 SFVLG 453
>gi|270012757|gb|EFA09205.1| cathepsin B precursor [Tribolium castaneum]
Length = 348
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 87/235 (37%), Gaps = 82/235 (34%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G ++ W + G+V+GG ++S+ GCQP S +A + KC
Sbjct: 146 CVGGYTAKAWDYYINEGIVSGGDYNSSEGCQPYSKASFQYAVAS--------------KC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C ND Y + DK+ +Y + V IQ EI+ NGPV+A +
Sbjct: 192 VKACQNDKYDVKYDDDKHYGDSFYTLETNVTQIQTEILTNGPVMATFNV----------- 240
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ DI YKSG + + V I+ WG E G P
Sbjct: 241 ------------FEDIIYYKSG----------IQLSNVSILRWGTEEGVP---------- 268
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT-IKILRGRNEAIIE 235
YW I +++G +GD G IKI RG NE IE
Sbjct: 269 ------------------------YWLIANSWGTWWGDLGGFIKIKRGTNECAIE 299
>gi|290989996|ref|XP_002677623.1| cathepsin B [Naegleria gruberi]
gi|284091231|gb|EFC44879.1| cathepsin B [Naegleria gruberi]
Length = 321
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/183 (25%), Positives = 68/183 (37%), Gaps = 61/183 (33%)
Query: 59 PKCHTRCTNDNY---GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
P C + C N + ++ + + + VADIQQEI NGPV +Y D
Sbjct: 185 PACPSNCNGTNIPISSQLYYAKSFSHISPWMFWERVADIQQEIYTNGPVQGGFSVYQDFM 244
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
+YKSG ++S+K+G + + +KI+GWG E G YW +
Sbjct: 245 NYKSG----------------VYSHKTGSF--------LGGHAIKIIGWGVEGGVDYWLV 280
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
++ +G GT KILRG NE IE
Sbjct: 281 ANSWST----------------------------------DWGIDGTFKILRGHNECGIE 306
Query: 236 SLV 238
V
Sbjct: 307 DDV 309
>gi|344264196|ref|XP_003404179.1| PREDICTED: tubulointerstitial nephritis antigen [Loxodonta
africana]
Length = 476
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 95/252 (37%), Gaps = 81/252 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNANNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT-KP-----CPNNIEKSNVI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G Y +
Sbjct: 385 FHYKTGIYRH-------------------------------------------------- 394
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGR 229
++R S + + VKL GWG G +W +++G+ +G+ G +ILRG
Sbjct: 395 VIRTSEESEKYQKLRTHAVKLTGWGMMKGAKGRKEKFWVAANSWGKSWGEDGYFRILRGV 454
Query: 230 NEAIIESLVNGA 241
NE+ IE L+ A
Sbjct: 455 NESDIEKLIIAA 466
>gi|296198446|ref|XP_002746707.1| PREDICTED: tubulointerstitial nephritis antigen [Callithrix
jacchus]
Length = 476
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 93/241 (38%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T S + + + K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNSGCAMASRSDGRGKR 332
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 333 HATKPCPNNIEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTG 390
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + S + + VK+ GWG G
Sbjct: 391 ----------------IYRHVTSTNKESEKFQKLQTHAVKLTGWGTLRG----------- 423
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ +W +++G+ +G+ G +ILRG NE+ IE L+
Sbjct: 424 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
Query: 241 A 241
A
Sbjct: 466 A 466
>gi|239799410|dbj|BAH70626.1| ACYPI000012 [Acyrthosiphon pisum]
Length = 265
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 54/111 (48%), Gaps = 14/111 (12%)
Query: 11 WVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTNDNY 70
W ++ GLV+GG +++N GCQP PP N T E C RC +N
Sbjct: 168 WEYLKNHGLVSGGKYNTNNGCQPSKIPPI--GNLPTGSYE--------NTCEKRCYGNN- 216
Query: 71 GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLY-SDIFSYKSG 120
+ QD + K +Y + E DIQ+E+ GPV ++ +D F YKSG
Sbjct: 217 TINYNQDHVKIKNHYDI--EYEDIQREVQNYGPVSMAFRVFDNDFFLYKSG 265
>gi|115621283|ref|XP_782184.2| PREDICTED: tubulointerstitial nephritis antigen-like
[Strongylocentrotus purpuratus]
Length = 450
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 67/169 (39%), Gaps = 44/169 (26%)
Query: 72 RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANM 131
RG D Y Y + DI EI +NGPV A + +D F Y G Y N
Sbjct: 316 RGVTSDLYLSTPPYRIAAREVDIMTEIYQNGPVQATFNVKNDFFVYNRGVYRN------- 368
Query: 132 YLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYA 191
K A + ++ + +VKIVGWG + R W Y
Sbjct: 369 --------VKQEFTASQSDSDQAGWHSVKIVGWGID--RSDW----------------YN 402
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+K YW +++G +G++G +I+RG NE IES V G
Sbjct: 403 PIK-----------YWLCTNSWGRNWGEQGMFRIVRGVNECEIESFVLG 440
>gi|270012758|gb|EFA09206.1| cathepsin B precursor [Tribolium castaneum]
Length = 326
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 68/167 (40%), Gaps = 58/167 (34%)
Query: 9 STWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTND 68
+ W + G+ +GG ++S+ GCQP S +A + EC
Sbjct: 153 NAWDYYINEGIASGGDYNSSEGCQPYSESSFQYAEAS----ECV---------------- 192
Query: 69 NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVV 128
++Y + VA IQ EI+ NGPV+A ++ D +KSG
Sbjct: 193 --------------KFYTLETNVAQIQMEILTNGPVMAYYNVFEDFACHKSG-------- 230
Query: 129 ANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
++ YKSG + V +VK++GWG E G PYW I
Sbjct: 231 --------VYYYKSGKF--------VGRHSVKVIGWGTEEGIPYWLI 261
>gi|290990464|ref|XP_002677856.1| predicted protein [Naegleria gruberi]
gi|284091466|gb|EFC45112.1| predicted protein [Naegleria gruberi]
Length = 231
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 85/227 (37%), Gaps = 75/227 (33%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ GI + ++HK GLV+ FP ++ T KC
Sbjct: 68 CNGGIPGLVFDYIHKDGLVSDAC-----------FPYLSYDGNT------------HVKC 104
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C N N + F DK+ + Y V + + D + +++ + +I ++
Sbjct: 105 PDFCYN-NKTKSFKSDKHFADKVYHVGEFLEDKAKRVLE---------IQKEILTH---- 150
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
GPV A+ +YSD YKSGVY
Sbjct: 151 ---GPVNADFMVYSDFTVYKSGVYR----------------------------------- 172
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
+ VK+IGWG ENG YW I +++G FG +G KI+RG
Sbjct: 173 HQTGSFEGIHAVKIIGWGTENGVDYWLIANSWGTTFGLQGFFKIVRG 219
>gi|123469339|ref|XP_001317882.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121900627|gb|EAY05659.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 241
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 25/52 (48%), Positives = 35/52 (67%)
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
V+LIGWG+ENG YW +++ G+ +G GT+ I G NE +IES + GA P
Sbjct: 187 AVELIGWGKENGVEYWILLNQHGKNWGINGTMHIKMGSNEGLIESFIYGATP 238
>gi|15723274|gb|AAL06325.1| cathepsin B-like protease [Trypanosoma cruzi]
gi|15723278|gb|AAL06327.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 76/175 (43%), Gaps = 39/175 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPC-NHANYTTSEPECKTLATPQPK 60
C+ G W + G+V+ CQP FP C +H N + P TP
Sbjct: 65 CNGGYPEVAWEYYAVHGIVS-------EYCQPYPFPSCAHHVNSSDLSPCSGEYDTPT-- 115
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C++ CT+ + KYR Y ++ E ++E++ NGP + +Y+D +Y G
Sbjct: 116 CNSTCTD----KKIPLIKYRGNTSYVLSGE-EPFKRELILNGPFEVSFSVYADFVAYTGG 170
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y + VA ++L G +A V+IVGWGE NG PYW I
Sbjct: 171 VYKH---VAGIFL---------GGHA------------VRIVGWGELNGEPYWKI 201
>gi|357116879|ref|XP_003560204.1| PREDICTED: cathepsin B-like [Brachypodium distachyon]
Length = 351
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 91/262 (34%), Gaps = 90/262 (34%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + ++G+VT + GC+ P C EP +T P
Sbjct: 165 CNGGYPISAWRYFRRKGVVTDECDPYFDQVGCK---HPGC--------EPAYRT-----P 208
Query: 60 KCHTRCTNDNY----GRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
KC +C N + F D YR V+ DI E+
Sbjct: 209 KCEKKCKVQNEVWKEQKHFSVDAYR------VHSNPHDIMAEV----------------- 245
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y NGPV +Y D YKSGVY + ++ VK++GWG +
Sbjct: 246 ------YTNGPVEVAFTVYEDFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDA------ 292
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
G YW + + + +GD G KI+RG+NE IE
Sbjct: 293 ---------------------------GEDYWLLANQWNRGWGDDGYFKIIRGKNECGIE 325
Query: 236 SLVNGALPKD-----NYGVEFG 252
V +P NY FG
Sbjct: 326 EDVVAGMPSTKNMARNYDDAFG 347
>gi|16758354|ref|NP_446034.1| tubulointerstitial nephritis antigen-like precursor [Rattus
norvegicus]
gi|61213054|sp|Q9EQT5.1|TINAL_RAT RecName: Full=Tubulointerstitial nephritis antigen-like; AltName:
Full=Glucocorticoid-inducible protein 5; Flags:
Precursor
gi|11527795|dbj|BAB18637.1| glucocorticoid-inducible protein [Rattus norvegicus]
Length = 467
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 67/246 (27%), Positives = 99/246 (40%), Gaps = 67/246 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ +RG+V+ Y S E A+P P+C
Sbjct: 269 CRGGRLDGAWWFLRRRGVVSDNC-------------------YPFSGREQNDEASPTPRC 309
Query: 62 --HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
H+R GRG Q R N +V +I + PV L SD
Sbjct: 310 MMHSR----AMGRGKRQATSRCP-----NSQVD--SNDIYQVTPVYR---LASDEKEIMK 355
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
NGPV A M ++ D F Y+ G+Y+ + ++ GRP R +
Sbjct: 356 ELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQ----------------GRP--EQYRRH 397
Query: 180 AVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+VK+ GWGEE +GR YWT +++G +G++G +I+RG NE I
Sbjct: 398 GTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDI 448
Query: 235 ESLVNG 240
E+ V G
Sbjct: 449 ETFVLG 454
>gi|351704465|gb|EHB07384.1| Tubulointerstitial nephritis antigen [Heterocephalus glaber]
Length = 475
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 92/246 (37%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
CS G W ++ KRGLV+ + +S GC A + S+ K A
Sbjct: 284 CSGGSIDRAWWYLRKRGLVSHACYPLFKDQNSTNGC----------AMASRSDGRGKRHA 333
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
T T C N+ Y+ Y V+ I +EIMKNGPV A M ++ D F
Sbjct: 334 T------TPCPNNIEKSNRI---YQCSPPYRVSSNETQIMKEIMKNGPVQAIMQVHEDFF 384
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK+G I+ + + S + + VK+ GWG G
Sbjct: 385 YYKTG----------------IYRHVTSTIEDSEKYQKLRTHAVKLTGWGTLRG------ 422
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+ +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 423 -----------------------AKGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|410959397|ref|XP_003986297.1| PREDICTED: tubulointerstitial nephritis antigen [Felis catus]
Length = 474
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 91/242 (37%), Gaps = 60/242 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV--SFPPCNHANYTTSEPECKTLATPQP 59
C+SG W ++ KRGLV+ C P+ + NH S + +
Sbjct: 281 CNSGSIDRAWWFLRKRGLVSHA-------CYPLFKNQNATNHGCAMASRSDGRGKRHATK 333
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C N Y+ Y V+ +I +EIM+NGPV A M ++ D F YK+
Sbjct: 334 PCPNNIEKSN-------RIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFHYKT 386
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + AN +SG Y + VK+ GWG G
Sbjct: 387 GIYRHITKKANE---------ESGKY------RKLQTHAVKLTGWGTLKG---------- 421
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ +W +++G+ +G+ G +ILRG NE+ IE L+
Sbjct: 422 -------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLII 462
Query: 240 GA 241
A
Sbjct: 463 AA 464
>gi|215687149|dbj|BAG90919.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 403
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 88/258 (34%), Gaps = 82/258 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + + G+VT + GC+ P C+ A P P
Sbjct: 215 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 258
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N + + + K+ Y VN + DI E+ +NGPV +Y D YKS
Sbjct: 259 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 316
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I G +A VK++GWG +
Sbjct: 317 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 342
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG NE IE V
Sbjct: 343 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 379
Query: 240 GALPKD-----NYGVEFG 252
+P NY FG
Sbjct: 380 AGMPSTKNMVRNYDSAFG 397
>gi|123483120|ref|XP_001323959.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121906833|gb|EAY11736.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 255
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 37/52 (71%)
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
TV++IGWG+E G PYW I++ +G +G+ G ++I GR++A +ES V A P
Sbjct: 200 TVEIIGWGQEKGIPYWIILNQYGRLWGENGMMRIRMGRDDARVESYVLAAEP 251
>gi|350596935|ref|XP_001927698.4| PREDICTED: tubulointerstitial nephritis antigen, partial [Sus
scrofa]
Length = 368
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 61/246 (24%), Positives = 96/246 (39%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T + + + + K
Sbjct: 176 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNNGCAMASRSDGRGKR 224
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N+ + Y+ Y V+ +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 225 HATKPCPNNFEKS--NRIYQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDFFHYKTG 282
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + S + E Y +R +A
Sbjct: 283 IY---------------------RHVTSTNEESDKYRK-----------------LRTHA 304
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 305 ------------VKLTGWGTLKGAQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 352
Query: 236 SLVNGA 241
L+ A
Sbjct: 353 KLIIAA 358
>gi|348553066|ref|XP_003462348.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cavia
porcellus]
Length = 475
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 89/246 (36%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G W ++ KRGLV+ + ++ GC S + T
Sbjct: 284 CGGGSVDRAWWYLRKRGLVSHACYPLFKDQNATNGCAMASRSDGRGKRHAT--------- 334
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
TP P H +N Y+ Y V+ I +EIM+NGPV A M ++ D F
Sbjct: 335 TPCPN-HIEKSNR---------IYQCSPPYRVSSNETQIMKEIMQNGPVQAIMKVHEDFF 384
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
SYK+G I+ + + S + + VK+ GWG G
Sbjct: 385 SYKTG----------------IYRHVTSTSEDSEKYQKLRTHAVKLTGWGTLKG------ 422
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+W +++G+ +G+ G KILRG NE+ IE
Sbjct: 423 -----------------------ARGKKEKFWIAANSWGKSWGENGYFKILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|59895951|gb|AAX11351.1| cathepsin B-like cysteine protease [Oryza sativa Japonica Group]
gi|125551767|gb|EAY97476.1| hypothetical protein OsI_19406 [Oryza sativa Indica Group]
gi|215694023|dbj|BAG89222.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215712372|dbj|BAG94499.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765382|dbj|BAG87079.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222631058|gb|EEE63190.1| hypothetical protein OsJ_17999 [Oryza sativa Japonica Group]
Length = 358
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 62/258 (24%), Positives = 88/258 (34%), Gaps = 82/258 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + + G+VT + GC+ P C+ A P P
Sbjct: 170 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---------------HPGCEP-AYPTP 213
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N + + + K+ Y VN + DI E+ +NGPV +Y D YKS
Sbjct: 214 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 271
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I G +A VK++GWG +
Sbjct: 272 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 297
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG NE IE V
Sbjct: 298 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 334
Query: 240 GALPKD-----NYGVEFG 252
+P NY FG
Sbjct: 335 AGMPSTKNMVRNYDSAFG 352
>gi|66810163|ref|XP_638805.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
gi|74897075|sp|Q54QD9.1|CTSB_DICDI RecName: Full=Cathepsin B; AltName: Full=Cathepsin B1; Flags:
Precursor
gi|60467425|gb|EAL65448.1| peptidase C1A family protein [Dictyostelium discoideum AX4]
Length = 311
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/172 (27%), Positives = 72/172 (41%), Gaps = 39/172 (22%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + S W W+ K+G V+ C P + P C A +P + TP C
Sbjct: 145 CEGGDAFSAWNWLRKQGAVS-------EECLPYTIPTCPPA----QQPCLNFVNTP--SC 191
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
C + N + QDK++ + Y + + A I QEI+ NGPV A ++ D +YKSG
Sbjct: 192 TKECQS-NSSLIYSQDKHKMAKIYSFDSDEA-IMQEIVTNGPVEACFTVFEDFLAYKSGV 249
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
Y V + + + VK+VG+G NG Y+
Sbjct: 250 Y------------------------VHTTGKDLGGHCVKLVGFGTLNGVDYY 277
>gi|189308076|gb|ACD86922.1| cysteine protease [Caenorhabditis brenneri]
Length = 228
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 27/78 (34%), Positives = 38/78 (48%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++ K G TGG++ + GC+P S PC T+ P C T P C
Sbjct: 150 CEGGYPINAWKYLVKSGFCTGGSYEAQFGCKPYSLAPCGETVGNTTWPACPTDGYDTPAC 209
Query: 62 HTRCTNDNYGRGFFQDKY 79
+CTN NY + DK+
Sbjct: 210 VNKCTNSNYNVAYKDDKH 227
>gi|146386348|gb|ABQ23962.1| cathepsin B [Oryctolagus cuniculus]
Length = 228
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 37/97 (38%), Positives = 52/97 (53%), Gaps = 3/97 (3%)
Query: 8 SSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTRCTN 67
S W + K+GLV+GG + S+ GC+P S PPC H + S P C T P+C C
Sbjct: 135 SGAWNFWTKKGLVSGGLYDSHVGCKPYSIPPCEH-HVNGSRPAC-TGEGDTPRCSKTC-E 191
Query: 68 DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
Y + +DK+ Y V+ + +I+ EI KNGPV
Sbjct: 192 PGYSPSYKEDKHYGYSSYSVSSDENEIKAEIYKNGPV 228
>gi|16768502|gb|AAL28470.1| GM06507p [Drosophila melanogaster]
Length = 430
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 60/242 (24%), Positives = 84/242 (34%), Gaps = 75/242 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK + K
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHSRSLKA 295
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ N R Y Y +N E ADI EI +GPV A M + D F+Y G
Sbjct: 296 NGCQKPVNVDRDSL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGGV 351
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y + + +VK+VGWGEE+
Sbjct: 352 YRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------- 377
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
NG YW +++G +G+ G +ILRG NE IE V +
Sbjct: 378 --------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLAS 417
Query: 242 LP 243
P
Sbjct: 418 WP 419
>gi|340503546|gb|EGR30116.1| hypothetical protein IMG5_141560 [Ichthyophthirius multifiliis]
Length = 599
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 24/124 (19%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVY-AVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+ NGP+V + D Y+ G+Y +V A+ I+ G+E+ P W V
Sbjct: 484 HKNGPIVVSFEPAMDFMYYQEGIYHSVDANDWIL----------GDEDKLPQWEKVD--- 530
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+V +GWGE YW + +++GE +G+KG KI RG +E+ IES+
Sbjct: 531 ----------HSVLCVGWGENEDGKYWLVQNSWGEDWGEKGYFKIRRGTDESNIESMGER 580
Query: 241 ALPK 244
A K
Sbjct: 581 AFIK 584
>gi|301775398|ref|XP_002923119.1| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Ailuropoda melanoleuca]
Length = 472
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 95/250 (38%), Gaps = 77/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANY-----TTSEPECKTLAT 56
C+SG W ++ KRGLV+ C P+ F N NY + S+ K AT
Sbjct: 280 CNSGSIDRAWWFLRKRGLVS-------HACYPL-FKDQNATNYGCAMASRSDGRGKRHAT 331
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
+P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D F
Sbjct: 332 -KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFH 382
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK+G Y + +
Sbjct: 383 YKTGIYRH--------------------------------------------------VT 392
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNE 231
R S+ + +KL GWG G +W +++G+ +G+ G +ILRG NE
Sbjct: 393 RTNEESSKYRKLQTHAIKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNE 452
Query: 232 AIIESLVNGA 241
+ IE L+ A
Sbjct: 453 SDIEKLIIAA 462
>gi|323448265|gb|EGB04166.1| hypothetical protein AURANDRAFT_32974 [Aureococcus anophagefferens]
Length = 298
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 63/253 (24%), Positives = 86/253 (33%), Gaps = 72/253 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTG------CQPVSFPPCNHANYTTSEP------ 49
C G + W +V K G VTGG ++ TG C P C+H +P
Sbjct: 92 CDGGQIITPWTYVAKAGAVTGG-QYNGTGPFGAGLCADWFAPHCHHHGPRGDDPYPAEGD 150
Query: 50 -ECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANM 108
C + +P+ T F DK+ F A I I + GPV
Sbjct: 151 AGCPSEKSPEGPKACDATAAAGHDAFAADKHTFAGDVQTASGEAAIMAMIAEGGPVETAF 210
Query: 109 YLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN 168
+Y D +Y G +Y + ++G +A VK VGWG EN
Sbjct: 211 TVYEDFENYAGG------------IYHHVTGEEAGGHA------------VKFVGWGVEN 246
Query: 169 GRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
G YW + + PYW G+ G +ILRG
Sbjct: 247 GTKYWKVANSW------------------------NPYW----------GEAGYFRILRG 272
Query: 229 RNEAIIESLVNGA 241
NE IE V G+
Sbjct: 273 SNEGGIEDQVTGS 285
>gi|78042562|ref|NP_001030279.1| tubulointerstitial nephritis antigen [Bos taurus]
gi|108861910|sp|Q3SZI1.1|TINAG_BOVIN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|74354008|gb|AAI02844.1| Tubulointerstitial nephritis antigen [Bos taurus]
gi|296474572|tpg|DAA16687.1| TPA: tubulointerstitial nephritis antigen [Bos taurus]
Length = 476
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT T C N Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F+YK+G I+ + + S VK+ GWG G
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|260782761|ref|XP_002586451.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
gi|229271561|gb|EEN42462.1| hypothetical protein BRAFLDRAFT_247264 [Branchiostoma floridae]
Length = 272
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 75/180 (41%), Gaps = 63/180 (35%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P+C ++CT G G K+ Y V+ E I+ EIM NGPV A +YSDI
Sbjct: 146 PECMSKCT----GEGHAYQKFYGLYLYTVSGE-NQIKVEIMTNGPVEAAFTVYSDIV--- 197
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
YKSGVY ++ ++ +A VK++GWG E+
Sbjct: 198 --------------------HYKSGVYHHTSGGKLGGHA-VKVLGWGVED---------- 226
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
EE YW + +++G +GD+G KI RG +E IES V
Sbjct: 227 ---------------------EEE---YWLVANSWGPDWGDQGFFKIKRGSDECGIESRV 262
>gi|344287520|ref|XP_003415501.1| PREDICTED: tubulointerstitial nephritis antigen isoform 2
[Loxodonta africana]
Length = 437
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 97/250 (38%), Gaps = 76/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H PV PPC ++ + K AT
Sbjct: 240 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPV--PPC--MMHSRAMGRGKRQAT- 294
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+RC N + D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 295 -----SRCPNSHV---HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHE----- 341
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGR 170
D F Y+ G+Y+ + ++ +VKI GWGEE
Sbjct: 342 ------------------DFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEET-- 381
Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
+ T+K YWT +++G +G++G +I+RG N
Sbjct: 382 ----------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGAN 414
Query: 231 EAIIESLVNG 240
E IES V G
Sbjct: 415 ECDIESFVLG 424
>gi|328872536|gb|EGG20903.1| hypothetical protein DFA_00770 [Dictyostelium fasciculatum]
Length = 313
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 40/67 (59%)
Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
+ Y S I K+ Y NGP++A LY+DI++YKSGVY S SA +++GWG
Sbjct: 154 DCYRLSSIEQAKADIYLNGPIIAVFDLYTDIYNYKSGVYIKSDSATYKETHAGRVIGWGV 213
Query: 167 ENGRPYW 173
E+G YW
Sbjct: 214 EDGVQYW 220
>gi|66911417|gb|AAH97299.1| Tubulointerstitial nephritis antigen-like 1 [Rattus norvegicus]
gi|149024087|gb|EDL80584.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024088|gb|EDL80585.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
gi|149024089|gb|EDL80586.1| lipocalin 7, isoform CRA_a [Rattus norvegicus]
Length = 467
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 85/193 (44%), Gaps = 48/193 (24%)
Query: 55 ATPQPKC--HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYS 112
A+P P+C H+R GRG Q R + ++++ + PV L S
Sbjct: 303 ASPTPRCMMHSR----AMGRGKRQATSRCPNSHVDSNDIYQVT-------PVYR---LAS 348
Query: 113 DIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
D NGPV A M ++ D F Y+ G+Y+ + ++ GRP
Sbjct: 349 DEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQ----------------GRP- 391
Query: 173 WTIVRVYAVSASAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILR 227
R + +VK+ GWGEE +GR YWT +++G +G++G +I+R
Sbjct: 392 -EQYRRHGTH---------SVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVR 441
Query: 228 GRNEAIIESLVNG 240
G NE IE+ V G
Sbjct: 442 GTNECDIETFVLG 454
>gi|338718488|ref|XP_001918155.2| PREDICTED: LOW QUALITY PROTEIN: tubulointerstitial nephritis
antigen-like [Equus caballus]
Length = 480
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 54/241 (22%), Positives = 92/241 (38%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T ++ + + + K
Sbjct: 288 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNNDCAMASRSDGRGKR 336
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EIM+NGPV A M ++ D F YK G
Sbjct: 337 HATKPCPNNIEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHDDFFHYKKG 394
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + + + +K+ GWG G
Sbjct: 395 ----------------IYRHVTSTHEEPEKYRKLRTHAIKLAGWGTLRG----------- 427
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ +W +++G+ +G+ G +ILRG NE+ IE L+
Sbjct: 428 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 469
Query: 241 A 241
A
Sbjct: 470 A 470
>gi|297723949|ref|NP_001174338.1| Os05g0310500 [Oryza sativa Japonica Group]
gi|255676228|dbj|BAH93066.1| Os05g0310500, partial [Oryza sativa Japonica Group]
Length = 234
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 89/258 (34%), Gaps = 82/258 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + + G+VT + GC+ P C EP A P P
Sbjct: 46 CDGGYPIMAWRYFVRNGVVTDECDPYFDQVGCK---HPGC--------EP-----AYPTP 89
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C +C N + + + K+ Y VN + DI E+ +NGPV +Y D YKS
Sbjct: 90 VCEKKCKVQN--QVWLEKKHFSVNAYRVNSDPHDIMAEVYQNGPVEVAFTVYEDFAHYKS 147
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I G +A VK++GWG +
Sbjct: 148 G------------VYKHITGGMMGGHA------------VKLIGWGTTDA---------- 173
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
G YW + + + +GD G KI+RG NE IE V
Sbjct: 174 -----------------------GEDYWLLANQWNRGWGDDGYFKIIRGTNECGIEEDVV 210
Query: 240 GALPKD-----NYGVEFG 252
+P NY FG
Sbjct: 211 AGMPSTKNMVRNYDSAFG 228
>gi|67613207|ref|XP_667285.1| preprocathepsin c precursor [Cryptosporidium hominis TU502]
gi|54658406|gb|EAL37056.1| preprocathepsin c precursor [Cryptosporidium hominis]
Length = 635
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 39/158 (24%)
Query: 110 LYSDIFSYKSGKYG-------------NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
+Y++ + Y G YG NGP+ M++ + + Y++GVY S + Y
Sbjct: 461 MYAEEYGYVGGCYGCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYENGVYD-SIPNDHTKY 519
Query: 157 ATV---KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
+ ++ GW N + ++GWGEENG PYW I +++
Sbjct: 520 CDLPNKQLNGWEYTN----------------------HAIAIVGWGEENGIPYWIIRNSW 557
Query: 214 GEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEF 251
G +G+KG KI RG+N IE+ P + G+ F
Sbjct: 558 GANWGNKGYAKIRRGKNIGGIENQAVFIDPDFSRGMGF 595
>gi|86279341|gb|ABC88766.1| putative cathepsin B-like like proteinase [Tenebrio molitor]
Length = 301
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/118 (31%), Positives = 54/118 (45%), Gaps = 4/118 (3%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + G+VTGG + + GC+ S PC+H P C + P C
Sbjct: 154 CNGGWPDLAWSYWSSTGIVTGGLYGVDEGCKAYSIKPCDHHVDGNLGP-CGDIQR-TPAC 211
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
C D+ ++ R Y + + IQ EIM NGPV A+ +YSD +YK+
Sbjct: 212 KKSC--DSTSDLEYKSDLRRGSAYSIPKSESQIQTEIMTNGPVEADYDVYSDFLTYKA 267
>gi|47125398|gb|AAH70278.1| Tubulointerstitial nephritis antigen [Homo sapiens]
gi|190690249|gb|ACE86899.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|190691623|gb|ACE87586.1| tubulointerstitial nephritis antigen protein [synthetic construct]
gi|312150986|gb|ADQ32005.1| tubulointerstitial nephritis antigen [synthetic construct]
Length = 476
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 94/247 (38%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|6009533|dbj|BAA84949.1| tubulointerstitial nephritis antigen [Homo sapiens]
Length = 476
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 94/247 (38%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|344287518|ref|XP_003415500.1| PREDICTED: tubulointerstitial nephritis antigen isoform 1
[Loxodonta africana]
Length = 468
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 61/250 (24%), Positives = 97/250 (38%), Gaps = 76/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH----HSNTGCQPVSFPPCNHANYTTSEPECKTLATP 57
C G W ++ +RG+V+ + H PV PPC ++ + K AT
Sbjct: 271 CRGGRLDGAWWFLRRRGVVSDHCYPFSGHERDKAGPV--PPC--MMHSRAMGRGKRQAT- 325
Query: 58 QPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
+RC N + D Y+ Y + +I +E+M+NGPV A M ++
Sbjct: 326 -----SRCPNSHV---HGNDIYQVTPAYRLGTNEKEIMKELMENGPVQALMEVHE----- 372
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGR 170
D F Y+ G+Y+ + ++ +VKI GWGEE
Sbjct: 373 ------------------DFFLYQGGIYSHTPVSQERPEQYRRHGTHSVKITGWGEET-- 412
Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
+ T+K YWT +++G +G++G +I+RG N
Sbjct: 413 ----------------LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGAN 445
Query: 231 EAIIESLVNG 240
E IES V G
Sbjct: 446 ECDIESFVLG 455
>gi|297291062|ref|XP_002803846.1| PREDICTED: tubulointerstitial nephritis antigen-like [Macaca
mulatta]
Length = 463
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+SG W ++ KRGLV+ + ++N GC A + S+ K A
Sbjct: 272 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 321
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
T +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D F
Sbjct: 322 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 372
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK+G I+ + + S + VK+ GWG G
Sbjct: 373 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 410
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+ +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 411 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 447
Query: 236 SLVNGA 241
L+ A
Sbjct: 448 KLIIAA 453
>gi|291236586|ref|XP_002738220.1| PREDICTED: cathepsin B preproprotein-like [Saccoglossus
kowalevskii]
Length = 93
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 52/120 (43%), Gaps = 35/120 (29%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
GPV +Y+D SYKSGVY + E + +KI+GWG E+G
Sbjct: 8 GPVEGAFTVYADFPSYKSGVYQ-HETGEALGGHAIKILGWGNEDG--------------- 51
Query: 185 AEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
YW + +++ E +GD+G KILRG +E IES + PK
Sbjct: 52 -------------------HDYWLVANSWNEDWGDQGFFKILRGVDECGIESQITAGSPK 92
>gi|73973401|ref|XP_538969.2| PREDICTED: tubulointerstitial nephritis antigen [Canis lupus
familiaris]
Length = 476
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/250 (26%), Positives = 94/250 (37%), Gaps = 77/250 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANY-----TTSEPECKTLAT 56
C+SG W ++ KRGLV+ C P+ F N NY + S+ K AT
Sbjct: 284 CNSGSIDRAWWFLRKRGLVSHA-------CYPL-FKDQNATNYGCAMASRSDGRGKRHAT 335
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
+P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D F
Sbjct: 336 -KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDFFH 386
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
YK+G Y + I
Sbjct: 387 YKTGIYRH--------------------------------------------------IT 396
Query: 177 RVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNE 231
R S + + VKL GWG G +W +++G +G+ G +ILRG NE
Sbjct: 397 RTNEESRKYQKLQTHAVKLTGWGTLKGAQGQKEKFWIAANSWGISWGENGYFRILRGVNE 456
Query: 232 AIIESLVNGA 241
+ IE L+ A
Sbjct: 457 SDIEKLIIAA 466
>gi|355561807|gb|EHH18439.1| hypothetical protein EGK_15031 [Macaca mulatta]
Length = 475
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+SG W ++ KRGLV+ + ++N GC A + S+ K A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
T +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK+G I+ + + S + VK+ GWG G
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+ +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|355748654|gb|EHH53137.1| hypothetical protein EGM_13709 [Macaca fascicularis]
Length = 475
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+SG W ++ KRGLV+ + ++N GC A + S+ K A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
T +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK+G I+ + + S + VK+ GWG G
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+ +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|428169747|gb|EKX38678.1| hypothetical protein GUITHDRAFT_76993, partial [Guillardia theta
CCMP2712]
Length = 85
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/63 (46%), Positives = 36/63 (57%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGP V +Y D +SYKSGVY SA A+ V V +VGWG ENG YW + + S+
Sbjct: 8 NGPGVVVFDVYDDFYSYKSGVYTKSAKAQKVGGHAVVLVGWGRENGVDYWLVQNSWGKSS 67
Query: 184 SAE 186
E
Sbjct: 68 GDE 70
>gi|402867308|ref|XP_003897801.1| PREDICTED: tubulointerstitial nephritis antigen [Papio anubis]
Length = 475
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 70/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C+SG W ++ KRGLV+ + ++N GC A + S+ K A
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNANNGC----------AMASRSDGRGKRHA 333
Query: 56 TPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
T +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D F
Sbjct: 334 T-KP-----CPNNIEKSNRI---YQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFF 384
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
YK+G I+ + + S + VK+ GWG G
Sbjct: 385 HYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG------ 422
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
+ +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 423 -----------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|149694136|ref|XP_001503950.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 1
[Equus caballus]
Length = 467
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 60/122 (49%), Gaps = 32/122 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A M ++ D F Y+ GVY+ + + +GRP R +
Sbjct: 360 NGPVQALMEVHEDFFLYQGGVYSHTPVS----------------HGRP--ERYRRHGTH- 400
Query: 184 SAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+VK+ GWGEE +GR YWT +++G +G++G +I+RG NE IES V
Sbjct: 401 --------SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 452
Query: 239 NG 240
G
Sbjct: 453 LG 454
>gi|320167003|gb|EFW43902.1| cathepsin B [Capsaspora owczarzaki ATCC 30864]
Length = 306
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 48/104 (46%), Gaps = 24/104 (23%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
Y+ K Y V + +A IQ EI+ NGPV A +Y D FSY SG ++
Sbjct: 198 YKAKTAYQVANNMAAIQSEILANGPVEAAFSVYDDFFSYTSG----------------VY 241
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVS 182
S++SG + VKIVGWG + PYW + + S
Sbjct: 242 SHQSGA--------LDGGHAVKIVGWGVDGTTPYWIVANSWGTS 277
>gi|403268748|ref|XP_003926429.1| PREDICTED: tubulointerstitial nephritis antigen [Saimiri
boliviensis boliviensis]
Length = 476
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/241 (23%), Positives = 92/241 (38%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T S + + + K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATNSGCAMASRSDGRGKR 332
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EIM+NGPV A M ++ D F YK+G
Sbjct: 333 HATKPCPNNIEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMKVHEDFFHYKTG 390
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + S + VK+ GWG G
Sbjct: 391 ----------------IYRHVTSTNKESEKFLKLQTHAVKLTGWGTLRG----------- 423
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ +W +++G+ +G+ G +ILRG NE+ IE L+
Sbjct: 424 ------------------AQGRKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
Query: 241 A 241
A
Sbjct: 466 A 466
>gi|145525479|ref|XP_001448556.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416111|emb|CAK81159.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 56/122 (45%), Gaps = 28/122 (22%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG--RPYWTIVRVY 179
Y NGPVV N D Y G++ + I+ NG +P W V
Sbjct: 387 YNNGPVVLNFEPSFDFMFYVGGIFHSTTPDWII-------------NGLAKPEWEKVD-- 431
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+V GWGEENG YW + +++G+Q+G+ G ++ RG++E+ IES+
Sbjct: 432 -----------HSVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAE 480
Query: 240 GA 241
A
Sbjct: 481 AA 482
>gi|195585648|ref|XP_002082593.1| GD25141 [Drosophila simulans]
gi|194194602|gb|EDX08178.1| GD25141 [Drosophila simulans]
Length = 484
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 94/278 (33%), Gaps = 78/278 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 62 HTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C T N R Y Y +N E ADI EI +GPV A M + D F+Y G
Sbjct: 296 ANGCQTPVNVDRDTL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + +VK+VGWGEE+
Sbjct: 352 VYRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------ 378
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G+ G +ILRG NE IE V
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLA 417
Query: 241 ALP--KDNYGVEFGEESGERLSEEFGVRAESSEEFREN 276
+ P + Y E G +R F S EN
Sbjct: 418 SWPYVYNYYKCEVGLRGIKRALPPFATEPISELCRNEN 455
>gi|48762497|dbj|BAD23818.1| cathepsin B-S [Tuberaphis coreana]
Length = 99
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 57/120 (47%), Gaps = 25/120 (20%)
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
P + H +C YG+ Q++Y+ K Y +N + I+Q D+ +
Sbjct: 5 PMERNH-QCPKTCYGKTTVQNRYKTKSEYSINS-IKTIEQ----------------DLKT 46
Query: 117 YKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y GPV A+ +Y D YKSG+Y + A+ ++KI+GWG+ENG YW V
Sbjct: 47 Y-------GPVEASFDVYDDFSVYKSGIYRKTPKAKYEGRHSIKIIGWGQENGTTYWLAV 99
>gi|338722032|ref|XP_003364468.1| PREDICTED: tubulointerstitial nephritis antigen-like 1 isoform 2
[Equus caballus]
Length = 436
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 60/122 (49%), Gaps = 32/122 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A M ++ D F Y+ GVY+ + + +GRP R +
Sbjct: 329 NGPVQALMEVHEDFFLYQGGVYSHTPVS----------------HGRP--ERYRRHGTH- 369
Query: 184 SAEIVAYATVKLIGWGEE---NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+VK+ GWGEE +GR YWT +++G +G++G +I+RG NE IES V
Sbjct: 370 --------SVKITGWGEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANECDIESFV 421
Query: 239 NG 240
G
Sbjct: 422 LG 423
>gi|197100841|ref|NP_001126804.1| tubulointerstitial nephritis antigen [Pongo abelii]
gi|55732702|emb|CAH93049.1| hypothetical protein [Pongo abelii]
Length = 476
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/241 (24%), Positives = 93/241 (38%), Gaps = 59/241 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ C P+S N T + + + + K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHA-------CYPLS----KDQNATNNGCAMASRSDGRGKR 332
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EIM+NGPV A M + D F YK+G
Sbjct: 333 HATKPCPNNVEKS--NRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ + + S + VK+ GWG G
Sbjct: 391 ----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----------- 423
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
+ +W +++G+ +G+ G +ILRG NE+ IE L+
Sbjct: 424 ------------------AQGQKEKFWVAANSWGKSWGENGYFRILRGVNESDIEKLIIA 465
Query: 241 A 241
A
Sbjct: 466 A 466
>gi|294877495|ref|XP_002768009.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239870149|gb|EER00727.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 180
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/75 (37%), Positives = 34/75 (45%), Gaps = 6/75 (8%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH------HSNTGCQPVSFPPCNHANYTTSEPECKTLA 55
C G S W WVH +G+ TGG + + GC P FPPC H T P+C
Sbjct: 100 CGGGDPYSAWSWVHDKGIATGGDYVAKDDMTKDDGCWPYDFPPCAHHINDTKYPKCPEGL 159
Query: 56 TPQPKCHTRCTNDNY 70
P P C +C N Y
Sbjct: 160 YPTPNCVEQCHNPKY 174
>gi|297744106|emb|CBI37076.3| unnamed protein product [Vitis vinifera]
Length = 392
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 87/248 (35%), Gaps = 85/248 (34%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + G+VT + TGC S P C+ P P
Sbjct: 203 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 246
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
KC +CT++N Q + KRY Y ++ + I E+ KNGPV +Y D
Sbjct: 247 KCVRKCTDEN------QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFA 300
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y+SG ++ Y +G +++ VK++GWG
Sbjct: 301 HYESG----------------VYRYTTG--------DVMGGHAVKLIGWGT--------- 327
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
++G YW + + + +GD G I RG NE IE
Sbjct: 328 ------------------------TDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIE 363
Query: 236 SLVNGALP 243
V LP
Sbjct: 364 EGVVAGLP 371
>gi|255647484|gb|ACU24206.1| unknown [Glycine max]
Length = 327
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/233 (25%), Positives = 85/233 (36%), Gaps = 77/233 (33%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYRT-----P 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y VN + DI E+ KNGPV +Y D YKS
Sbjct: 213 KCVKKCVSGN--QVWKKSKHYSVSAYRVNSDPHDIMAEVYKNGPVEVAFTVYEDFAYYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G +Y I Y+ G +A VK++GWG
Sbjct: 271 G------------VYKHITGYELGGHA------------VKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEA 232
++G YW + + + ++GD G KI RG NE
Sbjct: 294 --------------------TDDGEDYWLLANQWNREWGDDGYFKIRRGTNEC 326
>gi|24657813|ref|NP_726176.1| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|24657819|ref|NP_611652.2| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|21064305|gb|AAM29382.1| RE01730p [Drosophila melanogaster]
gi|21626543|gb|AAF46818.2| secreted Wg-interacting molecule, isoform A [Drosophila
melanogaster]
gi|21626544|gb|AAM68213.1| secreted Wg-interacting molecule, isoform B [Drosophila
melanogaster]
gi|220949028|gb|ACL87057.1| CG3074-PA [synthetic construct]
gi|220958134|gb|ACL91610.1| CG3074-PA [synthetic construct]
Length = 431
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/243 (24%), Positives = 83/243 (34%), Gaps = 76/243 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DENCYP----------YTQHRDTCKIRHNSRSLR 295
Query: 62 HTRCTND-NYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C N R Y Y +N E ADI EI +GPV A M + D F+Y G
Sbjct: 296 ANGCQKPVNVDRDSL---YTVGPAYSLNRE-ADIMAEIFHSGPVQATMRVNRDFFAYSGG 351
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + + +VK+VGWGEE+
Sbjct: 352 VYRE---------------------TAANRKAPTGFHSVKLVGWGEEH------------ 378
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
NG YW +++G +G+ G +ILRG NE IE V
Sbjct: 379 ---------------------NGEKYWIAANSWGSWWGEHGYFRILRGSNECGIEEYVLA 417
Query: 241 ALP 243
+ P
Sbjct: 418 SWP 420
>gi|52546914|gb|AAU81590.1| cysteine proteinase, partial [Petunia x hybrida]
Length = 122
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/132 (30%), Positives = 53/132 (40%), Gaps = 34/132 (25%)
Query: 112 SDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
SD +S + Y NGPV +Y D YKSGVY E+ +A VK++GWG
Sbjct: 5 SDPYSIMTEVYKNGPVEVAFTVYEDFAHYKSGVYKHVTGDELGGHA-VKLIGWGT----- 58
Query: 172 YWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
E+G YW + + + +GD G KI RG NE
Sbjct: 59 ----------------------------SEDGEDYWLLANQWNRGWGDDGYFKIRRGTNE 90
Query: 232 AIIESLVNGALP 243
IE V +P
Sbjct: 91 CDIEDEVVAGMP 102
>gi|170028894|ref|XP_001842329.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879379|gb|EDS42762.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 355
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 80/236 (33%), Gaps = 83/236 (35%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPP-CNHANYTTSEPECKTLATPQPK 60
C+ G W + GLV+ C P S P C N C L P
Sbjct: 93 CAGGDPLKVWNYWATTGLVS-------DSCMPFSLSPLCLGFN-------CPLLCAP--- 135
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
Y D+ + + V V IQ EI+ NGPV A+ LY D K
Sbjct: 136 --------GYAGSIVGDRKKGLKVVTVAPYVDAIQSEIILNGPVEASFDLYLDFVHLKQ- 186
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
S +++ +SG GR
Sbjct: 187 --------------SQVYNSRSG----------------------PNLGR---------- 200
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+VK+IGWG ENG YW I STFG +G++GT LRG N ++ S
Sbjct: 201 ----------QSVKIIGWGVENGTEYWLITSTFGIGWGNQGTAMFLRGVNHLVLPS 246
>gi|426250116|ref|XP_004018784.1| PREDICTED: tubulointerstitial nephritis antigen [Ovis aries]
Length = 476
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 91/247 (36%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT T C N Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F+YK+G I+ + + S VK+ GWG G
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AHGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|260821944|ref|XP_002606363.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
gi|229291704|gb|EEN62373.1| hypothetical protein BRAFLDRAFT_118514 [Branchiostoma floridae]
Length = 113
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 58/123 (47%), Gaps = 35/123 (28%)
Query: 131 MYLYSDIFSYKSGVYAVSASAE-------IVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
M + D+FSY+SGVY + A+ + +V+I+GWG E PY
Sbjct: 1 MEVKPDLFSYRSGVYRHTELAQGEPPEYRRRGWHSVRIIGWGVEMSDPY----------- 49
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
A +K YWT+ +++G Q+G++G +I+RG NE IES V G
Sbjct: 50 ------QAPIK-----------YWTVANSWGTQWGEEGYFRIVRGENECQIESFVLGVWG 92
Query: 244 KDN 246
K N
Sbjct: 93 KVN 95
>gi|388500062|gb|AFK38097.1| unknown [Lotus japonicus]
Length = 357
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 84/244 (34%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGYPLYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N + + + KY Y V + DI E+
Sbjct: 213 KCVRKCVKGN--QIWKKSKYFSVNAYSVKSDPYDIMAEV--------------------- 249
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV +Y D YKSGVY +++ +A VK++GWG
Sbjct: 250 --YKNGPVEVAFTVYEDFAHYKSGVYKHITGSQLGGHA-VKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW I + + +GD G I RG NE IE V
Sbjct: 294 --------------------TDEGEDYWLIANQWNRSWGDDGYFMIRRGTNECGIEEDVT 333
Query: 240 GALP 243
LP
Sbjct: 334 AGLP 337
>gi|357511629|ref|XP_003626103.1| Cathepsin B [Medicago truncatula]
gi|87240982|gb|ABD32840.1| Peptidase C1A, papain; Somatotropin hormone; Peptidase C1,
propeptide [Medicago truncatula]
gi|355501118|gb|AES82321.1| Cathepsin B [Medicago truncatula]
Length = 357
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 169 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N + + + K+ + Y V + DI E+ KNGPV ++ D YKS
Sbjct: 213 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 270
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + ++ SA + VK++GWG
Sbjct: 271 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 293
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW + + + +GD G KI RG NE IE V
Sbjct: 294 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 333
Query: 240 GALP 243
LP
Sbjct: 334 AGLP 337
>gi|66801417|ref|XP_629634.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
gi|60463014|gb|EAL61210.1| hypothetical protein DDB_G0292462 [Dictyostelium discoideum AX4]
Length = 323
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 65/150 (43%), Gaps = 42/150 (28%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV+A LYSD +K VY S++ ++ ++A V++VGWG +
Sbjct: 191 NGPVIATFMLYSDFKPHKWDVYIKSSNTQVESHA-VRVVGWGTTS--------------- 234
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE------SL 237
+G YW +++G +GDKG KI RG +EA E +
Sbjct: 235 ------------------DGVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTA 276
Query: 238 VNGALPKDNYGVE--FGEESGERLSEEFGV 265
++P YG+E FG S L F +
Sbjct: 277 DTASVPTSQYGLEYQFGGNSSTFLKPSFLI 306
>gi|327282776|ref|XP_003226118.1| PREDICTED: tubulointerstitial nephritis antigen-like [Anolis
carolinensis]
Length = 476
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 64/154 (41%), Gaps = 45/154 (29%)
Query: 85 YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
Y ++ + ADI +EI +NGPV A M +Y D F YKSG +Y I+S +
Sbjct: 361 YRISSQDADIMKEIKENGPVQAVMQVYDDFFLYKSG------------IYKHIWSLE--- 405
Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
+ + ++KIVGWG E +
Sbjct: 406 -GKTQNRHQKKPHSIKIVGWGTLRD-----------------------------AEGQRQ 435
Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+W +++G +G+ G +ILRG+NE IE V
Sbjct: 436 KFWIAANSWGNSWGENGYFRILRGQNECDIEKTV 469
>gi|261328564|emb|CBH11542.1| CPC cysteine peptidase, Clan CA, family C1,Cathepsin B-like,
putative [Trypanosoma brucei gambiense DAL972]
Length = 340
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C+ G W + GLV+ CQP FP C+H + + + P C PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C+ C + YR Y + E D +E+ GP +Y D +Y SG
Sbjct: 215 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 269
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + V+ YL G +A V++VGWG NG PYW I
Sbjct: 270 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 301
>gi|355332948|pdb|3MOR|A Chain A, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
gi|355332949|pdb|3MOR|B Chain B, Crystal Structure Of Cathepsin B From Trypanosoma Brucei
Length = 317
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C+ G W + GLV+ CQP FP C+H + + + P C PK
Sbjct: 139 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 191
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C+ C + YR Y + E D +E+ GP +Y D +Y SG
Sbjct: 192 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 246
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + V+ YL G +A V++VGWG NG PYW I
Sbjct: 247 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 278
>gi|217072748|gb|ACJ84734.1| unknown [Medicago truncatula]
gi|388505480|gb|AFK40806.1| unknown [Medicago truncatula]
Length = 359
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N + + + K+ + Y V + DI E+ KNGPV ++ D YKS
Sbjct: 215 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMAEVYKNGPVEVAFTVFEDFAHYKS 272
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + ++ SA + VK++GWG
Sbjct: 273 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 295
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW + + + +GD G KI RG NE IE V
Sbjct: 296 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 335
Query: 240 GALP 243
LP
Sbjct: 336 AGLP 339
>gi|217073630|gb|ACJ85175.1| unknown [Medicago truncatula]
Length = 359
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/244 (24%), Positives = 86/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W ++ G+VT + GC S P C EP +T P
Sbjct: 171 CDGGTPIYAWRYLAHHGVVTEECDPYFDQIGC---SHPGC--------EPAYQT-----P 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N + + + K+ + Y V + DI E+ KNGPV ++ D YKS
Sbjct: 215 KCVRKCVKGN--QIWKRSKHYSVKAYRVKSDPQDIMTEVYKNGPVEVAFTVFEDFAHYKS 272
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + ++ SA + VK++GWG
Sbjct: 273 GVYKH----------------------ITGSA--LGGHAVKLIGWGT------------- 295
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW + + + +GD G KI RG NE IE V
Sbjct: 296 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEDDVT 335
Query: 240 GALP 243
LP
Sbjct: 336 AGLP 339
>gi|225437812|ref|XP_002281936.1| PREDICTED: cathepsin B-like isoform 1 [Vitis vinifera]
gi|359480250|ref|XP_003632421.1| PREDICTED: cathepsin B-like [Vitis vinifera]
Length = 358
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/248 (23%), Positives = 87/248 (35%), Gaps = 85/248 (34%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + G+VT + TGC S P C+ P P
Sbjct: 169 CDGGYPLYAWRYFIHHGVVTEECDPYFDATGC---------------SHPGCEP-GYPTP 212
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIF 115
KC +CT++N Q + KRY Y ++ + I E+ KNGPV +Y D
Sbjct: 213 KCVRKCTDEN------QLWRKAKRYGQSAYRISSDPYQIMAEVYKNGPVEVAFTVYEDFA 266
Query: 116 SYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
Y+SG ++ Y +G +++ VK++GWG
Sbjct: 267 HYESG----------------VYRYTTG--------DVMGGHAVKLIGWGT--------- 293
Query: 176 VRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
++G YW + + + +GD G I RG NE IE
Sbjct: 294 ------------------------TDDGEDYWILANQWNRNWGDDGYFMIRRGVNECGIE 329
Query: 236 SLVNGALP 243
V LP
Sbjct: 330 EGVVAGLP 337
>gi|347546077|gb|AEP03186.1| cathepsin B [Diuraphis noxia]
Length = 239
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 53/121 (43%), Gaps = 12/121 (9%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + GLVTGG S GC+P PP + + C+
Sbjct: 116 CDGGYPIKAWKQFSRHGLVTGGDFDSGEGCEPYRVPPSGSNSSNSYNHFCR--------- 166
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+C DN + +D + YY+++ IQ++++ GP+ A+ +Y D YKSG
Sbjct: 167 -GKCYGDNQNISYSEDHRYTRDYYYLSYNA--IQKDVLLYGPIEASFEVYDDFMIYKSGV 223
Query: 122 Y 122
Y
Sbjct: 224 Y 224
>gi|332210168|ref|XP_003254178.1| PREDICTED: tubulointerstitial nephritis antigen [Nomascus
leucogenys]
Length = 476
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 94/246 (38%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T++ + + + K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHAC-----------YPLFKDQNATSNGCAMASRSDGRGKR 332
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +N + Y+ Y V+ +I +EIM+NGPV A M + D F YK+G
Sbjct: 333 HATKPCPNNVEKS--NRIYQCSPPYRVSSSETEIMKEIMQNGPVQAIMQVREDFFHYKTG 390
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + SA+ E Y ++
Sbjct: 391 IY---------------------RHVTSANKESEKYRKLQT------------------- 410
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 411 ----------HAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 460
Query: 236 SLVNGA 241
L+ A
Sbjct: 461 KLIIAA 466
>gi|126647906|ref|XP_001388062.1| preprocathepsin c precursor [Cryptosporidium parvum Iowa II]
gi|126117150|gb|EAZ51250.1| preprocathepsin c precursor, putative [Cryptosporidium parvum Iowa
II]
Length = 635
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 63/143 (44%), Gaps = 39/143 (27%)
Query: 110 LYSDIFSYKSGKYG-------------NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
+Y++ + Y G YG NGP+ M++ + + Y +GVY S + Y
Sbjct: 461 MYAEEYGYVGGCYGCCDEDRMKEEIFKNGPIAVAMHIDTSLLVYDNGVYD-SIPNDHTKY 519
Query: 157 ATV---KIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTF 213
+ ++ GW N + ++GWGEENG PYW I +++
Sbjct: 520 CDLPNKQLNGWEYTN----------------------HAIAIVGWGEENGIPYWIIRNSW 557
Query: 214 GEQFGDKGTIKILRGRNEAIIES 236
G +G KG KI RG+N IE+
Sbjct: 558 GANWGKKGYAKIRRGKNIGGIEN 580
>gi|161343853|tpg|DAA06107.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 217
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 40/84 (47%), Gaps = 1/84 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQ-PK 60
C G +W + + G V+GG ++SN GCQP + PPC N C T + P
Sbjct: 130 CDGGSLFESWDYYRRHGFVSGGDYNSNQGCQPYTIPPCKLMNEKPPGHSCTTYHREETPI 189
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRY 84
C +C N NY F D Y+ K Y
Sbjct: 190 CEKKCYNPNYYTSFRTDIYKGKYY 213
>gi|72389769|ref|XP_845179.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|427931064|pdb|4HWY|A Chain A, Trypanosoma Brucei Procathepsin B Solved From 40 Fs
Free-electron Laser Pulse Data By Serial Femtosecond
X-ray Crystallography
gi|40557577|gb|AAR88085.1| cathepsin B-like cysteine protease [Trypanosoma brucei]
gi|62360039|gb|AAX80461.1| cysteine peptidase C (CPC) [Trypanosoma brucei]
gi|70801714|gb|AAZ11620.1| cysteine peptidase C (CPC) [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
Length = 340
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 68/176 (38%), Gaps = 37/176 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C+ G W + GLV+ CQP FP C+H + + + P C PK
Sbjct: 162 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 214
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C+ C + YR Y + E D +E+ GP +Y D +Y SG
Sbjct: 215 CNYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 269
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + V+ YL G +A V++VGWG NG PYW I
Sbjct: 270 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 301
>gi|296863454|pdb|3HHI|A Chain A, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
gi|296863455|pdb|3HHI|B Chain B, Crystal Structure Of Cathepsin B From T. Brucei In Complex
With Ca074
Length = 325
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 67/176 (38%), Gaps = 37/176 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSE-PECKTLATPQPK 60
C+ G W + GLV+ CQP FP C+H + + + P C PK
Sbjct: 140 CNGGDPDRAWAYFSSTGLVS-------DYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPK 192
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C + YR Y + E D +E+ GP +Y D +Y SG
Sbjct: 193 CDYTCDDPT----IPVVNYRSWTSYALQGE-DDYMRELFFRGPFEVAFDVYEDFIAYNSG 247
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
Y + V+ YL G +A V++VGWG NG PYW I
Sbjct: 248 VYHH---VSGQYL---------GGHA------------VRLVGWGTSNGVPYWKIA 279
>gi|326430261|gb|EGD75831.1| hypothetical protein PTSG_07950 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/180 (26%), Positives = 73/180 (40%), Gaps = 57/180 (31%)
Query: 82 KRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYK 141
+ YY + ++ IQ++IM++GPV+A SY+ ++ D Y
Sbjct: 238 RCYYHSSSDIETIQRDIMQHGPVLA---------SYE--------------VFEDFGEYD 274
Query: 142 SGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE 201
SGVY +GW V ++GWG E
Sbjct: 275 SGVYTCPDDGS-------DSIGW--------------------------HAVIIVGWGVE 301
Query: 202 NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFGEESGERLSE 261
+ PYW + +++G FG G KI RG NE IES + +L + GV F SG +++
Sbjct: 302 DNTPYWLVQNSWGTGFGIDGYFKIARGTNECNIESRLVTSL-VNTEGVVFASTSGAAVAK 360
>gi|195384166|ref|XP_002050789.1| GJ20006 [Drosophila virilis]
gi|194145586|gb|EDW61982.1| GJ20006 [Drosophila virilis]
Length = 432
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 56/152 (36%), Gaps = 54/152 (35%)
Query: 92 ADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASA 151
ADI EI +GPV A M +Y D FSY G Y +
Sbjct: 324 ADIMAEIYHSGPVQATMRIYRDFFSYSGGIYRQ---------------------TAANRG 362
Query: 152 EIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVS 211
+ +VK+VGWGEE+ +G YW +
Sbjct: 363 APTGFHSVKLVGWGEEH---------------------------------DGVKYWIAAN 389
Query: 212 TFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
++G +G+ G +ILRG NE IE V + P
Sbjct: 390 SWGPWWGEHGYFRILRGSNECGIEEYVLASWP 421
>gi|224586907|ref|NP_055279.3| tubulointerstitial nephritis antigen [Homo sapiens]
gi|317373501|sp|Q9UJW2.3|TINAG_HUMAN RecName: Full=Tubulointerstitial nephritis antigen; Short=TIN-Ag
gi|119624842|gb|EAX04437.1| tubulointerstitial nephritis antigen [Homo sapiens]
gi|189066513|dbj|BAG35763.1| unnamed protein product [Homo sapiens]
Length = 476
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|294890618|ref|XP_002773230.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
gi|239878281|gb|EER05046.1| cathepsin B, putative [Perkinsus marinus ATCC 50983]
Length = 238
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 48/106 (45%), Gaps = 4/106 (3%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHH---SNTGCQPVSFPPCNHANYTTSEPECKTLATPQ 58
CS G ++W ++H G+V+G + GC P +FP C H + C
Sbjct: 133 CSGGNPITSWTFLHTNGIVSGKLSKNMKAADGCWPYNFPKCAHHQKESDYKPCAKELYDT 192
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVN-DEVADIQQEIMKNGP 103
P C + C N YG F +D++ + + I++EIM NGP
Sbjct: 193 PSCSSSCPNAKYGTAFDKDRHYTESLLPSRFGSTSSIKKEIMTNGP 238
>gi|397517574|ref|XP_003828984.1| PREDICTED: tubulointerstitial nephritis antigen [Pan paniscus]
Length = 476
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|224064398|ref|XP_002301456.1| predicted protein [Populus trichocarpa]
gi|222843182|gb|EEE80729.1| predicted protein [Populus trichocarpa]
Length = 325
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 87/244 (35%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + + G+VT + + GC S P C+ P P
Sbjct: 137 CDGGYPIDAWRYFVQSGVVTEECDPYFDDIGC---------------SHPGCEP-GFPTP 180
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C + N + + + K+ Y ++ + I E+ NGPV +Y D F++
Sbjct: 181 KCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSMNGPVEVAFTVYED-FAH-- 235
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
YKSGVY + +++ VK++GWG
Sbjct: 236 --------------------YKSGVYK-HITGDVMGGHAVKLIGWGT------------- 261
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G KI RG NE IE V
Sbjct: 262 --------------------SDDGEDYWLLANQWNRGWGDDGYFKIRRGTNECGIEEDVV 301
Query: 240 GALP 243
LP
Sbjct: 302 AGLP 305
>gi|357623033|gb|EHJ74345.1| tubulointerstitial nephritis antigen [Danaus plexippus]
Length = 426
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 52/119 (43%), Gaps = 32/119 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A M ++ D F Y G+Y S PY
Sbjct: 330 SGPVHAVMTVHQDFFHYHDGIYRRS----------------------PY----------G 357
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
+ +V+++GWGE+ G YW + +++G +G+ G +I RG NE+ IES V L
Sbjct: 358 DNTLQGLHSVRIVGWGEDRGDKYWVVANSWGCDWGENGYFRIARGSNESGIESFVVTVL 416
>gi|332824268|ref|XP_518550.3| PREDICTED: tubulointerstitial nephritis antigen [Pan troglodytes]
Length = 476
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDHNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|224128101|ref|XP_002320244.1| predicted protein [Populus trichocarpa]
gi|222861017|gb|EEE98559.1| predicted protein [Populus trichocarpa]
Length = 339
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/197 (24%), Positives = 73/197 (37%), Gaps = 60/197 (30%)
Query: 47 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
S P C+ P PKC +C + N + + + K+ Y ++ + I E+ NGPV
Sbjct: 183 SHPGCEP-GFPTPKCERKCADKN--KLWAESKHFSVNAYRIDSDPHSIMAEVSSNGPVEV 239
Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
+Y D F++ YKSGVY + + + VK++GWG
Sbjct: 240 AFTVYED-FAH----------------------YKSGVYK-HITGDAMGGHAVKLIGWGT 275
Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
E+G YW + + + +GD G KI
Sbjct: 276 ---------------------------------SEDGEDYWLLANQWNRGWGDDGYFKIK 302
Query: 227 RGRNEAIIESLVNGALP 243
RG NE IE V LP
Sbjct: 303 RGTNECGIEGAVVAGLP 319
>gi|426353589|ref|XP_004044272.1| PREDICTED: tubulointerstitial nephritis antigen [Gorilla gorilla
gorilla]
Length = 476
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 93/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|294895531|ref|XP_002775206.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239881224|gb|EER07022.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 49/113 (43%), Gaps = 35/113 (30%)
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
K + NGPV+ + LY DI YK+GVY
Sbjct: 41 KQEIFTNGPVIGMLSLYEDIRVYKAGVY-------------------------------- 68
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V + T+K+IGWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 69 ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 118
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 24/103 (23%)
Query: 74 FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
+ +D +R K + + +I+QEI NGPV+ + LY DI YK+G Y
Sbjct: 20 YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSLYEDIRVYKAGVY----------- 68
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
V + T+KI+GWG E+G+ YW V
Sbjct: 69 -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 98
>gi|328712819|ref|XP_001942906.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
gi|328712821|ref|XP_003244911.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 463
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 87/240 (36%), Gaps = 67/240 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W W+ K GL+T + P T + P+ K Q
Sbjct: 262 CQGGHLTRAWNWIRKFGLITEECY------------PWQGRMSTCAVPKKKKETMAQCPS 309
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
R ND + +R Y V E I EI+ +GPV A M + D F YKSG
Sbjct: 310 RVRSNNDRTTKTRL---HRVGPVYRVATEEG-IMHEILTSGPVQAVMKVSRDFFMYKSGV 365
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y +N+ AS Y +V+IVGWGEE
Sbjct: 366 YK----CSNL-----------------ASGSRTGYHSVRIVGWGEE-------------- 390
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
Y K++ YW +++G +G+ G +IL+G +E IE V A
Sbjct: 391 --------YQGGKIV--------KYWIASNSWGSWWGENGYFRILKGVDECEIEDFVIAA 434
>gi|198434980|ref|XP_002126076.1| PREDICTED: similar to LOC100124858 protein [Ciona intestinalis]
Length = 541
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/166 (25%), Positives = 68/166 (40%), Gaps = 46/166 (27%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
Y+ Y V+ +I +EI +NGPV A M + D F YKSG Y + + N+
Sbjct: 414 YKTSPVYRVSSNEENIMKEIFENGPVQAVMRVQPDFFVYKSGVY-SSTAIDNI------- 465
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
V + Y +VKI+GWGE+ +
Sbjct: 466 --------VVEQVKDNTYHSVKIIGWGEKKSK---------------------------- 489
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
N YW + +++G +G+ G +I +G NE IE ++ A P+
Sbjct: 490 --TNSGKYWIVQNSWGANWGEGGYFRIRKGVNECGIEEMILAAWPQ 533
>gi|449283627|gb|EMC90232.1| Tubulointerstitial nephritis antigen [Columba livia]
Length = 469
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/163 (26%), Positives = 69/163 (42%), Gaps = 57/163 (34%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR +Y V+ + +I +EIM GPV A M +Y D F YK G Y +
Sbjct: 354 YRCASHYRVSSKETNIMKEIMDKGPVQAIMKVYEDFFLYKEGIYRH-------------- 399
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVRVYAVSASAEIVAYATVKL 195
S K+G + S VK++GWG ++NG+
Sbjct: 400 SQKAGSKWKTHS--------VKLLGWGALADKNGQK------------------------ 427
Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ +W +++G+ +G+ G +ILRG+NE IE L+
Sbjct: 428 --------QKFWIAANSWGKSWGENGYFRILRGQNECDIEKLI 462
>gi|449498128|ref|XP_002193225.2| PREDICTED: tubulointerstitial nephritis antigen [Taeniopygia
guttata]
Length = 469
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/167 (26%), Positives = 69/167 (41%), Gaps = 57/167 (34%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR +Y V+ + DI +EI GPV A M +Y D F YK G Y +
Sbjct: 354 YRCASHYRVSSKETDIMKEIKDRGPVQAIMKVYEDFFLYKEGIYQH-------------- 399
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWG---EENGRPYWTIVRVYAVSASAEIVAYATVKL 195
S K+G + S VK++GWG ++NG+
Sbjct: 400 SQKAGSKWKTHS--------VKLLGWGALPDKNGQK------------------------ 427
Query: 196 IGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
+ +W +++G+ +G+ G +ILRG+NE IE L+ L
Sbjct: 428 --------QKFWIAANSWGKSWGENGYFRILRGQNECDIEKLILATL 466
>gi|328712825|ref|XP_001945477.2| PREDICTED: tubulointerstitial nephritis antigen-like isoform 1
[Acyrthosiphon pisum]
Length = 487
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 37/128 (28%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
G V A M + + F Y+SGVY S + G + G
Sbjct: 368 GSVQAMMKVSKEFFMYESGVYKCSK------------LDLGSKTG--------------- 400
Query: 185 AEIVAYATVKLIGWGEE--NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
Y TV+++GWGEE NGR YW + +++G +G+ G +IL+G NE IE V
Sbjct: 401 -----YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVA 455
Query: 241 ALPK-DNY 247
A+P DN+
Sbjct: 456 AMPDIDNF 463
>gi|156708118|gb|ABU93317.1| cathepsin B8 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 275
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 54/244 (22%), Positives = 84/244 (34%), Gaps = 91/244 (37%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W W+ K+G+ T C P Y + P C
Sbjct: 121 CEGGYADRVWNWIQKKGITT-------EQCLP----------YVSGSGRV-------PTC 156
Query: 62 HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
++C N N R F W + + E+ NGPV A ++ D +YKSG
Sbjct: 157 PSKCKNGSNIVRSFVSS--------WGSFNSKTVMDEVANNGPVYACFEVFEDFLNYKSG 208
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
I+ +K+G + + V ++GWG ENG P
Sbjct: 209 ----------------IYQHKTG--------KSKGWHHVMLMGWGTENGVP--------- 235
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + +++G +G+KG +I RG N+ I+ +
Sbjct: 236 -------------------------YWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYS 270
Query: 241 ALPK 244
LPK
Sbjct: 271 GLPK 274
>gi|268572247|ref|XP_002648914.1| Hypothetical protein CBG17827 [Caenorhabditis briggsae]
Length = 150
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 50/114 (43%), Gaps = 25/114 (21%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC C Y + +DK Y+V + IQ EIM NGPV A+ +Y D + YK
Sbjct: 62 PKCALSC-QSKYNTEYAKDKNFGSSAYYVGRNFSVIQTEIMTNGPVEASFTVYEDFYIYK 120
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPY 172
G ++ Y +G E++ +KI+GWG ENG Y
Sbjct: 121 KG----------------VYQYTAG--------EVLGGHAIKIIGWGTENGTDY 150
>gi|443686962|gb|ELT90079.1| hypothetical protein CAPTEDRAFT_166233 [Capitella teleta]
Length = 495
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 30/121 (24%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A M++ D ++Y+ GVY S + + Y + G+E
Sbjct: 364 NGPVQAVMHVKDDFYTYERGVYKHSHAPKPANYPHL-----GKE---------------- 402
Query: 184 SAEIVAYATVKLIGWGEE----NGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
AY +V++IGWG + + YW +T+G +G+ G +I RG +E+ IES V
Sbjct: 403 -----AYHSVRIIGWGTDYTGDDPIKYWLAANTWGRHWGEGGFFRIARGSDESHIESFVV 457
Query: 240 G 240
G
Sbjct: 458 G 458
>gi|15150360|gb|AAK85411.1| cathepsin B-like protease [Trypanosoma rangeli]
Length = 207
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 38/175 (21%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W++ + G+V+ CQP FPPC H +T C ++ P C
Sbjct: 65 CNGGDPDWAWLYYVETGIVS-------EFCQPYPFPPCAHHVNSTHYTPC-SVEYDTPFC 116
Query: 62 HTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGK 121
+ CTN KY+ + Y ++ E D ++E ++LY
Sbjct: 117 NITCTNT-----IPPIKYKGRISYSLSGE-EDYKRE----------LFLY---------- 150
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
GP +Y D +Y GVY + + +A V++VGWG NG PYW I
Sbjct: 151 ---GPFEVAFTVYEDFVAYSDGVYKHFSGNALGGHA-VRLVGWGNLNGTPYWKIA 201
>gi|300176830|emb|CBK25399.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/123 (24%), Positives = 55/123 (44%), Gaps = 35/123 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y GP+ ++ + D+ YK G+Y + A+ + +A
Sbjct: 189 YARGPITCSIAVPDDLMEYKGGIYRDTTGAKTLDHA------------------------ 224
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ ++GWGEE+G+ YW +++G +G+KG +I+RG N IE+ A
Sbjct: 225 -----------ISVVGWGEEDGQKYWIARNSWGTFWGEKGWFRIVRGENNLGIEADCQWA 273
Query: 242 LPK 244
+P+
Sbjct: 274 VPR 276
Score = 41.6 bits (96), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 42/201 (20%), Positives = 76/201 (37%), Gaps = 64/201 (31%)
Query: 44 YTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGP 103
Y + EC +A RC + G + K +KRY ++ + G
Sbjct: 422 YEAIDKECNDMA--------RCMDCPPGEDCYPVK-DYKRY------------KVSEYGE 460
Query: 104 VVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVG 163
V M + ++IF+ GPV +M + + +Y+ G++
Sbjct: 461 VKGEMEIKAEIFA-------RGPVSCSMIVTEEFLAYQGGIF------------------ 495
Query: 164 WGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGT 222
V IV Y V++ GWGE E+G YW +++G +G+ G
Sbjct: 496 -----------------VDDRGHIVGYHAVEVAGWGETEDGTKYWIARNSWGPYWGEHGW 538
Query: 223 IKILRGRNEAIIESLVNGALP 243
+++ G ++ +I N +P
Sbjct: 539 FRMIVGVSKGLITGYCNWGVP 559
>gi|168020784|ref|XP_001762922.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685734|gb|EDQ72127.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 345
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 89/247 (36%), Gaps = 80/247 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA--HHSNTGC-QPVSFPPCNHANYTTSEPECKTLATPQ 58
C G W + + G+VT + GC P +P Y T
Sbjct: 169 CEGGYPIRAWQYFKRTGVVTSKCDPYFDQKGCGHPGCYP-----TYDT------------ 211
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC RC +D + K+ Y V+ E ++ E+ NGP+ D+F
Sbjct: 212 PKCFKRCVDDEL---WVSSKHLGVSAYEVSMEPEELMAELFTNGPIEVAF----DVFE-- 262
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
D YK+GVY I +A VK+VGWG
Sbjct: 263 -----------------DFAHYKTGVYKHLYGGYIGGHA-VKLVGWGT------------ 292
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
++G YW++V+++ +G+ GT +ILRG++E IES
Sbjct: 293 ---------------------TDDGVDYWSMVNSWNTNWGEDGTFRILRGKDECGIESNA 331
Query: 239 NGALPKD 245
LP +
Sbjct: 332 VAGLPSN 338
>gi|168000937|ref|XP_001753172.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695871|gb|EDQ82213.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 89/245 (36%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA--HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G W + G+VT + GC + P C Y T E P
Sbjct: 171 CEGGYPIRAWKYFKHSGVVTNKCDPYFDQKGC---AHPGC----YPTYE---------TP 214
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C +D + + Q K+ Y ++ E D+ E+ NGPV +Y D YK+
Sbjct: 215 KCEKQCVDDEF---WVQSKHLGVNAYEMSMEPEDLMAELYTNGPVEVAFEVYEDFAHYKT 271
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWG-EENGRPYWTIVRV 178
G +Y +F G +A VK++GWG ++G YWTIV
Sbjct: 272 G------------VYKHLFGGFMGGHA------------VKLIGWGTTDDGVDYWTIVNS 307
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
+ + WGE+ G +I+RG +E IES
Sbjct: 308 WNTN---------------WGED-------------------GLFRIVRGNDECGIESNA 333
Query: 239 NGALP 243
LP
Sbjct: 334 VAGLP 338
>gi|355724272|gb|AES08175.1| tubulointerstitial nephritis antigen [Mustela putorius furo]
Length = 476
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 94/247 (38%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWFLRKRGLVSHACYPLFKDQNATNDGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT-KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + ++ VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTRTNEEASKYRKFQTHAVKLTGWGTLKG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|126310154|ref|XP_001364630.1| PREDICTED: tubulointerstitial nephritis antigen [Monodelphis
domestica]
Length = 468
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C G W ++ KRGLV+ + +N GC S S+ K
Sbjct: 276 CKGGSIDRAWWYLRKRGLVSHACYPLFKDQIFNNNGCDMAS----------RSDGRGKRH 325
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 326 AT-KP-----CPNNIEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVHEDF 376
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YKSG I+ + + + S + VK+ GWG G
Sbjct: 377 FHYKSG----------------IYRHINNLKDESEKYRNLRTHAVKLTGWGVLRG----- 415
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 416 ------------------------AQGKKEKFWIAANSWGKSWGENGYFRILRGVNESDI 451
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 452 EKLIIAA 458
>gi|440907441|gb|ELR57591.1| Tubulointerstitial nephritis antigen [Bos grunniens mutus]
Length = 476
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 59/247 (23%), Positives = 91/247 (36%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+S W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSESVDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRH 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT T C N Y+ Y V+ +I +EIM+NGPV A M ++ D
Sbjct: 334 AT------TPCPNSIEKSNRI---YQCSPPYRVSSNETEIMREIMQNGPVQAIMQVHEDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F+YK+G I+ + + S VK+ GWG G
Sbjct: 385 FNYKTG----------------IYRHITSTNEDSEKYRKFRTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W +++G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANSWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E L+ A
Sbjct: 460 EKLIIAA 466
>gi|195121981|ref|XP_002005491.1| GI19039 [Drosophila mojavensis]
gi|193910559|gb|EDW09426.1| GI19039 [Drosophila mojavensis]
Length = 432
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 62/246 (25%), Positives = 87/246 (35%), Gaps = 82/246 (33%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G + W ++HK+G+V + C P YT CK +
Sbjct: 253 CEGGHLDAAWRYLHKKGVV-------DETCYP----------YTQRRDSCKI------RH 289
Query: 62 HTRCTNDNYGR---GFFQDK-YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSY 117
++R N R G +D Y Y + E DI EI +GPV A M +Y D FSY
Sbjct: 290 NSRSLKANGCRPAYGVNRDSLYTVGPAYSLKGET-DIMAEIYHSGPVQATMRVYRDFFSY 348
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
G Y + + +VKIVGWGEE+
Sbjct: 349 SGGVYRQ---------------------TAANRGAPTGFHSVKIVGWGEEH--------- 378
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+G YW +++G +G+ G +ILRG NE IE
Sbjct: 379 ------------------------DGVKYWIAANSWGPWWGEHGYFRILRGSNECGIEEY 414
Query: 238 VNGALP 243
V + P
Sbjct: 415 VLASWP 420
>gi|15723272|gb|AAL06324.1| cathepsin B-like protease [Trypanosoma cruzi]
Length = 208
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 64/146 (43%), Gaps = 32/146 (21%)
Query: 31 CQPVSFPPC-NHANYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVND 89
CQP FP C +H N + P TP C++ CT+ + KYR ++
Sbjct: 87 CQPYPFPSCAHHVNSSDLSPCSGEYDTPT--CNSTCTD----KKIPLIKYRGNTSCILSG 140
Query: 90 EVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSA 149
E ++E++ NGP + +Y+D +Y G ++ + +GV+
Sbjct: 141 E-ESFKRELLLNGPFEVSFSVYADFVAYTGG----------------VYKHVTGVF---- 179
Query: 150 SAEIVAYATVKIVGWGEENGRPYWTI 175
+ V+IVGWGE NG PYW I
Sbjct: 180 ----LGGHAVRIVGWGELNGEPYWKI 201
>gi|294956046|ref|XP_002788796.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239904363|gb|EER20592.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 130
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 35/113 (30%)
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
K + NGPV+ + +Y DI YK+GVY
Sbjct: 41 KQEIFTNGPVIGMLSIYEDIRVYKAGVY-------------------------------- 68
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V + T+K+IGWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 69 ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 118
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 24/103 (23%)
Query: 74 FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
+ +D +R K + + +I+QEI NGPV+ + +Y DI YK+G Y
Sbjct: 20 YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGMLSIYEDIRVYKAGVY----------- 68
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
V + T+KI+GWG E+G+ YW V
Sbjct: 69 -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 98
>gi|6449322|gb|AAF08931.1| tubulointerstitial nephritis antigen isoform TIN-ag [Homo sapiens]
Length = 476
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 92/247 (37%), Gaps = 71/247 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAH-------HSNTGCQPVSFPPCNHANYTTSEPECKTL 54
C+SG W ++ KRGLV+ + +N GC A + S+ K
Sbjct: 284 CNSGSIDRAWWYLRKRGLVSHACYPLFKDQNATNNGC----------AMASRSDGRGKRD 333
Query: 55 ATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDI 114
AT +P C N+ Y+ Y V+ +I +EIM+NGPV A M + D
Sbjct: 334 AT-KP-----CPNNVEKSNRI---YQCSPPYRVSSNETEIMKEIMQNGPVQAIMQVREDF 384
Query: 115 FSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWT 174
F YK+G I+ + + S + VK+ GWG G
Sbjct: 385 FHYKTG----------------IYRHVTSTNKESEKYRKLQTHAVKLTGWGTLRG----- 423
Query: 175 IVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAII 234
+ +W + +G+ +G+ G +ILRG NE+ I
Sbjct: 424 ------------------------AQGQKEKFWIAANFWGKSWGENGYFRILRGVNESDI 459
Query: 235 ESLVNGA 241
E LV A
Sbjct: 460 EKLVIAA 466
>gi|253744515|gb|EET00718.1| Cathepsin B precursor [Giardia intestinalis ATCC 50581]
Length = 306
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 32/119 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A+M +Y D Y+SGVY ++I ++A V+I+G+G +
Sbjct: 215 DGPVQASMAVYRDFLYYRSGVYRHVYGSQISSHA-VEIIGYGAAD--------------- 258
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGAL 242
+E+ PYW + ++ G +G++G I+RG NE IES V L
Sbjct: 259 ----------------DEDSTPYWIVKNSLGSGWGEEGYFNIVRGSNECDIESAVYSGL 301
>gi|53850626|ref|NP_001005549.1| tubulointerstitial nephritis antigen precursor [Rattus norvegicus]
gi|51858645|gb|AAH81887.1| Tubulointerstitial nephritis antigen [Rattus norvegicus]
gi|149019129|gb|EDL77770.1| tubulointerstitial nephritis antigen [Rattus norvegicus]
Length = 475
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 63/256 (24%), Positives = 95/256 (37%), Gaps = 89/256 (34%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ HA Y P K +T C
Sbjct: 283 CNSGSIDRAWWFLRKRGLVS-------------------HACY----PLFKEQSTNNNSC 319
Query: 62 HTRCTNDNYGRGF--------FQDKYRFKRY---YWVNDEVADIQQEIMKNGPVVANMYL 110
+D G+ F+ R + Y ++ +I +EI++NGPV A M +
Sbjct: 320 AMASRSDGRGKRHATRPCPNSFEKSNRIYQCSPPYRISSNETEIMREIIQNGPVQAIMQV 379
Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
+ D F YK+G Y + VS + E Y
Sbjct: 380 HEDFFYYKTGIY---------------------RHVVSTNEEPEKYRK------------ 406
Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKI 225
+R +A VKL GWG G +W +++G+ +G+ G +I
Sbjct: 407 -----LRTHA------------VKLTGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRI 449
Query: 226 LRGRNEAIIESLVNGA 241
LRG NE+ IE L+ A
Sbjct: 450 LRGVNESDIEKLIIAA 465
>gi|145546673|ref|XP_001459019.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426842|emb|CAK91622.1| unnamed protein product [Paramecium tetraurelia]
Length = 476
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 59/120 (49%), Gaps = 26/120 (21%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
+ NGPVV N D Y GV+ ++T+ P W I +
Sbjct: 375 HKNGPVVLNFEPSFDFMFYVGGVF----------HSTI-----------PDWIINGL--- 410
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
A E V + +V GWGEENG YW + +++G+Q+G+ G ++ RG++E+ IES+ A
Sbjct: 411 -AKPEWVDH-SVLCYGWGEENGVKYWLLQNSWGKQWGENGRFRMKRGQDESSIESMAEAA 468
>gi|294871893|ref|XP_002766082.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239866672|gb|EEQ98799.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 118
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 35/113 (30%)
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
K + NGPV+ + +Y DI YK+GVY
Sbjct: 29 KQEIFTNGPVIGALTIYEDIRVYKAGVY-------------------------------- 56
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
V + T+K+IGWG E+G+ YW V+++ E++GD G IK+ GR
Sbjct: 57 ---VHQTGSFQGIHTLKIIGWGVESGQDYWLAVNSWNEEWGDHGMIKLAVGRT 106
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/103 (29%), Positives = 46/103 (44%), Gaps = 24/103 (23%)
Query: 74 FFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYL 133
+ +D +R K + + +I+QEI NGPV+ + +Y DI YK+G Y
Sbjct: 8 YIRDLHRAKSFGRLPAIPQNIKQEIFTNGPVIGALTIYEDIRVYKAGVY----------- 56
Query: 134 YSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIV 176
V + T+KI+GWG E+G+ YW V
Sbjct: 57 -------------VHQTGSFQGIHTLKIIGWGVESGQDYWLAV 86
>gi|63115212|gb|AAY33830.1| cathepsin B, partial [Siniperca chuatsi]
Length = 69
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 33/53 (62%)
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+K++GWGEE+G PYW +++ +GD G K LRG + IES + +PK
Sbjct: 17 AIKILGWGEEDGVPYWLCANSWNTDWGDNGFFKFLRGSDHCRIESEIVAGIPK 69
>gi|14789619|gb|AAH10745.1| Tubulointerstitial nephritis antigen [Mus musculus]
Length = 475
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 97/246 (39%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T + + + + K
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +++ + Y+ Y V+ +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTG 389
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + VS + E Y +R +A
Sbjct: 390 IY---------------------RHVVSTNEEPEKYKK-----------------LRTHA 411
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|227499499|ref|NP_036163.3| tubulointerstitial nephritis antigen precursor [Mus musculus]
gi|4929827|gb|AAD34171.1| tubulo-interstitial nephritis antigen [Mus musculus]
gi|148694397|gb|EDL26344.1| tubulointerstitial nephritis antigen, isoform CRA_a [Mus musculus]
Length = 475
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 97/246 (39%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T + + + + K
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +++ + Y+ Y V+ +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIQNGPVQAIMQVHEDFFYYKTG 389
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + VS + E Y +R +A
Sbjct: 390 IY---------------------RHVVSTNEEPEKYKK-----------------LRTHA 411
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGARGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|221505681|gb|EEE31326.1| cathepsin L, putative [Toxoplasma gondii VEG]
Length = 733
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV +FSY+SGVY +++ V +N P+ T +
Sbjct: 602 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 645
Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
E +A V ++GWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 646 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704
>gi|156708112|gb|ABU93314.1| cathepsin B5 cysteine protease [Monocercomonoides sp. PA]
Length = 281
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 51/115 (44%), Gaps = 35/115 (30%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y GP A +Y D SYKSGVY + +++ V +VGWG E+G PY
Sbjct: 193 YSRGPFEAAFSVYEDFKSYKSGVYH-HITGKMLGGHAVMVVGWGVEDGTPY--------- 242
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
W I +++G +G++G KILRG+NE IE+
Sbjct: 243 -------------------------WLIQNSWGTTWGEQGFFKILRGKNECGIET 272
>gi|237838179|ref|XP_002368387.1| cathepsin C [Toxoplasma gondii ME49]
gi|211966051|gb|EEB01247.1| cathepsin C [Toxoplasma gondii ME49]
gi|221484340|gb|EEE22636.1| cathepsin C, putative [Toxoplasma gondii GT1]
Length = 733
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV +FSY+SGVY +++ V +N P+ T +
Sbjct: 602 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 645
Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
E +A V ++GWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 646 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 704
>gi|70919569|gb|AAZ15654.1| cathepsin C1 [Toxoplasma gondii]
Length = 730
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 59/120 (49%), Gaps = 22/120 (18%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV +FSY+SGVY +++ V +N P+ T +
Sbjct: 599 YNNGPVPVAFDAPPSLFSYRSGVYDANSNHARVC-----------DNDLPHHT-----GI 642
Query: 182 SASAEIVAYATVKLIGWGE---ENGRP--YWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
E +A V ++GWGE ENG+P YW + +T+G +G G +KI RG+N IES
Sbjct: 643 LTGWEYTNHA-VTIVGWGETDGENGKPQKYWIVRNTWGPNWGVDGYVKIARGKNLGGIES 701
>gi|256086900|ref|XP_002579622.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 204
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 67/165 (40%), Gaps = 37/165 (22%)
Query: 83 RYYWVN-DEVADIQQEIMKNGPVVANMY--LYSDIFSYKSGKYGNGPVVANMYLYSDIFS 139
R +WVN + I E V+ +Y+D + NGPV+A++ + D
Sbjct: 73 RDHWVNCSTIKQIHDECCCRADWVSEKIYNVYADQEDIQKEILMNGPVIASILVKVDFLV 132
Query: 140 YKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWG 199
YKSGVY + + + + ++I+GWG E P
Sbjct: 133 YKSGVYFPTPKSSNLGWINLRIIGWGYEGKTP---------------------------- 164
Query: 200 EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
YW +++ +++G+ G +K+ RG IES V +PK
Sbjct: 165 ------YWLCANSWSKEWGENGYVKVRRGVQAGYIESYVRAPIPK 203
>gi|146163742|ref|XP_001012227.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|146145940|gb|EAR91982.2| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 581
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 47/88 (53%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNYGVEFG 252
+ ++GWG ENG YW + +++G +G+KG +++RG N IES A+PKD + +
Sbjct: 229 ISVVGWGVENGTKYWIVRNSWGSYWGEKGYFRLVRGINSLNIESDCAWAVPKDTWTNDVR 288
Query: 253 EESGERLSEEFGVRAESSEEFRENGEEE 280
+ + + R +EN +++
Sbjct: 289 NTTASNTNSQSNFRQLHDCVRQENNQKD 316
>gi|156708116|gb|ABU93316.1| cathepsin B7 cysteine protease, partial [Monocercomonoides sp. PA]
Length = 273
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/244 (21%), Positives = 85/244 (34%), Gaps = 91/244 (37%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G + W W+ K+G+ T C P Y + P C
Sbjct: 119 CNGGYADRVWNWIQKKGITT-------EQCIP----------YVSGSGRV-------PTC 154
Query: 62 HTRCTN-DNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
++C N N R F W + + E+ NGPV A ++ D ++Y+SG
Sbjct: 155 PSKCKNGSNIVRSFVSS--------WGSFNSKTVMDEVANNGPVYACFEVFEDFYNYRSG 206
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
++ +K+G + V ++GWG ENG P
Sbjct: 207 ----------------VYQHKTG--------RSQGWHHVMLMGWGTENGVP--------- 233
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
YW + +++G +G+KG +I RG N+ I+ +
Sbjct: 234 -------------------------YWLLQNSWGSGWGEKGFFRIRRGTNDCHIDEIFYS 268
Query: 241 ALPK 244
LPK
Sbjct: 269 GLPK 272
>gi|268563232|ref|XP_002638788.1| Hypothetical protein CBG05143 [Caenorhabditis briggsae]
Length = 426
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 100/224 (44%), Gaps = 48/224 (21%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
WV++ GLVTGG GC+P SF PC+ A + +E E +T C RC N
Sbjct: 223 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 269
Query: 68 DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
Y + + +DK+ F + Y + V+ +E +K ++ + F+ K+ +
Sbjct: 270 IYYQQKYEEDKH-FATFAYSLYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 321
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
N N+ + +I Y A E + Y++ + R + +
Sbjct: 322 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 365
Query: 184 -SAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKI 225
IV + V+LIGWGE ++G+ YW V++FG +GD G KI
Sbjct: 366 FDDRIVYWHVVRLIGWGESDDGQHYWLAVNSFGNHWGDNGIFKI 409
>gi|353228747|emb|CCD74918.1| cathepsin B (C01 family) [Schistosoma mansoni]
Length = 229
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/135 (24%), Positives = 57/135 (42%), Gaps = 34/135 (25%)
Query: 110 LYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENG 169
+Y+D + NGPV+A++ + D YKSGVY + + + + ++I+GWG E
Sbjct: 128 VYADQEDIQKEILMNGPVIASILVKVDFLVYKSGVYFPTPKSSNLGWINLRIIGWGYEGK 187
Query: 170 RPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGR 229
P YW +++ +++G+ G +K+ RG
Sbjct: 188 TP----------------------------------YWLCANSWSKEWGENGYVKVRRGV 213
Query: 230 NEAIIESLVNGALPK 244
IES V +PK
Sbjct: 214 QAGYIESYVRAPIPK 228
>gi|300121514|emb|CBK22033.2| unnamed protein product [Blastocystis hominis]
Length = 476
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 54/123 (43%), Gaps = 35/123 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y +GPV ++ + D+ YK G+Y I G G +
Sbjct: 101 YAHGPVTCSIDVPDDLLEYKGGIYEDKTG----------IAGDGHD-------------- 136
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+ ++GWGEENG PYW + +++G +G++G +I+RG+N IE
Sbjct: 137 -----------ISVVGWGEENGIPYWIVRNSWGTYWGEEGFFRIVRGKNNLGIEEGCTYG 185
Query: 242 LPK 244
+P+
Sbjct: 186 IPR 188
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 30/53 (56%), Gaps = 2/53 (3%)
Query: 193 VKLIGWG--EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
V++ GWG EE PYW + +++G +G+ G +I G+N IE + +P
Sbjct: 420 VEVTGWGVDEETRTPYWIVRNSWGTYWGENGWFRIAMGQNLLNIEQMCTWGVP 472
>gi|388499754|gb|AFK37943.1| unknown [Lotus japonicus]
Length = 209
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 70/197 (35%), Gaps = 60/197 (30%)
Query: 47 SEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
S P C+ A PKC +C N + + + K+ Y V + DI E+ KNGPV
Sbjct: 53 SHPGCEP-AYQTPKCVRKCVKGN--QIWKKSKHFSVNAYSVKSDPYDIMAEVYKNGPVEV 109
Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
+Y D YKSG +Y I + G +A VK++GWG
Sbjct: 110 AFTVYEDFAHYKSG------------VYKHITGSQLGGHA------------VKLIGWGT 145
Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
+ G YW I + + +GD G I
Sbjct: 146 ---------------------------------TDEGEDYWLIANQWNRSWGDDGYFMIR 172
Query: 227 RGRNEAIIESLVNGALP 243
RG NE IE V LP
Sbjct: 173 RGTNECGIEEDVTAGLP 189
>gi|123377855|ref|XP_001298125.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121878571|gb|EAX85195.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 135
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 76/191 (39%), Gaps = 66/191 (34%)
Query: 57 PQPKCHT-----RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLY 111
P CH C +N + ++ ++ ++++ DE I+ EI++NGPV A
Sbjct: 5 PNTTCHPFELNWTCVQNNCKK--YKTQHNSHKFFYGEDE---IKNEILQNGPVTA----- 54
Query: 112 SDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRP 171
+F + D+ YKSGVY S E ++
Sbjct: 55 --VFDVRP----------------DLAYYKSGVYQSVLSEEESSFQ-------------- 82
Query: 172 YWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
V + GWG+E P+W I++++G +G G++K LRG N
Sbjct: 83 -------------------HAVVIYGWGKEKETPFWWILNSYGPNWGINGSMKFLRGSNH 123
Query: 232 AIIESLVNGAL 242
IE+ V+ AL
Sbjct: 124 CNIETHVSSAL 134
>gi|145517168|ref|XP_001444467.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411889|emb|CAK77070.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAV-SASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
GPVVA M +Y D Y+ GVY V + +KI+GWGE+NG YW I + S
Sbjct: 255 GPVVAIMQVYKDFLVYRDGVYQVLEGTPRFHGGHAIKIIGWGEQNGYQYWIIENTWGTSW 314
Query: 184 SAEIVA 189
E +A
Sbjct: 315 GTEGLA 320
>gi|312082955|ref|XP_003143660.1| hypothetical protein LOAG_08080 [Loa loa]
gi|307761175|gb|EFO20409.1| hypothetical protein LOAG_08080 [Loa loa]
Length = 339
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 67/150 (44%), Gaps = 10/150 (6%)
Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVA----NMYLYSDIFSYKSGVYAVSASAEIVAYAT 158
P ++ +I + K+ NG N +Y SY+ +EI+
Sbjct: 171 PYISGTTRKPEICYMQKSKHANGRQCPSGHPNSRVYRTTPSYRVSSREQDIMSEILTNGP 230
Query: 159 VKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEE--NGRP--YWTIVSTFG 214
V+ +G + V + + EI Y +V+L+GWGE+ G P YW +++G
Sbjct: 231 VQATF--RVHGDFFIAGVYKHLPTVGEEIEGYHSVRLLGWGEDYSTGIPVKYWIAANSWG 288
Query: 215 EQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+G+ GT +ILRG N IES V GA K
Sbjct: 289 TNWGENGTFRILRGENHCEIESFVIGAWGK 318
>gi|197304333|dbj|BAG69285.1| cathepsin B-like cysteine protease [Raphanus sativus]
Length = 343
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 83/244 (34%), Gaps = 77/244 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + G+VT + TGC S P C A Y T P
Sbjct: 172 CDGGYPISAWQYFSYSGVVTEECDPYFDQTGC---SHPGCEPA-YNT------------P 215
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
+C +C N + + + K+ Y V DI EI
Sbjct: 216 QCLRKCVGRN--QLWSESKHYSINTYVVESNPQDIMAEI--------------------- 252
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
Y NGPV + +Y D YKSGVY + I +A VK++GWG
Sbjct: 253 --YKNGPVEVSFTVYEDFAHYKSGVYKHITGSNIGGHA-VKLIGWGT------------- 296
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
++G YW + + + +GD G I RG NE IE
Sbjct: 297 --------------------TDDGEDYWLLANQWNRSWGDDGYFMIRRGTNECGIEDEPV 336
Query: 240 GALP 243
LP
Sbjct: 337 AGLP 340
>gi|380805035|gb|AFE74393.1| tubulointerstitial nephritis antigen-like isoform 3, partial
[Macaca mulatta]
Length = 129
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 46/168 (27%), Positives = 69/168 (41%), Gaps = 48/168 (28%)
Query: 64 RCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
RC N + D Y+ Y + +I +E+M+NGPV A M ++ D F YK G Y
Sbjct: 9 RCPNSHVNN---NDIYQVTPVYRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIYS 65
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+ PV L + G + +VKI GWGEE
Sbjct: 66 HTPVS----LGRPERYRRHGTH------------SVKITGWGEET--------------- 94
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNE 231
+ T+K YWT +++G +G++G +I+RG NE
Sbjct: 95 ---LPDGRTLK-----------YWTAANSWGPAWGERGHFRIVRGVNE 128
>gi|321478457|gb|EFX89414.1| hypothetical protein DAPPUDRAFT_303204 [Daphnia pulex]
Length = 442
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 54/122 (44%), Gaps = 36/122 (29%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A M ++ D F Y+ GVY Y+ +
Sbjct: 345 HGPVQATMRVHPDFFLYRGGVYR--------------------------------YSGTN 372
Query: 184 SAEIVAYATVKLIGWG----EENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
S + Y +V+++GWG + N YW + +++G +G+ G +I+RG NE+ IE V
Sbjct: 373 SQQRSGYHSVRIVGWGVDSSKRNPTKYWLVANSWGRLWGEDGYFRIVRGENESDIEKFVL 432
Query: 240 GA 241
A
Sbjct: 433 AA 434
>gi|6449324|gb|AAF08932.1|AF195117_1 tubulointerstitial nephritis antigen isoform TIN2 [Homo sapiens]
Length = 333
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 60/157 (38%), Gaps = 45/157 (28%)
Query: 85 YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGV 144
Y V+ +I +EIM+NGPV A M + D F YK+G I+ + +
Sbjct: 212 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTG----------------IYRHVTST 255
Query: 145 YAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGR 204
S + VK+ GWG G +
Sbjct: 256 NKESEKYRKLQTHAVKLTGWGTRRG-----------------------------AQGQKE 286
Query: 205 PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
+W + +G+ +G+ G +ILRG NE+ IE LV A
Sbjct: 287 KFWIAANFWGKSWGENGYFRILRGVNESDIEKLVIAA 323
>gi|290975216|ref|XP_002670339.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
gi|284083897|gb|EFC37595.1| cathepsin B-like cysteine proteinase [Naegleria gruberi]
Length = 350
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 49/120 (40%), Gaps = 33/120 (27%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGP+ M +Y D +SYKSGVY S V VKIVGW
Sbjct: 262 NGPIQVAMGVYRDFYSYKSGVYH-HVSGRYVGGHAVKIVGW------------------- 301
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
G+ + PYW +++GE +G KG ILRGR E I +V P
Sbjct: 302 -------------GYDSASKLPYWICANSWGEDWGIKGYFWILRGRGECGIGKMVWSGKP 348
>gi|354483193|ref|XP_003503779.1| PREDICTED: tubulointerstitial nephritis antigen-like [Cricetulus
griseus]
Length = 475
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 97/246 (39%), Gaps = 69/246 (28%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+SG W ++ KRGLV+ +P N T + + + + K
Sbjct: 283 CNSGSIDRAWWFLRKRGLVSHAC-----------YPLFKDQNTTNNICAMASRSDGRGKR 331
Query: 62 H-TRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
H T+ +++ + Y+ Y V+ +I +EI++NGPV A M ++ D F YK+G
Sbjct: 332 HATKPCPNSFEKS--NRIYQCSPPYRVSSNETEIMREIIRNGPVQAIMQVHEDFFYYKTG 389
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
Y + +S + E Y +R +A
Sbjct: 390 IY---------------------RHVISTNEESEKYRK-----------------LRSHA 411
Query: 181 VSASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIE 235
VKL GWG G +W +++G+ +G+ G +ILRG NE+ IE
Sbjct: 412 ------------VKLTGWGTLRGAGGKKEKFWIAANSWGKSWGENGYFRILRGVNESDIE 459
Query: 236 SLVNGA 241
L+ A
Sbjct: 460 KLIIAA 465
>gi|123478051|ref|XP_001322190.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
gi|121905031|gb|EAY09967.1| Clan CA, family C1, cathepsin B-like cysteine peptidase
[Trichomonas vaginalis G3]
Length = 288
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 51/128 (39%), Gaps = 35/128 (27%)
Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
Y+ I + G GPV ++ +YSD+ YKSG+Y
Sbjct: 190 YASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSGIYT------------------------ 225
Query: 171 PYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN 230
E + + V++IGWG +NG YW I +++ +G G I RG N
Sbjct: 226 -----------HTKGEFLGHHAVEIIGWGTKNGIDYWIISNSWNTTWGMNGLFLIKRGVN 274
Query: 231 EAIIESLV 238
E IE V
Sbjct: 275 ECHIEDYV 282
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 45/175 (25%), Positives = 69/175 (39%), Gaps = 54/175 (30%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C GI + W ++ RGL CQP N T + C
Sbjct: 132 CGGGIEVNAWRYIDLRGLPL-------DSCQPYD------GNIT------------KYNC 166
Query: 62 HTRCTNDNYG-RGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
+CTN++ F + + RY + ++Q IM GPV ++ +YSD+ YKSG
Sbjct: 167 SKKCTNESETYEAQFTEYWSVARY----ASIEEMQIGIMTEGPVTTSLKVYSDLMYYKSG 222
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTI 175
I+++ G E + + V+I+GWG +NG YW I
Sbjct: 223 ----------------IYTHTKG--------EFLGHHAVEIIGWGTKNGIDYWII 253
>gi|6562772|emb|CAB62590.1| putative cathepsin B-like protease [Pisum sativum]
Length = 174
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 88/247 (35%), Gaps = 79/247 (31%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C G S W + G+VT + GC S P C EP +T P
Sbjct: 1 CDGGYPISAWKYFAHHGVVTEECDPYFDQIGC---SHPGC--------EPGYQT-----P 44
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C N + + + K+ + Y VN + +I +E+ KNGPV +Y D YKS
Sbjct: 45 KCVRKCVKGN--QVWKKSKHYSVKPYKVNSDPQNIMEEVYKNGPVEVAFSVYEDFAHYKS 102
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G Y + ++ SA + VK+ GWG
Sbjct: 103 GVYKH----------------------ITGSA--LGGHAVKLNGWGT------------- 125
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVN 239
+ G YW + + + +GD G KI RG NE IE V
Sbjct: 126 --------------------SDEGEDYWLLANQWNTNWGDDGYFKIKRGTNECGIEEDVT 165
Query: 240 GA--LPK 244
LPK
Sbjct: 166 AVCLLPK 172
>gi|294916338|ref|XP_002778359.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239886683|gb|EER10154.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 105
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 56/121 (46%), Gaps = 37/121 (30%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GPV A+ +Y D +Y+SGVY ++ + + +A
Sbjct: 21 DGPVSASFTVYEDFLAYRSGVYKHTSGSYLGGHA-------------------------- 54
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
VK+IGWGE++G+ YW V+++ E +GD G KI G N I + L+ G P
Sbjct: 55 ---------VKIIGWGEKSGQAYWLAVNSWNEDWGDHGLFKIALG-NCGIDDDLLGGT-P 103
Query: 244 K 244
K
Sbjct: 104 K 104
>gi|300121248|emb|CBK21629.2| unnamed protein product [Blastocystis hominis]
Length = 559
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 53/123 (43%), Gaps = 35/123 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y GP+ + + D YK G+Y + A + +V+A+
Sbjct: 190 YARGPITCGIAVPQDFVDYKGGIYKDESGA-----------------------VEKVHAI 226
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
S ++GWGEENG YW +++G +G++G +I RG N IES A
Sbjct: 227 S------------VVGWGEENGEKYWIGRNSWGNYWGEEGWFRIARGINNLAIESECQWA 274
Query: 242 LPK 244
+PK
Sbjct: 275 VPK 277
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 114 IFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYW 173
I + K+ + GPV +M + Y GVY S S+ +VA V+I GWG ENGRPYW
Sbjct: 462 IDAIKAEIFARGPVSCSMTVRESFLDYHGGVYE-SDSSPMVAGHIVEIAGWGVENGRPYW 520
>gi|294900111|ref|XP_002776905.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
gi|239884106|gb|EER08721.1| cathepsin b, putative [Perkinsus marinus ATCC 50983]
Length = 207
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 36/66 (54%)
Query: 57 PQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFS 116
P C T CTN Y +D +R K + V ++V +I+QEI +GPV + +Y D
Sbjct: 94 PLSSCQTTCTNKAYKTSLEKDVHRAKDWRKVPNDVQNIKQEIFDDGPVCSAFKMYEDFRY 153
Query: 117 YKSGKY 122
YKSG Y
Sbjct: 154 YKSGVY 159
>gi|395734831|ref|XP_003776483.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin B-like [Pongo abelii]
Length = 350
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 59/243 (24%), Positives = 86/243 (35%), Gaps = 66/243 (27%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPV-SFPPCNHANYTTSEPECKTLATPQPK 60
C+ G + W + +GLV+GG + S+ GC+ S PC H + P T PK
Sbjct: 164 CNGGXPNEGWNFWTGKGLVSGGLYDSHVGCRLFPSLLPCKH--HIHGXPYVXT--GDSPK 219
Query: 61 CHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSG 120
C C G+ + DK+ Y ++D DI I KN V +Y D YK
Sbjct: 220 CSMTCEP---GQTYKXDKHYGCSSYSISDSTKDIMTNIYKNDXVEEAFSVYLDFLMYKFK 276
Query: 121 KYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYA 180
+Y GV + E+ + I+G EN Y
Sbjct: 277 EY-------------------QGV-----TGEMXGGHAICILGCKVENSTSY-------- 304
Query: 181 VSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
W + + + +GD G KILRG++ IES V
Sbjct: 305 --------------------------WLVANXWNRDWGDNGFFKILRGQDHYGIESEVVA 338
Query: 241 ALP 243
+P
Sbjct: 339 EIP 341
>gi|114153242|gb|ABI52787.1| cathepsin B-like protein [Argas monolakensis]
Length = 91
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 33/53 (62%)
Query: 192 TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPK 244
+++IGWG E PYW + +++ ++GD G KILRG NE IE + +PK
Sbjct: 38 AIRIIGWGVEEDVPYWLVANSWNREWGDNGYFKILRGSNECGIEDDIVAGIPK 90
>gi|21695|emb|CAA46812.1| cathepsin B [Triticum aestivum]
Length = 310
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/178 (26%), Positives = 71/178 (39%), Gaps = 43/178 (24%)
Query: 2 CSSGISSSTWVWVHKRGLVT--GGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + + G+VT + TGCQ P C+ A P P
Sbjct: 165 CNGGYPISAWRYFRRSGVVTEECDPYFDQTGCQ---------------HPGCEP-AYPTP 208
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KC +C +N + + ++K+ Y V+ DI E+ KNGPV + Y I
Sbjct: 209 KCQRKCKVEN--QAWKENKHFSVNAYRVHSNPHDIMAEVYKNGPVEV-AFTYCQIL---- 261
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEEN-GRPYWTIV 176
D YKSGVY + ++ VK++GWG + G YW +
Sbjct: 262 ----------------DFAHYKSGVYK-HITGGVMGGHAVKLIGWGTSDAGEDYWLLA 302
>gi|161343837|tpg|DAA06099.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 255
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPV 104
H RCT YG F + +R+ R +Y++ IQ+++M GP+
Sbjct: 213 H-RCTRMCYGNXDLDFDEDHRYTRDFYYLT--YGSIQKDVMTYGPI 255
>gi|341891034|gb|EGT46969.1| hypothetical protein CAEBREN_30419 [Caenorhabditis brenneri]
Length = 422
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 99/224 (44%), Gaps = 48/224 (21%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
WV++ GLVTGG GC+P SF PC+ A + +E E +T C RC N
Sbjct: 219 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAE-EKRT-------CMRRCQN 265
Query: 68 DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
Y + + +DK+ F + Y + V+ +E +K ++ + F+ K+ +
Sbjct: 266 IYYQQRYEEDKH-FATFAYSLYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 317
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
N N+ + +I Y A E + Y++ + R + +
Sbjct: 318 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 361
Query: 184 -SAEIVAYATVKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKI 225
IV + V+LIGWG+ E+G YW V++FG +GD G KI
Sbjct: 362 FDDRIVYWHVVRLIGWGQSEDGTHYWLAVNSFGSHWGDNGLFKI 405
>gi|161343845|tpg|DAA06103.1| TPA_inf: cathepsin B [Acyrthosiphon pisum]
Length = 261
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 51/111 (45%), Gaps = 12/111 (10%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W KRGLVTGG + S GC+P PPC + +E P+
Sbjct: 157 CNGGYPIKAWERFKKRGLVTGGDYQSGEGCEPYRVPPCPY----DAEGHNTCAGKPRESN 212
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR--YYWVNDEVADIQQEIMKNGPVVANM 108
H RCT YG F + +R+ R YY IQ+++M GP+ A+
Sbjct: 213 H-RCTRMCYGNQDLDFDEDHRYTRDSYYLT---YGSIQKDVMTYGPIEASF 259
>gi|401401997|ref|XP_003881145.1| hypothetical protein NCLIV_041870 [Neospora caninum Liverpool]
gi|325115557|emb|CBZ51112.1| hypothetical protein NCLIV_041870 [Neospora caninum Liverpool]
Length = 736
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 56/120 (46%), Gaps = 22/120 (18%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGPV +FSY SG+Y ++S V +N P+ + V
Sbjct: 603 YKNGPVPVAFDAPPSLFSYSSGIYDANSSHARVC-----------DNDSPHCS-----GV 646
Query: 182 SASAEIVAYATVKLIGWGEENG-----RPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
E +A V L+GWGE N R YW + +T+G +G +G +KI RG+N IES
Sbjct: 647 LTGWEYTNHA-VTLVGWGETNAENEKPRKYWIVRNTWGPNWGVQGYLKIARGKNLGGIES 705
>gi|300176576|emb|CBK24241.2| unnamed protein product [Blastocystis hominis]
Length = 563
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 56/122 (45%), Gaps = 35/122 (28%)
Query: 122 YGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAV 181
Y NGP+ + + +D+ +Y++G+++ + S+ + +
Sbjct: 164 YYNGPITCKISVTNDLQNYRNGIFSRNTSSSLYDHY------------------------ 199
Query: 182 SASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGA 241
V +IGWG EN PYW + +++G +G+ G +ILRG N IES + A
Sbjct: 200 -----------VNIIGWGSENETPYWIVRNSWGSSWGEDGYFRILRGVNLLGIESSCSYA 248
Query: 242 LP 243
+P
Sbjct: 249 VP 250
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 32/52 (61%), Gaps = 1/52 (1%)
Query: 193 VKLIGWGE-ENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
V+++GWG E G YW + +GE +G+KG +I+ G N +IES + +P
Sbjct: 509 VEVVGWGRTEEGVEYWIGRNNWGENWGEKGWFRIMMGGNNLLIESSCSWGVP 560
>gi|48762499|dbj|BAD23819.1| cathepsin B-N [Tuberaphis styraci]
gi|48762501|dbj|BAD23820.1| cathepsi B-N [Tuberaphis coreana]
Length = 105
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 51/112 (45%), Gaps = 10/112 (8%)
Query: 5 GISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKCHTR 64
G W K GLVTGG + S GCQP PPC Y + C+ P K H R
Sbjct: 1 GYPIRAWERFKKHGLVTGGNYDSGEGCQPYRVPPCPLDEYGNN--TCR--GKPAEKNH-R 55
Query: 65 CTNDNYGR---GFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSD 113
CT YG F +D + + Y++ IQ +I+ GP+ A+ +Y D
Sbjct: 56 CTRMCYGNQDLDFKEDHHYTRDAYYLT--YGTIQNDILAYGPIEASFEVYDD 105
>gi|156375635|ref|XP_001630185.1| predicted protein [Nematostella vectensis]
gi|156217201|gb|EDO38122.1| predicted protein [Nematostella vectensis]
Length = 311
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/120 (25%), Positives = 52/120 (43%), Gaps = 35/120 (29%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPV A+ ++ D ++Y+SG+Y + ++ +A
Sbjct: 225 NGPVEADFTIFQDFYAYRSGIYVHATGKQLGGHA-------------------------- 258
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+K++GWG E+ YW +++G +G +G KI RG +E IE + LP
Sbjct: 259 ---------IKILGWGTEDNVDYWLCANSWGANWGIQGYFKIRRGTDECGIEDGLAAGLP 309
>gi|308485822|ref|XP_003105109.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
gi|308257054|gb|EFP01007.1| hypothetical protein CRE_20700 [Caenorhabditis remanei]
Length = 410
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 97/224 (43%), Gaps = 48/224 (21%)
Query: 13 WVHKRGLVTGGAHHSNTGCQPVSFP-----PCNHANYTTSEPECKTLATPQPKCHTRCTN 67
WV++ GLVTGG GC+P SF PC+ A + +E + C RC N
Sbjct: 207 WVNQ-GLVTGG----RDGCRPYSFDLSCGVPCSPATFFEAEE--------KRTCMRRCQN 253
Query: 68 DNYGRGFFQDKYRFKRY----YWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYG 123
Y + + +DK+ F + Y + V+ +E +K ++ + F+ K+ +
Sbjct: 254 IYYQQKYEEDKH-FATFAYSMYPRSMTVSPDGKERVKVPTIIGH-------FNDKNTEKL 305
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
N N+ + +I Y A E + Y++ + R + +
Sbjct: 306 NVTEYRNV-IKKEILLYGPTTMAFPVPEEFLHYSS---------------GVFRPFPLDG 349
Query: 184 -SAEIVAYATVKLIGWGEE-NGRPYWTIVSTFGEQFGDKGTIKI 225
IV + V+LIGWGE +G+ YW +++FG +GD G KI
Sbjct: 350 FDDRIVYWHVVRLIGWGESGDGQHYWLAINSFGNHWGDNGLFKI 393
>gi|300121294|emb|CBK21674.2| unnamed protein product [Blastocystis hominis]
Length = 561
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 15/80 (18%)
Query: 180 AVSASAEIVAYA---------------TVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIK 224
A+ A+ E+VAY + ++GWGEE+G+ YW + +++G +G+ G +
Sbjct: 197 ALDATDELVAYKGGIFEDKTGTTSLNHAISVVGWGEEDGKKYWIVRNSWGTYWGENGWFR 256
Query: 225 ILRGRNEAIIESLVNGALPK 244
I+RG N IES A+P+
Sbjct: 257 IVRGTNNLGIESECTWAVPR 276
>gi|242001446|ref|XP_002435366.1| cysteine proteinase, putative [Ixodes scapularis]
gi|215498696|gb|EEC08190.1| cysteine proteinase, putative [Ixodes scapularis]
Length = 238
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 75/197 (38%), Gaps = 61/197 (30%)
Query: 45 TTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPV 104
TT + + T P C T + Y F YR N+E DI QEI NGPV
Sbjct: 69 TTCRIARRRVPTEDPICPTGRQDQKY---FSTPPYRVP----ANEE--DIMQEIYANGPV 119
Query: 105 VANMYLYSDIFSYKSGKYGNGPVVANM---YLYSDIFSYKSGVYAVSASAEIVAYATVKI 161
A M + D F Y SG Y + + N+ Y SD + +V+I
Sbjct: 120 QALMLVKEDFFLYSSGVYKHTRLAHNLPPEYQKSD-------------------WHSVRI 160
Query: 162 VGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKG 221
+GWG + Y K YW +++G +G+ G
Sbjct: 161 LGWG-------------------VDRTQYRPQK-----------YWLCANSWGSGWGENG 190
Query: 222 TIKILRGRNEAIIESLV 238
+I+RG +E+ IES V
Sbjct: 191 YFRIVRGEDESQIESFV 207
>gi|449670327|ref|XP_002160467.2| PREDICTED: dipeptidyl peptidase 1-like [Hydra magnipapillata]
Length = 458
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 36/47 (76%)
Query: 197 GWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
G+GEE+G+ YW + +++GE++G+KG +I RG +E IESLV A+P
Sbjct: 405 GYGEEDGQKYWIVKNSWGEEWGEKGYFRIRRGTDEIAIESLVVYAVP 451
>gi|291000228|ref|XP_002682681.1| predicted protein [Naegleria gruberi]
gi|284096309|gb|EFC49937.1| predicted protein [Naegleria gruberi]
Length = 225
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 84/239 (35%), Gaps = 88/239 (36%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA--HHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C GI + W+++ G+VT + S G P CN TS P
Sbjct: 73 CDGGILWAAWIYLKHTGIVTDQCLPYSSGNGVAPSCPKYCN----GTSTP---------- 118
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKS 119
KY+ K +Y V I EI NGPV + +Y D SYKS
Sbjct: 119 --------------IDSVKYKAKDWYEVGSIAEKIMNEIATNGPVQSGFSVYQDFMSYKS 164
Query: 120 GKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVY 179
G ++++++G + + +KIVGWG EN
Sbjct: 165 G----------------VYTHQTGSF--------LGGHAIKIVGWGVEN----------- 189
Query: 180 AVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
VK YW + +++G +G G KI RG NE IE+ V
Sbjct: 190 ------------NVK-----------YWLVANSWGPDWGLNGLFKIKRGDNECGIEADV 225
>gi|405963121|gb|EKC28721.1| Tubulointerstitial nephritis antigen-like protein [Crassostrea
gigas]
Length = 464
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 35/121 (28%), Positives = 55/121 (45%), Gaps = 29/121 (23%)
Query: 118 KSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVR 177
K+ Y NGPV A + SD F Y+SGVY + + + +V+I+GWGE+
Sbjct: 328 KAEIYRNGPVQATFRVSSDFFMYRSGVYRHTGADLGESRLSVRIIGWGEKTN-------- 379
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+ R YW ++++G ++G+KG +I+RG N IE
Sbjct: 380 ---------------------KKGKKRKYWICLNSWGTKWGEKGAFRIVRGENHLGIEEN 418
Query: 238 V 238
V
Sbjct: 419 V 419
>gi|145486176|ref|XP_001429095.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124396185|emb|CAK61697.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 54/120 (45%), Gaps = 29/120 (24%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPVV + D Y+SG+Y + AE Y+ W V
Sbjct: 334 NGPVVLSFEPSYDFMYYESGIY--HSKAETSDYSE--------------WEKVD------ 371
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+V GWGEE G +W + +++G+Q+G+ G ++ RG +E+ IES+ + P
Sbjct: 372 -------HSVLCYGWGEEEGVKFWMLQNSWGDQWGESGNFRMKRGVDESAIESMAEASDP 424
>gi|118378294|ref|XP_001022323.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89304090|gb|EAS02078.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 497
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 66/142 (46%), Gaps = 40/142 (28%)
Query: 97 EIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAY 156
EIMKNGP+VAN +D F Y YKSGVY +A+ +
Sbjct: 380 EIMKNGPIVANFKTSAD-FVY----------------------YKSGVYHSVEAADWILK 416
Query: 157 ATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWG--EENGRPYWTIVSTFG 214
V+ P W V +AV + + GWG EE+G+ +W + +++G
Sbjct: 417 CEVE----------PEWRPVE-HAVMCQHQ---QQFLNSYGWGESEEDGK-FWLMQNSWG 461
Query: 215 EQFGDKGTIKILRGRNEAIIES 236
+ +G+KG KI RG +E+ +ES
Sbjct: 462 DDWGEKGRFKIRRGTDESFVES 483
>gi|58617822|gb|AAW80530.1| cathepsin L-like cysteine protease [Leishmania infantum]
Length = 234
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/129 (29%), Positives = 64/129 (49%), Gaps = 14/129 (10%)
Query: 109 YLYSDIFSYKSGKY--GNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE 166
++Y +F+ KS Y GNG V + + + Y + S E V A W
Sbjct: 66 HMYGIVFTEKSYPYTSGNGDVPECLNSSKLVPGAQIDGYVMIPSNETVMAA------WLA 119
Query: 167 ENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKIL 226
ENG AV AS+ + + V L+G+ + G PYW I +++GE +G+KG ++++
Sbjct: 120 ENGP------IAIAVDASSFMSYQSGVLLVGYNKTGGVPYWVIKNSWGEDWGEKGYVRVV 173
Query: 227 RGRNEAIIE 235
GRN +++
Sbjct: 174 MGRNACLLK 182
>gi|224285256|gb|ACN40354.1| unknown [Picea sitchensis]
Length = 350
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 55/245 (22%), Positives = 90/245 (36%), Gaps = 80/245 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTG--GAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQP 59
C+ G S W + +RG+VT + N GC N+ EP + P P
Sbjct: 163 CNGGFPLSAWRYFSRRGVVTDECDPYFDNDGC-----------NHPGCEP-----SYPTP 206
Query: 60 KCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMY-LYSDIFSYK 118
+C C ++ ++ ++Y AN Y + SD ++
Sbjct: 207 RCVKNCKDNQ--------RWSHSKHY-------------------SANAYRIKSDPYNIM 239
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
+ + NGPV + +Y D Y++GVY + +A VK++GWG
Sbjct: 240 AEVFNNGPVEVSFSVYEDFAHYETGVYKHVQGRYLGGHA-VKLIGWGT------------ 286
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLV 238
++G YW I +++ +G+ G KI RG NE IE
Sbjct: 287 ---------------------TDDGIDYWLIANSWNTAWGEGGYFKIARGVNECGIERDP 325
Query: 239 NGALP 243
+P
Sbjct: 326 VAGMP 330
>gi|168026641|ref|XP_001765840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683017|gb|EDQ69431.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 339
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 59/249 (23%), Positives = 87/249 (34%), Gaps = 82/249 (32%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGA--HHSNTGC-QPVSFPPCNHANYTTSEPECKTLATPQ 58
C G W + + G+VT + GC P +P Y T
Sbjct: 163 CDGGYPIRAWRYFKRTGVVTSKCDPYFDQIGCGHPGCYP-----TYRT------------ 205
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
PKC C +D + + K+ Y V+ E D+ E+
Sbjct: 206 PKCVKHCVDDEL---WVKSKHLSVNAYEVSKEPEDLMAEL-------------------- 242
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGE-ENGRPYWTIVR 177
Y NGP+ + ++ D YK+GVY I +A VK++GWG ++G YW
Sbjct: 243 ---YTNGPIEVSFEVFEDFAHYKTGVYKHVYGRYIGGHA-VKLIGWGTTDDGVDYW---- 294
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
TIV+++ +G+ G +I RG NE IES
Sbjct: 295 ------------------------------TIVNSWNTNWGEHGLFRIARGGNECGIESY 324
Query: 238 VNGALPKDN 246
LP D
Sbjct: 325 AVAGLPFDK 333
>gi|255087666|ref|XP_002505756.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
gi|226521026|gb|ACO67014.1| cathepsin B-like cysteine proteinase [Micromonas sp. RCC299]
Length = 273
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 56/136 (41%), Gaps = 21/136 (15%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G S+ + + G+VTGG C P F PC+H C+ P P C
Sbjct: 87 CEGGESADAYEFAKSNGVVTGGGFDDQNTCAPYPFAPCHH--------PCEVF--PTPAC 136
Query: 62 HTRC---TNDNYGRGFFQDKYRFKRYYWVNDEVAD---IQQEIMKNGPVVANM-YLYSDI 114
C +ND G K FK V+ D + EI NGPV + +Y +
Sbjct: 137 PATCVGGSNDGVQNG----KASFKVKAIVDCPSFDYGCVANEIYHNGPVSSYAGDIYEEF 192
Query: 115 FSYKSGKYGNGPVVAN 130
++YKSG + P VA
Sbjct: 193 YAYKSGVFRESPSVAQ 208
>gi|328712827|ref|XP_003244913.1| PREDICTED: tubulointerstitial nephritis antigen-like isoform 2
[Acyrthosiphon pisum]
Length = 487
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 36/122 (29%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSAS 184
G V A M + + F Y+SGVY S A G + G
Sbjct: 368 GSVQAMMKVSKEFFMYESGVYRCSNLA------------LGSKTG--------------- 400
Query: 185 AEIVAYATVKLIGWGEE--NGR--PYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNG 240
Y TV+++GWGEE NGR YW + +++G +G+ G +IL+G NE IE V
Sbjct: 401 -----YHTVRIVGWGEEQQNGRTVKYWIVSNSWGLWWGESGYFRILKGTNECQIEDFVVA 455
Query: 241 AL 242
A+
Sbjct: 456 AM 457
>gi|145500930|ref|XP_001436448.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124403587|emb|CAK69051.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 24/52 (46%), Positives = 28/52 (53%), Gaps = 1/52 (1%)
Query: 125 GPVVANMYLYSDIFSYKSGVYAV-SASAEIVAYATVKIVGWGEENGRPYWTI 175
GP VA M +Y D YK G+Y V VKI+GWGE NG+ YW I
Sbjct: 255 GPAVAIMPVYKDFLIYKDGIYQVLDGQPHFHGGQAVKIIGWGEHNGQQYWII 306
>gi|294891623|ref|XP_002773656.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
gi|239878860|gb|EER05472.1| hypothetical protein Pmar_PMAR011495 [Perkinsus marinus ATCC 50983]
Length = 815
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 40/66 (60%)
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
+Y +A + + V++IG+G E P+W +++++G+ +G+ G ++LRGRN IE L
Sbjct: 572 LYTTTAGSPEIGNHAVRIIGFGVEGNVPFWLLMNSWGDDWGEHGCFRMLRGRNLCGIEEL 631
Query: 238 VNGALP 243
G P
Sbjct: 632 PVGMDP 637
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 20/36 (55%), Positives = 30/36 (83%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRG 228
V ++G+GEENGR YW I +++GE++G+KG IKI +G
Sbjct: 319 VLVVGYGEENGRSYWLIKNSWGEEWGEKGYIKISKG 354
>gi|257215762|emb|CAX83033.1| Cysteine PRotease related protein [Schistosoma japonicum]
Length = 233
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 31/64 (48%), Gaps = 1/64 (1%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W + KRG+VTGG+ ++TGCQP FP C H P C T P+C
Sbjct: 159 CKGGFPGQAWDYWVKRGIVTGGSEENHTGCQPYPFPKCEHLT-KGKYPACGTKIYKTPQC 217
Query: 62 HTRC 65
C
Sbjct: 218 KQTC 221
>gi|444707360|gb|ELW48642.1| Tubulointerstitial nephritis antigen-like protein [Tupaia
chinensis]
Length = 989
Score = 50.4 bits (119), Expect = 8e-04, Method: Composition-based stats.
Identities = 40/146 (27%), Positives = 60/146 (41%), Gaps = 39/146 (26%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C G W ++ +RG+V+ C P+S + E A P P+C
Sbjct: 828 CRGGHLDGAWWFLRRRGVVS-------NHCYPLS-------GHVQGE------AGPAPRC 867
Query: 62 --HTRCTNDNYGRGFFQ-------------DKYRFKRYYWVNDEVADIQQEIMKNGPVVA 106
H+R GRG Q D Y+ Y + +I +E+M+NGPV A
Sbjct: 868 MMHSRAV----GRGKRQATARCPSGHVHANDIYQVTPAYRLGSSEKEIMKELMENGPVQA 923
Query: 107 NMYLYSDIFSYKSGKYGNGPVVANMY 132
M ++ D F Y+ G Y + P AN +
Sbjct: 924 LMEVHEDFFLYRGGVYSHTPTAANSW 949
>gi|111054118|gb|ABH04250.1| cathepsin B precursor [Sus scrofa]
Length = 61
Score = 50.4 bits (119), Expect = 0.001, Method: Composition-based stats.
Identities = 20/53 (37%), Positives = 35/53 (66%)
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
+ +++ ++++GWG ENG PYW + +++ +GD G KILRG++ IES
Sbjct: 7 TGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIES 59
>gi|290975817|ref|XP_002670638.1| predicted protein [Naegleria gruberi]
gi|284084199|gb|EFC37894.1| predicted protein [Naegleria gruberi]
Length = 528
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 47/165 (28%)
Query: 79 YRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIF 138
YR+ Y+ V ++Q +++K GP+ +M +Y+D+F+Y SG Y + V++ L S
Sbjct: 408 YRYTGGYYGAVTVENMQLDVLKYGPLSVSMEVYNDLFNYHSGIYRH---VSSSKLTS--- 461
Query: 139 SYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGW 198
V E+ + V IVGWGE
Sbjct: 462 -------PVPNPFELTNHV-VLIVGWGE-------------------------------- 481
Query: 199 GEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
E G YW + +++G FG G I RG +E IES A+P
Sbjct: 482 -NEKGEKYWIVKNSWGTSFGMDGYFLIARGVDECAIESENASAIP 525
>gi|161343859|tpg|DAA06110.1| TPA_inf: cathepsin B [Myzus persicae]
Length = 260
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + GLVTGG + S GC+P PPC + + P+ K
Sbjct: 157 CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKN----TCAGKPREKN 212
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANM 108
H RCT YG +++ +R+ R +Y++ IQ+++M GP+ A
Sbjct: 213 H-RCTRMCYGNQDLDYREDHRYTRDFYYLT--YGSIQKDVMTYGPIEATF 259
>gi|182509202|ref|NP_001116812.1| tubulointerstitial nephritis antigen precursor [Bombyx mori]
gi|81303350|gb|ABB71105.1| TIN-ag-RP [Bombyx mori]
Length = 404
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 32/120 (26%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
+GP + M +Y D F Y+ G+Y + + +
Sbjct: 314 SGPALGIMTVYQDFFHYREGIYRHTRHGDQL----------------------------- 344
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+ +V+++GWGE+ YW + +++G +G+KG +I RG + IES V LP
Sbjct: 345 ---MRGLHSVRIVGWGEDAEDKYWIVANSWGTSWGEKGYFRIARGHSGTGIESSVLTVLP 401
>gi|339235559|ref|XP_003379334.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
gi|316978005|gb|EFV61034.1| dipeptidyl-peptidase 1 [Trichinella spiralis]
Length = 465
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 73/201 (36%), Gaps = 52/201 (25%)
Query: 43 NYTTSEPECKTLATPQPKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
+Y C Q +C T T + Y + D YY ++E+ + Q ++KNG
Sbjct: 314 DYGMVSERCVAYTGKQQQCRTPSTCERY---YATDYEYIGGYYGASNEIL-MMQALVKNG 369
Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
P+ ++ D SY G I+ Y S V + + + V IV
Sbjct: 370 PIAVGFEVHDDFLSYSHG----------------IYHYTSAVSPLKWNPFVEVNHAVIIV 413
Query: 163 GWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT 222
G+G + E YW + +++G +FG+ G
Sbjct: 414 GYGTD--------------------------------EMTKEKYWIVKNSWGRKFGEDGY 441
Query: 223 IKILRGRNEAIIESLVNGALP 243
+I RG NE IESL A P
Sbjct: 442 FRIRRGTNECGIESLAFQATP 462
>gi|427783627|gb|JAA57265.1| hypothetical protein [Rhipicephalus pulchellus]
Length = 483
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 79/196 (40%), Gaps = 53/196 (27%)
Query: 44 YTTSEPECKTLATPQPKCHTRC-TNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNG 102
++++ C+ P RC T + F YR N+E DI QEI NG
Sbjct: 297 HSSANATCRIPRRRDPIEDARCPTGRTEQKHFSTPPYRVP----ANEE--DIMQEIYANG 350
Query: 103 PVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIV 162
PV A + + D F Y+SG +Y ++ I Y+ S + +V+I+
Sbjct: 351 PVQALILVKEDFFLYRSG----------VYRHTRIAESLRPQYSRS------GWHSVRIL 394
Query: 163 GWGEENGRPYWTIVRVYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGT 222
GWG + + Y +K YW +++G +G+ G
Sbjct: 395 GWGVDRSQ-------------------YRPIK-----------YWLCANSWGHGWGENGY 424
Query: 223 IKILRGRNEAIIESLV 238
+I+RG +E+ IES V
Sbjct: 425 FRIVRGEDESQIESFV 440
>gi|66270083|gb|AAY43371.1| cathepsin-like cysteine protease [Phytophthora infestans]
Length = 635
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/69 (30%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 178 VYAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESL 237
++ +A V +A + ++GWGEENG P+W + +++G +G+ G ++++RG N +E
Sbjct: 237 IFDDKTNATDVDHA-ISIVGWGEENGVPFWVLRNSWGSFWGESGWMRLVRGVNNVGVEGE 295
Query: 238 VNGALPKDN 246
+P+D+
Sbjct: 296 CAFGVPRDD 304
>gi|157058737|gb|ABV03126.1| cathepsin B-16 [Myzus persicae]
Length = 238
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)
Query: 2 CSSGISSSTWVWVHKRGLVTGGAHHSNTGCQPVSFPPCNHANYTTSEPECKTLATPQPKC 61
C+ G W + GLVTGG + S GC+P PPC + + P+ K
Sbjct: 135 CNGGDPIKAWKYFSTHGLVTGGNYKSGEGCEPYRVPPCPRDDKGKN----TCAGKPREKN 190
Query: 62 HTRCTNDNYGRG--FFQDKYRFKR-YYWVNDEVADIQQEIMKNGPVVANM 108
H RCT YG +++ +R+ R +Y++ IQ+++M GP+ A
Sbjct: 191 H-RCTRMCYGNQDLDYREDHRYTRDFYYLT--YGSIQKDVMTYGPIEATF 237
>gi|428180143|gb|EKX49011.1| cathepsin B-like cysteine protease [Guillardia theta CCMP2712]
Length = 330
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/191 (26%), Positives = 68/191 (35%), Gaps = 61/191 (31%)
Query: 59 PKCHTRCTNDNYGRGFFQDKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYK 118
P C C + G KY+ YY + E DI +EI NGPV A +Y+ SYK
Sbjct: 193 PSCRISCVD-----GEPYKKYKASDYYQLTTE-EDIMKEIYLNGPVEAGFRVYTSFMSYK 246
Query: 119 SGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRV 178
SG Y + I G +A +KIVGWG E + +W
Sbjct: 247 SGVY-----------HHRILDIMEGGHA------------IKIVGWGVEPPKRFW----- 278
Query: 179 YAVSASAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRN-----EAI 233
+ YW +++ +G G KI RG+N E
Sbjct: 279 ----------------------QKPTKYWICANSWTADWGMNGFFKIRRGKNRFGQSECG 316
Query: 234 IESLVNGALPK 244
IE V PK
Sbjct: 317 IEDQVFAGHPK 327
>gi|340508280|gb|EGR34021.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 620
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/55 (41%), Positives = 34/55 (61%)
Query: 193 VKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALPKDNY 247
V ++GWG ENG YW + +++G +G+KG + LRG N IE A+PKD +
Sbjct: 226 VSIVGWGVENGVKYWIVRNSWGSYWGEKGFYRQLRGVNMINIEQFCYWAVPKDTW 280
>gi|145490612|ref|XP_001431306.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398410|emb|CAK63908.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 52/120 (43%), Gaps = 29/120 (24%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPVV + D Y+SG+Y A + N W V
Sbjct: 360 NGPVVLSFEPSYDFMYYESGIYHSKA----------------QTNDYAEWEKVD------ 397
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIESLVNGALP 243
+V GWGEE+G +W + +++G Q+G+ G ++ RG +E+ IES+ + P
Sbjct: 398 -------HSVLCYGWGEEDGVKFWMLQNSWGNQWGEGGNFRMKRGVDESAIESMAEASDP 450
>gi|145509603|ref|XP_001440740.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407968|emb|CAK73343.1| unnamed protein product [Paramecium tetraurelia]
Length = 357
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 33/117 (28%), Positives = 55/117 (47%), Gaps = 23/117 (19%)
Query: 77 DKYRFKRYYWVNDEVADIQQEIMKNGPVVANMYLYSDIFSYKSGKYGNGPVVANMYLYSD 136
+KY+ + Y ++ E +I++EI+ NGPVVA + ++ D YK G Y
Sbjct: 240 EKYKIQDYCVISSE-ENIKREILNNGPVVAVIQVFKDFLVYKGGIYE------------- 285
Query: 137 IFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSASAEIVAYATV 193
V S++ VK++GWG+++G YW I + S + +AY V
Sbjct: 286 ---------VVEGSSKFQYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAV 333
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 33/113 (29%)
Query: 124 NGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGRPYWTIVRVYAVSA 183
NGPVVA + ++ D YK G+Y V V
Sbjct: 263 NGPVVAVIQVFKDFLVYKGGIYEV---------------------------------VEG 289
Query: 184 SAEIVAYATVKLIGWGEENGRPYWTIVSTFGEQFGDKGTIKILRGRNEAIIES 236
S++ VK+IGWG+++G YW I +++G+ +G KG + G+N+ +E+
Sbjct: 290 SSKFQYGHAVKVIGWGKQDGVNYWVIENSWGDSWGLKGLAYVAVGQNQLQLEA 342
>gi|403339807|gb|EJY69164.1| Cathepsin B [Oxytricha trifallax]
Length = 345
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 44/94 (46%), Gaps = 16/94 (17%)
Query: 111 YSDIFSYKSGKYGNGPVVANMYLYSDIFSYKSGVYAVSASAEIVAYATVKIVGWGEENGR 170
Y DI K Y NGPV+ +Y D SY +G+Y V+ + V + GWG +NGR
Sbjct: 249 YEDI---KEEIYTNGPVMVGFVVYDDFSSYSTGIYEVTPDSVEEGGHAVTLNGWGYDNGR 305
Query: 171 PYWT-------------IVRVYAVSASAEIVAYA 191
YW R+YA A +++A++
Sbjct: 306 LYWIGQNQWQNTWGESGFFRIYAGEAGIDLMAFS 339
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.133 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,872,915,150
Number of Sequences: 23463169
Number of extensions: 218359601
Number of successful extensions: 488940
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2423
Number of HSP's successfully gapped in prelim test: 394
Number of HSP's that attempted gapping in prelim test: 480824
Number of HSP's gapped (non-prelim): 5460
length of query: 280
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 140
effective length of database: 9,074,351,707
effective search space: 1270409238980
effective search space used: 1270409238980
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 76 (33.9 bits)