BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8713
(309 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P07858|CATB_HUMAN Cathepsin B OS=Homo sapiens GN=CTSB PE=1 SV=3
Length = 339
Score = 192 bits (489), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>sp|P07688|CATB_BOVIN Cathepsin B OS=Bos taurus GN=CTSB PE=1 SV=5
Length = 335
Score = 192 bits (489), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 98/193 (50%), Positives = 120/193 (62%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYSV++NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVSGEIMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGMP 330
Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 12/83 (14%)
Query: 44 NSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYS------------EVDEDLPANFDSR 91
+ L N +W H YN+ + + +L G D LP +FD+R
Sbjct: 28 DELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVVLPESFDAR 87
Query: 92 TKWPNCPTIREIRDQGSCGSCWG 114
+WPNCPTI+EIRDQGSCGSCW
Sbjct: 88 EQWPNCPTIKEIRDQGSCGSCWA 110
Score = 32.0 bits (71), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 15/24 (62%), Positives = 17/24 (70%)
Query: 15 GGFPGMAWRYWVKSGIVSGGAYGS 38
GGFP AW +W K G+VSGG Y S
Sbjct: 152 GGFPSGAWNFWTKKGLVSGGLYNS 175
>sp|Q4R5M2|CATB_MACFA Cathepsin B OS=Macaca fascicularis GN=CTSB PE=2 SV=1
Length = 339
Score = 192 bits (488), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 97/196 (49%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 KDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAGAWNFWTRKGLVSGGLYDS 175
>sp|Q5R6D1|CATB_PONAB Cathepsin B OS=Pongo abelii GN=CTSB PE=2 SV=1
Length = 339
Score = 191 bits (486), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/196 (48%), Positives = 123/196 (62%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS++E
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
+ IM EIY++GPVEGAF+V+ D +LYKSG +
Sbjct: 237 RDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G+ +GGHAIRILGWG + + YWL+ANSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGEMMGGHAIRILGWGVENGT--PYWLVANSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES + AG+P+ D
Sbjct: 318 HCGIESEVVAGIPRTD 333
Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 8 LCGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
+CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 145 MCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 175
>sp|A1E295|CATB_PIG Cathepsin B OS=Sus scrofa GN=CTSB PE=1 SV=1
Length = 335
Score = 189 bits (480), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/193 (50%), Positives = 117/193 (60%), Gaps = 40/193 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D +FG SYS+S NE
Sbjct: 178 GCRPYSIPPCEHHVNGSRPPC-TGEGDTPKCSKICEPGYTPSYKEDKHFGCSSYSISRNE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTV+ D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ +G +GGHAIRILGWG + + YWL+ NSWNTDWGDNG FKILRG+D
Sbjct: 268 --------QHVTGDLMGGHAIRILGWGVENGT--PYWLVGNSWNTDWGDNGFFKILRGQD 317
Query: 294 ECGIESSITAGVP 306
CGIES I AG+P
Sbjct: 318 HCGIESEIVAGIP 330
Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGGFP AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGFPSGAWNFWTKKGLVSGGLYDS 175
>sp|P43233|CATB_CHICK Cathepsin B OS=Gallus gallus GN=CTSB PE=2 SV=1
Length = 340
Score = 184 bits (468), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 93/196 (47%), Positives = 116/196 (59%), Gaps = 39/196 (19%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GCR Y I PCEHHVNG+RP C G TP+C R C+ Y YK+D ++G SY V +E
Sbjct: 178 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSE 237
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAF V++D ++YKSG +
Sbjct: 238 KEIMAEIYKNGPVEGAFIVYEDFLMYKSGVY----------------------------- 268
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+ SG+ +GGHAIRILGWG + + YWL ANSWNTDWG G FKILRG+D
Sbjct: 269 --------QHVSGEQVGGHAIRILGWGVENGT--PYWLAANSWNTDWGITGFFKILRGED 318
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AGVP+++
Sbjct: 319 HCGIESEIVAGVPRME 334
Score = 49.3 bits (116), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 20/30 (66%), Positives = 23/30 (76%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AWRYW + G+VSGG Y S
Sbjct: 146 CGMGCNGGYPSGAWRYWTERGLVSGGLYDS 175
>sp|P00787|CATB_RAT Cathepsin B OS=Rattus norvegicus GN=Ctsb PE=1 SV=2
Length = 339
Score = 184 bits (468), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 95/194 (48%), Positives = 118/194 (60%), Gaps = 40/194 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G SYSVS +E
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSE 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL+ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDVMGGHAIRILGWGIE--NGVPYWLVANSWNVDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPK 307
CGIES I AG+P+
Sbjct: 318 HCGIESEIVAGIPR 331
Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWNFWTRKGLVSGGVYNS 175
>sp|P10605|CATB_MOUSE Cathepsin B OS=Mus musculus GN=Ctsb PE=1 SV=2
Length = 339
Score = 182 bits (462), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 95/196 (48%), Positives = 118/196 (60%), Gaps = 40/196 (20%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY I PCEHHVNG+RP C +G TP+C + C+ Y YK+D +FG SYSVS++
Sbjct: 178 GCLPYTIPPCEHHVNGSRPPC-TGEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSV 236
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K IM EIY++GPVEGAFTVF D + YKSG +
Sbjct: 237 KEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY----------------------------- 267
Query: 234 FTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD 293
+++G +GGHAIRILGWG + + YWL ANSWN DWGDNG FKILRG++
Sbjct: 268 --------KHEAGDMMGGHAIRILGWGVE--NGVPYWLAANSWNLDWGDNGFFKILRGEN 317
Query: 294 ECGIESSITAGVPKLD 309
CGIES I AG+P+ D
Sbjct: 318 HCGIESEIVAGIPRTD 333
Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/30 (63%), Positives = 22/30 (73%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGS 38
CG GCNGG+P AW +W K G+VSGG Y S
Sbjct: 146 CGDGCNGGYPSGAWSFWTKKGLVSGGVYNS 175
>sp|P25807|CPR1_CAEEL Gut-specific cysteine proteinase OS=Caenorhabditis elegans GN=cpr-1
PE=1 SV=2
Length = 329
Score = 155 bits (393), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 97/288 (33%), Positives = 129/288 (44%), Gaps = 98/288 (34%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW-----------------GCRPYEIAP 122
V +PA FDSRT+W C +I+ IRDQ +CGSCW G + I+P
Sbjct: 81 VLASVPATFDSRTQWSECKSIKLIRDQATCGSCWAFGAAEMISDRTCIETKGAQQPIISP 140
Query: 123 --------------CE---------------------HHVNGTRP-------SCDASKGH 140
CE +H G +P S + +
Sbjct: 141 DDLLSCCGSSCGNGCEGGYPIQALRWWDSKGVVTGGDYHGAGCKPYPIAPCTSGNCPESK 200
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C CQ Y Y KD +FG +Y+V N SI EIY +GPVE AF+V++D YK
Sbjct: 201 TPSCSMSCQSGYSTAYAKDKHFGVSAYAVPKNAASIQAEIYANGPVEAAFSVYEDFYKYK 260
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
SG + + +GK LGGHAI+I+GWG
Sbjct: 261 SGVY-------------------------------------KHTAGKYLGGHAIKIIGWG 283
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
+ S YWL+ANSW +WG++G FKI RG D+CGIES++ AG K+
Sbjct: 284 TE--SGSPYWLVANSWGVNWGESGFFKIYRGDDQCGIESAVVAGKAKV 329
Score = 37.0 bits (84), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GC GG+P A R+W G+V+GG Y
Sbjct: 151 CGNGCEGGYPIQALRWWDSKGVVTGGDY 178
>sp|P25792|CYSP_SCHMA Cathepsin B-like cysteine proteinase OS=Schistosoma mansoni PE=2
SV=1
Length = 340
Score = 151 bits (382), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 106/198 (53%), Gaps = 53/198 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC PY CEHH G P C + +TP+C + CQ Y PY +D + G SY+V ++E
Sbjct: 186 GCEPYPFPKCEHHTKGKYPPCGSKIYNTPRCKQTCQRKYKTPYTQDKHRGKSSYNVKNDE 245
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K+I KEI ++GPVE +
Sbjct: 246 KAIQKEIMKYGPVEAS-------------------------------------------- 261
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
FTV++D + YKSG +ALGGHAIRI+GWG + K+ YWLIANSWN DWG+NG F
Sbjct: 262 FTVYEDFLNYKSGIYKHITGEALGGHAIRIIGWGVENKTP--YWLIANSWNEDWGENGYF 319
Query: 287 KILRGKDECGIESSITAG 304
+I+RG+DEC IES + AG
Sbjct: 320 RIVRGRDECSIESEVIAG 337
Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/49 (46%), Positives = 33/49 (67%), Gaps = 1/49 (2%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+L R P + +++ + ++P+NFDSR KWP C +I IRDQ CGSCW
Sbjct: 71 DLRRKRRP-TVDHNDWNVEIPSNFDSRKKWPGCKSIATIRDQSRCGSCW 118
Score = 40.4 bits (93), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 16/27 (59%), Positives = 18/27 (66%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GG G AW YWVK GIV+ +
Sbjct: 154 CGLGCEGGILGPAWDYWVKEGIVTASS 180
>sp|P43509|CPR5_CAEEL Cathepsin B-like cysteine proteinase 5 OS=Caenorhabditis elegans
GN=cpr-5 PE=2 SV=1
Length = 344
Score = 150 bits (378), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 111/205 (54%), Gaps = 42/205 (20%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVREC--QENYDVPYKKDLNFG 163
GS + +GC+PY IAPC VNG + P+C TPKCV C + NY PY +D +FG
Sbjct: 175 GSYETQFGCKPYSIAPCGETVNGVKWPACPEDTEPTPKCVDSCTSKNNYATPYLQDKHFG 234
Query: 164 AKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRD 223
+ +Y+V + I EI +GP+E AFTV++D Y +G +
Sbjct: 235 STAYAVGKKVEQIQTEILTNGPIEVAFTVYEDFYQYTTGVY------------------- 275
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
++ +G +LGGHA++ILGWG D + YWL+ANSWN WG+
Sbjct: 276 ------------------VHTAGASLGGHAVKILGWGVDNGTP--YWLVANSWNVAWGEK 315
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+I+RG +ECGIE S AG+P L
Sbjct: 316 GYFRIIRGLNECGIEHSAVAGIPDL 340
>sp|P43157|CYSP_SCHJA Cathepsin B-like cysteine proteinase OS=Schistosoma japonicum
GN=CATB PE=2 SV=1
Length = 342
Score = 147 bits (372), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 108/201 (53%), Gaps = 53/201 (26%)
Query: 114 GCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNE 173
GC+PY CEHH G P+C TP+C + CQ+ Y PY++D ++G +SY+V +NE
Sbjct: 187 GCQPYPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNE 246
Query: 174 KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGA 233
K I ++I +GPVE A
Sbjct: 247 KVIQRDIMMYGPVEAA-------------------------------------------- 262
Query: 234 FTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
F V++D + YKSG +GGHAIRI+GWG ++++ YWLIANSWN DWG+ GLF
Sbjct: 263 FDVYEDFLNYKSGIYRHVTGSIVGGHAIRIIGWGVEKRTP--YWLIANSWNEDWGEKGLF 320
Query: 287 KILRGKDECGIESSITAGVPK 307
+++RG+DEC IES + AG+ K
Sbjct: 321 RMVRGRDECSIESDVVAGLIK 341
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 20/27 (74%), Positives = 23/27 (85%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGA 35
CG GC GGFPG+AW YWVK GIV+GG+
Sbjct: 155 CGDGCQGGFPGVAWDYWVKRGIVTGGS 181
>sp|P43508|CPR4_CAEEL Cathepsin B-like cysteine proteinase 4 OS=Caenorhabditis elegans
GN=cpr-4 PE=2 SV=1
Length = 335
Score = 145 bits (365), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 131/300 (43%), Gaps = 110/300 (36%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPA 68
CG+GC GG+P AW+Y VKSG +GG+Y ++ G P P
Sbjct: 146 CGYGCEGGYPINAWKYLVKSGFCTGGSYEAQ------------------FGCKPYSLAPC 187
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN 128
E +G WP+CP D G Y+ C +
Sbjct: 188 G---ETVG--------------NVTWPSCP------DDG----------YDTPACVN--- 211
Query: 129 GTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
KC +NY+V Y D +FG+ +Y+V I EI HGPVE
Sbjct: 212 --------------KCT---NKNYNVAYTADKHFGSTAYAVGKKVSQIQAEIIAHGPVEA 254
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
AFTV++D YK+G + ++ +G+
Sbjct: 255 AFTVYEDFYQYKTGVY-------------------------------------VHTTGQE 277
Query: 249 LGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
LGGHAIRILGWG D + YWL+ANSWN +WG+NG F+I+RG +ECGIE ++ GVPK+
Sbjct: 278 LGGHAIRILGWGTDNGT--PYWLVANSWNVNWGENGYFRIIRGTNECGIEHAVVGGVPKV 335
>sp|P43510|CPR6_CAEEL Cathepsin B-like cysteine proteinase 6 OS=Caenorhabditis elegans
GN=cpr-6 PE=1 SV=1
Length = 379
Score = 140 bits (353), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/198 (38%), Positives = 105/198 (53%), Gaps = 41/198 (20%)
Query: 114 GCRPYEIAPCEHHVNGTR-PSCDASKGHTPKCVRECQENY-DVPYKKDLNFGAKSYSVSS 171
GC+PY PCEHH T C TPKC ++C +Y D Y +D FGA +Y V
Sbjct: 202 GCKPYPFPPCEHHSKKTHFDPCPHDLYPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKD 261
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ ++I KE+ HGP+E AF V++D + Y G +
Sbjct: 262 DVEAIQKELMTHGPLEIAFEVYEDFLNYDGGVY--------------------------- 294
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
++ GK GGHA++++GWG D+ YW +ANSWNTDWG++G F+ILRG
Sbjct: 295 ----------VHTGGKLGGGHAVKLIGWGIDDGIP--YWTVANSWNTDWGEDGFFRILRG 342
Query: 292 KDECGIESSITAGVPKLD 309
DECGIES + G+PKL+
Sbjct: 343 VDECGIESGVVGGIPKLN 360
Score = 54.7 bits (130), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 21/36 (58%), Positives = 27/36 (75%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
++D D+P +FDSR WP C +I+ IRDQ SCGSCW
Sbjct: 100 DLDLDIPESFDSRDNWPKCDSIKVIRDQSSCGSCWA 135
Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 25/37 (67%)
Query: 7 RLCGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
+ CGFGCNGG P AWRYWVK GIV+G Y + K
Sbjct: 168 KSCGFGCNGGDPLAAWRYWVKDGIVTGSNYTANNGCK 204
>sp|P43507|CPR3_CAEEL Cathepsin B-like cysteine proteinase 3 OS=Caenorhabditis elegans
GN=cpr-3 PE=2 SV=1
Length = 370
Score = 123 bits (309), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 88/290 (30%), Positives = 124/290 (42%), Gaps = 100/290 (34%)
Query: 80 VDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHV--NGTR------ 131
V E LP FD+R KWP+C TI+ IR+Q +CGSCW E+ + NGT+
Sbjct: 88 VPEPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISV 147
Query: 132 ----PSCDASKGHTPK----------------------------------CVREC----- 148
C + G+ K C + C
Sbjct: 148 EDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNCPESTT 207
Query: 149 -------QENYDVPYKK-DLNFGAKSYSVSSNEK--SIMKEIYEHGPVEGAFTVFDDLIL 198
Q +Y K D ++GA +Y V++ + I EIY +GPVE ++ V++D
Sbjct: 208 PSCKTTCQSSYKTEEYKKDKHYGASAYKVTTTKSVTEIQTEIYHYGPVEASYKVYEDFYH 267
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
YKSG + Y SGK +GGHA++I+G
Sbjct: 268 YKSGVYH-------------------------------------YTSGKLVGGHAVKIIG 290
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVPKL 308
WG + + YWLIANSW T +G+ G FKI RG +EC IE ++ AG+ KL
Sbjct: 291 WGVE--NGVDYWLIANSWGTSFGEKGFFKIRRGTNECQIEGNVVAGIAKL 338
Score = 40.0 bits (92), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYG 37
CG+GC GG+ A R+W SG V+GG YG
Sbjct: 158 CGYGCKGGYSIEALRFWASSGAVTGGDYG 186
>sp|P25802|CYSP1_OSTOS Cathepsin B-like cysteine proteinase 1 OS=Ostertagia ostertagi
GN=CP-1 PE=3 SV=3
Length = 341
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 97/190 (51%), Gaps = 40/190 (21%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPYEI PC HH N T TP+C R C Y Y D + K+Y + ++ K
Sbjct: 189 CRPYEIHPCGHHGNETYYGECVGMADTPRCKRRCLLGYPKSYPSD-RYYKKAYQLKNSVK 247
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I K+I ++GPV +TV++D Y+SG
Sbjct: 248 AIQKDIMKNGPVVATYTVYEDFAHYRSG-------------------------------- 275
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ +K+G+ G HA++++GWGE++ + YW++ANSW+ DWG+NG F++ RG ++
Sbjct: 276 -----IYKHKAGRKTGLHAVKVIGWGEEKGTP--YWIVANSWHDDWGENGFFRMHRGSND 328
Query: 295 CGIESSITAG 304
CG E + AG
Sbjct: 329 CGFEERMAAG 338
Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+P ++D R +W NC ++ I DQ +CGSCW
Sbjct: 91 IPESYDPRIQWANCSSLFHIPDQANCGSCWA 121
Score = 35.4 bits (80), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 21/31 (67%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P A+R+ G+V+GG Y +K
Sbjct: 156 CGDGCEGGWPISAFRFHADEGVVTGGDYNTK 186
>sp|Q54QD9|CTSB_DICDI Cathepsin B OS=Dictyostelium discoideum GN=ctsB PE=3 SV=1
Length = 311
Score = 112 bits (279), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 119/293 (40%), Gaps = 108/293 (36%)
Query: 73 ELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYE------------- 119
++ Y + +P +F+++T WPNC TI +I++Q CGSCW E
Sbjct: 68 QIKSYDPLGVQIPTSFNAQTNWPNCTTISQIQNQARCGSCWAFGATESATDRLCIHNNEN 127
Query: 120 -------IAPCEHHVNG--------------------------TRPSCDASKG------H 140
+ C+ NG T P+C ++ +
Sbjct: 128 VQLSFMDMVTCDETDNGCEGGDAFSAWNWLRKQGAVSEECLPYTIPTCPPAQQPCLNFVN 187
Query: 141 TPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYK 200
TP C +ECQ N + Y +D + AK YS S+E +IM+EI +GPVE
Sbjct: 188 TPSCTKECQSNSSLIYSQDKHKMAKIYSFDSDE-AIMQEIVTNGPVEAC----------- 235
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSG-------KALGGHA 253
FTVF+D + YKSG K LGGH
Sbjct: 236 ---------------------------------FTVFEDFLAYKSGVYVHTTGKDLGGHC 262
Query: 254 IRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
++++G+G + Y+ N W T WGDNG F I RG +CGI + AG+P
Sbjct: 263 VKLVGFGT--LNGVDYYAANNQWTTSWGDNGTFLIKRG--DCGISDDVVAGLP 311
>sp|P25793|CYSP2_HAECO Cathepsin B-like cysteine proteinase 2 OS=Haemonchus contortus
GN=AC-2 PE=2 SV=1
Length = 342
Score = 109 bits (272), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 93/191 (48%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T TP C R+C+ Y+ D +G +Y V + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI ++GPV +F V Y+ R + G
Sbjct: 245 AIQSEILKNGPVVASFAV------YEDFRHYKSG-------------------------- 272
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +G+ G HA++++GWG + + +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIVRGSND 325
Query: 295 CGIESSITAGV 305
CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336
Score = 41.6 bits (96), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+Y++ G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182
>sp|P19092|CYSP1_HAECO Cathepsin B-like cysteine proteinase 1 OS=Haemonchus contortus
GN=AC-1 PE=2 SV=1
Length = 342
Score = 109 bits (272), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 92/191 (48%), Gaps = 39/191 (20%)
Query: 115 CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEK 174
CRPY I PC HH N T TP C R+C+ Y+ D +G +Y V + K
Sbjct: 185 CRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVK 244
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
+I EI +GPV +F V Y+ R + G
Sbjct: 245 AIQSEILRNGPVVASFAV------YEDFRHYKSG-------------------------- 272
Query: 235 TVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
+ + +G+ G HA++++GWG + + +WLIANSW+ DWG+ G F+I+RG ++
Sbjct: 273 -----IYKHTAGELRGYHAVKMIGWGNENNTD--FWLIANSWHNDWGEKGYFRIIRGTND 325
Query: 295 CGIESSITAGV 305
CGIE +I AG+
Sbjct: 326 CGIEGTIAAGI 336
Score = 41.2 bits (95), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 23/31 (74%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSK 39
CG GC GG+P AW+Y++ G+VSGG Y +K
Sbjct: 152 CGDGCEGGWPIEAWKYFIYDGVVSGGEYLTK 182
>sp|Q06544|CYSP3_OSTOS Cathepsin B-like cysteine proteinase 3 (Fragment) OS=Ostertagia
ostertagi GN=CP-3 PE=3 SV=1
Length = 174
Score = 109 bits (272), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 98/202 (48%), Gaps = 61/202 (30%)
Query: 115 CRPYEIAPCEHHVNGTRP---SC-DASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVS 170
CRPYE PC H G P C D +K TPKC + CQ Y YK+D +FG +Y +
Sbjct: 22 CRPYEFPPCGRH--GKEPYYGECYDTAK--TPKCQKTCQRGYLKAYKEDKHFGKSAYRLP 77
Query: 171 SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGA 230
+N K+I ++I ++GPV F
Sbjct: 78 NNVKAIQRDIMKNGPVVAGFI--------------------------------------- 98
Query: 231 EGAFTVFDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
V++D YKSG + GGHA++I+GWG+++ + YWLIANSW+ DWG+
Sbjct: 99 -----VYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWGKEKGTP--YWLIANSWHDDWGEK 151
Query: 284 GLFKILRGKDECGIESSITAGV 305
G ++++RG + C IE + AG+
Sbjct: 152 GFYRMIRGINNCRIEEMVFAGI 173
>sp|P90850|YCF2E_CAEEL Uncharacterized peptidase C1-like protein F26E4.3 OS=Caenorhabditis
elegans GN=F26E4.3 PE=1 SV=3
Length = 452
Score = 86.7 bits (213), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/137 (36%), Positives = 69/137 (50%), Gaps = 31/137 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSS E+ I E+ +GPV+ F V +D +Y G + D +
Sbjct: 318 YKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVY---------------QHSDLAA 362
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNG 284
Q GA S A G H++R+LGWG D + + KYWL ANSW T WG++G
Sbjct: 363 QKGA--------------SSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDG 408
Query: 285 LFKILRGKDECGIESSI 301
FK+LRG++ C IES +
Sbjct: 409 YFKVLRGENHCEIESFV 425
Score = 35.8 bits (81), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 20/32 (62%), Gaps = 2/32 (6%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+LP +FD+R KW P I + DQG CGS W
Sbjct: 182 RELPEHFDARDKWG--PLIHPVADQGDCGSSW 211
>sp|P53634|CATC_HUMAN Dipeptidyl peptidase 1 OS=Homo sapiens GN=CTSC PE=1 SV=2
Length = 463
Score = 85.5 bits (210), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 101/267 (37%), Gaps = 82/267 (30%)
Query: 84 LPANFDSRTKWPNCPTIR---EIRDQGSCGSCWG----------------------CRPY 118
LP ++D W N I +R+Q SCGSC+ P
Sbjct: 231 LPTSWD----WRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQ 286
Query: 119 EIAPCEHHVNG-------------------TRPSCDASKGHTPKCVRECQENYDVPYKKD 159
E+ C + G +C G C + +E+ Y +
Sbjct: 287 EVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSE 344
Query: 160 LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
++ Y NE + E+ HGP+ AF V+DD + YK G + G
Sbjct: 345 YHYVGGFYG-GCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTG----------- 392
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTD 279
+RD F F+ L HA+ ++G+G D S YW++ NSW T
Sbjct: 393 -LRD---------PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTG 432
Query: 280 WGDNGLFKILRGKDECGIESSITAGVP 306
WG+NG F+I RG DEC IES A P
Sbjct: 433 WGENGYFRIRRGTDECAIESIAVAATP 459
>sp|O97578|CATC_CANFA Dipeptidyl peptidase 1 (Fragment) OS=Canis familiaris GN=CTSC PE=1
SV=1
Length = 435
Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/244 (26%), Positives = 93/244 (38%), Gaps = 68/244 (27%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNGTR---PSC 134
+ +R+Q SCGSC+ P EI C + G P
Sbjct: 219 VSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQYAQGCEGGFPYL 278
Query: 135 DASKGHTPKCVRE------------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYE 182
A K + E C+ N Y + + + NE + E+
Sbjct: 279 IAGKYAQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGACNEALMKLELVR 338
Query: 183 HGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLIL 242
HGP+ AF V+DD Y+ G ++ G +RD F F+
Sbjct: 339 HGPMAVAFEVYDDFFHYQKGIYYHTG------------LRD---------PFNPFE---- 373
Query: 243 YKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
L HA+ ++G+G D S YW++ NSW + WG++G F+I RG DEC IES
Sbjct: 374 ------LTNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDECAIESIAV 427
Query: 303 AGVP 306
A P
Sbjct: 428 AATP 431
>sp|P92131|CATB1_GIAIN Cathepsin B-like CP1 OS=Giardia intestinalis GN=CP1 PE=2 SV=3
Length = 303
Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 119/286 (41%), Gaps = 39/286 (13%)
Query: 39 KQAEKNSLSNIPRAHLKSWMGVHPDY------NLPANRLPELIGYSEVDEDLPANFDSRT 92
K N+ +S M + PD +LP + E+ E+ + +P FD R
Sbjct: 32 KAGMPKRFENVTEDEFRS-MLIRPDRLRARSGSLPPISITEV---QELVDPIPPQFDFRD 87
Query: 93 KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGT-RPSCDASKGHTPKCVRE---C 148
++P C ++ DQGSCGSCW + G + + S+ H C E C
Sbjct: 88 EYPQC--VKPALDQGSCGSCWAFSAIGVFGDRRCAMGIDKEAVSYSQQHLISCSLENFGC 145
Query: 149 QENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD---LILYKSGRFF 205
P L F ++ + + Y H V DD + LYK+ +
Sbjct: 146 DGGDFQPTWSFLTFTG-----ATTAECVKYVDYGHTVASPCPAVCDDGSPIQLYKAHGY- 199
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA--------LGGHAIRIL 257
G + ++ I + + V+ DL Y+SG LG HA+ I+
Sbjct: 200 --GQVSKSVPAIMGMLVAGGP---LQTMIVVYADLSYYESGVYKHTYGTINLGFHALEIV 254
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
G+G + + YW+I NSW DWG+NG F+I+RG +EC IE I A
Sbjct: 255 GYGTTDDGTD-YWIIKNSWGPDWGENGYFRIVRGVNECRIEDEIYA 299
>sp|P92133|CATB3_GIAIN Cathepsin B-like CP3 OS=Giardia intestinalis GN=CP3 PE=2 SV=2
Length = 299
Score = 83.6 bits (205), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 112/252 (44%), Gaps = 63/252 (25%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKC 144
P +FD R ++P+C I E+ DQG CGSCW S AS G
Sbjct: 75 PDSFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVASVGD---- 111
Query: 145 VRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDDLIL 198
R C D KK + + + Y VS + + + ++ G T D+ +
Sbjct: 112 -RRCFAGLD---KKAVKY-SPQYVVSCDRGDMACDGGWLPSVWRFLTKTG--TTTDECVP 164
Query: 199 YKSGRFFVPGNETTAMS-------LIKWTIR-----DNTSQLGA-------EGAFTVFDD 239
Y+SG G T + L K T D + + A + AFTV+ D
Sbjct: 165 YQSGSTGARGTCPTKCADGSDLPHLYKATKAVDYGLDAPAIMKALATGGPLQTAFTVYSD 224
Query: 240 LILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGK 292
+ Y+SG + GGHA+ ++G+G D+ + YW+I NSW DWG++G F+I+R
Sbjct: 225 FMYYESGVYQHTYGRVEGGHAVDMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRIIRMT 283
Query: 293 DECGIESSITAG 304
+ECGIE + G
Sbjct: 284 NECGIEEQVIGG 295
>sp|Q5RB02|CATC_PONAB Dipeptidyl peptidase 1 OS=Pongo abelii GN=CTSC PE=2 SV=1
Length = 463
Score = 83.6 bits (205), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 94/248 (37%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+Q SCGSC+ P E+ C + G
Sbjct: 246 VSPVRNQASCGSCYSFASMGMLEARIRILTSNSQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
+C G C + +E+ Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEACFPYTGTDSPC--KMKEDCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ HGP+ AF V+DD + YK G + G +RD F F+
Sbjct: 363 ELVHHGPMAVAFEVYDDFLHYKKGIYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG++G F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGEDGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIAVAATP 459
>sp|Q3ZCJ8|CATC_BOVIN Dipeptidyl peptidase 1 OS=Bos taurus GN=CTSC PE=2 SV=1
Length = 463
Score = 82.0 bits (201), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/248 (25%), Positives = 91/248 (36%), Gaps = 75/248 (30%)
Query: 100 IREIRDQGSCGSCWG----------------------CRPYEIAPCEHHVNG-------- 129
+ +R+QGSCGSC+ P E+ C + G
Sbjct: 246 VTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVSCSQYAQGCEGGFPYL 305
Query: 130 -----------TRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMK 178
C G C +E Y + ++ Y NE +
Sbjct: 306 IAGKYAQDFGLVEEDCFPYTGTDSPC--RLKEGCFRYYSSEYHYVGGFYG-GCNEALMKL 362
Query: 179 EIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFD 238
E+ GP+ AF V+DD + Y+ G + G +RD F F+
Sbjct: 363 ELVHQGPMAVAFEVYDDFLHYRKGVYHHTG------------LRD---------PFNPFE 401
Query: 239 DLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIE 298
L HA+ ++G+G D S YW++ NSW T WG+NG F+I RG DEC IE
Sbjct: 402 ----------LTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDECAIE 451
Query: 299 SSITAGVP 306
S A P
Sbjct: 452 SIALAATP 459
>sp|Q9UJW2|TINAG_HUMAN Tubulointerstitial nephritis antigen OS=Homo sapiens GN=TINAG PE=2
SV=3
Length = 476
Score = 82.0 bits (201), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 69/145 (47%), Gaps = 32/145 (22%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y VSSNE IMKEI ++GPV+ V +D YK+G + R TS
Sbjct: 355 YRVSSNETEIMKEIMQNGPVQAIMQVREDFFHYKTGIY-----------------RHVTS 397
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWGDN 283
+ + L HA+++ GWG + KEK+W+ ANSW WG+N
Sbjct: 398 TNKESEKY------------RKLQTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWGEN 445
Query: 284 GLFKILRGKDECGIESSITAGVPKL 308
G F+ILRG +E IE I A +L
Sbjct: 446 GYFRILRGVNESDIEKLIIAAWGQL 470
>sp|P92132|CATB2_GIAIN Cathepsin B-like CP2 OS=Giardia intestinalis GN=CP2 PE=1 SV=2
Length = 300
Score = 82.0 bits (201), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 72/255 (28%), Positives = 114/255 (44%), Gaps = 63/255 (24%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHT 141
+D+P +FD R ++P+C I E+ DQG CGSCW S A+ G
Sbjct: 73 DDVPESFDFREEYPHC--IPEVVDQGGCGSCWAF-----------------SSVATFGD- 112
Query: 142 PKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSI------MKEIYEHGPVEGAFTVFDD 195
R C D KK + + + Y VS + + + +++ G T D+
Sbjct: 113 ----RRCVAGLD---KKPVKYSPQ-YVVSCDHGDMACNGGWLPNVWKFLTKTG--TTTDE 162
Query: 196 LILYKSGRFFVPG-------------NETTAMSL------IKWTIRDNTSQLGAEGAFTV 236
+ YKSG + G + TA S I ++ ++ + AF V
Sbjct: 163 CVPYKSGSTTLRGTCPTKCADGSSKVHLATATSYKDYGLDIPAMMKALSTSGPLQVAFLV 222
Query: 237 FDDLILYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
D + Y+SG GGHA+ ++G+G D+ + YW+I NSW DWG++G F+++
Sbjct: 223 HSDFMYYESGVYQHTYGYMEGGHAVEMVGYGTDDDGVD-YWIIKNSWGPDWGEDGYFRMI 281
Query: 290 RGKDECGIESSITAG 304
RG ++C IE AG
Sbjct: 282 RGINDCSIEEQAYAG 296
>sp|Q60HG6|CATC_MACFA Dipeptidyl peptidase 1 OS=Macaca fascicularis GN=CTSC PE=2 SV=1
Length = 463
Score = 81.6 bits (200), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/135 (34%), Positives = 65/135 (48%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ HGP+ AF V+DD + Y++G + G +RD
Sbjct: 356 NEALMKLELVYHGPLAVAFEVYDDFLHYQNGIYHHTG------------LRD-------- 395
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D S YW++ NSW T WG++G F+I RG
Sbjct: 396 -PFNPFE----------LTNHAVLLVGYGTDSASGMDYWIVKNSWGTSWGEDGYFRIRRG 444
Query: 292 KDECGIESSITAGVP 306
DEC IES A P
Sbjct: 445 TDECAIESIAVAATP 459
>sp|P97821|CATC_MOUSE Dipeptidyl peptidase 1 OS=Mus musculus GN=Ctsc PE=2 SV=1
Length = 462
Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 64/135 (47%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G D + +YW+I NSW ++WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAVAAIP 458
>sp|Q3SZI1|TINAG_BOVIN Tubulointerstitial nephritis antigen OS=Bos taurus GN=TINAG PE=2
SV=1
Length = 476
Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 68/147 (46%), Gaps = 36/147 (24%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSG--RFFVPGNETTAMSLIKWTIRDN 224
Y VSSNE IM+EI ++GPV+ V +D YK+G R NE +
Sbjct: 355 YRVSSNETEIMREIMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDS------------ 402
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGE---DEKSKEKYWLIANSWNTDWG 281
+ + HA+++ GWG + KEK+W+ ANSW WG
Sbjct: 403 -------------------EKYRKFRTHAVKLTGWGTLRGAQGQKEKFWIAANSWGKSWG 443
Query: 282 DNGLFKILRGKDECGIESSITAGVPKL 308
+NG F+ILRG +E IE I A +L
Sbjct: 444 ENGYFRILRGVNESDIEKLIIAAWGQL 470
>sp|Q26563|CATC_SCHMA Cathepsin C OS=Schistosoma mansoni PE=2 SV=1
Length = 454
Score = 79.7 bits (195), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 75/282 (26%), Positives = 104/282 (36%), Gaps = 92/282 (32%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCP-----TIREIRDQGSCGSCWG------- 114
P+ L L G +LP FD W + P + IR+QG CGSC+
Sbjct: 207 PSKELISLTG------NLPLEFD----WTSPPDGSRSPVTPIRNQGICGSCYASPSAAAL 256
Query: 115 ---------------CRPYEIAPCEHH---VNGTRPSCDASK-----------------G 139
P + C + NG P A K
Sbjct: 257 EARIRLVSNFSEQPILSPQTVVDCSPYSEGCNGGFPFLIAGKYGEDFGLPQKIVIPYTGE 316
Query: 140 HTPKCV--RECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI 197
T KC + C Y Y Y ++NEK + E+ +GP F V++D
Sbjct: 317 DTGKCTVSKNCTRYYTTDYSY-----IGGYYGATNEKLMQLELISNGPFPVGFEVYEDFQ 371
Query: 198 LYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRIL 257
YK G I +T+ F F+ L HA+ ++
Sbjct: 372 FYKEG------------------IYHHTTVQTDHYNFNPFE----------LTNHAVLLV 403
Query: 258 GWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
G+G D+ S E YW + NSW +WG+ G F+ILRG DECG+ES
Sbjct: 404 GYGVDKLSGEPYWKVKNSWGVEWGEQGYFRILRGTDECGVES 445
>sp|P80067|CATC_RAT Dipeptidyl peptidase 1 OS=Rattus norvegicus GN=Ctsc PE=1 SV=3
Length = 462
Score = 79.3 bits (194), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 31/135 (22%)
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
NE + E+ +HGP+ AF V DD + Y SG + G
Sbjct: 355 NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIY---------------------HHTGLS 393
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
F F+ L HA+ ++G+G+D + YW++ NSW + WG++G F+I RG
Sbjct: 394 DPFNPFE----------LTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRG 443
Query: 292 KDECGIESSITAGVP 306
DEC IES A +P
Sbjct: 444 TDECAIESIAMAAIP 458
>sp|Q9GZM7|TINAL_HUMAN Tubulointerstitial nephritis antigen-like OS=Homo sapiens
GN=TINAGL1 PE=1 SV=1
Length = 467
Score = 78.6 bits (192), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 68/138 (49%), Gaps = 32/138 (23%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
Y + SN+K IMKE+ E+GPV+ V +D LYK G + T +SL +
Sbjct: 344 YRLGSNDKEIMKELMENGPVQALMEVHEDFFLYKGGIY-----SHTPVSLGR-------- 390
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDN 283
+ + G H+++I GWGE+ + KYW ANSW WG+
Sbjct: 391 ----------------PERYRRHGTHSVKITGWGEETLPDGRTLKYWTAANSWGPAWGER 434
Query: 284 GLFKILRGKDECGIESSI 301
G F+I+RG +EC IES +
Sbjct: 435 GHFRIVRGVNECDIESFV 452
>sp|Q99JR5|TINAL_MOUSE Tubulointerstitial nephritis antigen-like OS=Mus musculus
GN=Tinagl1 PE=1 SV=1
Length = 466
Score = 78.6 bits (192), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 113/296 (38%), Gaps = 97/296 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I E DQG+C W
Sbjct: 190 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 244
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ HH G R +C G
Sbjct: 245 SLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNEASP 304
Query: 141 TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVEG 188
TP+C+ Q P + D+ +Y + S+EK IMKE+ E+GPV+
Sbjct: 305 TPRCMMHSRAMGRGKRQATSRCPNGQVDSNDIYQVTPAYRLGSDEKEIMKELMENGPVQA 364
Query: 189 AFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKA 248
V +D LY+ G + T +S + + +
Sbjct: 365 LMEVHEDFFLYQRGIY-----SHTPVSQGR------------------------PEQYRR 395
Query: 249 LGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE+ +
Sbjct: 396 HGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNECDIETFV 451
>sp|Q9EQT5|TINAL_RAT Tubulointerstitial nephritis antigen-like OS=Rattus norvegicus
GN=Tinagl1 PE=2 SV=1
Length = 467
Score = 77.8 bits (190), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 114/297 (38%), Gaps = 98/297 (32%)
Query: 69 NRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG-------------- 114
N + ++G EV LP F++ KWPN I E DQG+C W
Sbjct: 190 NEIYTVLGQGEV---LPTAFEASEKWPN--LIHEPLDQGNCAGSWAFSTAAVASDRVSIH 244
Query: 115 --------CRPYEIAPCE-HHVNGTR------------------PSCDASKGH------- 140
P + C+ HH G R +C G
Sbjct: 245 SLGHMTPILSPQNLLSCDTHHQKGCRGGRLDGAWWFLRRRGVVSDNCYPFSGREQNDEAS 304
Query: 141 -TPKCVREC--------QENYDVPYKK----DLNFGAKSYSVSSNEKSIMKEIYEHGPVE 187
TP+C+ Q P + D+ Y ++S+EK IMKE+ E+GPV+
Sbjct: 305 PTPRCMMHSRAMGRGKRQATSRCPNSQVDSNDIYQVTPVYRLASDEKEIMKELMENGPVQ 364
Query: 188 GAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGK 247
V +D LY+ G + T +S +G + +
Sbjct: 365 ALMEVHEDFFLYQRGIY-----SHTPVS---------------QGRPEQY---------R 395
Query: 248 ALGGHAIRILGWGED---EKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSI 301
G H+++I GWGE+ + KYW ANSW WG+ G F+I+RG +EC IE+ +
Sbjct: 396 RHGTHSVKITGWGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGINECDIETFV 452
>sp|Q8IIJ9|CATC_PLAF7 Probable cathepsin C OS=Plasmodium falciparum (isolate 3D7)
GN=PF11_0174 PE=1 SV=1
Length = 700
Score = 73.2 bits (178), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 118/313 (37%), Gaps = 88/313 (28%)
Query: 4 QQIRLCGF---GCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEKNSLSNIPRAHLKSWMGV 60
Q + C F GCNGGFP + SK A+ L IP
Sbjct: 434 QTVLSCSFYDQGCNGGFPYLV----------------SKLAK---LQGIP---------- 464
Query: 61 HPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPN----CPTIREIRDQGSCGSCWGCR 116
L YS +E P N +K PN +REI S
Sbjct: 465 ----------LNVYFPYSATEETCPYNI---SKHPNDMNGSAKLREI--NAIFNSNNNMS 509
Query: 117 PYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVP-----YKKDLNFGAKSYSVS- 170
Y +HH G + +S QE + + Y KD N+ Y +
Sbjct: 510 TYNNINNDHHQLGVYANTASS-----------QEQHGISEENRWYAKDFNYVGGCYGCNQ 558
Query: 171 -SNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+ EK +M EIY +GP+ +F D Y G +FV R T +
Sbjct: 559 CNGEKIMMNEIYRNGPIVSSFEASPDFYDYADGVYFVEDFPHA---------RRCTIEPK 609
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKE--KYWLIANSWNTDWGDNGLFK 287
+G + + +G HAI +LGWGE+E + + KYW+ NSW WG G FK
Sbjct: 610 NDGVYNI--------TGWDRVNHAIVLLGWGEEEINGKLYKYWIGRNSWGNGWGKEGYFK 661
Query: 288 ILRGKDECGIESS 300
ILRG++ GIES
Sbjct: 662 ILRGQNFSGIESQ 674
>sp|Q26534|CATL_SCHMA Cathepsin L OS=Schistosoma mansoni GN=CL1 PE=2 SV=1
Length = 319
Score = 67.8 bits (164), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 75/310 (24%), Positives = 116/310 (37%), Gaps = 82/310 (26%)
Query: 30 IVSGGA-YGSKQAEKNSLSNIPRAHLK-SWMGVHPDYNLPANRLPELIGYSEVDEDLPAN 87
V G A YG + R HL SW+ N P + E+ ++P N
Sbjct: 56 FVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSLGKEV-------NNIPKN 108
Query: 88 FDSRTKWPNCPTIREIRDQGSCGSCWGCRPY-------------EIAPCEHHVNGTRPSC 134
FD R K + E+++QG CGSCW ++ E +
Sbjct: 109 FDWREK----GAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLVDCDGLD 164
Query: 135 DASKGHTPKCVRE---------CQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGP 185
D G P E ++NY PY + NEK +K
Sbjct: 165 DGCNGGLPSNAYESIIKMGGLMLEDNY--PYD------------AKNEKCHLKT------ 204
Query: 186 VEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKS 245
D + +Y + + +ET L W ++T +G F Y+
Sbjct: 205 --------DGVAVYINSSVNLTQDET---ELAAWLYHNSTISVGMNALLLQF-----YQH 248
Query: 246 G----------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDEC 295
G K L HA+ ++G+G EK+ E +W++ NSW +WG+NG F++ RG C
Sbjct: 249 GISHPWWIFCSKYLLDHAVLLVGYGVSEKN-EPFWIVKNSWGVEWGENGYFRMYRGDGSC 307
Query: 296 GIESSITAGV 305
GI + T+ +
Sbjct: 308 GINTVATSAM 317
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
GN=At2g21430 PE=2 SV=2
Length = 361
Score = 64.7 bits (156), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 119/285 (41%), Gaps = 37/285 (12%)
Query: 36 YGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLP--ANRLPELIGYSEVDEDLPANFDSRTK 93
+G Q + S R HL GV + LP AN+ P L ++LP FD
Sbjct: 91 HGVTQFSDLTRSEFRRKHL----GVKGGFKLPKDANQAPIL-----PTQNLPEEFD---- 137
Query: 94 WPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYD 153
W + + +++QGSCGSCW H + T S+ C EC +
Sbjct: 138 WRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFL-ATGKLVSLSEQQLVDCDHECDPEEE 196
Query: 154 VPYKKDLNFGAKSYSVSSNEKS--IMKEI-YEHGPVEGAFTVFDDLILYKSGRFF--VPG 208
N G + + K+ +M+E Y + +G D + S F V
Sbjct: 197 GSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSVVSI 256
Query: 209 NE-TTAMSLIK---WTIRDNTSQLGAE-GAFTVFDDLILYKSGKALGGHAIRILGWGE-- 261
NE A +LIK + N + + G + Y + L H + ++G+G
Sbjct: 257 NEDQIAANLIKNGPLAVAINAAYMQTYIGGVSC-----PYICSRRLN-HGVLLVGYGSAG 310
Query: 262 --DEKSKEK-YWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
+ KEK YW+I NSW WG+NG +KI +G++ CG++S ++
Sbjct: 311 FSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355
>sp|P22497|CYSP_THEPA Cysteine proteinase OS=Theileria parva GN=TP03_0285 PE=3 SV=2
Length = 440
Score = 64.3 bits (155), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/261 (25%), Positives = 100/261 (38%), Gaps = 83/261 (31%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGC---------------RPYEIA---- 121
D DL W ++ ++DQ +CG CW + YE++
Sbjct: 222 DVDLAKLTGENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQEL 281
Query: 122 -PCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF----------GAKSYSVS 170
C+ NG +G + E Y + KDL F AK SV
Sbjct: 282 LDCDSFSNGC-------QGGLLESAYEYVRKYGLVSAKDLPFVDKARRCSVPKAKKVSVP 334
Query: 171 SNE----KSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
S K +M P +V +L YKSG F G
Sbjct: 335 SYHVFKGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVF--TG------------------ 374
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
+ GK+L HA+ ++G G DE +K++YW++ NSW TDWG+NG
Sbjct: 375 -----------------ECGKSLN-HAVVLVGEGYDEVTKKRYWVVQNSWGTDWGENGYM 416
Query: 287 KILR---GKDECGI-ESSITA 303
++ R G D+CG+ ++S++A
Sbjct: 417 RLERTNMGTDKCGVLDTSMSA 437
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 63.9 bits (154), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 114/275 (41%), Gaps = 34/275 (12%)
Query: 46 LSNIPRAHL-KSWMGVHPDYNLP--ANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIRE 102
S++ R+ K +GV + LP AN+ P L E+LP +FD W + +
Sbjct: 99 FSDLTRSEFRKKHLGVRSGFKLPKDANKAPIL-----PTENLPEDFD----WRDHGAVTP 149
Query: 103 IRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNF 162
+++QGSCGSCW + + T S+ C EC N
Sbjct: 150 VKNQGSCGSCWSFSATGALEGANFL-ATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNG 208
Query: 163 GAKSYSVSSNEKS--IMKE-IYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKW 219
G + + K+ +MKE Y + +G D + S F + +S+ +
Sbjct: 209 GLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNF------SVISIDEE 262
Query: 220 TIRDNTSQLGAEGAFTVFDDLILYKSGKALG-------GHAIRILGWGED----EKSKEK 268
I N + G + Y G + H + ++G+G + KEK
Sbjct: 263 QIAANLVKNGPLAVAINAGYMQTYIGGVSCPYICTRRLNHGVLLVGYGAAGYAPARFKEK 322
Query: 269 -YWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
YW+I NSW WG+NG +KI +G++ CG++S ++
Sbjct: 323 PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVS 357
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 62.4 bits (150), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 68/282 (24%), Positives = 115/282 (40%), Gaps = 67/282 (23%)
Query: 55 KSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
+ ++G+ LPA+ I + +LP +FD R K P ++DQGSCGSCW
Sbjct: 106 RQFLGLKKRLRLPAHAQKAPILPTT---NLPEDFDWREKGAVTP----VKDQGSCGSCWA 158
Query: 115 --------------------CRPYEIAPCEHHVNGTRP-SCDA--SKGHTPKCVRECQEN 151
++ C+H + + SCD+ + G E+
Sbjct: 159 FSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLES 218
Query: 152 YDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLI---LYKSGRFFVPG 208
V +KD + + S ++ ++ + V T+ +D I L K+G V
Sbjct: 219 GGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVV----TLDEDQIAANLVKNGPLAVAI 274
Query: 209 NET---TAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDE-- 263
N T MS + Y K+ H + ++G+G+
Sbjct: 275 NAAWMQTYMSGVSCP----------------------YVCAKSRLDHGVLLVGFGKGAYA 312
Query: 264 --KSKEK-YWLIANSWNTDWGDNGLFKILRGKDECGIESSIT 302
+ KEK YW+I NSW +WG+ G +KI RG++ CG++S ++
Sbjct: 313 PIRLKEKPYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVS 354
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 50/86 (58%), Gaps = 12/86 (13%)
Query: 233 AFTVFDDLILYKSGKALGG----------HAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
AF V +D ++YKSG HA+ +G+GE ++ YW++ NSW ++WG+
Sbjct: 250 AFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGE--QNGLLYWIVKNSWGSNWGN 307
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG F I RGK+ CG+ + + +P++
Sbjct: 308 NGYFLIERGKNMCGLAACASYPIPQV 333
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/232 (25%), Positives = 98/232 (42%), Gaps = 27/232 (11%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
+V ++LP NFD W + ++DQG+CGSCW H GT + A K
Sbjct: 121 DVHDELPQNFD----WRVNNKMTSVKDQGACGSCWA----------HAAVGTLETLYAIK 166
Query: 139 GHTPKCVRECQENYDVPYKK---DLNFGAKSYSVSSNEKSIMKEI-YEHGPVEGAFTVFD 194
+ + E Q+ D D ++ N +M+EI Y + +G + +
Sbjct: 167 HNYLINLSE-QQLIDCDSANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDN 225
Query: 195 D---LILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGG 251
L + R+ E LI T+ + A T +I + L
Sbjct: 226 KKFALSVSSCKRYIFQNEENLKKELI--TMGPIAMAIDAASISTYSKGIIHFCENLGLN- 282
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIESSITA 303
HA+ ++G+G + YW + NSW +DWG++G F++ R + CG+ + + A
Sbjct: 283 HAVLLVGYGTE--GGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAA 332
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 60.8 bits (146), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/86 (34%), Positives = 49/86 (56%), Gaps = 12/86 (13%)
Query: 233 AFTVFDDLILYKSGKALG----------GHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
AF V +D ++YKSG HA+ +G+GE ++ YW++ NSW + WG+
Sbjct: 250 AFEVTEDFLMYKSGVYSSKSCHKTPDKVNHAVLAVGYGE--QNGLLYWIVKNSWGSQWGE 307
Query: 283 NGLFKILRGKDECGIESSITAGVPKL 308
NG F I RGK+ CG+ + + +P++
Sbjct: 308 NGYFLIERGKNMCGLAACASYPIPQV 333
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 57.4 bits (137), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/50 (52%), Positives = 35/50 (70%), Gaps = 2/50 (4%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGIESS 300
H + ++G+G DE S E YWL+ NSW T WGD G K+LR K+ +CGI S+
Sbjct: 317 HGVLVVGFGTDE-SGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASA 365
Score = 31.6 bits (70), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 14/31 (45%), Positives = 18/31 (58%), Gaps = 4/31 (12%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWG 114
LP + D RTK + ++DQG CGSCW
Sbjct: 154 LPKSVDWRTK----GAVTAVKDQGHCGSCWA 180
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 101/235 (42%), Gaps = 26/235 (11%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPK 143
LP +FD W + + +++QGSCGSCW H++ + S+
Sbjct: 137 LPDDFD----WRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEV-LSEQQFVD 191
Query: 144 CVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGR 203
C EC + N G + + S +K+ E + P G+ D + +
Sbjct: 192 CDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGS----DGKCKFDKSK 247
Query: 204 FFVPGNETTAMSLIKWTIRDNTSQLG--AEGAFTVFDDLIL------YKSGKALGGHAIR 255
+ +S+ + I N + G A G + + Y G+ L H +
Sbjct: 248 IVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYMQTYIGGVSCPYICGRHLD-HGVL 306
Query: 256 ILGWGEDE----KSKEK-YWLIANSWNTDWGDNGLFKILRG---KDECGIESSIT 302
++G+G + K+K YW+I NSW +WG+NG +KI RG +++CG++S ++
Sbjct: 307 LVGYGASGFAPIRLKDKPYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMVS 361
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/77 (40%), Positives = 41/77 (53%), Gaps = 12/77 (15%)
Query: 233 AFTVFDDLILYKSGKALG----------GHAIRILGWGEDEKSKEKYWLIANSWNTDWGD 282
AF V + +YKSG HA+ +G+G + + YWLI NSW DWGD
Sbjct: 280 AFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVE--NGVPYWLIKNSWGADWGD 337
Query: 283 NGLFKILRGKDECGIES 299
NG FK+ GK+ CGI +
Sbjct: 338 NGYFKMEMGKNMCGIAT 354
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
Length = 343
Score = 56.6 bits (135), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 95/238 (39%), Gaps = 19/238 (7%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASK 138
E +P FD RT+ P +++QG CGSCW +H ++ + S+
Sbjct: 113 EFINSIPTAFDWRTRGAVTP----VKNQGQCGSCWSFSTTGNVEGQHFISQNKL-VSLSE 167
Query: 139 GHTPKCVRECQENYDVPYKKD------LNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTV 192
+ C EC E Y+ D L A +Y + N + Y + G
Sbjct: 168 QNLVDCDHECME-YEGEQACDEGCNGGLQPNAYNYIIK-NGGIQTESSYPYTAETGTQCN 225
Query: 193 FDDL-ILYKSGRF-FVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
F+ I K F +P NET I T + E F + + + +L
Sbjct: 226 FNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLD 285
Query: 251 GHAIRILGWGEDEKSKEK---YWLIANSWNTDWGDNGLFKILRGKDECGIESSITAGV 305
H I I+G+ K YW++ NSW DWG+ G + RGK+ CG+ + ++ +
Sbjct: 286 -HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSI 342
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.137 0.447
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 132,098,251
Number of Sequences: 539616
Number of extensions: 5941334
Number of successful extensions: 11865
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 192
Number of HSP's successfully gapped in prelim test: 20
Number of HSP's that attempted gapping in prelim test: 11199
Number of HSP's gapped (non-prelim): 587
length of query: 309
length of database: 191,569,459
effective HSP length: 117
effective length of query: 192
effective length of database: 128,434,387
effective search space: 24659402304
effective search space used: 24659402304
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 61 (28.1 bits)