RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7460
(1026 letters)
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
nomenclature); composed of cysteine peptidases (CPs)
similar to papain, including the mammalian CPs
(cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
is an endopeptidase with specific substrate preferences,
primarily for bulky hydrophobic or aromatic residues at
the S2 subsite, a hydrophobic pocket in papain that
accommodates the P2 sidechain of the substrate (the
second residue away from the scissile bond). Most
members of the papain subfamily are endopeptidases. Some
exceptions to this rule can be explained by specific
details of the catalytic domains like the occluding loop
in cathepsin B which confers an additional
carboxydipeptidyl activity and the mini-chain of
cathepsin H resulting in an N-terminal exopeptidase
activity. Papain-like CPs have different functions in
various organisms. Plant CPs are used to mobilize
storage proteins in seeds. Parasitic CPs act
extracellularly to help invade tissues and cells, to
hatch or to evade the host immune system. Mammalian CPs
are primarily lysosomal enzymes with the exception of
cathepsin W, which is retained in the endoplasmic
reticulum. They are responsible for protein degradation
in the lysosome. Papain-like CPs are synthesized as
inactive proenzymes with N-terminal propeptide regions,
which are removed upon activation. In addition to its
inhibitory role, the propeptide is required for proper
folding of the newly synthesized enzyme and its
stabilization in denaturing pH conditions. Residues
within the propeptide region also play a role in the
transport of the proenzyme to lysosomes or acidified
vesicles. Also included in this subfamily are proteins
classified as non-peptidase homologs, which lack
peptidase activity or have missing active site residues.
Length = 210
Score = 222 bits (569), Expect = 6e-67
Identities = 95/215 (44%), Positives = 124/215 (57%), Gaps = 8/215 (3%)
Query: 12 PDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 71
P++ DWR+K P DQ CGSCWAFS G LEG YAIKTGKLV S+ QLV+C+ +
Sbjct: 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGN 60
Query: 72 -GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGS 129
GC+G + + EY GL SE DYPY +G C Y+ SKV TG +
Sbjct: 61 NGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPGDE 117
Query: 130 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 188
E +K L YGP+SV ++ S Y G + CS +L HAVLLVGYG ++ + Y
Sbjct: 118 EALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGVDY 175
Query: 189 WLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 223
W+V+NSWG ++G+ +I RG+N CGI A Y
Sbjct: 176 WIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210
Score = 222 bits (567), Expect = 1e-66
Identities = 95/215 (44%), Positives = 125/215 (58%), Gaps = 8/215 (3%)
Query: 743 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 802
P++ DWR+K P DQ +CGSCWAFS G LEG YAIKTGKLV S+ QLV+C+ +
Sbjct: 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTSGN 60
Query: 803 -GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLF-TGKDFLHFNGS 860
GC+G + + EY GL SE DYPY +G C Y+ SKV TG +
Sbjct: 61 NGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPGDE 117
Query: 861 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPY 919
E +K L YGP+SV ++ S Y G + CS +L HAVLLVGYG ++ + Y
Sbjct: 118 EALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGVDY 175
Query: 920 WLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 954
W+V+NSWG ++G+ +I RG+N CGI A Y
Sbjct: 176 WIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210
Score = 221 bits (566), Expect = 2e-66
Identities = 97/217 (44%), Positives = 125/217 (57%), Gaps = 11/217 (5%)
Query: 377 PDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCS 436
P++ DWR+K P DQ +CGSCWAFS G LEG YAIKTGKLV S+ QLV+C+ S
Sbjct: 1 PESVDWREKGAVTPVKDQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCST--S 58
Query: 437 GCGGCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLF-TGKDFLYFN 493
G GC+G + EY GL SE DYPY +G C Y+ SKV TG +
Sbjct: 59 GNNGCNGGNPDNAFEYVKNGGLASESDYPYTGKDG---TCKYNSSKVGAKITGYSNVPPG 115
Query: 494 GSETMKKILYKYGPLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDI 552
E +K L YGP+SV ++ S FY G + CS +L HAVLLVGYG ++ +
Sbjct: 116 DEEALKAALANYGPVSVAIDASSSFQFYKGGIY--SGPCCSNTNLNHAVLLVGYGTENGV 173
Query: 553 PYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYA 589
YW+ +NSWG ++G+ +I RG+N CGI A Y
Sbjct: 174 DYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYASYP 210
Score = 91.5 bits (228), Expect = 7e-21
Identities = 29/62 (46%), Positives = 41/62 (66%)
Query: 961 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAG 1020
+ CS +L HAVLLVGYG ++ + YW+V+NSWG ++G+ +I RG+N CGI A
Sbjct: 149 SGPCCSNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGYIRIARGSNLCGIASYAS 208
Query: 1021 YA 1022
Y
Sbjct: 209 YP 210
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease.
Length = 213
Score = 220 bits (564), Expect = 4e-66
Identities = 90/219 (41%), Positives = 124/219 (56%), Gaps = 11/219 (5%)
Query: 11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
+P+++DWR+K P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C
Sbjct: 1 LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGN 60
Query: 71 SGCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFN 127
+GC+G + + EY + G+ +E DYPY +G C + KS K K + + +N
Sbjct: 61 NGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVPYN 117
Query: 128 GSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 185
E ++ L K GP+SV ++ D Y CS L HAVL+VGYG ++
Sbjct: 118 DEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVY--KHTECSGE-LDHAVLIVGYGTENG 174
Query: 186 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 224
+PYW+V+NSWG + G+F+I RG N CGI A Y
Sbjct: 175 VPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213
Score = 220 bits (563), Expect = 6e-66
Identities = 90/219 (41%), Positives = 124/219 (56%), Gaps = 11/219 (5%)
Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
+P+++DWR+K P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C
Sbjct: 1 LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTGN 60
Query: 802 SGCDGCFFEPSIEYTHQA-GLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--LHFN 858
+GC+G + + EY + G+ +E DYPY +G C + KS K K + + +N
Sbjct: 61 NGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDVPYN 117
Query: 859 GSETMKKILYKYGPLSVLLN--SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDN 916
E ++ L K GP+SV ++ D Y CS L HAVL+VGYG ++
Sbjct: 118 DEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVY--KHTECSGE-LDHAVLIVGYGTENG 174
Query: 917 IPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 955
+PYW+V+NSWG + G+F+I RG N CGI A Y
Sbjct: 175 VPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213
Score = 213 bits (545), Expect = 2e-63
Identities = 91/227 (40%), Positives = 125/227 (55%), Gaps = 26/227 (11%)
Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
+P+++DWR+K P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C
Sbjct: 1 LPESFDWREKGAVTPVKDQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDC---D 57
Query: 436 SGCGGCDG--LEQPIEYTHQA-GLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLY- 491
+G GC+G + EY + G+ +E DYPY +G C + KS K K +
Sbjct: 58 TGNNGCNGGLPDNAFEYIKKNGGIVTESDYPYTAHDG---TCKFKKSNSKYAKIKGYGDV 114
Query: 492 -FNGSETMKKILYKYGPLSVGLNSHLIHF-------YNGTPIRKNDETCSPYDLGHAVLL 543
+N E ++ L K GP+SV ++++ F Y T CS L HAVL+
Sbjct: 115 PYNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE-------CSGE-LDHAVLI 166
Query: 544 VGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIAGYAT 590
VGYG ++ +PYW+ +NSWG + G+F+I RG N CGI A Y
Sbjct: 167 VGYGTENGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEASYPI 213
Score = 88.4 bits (220), Expect = 1e-19
Identities = 30/63 (47%), Positives = 39/63 (61%), Gaps = 1/63 (1%)
Query: 961 NDETCSPYDLGHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAG 1020
CS L HAVL+VGYG ++ +PYW+V+NSWG + G+F+I RG N CGI A
Sbjct: 152 KHTECSGE-LDHAVLIVGYGTENGVPYWIVKNSWGTDWGENGYFRIARGVNECGIASEAS 210
Query: 1021 YAT 1023
Y
Sbjct: 211 YPI 213
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease.
Length = 175
Score = 170 bits (433), Expect = 1e-48
Identities = 80/218 (36%), Positives = 104/218 (47%), Gaps = 49/218 (22%)
Query: 11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
+P+++DWRKK P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C+
Sbjct: 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60
Query: 71 S-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 128
+ GC+G + + EY + GLE+E YPY + DF
Sbjct: 61 NCGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQF--- 103
Query: 129 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DNI 186
YK G Y+ C L HAVL+VGYG + +
Sbjct: 104 --------YKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVENGK 136
Query: 187 PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 223
YW+V+NSWG + G+F+I RG NN CGIE
Sbjct: 137 DYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174
Score = 169 bits (431), Expect = 2e-48
Identities = 80/218 (36%), Positives = 104/218 (47%), Gaps = 49/218 (22%)
Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
+P+++DWRKK P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C+
Sbjct: 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60
Query: 802 S-GCDGCFFEPSIEYTHQ-AGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNG 859
+ GC+G + + EY + GLE+E YPY + DF
Sbjct: 61 NCGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQF--- 103
Query: 860 SETMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--DNI 917
YK G Y+ C L HAVL+VGYG + +
Sbjct: 104 --------YKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVENGK 136
Query: 918 PYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 954
YW+V+NSWG + G+F+I RG NN CGIE
Sbjct: 137 DYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174
Score = 161 bits (410), Expect = 1e-45
Identities = 79/220 (35%), Positives = 101/220 (45%), Gaps = 52/220 (23%)
Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
+P+++DWRKK P DQ CGSCWAFS G LEG+Y IKTGKLV S+ QLV+C+
Sbjct: 1 LPESFDWRKKGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGGG 60
Query: 436 SGCGGCDG--LEQPIEYTHQ-AGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYF 492
+ GC+G + EY + GLE+E YPY DF
Sbjct: 61 N--CGCNGGLPDNAFEYIKKNGGLETESCYPYTGSVAID--------------ASDFQ-- 102
Query: 493 NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQ--D 550
YK G Y+ C L HAVL+VGYG + +
Sbjct: 103 ---------FYKSG------------IYDHP-------GCGSGTLDHAVLIVGYGTEVEN 134
Query: 551 DIPYWLARNSWGPIGPDEGFFKIERG-NNACGIEQIAGYA 589
YW+ +NSWG + G+F+I RG NN CGIE
Sbjct: 135 GKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVASY 174
Score = 79.9 bits (198), Expect = 4e-17
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 3/65 (4%)
Query: 961 NDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKIERG-NNACGIEQ 1017
+ C L HAVL+VGYG + + YW+V+NSWG + G+F+I RG NN CGIE
Sbjct: 110 DHPGCGSGTLDHAVLIVGYGTEVENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEA 169
Query: 1018 IAGYA 1022
Sbjct: 170 SVASY 174
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
Length = 348
Score = 151 bits (381), Expect = 7e-40
Identities = 91/305 (29%), Positives = 142/305 (46%), Gaps = 22/305 (7%)
Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQD--------GHKKHERYGTSEFSDRSPEEILCK 709
F+ F R Y E ++R F+++ H R+G ++F D S E +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 710 TGFKWSERTYERIVADREKVEKMLMEVEKD-GPVPDAWDWRKKNVTGPAGDQAACGSCWA 768
+ Y A ++ + + D VPDA DWR+K P +Q ACGSCWA
Sbjct: 98 Y---LNGAAY--FAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWA 152
Query: 769 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEY--THQAG-LESEK 825
FS G +E Q+A+ KLV S+ QLV C +GC G + E+ + G + +EK
Sbjct: 153 FSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEK 212
Query: 826 DYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSE-TMKKILYKYGPLSVLLNSDLIHD 884
YPY + NG+ +C+ ++ SE M L K GP+S+ +++
Sbjct: 213 SYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMS 272
Query: 885 YNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNA 944
Y+ + +C L H VLLVGY +PYW+++NSWG ++G+ ++ G NA
Sbjct: 273 YHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNA 328
Query: 945 CGIEQ 949
C +
Sbjct: 329 CLLTG 333
Score = 151 bits (381), Expect = 8e-40
Identities = 96/307 (31%), Positives = 146/307 (47%), Gaps = 25/307 (8%)
Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQD--------GHKKHERYGTSEFSDRSPEEILCK 343
F+ F R Y E ++R F+++ H R+G ++F D S E +
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEFAAR 97
Query: 344 TGFKWSERTYERIVADREKVEKMLMEVEKD-GPVPDAWDWRKKNVTGPAGDQAACGSCWA 402
+ Y A ++ + + D VPDA DWR+K P +Q ACGSCWA
Sbjct: 98 Y---LNGAAY--FAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWA 152
Query: 403 FSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGLE-QPIEY--THQAG-LES 458
FS G +E Q+A+ KLV S+ QLV C +GCGG GL Q E+ + G + +
Sbjct: 153 FSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGG--GLMLQAFEWVLRNMNGTVFT 210
Query: 459 EKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSE-TMKKILYKYGPLSVGLNSHLI 517
EK YPY +GNG+ +C+ ++ SE M L K GP+S+ +++
Sbjct: 211 EKSYPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSF 270
Query: 518 HFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGN 577
Y+ + +C L H VLLVGY ++PYW+ +NSWG ++G+ ++ G
Sbjct: 271 MSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGV 326
Query: 578 NACGIEQ 584
NAC +
Sbjct: 327 NACLLTG 333
Score = 140 bits (355), Expect = 2e-36
Identities = 73/212 (34%), Positives = 109/212 (51%), Gaps = 8/212 (3%)
Query: 11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
VPDA DWR+K P +Q CGSCWAFS G +E Q+A+ KLV S+ QLV C
Sbjct: 126 VPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSEQQLVSCDHVD 185
Query: 71 SGCDGCFFEPSIEY--THQAG-LESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFN 127
+GC G + E+ + G + +EK YPY + NG+ +C+ ++
Sbjct: 186 NGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPECSNSSELAPGARIDGYVSME 245
Query: 128 GSE-TMKKILYKYGPLSVLLNSDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNI 186
SE M L K GP+S+ +++ Y+ + +C L H VLLVGY +
Sbjct: 246 SSERVMAAWLAKNGPISIAVDASSFMSYHSGVL----TSCIGEQLNHGVLLVGYNMTGEV 301
Query: 187 PYWLVRNSWGPIGPDEGFFKIERGNNACGIEQ 218
PYW+++NSWG ++G+ ++ G NAC +
Sbjct: 302 PYWVIKNSWGEDWGEKGYVRVTMGVNACLLTG 333
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
Length = 448
Score = 144 bits (365), Expect = 1e-36
Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 54/364 (14%)
Query: 623 LPSLTDRITDQVVARVDTLAIEGSLTFDNENILET---FKAFIVKRGRQYANDEEIKERF 679
P L D V + L +G ++ D + E F+ F K R++A E RF
Sbjct: 88 FPRLDKSKRDSYVDELTRLFKDGYISDDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRF 147
Query: 680 EYFKQD-----GHKKHERY--GTSEFSDRSPEEI--------LCKTGFKWSERTY--ERI 722
F+ + HK E Y ++FSD + EE + S R
Sbjct: 148 LTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARH 207
Query: 723 VADREKVEKMLMEVEKDGPVPDA-------WDWRKKNVTGPAGDQAA-CGSCWAFSIAGM 774
V++ ++ + D V D DWR+ + DQ CGSCWAFS G
Sbjct: 208 VSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGS 267
Query: 775 LEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANG 834
+E Y I K V+ S+ +LV C + GC G + + ++EY GL S D PY +G
Sbjct: 268 VESLYKIYRDKSVDLSEQELVNCDTKSQGCSGGYPDTALEYVKNKGLSSSSDVPYLAKDG 327
Query: 835 EKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSVLLNS--DLIHD----YNGT 888
KC +K +L G + + K L P V + +L+ YNG
Sbjct: 328 ---KCVVSSTKKVYIDS--YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYNG- 380
Query: 889 PIRKNDETCSPYDLGHAVLLV--GYGKQDNIPYWLVRNSWGPIGPDEGFFKIER---GNN 943
C L HAVLLV GY ++ YW+++NSWG + G+ ++ER G +
Sbjct: 381 -------ECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTD 432
Query: 944 ACGI 947
CGI
Sbjct: 433 KCGI 436
Score = 140 bits (355), Expect = 2e-35
Identities = 103/366 (28%), Positives = 148/366 (40%), Gaps = 57/366 (15%)
Query: 257 LPSLTDRITDQVVARVDTLAIEGSLTFDNENILET---FKAFIVKRGRQYANDEEIKERF 313
P L D V + L +G ++ D + E F+ F K R++A E RF
Sbjct: 88 FPRLDKSKRDSYVDELTRLFKDGYISDDPKLEFEVYLEFEEFNKKYNRKHATHAERLNRF 147
Query: 314 EYFKQD-----GHKKHERY--GTSEFSDRSPEEI--------LCKTGFKWSERTY--ERI 356
F+ + HK E Y ++FSD + EE + S R
Sbjct: 148 LTFRNNYLEVKSHKGDEPYSKEINKFSDLTEEEFRKLFPVIKVPPKSNSTSHNNDFKARH 207
Query: 357 VADREKVEKMLMEVEKDGPVPDA-------WDWRKKNVTGPAGDQAA-CGSCWAFSIAGM 408
V++ ++ + D V D DWR+ + DQ CGSCWAFS G
Sbjct: 208 VSNPTYLKNLKKAKNTDEDVKDPSKITGEGLDWRRADAVTKVKDQGLNCGSCWAFSSVGS 267
Query: 409 LEGQYAIKTGKLVEFSKSQLVECAKQCSGCGGCDGL-EQPIEYTHQAGLESEKDYPYRNG 467
+E Y I K V+ S+ +LV C + GC G G + +EY GL S D PY
Sbjct: 268 VESLYKIYRDKSVDLSEQELVNCDTKSQGCSG--GYPDTALEYVKNKGLSSSSDVPYL-- 323
Query: 468 NGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSH--LIHF----YN 521
+ KC +K +L G + + K L P V + L+ + YN
Sbjct: 324 -AKDGKCVVSSTKKVYIDS--YLVAKGKDVLNKSLV-ISPTVVYIAVSRELLKYKSGVYN 379
Query: 522 GTPIRKNDETCSPYDLGHAVLLV--GYGKQDDIPYWLARNSWGPIGPDEGFFKIER---G 576
G C L HAVLLV GY ++ YW+ +NSWG + G+ ++ER G
Sbjct: 380 G--------ECGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEG 430
Query: 577 NNACGI 582
+ CGI
Sbjct: 431 TDKCGI 436
Score = 128 bits (324), Expect = 2e-31
Identities = 71/213 (33%), Positives = 101/213 (47%), Gaps = 27/213 (12%)
Query: 16 DWRKKNVTGPAGDQA-DCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCD 74
DWR+ + DQ +CGSCWAFS G +E Y I K V+ S+ +LV C + GC
Sbjct: 239 DWRRADAVTKVKDQGLNCGSCWAFSSVGSVESLYKIYRDKSVDLSEQELVNCDTKSQGCS 298
Query: 75 GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKK 134
G + + ++EY GL S D PY +G KC +K +L G + + K
Sbjct: 299 GGYPDTALEYVKNKGLSSSSDVPYLAKDG---KCVVSSTKKVYIDS--YLVAKGKDVLNK 353
Query: 135 ILYKYGPLSVLLNS--DLIHD----YNGTPIRKNDETCSPYDLGHAVLLV--GYGKQDNI 186
L P V + +L+ YNG C L HAVLLV GY ++
Sbjct: 354 SLV-ISPTVVYIAVSRELLKYKSGVYNG--------ECGKS-LNHAVLLVGEGYDEKTKK 403
Query: 187 PYWLVRNSWGPIGPDEGFFKIER---GNNACGI 216
YW+++NSWG + G+ ++ER G + CGI
Sbjct: 404 RYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436
Score = 54.7 bits (132), Expect = 2e-07
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 6/56 (10%)
Query: 965 CSPYDLGHAVLLVG--YGKQDDIPYWLVRNSWGPIGPDEGFFKIER---GNNACGI 1015
C L HAVLLVG Y ++ YW+++NSWG + G+ ++ER G + CGI
Sbjct: 382 CGKS-LNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 141 bits (358), Expect = 1e-35
Identities = 103/319 (32%), Positives = 151/319 (47%), Gaps = 46/319 (14%)
Query: 652 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHE-------RYGTSEFSDRS 702
EN+ +F FI + G++Y +E+++R+ F ++ K H + G + F D S
Sbjct: 164 ENV-NSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLS 222
Query: 703 PEEI------LCKTGFKWSERTYERIVADREKVEKMLMEVEKDGPVPD-AWDWRKKNVTG 755
EE L FK S V + + V K KD +DWR N
Sbjct: 223 FEEFKKKYLTLKSFDFK-SNGKKSPRVINYDDVIKKYKP--KDATFDHAKYDWRLHNGVT 279
Query: 756 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCDGCF----FEP 811
P DQ CGSCWAFS G++E QYAI+ +LV S+ +LV+C+ + +GC G FE
Sbjct: 280 PVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339
Query: 812 SIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYG 871
IE GL SE DYPY + E C D+ K K + K ++ + K+ + G
Sbjct: 340 MIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-EDKFKEAIRFLG 392
Query: 872 PLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP----------YW 920
P+SV + SD Y G D C + HAV+LVGYG ++ Y+
Sbjct: 393 PISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYYY 448
Query: 921 LVRNSWGPIGPDEGFFKIE 939
+++NSWG ++GF +IE
Sbjct: 449 IIKNSWGESWGEKGFIRIE 467
Score = 133 bits (337), Expect = 7e-33
Identities = 102/319 (31%), Positives = 150/319 (47%), Gaps = 45/319 (14%)
Query: 286 ENILETFKAFIVKRGRQYANDEEIKERFEYFKQDGHK--KHE-------RYGTSEFSDRS 336
EN+ +F FI + G++Y +E+++R+ F ++ K H + G + F D S
Sbjct: 164 ENV-NSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLS 222
Query: 337 PEEI------LCKTGFKWSERTYERIVADREKVEKMLMEVEKDGPVPD-AWDWRKKNVTG 389
EE L FK S V + + V K KD +DWR N
Sbjct: 223 FEEFKKKYLTLKSFDFK-SNGKKSPRVINYDDVIKKYKP--KDATFDHAKYDWRLHNGVT 279
Query: 390 PAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGCGG---CDGLEQ 446
P DQ CGSCWAFS G++E QYAI+ +LV S+ +LV+C+ + +GC G + E
Sbjct: 280 PVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGCYGGLIPNAFED 339
Query: 447 PIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYG 506
IE GL SE DYPY + E C D+ K K + K ++ + K+ + G
Sbjct: 340 MIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-EDKFKEAIRFLG 392
Query: 507 PLSVGLN-SHLIHFYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDDIP----------YW 555
P+SV + S FY G D C + HAV+LVGYG ++ Y+
Sbjct: 393 PISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNSDTKKMEKRYYY 448
Query: 556 LARNSWGPIGPDEGFFKIE 574
+ +NSWG ++GF +IE
Sbjct: 449 IIKNSWGESWGEKGFIRIE 467
Score = 125 bits (315), Expect = 5e-30
Identities = 76/210 (36%), Positives = 109/210 (51%), Gaps = 26/210 (12%)
Query: 14 AWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQCSGC 73
+DWR N P DQ +CGSCWAFS G++E QYAI+ +LV S+ +LV+C+ + +GC
Sbjct: 269 KYDWRLHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCSFKNNGC 328
Query: 74 DGCF----FEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDFLHFNGS 129
G FE IE GL SE DYPY + E C D+ K K + K ++
Sbjct: 329 YGGLIPNAFEDMIEL---GGLCSEDDYPYVSDTPE--LCNIDRCKEK-YKIKSYVSIP-E 381
Query: 130 ETMKKILYKYGPLSVLLN-SDLIHDYNGTPIRKNDETCSPYDLGHAVLLVGYGKQDNIP- 187
+ K+ + GP+SV + SD Y G D C + HAV+LVGYG ++
Sbjct: 382 DKFKEAIRFLGPISVSIAVSDDFAFYKGGIF---DGECG-EEPNHAVILVGYGMEEIYNS 437
Query: 188 ---------YWLVRNSWGPIGPDEGFFKIE 208
Y++++NSWG ++GF +IE
Sbjct: 438 DTKKMEKRYYYIIKNSWGESWGEKGFIRIE 467
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
Dipeptidyl Peptidase I (DPPI), an atypical papain-like
cysteine peptidase with chloride dependency and
dipeptidyl aminopeptidase activity, resulting from its
tetrameric structure which limits substrate access. Each
subunit of the tetramer is composed of three peptides:
the heavy and light chains, which together adopts the
papain fold and forms the catalytic domain; and the
residual propeptide region, which forms a beta barrel
and points towards the substrate's N-terminus. The
subunit composition is the result of the unique
characteristic of procathepsin C maturation involving
the cleavage of the catalytic domain and the
non-autocatalytic excision of an activation peptide
within its propeptide region. By removing N-terminal
dipeptide extensions, cathepsin C activates granule
serine peptidases (granzymes) involved in cell-mediated
apoptosis, inflammation and tissue remodelling.
Loss-of-function mutations in cathepsin C are associated
with Papillon-Lefevre and Haim-Munk syndromes, rare
diseases characterized by hyperkeratosis and early-onset
periodontitis. Cathepsin C is widely expressed in many
tissues with high levels in lung, kidney and placenta.
It is also highly expressed in cytotoxic lymphocytes and
mature myeloid cells.
Length = 243
Score = 134 bits (340), Expect = 2e-35
Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 31/242 (12%)
Query: 12 PDAWDWR----KKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 61
P ++DW N P +Q CGSC+AF+ LE + I + K S
Sbjct: 2 PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61
Query: 62 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGK 121
++ C++ GCDG F ++ G+ +E +PY + C S+ + +
Sbjct: 62 HVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYYFS 119
Query: 122 DFLHFNG------SETMKKILYKYGPLSVLL--NSDLIHDYNGT-PIRKNDETCS----- 167
D+ + G + MK +Y+ GP+ V SD G NDE
Sbjct: 120 DYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDN 179
Query: 168 --PYDL-GHAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 222
P++L HAVLLVG+G+ + YW+V+NSWG ++G+FKI RG N CGIE A +
Sbjct: 180 FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239
Query: 223 AT 224
A
Sbjct: 240 AY 241
Score = 134 bits (340), Expect = 2e-35
Identities = 73/242 (30%), Positives = 110/242 (45%), Gaps = 31/242 (12%)
Query: 743 PDAWDWR----KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 792
P ++DW N P +Q CGSC+AF+ LE + I + K S
Sbjct: 2 PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61
Query: 793 QLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGK 852
++ C++ GCDG F ++ G+ +E +PY + C S+ + +
Sbjct: 62 HVLSCSQYSQGCDGGFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYYFS 119
Query: 853 DFLHFNG------SETMKKILYKYGPLSVLL--NSDLIHDYNGT-PIRKNDETCS----- 898
D+ + G + MK +Y+ GP+ V SD G NDE
Sbjct: 120 DYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDNDN 179
Query: 899 --PYDL-GHAVLLVGYGKQD--NIPYWLVRNSWGPIGPDEGFFKIERGNNACGIEQIAGY 953
P++L HAVLLVG+G+ + YW+V+NSWG ++G+FKI RG N CGIE A +
Sbjct: 180 FNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239
Query: 954 AT 955
A
Sbjct: 240 AY 241
Score = 128 bits (324), Expect = 3e-33
Identities = 71/244 (29%), Positives = 109/244 (44%), Gaps = 34/244 (13%)
Query: 377 PDAWDWR----KKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVE------FSKS 426
P ++DW N P +Q CGSC+AF+ LE + I + K S
Sbjct: 2 PKSFDWGDVNNGFNYVSPVRNQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQ 61
Query: 427 QLVECAKQCSGCGGCDGLEQPI-EYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFT 485
++ C++ GC G G + ++ G+ +E +PY + C S+ + +
Sbjct: 62 HVLSCSQYSQGCDG--GFPFLVGKFAEDFGIVTEDYFPYTADDDRP--CKASPSECRRYY 117
Query: 486 GKDFLYFNG------SETMKKILYKYGPLSVGL--NSHLIHFYNGT-PIRKNDETCS--- 533
D+ Y G + MK +Y+ GP+ V S + G NDE
Sbjct: 118 FSDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDN 177
Query: 534 ----PYDL-GHAVLLVGYGKQDD--IPYWLARNSWGPIGPDEGFFKIERGNNACGIEQIA 586
P++L HAVLLVG+G+ + YW+ +NSWG ++G+FKI RG N CGIE A
Sbjct: 178 DNFNPFELTNHAVLLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQA 237
Query: 587 GYAT 590
+A
Sbjct: 238 VFAY 241
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
cathepsin B and similar proteins, including
tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
B is a lysosomal papain-like cysteine peptidase which is
expressed in all tissues and functions primarily as an
exopeptidase through its carboxydipeptidyl activity.
Together with other cathepsins, it is involved in the
degradation of proteins, proenzyme activation, Ag
processing, metabolism and apoptosis. Cathepsin B has
been implicated in a number of human diseases such as
cancer, rheumatoid arthritis, osteoporosis and
Alzheimer's disease. The unique carboxydipeptidyl
activity of cathepsin B is attributed to the presence of
an occluding loop in its active site which favors the
binding of the C-termini of substrate proteins. Some
members of this group do not possess the occluding loop.
TIN-Ag is an extracellular matrix basement protein which
was originally identified as a target Ag involved in
anti-tubular basement membrane antibody-mediated
interstitial nephritis. It plays a role in renal
tubulogenesis and is defective in hereditary
tubulointerstitial disorders. TIN-Ag is exclusively
expressed in kidney tissues. .
Length = 236
Score = 119 bits (300), Expect = 3e-30
Identities = 60/234 (25%), Positives = 97/234 (41%), Gaps = 32/234 (13%)
Query: 12 PDAWDWRKK----NVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 65
P+++D R+K G DQ +CGSCWAFS + I++ + V S L+
Sbjct: 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60
Query: 66 CAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF------------------ 106
C C GC+G + + + +Y G+ + PY
Sbjct: 61 CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQ 120
Query: 107 -KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKND 163
C + K + + + K + GP+ + D ++ Y +
Sbjct: 121 DGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-YKSGVYQH-- 177
Query: 164 ETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
T GHAV ++G+G ++ +PYWL NSWG + G+F+I RG+N CGIE
Sbjct: 178 -TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230
Score = 118 bits (298), Expect = 6e-30
Identities = 60/234 (25%), Positives = 96/234 (41%), Gaps = 32/234 (13%)
Query: 743 PDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 796
P+++D R+K G DQ CGSCWAFS + I++ + V S L+
Sbjct: 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60
Query: 797 CAKQCS-GCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKF------------------ 837
C C GC+G + + + +Y G+ + PY
Sbjct: 61 CCSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQ 120
Query: 838 -KCAYDKSKVKLFTGKDFLHFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIRKND 894
C + K + + + K + GP+ + D ++ Y +
Sbjct: 121 DGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLY-YKSGVYQH-- 177
Query: 895 ETCSPYDLGHAVLLVGYGKQDNIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
T GHAV ++G+G ++ +PYWL NSWG + G+F+I RG+N CGIE
Sbjct: 178 -TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230
Score = 116 bits (292), Expect = 4e-29
Identities = 64/235 (27%), Positives = 99/235 (42%), Gaps = 33/235 (14%)
Query: 377 PDAWDWRKK----NVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVE 430
P+++D R+K G DQ CGSCWAFS + I++ + V S L+
Sbjct: 1 PESFDAREKWPNCISIGEIRDQGNCGSCWAFSAVEAFSDRLCIQSNGKENVLLSAQDLLS 60
Query: 431 CAKQCSGCG-GCDG--LEQPIEYTHQAGLESEKDYPYRNGNGEKF--------------- 472
C CSGCG GC+G + +Y G+ + PY
Sbjct: 61 C---CSGCGDGCNGGYPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTP 117
Query: 473 ----KCAYDKSKVKLFTGKDFLYFNGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKN 528
C + K + + + K + GP+ + Y + + ++
Sbjct: 118 KCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGVYQH 177
Query: 529 DETCSPYDLGHAVLLVGYGKQDDIPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
T GHAV ++G+G ++ +PYWLA NSWG + G+F+I RG+N CGIE
Sbjct: 178 --TSGKQLGGHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230
Score = 76.9 bits (190), Expect = 1e-15
Identities = 24/46 (52%), Positives = 34/46 (73%)
Query: 971 GHAVLLVGYGKQDDIPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
GHAV ++G+G ++ +PYWL NSWG + G+F+I RG+N CGIE
Sbjct: 185 GHAVKIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECGIE 230
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
nomenclature), also referred to as the papain family;
composed of two subfamilies of cysteine peptidases
(CPs), C1A (papain) and C1B (bleomycin hydrolase).
Papain-like enzymes are mostly endopeptidases with some
exceptions like cathepsins B, C, H and X, which are
exopeptidases. Papain-like CPs have different functions
in various organisms. Plant CPs are used to mobilize
storage proteins in seeds while mammalian CPs are
primarily lysosomal enzymes responsible for protein
degradation in the lysosome. Papain-like CPs are
synthesized as inactive proenzymes with N-terminal
propeptide regions, which are removed upon activation.
Bleomycin hydrolase (BH) is a CP that detoxifies
bleomycin by hydrolysis of an amide group. It acts as a
carboxypeptidase on its C-terminus to convert itself
into an aminopeptidase and peptide ligase. BH is found
in all tissues in mammals as well as in many other
eukaryotes. It forms a hexameric ring barrel structure
with the active sites imbedded in the central channel.
Some members of the C1 family are proteins classified as
non-peptidase homologs which lack peptidase activity or
have missing active site residues.
Length = 223
Score = 114 bits (286), Expect = 2e-28
Identities = 57/209 (27%), Positives = 83/209 (39%), Gaps = 17/209 (8%)
Query: 15 WDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ--- 69
D R +T P +Q GSCWAF+ A LE Y IK G + V+ S L CA
Sbjct: 2 VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60
Query: 70 ---CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 124
S G ++ G+ E+DYPY + + + KD+ +
Sbjct: 61 GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120
Query: 125 HFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIR--KNDETCSPYDLGHAVLLVGY 180
N E +K+ L K GP+ + S G GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180
Query: 181 GKQ--DNIPYWLVRNSWGPIGPDEGFFKI 207
+ ++V+NSWG D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209
Score = 113 bits (285), Expect = 2e-28
Identities = 59/209 (28%), Positives = 87/209 (41%), Gaps = 16/209 (7%)
Query: 380 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ-CS 436
D R +T P +Q + GSCWAF+ A LE Y IK G + V+ S L CA C
Sbjct: 2 VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60
Query: 437 GCG-GCDG---LEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF--L 490
G CDG L ++ G+ E+DYPY + + + KD+ +
Sbjct: 61 GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120
Query: 491 YFNGSETMKKILYKYGPLSVGLNSHLIHFY----NGTPIRKNDETCSPYDLGHAVLLVGY 546
N E +K+ L K GP+ G + + GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180
Query: 547 GKQ--DDIPYWLARNSWGPIGPDEGFFKI 573
+ ++ +NSWG D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209
Score = 112 bits (283), Expect = 4e-28
Identities = 57/209 (27%), Positives = 84/209 (40%), Gaps = 17/209 (8%)
Query: 746 WDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTG--KLVEFSKSQLVECAKQ--- 800
D R +T P +Q + GSCWAF+ A LE Y IK G + V+ S L CA
Sbjct: 2 VDLRPLRLT-PVKNQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECL 60
Query: 801 ---CSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF--L 855
S G ++ G+ E+DYPY + + + KD+ +
Sbjct: 61 GINGSCDGGGPLSALLKLVALKGIPPEEDYPYGAESDGEEPKSEAALNAAKVKLKDYRRV 120
Query: 856 HFNGSETMKKILYKYGPLSV--LLNSDLIHDYNGTPIR--KNDETCSPYDLGHAVLLVGY 911
N E +K+ L K GP+ + S G GHAV++VGY
Sbjct: 121 LKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGY 180
Query: 912 GKQ--DNIPYWLVRNSWGPIGPDEGFFKI 938
+ ++V+NSWG D G+ +I
Sbjct: 181 DDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209
Score = 52.1 bits (125), Expect = 3e-07
Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%)
Query: 959 VKNDETCSPYDLGHAVLLVGYGKQ--DDIPYWLVRNSWGPIGPDEGFFKI 1006
+ GHAV++VGY + ++V+NSWG D G+ +I
Sbjct: 160 IVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGRI 209
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
papain-like lysosomal cysteine peptidase exhibiting
carboxymonopeptidase activity. It can also act as a
carboxydipeptidase, like cathepsin B, but has been shown
to preferentially cleave substrates through a
monopeptidyl carboxypeptidase pathway. The propeptide
region of cathepsin X, the shortest among papain-like
peptidases, is covalently attached to the active site
cysteine in the inactive form of the enzyme. Little is
known about the biological function of cathepsin X. Some
studies point to a role in early tumorigenesis. A more
recent study indicates that cathepsin X expression is
restricted to immune cells suggesting a role in
phagocytosis and the regulation of the immune response.
Length = 239
Score = 87.1 bits (216), Expect = 5e-19
Identities = 59/225 (26%), Positives = 96/225 (42%), Gaps = 31/225 (13%)
Query: 376 VPDAWDWRKKNVTG-----PAGDQ---AACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQ 427
+P +WDWR NV G P +Q CGSCWA L + I + +
Sbjct: 1 LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADR--INIARKGAWPSVY 56
Query: 428 L-VECAKQCSGCGGCDGLE--QPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKS----- 479
L V+ C+G G C G + EY H+ G+ E PY+ +GE +
Sbjct: 57 LSVQVVIDCAGGGSCHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFG 116
Query: 480 ---KVKLFTG---KDFLYFNGSETMKKILYKYGPLSVGLNSH-LIHFYNGTPIRKNDETC 532
+K +T D+ +G + M +Y GP+S G+ + + Y G ++ +
Sbjct: 117 ECFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDP 176
Query: 533 SPYDLGHAVLLVGYGKQDD-IPYWLARNSWGPIGPDEGFFKIERG 576
H + + G+G ++ + YW+ RNSWG + G+F+I
Sbjct: 177 LI---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218
Score = 85.2 bits (211), Expect = 3e-18
Identities = 60/224 (26%), Positives = 97/224 (43%), Gaps = 30/224 (13%)
Query: 742 VPDAWDWRKKNVTG-----PAGDQ---AACGSCWAFSIAGMLEGQYAIKT---GKLVEFS 790
+P +WDWR NV G P +Q CGSCWA L + I V S
Sbjct: 1 LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLS 58
Query: 791 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKS------ 844
+++CA S C G EY H+ G+ E PY+ +GE +
Sbjct: 59 VQVVIDCAGGGS-CHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGE 117
Query: 845 --KVKLFTG---KDFLHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCS 898
+K +T D+ +G + M +Y GP+S ++ ++ + +Y G ++ +
Sbjct: 118 CFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPL 177
Query: 899 PYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERG 941
H + + G+G +N + YW+VRNSWG + G+F+I
Sbjct: 178 I---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218
Score = 84.4 bits (209), Expect = 4e-18
Identities = 60/224 (26%), Positives = 97/224 (43%), Gaps = 30/224 (13%)
Query: 11 VPDAWDWRKKNVTG-----PAGDQ---ADCGSCWAFSIAGMLEGQYAIKT---GKLVEFS 59
+P +WDWR NV G P +Q CGSCWA L + I V S
Sbjct: 1 LPKSWDWR--NVNGVNYVSPTRNQHIPQYCGSCWAHGSTSALADRINIARKGAWPSVYLS 58
Query: 60 KSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKS------ 113
+++CA S C G EY H+ G+ E PY+ +GE +
Sbjct: 59 VQVVIDCAGGGS-CHGGDPGGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGE 117
Query: 114 --KVKLFTG---KDFLHFNGSETMKKILYKYGPLSV-LLNSDLIHDYNGTPIRKNDETCS 167
+K +T D+ +G + M +Y GP+S ++ ++ + +Y G ++ +
Sbjct: 118 CFAIKNYTLYFVSDYGSVSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPL 177
Query: 168 PYDLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGPDEGFFKIERG 210
H + + G+G +N + YW+VRNSWG + G+F+I
Sbjct: 178 I---NHIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218
Score = 45.5 bits (108), Expect = 5e-05
Identities = 14/39 (35%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 972 HAVLLVGYGKQDD-IPYWLVRNSWGPIGPDEGFFKIERG 1009
H + + G+G ++ + YW+VRNSWG + G+F+I
Sbjct: 180 HIISVAGWGVDENGVEYWIVRNSWGEPWGERGWFRIVTS 218
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
Length = 548
Score = 85.3 bits (211), Expect = 4e-17
Identities = 60/250 (24%), Positives = 93/250 (37%), Gaps = 44/250 (17%)
Query: 741 PVPDAWDWRKKNVTG--------PAGDQAACGSC------WAFSIAGMLEGQYAIKTGKL 786
P P AW W +V G PA C S A M+ G+
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261
Query: 787 VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDY--PYKNANGEKFKCAYDKS 844
S +++C++ GC G F E ++ G+ + Y PY + +G + C +
Sbjct: 262 TFLSARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRP 321
Query: 845 KVKLF------TGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI---HDYNGT----- 888
+ + G + + + +Y++GP+ SV NSD +
Sbjct: 322 SRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVS 381
Query: 889 ----PIRKNDETCSPY---DLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGP--DEGFFKI 938
D Y ++ H VL++G+G +N YWLV + WG D G KI
Sbjct: 382 LDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKI 441
Query: 939 ERGNNACGIE 948
RG NA IE
Sbjct: 442 ARGVNAYNIE 451
Score = 85.3 bits (211), Expect = 5e-17
Identities = 60/250 (24%), Positives = 93/250 (37%), Gaps = 44/250 (17%)
Query: 10 PVPDAWDWRKKNVTG--------PAGDQADCGSC------WAFSIAGMLEGQYAIKTGKL 55
P P AW W +V G PA C S A M+ G+
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261
Query: 56 VEFSKSQLVECAKQCSGCDGCFFEPSIEYTHQAGLESEKDY--PYKNANGEKFKCAYDKS 113
S +++C++ GC G F E ++ G+ + Y PY + +G + C +
Sbjct: 262 TFLSARHVLDCSQYGQGCAGGFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTRRP 321
Query: 114 KVKLF------TGKDFLHFNGSETMKKILYKYGPL--SVLLNSDLI---HDYNGT----- 157
+ + G + + + +Y++GP+ SV NSD +
Sbjct: 322 SRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRYVS 381
Query: 158 ----PIRKNDETCSPY---DLGHAVLLVGYGKQDN-IPYWLVRNSWGPIGP--DEGFFKI 207
D Y ++ H VL++G+G +N YWLV + WG D G KI
Sbjct: 382 LDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKI 441
Query: 208 ERGNNACGIE 217
RG NA IE
Sbjct: 442 ARGVNAYNIE 451
Score = 70.3 bits (172), Expect = 3e-12
Identities = 57/252 (22%), Positives = 95/252 (37%), Gaps = 47/252 (18%)
Query: 375 PVPDAWDWRKKNVTG--------PAGDQAACGSC------WAFSIAGMLEGQYAIKTGKL 420
P P AW W +V G PA C S A M+ G+
Sbjct: 204 PPPAAWSWG--DVGGASFLPAAPPASPGRGCNSSYVEAALAAMMARVMVASNRTDPLGQQ 261
Query: 421 VEFSKSQLVECAKQCSGCGGCDGL-EQPIEYTHQAGLESEKDY--PYRNGNGEKFKCAYD 477
S +++C++ GC G G E+ ++ G+ + Y PY +G+G + C
Sbjct: 262 TFLSARHVLDCSQYGQGCAG--GFPEEVGKFAETFGILTTDSYYIPYDSGDGVERACKTR 319
Query: 478 KSKVK-LFTGKDFL--YF---NGSETMKKILYKYGPLSVGLNSHLIHFYNGTPIRKNDET 531
+ + FT L Y+ + + +Y++GP+ + ++ + ++
Sbjct: 320 RPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTEDVRY 379
Query: 532 CSPYD-----------------LGHAVLLVGYGK-QDDIPYWLARNSWGPIGP--DEGFF 571
S D + H VL++G+G ++ YWL + WG D G
Sbjct: 380 VSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTR 439
Query: 572 KIERGNNACGIE 583
KI RG NA IE
Sbjct: 440 KIARGVNAYNIE 451
Score = 43.7 bits (103), Expect = 5e-04
Identities = 21/51 (41%), Positives = 29/51 (56%), Gaps = 3/51 (5%)
Query: 969 DLGHAVLLVGYGK-QDDIPYWLVRNSWGPIGP--DEGFFKIERGNNACGIE 1016
++ H VL++G+G ++ YWLV + WG D G KI RG NA IE
Sbjct: 401 NVNHTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIE 451
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
Length = 693
Score = 56.9 bits (137), Expect = 4e-08
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 972 HAVLLVGYGKQDD----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 1016
HA++LVG+G+++ YW+ RNSWG EG+FKI RG N GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668
Score = 56.5 bits (136), Expect = 6e-08
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 173 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 217
HA++LVG+G+++ YW+ RNSWG EG+FKI RG N GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668
Score = 56.5 bits (136), Expect = 6e-08
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 904 HAVLLVGYGKQDN----IPYWLVRNSWGPIGPDEGFFKIERGNNACGIE 948
HA++LVG+G+++ YW+ RNSWG EG+FKI RG N GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668
Score = 56.1 bits (135), Expect = 8e-08
Identities = 24/49 (48%), Positives = 32/49 (65%), Gaps = 4/49 (8%)
Query: 539 HAVLLVGYGKQDD----IPYWLARNSWGPIGPDEGFFKIERGNNACGIE 583
HA++LVG+G+++ YW+ RNSWG EG+FKI RG N GIE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668
>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
modification, protein turnover, chaperones].
Length = 372
Score = 49.5 bits (118), Expect = 6e-06
Identities = 49/220 (22%), Positives = 73/220 (33%), Gaps = 29/220 (13%)
Query: 11 VPDAWDWRKKNVTGPAGDQADCGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 70
+P +D R + P DQ GSCWAF+ LE Y +
Sbjct: 99 LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157
Query: 71 S-GCD------GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 123
G D G + T +G E D PY + + K
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSY---FSPTNLPVTKHVQEAQI 214
Query: 124 L-HFNGS---ETMKKILYKYGPLSVLLNSDLIHDY-NGTPIRKNDETCSPYDLGHAVLLV 178
+ +K + YG +S + D + P D + GHAVL+V
Sbjct: 215 IPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVLIV 271
Query: 179 GYG---KQDNIPY-------WLVRNSWGPIGPDEGFFKIE 208
GY +N Y ++++NSWG + G+F I
Sbjct: 272 GYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311
Score = 47.5 bits (113), Expect = 2e-05
Identities = 49/220 (22%), Positives = 74/220 (33%), Gaps = 29/220 (13%)
Query: 742 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 801
+P +D R + P DQ + GSCWAF+ LE Y +
Sbjct: 99 LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157
Query: 802 S-GCD------GCFFEPSIEYTHQAGLESEKDYPYKNANGEKFKCAYDKSKVKLFTGKDF 854
G D G + T +G E D PY + + K
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSY---FSPTNLPVTKHVQEAQI 214
Query: 855 L-HFNGS---ETMKKILYKYGPLSVLLNSDLIHDY-NGTPIRKNDETCSPYDLGHAVLLV 909
+ +K + YG +S + D + P D + GHAVL+V
Sbjct: 215 IPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVLIV 271
Query: 910 GYG---KQDNIPY-------WLVRNSWGPIGPDEGFFKIE 939
GY +N Y ++++NSWG + G+F I
Sbjct: 272 GYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311
Score = 40.6 bits (95), Expect = 0.003
Identities = 46/222 (20%), Positives = 72/222 (32%), Gaps = 32/222 (14%)
Query: 376 VPDAWDWRKKNVTGPAGDQAACGSCWAFSIAGMLEGQYAIKTGKLVEFSKSQLVECAKQC 435
+P +D R + P DQ + GSCWAF+ LE Y +
Sbjct: 99 LPSYFDRRDEGKVSPVKDQGSGGSCWAFATTRSLES-YLNPESAWDFSENNMKNLLGVPY 157
Query: 436 S------GCGGCDGLEQPIEYTHQAGLESEKDYPYRNGNGEKFKCAYDKSKVKLFTGKDF 489
G + T +G E D PY + + + K ++
Sbjct: 158 EKGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPY---SENSYFSPTNLPVTK--HVQEA 212
Query: 490 LYFNG------SETMKKILYKYGPLSVGLNSHLIHFY-NGTPIRKNDETCSPYDLGHAVL 542
+ +K + YG +S + + P D + GHAVL
Sbjct: 213 QIIPSRKKYLDNGNIKAMFGFYGAVSSSMYIDATNSLGICIPYPYVDSGE---NWGHAVL 269
Query: 543 LVGYGKQDDIPY----------WLARNSWGPIGPDEGFFKIE 574
+VGY DI ++ +NSWG + G+F I
Sbjct: 270 IVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311
Score = 34.8 bits (80), Expect = 0.20
Identities = 17/47 (36%), Positives = 24/47 (51%), Gaps = 10/47 (21%)
Query: 971 GHAVLLVGYGKQDDIPY----------WLVRNSWGPIGPDEGFFKIE 1007
GHAVL+VGY DI ++++NSWG + G+F I
Sbjct: 265 GHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWIS 311
>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
Length = 1004
Score = 48.5 bits (115), Expect = 2e-05
Identities = 21/41 (51%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 173 HAVLLVGYGKQDN-----IPYWLVRNSWGPIGPDEGFFKIE 208
HAV +VGYG N YW+VRNSWG DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763
Score = 48.5 bits (115), Expect = 2e-05
Identities = 21/41 (51%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 904 HAVLLVGYGKQDN-----IPYWLVRNSWGPIGPDEGFFKIE 939
HAV +VGYG N YW+VRNSWG DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763
Score = 48.1 bits (114), Expect = 2e-05
Identities = 20/41 (48%), Positives = 27/41 (65%), Gaps = 5/41 (12%)
Query: 972 HAVLLVGYG-----KQDDIPYWLVRNSWGPIGPDEGFFKIE 1007
HAV +VGYG + + YW+VRNSWG DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763
Score = 46.2 bits (109), Expect = 1e-04
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 5/41 (12%)
Query: 539 HAVLLVGYG-----KQDDIPYWLARNSWGPIGPDEGFFKIE 574
HAV +VGYG + + YW+ RNSWG DEG+FK++
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKVD 763
>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
(I29). This domain is found at the N-terminus of some
C1 peptidases such as Cathepsin L where it acts as a
propeptide. There are also a number of proteins that are
composed solely of multiple copies of this domain such
as the peptidase inhibitor salarin. This family is
classified as I29 by MEROPS. Peptide proteinase
inhibitors can be found as single domain proteins or as
single or multiple domains within proteins; these are
referred to as either simple or compound inhibitors,
respectively. In many cases they are synthesised as part
of a larger precursor protein, either as a prepropeptide
or as an N-terminal domain associated with an inactive
peptidase or zymogen. This domain prevents access of the
substrate to the active site. Removal of the N-terminal
inhibitor domain either by interaction with a second
peptidase or by autocatalytic cleavage activates the
zymogen. Other inhibitors interact direct with
proteinases using a simple noncovalent lock and key
mechanism; while yet others use a conformational
change-based trapping mechanism that depends on their
structural and thermodynamic properties.
Length = 57
Score = 41.5 bits (98), Expect = 5e-05
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 9/57 (15%)
Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEE 339
F+ + K G+ Y+++EE RF FK+ + H K + G ++FSD +PEE
Sbjct: 1 FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57
Score = 41.5 bits (98), Expect = 5e-05
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 9/57 (15%)
Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEE 705
F+ + K G+ Y+++EE RF FK+ + H K + G ++FSD +PEE
Sbjct: 1 FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIEEHNKKYEHSYKLGVNQFSDLTPEE 57
>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
(I29). This domain is found at the N-terminus of some
C1 peptidases such as Cathepsin L where it acts as a
propeptide. There are also a number of proteins that are
composed solely of multiple copies of this domain such
as the peptidase inhibitor salarin. This family is
classified as I29 by MEROPS.
Length = 58
Score = 39.1 bits (92), Expect = 4e-04
Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 9/58 (15%)
Query: 292 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEEI 340
F+ + K G+ Y ++EE RF+ FK+ + H K G ++F+D + EE
Sbjct: 1 FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58
Score = 39.1 bits (92), Expect = 4e-04
Identities = 17/58 (29%), Positives = 29/58 (50%), Gaps = 9/58 (15%)
Query: 658 FKAFIVKRGRQYANDEEIKERFEYFKQ-----DGHKKHE----RYGTSEFSDRSPEEI 706
F+ + K G+ Y ++EE RF+ FK+ + H K G ++F+D + EE
Sbjct: 1 FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIEEHNKKGNVSYTLGLNQFADLTDEEF 58
>gnl|CDD|240447 cd13432, LDT_IgD_like_2, IgD-like repeat domain of mycobacterial
L,D-transpeptidases. Immunoglobulin-like domain found
in actinobacterial L,D-transpeptidases, including
Mycobacterium tuberculosis LdtMt2, which is a
non-classical transpeptidase that generates 3->3
transpeptide linkages. LdtMt2 is associated with
virulence and resistance to amoxicillin. This domain may
occur in a tandem-repeat arrangement and is found
N-terminal to the catalytic L,D-transpeptidase domain;
this model represents the repeat adjacent to the
catalytic domain.
Length = 99
Score = 29.0 bits (66), Expect = 3.1
Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 9/38 (23%)
Query: 357 VADREKVEKMLMEVEKDGPVPDAW--------DWRKKN 386
V DR VEK ++V PV AW WR K
Sbjct: 27 VTDRAAVEKA-LKVTTSPPVEGAWYWLSDREVHWRPKE 63
Score = 29.0 bits (66), Expect = 3.1
Identities = 14/38 (36%), Positives = 16/38 (42%), Gaps = 9/38 (23%)
Query: 723 VADREKVEKMLMEVEKDGPVPDAW--------DWRKKN 752
V DR VEK ++V PV AW WR K
Sbjct: 27 VTDRAAVEKA-LKVTTSPPVEGAWYWLSDREVHWRPKE 63
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.138 0.433
Gapped
Lambda K H
0.267 0.0764 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 52,786,039
Number of extensions: 5260080
Number of successful extensions: 4485
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4343
Number of HSP's successfully gapped: 77
Length of query: 1026
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 919
Effective length of database: 6,191,724
Effective search space: 5690194356
Effective search space used: 5690194356
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 64 (28.4 bits)