RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1727
(240 letters)
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease.
Length = 213
Score = 173 bits (441), Expect = 2e-54
Identities = 67/152 (44%), Positives = 78/152 (51%), Gaps = 17/152 (11%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKE-EIRVKIQS 121
LVDCD + GC GGL NAFE I GG+ E DYPY + C K KI+
Sbjct: 53 LVDCDTGNNGCNGGLPDNAFEYIKKN--GGIVTESDYPYTAHDGTCKFKKSNSKYAKIKG 110
Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAM--QFYFGGVSHPLKFLCKGGMDNLDHGVL 178
Y +V DE + L KNGP++VAI+A Q Y GV + C G LDH VL
Sbjct: 111 YGDVPYNDEEALQAALAKNGPVSVAIDAYEDDFQLYKSGVYKHTE--CSG---ELDHAVL 165
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGYG PYWI+KNSWG WGE
Sbjct: 166 IVGYGTENGV------PYWIVKNSWGTDWGEN 191
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
nomenclature); composed of cysteine peptidases (CPs)
similar to papain, including the mammalian CPs
(cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
is an endopeptidase with specific substrate preferences,
primarily for bulky hydrophobic or aromatic residues at
the S2 subsite, a hydrophobic pocket in papain that
accommodates the P2 sidechain of the substrate (the
second residue away from the scissile bond). Most
members of the papain subfamily are endopeptidases. Some
exceptions to this rule can be explained by specific
details of the catalytic domains like the occluding loop
in cathepsin B which confers an additional
carboxydipeptidyl activity and the mini-chain of
cathepsin H resulting in an N-terminal exopeptidase
activity. Papain-like CPs have different functions in
various organisms. Plant CPs are used to mobilize
storage proteins in seeds. Parasitic CPs act
extracellularly to help invade tissues and cells, to
hatch or to evade the host immune system. Mammalian CPs
are primarily lysosomal enzymes with the exception of
cathepsin W, which is retained in the endoplasmic
reticulum. They are responsible for protein degradation
in the lysosome. Papain-like CPs are synthesized as
inactive proenzymes with N-terminal propeptide regions,
which are removed upon activation. In addition to its
inhibitory role, the propeptide is required for proper
folding of the newly synthesized enzyme and its
stabilization in denaturing pH conditions. Residues
within the propeptide region also play a role in the
transport of the proenzyme to lysosomes or acidified
vesicles. Also included in this subfamily are proteins
classified as non-peptidase homologs, which lack
peptidase activity or have missing active site residues.
Length = 210
Score = 165 bits (421), Expect = 2e-51
Identities = 63/151 (41%), Positives = 79/151 (52%), Gaps = 16/151 (10%)
Query: 63 LVDCDKV-DAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
LVDC + GC GG NAFE + + GGL E DYPY G + C N ++ KI
Sbjct: 52 LVDCSTSGNNGCNGGNPDNAFEYVKN---GGLASESDYPYTGKDGTCKYNSSKVGAKITG 108
Query: 122 YVNVS-SDETEMAKYLVKNGPMAVAINAN-AMQFYFGGVSHPLKFLCKGGMDNLDHGVLI 179
Y NV DE + L GP++VAI+A+ + QFY GG+ NL+H VL+
Sbjct: 109 YSNVPPGDEEALKAALANYGPVSVAIDASSSFQFYKGGIYSG----PCCSNTNLNHAVLL 164
Query: 180 VGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VGYG YWI+KNSWG WGEK
Sbjct: 165 VGYGTENGV------DYWIVKNSWGTSWGEK 189
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease.
Length = 175
Score = 118 bits (299), Expect = 2e-33
Identities = 57/161 (35%), Positives = 64/161 (39%), Gaps = 51/161 (31%)
Query: 63 LVDCDK-VDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQS 121
LVDC + GC GGL NAFE I GGLE E YPY GS
Sbjct: 53 LVDCSGGGNCGCNGGLPDNAFEYIKKN--GGLETESCYPYTGS----------------- 93
Query: 122 YVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVG 181
VAI+A+ QFY G+ C G LDH VLIVG
Sbjct: 94 ----------------------VAIDASDFQFYKSGIYDHPG--C--GSGTLDHAVLIVG 127
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMPFWIIKNSWG 222
YG + YWI+KNSWG WGE F I +
Sbjct: 128 YGTEVEN----GKDYWIVKNSWGTDWGENGY-FRIARGKNN 163
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
Length = 348
Score = 121 bits (305), Expect = 6e-33
Identities = 67/154 (43%), Positives = 91/154 (59%), Gaps = 18/154 (11%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPY---KGSNRACHLNKEEIRV- 117
+LV CD VD GCGGGLM AFE ++ + G + EK YPY G C N E+
Sbjct: 177 QLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSYPYVSGNGDVPEC-SNSSELAPG 235
Query: 118 -KIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGMDNLDHG 176
+I YV++ S E MA +L KNGP+++A++A++ Y GV C G + L+HG
Sbjct: 236 ARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYHSGVLTS----CIG--EQLNHG 289
Query: 177 VLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
VL+VGY + T ++ PYW+IKNSWG WGEK
Sbjct: 290 VLLVGYNM-----TGEV-PYWVIKNSWGEDWGEK 317
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 96.0 bits (239), Expect = 8e-23
Identities = 62/157 (39%), Positives = 92/157 (58%), Gaps = 20/157 (12%)
Query: 62 KLVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN-RACHLNKEEIRVKIQ 120
+LVDC + GC GGL+ NAFE +I GGL E DYPY C++++ + + KI+
Sbjct: 317 ELVDCSFKNNGCYGGLIPNAFEDMIEL--GGLCSEDDYPYVSDTPELCNIDRCKEKYKIK 374
Query: 121 SYVNVSSDE-TEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKFLCKGGMDNLDHGVL 178
SYV++ D+ E ++L GP++V+I + FY GG+ F + G + +H V+
Sbjct: 375 SYVSIPEDKFKEAIRFL---GPISVSIAVSDDFAFYKGGI-----FDGECG-EEPNHAVI 425
Query: 179 IVGYGVHK-----TKFTHKIQPYWIIKNSWGPHWGEK 210
+VGYG+ + TK K + Y+IIKNSWG WGEK
Sbjct: 426 LVGYGMEEIYNSDTKKMEK-RYYYIIKNSWGESWGEK 461
Score = 40.5 bits (95), Expect = 4e-04
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 11 DKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
LE+V F F+++H K Y T +E +R F NL KI
Sbjct: 161 TNLENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKI 200
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
Dipeptidyl Peptidase I (DPPI), an atypical papain-like
cysteine peptidase with chloride dependency and
dipeptidyl aminopeptidase activity, resulting from its
tetrameric structure which limits substrate access. Each
subunit of the tetramer is composed of three peptides:
the heavy and light chains, which together adopts the
papain fold and forms the catalytic domain; and the
residual propeptide region, which forms a beta barrel
and points towards the substrate's N-terminus. The
subunit composition is the result of the unique
characteristic of procathepsin C maturation involving
the cleavage of the catalytic domain and the
non-autocatalytic excision of an activation peptide
within its propeptide region. By removing N-terminal
dipeptide extensions, cathepsin C activates granule
serine peptidases (granzymes) involved in cell-mediated
apoptosis, inflammation and tissue remodelling.
Loss-of-function mutations in cathepsin C are associated
with Papillon-Lefevre and Haim-Munk syndromes, rare
diseases characterized by hyperkeratosis and early-onset
periodontitis. Cathepsin C is widely expressed in many
tissues with high levels in lung, kidney and placenta.
It is also highly expressed in cytotoxic lymphocytes and
mature myeloid cells.
Length = 243
Score = 91.7 bits (228), Expect = 2e-22
Identities = 52/165 (31%), Positives = 73/165 (44%), Gaps = 27/165 (16%)
Query: 64 VDCDKVDAGCGGGLMSNAFETIISKLGG--GLEGEKDYPYKG-SNRACHLNKEEIR---V 117
+ C + GC GG F ++ K G+ E +PY +R C + E R
Sbjct: 64 LSCSQYSQGCDGG-----FPFLVGKFAEDFGIVTEDYFPYTADDDRPCKASPSECRRYYF 118
Query: 118 KIQSYVN---VSSDETEMAKYLVKNGPMAVAINANA-MQFYFGGVSHPLKF--LCKGGMD 171
+YV ++E EM + +NGP+ VA + FY GV H + G D
Sbjct: 119 SDYNYVGGCYGCTNEDEMKWEIYRNGPIVVAFEVYSDFDFYKEGVYHHTDNDEVSDGDND 178
Query: 172 NL------DHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
N +H VL+VG+G + K YWI+KNSWG WGEK
Sbjct: 179 NFNPFELTNHAVLLVGWGEDEIKGE----KYWIVKNSWGSSWGEK 219
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
Length = 448
Score = 90.5 bits (225), Expect = 5e-21
Identities = 55/154 (35%), Positives = 70/154 (45%), Gaps = 29/154 (18%)
Query: 63 LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEIRVKIQSY 122
LV+CD GC GG A E + +K GL D PY + C + +V I SY
Sbjct: 287 LVNCDTKSQGCSGGYPDTALEYVKNK---GLSSSSDVPYLAKDGKC-VVSSTKKVYIDSY 342
Query: 123 VNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHPLKFLCKGGM------DNLDHG 176
+ V+ + + K LV P V I VS L K G+ +L+H
Sbjct: 343 L-VAKGKDVLNKSLVI-SPTVVYIA----------VSREL-LKYKSGVYNGECGKSLNHA 389
Query: 177 VLIVGYGV-HKTKFTHKIQPYWIIKNSWGPHWGE 209
VL+VG G KTK + YWIIKNSWG WGE
Sbjct: 390 VLLVGEGYDEKTK-----KRYWIIKNSWGTDWGE 418
Score = 35.1 bits (81), Expect = 0.027
Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 2/45 (4%)
Query: 11 DKLEHVAM--FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR 53
KLE F F +K+N+ +AT E R FR N +++
Sbjct: 116 PKLEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSH 160
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
nomenclature), also referred to as the papain family;
composed of two subfamilies of cysteine peptidases
(CPs), C1A (papain) and C1B (bleomycin hydrolase).
Papain-like enzymes are mostly endopeptidases with some
exceptions like cathepsins B, C, H and X, which are
exopeptidases. Papain-like CPs have different functions
in various organisms. Plant CPs are used to mobilize
storage proteins in seeds while mammalian CPs are
primarily lysosomal enzymes responsible for protein
degradation in the lysosome. Papain-like CPs are
synthesized as inactive proenzymes with N-terminal
propeptide regions, which are removed upon activation.
Bleomycin hydrolase (BH) is a CP that detoxifies
bleomycin by hydrolysis of an amide group. It acts as a
carboxypeptidase on its C-terminus to convert itself
into an aminopeptidase and peptide ligase. BH is found
in all tissues in mammals as well as in many other
eukaryotes. It forms a hexameric ring barrel structure
with the active sites imbedded in the central channel.
Some members of the C1 family are proteins classified as
non-peptidase homologs which lack peptidase activity or
have missing active site residues.
Length = 223
Score = 78.3 bits (193), Expect = 1e-17
Identities = 36/152 (23%), Positives = 64/152 (42%), Gaps = 13/152 (8%)
Query: 66 CDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNKEEI----RVKIQS 121
C ++ C GG +A +++ G + E+DYPY + E +VK++
Sbjct: 59 CLGINGSCDGGGPLSALLKLVALKG--IPPEEDYPYGAESDGEEPKSEAALNAAKVKLKD 116
Query: 122 YVNVSS-DETEMAKYLVKNGPMAVAINANAMQFYFGGVSH--PLKFLCKGGMDNLDHGVL 178
Y V + ++ + L K GP+ + + + +L D H V+
Sbjct: 117 YRRVLKNNIEDIKEALAKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVV 176
Query: 179 IVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
IVGY + + + +I+KNSWG WG+
Sbjct: 177 IVGYDDNYVE----GKGAFIVKNSWGTDWGDN 204
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
cathepsin B and similar proteins, including
tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
B is a lysosomal papain-like cysteine peptidase which is
expressed in all tissues and functions primarily as an
exopeptidase through its carboxydipeptidyl activity.
Together with other cathepsins, it is involved in the
degradation of proteins, proenzyme activation, Ag
processing, metabolism and apoptosis. Cathepsin B has
been implicated in a number of human diseases such as
cancer, rheumatoid arthritis, osteoporosis and
Alzheimer's disease. The unique carboxydipeptidyl
activity of cathepsin B is attributed to the presence of
an occluding loop in its active site which favors the
binding of the C-termini of substrate proteins. Some
members of this group do not possess the occluding loop.
TIN-Ag is an extracellular matrix basement protein which
was originally identified as a target Ag involved in
anti-tubular basement membrane antibody-mediated
interstitial nephritis. It plays a role in renal
tubulogenesis and is defective in hereditary
tubulointerstitial disorders. TIN-Ag is exclusively
expressed in kidney tissues. .
Length = 236
Score = 64.6 bits (158), Expect = 1e-12
Identities = 46/172 (26%), Positives = 64/172 (37%), Gaps = 38/172 (22%)
Query: 63 LVDCD-KVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLNK--------- 112
L+ C GC GG A++ + + G+ PY H
Sbjct: 58 LLSCCSGCGDGCNGGYPDAAWKYLTTT---GVVTGGCQPYTIPPCGHHPEGPPPCCGTPY 114
Query: 113 -------------EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYF-GGV 158
EE + K +S +V SDET++ K ++ NGP+ A Y+ GV
Sbjct: 115 CTPKCQDGCEKTYEEDKHKGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYEDFLYYKSGV 174
Query: 159 SHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
GG H V I+G+GV PYW+ NSWG WGE
Sbjct: 175 YQHTSGKQLGG-----HAVKIIGWGVENG------VPYWLAANSWGTDWGEN 215
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
papain-like lysosomal cysteine peptidase exhibiting
carboxymonopeptidase activity. It can also act as a
carboxydipeptidase, like cathepsin B, but has been shown
to preferentially cleave substrates through a
monopeptidyl carboxypeptidase pathway. The propeptide
region of cathepsin X, the shortest among papain-like
peptidases, is covalently attached to the active site
cysteine in the inactive form of the enzyme. Little is
known about the biological function of cathepsin X. Some
studies point to a role in early tumorigenesis. A more
recent study indicates that cathepsin X expression is
restricted to immune cells suggesting a role in
phagocytosis and the regulation of the immune response.
Length = 239
Score = 59.7 bits (145), Expect = 7e-11
Identities = 42/181 (23%), Positives = 67/181 (37%), Gaps = 34/181 (18%)
Query: 49 KIQIRGEGTHLALK---LVDCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDYPYKGSN 105
I +G + L ++DC + C GG +E G+ E PY+ +
Sbjct: 45 NIARKGAWPSVYLSVQVVIDCAGGGS-CHGGDPGGVYEYAHKH---GIPDETCNPYQAKD 100
Query: 106 R---------------ACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA-N 149
C K + Y +VS +M + GP++ I A
Sbjct: 101 GECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVSG-RDKMMAEIYARGPISCGIMATE 159
Query: 150 AMQFYFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGE 209
A++ Y GGV ++H + + G+GV YWI++NSWG WGE
Sbjct: 160 ALENYTGGVYKE-----YVQDPLINHIISVAGWGVD-----ENGVEYWIVRNSWGEPWGE 209
Query: 210 K 210
+
Sbjct: 210 R 210
>gnl|CDD|214853 smart00848, Inhibitor_I29, Cathepsin propeptide inhibitor domain
(I29). This domain is found at the N-terminus of some
C1 peptidases such as Cathepsin L where it acts as a
propeptide. There are also a number of proteins that
are composed solely of multiple copies of this domain
such as the peptidase inhibitor salarin. This family is
classified as I29 by MEROPS. Peptide proteinase
inhibitors can be found as single domain proteins or as
single or multiple domains within proteins; these are
referred to as either simple or compound inhibitors,
respectively. In many cases they are synthesised as
part of a larger precursor protein, either as a
prepropeptide or as an N-terminal domain associated
with an inactive peptidase or zymogen. This domain
prevents access of the substrate to the active site.
Removal of the N-terminal inhibitor domain either by
interaction with a second peptidase or by autocatalytic
cleavage activates the zymogen. Other inhibitors
interact direct with proteinases using a simple
noncovalent lock and key mechanism; while yet others
use a conformational change-based trapping mechanism
that depends on their structural and thermodynamic
properties.
Length = 57
Score = 49.5 bits (119), Expect = 2e-08
Identities = 16/33 (48%), Positives = 24/33 (72%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F + +KH KSY+++EE +R IF+ NLKKI+
Sbjct: 1 FEQWKKKHGKSYSSEEEEARRFAIFKENLKKIE 33
>gnl|CDD|219764 pfam08246, Inhibitor_I29, Cathepsin propeptide inhibitor domain
(I29). This domain is found at the N-terminus of some
C1 peptidases such as Cathepsin L where it acts as a
propeptide. There are also a number of proteins that
are composed solely of multiple copies of this domain
such as the peptidase inhibitor salarin. This family is
classified as I29 by MEROPS.
Length = 58
Score = 48.4 bits (116), Expect = 4e-08
Identities = 13/33 (39%), Positives = 22/33 (66%)
Query: 19 FNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQ 51
F + +K+ KSY ++EE R +IF+ NL+ I+
Sbjct: 1 FEDWKKKYGKSYYSEEEELYRFQIFKENLRFIE 33
>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
Length = 1004
Score = 52.0 bits (124), Expect = 1e-07
Identities = 32/99 (32%), Positives = 51/99 (51%), Gaps = 12/99 (12%)
Query: 113 EEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINA-NAMQFYFGGVSHPLKFLCKGGMD 171
E K+ +++ + DE ++ G + I A N + + F G ++ LC G D
Sbjct: 670 EHFHDKMDAFIKIIKDE------IMNKGSVIAYIKAENVLGYEFNGKK--VQNLC--GDD 719
Query: 172 NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWGPHWGEK 210
DH V IVGYG + K + YWI++NSWG +WG++
Sbjct: 720 TADHAVNIVGYGNYINDEDEK-KSYWIVRNSWGKYWGDE 757
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
Length = 693
Score = 50.3 bits (120), Expect = 3e-07
Identities = 33/129 (25%), Positives = 52/129 (40%), Gaps = 34/129 (26%)
Query: 97 KDYPYKGSNRACHLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQF-YF 155
KDY Y G C N + E M + +NGP+ + A+ + Y
Sbjct: 540 KDYNYIGGCYGC---------------NQCNGEKIMMNEIYRNGPIVASFEASPDFYDYA 584
Query: 156 GGV----SHPLKFLCK------------GGMDNLDHGVLIVGYGVHKTKFTHKIQPYWII 199
GV P C G + ++H +++VG+G + + K+ YWI
Sbjct: 585 DGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWG--EEEINGKLYKYWIG 642
Query: 200 KNSWGPHWG 208
+NSWG +WG
Sbjct: 643 RNSWGKNWG 651
>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
modification, protein turnover, chaperones].
Length = 372
Score = 46.4 bits (110), Expect = 4e-06
Identities = 25/67 (37%), Positives = 30/67 (44%), Gaps = 7/67 (10%)
Query: 154 YFGGVSHPLKFLCKGGMDNLDHGVLIVGYGVHKTKFTHKIQPY----WIIKNSWGPHWGE 209
G+ P + +N H VLIVGY K P +IIKNSWG +WGE
Sbjct: 247 NSLGICIP--YPYVDSGENWGHAVLIVGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGE 304
Query: 210 KTMPFWI 216
FWI
Sbjct: 305 NGY-FWI 310
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
Length = 548
Score = 45.3 bits (107), Expect = 2e-05
Identities = 39/178 (21%), Positives = 64/178 (35%), Gaps = 35/178 (19%)
Query: 54 GEGTHLALKLV-DCDKVDAGCGGGLMSNAFETIISKLGGGLEGEKDY--PYK---GSNRA 107
G+ T L+ + V DC + GC GG E G+ Y PY G RA
Sbjct: 259 GQQTFLSARHVLDCSQYGQGCAGGF---PEEVGKFAETFGILTTDSYYIPYDSGDGVERA 315
Query: 108 CHLNKEEIR------VKIQSYVNVSSDETEMAKYLVKNGPMAVAINANAMQFYFGGVSHP 161
C + R + Y +D E+ + ++GP+ ++ AN+ + S
Sbjct: 316 CKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIWEIYRHGPVPASVYANSDWYNCDENSTE 375
Query: 162 LKFLCKGGMD---------------NLDHGVLIVGYGVHKTKFTHKIQPYWIIKNSWG 204
N++H VLI+G+G + YW++ + WG
Sbjct: 376 DVRYVSLDDYSTASADRPLRHYFASNVNHTVLIIGWGTDENG-----GDYWLVLDPWG 428
>gnl|CDD|176025 cd08379, C2D_MCTP_PRT_plant, C2 domain fourth repeat found in
Multiple C2 domain and Transmembrane region Proteins
(MCTP); plant subset. MCTPs are involved in Ca2+
signaling at the membrane. Plant-MCTPs are composed of
a variable N-terminal sequence, four C2 domains, two
transmembrane regions (TMRs), and a short C-terminal
sequence. It is one of four protein classes that are
anchored to membranes via a transmembrane region; the
others being synaptotagmins, extended synaptotagmins,
and ferlins. MCTPs are the only membrane-bound C2 domain
proteins that contain two functional TMRs. MCTPs are
unique in that they bind Ca2+ but not phospholipids. C2
domains fold into an 8-standed beta-sandwich that can
adopt 2 structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. This cd contains the fourth C2
repeat, C2D, and has a type-II topology.
Length = 126
Score = 28.1 bits (63), Expect = 2.1
Identities = 14/47 (29%), Positives = 21/47 (44%), Gaps = 7/47 (14%)
Query: 195 PYWIIKNSWGPHWGEKTMPFWIIKNSWGPRWGEQVTKSIYS-SAPVT 240
Y + K +GP W +++S PRW EQ T +Y +T
Sbjct: 26 AYCVAK--YGPKWVR----TRTVEDSSNPRWNEQYTWPVYDPCTVLT 66
>gnl|CDD|211774 TIGR02993, ectoine_eutD, ectoine utilization protein EutD. Members
of this family are putative peptidases or hydrolases
similar to Xaa-Pro aminopeptidase (pfam00557). They
belong to ectoine utilization operons, as found in
Sinorhizobium meliloti 1021 (where it is known to be
induced by ectoine), Mesorhizobium loti, Silicibacter
pomeroyi, Agrobacterium tumefaciens, and Pseudomonas
putida. The exact function is unknown.
Length = 391
Score = 29.1 bits (65), Expect = 2.5
Identities = 20/66 (30%), Positives = 28/66 (42%), Gaps = 19/66 (28%)
Query: 182 YGVHKTKFTHKIQPYWIIKNSWGPHWGEKTMP--------------FWIIKNSWGPRWGE 227
YG+HK T + I S+ P WGE+TM F + W WG
Sbjct: 310 YGIHKDSRTG-----YPIGLSYPPDWGERTMSLRPGDNTVLQPGMTFHFMTGLWMEDWGL 364
Query: 228 QVTKSI 233
++T+SI
Sbjct: 365 EITESI 370
>gnl|CDD|188453 TIGR03938, deacetyl_PgaB, poly-beta-1,6-N-acetyl-D-glucosamine
N-deacetylase PgaB. Two well-characterized systems
produce polysaccharide based on N-acetyl-D-glucosamine
in straight chains with beta-1,6 linkages. These are
encoded by the icaADBC operon in Staphylococcus species,
where the system is designated polysaccharide
intercellular adhesin (PIA), and the pgaABCD operon in
Gram-negative bacteria such as E. coli. Both systems
include a putative polysaccharide deacetylase. The PgaB
protein, described here, contains an additional domain
lacking from its Gram-positive counterpart IcaB
(TIGR03933). Deacetylation by this protein appears
necessary to allow export through the porin PgaA [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides].
Length = 619
Score = 28.0 bits (63), Expect = 4.6
Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 3/34 (8%)
Query: 17 AMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKI 50
A + Y T EY +R+ R +LKK
Sbjct: 158 AATTRAYDPATGRYETDAEYRQRI---RDDLKKS 188
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 28.1 bits (62), Expect = 5.6
Identities = 12/53 (22%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 1 MEATAKPHHHDKLEHVAMFNHFLEKHNKSYATKEEYHKRLRIFRANLKKIQIR 53
M+ H KLE + H + ++ Y K ++ F+ L++ Q R
Sbjct: 1047 MQVLQMKQEHQKLEE--NIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFR 1097
>gnl|CDD|165867 PLN02223, PLN02223, phosphoinositide phospholipase C.
Length = 537
Score = 27.7 bits (61), Expect = 6.0
Identities = 12/26 (46%), Positives = 14/26 (53%), Gaps = 1/26 (3%)
Query: 205 PHWGEKTMPFWIIKNSWGPRWGEQVT 230
PH EK M + N W P WGE+ T
Sbjct: 447 PH-DEKIMKTTVKNNEWKPTWGEEFT 471
>gnl|CDD|145887 pfam02973, Sialidase, Sialidase, N-terminal domain.
Length = 190
Score = 27.1 bits (60), Expect = 8.0
Identities = 16/80 (20%), Positives = 26/80 (32%), Gaps = 8/80 (10%)
Query: 75 GGLMSNAFETIISKLGGGLEGEKDYPYKGSNRACHLN-KEEIRVKIQSYVNVSSDETEMA 133
GG N I + G L GE Y + K + + IQS ++S+
Sbjct: 4 GGEYENNNINISNGQGYDLSGELGKLYVLEEGTIIMRFKSDKKNSIQSLFSISNSTA--- 60
Query: 134 KYLVKNGPMAVAINANAMQF 153
N + I + +
Sbjct: 61 ----GNEYFHIYITNSEIGL 76
>gnl|CDD|223291 COG0213, DeoA, Thymidine phosphorylase [Nucleotide transport and
metabolism].
Length = 435
Score = 27.2 bits (61), Expect = 8.3
Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 5/39 (12%)
Query: 109 HLNKEEIRVKIQSYVNVSSDETEMAKYLVKNGPMAVAIN 147
L+KEEI I YVN + + +++ +L MA+
Sbjct: 16 ALSKEEIEFFINGYVNGTIPDYQISAFL-----MAIFFR 49
>gnl|CDD|219433 pfam07500, TFIIS_M, Transcription factor S-II (TFIIS), central
domain. Transcription elongation by RNA polymerase II
is regulated by the general elongation factor TFIIS.
This factor stimulates RNA polymerase II to transcribe
through regions of DNA that promote the formation of
stalled ternary complexes. TFIIS is composed of three
structural domains, termed I, II, and III. The two
C-terminal domains (II and III), this domain and
pfam01096 are required for transcription activity.
Length = 115
Score = 26.1 bits (58), Expect = 9.2
Identities = 7/32 (21%), Positives = 12/32 (37%), Gaps = 7/32 (21%)
Query: 17 AMFNHFLEKHNKSYATKEEYHKRLRIFRANLK 48
+F + T ++Y + R NLK
Sbjct: 40 ELFKLYGN-------TGKKYKNKYRSLMFNLK 64
>gnl|CDD|199985 TIGR00009, L28, ribosomal protein L28. This model describes
bacterial and chloroplast forms of the 50S ribosomal
protein L28, a polypeptide about 60 amino acids in
length. Mitochondrial homologs differ substantially in
architecture and are not included [Protein synthesis,
Ribosomal proteins: synthesis and modification].
Length = 56
Score = 24.9 bits (55), Expect = 9.4
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 37 HKRLRIFRANLKKIQIRGEGTHLALKL 63
K R F+ NL+K ++ +G L++
Sbjct: 23 RKTKRRFKPNLQKKRVWVDGRFKRLRV 49
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.136 0.430
Gapped
Lambda K H
0.267 0.0764 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,901,916
Number of extensions: 1089515
Number of successful extensions: 1039
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1005
Number of HSP's successfully gapped: 39
Length of query: 240
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 146
Effective length of database: 6,768,326
Effective search space: 988175596
Effective search space used: 988175596
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 57 (25.7 bits)