RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3960
(351 letters)
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
nomenclature); composed of cysteine peptidases (CPs)
similar to papain, including the mammalian CPs
(cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
is an endopeptidase with specific substrate preferences,
primarily for bulky hydrophobic or aromatic residues at
the S2 subsite, a hydrophobic pocket in papain that
accommodates the P2 sidechain of the substrate (the
second residue away from the scissile bond). Most
members of the papain subfamily are endopeptidases. Some
exceptions to this rule can be explained by specific
details of the catalytic domains like the occluding loop
in cathepsin B which confers an additional
carboxydipeptidyl activity and the mini-chain of
cathepsin H resulting in an N-terminal exopeptidase
activity. Papain-like CPs have different functions in
various organisms. Plant CPs are used to mobilize
storage proteins in seeds. Parasitic CPs act
extracellularly to help invade tissues and cells, to
hatch or to evade the host immune system. Mammalian CPs
are primarily lysosomal enzymes with the exception of
cathepsin W, which is retained in the endoplasmic
reticulum. They are responsible for protein degradation
in the lysosome. Papain-like CPs are synthesized as
inactive proenzymes with N-terminal propeptide regions,
which are removed upon activation. In addition to its
inhibitory role, the propeptide is required for proper
folding of the newly synthesized enzyme and its
stabilization in denaturing pH conditions. Residues
within the propeptide region also play a role in the
transport of the proenzyme to lysosomes or acidified
vesicles. Also included in this subfamily are proteins
classified as non-peptidase homologs, which lack
peptidase activity or have missing active site residues.
Length = 210
Score = 257 bits (658), Expect = 1e-85
Identities = 98/199 (49%), Positives = 128/199 (64%), Gaps = 5/199 (2%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F T GA+EGAY +K KL LS+Q L+DCS GNNGC+GG +++++
Sbjct: 17 DQGSCGSCWAFSTVGALEGAYAIKTGKLVSLSEQQLVDCSTS-GNNGCNGGNPDNAFEYV 75
Query: 211 MKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSVAI 270
GL ++ DY PY G+D C ++ A +TG+ NV P E+ALK ALA +GPVSVAI
Sbjct: 76 KNGGLASESDY-PYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVAI 134
Query: 271 DASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGY 330
DAS SF FY G+Y C S L+HAVL VGYG +G YW VKNSW T WG +GY
Sbjct: 135 DASS-SFQFYKGGIYSGPCC--SNTNLNHAVLLVGYGTENGVDYWIVKNSWGTSWGEKGY 191
Query: 331 VLMSIKDNNCGVMTAPTYV 349
+ ++ N CG+ + +Y
Sbjct: 192 IRIARGSNLCGIASYASYP 210
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease.
Length = 213
Score = 254 bits (650), Expect = 2e-84
Identities = 93/202 (46%), Positives = 123/202 (60%), Gaps = 8/202 (3%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWI 210
DQ CGSCW+F GA+EG Y +K KL LS+Q L+DC G NNGC+GG ++++I
Sbjct: 18 DQGQCGSCWAFSAVGALEGRYCIKTGKLVSLSEQQLVDCDTG--NNGCNGGLPDNAFEYI 75
Query: 211 MK-HGLPTQDDYGPYLGQDAYCHI-ANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
K G+ T+ DY PY D C + + A + G+ +V N E+AL+ ALAK+GPVSV
Sbjct: 76 KKNGGIVTESDY-PYTAHDGTCKFKKSNSKYAKIKGYGDVPYNDEEALQAALAKNGPVSV 134
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQ 328
AIDA + F Y +GVY +C+ LDHAVL VGYG +G PYW VKNSW T WG
Sbjct: 135 AIDAYEDDFQLYKSGVYKHTECSGE---LDHAVLIVGYGTENGVPYWIVKNSWGTDWGEN 191
Query: 329 GYVLMSIKDNNCGVMTAPTYVT 350
GY ++ N CG+ + +Y
Sbjct: 192 GYFRIARGVNECGIASEASYPI 213
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease.
Length = 175
Score = 174 bits (444), Expect = 7e-54
Identities = 83/211 (39%), Positives = 104/211 (49%), Gaps = 51/211 (24%)
Query: 144 KGYNSL-LDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGE 202
KG + DQ CGSCW+F TGA+EG Y +K KL LS+Q L+DCS G GN GC+GG
Sbjct: 10 KGAVTPVKDQGQCGSCWAFSATGALEGRYCIKTGKLVSLSEQQLVDCSGG-GNCGCNGGL 68
Query: 203 DFRSYQWIMKH-GLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALA 261
++++I K+ GL T+ Y PY G
Sbjct: 69 PDNAFEYIKKNGGLETESCY-PYTG----------------------------------- 92
Query: 262 KHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYGE--LDGKPYWQVKN 319
SVAIDAS F FY +G+Y C LDHAVL VGYG +GK YW VKN
Sbjct: 93 -----SVAIDAS--DFQFYKSGIYDHPGC--GSGTLDHAVLIVGYGTEVENGKDYWIVKN 143
Query: 320 SWSTYWGNQGYVLMSI-KDNNCGVMTAPTYV 349
SW T WG GY ++ K+N CG+ +
Sbjct: 144 SWGTDWGENGYFRIARGKNNECGIEASVASY 174
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
Length = 348
Score = 125 bits (315), Expect = 4e-33
Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 26/232 (11%)
Query: 129 YNKASKD--AIP--VRYEMKG-YNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQ 183
Y KA D A+P V + KG + +Q CGSCW+F G +E + + KL LS+
Sbjct: 116 YRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFSAVGNIESQWAVAGHKLVRLSE 175
Query: 184 QALIDCSWGYGNNGCDGGEDFRSYQWIMKH---GLPTQDDYGPYL---GQDAYCHIANTT 237
Q L+ C + +NGC GG ++++W++++ + T+ Y PY+ G C ++
Sbjct: 176 QQLVSCD--HVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY-PYVSGNGDVPECSNSSEL 232
Query: 238 AT-ATMTGFVNVTPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG 296
A A + G+V++ +SE + LAK+GP+S+A+DAS SF Y +GV S G
Sbjct: 233 APGARIDGYVSM-ESSERVMAAWLAKNGPISIAVDAS--SFMSYHSGVL------TSCIG 283
Query: 297 --LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
L+H VL VGY PYW +KNSW WG +GYV +++ N C + P
Sbjct: 284 EQLNHGVLLVGYNMTGEVPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTGYP 335
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
nomenclature), also referred to as the papain family;
composed of two subfamilies of cysteine peptidases
(CPs), C1A (papain) and C1B (bleomycin hydrolase).
Papain-like enzymes are mostly endopeptidases with some
exceptions like cathepsins B, C, H and X, which are
exopeptidases. Papain-like CPs have different functions
in various organisms. Plant CPs are used to mobilize
storage proteins in seeds while mammalian CPs are
primarily lysosomal enzymes responsible for protein
degradation in the lysosome. Papain-like CPs are
synthesized as inactive proenzymes with N-terminal
propeptide regions, which are removed upon activation.
Bleomycin hydrolase (BH) is a CP that detoxifies
bleomycin by hydrolysis of an amide group. It acts as a
carboxypeptidase on its C-terminus to convert itself
into an aminopeptidase and peptide ligase. BH is found
in all tissues in mammals as well as in many other
eukaryotes. It forms a hexameric ring barrel structure
with the active sites imbedded in the central channel.
Some members of the C1 family are proteins classified as
non-peptidase homologs which lack peptidase activity or
have missing active site residues.
Length = 223
Score = 120 bits (303), Expect = 2e-32
Identities = 62/201 (30%), Positives = 90/201 (44%), Gaps = 15/201 (7%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAV--LSQQALIDCSWGY---GNNGCDGGEDFR 205
+Q GSCW+F + A+E AY +K + LS Q L C+ N CDGG
Sbjct: 14 NQGSRGSCWAFASAYALESAYRIKGGEDEYVDLSPQYLYICANDECLGINGSCDGGGPLS 73
Query: 206 SYQWIMK-HGLPTQDDYGPYLGQD----AYCHIANTTATATMTGFVNVTPNSEDALKLAL 260
+ ++ G+P ++DY PY + A A + + V N+ + +K AL
Sbjct: 74 ALLKLVALKGIPPEEDY-PYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEAL 132
Query: 261 AKHGPVSVAIDASQKSFSFYVNGVYYDEK--CNNSPDGLDHAVLAVGYG--ELDGKPYWQ 316
AK GPV D +Y + D HAV+ VGY ++GK +
Sbjct: 133 AKGGPVVAGFDVYSGFDRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFI 192
Query: 317 VKNSWSTYWGNQGYVLMSIKD 337
VKNSW T WG+ GY +S +D
Sbjct: 193 VKNSWGTDWGDNGYGRISYED 213
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
Length = 448
Score = 122 bits (309), Expect = 2e-31
Identities = 68/201 (33%), Positives = 103/201 (51%), Gaps = 20/201 (9%)
Query: 151 DQSV-CGSCWSFGTTGAVEGAYYMKHKKLAV-LSQQALIDCSWGYGNNGCDGGEDFRSYQ 208
DQ + CGSCW+F + G+VE Y ++ +V LS+Q L++C + GC GG + +
Sbjct: 251 DQGLNCGSCWAFSSVGSVES-LYKIYRDKSVDLSEQELVNCD--TKSQGCSGGYPDTALE 307
Query: 209 WIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPNSEDALKLALAKHGPVSV 268
++ GL + D PYL +D C +++T + V +D L +L P V
Sbjct: 308 YVKNKGLSSSSDV-PYLAKDGKCVVSSTKKVYIDSYLVA---KGKDVLNKSLVI-SPTVV 362
Query: 269 AIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAV--GYGELDGKPYWQVKNSWSTYWG 326
I S + Y +GV Y+ +C L+HAVL V GY E K YW +KNSW T WG
Sbjct: 363 YIAVS-RELLKYKSGV-YNGECG---KSLNHAVLLVGEGYDEKTKKRYWIIKNSWGTDWG 417
Query: 327 NQGYVLM---SIKDNNCGVMT 344
GY+ + + + CG++T
Sbjct: 418 ENGYMRLERTNEGTDKCGILT 438
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 122 bits (307), Expect = 5e-31
Identities = 74/214 (34%), Positives = 113/214 (52%), Gaps = 25/214 (11%)
Query: 142 EMKGYNSLLDQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
G + DQ CGSCW+F T G VE Y ++ +L LS+Q L+DCS + NNGC GG
Sbjct: 274 LHNGVTPVKDQKNCGSCWAFSTVGVVESQYAIRKNELVSLSEQELVDCS--FKNNGCYGG 331
Query: 202 EDFRSYQWIMK-HGLPTQDDYGPYLGQDA-YCHIANTTATATMTGFVNVTPNSEDALKLA 259
+++ +++ GL ++DDY PY+ C+I + +V++ ED K A
Sbjct: 332 LIPNAFEDMIELGGLCSEDDY-PYVSDTPELCNIDRCKEKYKIKSYVSI---PEDKFKEA 387
Query: 260 LAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG------ELDGKP 313
+ GP+SV+I S F+FY G+ +D +C P +HAV+ VGYG K
Sbjct: 388 IRFLGPISVSIAVSD-DFAFYKGGI-FDGECGEEP---NHAVILVGYGMEEIYNSDTKKM 442
Query: 314 ----YWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 343
Y+ +KNSW WG +G+ + I+ + G+M
Sbjct: 443 EKRYYYIIKNSWGESWGEKGF--IRIETDENGLM 474
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
cathepsin B and similar proteins, including
tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
B is a lysosomal papain-like cysteine peptidase which is
expressed in all tissues and functions primarily as an
exopeptidase through its carboxydipeptidyl activity.
Together with other cathepsins, it is involved in the
degradation of proteins, proenzyme activation, Ag
processing, metabolism and apoptosis. Cathepsin B has
been implicated in a number of human diseases such as
cancer, rheumatoid arthritis, osteoporosis and
Alzheimer's disease. The unique carboxydipeptidyl
activity of cathepsin B is attributed to the presence of
an occluding loop in its active site which favors the
binding of the C-termini of substrate proteins. Some
members of this group do not possess the occluding loop.
TIN-Ag is an extracellular matrix basement protein which
was originally identified as a target Ag involved in
anti-tubular basement membrane antibody-mediated
interstitial nephritis. It plays a role in renal
tubulogenesis and is defective in hereditary
tubulointerstitial disorders. TIN-Ag is exclusively
expressed in kidney tissues. .
Length = 236
Score = 112 bits (282), Expect = 3e-29
Identities = 56/220 (25%), Positives = 83/220 (37%), Gaps = 39/220 (17%)
Query: 150 LDQSVCGSCWSFGTTGAV--------EGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGG 201
DQ CGSCW+F A G K+ +LS Q L+ C +GC+GG
Sbjct: 20 RDQGNCGSCWAFSAVEAFSDRLCIQSNG------KENVLLSAQDLLSCC-SGCGDGCNGG 72
Query: 202 EDFRSYQWIMKHGLPT------QDDYGPYLGQDAYCHIANTTATATMTGFVNVT------ 249
+++++ G+ T + + T T
Sbjct: 73 YPDAAWKYLTTTGVVTGGCQPYTIPPCGHHPEGPPPCCGTPYCTPKCQDGCEKTYEEDKH 132
Query: 250 --------PNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAV 301
P+ E + + +GPV A + F +Y +GVY + + G HAV
Sbjct: 133 KGKSAYSVPSDETDIMKEIMTNGPVQAAFTVYE-DFLYYKSGVY--QHTSGKQLGG-HAV 188
Query: 302 LAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCG 341
+G+G +G PYW NSW T WG GY + N CG
Sbjct: 189 KIIGWGVENGVPYWLAANSWGTDWGENGYFRILRGSNECG 228
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only
papain-like lysosomal cysteine peptidase exhibiting
carboxymonopeptidase activity. It can also act as a
carboxydipeptidase, like cathepsin B, but has been shown
to preferentially cleave substrates through a
monopeptidyl carboxypeptidase pathway. The propeptide
region of cathepsin X, the shortest among papain-like
peptidases, is covalently attached to the active site
cysteine in the inactive form of the enzyme. Little is
known about the biological function of cathepsin X. Some
studies point to a role in early tumorigenesis. A more
recent study indicates that cathepsin X expression is
restricted to immune cells suggesting a role in
phagocytosis and the regulation of the immune response.
Length = 239
Score = 112 bits (282), Expect = 3e-29
Identities = 55/193 (28%), Positives = 84/193 (43%), Gaps = 25/193 (12%)
Query: 155 CGSCWSFGTTGAVEGAYYMKHK---KLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIM 211
CGSCW+ G+T A+ + K LS Q +IDC+ G C GG+ Y++
Sbjct: 28 CGSCWAHGSTSALADRINIARKGAWPSVYLSVQVVIDCA---GGGSCHGGDPGGVYEYAH 84
Query: 212 KHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTPN-------------SEDALKL 258
KHG+P + PY +D C+ N T G N D +
Sbjct: 85 KHGIP-DETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGSVSGRDKMMA 143
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLDHAVLAVGYG-ELDGKPYWQV 317
+ GP+S I A++ + Y GVY + + + H + G+G + +G YW V
Sbjct: 144 EIYARGPISCGIMATEALEN-YTGGVYKEYVQDPLIN---HIISVAGWGVDENGVEYWIV 199
Query: 318 KNSWSTYWGNQGY 330
+NSW WG +G+
Sbjct: 200 RNSWGEPWGERGW 212
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
Dipeptidyl Peptidase I (DPPI), an atypical papain-like
cysteine peptidase with chloride dependency and
dipeptidyl aminopeptidase activity, resulting from its
tetrameric structure which limits substrate access. Each
subunit of the tetramer is composed of three peptides:
the heavy and light chains, which together adopts the
papain fold and forms the catalytic domain; and the
residual propeptide region, which forms a beta barrel
and points towards the substrate's N-terminus. The
subunit composition is the result of the unique
characteristic of procathepsin C maturation involving
the cleavage of the catalytic domain and the
non-autocatalytic excision of an activation peptide
within its propeptide region. By removing N-terminal
dipeptide extensions, cathepsin C activates granule
serine peptidases (granzymes) involved in cell-mediated
apoptosis, inflammation and tissue remodelling.
Loss-of-function mutations in cathepsin C are associated
with Papillon-Lefevre and Haim-Munk syndromes, rare
diseases characterized by hyperkeratosis and early-onset
periodontitis. Cathepsin C is widely expressed in many
tissues with high levels in lung, kidney and placenta.
It is also highly expressed in cytotoxic lymphocytes and
mature myeloid cells.
Length = 243
Score = 105 bits (265), Expect = 9e-27
Identities = 68/226 (30%), Positives = 104/226 (46%), Gaps = 32/226 (14%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLA------VLSQQALIDCSWGYGNNGCDGGEDF 204
+Q CGSC++F + A+E + K +LS Q ++ CS Y + GCDGG F
Sbjct: 22 NQGGCGSCYAFASVYALEARIMIASNKTDPLGQQPILSPQHVLSCS-QY-SQGCDGGFPF 79
Query: 205 RSYQWIMKHGLPTQDDYGPYLGQDAYCHIAN--------TTATATMTGFVNVTPNSEDAL 256
++ G+ T+D + D C A+ + + G T +ED +
Sbjct: 80 LVGKFAEDFGIVTEDYFPYTADDDRPC-KASPSECRRYYFSDYNYVGGCYGCT--NEDEM 136
Query: 257 KLALAKHGPVSVAIDASQKSFSFYVNGVY----YDEKCNNSPDGL------DHAVLAVGY 306
K + ++GP+ VA + F FY GVY DE + D +HAVL VG+
Sbjct: 137 KWEIYRNGPIVVAFEVYS-DFDFYKEGVYHHTDNDEVSDGDNDNFNPFELTNHAVLLVGW 195
Query: 307 GE--LDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 350
GE + G+ YW VKNSW + WG +GY + N CG+ + +
Sbjct: 196 GEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVFAY 241
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
Length = 693
Score = 64.6 bits (157), Expect = 2e-11
Identities = 58/277 (20%), Positives = 100/277 (36%), Gaps = 82/277 (29%)
Query: 151 DQSVCGSCWSFGTTGAVE-----GAYYMKHKKLA-----VLSQQALIDCSWGYGNNGCDG 200
+Q +CGSC+ A + KK +LS Q ++ CS + + GC+G
Sbjct: 402 NQLLCGSCYIASQMYAFKRRIEIALTKNLDKKYLNNFDDLLSIQTVLSCS--FYDQGCNG 459
Query: 201 GEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMTGFVNVTP---------- 250
G + + G+P D PY + C + +M G N+
Sbjct: 460 GFPYLVSKMAKLQGIP-LDKVFPYTATEQTCPYQVDQSANSMNGSANLRQINAVFFSSET 518
Query: 251 ------------------------------------NSEDALKLALAKHGPVSVAIDASQ 274
N E + + ++GP+ + +AS
Sbjct: 519 QSDMHADFEAPISSEPARWYAKDYNYIGGCYGCNQCNGEKIMMNEIYRNGPIVASFEASP 578
Query: 275 KSFSFYVNGVYYDE------KCNNSP------------DGLDHAVLAVGYGE--LDGKP- 313
+ Y +GVYY E +C + ++HA++ VG+GE ++GK
Sbjct: 579 DFYD-YADGVYYVEDFPHARRCTVDLPKHNGVYNITGWEKVNHAIVLVGWGEEEINGKLY 637
Query: 314 -YWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYV 349
YW +NSW WG +GY + N G+ + ++
Sbjct: 638 KYWIGRNSWGKNWGKEGYFKIIRGKNFSGIESQSLFI 674
>gnl|CDD|227207 COG4870, COG4870, Cysteine protease [Posttranslational
modification, protein turnover, chaperones].
Length = 372
Score = 58.7 bits (142), Expect = 1e-09
Identities = 50/233 (21%), Positives = 75/233 (32%), Gaps = 40/233 (17%)
Query: 140 RYEMKGYNSLLDQSVCGSCWSFGTTGAVEGAYY---------MKHKKLAVLSQQALIDCS 190
R + + + DQ GSCW+F TT ++E K L +
Sbjct: 105 RRDEGKVSPVKDQGSGGSCWAFATTRSLESYLNPESAWDFSENNMKNLLGVP------YE 158
Query: 191 WGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGPYLGQDAYCHIANTTATATMT--GFVNV 248
G+ DGG S ++ + P + PY + N T + +
Sbjct: 159 KGFDYTSNDGGNADMSAAYLTEWSGPVYETDDPYSENSYFSP-TNLPVTKHVQEAQIIPS 217
Query: 249 TPNSEDALKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKC-----NNSPDGLDHAVLA 303
D + +V S S Y++ C +S + HAVL
Sbjct: 218 RKKYLDNGNIKAMFGFYGAV-------SSSMYIDATNSLGICIPYPYVDSGENWGHAVLI 270
Query: 304 VGYGEL----------DGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 346
VGY + G + +KNSW T WG GY +S G A
Sbjct: 271 VGYDDSFDINNFKYGPPGDGAFIIKNSWGTNWGENGYFWISYYYALNGDAEAL 323
>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
Length = 1004
Score = 55.5 bits (133), Expect = 2e-08
Identities = 61/217 (28%), Positives = 84/217 (38%), Gaps = 43/217 (19%)
Query: 151 DQSVCGSCWSFGTTGAVEGAYYMKHKKLAVLSQQALIDCSWGYGNNGCDGGEDFRSY-QW 209
DQ C W F + +E MK + +S + +CS G + CD G + + Q
Sbjct: 549 DQGNCAISWIFASKYHLETIKCMKGYEPHAISALYIANCSKGEHKDRCDEGSNPLEFLQI 608
Query: 210 IMKHG-LPTQDDYGPY----LGQDAYC-----HIANTTATATMTGFVNVTPNSEDA---- 255
I +G LP +Y Y +G+D C H N + PNS D
Sbjct: 609 IEDNGFLPADSNY-LYNYTKVGED--CPDEEDHWMNLLDHGKILNHNKKEPNSLDGKAYR 665
Query: 256 -----------------LKLALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDGLD 298
+K + G V I A + + NG C + D D
Sbjct: 666 AYESEHFHDKMDAFIKIIKDEIMNKGSVIAYIKA-ENVLGYEFNGKKVQNLCGD--DTAD 722
Query: 299 HAVLAVGYG-----ELDGKPYWQVKNSWSTYWGNQGY 330
HAV VGYG E + K YW V+NSW YWG++GY
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGY 759
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
Length = 548
Score = 52.2 bits (125), Expect = 2e-07
Identities = 49/229 (21%), Positives = 82/229 (35%), Gaps = 41/229 (17%)
Query: 155 CGSCWSFGTTGAVEGAYYM----KHKKLA---VLSQQALIDCSWGYGNNGCDGGEDFRSY 207
C S + A+ A M + L LS + ++DCS YG GC GG
Sbjct: 232 CNSSYVEAALAAMM-ARVMVASNRTDPLGQQTFLSARHVLDCS-QYGQ-GCAGGFPEEVG 288
Query: 208 QWIMKHGLPTQDDYG-PYLGQDAYCHIANTTATATMTGFVNVTP--------NSEDALKL 258
++ G+ T D Y PY D T + F N P D +
Sbjct: 289 KFAETFGILTTDSYYIPYDSGDGVERACKTRRPSRRYYFTNYGPLGGYYGAVTDPDEIIW 348
Query: 259 ALAKHGPVSVAIDASQKSFSFYVNGVYYDEKCNNSPDG-----------------LDHAV 301
+ +HGPV ++ A+ ++ + ++ S D ++H V
Sbjct: 349 EIYRHGPVPASVYANSDWYN--CDENSTEDVRYVSLDDYSTASADRPLRHYFASNVNHTV 406
Query: 302 LAVGYGELD-GKPYWQVKNSWSTY--WGNQGYVLMSIKDNNCGVMTAPT 347
L +G+G + G YW V + W + W + G ++ N + +
Sbjct: 407 LIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEVV 455
>gnl|CDD|202517 pfam03051, Peptidase_C1_2, Peptidase C1-like family. This family
is closely related to the Peptidase_C1 family pfam00112,
containing several prokaryotic and eukaryotic
aminopeptidases and bleomycin hydrolases.
Length = 438
Score = 35.4 bits (82), Expect = 0.043
Identities = 15/40 (37%), Positives = 22/40 (55%), Gaps = 2/40 (5%)
Query: 297 LDHAVLAVGYGELDGKPY--WQVKNSWSTYWGNQGYVLMS 334
+ HA++ G E D W+V+NSW G +GY +MS
Sbjct: 359 MTHAMVLTGVDEDDDGKPTKWKVENSWGDDSGKKGYFVMS 398
>gnl|CDD|222715 pfam14371, DUF4412, Domain of unknown function (DUF4412). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria, archaea and
eukaryotes, and is typically between 75 and 104 amino
acids in length.
Length = 74
Score = 32.3 bits (74), Expect = 0.056
Identities = 12/50 (24%), Positives = 18/50 (36%), Gaps = 7/50 (14%)
Query: 95 TYIGEELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEMK 144
T +G E + G EK+R+ LWV P++ E
Sbjct: 2 TKLGTETIAGYKCEKYRVTDGSTG---TAELWV----TPDLGFPLKTESI 44
>gnl|CDD|226107 COG3579, PepC, Aminopeptidase C [Amino acid transport and
metabolism].
Length = 444
Score = 32.5 bits (74), Expect = 0.32
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 2/40 (5%)
Query: 297 LDHAVLAVGYGEL-DGKPY-WQVKNSWSTYWGNQGYVLMS 334
+ HA++ G G P W+V+NSW G +GY + S
Sbjct: 361 MTHAMVLTGVDLDETGNPLRWKVENSWGKDVGKKGYFVAS 400
>gnl|CDD|234540 TIGR04318, lacto_ODC_hypo, putative ornithine decarboxylase. In at
least ten species of Lactobacillus, this close homolog
to known ornithine decarboxylase occurs in a three-gene
neighborhood, along with an amino acid permease family
transporter and pyridoxal phosphate-dependent enyzme
from the cystathionine gamma-lyase family. Species
include L. acidophilus, L. amylovorus, L. crispatus, L.
delbrueckii, L. farciminis, L. helveticus, L. johnsonii,
etc. The combination of a decarboxylase with an
antiporter in a two-gene system suggests a
decarboxylation/antiport proton-motive cycle for
transient resistance to acidic conditions. The substrate
for this decarboxylase might be ornithine but is
unknown.
Length = 695
Score = 32.4 bits (74), Expect = 0.34
Identities = 19/48 (39%), Positives = 21/48 (43%), Gaps = 10/48 (20%)
Query: 176 KKLAVLSQQALIDCSWGYGNNGCDGGEDFRSYQWIMKHGLPTQDDYGP 223
K+ L L DC+WG G E F IMKH P Q YGP
Sbjct: 287 DKIGKLCDYILFDCAWG-------GYEQFVP---IMKHLSPLQLTYGP 324
>gnl|CDD|187859 cd09728, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Protein of this family often fused to HTH domain; Some
proteins could have an additional fusion with
RecB-family nuclease domain; Core domain appears to have
a Rossmann-like fold; loosely associated with CRISPR/Cas
systems; also known as DxTHG family.
Length = 400
Score = 32.0 bits (73), Expect = 0.50
Identities = 16/105 (15%), Positives = 35/105 (33%), Gaps = 9/105 (8%)
Query: 47 CFHAYGIPDATIEPQSVLPDVSDFKVNIYRLFFLRPRFHENEKIRYN---WTYIGEELVN 103
+ + I TI P + + + + +N Y+ + + + + L N
Sbjct: 180 PLNIHEIESETIAPVTEISYLKEKFINKYKDNNKDLKRYFKKALNEGLEKIIKFASALYN 239
Query: 104 GIIL------EKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYE 142
G+ L + E + + NK ++ I V+ E
Sbjct: 240 GLPLYLLYLKPSIEELREPLEFKLEIIEIIESNKNQEEEIKVKRE 284
>gnl|CDD|238328 cd00585, Peptidase_C1B, Peptidase C1B subfamily (MEROPS database
nomenclature); composed of eukaryotic bleomycin
hydrolases (BH) and bacterial aminopeptidases C (pepC).
The proteins of this subfamily contain a large insert
relative to the C1A peptidase (papain) subfamily. BH is
a cysteine peptidase that detoxifies bleomycin by
hydrolysis of an amide group. It acts as a
carboxypeptidase on its C-terminus to convert itself
into an aminopeptidase and peptide ligase. BH is found
in all tissues in mammals as well as in many other
eukaryotes. Bleomycin, a glycopeptide derived from the
fungus Streptomyces verticullus, is an effective
anticancer drug due to its ability to induce DNA strand
breaks. Human BH is the major cause of tumor cell
resistance to bleomycin chemotherapy, and is also
genetically linked to Alzheimer's disease. In addition
to its peptidase activity, the yeast BH (Gal6) binds DNA
and acts as a repressor in the Gal4 regulatory system.
BH forms a hexameric ring barrel structure with the
active sites imbedded in the central channel. The
bacterial homolog of BH, called pepC, is a cysteine
aminopeptidase possessing broad specificity. Although
its crystal structure has not been solved, biochemical
analysis shows that pepC also forms a hexamer. .
Length = 437
Score = 31.0 bits (71), Expect = 0.83
Identities = 17/38 (44%), Positives = 23/38 (60%), Gaps = 2/38 (5%)
Query: 299 HAVLAVGYGE-LDGKPY-WQVKNSWSTYWGNQGYVLMS 334
HA++ G DGKP W+V+NSW G +GY +MS
Sbjct: 360 HAMVLTGVDLDEDGKPVKWKVENSWGEKVGKKGYFVMS 397
>gnl|CDD|129242 TIGR00136, gidA, glucose-inhibited division protein A. GidA, the
longer of two forms of GidA-related proteins, appears to
be present in all complete eubacterial genomes so far,
as well as Saccharomyces cerevisiae. A subset of these
organisms have a closely related protein. GidA is absent
in the Archaea. It appears to act with MnmE, in an
alpha2/beta2 heterotetramer, in the
5-carboxymethylaminomethyl modification of uridine 34 in
certain tRNAs. The shorter, related protein, previously
called gid or gidA(S), is now called TrmFO (see model
TIGR00137) [Protein synthesis, tRNA and rRNA base
modification].
Length = 617
Score = 30.8 bits (70), Expect = 1.4
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 7/51 (13%)
Query: 96 YIG---EELVNGIILEKWRLVTSEGEKVSKYSLWVRYNKASKDAIPVRYEM 143
YIG ++LV E +R+ TS E Y L +R + A + E+
Sbjct: 404 YIGVLIDDLVTKGTKEPYRMFTSRAE----YRLLLREDNADFRLTEIGREL 450
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.2 bits (65), Expect = 4.0
Identities = 12/33 (36%), Positives = 19/33 (57%), Gaps = 1/33 (3%)
Query: 297 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQG 329
D A+ AV +G + G+ Y + W TY+G+ G
Sbjct: 56 YDIALWAVPFGLIGGRLY-HLATDWRTYFGDGG 87
>gnl|CDD|215341 PLN02635, PLN02635, disproportionating enzyme.
Length = 538
Score = 29.0 bits (65), Expect = 4.2
Identities = 16/45 (35%), Positives = 18/45 (40%), Gaps = 13/45 (28%)
Query: 200 GGEDFRSYQWIMKHG------LP-------TQDDYGPYLGQDAYC 231
G E FR W+ G LP +D PY GQDA C
Sbjct: 49 GDEAFRFLDWLASTGCSVWQVLPLVPPGRKGGEDGSPYSGQDANC 93
>gnl|CDD|182581 PRK10600, PRK10600, nitrate/nitrite sensor protein NarX;
Provisional.
Length = 569
Score = 28.9 bits (65), Expect = 5.5
Identities = 9/16 (56%), Positives = 12/16 (75%)
Query: 327 NQGYVLMSIKDNNCGV 342
NQ V +S++DN CGV
Sbjct: 497 NQNQVKLSVQDNGCGV 512
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.134 0.422
Gapped
Lambda K H
0.267 0.0789 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 17,628,175
Number of extensions: 1642845
Number of successful extensions: 1282
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1238
Number of HSP's successfully gapped: 30
Length of query: 351
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 253
Effective length of database: 6,590,910
Effective search space: 1667500230
Effective search space used: 1667500230
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.4 bits)