RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy768
(475 letters)
>gnl|CDD|216503 pfam01435, Peptidase_M48, Peptidase family M48.
Length = 223
Score = 107 bits (270), Expect = 6e-27
Identities = 61/218 (27%), Positives = 86/218 (39%), Gaps = 44/218 (20%)
Query: 227 KRVGAVVKRLIDANKVYMEHNNFKYP-ITIIDDPLINAFVFPDG---RIFMFTGMFQLCQ 282
R+ VV+RL D+ F + ++D P NAF G R+ + TG+
Sbjct: 32 ARLQRVVERLADS-----AGAGFPPWEVYVVDSPQPNAFALGGGKNKRVVVTTGLLL-AL 85
Query: 283 TDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDD 342
T+DELA VL HE+ H +H E +S ++ +LL
Sbjct: 86 TEDELAAVLGHEIGHIKARHSVESMS-----------------------QGLLLNLLLLL 122
Query: 343 LKSIIFELPFEREMETEADESIIF-----ELPFEREMETEADEVGLKL-----MARACYD 392
+ + IF LP+ R+ E EADE G +L +ARA Y
Sbjct: 123 GAAALGGRALGFNANGFLTALGIFLLQLLLLPYSRKQEYEADEAGARLGGDKDLARAGYK 182
Query: 393 VRVAPLFWQKMALKETQDQVGPKM-EEYLSTHPSHENR 429
R A F K+A + G K+ E LSTHP R
Sbjct: 183 PRAAVKFLAKLAAENLSRVSGGKLYPELLSTHPPLVER 220
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 89.4 bits (222), Expect = 5e-19
Identities = 49/194 (25%), Positives = 84/194 (43%), Gaps = 32/194 (16%)
Query: 251 YPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNK 310
+ +++D INAF P G + + TG+ + + ELA V++HE+ H +H+A
Sbjct: 94 FTFFLVNDDSINAFATPGGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLA------ 147
Query: 311 TFLEILYIVPLMIIWFLLPDLGAIV-----TQVLLDDLKSIIFELPFEREMETEADESII 365
+E M I +L + A + + A
Sbjct: 148 RSMEQQQRAAPMAIAGMLLGILAALAGADAGMAGI-----------------AGALAGAA 190
Query: 366 -FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGPKMEEYLSTHP 424
+L F R+ E EAD +G+ + RA YD + P F++++A + G + EYL THP
Sbjct: 191 QGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQLRY---GGQPPEYLLTHP 247
Query: 425 SHENRANNLESKMK 438
E R +L ++ +
Sbjct: 248 LPEERIADLRNRAE 261
>gnl|CDD|227123 COG4784, COG4784, Putative Zn-dependent protease [General function
prediction only].
Length = 479
Score = 85.7 bits (212), Expect = 6e-18
Identities = 51/206 (24%), Positives = 83/206 (40%), Gaps = 37/206 (17%)
Query: 231 AVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATV 290
+V R++ A E+ Y ITI++ P INAF P G +++ G+ L E+A V
Sbjct: 68 RMVARIVGALTAVSENPQQTYRITILNSPNINAFALPGGYLYITRGLLALANDSSEVAAV 127
Query: 291 LSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFEL 350
L+HE+ H H ++ + I +V +VL D
Sbjct: 128 LAHEMGHVTANHGIQRQQREAAEVI---------------ASRVVAEVLGSDAA------ 166
Query: 351 PFEREMETEADESII----FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALK 406
+++I F R E +AD +G+K++ A YD A F Q MA
Sbjct: 167 ---------GKQALIRGKLRLAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAY 217
Query: 407 ETQDQVGPKMEE---YLSTHPSHENR 429
V ++ +L++HP+ R
Sbjct: 218 TDFRSVSGAADQSLDFLASHPNTPQR 243
>gnl|CDD|223575 COG0501, HtpX, Zn-dependent protease with chaperone function
[Posttranslational modification, protein turnover,
chaperones].
Length = 302
Score = 82.7 bits (204), Expect = 2e-17
Identities = 47/201 (23%), Positives = 72/201 (35%), Gaps = 27/201 (13%)
Query: 249 FKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304
+ I++ P NAF +GR+ + TG+ L DDEL VL+HEL H +H
Sbjct: 116 HMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDL-LNDDELEAVLAHELGHIKNRHT- 173
Query: 305 EKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESI 364
L T +L +++ L + L L ++
Sbjct: 174 --LVRLTLRGLLASAFVLLATLALAAGLLGEAALALLLLLLLLALFLAT----------- 220
Query: 365 IFELPFEREMETEADEVGLKL-----MARACYD-VRVAPLFWQKMALKETQDQVGPKMEE 418
+ L F R+ E EAD KL +A A R++ K + G +++
Sbjct: 221 LLVLAFSRKREYEADRFAAKLTGPEKLASALQKLARLSGRANSKAFI--ASGFSGGRLQA 278
Query: 419 YLSTHPSHENRANNLESKMKE 439
STHP R L
Sbjct: 279 LFSTHPPLAERIAALRQLALT 299
>gnl|CDD|234931 PRK01265, PRK01265, heat shock protein HtpX; Provisional.
Length = 324
Score = 34.7 bits (80), Expect = 0.086
Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 11/86 (12%)
Query: 245 EHNNFKYP-ITIIDDPLINAFVFPD---GRIFMFTGMFQLCQTDDELATVLSHELSHTLL 300
++N + P + I D P NAF + G+ T DE+ V HEL H L
Sbjct: 94 KYNGIRVPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKILNRDEIKAVAGHELGH--L 151
Query: 301 KHVAEKLSNKTFLEILYIVPLMIIWF 326
KH + L + ++P +I +
Sbjct: 152 KH-----RDVELLMAIGLIPTLIYYL 172
>gnl|CDD|234944 PRK01345, PRK01345, heat shock protein HtpX; Provisional.
Length = 317
Score = 32.3 bits (74), Expect = 0.51
Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 9/51 (17%)
Query: 253 ITIIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSH 297
+ IID+P NAF GR + TG+ Q + +E+A V++HEL+H
Sbjct: 87 VYIIDNPQPNAFA--TGRNPENAAVAATTGLLQRL-SPEEVAGVMAHELAH 134
>gnl|CDD|235186 PRK03982, PRK03982, heat shock protein HtpX; Provisional.
Length = 288
Score = 31.9 bits (73), Expect = 0.71
Identities = 25/105 (23%), Positives = 49/105 (46%), Gaps = 22/105 (20%)
Query: 275 TGMFQLCQTDDELATVLSHELSH-----TLLKHVAEKLSNKTFLEILYIVPLM--IIWFL 327
G+ L +DEL V++HEL+H TL++ +A L+ I+Y+ + +WF
Sbjct: 114 EGILNLL-NEDELEGVIAHELTHIKNRDTLIQTIAATLAGA----IMYLAQWLSWGLWFG 168
Query: 328 LPD---------LGAIVTQVLLDDLKSIIFELPFEREMETEADES 363
+G+++ ++L + + + + R+ E ADE
Sbjct: 169 GGGRDDRNGGNPIGSLLL-IILAPIAATLIQFAISRQREFSADEG 212
>gnl|CDD|183649 PRK12649, PRK12649, putative monovalent cation/H+ antiporter
subunit A; Reviewed.
Length = 789
Score = 31.7 bits (72), Expect = 0.88
Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 2/34 (5%)
Query: 317 YIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFEL 350
Y+V L+ I+ PDL +TQ+L++ L +IIF L
Sbjct: 653 YLVSLLFIYLKAPDLA--LTQILVETLSTIIFLL 684
>gnl|CDD|199913 cd11007, M35_like_1, Peptidase M35-like domain of uncharacterized
proteins. This family contains proteins similar to the
M35 Zn2+-metallopeptidases, also known as the
deuterolysin family, presumably these are bacterial
metalloendopeptidases that have yet to be characterized.
Typically, members of this family of extracellular
peptidases contain a unique zinc-binding motif (the
aspzincin motif), defined by the HExxH + D motif where
an aspartic acid is the third zinc ligand; however,
members of this family do not contain the GTXDXXYG motif
C-terminal to the His zinc ligands that is typical for
the M35 proteases. Deuterolysins are highly active
towards basic nuclear proteins such as histones and
protamines, with a preference for a Lys or Arg residue
in the P1' subsite. MEPs specifically cleave
peptidyl-lysine bonds (-X-Lys-) in proteins and
peptides. Many members of the M35 peptidases display
unusual thermostabilities.
Length = 183
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 11/81 (13%)
Query: 226 YKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLIN--AFVFPDGRI---FMFTGMFQL 280
K++ AV+ L N V ++ K + + AFV I +F +
Sbjct: 62 LKKILAVLNSLTPENFVR--SDSEKDTCGCVPNDGDGEAAFVCGPDTISIHVIFCELGAF 119
Query: 281 C----QTDDELATVLSHELSH 297
D A+ + HELSH
Sbjct: 120 FDAASNGLDSQASTIIHELSH 140
>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
Aminopeptidase N. This family contains bacterial
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 435
Score = 31.2 bits (71), Expect = 1.4
Identities = 28/122 (22%), Positives = 48/122 (39%), Gaps = 18/122 (14%)
Query: 186 TGRQKFIIVKPNQLNDVTQIAYDNFIEEH----GNQVLPLGHPAYKRVGAVVKRLIDANK 241
G + + N D A FI + G +V P + KR +DA K
Sbjct: 185 GGTKTWEFEAENV-RDFALAASPKFIVDEATVDGIKVRAYYFPEDA---ELAKRYLDAAK 240
Query: 242 VYMEHNN---FKYP---ITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHEL 295
+E + YP +++++P +P G + + +L L TVL+HE+
Sbjct: 241 KALEFYSELFGPYPYKEFSVVENPFPGGMEYP-GLTLIGGRVLRL---PFILETVLAHEI 296
Query: 296 SH 297
+H
Sbjct: 297 AH 298
>gnl|CDD|179418 PRK02391, PRK02391, heat shock protein HtpX; Provisional.
Length = 296
Score = 30.7 bits (70), Expect = 1.4
Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 9/51 (17%)
Query: 253 ITIIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSH 297
+ + D + NAF GR + + TG+ + DEL VL+HELSH
Sbjct: 96 VAVADSDVPNAFA--TGRSPKNAVVCVTTGLMRRL-DPDELEAVLAHELSH 143
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 30.9 bits (70), Expect = 2.0
Identities = 19/144 (13%), Positives = 43/144 (29%), Gaps = 33/144 (22%)
Query: 49 QDTIGCILSCDKKYSTCLNFQITGHQDKHLTRVLFHQSANHQLQVFNIRVLRCFHTSQPK 108
Q I + + + + + +++ Q+ NH +Q +C K
Sbjct: 529 QKAIDIVSTDIQ------ISLLVEELRPYCLQLIKDQNGNHVIQ-------KCIEK-FNK 574
Query: 109 NALPGFIYLIFKPILR--------VVGAIVIARISRKWWSKLSPDQKKIILSQIKKHQDK 160
IF I G+ V+ R + + K+ ++ +I +
Sbjct: 575 EKNQF----IFDSINENLYDLSTHRYGSRVVQRCLENC----NSEDKENLVEEIISNSKY 626
Query: 161 IAGITLTSIGLAYIYYFLHLETCP 184
++ G + + L P
Sbjct: 627 LS---QDQYGNYVVQHILDNGAEP 647
>gnl|CDD|235478 PRK05457, PRK05457, heat shock protein HtpX; Provisional.
Length = 284
Score = 30.1 bits (69), Expect = 2.3
Identities = 54/232 (23%), Positives = 76/232 (32%), Gaps = 97/232 (41%)
Query: 255 IIDDPLINAFVFPDGRIFMFTGMFQ----------LCQ--TDDELATVLSHELSH----- 297
I P INAF TG + L Q + DE+ VL+HE+SH
Sbjct: 99 IYHSPEINAFA---------TGASKNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIANGD 149
Query: 298 ----TLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDD---------LK 344
TL++ V N TF + FL + IV + + + +
Sbjct: 150 MVTMTLIQGVV----N-TF-----------VIFLSRIIAQIVDRFVSGNEEGNGIGYFIV 193
Query: 345 SIIFELPFEREMETEADESI---IFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQ 401
SI+ E+ F I I + F R E AD G KL R +
Sbjct: 194 SIVLEIVF----------GILASIIVMWFSRHREFRADAGGAKLAGR------------E 231
Query: 402 KM--ALK--------ETQDQV-------GPKMEEYLSTHPSHENRANNLESK 436
KM AL+ + + + E +HP E R L S
Sbjct: 232 KMIAALQRLKTSYEPQLPGSMAAFGINGKSGLSELFMSHPPLEKRIAALRSG 283
>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
The aldehyde dehydrogenase family (ALDH) of NAD(P)+
dependent enzymes, in general, oxidize a wide range of
endogenous and exogenous aliphatic and aromatic
aldehydes to their corresponding carboxylic acids and
play an important role in detoxification. Besides
aldehyde detoxification, many ALDH isozymes possess
multiple additional catalytic and non-catalytic
functions such as participating in metabolic pathways,
or as binding proteins, or as osmoregulants, to mention
a few. The enzyme has three domains, a NAD(P)+
cofactor-binding domain, a catalytic domain, and a
bridging domain; and the active enzyme is generally
either homodimeric or homotetrameric. The catalytic
mechanism is proposed to involve cofactor binding,
resulting in a conformational change and activation of
an invariant catalytic cysteine nucleophile. The
cysteine and aldehyde substrate form an oxyanion
thiohemiacetal intermediate resulting in hydride
transfer to the cofactor and formation of a
thioacylenzyme intermediate. Hydrolysis of the
thioacylenzyme and release of the carboxylic acid
product occurs, and in most cases, the reduced cofactor
dissociates from the enzyme. The evolutionary
phylogenetic tree of ALDHs appears to have an initial
bifurcation between what has been characterized as the
classical aldehyde dehydrogenases, the ALDH family
(ALDH) and extended family members or aldehyde
dehydrogenase-like (ALDH-like) proteins. The ALDH
proteins are represented by enzymes which share a number
of highly conserved residues necessary for catalysis and
cofactor binding and they include such proteins as
retinal dehydrogenase, 10-formyltetrahydrofolate
dehydrogenase, non-phosphorylating glyceraldehyde
3-phosphate dehydrogenase,
delta(1)-pyrroline-5-carboxylate dehydrogenases,
alpha-ketoglutaric semialdehyde dehydrogenase,
alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
aldehyde dehydrogenase and succinate-semialdehyde
dehydrogenase. Included in this larger group are all
human, Arabidopsis, Tortula, fungal, protozoan, and
Drosophila ALDHs identified in families ALDH1 through
ALDH22 with the exception of families ALDH18, ALDH19,
and ALDH20 which are present in the ALDH-like group.
Length = 432
Score = 29.9 bits (68), Expect = 3.0
Identities = 10/41 (24%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 128 AIVIARISRKWWSKLSPDQKKIIL----SQIKKHQDKIAGI 164
A+ AR + K W+ L P ++ IL +++ ++++A +
Sbjct: 3 AVAAARAAFKAWAALPPAERAAILRKLADLLEERREELAAL 43
>gnl|CDD|221619 pfam12520, DUF3723, Protein of unknown function (DUF3723). This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 374 and 1069 amino
acids in length. There is a conserved LGF sequence
motif.
Length = 506
Score = 30.0 bits (68), Expect = 3.1
Identities = 7/29 (24%), Positives = 14/29 (48%)
Query: 136 RKWWSKLSPDQKKIILSQIKKHQDKIAGI 164
R+WW++LS K ++K++
Sbjct: 122 RRWWARLSSSNKARRRRLLQKNRRLRNAF 150
>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265). Members of
this family of bacterial proteins comprise various
hypothetical proteins and putative aminopeptidases.
Their exact function, has not, as yet, been defined.
Length = 338
Score = 29.1 bits (66), Expect = 4.6
Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 15/41 (36%)
Query: 257 DDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSH 297
+DPL++ F+ D ELA ++ HEL+H
Sbjct: 151 NDPLLSTFIRYP---------------DGELARLIFHELAH 176
>gnl|CDD|234514 TIGR04231, seadorna_VP5, seadornavirus VP5 protein. This protein
family occurs in the seadornavirus virus group, with
designations VP5 in Banna virus, and VP6 in Kadipiro
virus and Liao ning virus. The function is unassigned.
Length = 505
Score = 29.5 bits (66), Expect = 4.7
Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 17/98 (17%)
Query: 209 NFIEEHGNQ----VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAF 264
+ ++ HGN V PL R +++ L D+N + FK D+ L A+
Sbjct: 259 SRVKMHGNSRFGFVKPLDDAQKSRSTGLIRALDDSNSIR-----FKSIGVGRDEKLSYAY 313
Query: 265 VFPDGRIFMF-----TGMFQLCQTDDELATVLSHELSH 297
DG + + G LC T D + ++ H+L H
Sbjct: 314 ---DGCLLYYIKKIQNGNRGLCVTIDNINKLIEHKLLH 348
>gnl|CDD|179524 PRK03001, PRK03001, M48 family peptidase; Provisional.
Length = 283
Score = 29.2 bits (66), Expect = 4.9
Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 11/54 (20%)
Query: 255 IIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSHTLLKH 302
+I++ NAF GR + TG+ ++ ++ E+ V++HEL+H +KH
Sbjct: 89 LINEDQPNAFA--TGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAH--VKH 137
>gnl|CDD|213809 TIGR03406, FeS_long_SufT, probable FeS assembly SUF system protein
SufT. The function is unknown for this protein family,
but members are found almost always in operons for the
the SUF system of iron-sulfur cluster biosynthesis. The
SUF system is present elsewhere on the chromosome for
those few species where SUF genes are not adjacent. This
family shares this property of association with the SUF
system with a related family, TIGR02945. TIGR02945
consists largely of a DUF59 domain (see pfamam family
pfam01883), while this protein is about double the
length, with a unique N-terminal domain and DUF59
C-terminal domain. A location immediately downstream of
the cysteine desulfurase gene sufS in many contexts
suggests the gene symbol sufT. Note that some other
homologs of this family and of TIGR02945, but no actual
members of this family, are found in operons associated
with phenylacetic acid (or other ring-hydroxylating)
degradation pathways [Biosynthesis of cofactors,
prosthetic groups, and carriers, Other].
Length = 174
Score = 28.5 bits (64), Expect = 5.1
Identities = 14/41 (34%), Positives = 26/41 (63%), Gaps = 4/41 (9%)
Query: 338 VLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEA 378
VL++D++ + +P E+E E ++F+ P+ REM +EA
Sbjct: 131 VLVEDVEDKVLAVPNVDEVEVE----LVFDPPWSREMMSEA 167
>gnl|CDD|188992 cd02699, M4, Peptidase M4 family includes thermolysin, pseudolysin,
aureolysin, neutral protease and bacillolysin. M4
Peptidase family belong to the Glu-zincins family and
includes numerous zinc-dependent metallopeptidases that
hydrolyze peptide bonds such as thermolysin (EC
3.4.24.27), pseudolysin (the extracellullar elastase of
Pseudomonas aeruginosa), aureolysin (the extracellular
metalloproteinase from Staphylococcus aureus), neutral
protease from Bacillus cereus as well as fungalysin
(from Aspergillus fumigates) and bacillolysin (EC
3.4.24.28). Typically, the M4 peptidases consist of a
presequence (signal sequence), a propeptide sequence and
a peptidase unit. The presequence is cleaved off during
export while the propeptide has inhibitory and chaperone
functions and facilitates folding. The propeptide
remains attached until the peptidase is secreted and can
be safely activated. All peptidases in this family bind
a single catalytic zinc ion which is tetrahedrally
co-ordinated by three amino acid ligands and a water
molecule that forms the nucleophile on activation during
catalysis. These peptidases are secreted eubacterial
endopeptidases from Gram-positive or Gram-negative
sources that degrade extracellular proteins and peptides
for bacterial nutrition. They contain HEXXH and EXXXD
motifs as part of their active site and are selectively
inhibited by Steptomyces metalloproteinase inhibitor
(SMPI) as well as by phosphoramidon from Streptomyces
tanashiensis. A large number of these enzymes are
implicated as key factors in the pathogenesis of various
diseases, including gastritis, peptic ulcer, gastric
carcinoma, cholera and several types of bacterial
infections, and are therefore important drug targets.
Some enzymes of the family can function at extremes of
temperatures, while some function in organic solvents,
thus rendering them novel targets for biotechnological
applications.
Length = 230
Score = 28.8 bits (65), Expect = 5.5
Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 13/49 (26%)
Query: 262 NAF------VFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304
NAF VF DG FT + D V++HEL+H + +
Sbjct: 81 NAFWDGSQMVFGDGDGSTFTP---FAVSLD----VVAHELTHGVTDSTS 122
>gnl|CDD|162041 TIGR00795, lctP, L-lactate transport. The Lactate Permease (LctP)
Family (TC 2.A.14) The only characterized member of this
family, from E. coli, appears to catalyze lactate:H+
uptake. Members of this family have 12 probable TMS
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 530
Score = 29.3 bits (66), Expect = 5.6
Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 9/51 (17%)
Query: 110 ALPGFIYLIFKPILRVVGAIVIARIS---------RKWWSKLSPDQKKIIL 151
L GF+Y ++ ++ AI + ++S R +SPD++ +L
Sbjct: 60 FLYGFLYGLWPIAWIIIAAIFLYKLSVKSGAFEIIRSSILSISPDRRIQVL 110
>gnl|CDD|239796 cd04268, ZnMc_MMP_like, Zinc-dependent metalloprotease, MMP_like
subfamily. This group contains matrix metalloproteinases
(MMPs), serralysins, and the astacin_like family of
proteases.
Length = 165
Score = 28.2 bits (63), Expect = 6.3
Identities = 9/36 (25%), Positives = 17/36 (47%), Gaps = 3/36 (8%)
Query: 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTL-LKH 302
R+++++ + + L HEL H L L+H
Sbjct: 77 LARVYLYSSFVEY--SGARLRNTAEHELGHALGLRH 110
>gnl|CDD|233203 TIGR00946, 2a69, he Auxin Efflux Carrier (AEC) Family. [Transport
and binding proteins, Other].
Length = 321
Score = 28.5 bits (64), Expect = 8.2
Identities = 8/37 (21%), Positives = 17/37 (45%)
Query: 114 FIYLIFKPILRVVGAIVIARISRKWWSKLSPDQKKII 150
Y+I + +L ++ I++ I K + L + I
Sbjct: 2 ITYVILETVLPILVVILLGYILGKRFGILDEEHASGI 38
>gnl|CDD|219022 pfam06426, SATase_N, Serine acetyltransferase, N-terminal. The
N-terminal domain of serine acetyltransferase has a
sequence that is conserved in plants and bacteria.
Length = 105
Score = 27.0 bits (61), Expect = 8.4
Identities = 10/31 (32%), Positives = 16/31 (51%), Gaps = 1/31 (3%)
Query: 288 ATVLSHE-LSHTLLKHVAEKLSNKTFLEILY 317
AT+L+H+ L L +A KL++ T
Sbjct: 25 ATILNHDSLESALAYRLANKLASPTMSADTL 55
>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
periplasmic component [Carbohydrate transport and
metabolism].
Length = 332
Score = 28.4 bits (64), Expect = 9.0
Identities = 6/23 (26%), Positives = 13/23 (56%)
Query: 134 ISRKWWSKLSPDQKKIILSQIKK 156
+S+ +W L + + I+L K+
Sbjct: 242 VSKAFWDSLPEEDQTILLEAAKE 264
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.323 0.139 0.418
Gapped
Lambda K H
0.267 0.0766 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,460,594
Number of extensions: 2425202
Number of successful extensions: 2330
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2313
Number of HSP's successfully gapped: 47
Length of query: 475
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 374
Effective length of database: 6,457,848
Effective search space: 2415235152
Effective search space used: 2415235152
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.2 bits)