RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16995
(165 letters)
>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase.
Length = 500
Score = 81.7 bits (202), Expect = 1e-18
Identities = 42/122 (34%), Positives = 65/122 (53%), Gaps = 11/122 (9%)
Query: 48 VDMLRNISMTFLEHDISIGVPQALTPNMLFTGGMHIKHAKPLPPDLDKYM-SDAPHGVIF 106
+++ S L + + P+ L PNM F GG++ K AKPLP +++ ++ S HGV+
Sbjct: 221 PELMSKASAWLLRNYWDLEFPRPLLPNMEFIGGLNCKPAKPLPQEMEAFVQSSGEHGVVV 280
Query: 107 FSFGTNVRFANMPPYVLNAFVESFSKIKQKILWKTDVEVEVPPNVLVRN-----WFPQAD 161
FS G+ V N+P N + ++I QK+LW+ D P+ L RN W PQ D
Sbjct: 281 FSLGSMVS--NIPEEKANEIASALAQIPQKVLWRFD---GTKPSTLGRNTRLVKWLPQND 335
Query: 162 IL 163
+L
Sbjct: 336 LL 337
>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
Length = 507
Score = 79.2 bits (196), Expect = 8e-18
Identities = 33/96 (34%), Positives = 60/96 (62%), Gaps = 5/96 (5%)
Query: 73 PNMLFTGGMHI--KHAKPLPPDLDKYMSDAPHGVIFFSFGTNVRFANMPPYVLNAFVESF 130
P++ + GG+H+ K +PL L+++++++ +GV++ SFG+++ +M L + +F
Sbjct: 265 PSVQYLGGLHLHKKPPQPLDDYLEEFLNNSTNGVVYVSFGSSIDTNDMDNEFLQMLLRTF 324
Query: 131 SKIKQKILWKTDVEVE---VPPNVLVRNWFPQADIL 163
K+ +LWK D EVE +P NVL + WFPQ +L
Sbjct: 325 KKLPYNVLWKYDGEVEAINLPANVLTQKWFPQRAVL 360
>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family. This model
describes the MGT (macroside glycosyltransferase)
subfamily of the UDP-glucuronosyltransferase family.
Members include a number of glucosyl transferases for
macrolide antibiotic inactivation, but also include
transferases of glucose-related sugars for macrolide
antibiotic production [Cellular processes, Toxin
production and resistance].
Length = 392
Score = 40.8 bits (96), Expect = 1e-04
Identities = 23/69 (33%), Positives = 32/69 (46%), Gaps = 9/69 (13%)
Query: 102 HGVIFFSFGTNVRFANMPPYVLNAFVESFSKIKQKIL----WKTDVEV--EVPPNVLVRN 155
V+ S GT F N P + VE+F + ++ D E+PPNV VR
Sbjct: 225 RPVVLISLGT--VFNNQPSF-YRTCVEAFRDLDWHVVLSVGRGVDPADLGELPPNVEVRQ 281
Query: 156 WFPQADILE 164
W PQ +IL+
Sbjct: 282 WVPQLEILK 290
>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
UDP-glucuronosyltransferase [Carbohydrate transport and
metabolism / Signal transduction mechanisms].
Length = 406
Score = 33.9 bits (78), Expect = 0.029
Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 6/78 (7%)
Query: 90 PPDLDKYMSDAPHGVIFFSFGT---NVRFANMPPYVLNAFVESFSKIKQKILWKTDVEVE 146
+ Y A +++ S GT V + VL A + ++ + D V
Sbjct: 225 AANELPYWIPADRPIVYVSLGTVGNAVELLAI---VLEALADLDVRVIVSLGGARDTLVN 281
Query: 147 VPPNVLVRNWFPQADILE 164
VP NV+V ++ PQ ++L
Sbjct: 282 VPDNVIVADYVPQLELLP 299
>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
homologous glycosyltransferases involved in the final
stages of the biosynthesis of antibiotics vancomycin and
related chloroeremomycin. Gtfs transfer sugar moieties
from an activated NDP-sugar donor to the oxidatively
cross-linked heptapeptide core of vancomycin group
antibiotics. The core structure is important for the
bioactivity of the antibiotics.
Length = 401
Score = 32.7 bits (75), Expect = 0.063
Identities = 19/102 (18%), Positives = 36/102 (35%), Gaps = 6/102 (5%)
Query: 68 PQALTPNMLFTGGMHIKHAKPLPPDLDKYMSDAPHGVIFFSFGTNVRFANMPPYVLNAFV 127
P +++ G PP A ++ FG+ V P + V
Sbjct: 205 PDWPRFDLVTGYGFRDVPYNGPPPPELWLFLAAGRPPVYVGFGSMVVRD--PEALARLDV 262
Query: 128 ESFSKIKQKIL----WKTDVEVEVPPNVLVRNWFPQADILET 165
E+ + + Q+ + W ++P NV V ++ P +L
Sbjct: 263 EAVATLGQRAILSLGWGGLGAEDLPDNVRVVDFVPHDWLLPR 304
>gnl|CDD|233877 TIGR02466, TIGR02466, conserved hypothetical protein. This family
consists of uncharacterized proteins in Caulobacter
crescentus CB15, Bdellovibrio bacteriovorus HD100,
Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi
DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The
context of nearby genes differs substantially between
members and does point to any specific biological role
[Hypothetical proteins, Conserved].
Length = 201
Score = 29.8 bits (67), Expect = 0.45
Identities = 23/90 (25%), Positives = 33/90 (36%), Gaps = 8/90 (8%)
Query: 69 QALTPNMLFTGGMHIKHAKPLPPDLDKYMSDAPHGV--IFFSFGTNVRFANMPPYVLNA- 125
Q N+L GG H H P Y P I F P + NA
Sbjct: 96 QKAWVNILPQGGTHSPHLHPGSVISGTYYVQTPENCGAIKFEDPRLDDMMAAPMRIPNAK 155
Query: 126 -FVESFSKIKQK----ILWKTDVEVEVPPN 150
V+ F + + +L+++ + EVPPN
Sbjct: 156 RAVQRFVYVPPQEGRVLLFESWLRHEVPPN 185
>gnl|CDD|176918 cd08909, START_STARD13-like, C-terminal lipid-binding START domain
of mammalian STARD13 and related proteins, which also
have an N-terminal Rho GTPase-activating protein
(RhoGAP) domain. This subgroup includes the
steroidogenic acute regulatory protein (StAR)-related
lipid transfer (START) domains of STARD13 (also known as
DLC-2, Arhgap37, and SDCCAG13) and related proteins. It
belongs to the START domain family, and in turn to the
SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain
superfamily of proteins that bind hydrophobic ligands.
SRPBCC domains have a deep hydrophobic ligand-binding
pocket. Proteins belonging to this subfamily also have a
RhoGAP domain. The precise function of the START domain
in this subgroup is unclear.
Length = 205
Score = 28.7 bits (64), Expect = 1.3
Identities = 10/16 (62%), Positives = 13/16 (81%)
Query: 138 LWKTDVEVEVPPNVLV 153
LWK VEVE PP+V++
Sbjct: 53 LWKVSVEVEAPPSVVL 68
>gnl|CDD|222868 PHA02547, 55, RNA polymerase sigma factor; Provisional.
Length = 179
Score = 27.4 bits (61), Expect = 2.8
Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 3/47 (6%)
Query: 18 FLTNLFYYPKQVALMDKYFKYPGYQSRPPMVDMLRNISMTFLEHDIS 64
F+ + K++A+ KYF + Y DM+ TF++ DI
Sbjct: 105 FVQRIKKEKKEMAIKYKYFLHNVYDEV--DDDMVAIADETFIQ-DIY 148
>gnl|CDD|233159 TIGR00867, deg-1, degenerin. The Epithelial Na+ Channel (ENaC)
Family (TC 1.A.06)The ENaC family consists of sodium
channels from animals and has no recognizable
homologues in other eukaryotes or bacteria. The
vertebrate ENaC proteins from epithelial cells cluster
tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a
mechano-transducing complex for touch sensitivity.
Other members of the ENaC family, the acid-sensing ion
channels, ASIC1-3,are homo- or hetero-oligomeric
neuronal H+-gated channels that mediate pain sensation
in response to tissue acidosis. The homologous Helix
aspersa(FMRF-amide)-activated Na+ channel is the first
peptide neurotransmitter-gated ionotropic receptor to
be sequenced.Mammalian ENaC is important for the
maintenance of Na+ balance and the regulation of blood
pressure. Three homologous ENaC subunits, a, b and g,
havebeen shown to assemble to form the highly
Na+-selective channel.This model is designed from the
invertebrate members of the ENaC family [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 600
Score = 27.9 bits (62), Expect = 3.3
Identities = 12/38 (31%), Positives = 18/38 (47%), Gaps = 4/38 (10%)
Query: 2 NFWGRLDSLWFAVTDLFLTNLFYYPKQVALMDKYFKYP 39
N + R W A+ L +F Y + L+ KYF+Y
Sbjct: 19 NSFSRA--FWVALF-LICLLMFAY-QAYILISKYFQYE 52
>gnl|CDD|180419 PRK06134, PRK06134, putative FAD-binding dehydrogenase; Reviewed.
Length = 581
Score = 27.8 bits (62), Expect = 3.6
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 93 LDKYMSDAPHGVIFFSFGTNVRFANMP 119
+D ++ PH V FF T +RFA+
Sbjct: 95 IDAFLEAGPHMVAFFERHTALRFADGN 121
>gnl|CDD|236446 PRK09279, PRK09279, pyruvate phosphate dikinase; Provisional.
Length = 879
Score = 27.0 bits (61), Expect = 6.3
Identities = 7/7 (100%), Positives = 7/7 (100%)
Query: 106 FFSFGTN 112
FFSFGTN
Sbjct: 763 FFSFGTN 769
>gnl|CDD|176916 cd08907, START_STARD8-like, C-terminal lipid-binding START domain
of mammalian STARD8 and related proteins, which also
have an N-terminal Rho GTPase-activating protein
(RhoGAP) domain. This subgroup includes the
steroidogenic acute regulatory protein (StAR)-related
lipid transfer (START) domains of STARD8 (also known as
deleted in liver cancer 3/DLC3, and Arhgap38) and
related proteins. It belongs to the START domain family,
and in turn to the SRPBCC
(START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain
superfamily of proteins that bind hydrophobic ligands.
SRPBCC domains have a deep hydrophobic ligand-binding
pocket. Proteins belonging to this subfamily also have a
RhoGAP domain. The precise function of the START domain
in this subgroup is unclear.
Length = 205
Score = 26.4 bits (58), Expect = 7.6
Identities = 9/17 (52%), Positives = 13/17 (76%)
Query: 138 LWKTDVEVEVPPNVLVR 154
LWK EVE PP+V+++
Sbjct: 53 LWKVSTEVEAPPSVVLQ 69
>gnl|CDD|239161 cd02760, MopB_Phenylacetyl-CoA-OR, The MopB_Phenylacetyl-CoA-OR CD
contains the phenylacetyl-CoA:acceptor oxidoreductase,
large subunit (PadB2), and other related proteins. The
phenylacetyl-CoA:acceptor oxidoreductase has been
characterized as a membrane-bound molybdenum-iron-sulfur
enzyme involved in anaerobic metabolism of phenylalanine
in the denitrifying bacterium Thauera aromatica. Members
of this CD belong to the molybdopterin_binding (MopB)
superfamily of proteins.
Length = 760
Score = 26.5 bits (58), Expect = 8.6
Identities = 28/127 (22%), Positives = 42/127 (33%), Gaps = 18/127 (14%)
Query: 25 YPKQVALMDKYFKYPGYQSRPPMVDMLRNISMTFLEHDISIGVPQALTPNMLFTGGMHIK 84
P + M + F P + + RN T + + QAL P L M ++
Sbjct: 444 KPGEDGFMAQGF-NPTDKEHWVVKPTGRNAHRTLVPIVGNSAWSQALGPTQLAW--MFLR 500
Query: 85 HAK-------PLPPDLDKYMSDAPHGVIFFSFGTNV-RFANMPPYVLNAFVESFSKIKQK 136
P PD+ P + F+ T V A P F SF+ + +
Sbjct: 501 EVPLDWKFELPTLPDVWFNYRTNP-AISFWDTATLVDNIAKFP------FTVSFAYTEDE 553
Query: 137 ILWKTDV 143
W DV
Sbjct: 554 TNWMADV 560
>gnl|CDD|220131 pfam09172, DUF1943, Domain of unknown function (DUF1943). Members
of this family adopt a structure consisting of several
large open beta-sheets. Their exact function has not, as
yet, been determined.
Length = 288
Score = 26.1 bits (58), Expect = 8.6
Identities = 15/75 (20%), Positives = 27/75 (36%), Gaps = 2/75 (2%)
Query: 86 AKPLPPDLDKYMSDAPHGVIFFS-FGTNVRFANMPPYVLNAFVESFSKIKQKILWKTDVE 144
K L + S P ++ FG + FA++ + + F + +K+L +
Sbjct: 99 LKILLSEWKSLPSSKPLASVYLKLFGQELAFASIDKETIEQLIRLFKEALKKLLSGVSFQ 158
Query: 145 VEVPPNVL-VRNWFP 158
P L R P
Sbjct: 159 YTKPLLFLEARRILP 173
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.325 0.141 0.453
Gapped
Lambda K H
0.267 0.0663 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,909,786
Number of extensions: 821799
Number of successful extensions: 838
Number of sequences better than 10.0: 1
Number of HSP's gapped: 831
Number of HSP's successfully gapped: 21
Length of query: 165
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 75
Effective length of database: 6,945,742
Effective search space: 520930650
Effective search space used: 520930650
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 55 (25.1 bits)