RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6422
(182 letters)
>gnl|CDD|221573 pfam12430, ABA_GPCR, Abscisic acid G-protein coupled receptor.
This domain family is found in eukaryotes, and is
typically between 177 and 216 amino acids in length.
This family is part of the abscisic acid (ABA) G-protein
coupled receptor. ABA is a stress hormone in plants.
Length = 99
Score = 94.6 bits (236), Expect = 9e-26
Identities = 50/91 (54%), Positives = 67/91 (73%), Gaps = 3/91 (3%)
Query: 22 LTILQTFFYAISSSKSS--NIIVLVLAQIMGMYFVSSVLLMRMNMPPEYRSIISKVLG-D 78
LT L +FF A+ SS N++VL LA+IMG+YF+S+VLL+R N+P EYRSII+ +LG D
Sbjct: 8 LTTLSSFFRALPGLLSSSSNLLVLFLAEIMGIYFISTVLLLRSNLPAEYRSIITDILGSD 67
Query: 79 LQFNFYHRWFDVIFLLSALSSIVFLPFSYVH 109
L+F FY RWFD IFLLSAL + + L ++
Sbjct: 68 LEFRFYDRWFDKIFLLSALLTAIGLYLAHKS 98
Score = 38.3 bits (90), Expect = 3e-04
Identities = 20/43 (46%), Positives = 31/43 (72%), Gaps = 2/43 (4%)
Query: 135 LLSGFGAFFYAISSSKSS--NIIVLVLAQIMGMYFVSSVDILR 175
+L+ +FF A+ SS N++VL LA+IMG+YF+S+V +LR
Sbjct: 7 VLTTLSSFFRALPGLLSSSSNLLVLFLAEIMGIYFISTVLLLR 49
>gnl|CDD|221626 pfam12537, DUF3735, Protein of unknown function (DUF3735). This
domain family is found in eukaryotes, and is
approximately 70 amino acids in length. There is a
conserved LSG sequence motif. There is a single
completely conserved residue G that may be functionally
important.
Length = 71
Score = 44.9 bits (107), Expect = 5e-07
Identities = 16/25 (64%), Positives = 21/25 (84%)
Query: 117 TIEQGISRIGVIGVTVMALLSGFGA 141
+E +SR+ VIGVT+MA+LSGFGA
Sbjct: 5 YLENLLSRVSVIGVTIMAILSGFGA 29
>gnl|CDD|224819 COG1907, COG1907, Predicted archaeal sugar kinases [General
function prediction only].
Length = 312
Score = 30.8 bits (70), Expect = 0.34
Identities = 9/36 (25%), Positives = 14/36 (38%)
Query: 131 TVMALLSGFGAFFYAISSSKSSNIIVLVLAQIMGMY 166
A S +G Y I S+ + +V L I+
Sbjct: 257 AYGAGQSSWGPTVYGIVDSREAGSVVRKLIDILLEE 292
>gnl|CDD|200431 TIGR04180, EDH_00030, NAD dependent epimerase/dehydratase,
LLPSF_EDH_00030 family. This clade within the NAD
dependent epimerase/dehydratase superfamily (pfam01370)
is characterized by inclusion of its members within a
cassette of seven distinctive enzymes. These include
four genes homologous to the elements of the neuraminic
(sialic) acid biosynthesis cluster (NeuABCD), an
aminotransferase and a nucleotidyltransferase in
addition to the epimerase/dehydratase. Together it is
very likely that these enzymes direct the biosynthesis
of a nine-carbon sugar analagous to CMP-neuraminic acid.
These seven genes form the core of the cassette,
although they are often accompanied by additional genes
that may further modify the product sugar. Although this
cassette is widely distributed in bacteria, the family
nomenclature arises from the instance in Leptospira
interrogans serovar Lai, str. 56601, where it appears as
the 30th gene in the 91-gene lipopolysaccharide
biosynthesis cluster.
Length = 297
Score = 30.0 bits (68), Expect = 0.63
Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 18/76 (23%)
Query: 65 PPEYRSIISKVLGDLQFNFYHR----WFDVIFLLSALSSIVFLPFSYVHP------NI-- 112
PPE + I V GD++ R DV+F L+AL +I P+SY+ P N+
Sbjct: 44 PPEVKDKIEVVTGDIRDPDSVRKAMKGCDVVFHLAALIAI---PYSYIAPDSYVDTNVTG 100
Query: 113 ---FLQWTIEQGISRI 125
LQ + G+ ++
Sbjct: 101 TLNVLQAARDLGVEKV 116
>gnl|CDD|188999 cd06461, M2_ACE, Peptidase family M2 Angiotensin converting enzyme
(ACE). Peptidase family M2 Angiotensin converting
enzyme (ACE, EC 3.4.15.1) is a membrane-bound, zinc
dependent dipeptidase that catalyzes the conversion of
the decapeptide angiotensin I to the potent vasopressor
octapeptide angiotensin II, by removing two C-terminal
amino acids. There are two forms of the enzyme in
humans, the ubiquitous somatic ACE and the
sperm-specific germinal ACE, both encoded by the same
gene through transcription from alternative promoters.
Somatic ACE has two tandem active sites with distinct
catalytic properties, whereas germinal ACE, the function
of which is largely unknown, has just a single active
site. Recently, an ACE homolog, ACE2, has been
identified in humans that differs from ACE; it
preferentially removes carboxy-terminal hydrophobic or
basic amino acids and appears to be important in cardiac
function. ACE homologs (also known as members of the M2
gluzincin family) have been found in a wide variety of
species, including those that neither have a
cardiovascular system nor synthesize angiotensin. ACE is
well-known as a key part of the renin-angiotensin system
that regulates blood pressure and ACE inhibitors are
important for the treatment of hypertension.
Length = 562
Score = 29.8 bits (68), Expect = 0.70
Identities = 10/17 (58%), Positives = 10/17 (58%), Gaps = 1/17 (5%)
Query: 92 FLLS-ALSSIVFLPFSY 107
FLL AL I FLPF
Sbjct: 401 FLLKMALDKIAFLPFGL 417
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 2.5
Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 31 AISSSKSSNIIVLVLAQIMGMYFVSSVLLMRMNMPPEYRSI-ISKVLGDLQF 81
A S S I + VL + YF + LL R+I ISK LG + F
Sbjct: 319 AEDSLSKSEINIDVLKEAKMFYFNTHSLLDPTMRSTTLRAIKISKKLGGVIF 370
>gnl|CDD|201774 pfam01401, Peptidase_M2, Angiotensin-converting enzyme. Members of
this family are dipeptidyl carboxydipeptidases (cleave
carboxyl dipeptides) and most notably convert
angiotensin I to angiotensin II. Many members of this
family contain a tandem duplication of the 600 amino
acid peptidase domain, both of these are catalytically
active. Most members are secreted membrane bound
ectoenzymes.
Length = 595
Score = 27.6 bits (61), Expect = 5.2
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 54 VSSVLLMRMNMPPEYRSI--ISKVLGDLQFNFYHRWFDVIFLLS-ALSSIVFLPFSYV 108
+ VL + ++ P SI +S D + D+ FL+ AL I F+PFSY+
Sbjct: 385 IGDVLALSVSTPKHLHSINLLSSEGNDYES-------DINFLMKMALDKIAFIPFSYL 435
>gnl|CDD|204929 pfam12460, MMS19_N, NER and RNAPII transcription protein n
terminal. This domain family is found in eukaryotes,
and is approximately 60 amino acids in length. MMS19 is
required for both nucleotide excision repair (NER) and
RNA polymerase II (RNAP II) transcription.
Length = 64
Score = 25.2 bits (56), Expect = 5.5
Identities = 11/35 (31%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 148 SSKSSNIIVLVLAQIMGMYFVSSVDILREAAVQRL 182
+ ++ LV + +G Y S I+R AVQ L
Sbjct: 25 ENGKLTLLDLV--ESLGEYLTSEDPIIRAKAVQLL 57
>gnl|CDD|220381 pfam09753, Use1, Membrane fusion protein Use1. This entry is of a
family of proteins all approximately 300 residues in
length. The proteins have a single C-terminal
trans-membrane domain and a SNARE [soluble NSF
(N-ethylmaleimide-sensitive fusion protein) attachment
protein receptor] domain of approximately 60 residues.
The SNARE domains are essential for membrane fusion and
are conserved from yeasts to humans. Use1 is one of the
three protein subunits that make up the SNARE complex
and it is specifically required for Golgi-endoplasmic
reticulum retrograde transport.
Length = 251
Score = 26.7 bits (59), Expect = 7.6
Identities = 7/30 (23%), Positives = 18/30 (60%)
Query: 33 SSSKSSNIIVLVLAQIMGMYFVSSVLLMRM 62
+SK+S + ++ ++ F+S VL +++
Sbjct: 221 HASKASKCFLWLMIFLVMCIFISMVLFIKI 250
>gnl|CDD|187543 cd05232, UDP_G4E_4_SDR_e, UDP-glucose 4 epimerase, subgroup 4,
extended (e) SDRs. UDP-glucose 4 epimerase (aka
UDP-galactose-4-epimerase), is a homodimeric extended
SDR. It catalyzes the NAD-dependent conversion of
UDP-galactose to UDP-glucose, the final step in Leloir
galactose synthesis. This subgroup is comprised of
bacterial proteins, and includes the Staphylococcus
aureus capsular polysaccharide Cap5N, which may have a
role in the synthesis of UDP-N-acetyl-d-fucosamine. This
subgroup has the characteristic active site tetrad and
NAD-binding motif of the extended SDRs. Extended SDRs
are distinct from classical SDRs. In addition to the
Rossmann fold (alpha/beta folding pattern with a central
beta-sheet) core region typical of all SDRs, extended
SDRs have a less conserved C-terminal extension of
approximately 100 amino acids. Extended SDRs are a
diverse collection of proteins, and include isomerases,
epimerases, oxidoreductases, and lyases; they typically
have a TGXXGXXG cofactor binding motif. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold, an NAD(P)(H)-binding region, and a structurally
diverse C-terminal region. Sequence identity between
different SDR enzymes is typically in the 15-30% range;
they catalyze a wide range of activities including the
metabolism of steroids, cofactors, carbohydrates,
lipids, aromatic compounds, and amino acids, and act in
redox sensing. Classical SDRs have an TGXXX[AG]XG
cofactor binding motif and a YXXXK active site motif,
with the Tyr residue of the active site motif serving as
a critical catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase numbering). In
addition to the Tyr and Lys, there is often an upstream
Ser and/or an Asn, contributing to the active site;
while substrate binding is in the C-terminal region,
which determines specificity. The standard reaction
mechanism is a 4-pro-S hydride transfer and proton relay
involving the conserved Tyr and Lys, a water molecule
stabilized by Asn, and nicotinamide. Atypical SDRs
generally lack the catalytic residues characteristic of
the SDRs, and their glycine-rich NAD(P)-binding motif is
often different from the forms normally seen in
classical or extended SDRs. Complex (multidomain) SDRs
such as ketoreductase domains of fatty acid synthase
have a GGXGXXG NAD(P)-binding motif and an altered
active site motif (YXXXN). Fungal type ketoacyl
reductases have a TGXXXGX(1-2)G NAD(P)-binding motif.
Length = 303
Score = 26.5 bits (59), Expect = 7.7
Identities = 16/77 (20%), Positives = 32/77 (41%), Gaps = 6/77 (7%)
Query: 10 DLIALFLLELEPLTILQTFFYAISSSKSSNIIVLVLAQIMG----MYFVSSVLLMRMNMP 65
D I L + + TF + S+ +V + + +G + V + LL
Sbjct: 206 DAIYLCISLPKAAN--GTFLVSDGPPVSTAELVDEIRRALGKPTRLLPVPAGLLRFAAKL 263
Query: 66 PEYRSIISKVLGDLQFN 82
R++I ++ G LQ++
Sbjct: 264 LGKRAVIQRLFGSLQYD 280
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.331 0.142 0.409
Gapped
Lambda K H
0.267 0.0756 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,499,808
Number of extensions: 918453
Number of successful extensions: 1854
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1819
Number of HSP's successfully gapped: 208
Length of query: 182
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 91
Effective length of database: 6,901,388
Effective search space: 628026308
Effective search space used: 628026308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 56 (25.3 bits)