RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17341
(232 letters)
>gnl|CDD|149835 pfam08897, DUF1841, Domain of unknown function (DUF1841). This
family of proteins are functionally uncharacterized.
Length = 137
Score = 30.7 bits (70), Expect = 0.26
Identities = 13/48 (27%), Positives = 17/48 (35%), Gaps = 13/48 (27%)
Query: 153 RQFFDEKWSKNRN--------VTYMDWSTQHPELLLASYHRNKEAPNE 192
R+FF + W K + D +HPE YH E P
Sbjct: 7 RRFFCDAWRKYQAGEPLTPLETIAADVILEHPE-----YHALLEDPEA 49
>gnl|CDD|217351 pfam03070, TENA_THI-4, TENA/THI-4/PQQC family. Members of this
family are found in all the three major phyla of life:
archaebacteria, eubacteria, and eukaryotes. In Bacillus
subtilis, TENA is one of a number of proteins that
enhance the expression of extracellular enzymes, such as
alkaline protease, neutral protease and levansucrase.
The THI-4 protein, which is involved in thiamine
biosynthesis, is also a member of this family. The
C-terminal part of these proteins consistently show
significant sequence similarity to TENA proteins. This
similarity was first noted with the Neurospora crassa
THI-4. This family includes bacterial coenzyme PQQ
synthesis protein C or PQQC proteins. Pyrroloquinoline
quinone (PQQ) is the prosthetic group of several
bacterial enzymes,including methanol dehydrogenase of
methylotrophs and the glucose dehydrogenase of a number
of bacteria. PQQC has been found to be required in the
synthesis of PQQ but its function is unclear. The exact
molecular function of members of this family is
uncertain.
Length = 210
Score = 30.0 bits (68), Expect = 0.70
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 66 EKLKITLSEEFQHFVARTGRVMERLLAEKSDMYKD 100
E + SEEF+ V R++ LLA SD +
Sbjct: 153 EWIDDYGSEEFRSAVEEAERLLNHLLAYVSDSELE 187
>gnl|CDD|235218 PRK04081, PRK04081, hypothetical protein; Provisional.
Length = 207
Score = 29.7 bits (67), Expect = 1.1
Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 12/86 (13%)
Query: 12 SMLSSLGVAPVADVMSSLSSVSANGTPESNNTPDTSILNSYHSAAYLVVELSEDEKLKIT 71
+++ LG VM L +N + N ++S+ S A++++E D KI
Sbjct: 11 AIVGGLGAL----VMVGLVGCGSNS--DQNQKENSSLQASQQQGAFVIIEEQADGSYKIL 64
Query: 72 LSEEF----QHFVARTGRVMERLLAE 93
EE+ V R ER+L++
Sbjct: 65 --EEYPSSETRVVLRDLDGTERVLSQ 88
>gnl|CDD|227918 COG5631, COG5631, Predicted transcription regulator, contains HTH
domain (MarR family) [Transcription].
Length = 199
Score = 29.6 bits (66), Expect = 1.1
Identities = 19/71 (26%), Positives = 29/71 (40%), Gaps = 1/71 (1%)
Query: 12 SMLSSLGVAPVADVMSSLSSVSANGTPESNNTPDTSILNSYHSAAYLVVELSEDEKLKIT 71
S ++S G V + N +S TP I++S H A L ++SE E +
Sbjct: 2 SKINSCGRNHVDQREAMAGDRQMNERRKSLETP-GEIVSSAHLARTLSPDVSEVEYALMR 60
Query: 72 LSEEFQHFVAR 82
E F + R
Sbjct: 61 AYEAFGRWQVR 71
>gnl|CDD|240725 cd12279, RRM_TUT1, RNA recognition motif in speckle targeted
PIP5K1A-regulated poly(A) polymerase (Star-PAP) and
similar proteins. This subfamily corresponds to the
RRM of Star-PAP, also termed RNA-binding motif protein
21 (RBM21), which is a ubiquitously expressed U6
snRNA-specific terminal uridylyltransferase (U6-TUTase)
essential for cell proliferation. Although it belongs
to the well-characterized poly(A) polymerase protein
superfamily, Star-PAP is highly divergent from both,
the poly(A) polymerase (PAP) and the terminal uridylyl
transferase (TUTase), identified within the editing
complexes of trypanosomes. Star-PAP predominantly
localizes at nuclear speckles and catalyzes
RNA-modifying nucleotidyl transferase reactions. It
functions in mRNA biosynthesis and may be regulated by
phosphoinositides. It binds to glutathione
S-transferase (GST)-PIPKIalpha. Star-PAP preferentially
uses ATP as a nucleotide substrate and possesses PAP
activity that is stimulated by PtdIns4,5P2. It contains
an N-terminal C2H2-type zinc finger motif followed by
an RNA recognition motif (RRM), also termed RBD (RNA
binding domain) or RNP (ribonucleoprotein domain), a
split PAP domain linked by a proline-rich region, a PAP
catalytic and core domain, a PAP-associated domain, an
RS repeat, and a nuclear localization signal (NLS). .
Length = 74
Score = 27.4 bits (61), Expect = 2.0
Identities = 10/35 (28%), Positives = 12/35 (34%)
Query: 55 AAYLVVELSEDEKLKITLSEEFQHFVARTGRVMER 89
Y +VE E + LSE RV R
Sbjct: 39 GVYAIVEFDSKEGVDKVLSEPQHTLNGHRLRVRPR 73
>gnl|CDD|178195 PLN02583, PLN02583, cinnamoyl-CoA reductase.
Length = 297
Score = 28.5 bits (64), Expect = 3.0
Identities = 10/26 (38%), Positives = 15/26 (57%)
Query: 130 WSKNQQMAPKDGDEKSSTKLSFCRQF 155
W + KD DE+S + +FCR+F
Sbjct: 134 WRDDNISTQKDVDERSWSDQNFCRKF 159
>gnl|CDD|182134 PRK09897, PRK09897, hypothetical protein; Provisional.
Length = 534
Score = 28.6 bits (64), Expect = 3.4
Identities = 7/25 (28%), Positives = 11/25 (44%)
Query: 165 NVTYMDWSTQHPELLLASYHRNKEA 189
TY++W + + L Y KE
Sbjct: 66 YCTYLEWLQKQEDSHLQRYGVKKET 90
>gnl|CDD|184966 PRK15004, PRK15004, alpha-ribazole phosphatase; Provisional.
Length = 199
Score = 27.7 bits (62), Expect = 4.1
Identities = 9/22 (40%), Positives = 11/22 (50%)
Query: 74 EEFQHFVARTGRVMERLLAEKS 95
E FQ F R R + RL A +
Sbjct: 119 EGFQAFSQRVERFIARLSAFQH 140
>gnl|CDD|220458 pfam09890, DUF2117, Uncharacterized protein conserved in archaea
(DUF2117). This domain, found in various hypothetical
archaeal proteins, has no known function.
Length = 209
Score = 27.6 bits (62), Expect = 4.2
Identities = 9/19 (47%), Positives = 10/19 (52%)
Query: 188 EAPNEPDGVCLIWNTKFKK 206
E P E DG + WN K K
Sbjct: 37 ERPGESDGTIIPWNRKAKT 55
>gnl|CDD|239275 cd02977, ArsC_family, Arsenate Reductase (ArsC) family; composed of
TRX-fold arsenic reductases and similar proteins
including the transcriptional regulator, Spx. ArsC
catalyzes the reduction of arsenate [As(V)] to arsenite
[As(III)], using reducing equivalents derived from
glutathione (GSH) via glutaredoxin (GRX), through a
single catalytic cysteine. This family of predominantly
bacterial enzymes is unrelated to two other families of
arsenate reductases which show similarity to
low-molecular-weight acid phosphatases and
phosphotyrosyl phosphatases. Spx is a general regulator
that exerts negative and positive control over
transcription initiation by binding to the C-terminal
domain of the alpha subunit of RNA polymerase.
Length = 105
Score = 26.7 bits (60), Expect = 4.6
Identities = 16/63 (25%), Positives = 30/63 (47%), Gaps = 7/63 (11%)
Query: 53 HSAAYLVVELSEDEKLKITLS-EEFQHFVARTGRVMERLLAEKSDMYKDYMGLGDADEIG 111
H Y ++ ++ + EE + +A+ G +E L + Y+ +GL D DE+
Sbjct: 22 HGIEYEFIDYLKE-----PPTKEELKELLAKLGLGVEDLFNTRGTPYRK-LGLADKDELS 75
Query: 112 DEK 114
DE+
Sbjct: 76 DEE 78
>gnl|CDD|200071 TIGR01041, ATP_syn_B_arch, ATP synthase archaeal, B subunit.
Archaeal ATP synthase shares extensive sequence
similarity with eukaryotic and prokaryotic V-type
(H+)-ATPases [Energy metabolism, ATP-proton motive force
interconversion].
Length = 458
Score = 27.8 bits (62), Expect = 5.3
Identities = 25/101 (24%), Positives = 41/101 (40%), Gaps = 13/101 (12%)
Query: 10 LDSMLSSLGVAPVADVMSSLSSVSANGTPESNNTPDTSILNSYHSAAY-----------L 58
L L G+ P +V+ SLS + +G E D ++ AAY +
Sbjct: 326 LSRELHRKGIYPPINVLPSLSRLMKDGIGEGKTREDHKDVSDQLYAAYAEGRDLRGLVAI 385
Query: 59 VVE--LSEDEKLKITLSEEFQHFVARTGRVMERLLAEKSDM 97
V E LSE ++ + ++ F+ R G R + E D+
Sbjct: 386 VGEEALSERDRKYLKFADLFERKFVRQGFNENRSIEETLDI 426
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 27.5 bits (61), Expect = 7.2
Identities = 14/43 (32%), Positives = 22/43 (51%), Gaps = 2/43 (4%)
Query: 58 LVVELSEDEKLKITLSEEFQHF-VARTGRVMERLLAEKSDMYK 99
L+ E+ E L + F HF R R++E LLA+ S+ +
Sbjct: 918 LIHEVRTREPLPFIADDIFVHFDDERAKRMLE-LLADLSEGNQ 959
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 26.7 bits (59), Expect = 7.4
Identities = 21/92 (22%), Positives = 39/92 (42%)
Query: 8 KELDSMLSSLGVAPVADVMSSLSSVSANGTPESNNTPDTSILNSYHSAAYLVVELSEDEK 67
+ +D +LS L + + + S++ N T + N P+ N + V+ DEK
Sbjct: 63 ERVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVDEK 122
Query: 68 LKITLSEEFQHFVARTGRVMERLLAEKSDMYK 99
L EE Q + + + LL + ++ K
Sbjct: 123 NGAALIEELQKHRDKLKKEQKELLKKLDELEK 154
>gnl|CDD|188875 cd09769, Luminal_IRE1, The Luminal domain, a dimerization domain,
of the Serine/Threonine protein kinase,
Inositol-requiring protein 1. The Luminal domain is a
dimerization domain present in Inositol-requiring
protein 1 (IRE1), a serine/threonine protein kinase
(STK) and a type I transmembrane protein that is
localized in the endoplasmic reticulum (ER). IRE1, also
called Endoplasmic reticulum (ER)-to-nucleus signaling
protein (or ERN), is a kinase receptor that also
contains an endoribonuclease domain in the cytoplasmic
side. It plays roles in the signaling of the unfolded
protein response (UPR), which is activated when protein
misfolding is detected in the ER in order to decrease
the synthesis of new proteins and increase the capacity
of the ER to cope with the stress. IRE1 acts as an ER
stress sensor and is the oldest and most conserved
component of the UPR in eukaryotes. During ER stress,
IRE1 dimerizes through its luminal domain and forms
oligomers, allowing the kinase domain to undergo
trans-autophosphorylation. This leads to a
conformational change that stimulates its
endoribonuclease activity and results in the cleavage of
its mRNA substrate, HAC1 in yeast and Xbp1 in metazoans,
promoting a splicing event that enables translation into
a transcription factor which activates the UPR. Mammals
contain two IRE1 proteins, IRE1alpha (or ERN1) and
IRE1beta (or ERN2). IRE1alpha is expressed in all cells
and tissues while IRE1beta is found only in intestinal
epithelial cells.
Length = 295
Score = 26.9 bits (60), Expect = 9.1
Identities = 7/22 (31%), Positives = 11/22 (50%)
Query: 165 NVTYMDWSTQHPELLLASYHRN 186
NVTY D++ + L S +
Sbjct: 166 NVTYSDYTPNSNDRDLQSQYSK 187
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.130 0.387
Gapped
Lambda K H
0.267 0.0792 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,419,651
Number of extensions: 1014055
Number of successful extensions: 825
Number of sequences better than 10.0: 1
Number of HSP's gapped: 824
Number of HSP's successfully gapped: 18
Length of query: 232
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 138
Effective length of database: 6,768,326
Effective search space: 934028988
Effective search space used: 934028988
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.6 bits)