RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7380
(72 letters)
>gnl|CDD|238106 cd00182, TBOX, T-box DNA binding domain of the T-box family of
transcriptional regulators. The T-box family is an
ancient group that appears to play a critical role in
development in all animal species. These genes were
uncovered on the basis of similarity to the DNA binding
domain of murine Brachyury (T) gene product, the
defining feature of the family. Common features shared
by T-box family members are DNA-binding and
transcriptional regulatory activity, a role in
development and conserved expression patterns, most of
the known genes in all species being expressed in
mesoderm or mesoderm precursors.
Length = 188
Score = 50.3 bits (121), Expect = 2e-09
Identities = 18/26 (69%), Positives = 22/26 (84%), Gaps = 1/26 (3%)
Query: 10 AYYKE-ITKLKIDSNPFAKGFRDSSR 34
AY + IT+LKID+NPFAKGFRD+ R
Sbjct: 162 AYQNQEITQLKIDNNPFAKGFRDNGR 187
>gnl|CDD|216184 pfam00907, T-box, T-box. The T-box encodes a 180 amino acid domain
that binds to DNA. Genes encoding T-box proteins are
found in a wide range of animals, but not in other
kingdoms such as plants. Family members are all thought
to bind to the DNA consensus sequence TCACACCT. they are
found exclusively in the nucleus, and perform
DNA-binding and transcriptional activation/repression
roles. They are generally required for development of
the specific tissues they are expressed in, and
mutations in T-box genes are implicated in human
conditions such as DiGeorge syndrome and X-linked cleft
palate, which feature malformations.
Length = 182
Score = 47.6 bits (114), Expect = 2e-08
Identities = 19/24 (79%), Positives = 21/24 (87%), Gaps = 1/24 (4%)
Query: 10 AYYK-EITKLKIDSNPFAKGFRDS 32
AY EITKLKID+NPFAKGFRD+
Sbjct: 159 AYQNEEITKLKIDNNPFAKGFRDA 182
>gnl|CDD|214656 smart00425, TBOX, Domain first found in the mice T locus
(Brachyury) protein.
Length = 190
Score = 43.0 bits (102), Expect = 1e-06
Identities = 17/21 (80%), Positives = 19/21 (90%)
Query: 14 EITKLKIDSNPFAKGFRDSSR 34
+ITKLKID+NPFAKGFRD R
Sbjct: 168 KITKLKIDNNPFAKGFRDQGR 188
>gnl|CDD|227725 COG5438, COG5438, Predicted multitransmembrane protein [Function
unknown].
Length = 385
Score = 28.5 bits (64), Expect = 0.23
Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 7/71 (9%)
Query: 1 MFNDDNNNQAYYKEITKLKIDSNPFAKGFRDSSRLTEFERQNVLFDRHFMGRILRDEYPP 60
F ++ A EIT +KI SN + +R ++Q++ +IL EY
Sbjct: 32 KFIAESAYNAPKGEITDVKIVSNNSEDKSSNKNRQV-LQKQSIKV------KILNGEYKG 84
Query: 61 DTGKVVNTLYS 71
+ K+ NT+
Sbjct: 85 ELVKINNTVNE 95
>gnl|CDD|237790 PRK14694, PRK14694, putative mercuric reductase; Provisional.
Length = 468
Score = 27.6 bits (61), Expect = 0.50
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 35 LTEFERQNVLFDRHFMGRILRDEYPPDTG 63
L + Q V FDR F+G R PP G
Sbjct: 131 LNDGGEQTVHFDRAFIGTGARPAEPPVPG 159
>gnl|CDD|235181 PRK03958, PRK03958, tRNA 2'-O-methylase; Reviewed.
Length = 176
Score = 26.8 bits (60), Expect = 0.83
Identities = 11/32 (34%), Positives = 14/32 (43%), Gaps = 8/32 (25%)
Query: 43 VLFDRHFMGRILRDEY--------PPDTGKVV 66
V DR F G+ L E+ P + GK V
Sbjct: 142 VFLDRLFEGKELEREFEDAKLKVVPQERGKRV 173
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found
at the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 24.9 bits (55), Expect = 3.5
Identities = 10/39 (25%), Positives = 19/39 (48%), Gaps = 5/39 (12%)
Query: 11 YYKEITKLKIDSNPFAKGFRD-----SSRLTEFERQNVL 44
Y K+ + ++D + +G +D +LT+ E Q L
Sbjct: 36 YLKQQGEEQLDKDALLQGLQDALAGKELKLTDEEIQKAL 74
>gnl|CDD|240358 PTZ00321, PTZ00321, ribosomal protein L11; Provisional.
Length = 342
Score = 24.9 bits (54), Expect = 4.1
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 9/54 (16%)
Query: 13 KEITKLKIDSNPFAKGFRDSSR------LTEFERQNVLFDRHFMGRILRDEYPP 60
+E +KL + + FAK F D ++ + R V FD+ ++ I E PP
Sbjct: 70 QEFSKLGLKAMDFAKSFNDRTKPHFKDDVELIVRIQVYFDKSYLFTI---EPPP 120
>gnl|CDD|220492 pfam09960, DUF2194, Uncharacterized protein conserved in bacteria
(DUF2194). This domain, found in various hypothetical
bacterial proteins, has no known function.
Length = 573
Score = 24.7 bits (54), Expect = 4.9
Identities = 9/29 (31%), Positives = 12/29 (41%)
Query: 12 YKEITKLKIDSNPFAKGFRDSSRLTEFER 40
E+ + IDS+ F G S FE
Sbjct: 144 ILEVKGIYIDSDFFIGGGDLYSIEIPFES 172
>gnl|CDD|236287 PRK08561, rps15p, 30S ribosomal protein S15P; Reviewed.
Length = 151
Score = 24.4 bits (54), Expect = 5.9
Identities = 11/24 (45%), Positives = 14/24 (58%), Gaps = 5/24 (20%)
Query: 51 GRILRDEYP-PD----TGKVVNTL 69
G ILRD+Y PD TGK + +
Sbjct: 51 GIILRDQYGIPDVKLITGKKITEI 74
>gnl|CDD|219797 pfam08326, ACC_central, Acetyl-CoA carboxylase, central region.
The region featured in this family is found in various
eukaryotic acetyl-CoA carboxylases, N-terminal to the
catalytic domain (pfam01039). This enzyme (EC:6.4.1.2)
is involved in the synthesis of long-chain fatty acids,
as it catalyzes the rate-limiting step in this process.
Length = 707
Score = 23.8 bits (52), Expect = 9.2
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 54 LRDEYPPDTGKVVNTLYS 71
LRDE D KVV+ + S
Sbjct: 181 LRDENKDDLSKVVDIVLS 198
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.321 0.138 0.412
Gapped
Lambda K H
0.267 0.0751 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 3,722,393
Number of extensions: 277518
Number of successful extensions: 254
Number of sequences better than 10.0: 1
Number of HSP's gapped: 254
Number of HSP's successfully gapped: 13
Length of query: 72
Length of database: 10,937,602
Length adjustment: 42
Effective length of query: 30
Effective length of database: 9,074,734
Effective search space: 272242020
Effective search space used: 272242020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 53 (24.2 bits)