RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16701
(272 letters)
>gnl|CDD|239952 cd04516, TBP_eukaryotes, eukaryotic TATA box binding protein (TBP):
Present in archaea and eukaryotes, TBPs are
transcription factors that recognize promoters and
initiate transcription. TBP has been shown to be an
essential component of three different transcription
initiation complexes: SL1, TFIID and TFIIIB, directing
transcription by RNA polymerases I, II and III,
respectively. TBP binds directly to the TATA box
promoter element, where it nucleates polymerase
assembly, thus defining the transcription start site.
TBP's binding in the minor groove induces a dramatic DNA
bending while its own structure barely changes. The
conserved core domain of TBP, which binds to the TATA
box, has a bipartite structure, with intramolecular
symmetry generating a saddle-shaped structure that sits
astride the DNA.
Length = 174
Score = 224 bits (572), Expect = 3e-74
Identities = 86/144 (59%), Positives = 114/144 (79%)
Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
N+V T+++GC LDL +I R R +EYNP +F +IM+I P+TT L F SGK++ GAK
Sbjct: 6 NIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKS 65
Query: 189 EHDCKLASRKFAKILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPE 248
E D KLA+RK+A+I+++LG P K+ FKI NIV +CDV+FP++L+ L H H QFSSYEPE
Sbjct: 66 EDDSKLAARKYARIIQKLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPE 125
Query: 249 LFPGLIYRMVKPRVVLLIFVNGRI 272
LFPGLIYRMVKP++VLLIFV+G+I
Sbjct: 126 LFPGLIYRMVKPKIVLLIFVSGKI 149
Score = 151 bits (384), Expect = 7e-46
Identities = 69/168 (41%), Positives = 104/168 (61%), Gaps = 2/168 (1%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V T+++GC LDL +I R R +EYNP +F +IM+I P+TT L F SGK++ GAK
Sbjct: 6 NIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKS 65
Query: 100 EHDCKLASRKFAKILKQLGHP-EIIFFFFRNMVGTISVGCPLDLNQIN-SRVRYSEYNPG 157
E D KLA+RK+A+I+++LG P + F +N+VG+ V P+ L + + ++S Y P
Sbjct: 66 EDDSKLAARKYARIIQKLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPE 125
Query: 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
F GLI +++ P+ L F SGK+++ GAK + A IL Q
Sbjct: 126 LFPGLIYRMVKPKIVLLIFVSGKIVLTGAKSREEIYQAFENIYPILLQ 173
>gnl|CDD|177693 PLN00062, PLN00062, TATA-box-binding protein; Provisional.
Length = 179
Score = 213 bits (544), Expect = 6e-70
Identities = 81/144 (56%), Positives = 113/144 (78%)
Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
N+V T+++ C LDL +I + R +EYNP +F +IM+I P+TT L F SGK++ GAK
Sbjct: 6 NIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKS 65
Query: 189 EHDCKLASRKFAKILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPE 248
EHD KLA+RK+A+I+++LG P K++ FKI NIV +CDV+FP++L+ L + H FSSYEPE
Sbjct: 66 EHDSKLAARKYARIIQKLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPE 125
Query: 249 LFPGLIYRMVKPRVVLLIFVNGRI 272
LFPGLIYRM +P++VLLIFV+G+I
Sbjct: 126 LFPGLIYRMKQPKIVLLIFVSGKI 149
Score = 147 bits (373), Expect = 3e-44
Identities = 67/168 (39%), Positives = 102/168 (60%), Gaps = 2/168 (1%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V T+++ C LDL +I + R +EYNP +F +IM+I P+TT L F SGK++ GAK
Sbjct: 6 NIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKS 65
Query: 100 EHDCKLASRKFAKILKQLGHP-EIIFFFFRNMVGTISVGCPLDLNQIN-SRVRYSEYNPG 157
EHD KLA+RK+A+I+++LG P + F +N+VG+ V P+ L + + +S Y P
Sbjct: 66 EHDSKLAARKYARIIQKLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPE 125
Query: 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
F GLI ++ P+ L F SGK++I GAK + A +L +
Sbjct: 126 LFPGLIYRMKQPKIVLLIFVSGKIVITGAKVREEIYTAFENIYPVLTE 173
>gnl|CDD|238352 cd00652, TBP_TLF, TATA box binding protein (TBP): Present in
archaea and eukaryotes, TBPs are transcription factors
that recognize promoters and initiate transcription. TBP
has been shown to be an essential component of three
different transcription initiation complexes: SL1, TFIID
and TFIIIB, directing transcription by RNA polymerases
I, II and III, respectively. TBP binds directly to the
TATA box promoter element, where it nucleates polymerase
assembly, thus defining the transcription start site.
TBP's binding in the minor groove induces a dramatic DNA
bending while its own structure barely changes. The
conserved core domain of TBP, which binds to the TATA
box, has a bipartite structure, with intramolecular
symmetry generating a saddle-shaped structure that sits
astride the DNA. New members of the TBP family, called
TBP-like proteins (TBLP, TLF, TLP) or TBP-related
factors (TRF1, TRF2,TRP), are similar to the core domain
of TBPs, with identical or chemically similar amino
acids at many equivalent positions, suggesting similar
structure. However, TLFs contain distinct, conserved
amino acids at several positions that distinguish them
from TBP.
Length = 174
Score = 203 bits (520), Expect = 2e-66
Identities = 80/146 (54%), Positives = 113/146 (77%), Gaps = 1/146 (0%)
Query: 128 RNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187
+N+V T+++GC LDL +I R +EYNP +F G+IM++ P+TT L F SGK++I GAK
Sbjct: 5 QNIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAK 64
Query: 188 HEHDCKLASRKFAKILKQLGHPI-KYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYE 246
E D KLA+RK+A+IL++LG P+ K+ FK+ NIV +CD+ FP++L+ L H + +SYE
Sbjct: 65 SEEDAKLAARKYARILQKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYE 124
Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
PELFPGLIYRM +P+VVLLIFV+G+I
Sbjct: 125 PELFPGLIYRMDEPKVVLLIFVSGKI 150
Score = 160 bits (407), Expect = 2e-49
Identities = 72/169 (42%), Positives = 105/169 (62%), Gaps = 3/169 (1%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V T+++GC LDL +I R +EYNP +F G+IM++ P+TT L F SGK++I GAK
Sbjct: 6 NIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAKS 65
Query: 100 EHDCKLASRKFAKILKQLGHPEIIFFFFR--NMVGTISVGCPLDLNQINSRVR-YSEYNP 156
E D KLA+RK+A+IL++LG P F F+ N+V + +G P+ L ++ + + Y P
Sbjct: 66 EEDAKLAARKYARILQKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYEP 125
Query: 157 GKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
F GLI ++ P+ L F SGK++I GAK D A K ILK+
Sbjct: 126 ELFPGLIYRMDEPKVVLLIFVSGKIVITGAKSREDIYEAVEKIYPILKE 174
Score = 51.4 bits (124), Expect = 4e-08
Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 9/61 (14%)
Query: 216 KIHNIVCTCDVRFPVKLD----ALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGR 271
KI NIV T V +LD AL +++ Y P+ FPG+I R+ +P+ LIF +G+
Sbjct: 3 KIQNIVAT--VNLGCELDLRKIALAARNAE---YNPKRFPGVIMRLREPKTTALIFSSGK 57
Query: 272 I 272
+
Sbjct: 58 M 58
>gnl|CDD|225012 COG2101, SPT15, TATA-box binding protein (TBP), component of TFIID
and TFIIIB [Transcription].
Length = 185
Score = 173 bits (441), Expect = 2e-54
Identities = 57/146 (39%), Positives = 89/146 (60%), Gaps = 2/146 (1%)
Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
N+V ++ +G LDL ++ + +EYNP +F GL+ ++ P+T L F+SGK++ GAK
Sbjct: 12 NIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKS 71
Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDALHHVHSQF-SSYE 246
D A +K AK LK G I ++ K+ NIV + D+ + L+A+ YE
Sbjct: 72 VEDVHRAVKKLAKKLKDGGIDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYE 131
Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
PE FPGL+YR+ +PRVVLL+F +G++
Sbjct: 132 PEQFPGLVYRLDEPRVVLLLFGSGKL 157
Score = 151 bits (383), Expect = 1e-45
Identities = 65/173 (37%), Positives = 97/173 (56%), Gaps = 6/173 (3%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V ++ +G LDL ++ + +EYNP +F GL+ ++ P+T L F+SGK++ GAK
Sbjct: 12 NIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKS 71
Query: 100 EHDCKLASRKFAKILKQLGHPEIIF---FFFRNMVGTISVGCPLDLNQINSRV--RYSEY 154
D A +K AK LK G +I F +N+V + +G L+LN I + EY
Sbjct: 72 VEDVHRAVKKLAKKLKDGG-IDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEY 130
Query: 155 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLG 207
P +F GL+ ++ PR L F SGKL+I GAK E D + A K L++LG
Sbjct: 131 EPEQFPGLVYRLDEPRVVLLLFGSGKLVITGAKSEEDAEQAVEKIQSRLEELG 183
>gnl|CDD|239954 cd04518, TBP_archaea, archaeal TATA box binding protein (TBP): TBPs
are transcription factors present in archaea and
eukaryotes, that recognize promoters and initiate
transcription. TBP has been shown to be an essential
component of three different transcription initiation
complexes: SL1, TFIID and TFIIIB, directing
transcription by RNA polymerases I, II and III,
respectively. TBP binds directly to the TATA box
promoter element, where it nucleates polymerase
assembly, thus defining the transcription start site.
TBP's binding in the minor groove induces a dramatic DNA
bending while its own structure barely changes. The
conserved core domain of TBP, which binds to the TATA
box, has a bipartite structure, with intramolecular
symmetry generating a saddle-shaped structure that sits
astride the DNA.
Length = 174
Score = 136 bits (345), Expect = 4e-40
Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 8/172 (4%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V ++ +G LDL ++ + + +EYNP +F GL+ ++ +P+ L F+SGK++ GAK
Sbjct: 6 NIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKS 65
Query: 100 EHDCKLASRKFAKILKQLG-----HPEIIFFFFRNMVGTISVGCPLDLNQINSRVRYSEY 154
D A ++ K LK G PEI +N+V + +G ++L+ I + +EY
Sbjct: 66 VEDLHRAVKEIIKKLKDYGIKVIEKPEIK---VQNIVASADLGREVNLDAIAIGLPNAEY 122
Query: 155 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQL 206
P +F GL+ ++ P+ L F SGK++I GAK E D K A K LK+L
Sbjct: 123 EPEQFPGLVYRLDEPKVVLLLFSSGKMVITGAKSEEDAKRAVEKLLSRLKEL 174
Score = 131 bits (332), Expect = 3e-38
Identities = 54/145 (37%), Positives = 90/145 (62%), Gaps = 2/145 (1%)
Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
N+V ++ +G LDL ++ + + +EYNP +F GL+ ++ +P+ L F+SGK++ GAK
Sbjct: 6 NIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKS 65
Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEP 247
D A ++ K LK G + + K+ NIV + D+ V LDA+ + + YEP
Sbjct: 66 VEDLHRAVKEIIKKLKDYGIKVIEKPEIKVQNIVASADLGREVNLDAI-AIGLPNAEYEP 124
Query: 248 ELFPGLIYRMVKPRVVLLIFVNGRI 272
E FPGL+YR+ +P+VVLL+F +G++
Sbjct: 125 EQFPGLVYRLDEPKVVLLLFSSGKM 149
>gnl|CDD|234746 PRK00394, PRK00394, transcription factor; Reviewed.
Length = 179
Score = 135 bits (342), Expect = 2e-39
Identities = 52/146 (35%), Positives = 88/146 (60%), Gaps = 2/146 (1%)
Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
N+V + +G LDL ++ + +EYNP +F GL+ ++ +P+ L F+SGK++ GAK
Sbjct: 5 NIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKS 64
Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDAL-HHVHSQFSSYE 246
D A + K LK+LG + + K+ NIV + D+ + L+A+ + + YE
Sbjct: 65 VEDLHEAVKIIIKKLKELGIKVIDEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYE 124
Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
PE FPGL+YR+ P+VV+L+F +G++
Sbjct: 125 PEQFPGLVYRLDDPKVVVLLFGSGKL 150
Score = 130 bits (329), Expect = 1e-37
Identities = 63/175 (36%), Positives = 98/175 (56%), Gaps = 10/175 (5%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V + +G LDL ++ + +EYNP +F GL+ ++ +P+ L F+SGK++ GAK
Sbjct: 5 NIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKS 64
Query: 100 EHDCKLASRKFAKILKQLG-----HPEIIFFFFRNMVGTISVGCPLDLNQI--NSRVRYS 152
D A + K LK+LG PEI +N+V + +G L+LN I +
Sbjct: 65 VEDLHEAVKIIIKKLKELGIKVIDEPEIK---VQNIVASADLGVELNLNAIAIGLGLENI 121
Query: 153 EYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLG 207
EY P +F GL+ ++ +P+ L F SGKL+I GAK E D + A K + L++LG
Sbjct: 122 EYEPEQFPGLVYRLDDPKVVVLLFGSGKLVITGAKSEEDAEKAVEKILEKLEELG 176
>gnl|CDD|239953 cd04517, TLF, TBP-like factors (TLF; also called TLP, TRF, TRP),
which are found in most metazoans. TLFs and TBPs have
well-conserved core domains; however, they only share
about 60% similarity. TLFs, like TBPs, interact with
TFIIA and TFIIB, which are part of the basal
transcription machinery. Yet, in contrast to TBPs, TLFs
seem not to interact with the TATA-box and even have a
negative effect on the transcription of TATA-containing
promoters. Recent results indicate that TLFs are
involved in the transcription via TATA-less promoters.
Length = 174
Score = 122 bits (309), Expect = 1e-34
Identities = 54/151 (35%), Positives = 90/151 (59%), Gaps = 2/151 (1%)
Query: 123 IFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLL 182
+ N+V S+ C +DL ++ R EYNP ++ + M++ PR T + SGK+
Sbjct: 1 LDILIVNVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKIT 59
Query: 183 ILGAKHEHDCKLASRKFAKILKQLGHP-IKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQ 241
I GA E + K A+R+ A++L++LG +++ F++ N++ TC + FP++LD L +
Sbjct: 60 ITGATSEEEAKQAARRAARLLQKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRS 119
Query: 242 FSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272
+SYEPEL PG++YR+ PR L IF G +
Sbjct: 120 SASYEPELHPGVVYRITGPRATLSIFSTGSV 150
Score = 89.3 bits (222), Expect = 6e-22
Identities = 53/169 (31%), Positives = 93/169 (55%), Gaps = 4/169 (2%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V S+ C +DL ++ R EYNP ++ + M++ PR T + SGK+ I GA
Sbjct: 7 NVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKITITGATS 65
Query: 100 EHDCKLASRKFAKILKQLGHPEIIFFFFR--NMVGTISVGCPLDLNQINSRVR-YSEYNP 156
E + K A+R+ A++L++LG + F FR N++ T S+ P+ L+++ ++ R + Y P
Sbjct: 66 EEEAKQAARRAARLLQKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRSSASYEP 125
Query: 157 GKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
G++ +I PR T F +G + + GA+ D + A K I+ +
Sbjct: 126 ELHPGVVYRITGPRATLSIFSTGSVTVTGARSMEDVREAVEKIYPIVFE 174
>gnl|CDD|201174 pfam00352, TBP, Transcription factor TFIID (or TATA-binding
protein, TBP).
Length = 86
Score = 104 bits (262), Expect = 8e-29
Identities = 37/80 (46%), Positives = 54/80 (67%)
Query: 128 RNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187
+N+V + +G P+DL +I ++R +EY P +F GL+ ++ P+ T L F SGK++I GAK
Sbjct: 7 QNIVASADLGFPIDLEKIALKLRNAEYEPEQFPGLVYRLREPKATALIFSSGKVVITGAK 66
Query: 188 HEHDCKLASRKFAKILKQLG 207
E D K A RK A ILK+LG
Sbjct: 67 SEEDAKKAVRKIAPILKKLG 86
Score = 104 bits (261), Expect = 1e-28
Identities = 37/79 (46%), Positives = 53/79 (67%)
Query: 40 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
N+V + +G P+DL +I ++R +EY P +F GL+ ++ P+ T L F SGK++I GAK
Sbjct: 8 NIVASADLGFPIDLEKIALKLRNAEYEPEQFPGLVYRLREPKATALIFSSGKVVITGAKS 67
Query: 100 EHDCKLASRKFAKILKQLG 118
E D K A RK A ILK+LG
Sbjct: 68 EEDAKKAVRKIAPILKKLG 86
Score = 91.8 bits (229), Expect = 6e-24
Identities = 24/58 (41%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 215 FKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272
KI NIV + D+ FP+ L+ + + + YEPE FPGL+YR+ +P+ LIF +G++
Sbjct: 4 IKIQNIVASADLGFPIDLEKIALKL-RNAEYEPEQFPGLVYRLREPKATALIFSSGKV 60
>gnl|CDD|238383 cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze
the reversible thiolytic cleavage of 3-ketoacyl-CoA into
acyl-CoA and acetyl-CoA, a 2-step reaction involving a
covalent intermediate formed with a catalytic cysteine.
They are found in prokaryotes and eukaryotes (cytosol,
microbodies and mitochondria). There are 2 functional
different classes: thiolase-I (3-ketoacyl-CoA thiolase)
and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I
can cleave longer fatty acid molecules and plays an
important role in the beta-oxidative degradation of
fatty acids. Thiolase-II has a high substrate
specificity. Although it can cleave acetoacyl-CoA, its
main function is the synthesis of acetoacyl-CoA from two
molecules of acetyl-CoA, which gives it importance in
several biosynthetic pathways.
Length = 386
Score = 37.8 bits (89), Expect = 0.004
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 4 KRKNLKYGIGSACIGGGQA 22
KR+ +YG+ + CIGGGQ
Sbjct: 361 KRRGGRYGLATMCIGGGQG 379
>gnl|CDD|180261 PRK05790, PRK05790, putative acyltransferase; Provisional.
Length = 393
Score = 36.3 bits (85), Expect = 0.014
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 4 KRKNLKYGIGSACIGGGQ 21
KR+ K G+ + CIGGGQ
Sbjct: 367 KRRGAKKGLATLCIGGGQ 384
>gnl|CDD|218343 pfam04950, DUF663, Protein of unknown function (DUF663). This
family contains several uncharacterized eukaryotic
proteins.
Length = 293
Score = 34.6 bits (80), Expect = 0.036
Identities = 44/177 (24%), Positives = 62/177 (35%), Gaps = 27/177 (15%)
Query: 53 LNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGA--KHEHDCKLA---- 106
LN+ R PG + L +K + P+ F S +L+I+ HEH +
Sbjct: 58 LNEYEEEKRIEGVKPGWYVRLEIKNV-PKELAENFNSKQLIIVFGLLPHEHKMSVVNFKI 116
Query: 107 --SRKFAKILKQLGHPEIIFF--FFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFH-- 160
R + K LK +I F R I N N + +Y Y P
Sbjct: 117 KKHRWYKKPLKS--KEPLIIQVGFRRFQSQPI-----YSQNTSNDKHKYERYLPPHKTCN 169
Query: 161 -GLIMKILNPRTTCLAFQ-----SGKLLILGAKHEHDCKLASRKFAKILKQLGHPIK 211
I P + LAF+ + I +C +R AK LK GHP K
Sbjct: 170 ATFYGPITFPNSPLLAFKKSFSDTNGFRIAATGCILECD-PNRIIAKKLKLTGHPYK 225
>gnl|CDD|181265 PRK08170, PRK08170, acetyl-CoA acetyltransferase; Provisional.
Length = 426
Score = 33.8 bits (78), Expect = 0.090
Identities = 11/18 (61%), Positives = 13/18 (72%)
Query: 4 KRKNLKYGIGSACIGGGQ 21
KR+ K GI + CIGGGQ
Sbjct: 400 KRRGTKRGIAAICIGGGQ 417
>gnl|CDD|145779 pfam02803, Thiolase_C, Thiolase, C-terminal domain. Thiolase is
reported to be structurally related to beta-ketoacyl
synthase (pfam00109), and also chalcone synthase.
Length = 123
Score = 32.2 bits (74), Expect = 0.095
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 4 KRKNLKYGIGSACIGGGQA 22
KR+ KYG+ + CIGGGQ
Sbjct: 98 KRRGGKYGLATLCIGGGQG 116
>gnl|CDD|233642 TIGR01930, AcCoA-C-Actrans, acetyl-CoA acetyltransferases. This
model represents a large family of enzymes which
catalyze the thiolysis of a linear fatty acid CoA (or
acetoacetyl-CoA) using a second CoA molecule to produce
acetyl-CoA and a CoA-ester product two carbons shorter
(or, alternatively, the condensation of two molecules of
acetyl-CoA to produce acetoacetyl-CoA and CoA). This
enzyme is also known as "thiolase", "3-ketoacyl-CoA
thiolase", "beta-ketothiolase" and "Fatty oxidation
complex beta subunit". When catalyzing the degradative
reaction on fatty acids the corresponding EC number is
2.3.1.16. The condensation reaction corresponds to
2.3.1.9. Note that the enzymes which catalyze the
condensation are generally not involved in fatty acid
biosynthesis, which is carried out by a decarboxylating
condensation of acetyl and malonyl esters of acyl
carrier proteins. Rather, this activity may produce
acetoacetyl-CoA for pathways such as IPP biosynthesis in
the absence of sufficient fatty acid oxidation [Fatty
acid and phospholipid metabolism, Other].
Length = 386
Score = 33.4 bits (77), Expect = 0.099
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 4 KRKNLKYGIGSACIGGGQA 22
KR+ +YG+ + CIGGGQ
Sbjct: 362 KRRGGRYGLATICIGGGQG 380
>gnl|CDD|181072 PRK07661, PRK07661, acetyl-CoA acetyltransferase; Provisional.
Length = 391
Score = 30.9 bits (70), Expect = 0.73
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 4 KRKNLKYGIGSACIGGG 20
KR+N ++GI + CIGGG
Sbjct: 365 KRRNEQFGIVTMCIGGG 381
>gnl|CDD|181626 PRK09052, PRK09052, acetyl-CoA acetyltransferase; Provisional.
Length = 399
Score = 29.6 bits (67), Expect = 1.6
Identities = 9/18 (50%), Positives = 13/18 (72%)
Query: 4 KRKNLKYGIGSACIGGGQ 21
+R NLKYG+ + C+G G
Sbjct: 373 RRTNLKYGMVTMCVGTGM 390
>gnl|CDD|235741 PRK06205, PRK06205, acetyl-CoA acetyltransferase; Provisional.
Length = 404
Score = 29.6 bits (67), Expect = 1.6
Identities = 9/18 (50%), Positives = 14/18 (77%)
Query: 4 KRKNLKYGIGSACIGGGQ 21
+R+ +YG+ + CIGGGQ
Sbjct: 377 QRRQARYGLETMCIGGGQ 394
>gnl|CDD|234886 PRK00996, PRK00996, ribonuclease HIII; Provisional.
Length = 304
Score = 29.1 bits (66), Expect = 2.5
Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 2/53 (3%)
Query: 51 LDLNQINS-RVRYSEYNPGK-FHGLIMKILNPRTTCLAFQSGKLLILGAKHEH 101
L I + +Y E G + P T A++SGK++ G E
Sbjct: 8 LSDKLIEKLKEQYEEKLTPSLPPGAVFAAKKPGVTITAYKSGKVVFQGKGAEA 60
Score = 29.1 bits (66), Expect = 2.5
Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 2/53 (3%)
Query: 140 LDLNQINS-RVRYSEYNPGK-FHGLIMKILNPRTTCLAFQSGKLLILGAKHEH 190
L I + +Y E G + P T A++SGK++ G E
Sbjct: 8 LSDKLIEKLKEQYEEKLTPSLPPGAVFAAKKPGVTITAYKSGKVVFQGKGAEA 60
>gnl|CDD|215347 PLN02644, PLN02644, acetyl-CoA C-acetyltransferase.
Length = 394
Score = 28.9 bits (65), Expect = 3.1
Identities = 10/19 (52%), Positives = 12/19 (63%)
Query: 4 KRKNLKYGIGSACIGGGQA 22
+ KN KYG+ C GGG A
Sbjct: 367 RSKNGKYGVAGICNGGGGA 385
>gnl|CDD|222691 pfam14337, DUF4393, Domain of unknown function (DUF4393). This
family of proteins is found in bacteria, archaea and
viruses. Proteins in this family are typically between
254 and 285 amino acids in length.
Length = 185
Score = 28.5 bits (64), Expect = 3.3
Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 6/42 (14%)
Query: 191 DCKLASRKFAKILKQLGHP----IKYQGFKIHNIVCTCDVRF 228
F ILKQL +KY +NIV T DVR
Sbjct: 51 KNDDVHPSFVDILKQLSPDEARILKY--LYKNNIVPTVDVRL 90
>gnl|CDD|152474 pfam12039, DUF3525, Protein of unknown function (DUF3525). This
family of proteins is functionally uncharacterized. This
protein is found in viruses. Proteins in this family are
about 360 amino acids in length.
Length = 404
Score = 28.9 bits (64), Expect = 3.3
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 10/45 (22%)
Query: 232 LDALHHVHSQFSSYEPELFPGL----------IYRMVKPRVVLLI 266
+D H V SQFS Y P + L +YR+ +P + LL+
Sbjct: 306 IDETHRVLSQFSHYAPRILNELTTWLKGLAENLYRVAEPILELLL 350
>gnl|CDD|180563 PRK06445, PRK06445, acetyl-CoA acetyltransferase; Provisional.
Length = 394
Score = 28.5 bits (64), Expect = 3.4
Identities = 8/16 (50%), Positives = 11/16 (68%)
Query: 6 KNLKYGIGSACIGGGQ 21
K YG+ + C+GGGQ
Sbjct: 370 KGKDYGVATLCVGGGQ 385
>gnl|CDD|143599 cd07575, Xc-1258_like, Xanthomonas campestris XC1258 and related
proteins, members of the nitrilase superfamily (putative
class 13 nitrilases). Uncharacterized subgroup
belonging to a larger nitrilase superfamily comprised of
nitrile- or amide-hydrolyzing enzymes and
amide-condensing enzymes, which depend on a Glu-Lys-Cys
catalytic triad. This superfamily has been classified in
the literature based on global and structure based
sequence analysis into thirteen different enzyme classes
(referred to as 1-13), class 13 represents proteins that
at the time were difficult to place in a distinct
similarity group; this subgroup either represents a new
class or one that was included previously in class 13.
Members of this superfamily generally form homomeric
complexes, the basic building block of which is a
homodimer. XC1258 is a homotetramer.
Length = 252
Score = 28.3 bits (64), Expect = 4.0
Identities = 11/21 (52%), Positives = 16/21 (76%), Gaps = 1/21 (4%)
Query: 210 IKYQGFKIHNIVCTCDVRFPV 230
++Y+G+KI VC D+RFPV
Sbjct: 128 VEYKGWKILLQVCY-DLRFPV 147
>gnl|CDD|179184 PRK00961, PRK00961, H(2)-dependent methylenetetrahydromethanopterin
dehydrogenase; Provisional.
Length = 342
Score = 27.8 bits (62), Expect = 6.0
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 96 GAKHEHDCKLASRKFAKILKQLG 118
GA H C + + KFAKI K LG
Sbjct: 169 GAIVTHACTIPTTKFAKIFKDLG 191
Score = 27.8 bits (62), Expect = 6.0
Identities = 12/23 (52%), Positives = 14/23 (60%)
Query: 185 GAKHEHDCKLASRKFAKILKQLG 207
GA H C + + KFAKI K LG
Sbjct: 169 GAIVTHACTIPTTKFAKIFKDLG 191
>gnl|CDD|234114 TIGR03121, one_C_dehyd_A, formylmethanofuran dehydrogenase subunit
A. Members of this largely archaeal protein family are
subunit A of the formylmethanofuran dehydrogenase.
Nomenclature in some bacteria may reflect inclusion of
the formyltransferase described by TIGR03119 as part of
the complex, and therefore call this protein
formyltransferase/hydrolase complex Fhc subunit A. Note
that this model does not distinguish tungsten (FwdA)
from molybdenum-containing (FmdA) forms of this enzyme;
a single gene from this family is expressed
constitutively in Methanobacterium thermoautotrophicum,
which has both tungsten and molybdenum forms and may
work interchangeably.
Length = 556
Score = 27.7 bits (62), Expect = 8.1
Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 2/41 (4%)
Query: 205 QLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSY 245
LG P Y+ + + V+ P + LH H QF SY
Sbjct: 237 NLGVPGNYE-TTLDTLDAAEGVK-PNRNQVLHLTHVQFHSY 275
>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
This family consists of the N-terminal region of the
prokaryotic fibronectin-binding protein. Fibronectin
binding is considered to be an important virulence
factor in streptococcal infections. Fibronectin is a
dimeric glycoprotein that is present in a soluble form
in plasma and extracellular fluids; it is also present
in a fibrillar form on cell surfaces. Both the soluble
and cellular forms of fibronectin may be incorporated
into the extracellular tissue matrix. While fibronectin
has critical roles in eukaryotic cellular processes,
such as adhesion, migration and differentiation, it is
also a substrate for the attachment of bacteria. The
binding of pathogenic Streptococcus pyogenes and
Staphylococcus aureus to epithelial cells via
fibronectin facilitates their internalisation and
systemic spread within the host.
Length = 447
Score = 27.2 bits (61), Expect = 9.5
Identities = 8/24 (33%), Positives = 11/24 (45%), Gaps = 1/24 (4%)
Query: 232 LDALHHVHSQFSSYEPELFPGLIY 255
+DA+ HV S + PG Y
Sbjct: 134 IDAIKHVSPSQSRVRT-ILPGEPY 156
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.327 0.144 0.447
Gapped
Lambda K H
0.267 0.0711 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,711,924
Number of extensions: 1284584
Number of successful extensions: 1416
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1377
Number of HSP's successfully gapped: 44
Length of query: 272
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 177
Effective length of database: 6,723,972
Effective search space: 1190143044
Effective search space used: 1190143044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 58 (26.2 bits)