RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16701
         (272 letters)



>gnl|CDD|239952 cd04516, TBP_eukaryotes, eukaryotic TATA box binding protein (TBP):
           Present in archaea and eukaryotes, TBPs are
           transcription factors that recognize promoters and
           initiate transcription. TBP has been shown to be an
           essential component of three different transcription
           initiation complexes: SL1, TFIID and TFIIIB, directing
           transcription by RNA polymerases I, II and III,
           respectively. TBP binds directly to the TATA box
           promoter element, where it nucleates polymerase
           assembly, thus defining the transcription start site.
           TBP's binding in the minor groove induces a dramatic DNA
           bending while its own structure barely changes. The
           conserved core domain of TBP, which binds to the TATA
           box, has a bipartite structure, with intramolecular
           symmetry generating a saddle-shaped structure that sits
           astride the DNA.
          Length = 174

 Score =  224 bits (572), Expect = 3e-74
 Identities = 86/144 (59%), Positives = 114/144 (79%)

Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
           N+V T+++GC LDL +I  R R +EYNP +F  +IM+I  P+TT L F SGK++  GAK 
Sbjct: 6   NIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKS 65

Query: 189 EHDCKLASRKFAKILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPE 248
           E D KLA+RK+A+I+++LG P K+  FKI NIV +CDV+FP++L+ L H H QFSSYEPE
Sbjct: 66  EDDSKLAARKYARIIQKLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPE 125

Query: 249 LFPGLIYRMVKPRVVLLIFVNGRI 272
           LFPGLIYRMVKP++VLLIFV+G+I
Sbjct: 126 LFPGLIYRMVKPKIVLLIFVSGKI 149



 Score =  151 bits (384), Expect = 7e-46
 Identities = 69/168 (41%), Positives = 104/168 (61%), Gaps = 2/168 (1%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V T+++GC LDL +I  R R +EYNP +F  +IM+I  P+TT L F SGK++  GAK 
Sbjct: 6   NIVATVNLGCKLDLKKIALRARNAEYNPKRFAAVIMRIREPKTTALIFSSGKMVCTGAKS 65

Query: 100 EHDCKLASRKFAKILKQLGHP-EIIFFFFRNMVGTISVGCPLDLNQIN-SRVRYSEYNPG 157
           E D KLA+RK+A+I+++LG P +   F  +N+VG+  V  P+ L  +  +  ++S Y P 
Sbjct: 66  EDDSKLAARKYARIIQKLGFPAKFTDFKIQNIVGSCDVKFPIRLEGLAHAHKQFSSYEPE 125

Query: 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
            F GLI +++ P+   L F SGK+++ GAK   +   A      IL Q
Sbjct: 126 LFPGLIYRMVKPKIVLLIFVSGKIVLTGAKSREEIYQAFENIYPILLQ 173


>gnl|CDD|177693 PLN00062, PLN00062, TATA-box-binding protein; Provisional.
          Length = 179

 Score =  213 bits (544), Expect = 6e-70
 Identities = 81/144 (56%), Positives = 113/144 (78%)

Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
           N+V T+++ C LDL +I  + R +EYNP +F  +IM+I  P+TT L F SGK++  GAK 
Sbjct: 6   NIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKS 65

Query: 189 EHDCKLASRKFAKILKQLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPE 248
           EHD KLA+RK+A+I+++LG P K++ FKI NIV +CDV+FP++L+ L + H  FSSYEPE
Sbjct: 66  EHDSKLAARKYARIIQKLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPE 125

Query: 249 LFPGLIYRMVKPRVVLLIFVNGRI 272
           LFPGLIYRM +P++VLLIFV+G+I
Sbjct: 126 LFPGLIYRMKQPKIVLLIFVSGKI 149



 Score =  147 bits (373), Expect = 3e-44
 Identities = 67/168 (39%), Positives = 102/168 (60%), Gaps = 2/168 (1%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V T+++ C LDL +I  + R +EYNP +F  +IM+I  P+TT L F SGK++  GAK 
Sbjct: 6   NIVSTVNLDCKLDLKKIALQARNAEYNPKRFAAVIMRIREPKTTALIFASGKMVCTGAKS 65

Query: 100 EHDCKLASRKFAKILKQLGHP-EIIFFFFRNMVGTISVGCPLDLNQIN-SRVRYSEYNPG 157
           EHD KLA+RK+A+I+++LG P +   F  +N+VG+  V  P+ L  +  +   +S Y P 
Sbjct: 66  EHDSKLAARKYARIIQKLGFPAKFKDFKIQNIVGSCDVKFPIRLEGLAYAHGAFSSYEPE 125

Query: 158 KFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
            F GLI ++  P+   L F SGK++I GAK   +   A      +L +
Sbjct: 126 LFPGLIYRMKQPKIVLLIFVSGKIVITGAKVREEIYTAFENIYPVLTE 173


>gnl|CDD|238352 cd00652, TBP_TLF, TATA box binding protein (TBP): Present in
           archaea and eukaryotes, TBPs are transcription factors
           that recognize promoters and initiate transcription. TBP
           has been shown to be an essential component of three
           different transcription initiation complexes: SL1, TFIID
           and TFIIIB, directing transcription by RNA polymerases
           I, II and III, respectively. TBP binds directly to the
           TATA box promoter element, where it nucleates polymerase
           assembly, thus defining the transcription start site.
           TBP's binding in the minor groove induces a dramatic DNA
           bending while its own structure barely changes. The
           conserved core domain of TBP, which binds to the TATA
           box, has a bipartite structure, with intramolecular
           symmetry generating a saddle-shaped structure that sits
           astride the DNA. New members of the TBP family, called
           TBP-like proteins (TBLP, TLF, TLP) or TBP-related
           factors (TRF1, TRF2,TRP), are similar to the core domain
           of TBPs, with identical or chemically similar amino
           acids at many equivalent positions, suggesting similar
           structure. However, TLFs contain distinct, conserved
           amino acids at several positions that distinguish them
           from TBP.
          Length = 174

 Score =  203 bits (520), Expect = 2e-66
 Identities = 80/146 (54%), Positives = 113/146 (77%), Gaps = 1/146 (0%)

Query: 128 RNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187
           +N+V T+++GC LDL +I    R +EYNP +F G+IM++  P+TT L F SGK++I GAK
Sbjct: 5   QNIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAK 64

Query: 188 HEHDCKLASRKFAKILKQLGHPI-KYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYE 246
            E D KLA+RK+A+IL++LG P+ K+  FK+ NIV +CD+ FP++L+ L   H + +SYE
Sbjct: 65  SEEDAKLAARKYARILQKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYE 124

Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
           PELFPGLIYRM +P+VVLLIFV+G+I
Sbjct: 125 PELFPGLIYRMDEPKVVLLIFVSGKI 150



 Score =  160 bits (407), Expect = 2e-49
 Identities = 72/169 (42%), Positives = 105/169 (62%), Gaps = 3/169 (1%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V T+++GC LDL +I    R +EYNP +F G+IM++  P+TT L F SGK++I GAK 
Sbjct: 6   NIVATVNLGCELDLRKIALAARNAEYNPKRFPGVIMRLREPKTTALIFSSGKMVITGAKS 65

Query: 100 EHDCKLASRKFAKILKQLGHPEIIFFFFR--NMVGTISVGCPLDLNQINSRVR-YSEYNP 156
           E D KLA+RK+A+IL++LG P   F  F+  N+V +  +G P+ L ++  +    + Y P
Sbjct: 66  EEDAKLAARKYARILQKLGFPVEKFPEFKVQNIVASCDLGFPIRLEELALKHPENASYEP 125

Query: 157 GKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
             F GLI ++  P+   L F SGK++I GAK   D   A  K   ILK+
Sbjct: 126 ELFPGLIYRMDEPKVVLLIFVSGKIVITGAKSREDIYEAVEKIYPILKE 174



 Score = 51.4 bits (124), Expect = 4e-08
 Identities = 23/61 (37%), Positives = 35/61 (57%), Gaps = 9/61 (14%)

Query: 216 KIHNIVCTCDVRFPVKLD----ALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGR 271
           KI NIV T  V    +LD    AL   +++   Y P+ FPG+I R+ +P+   LIF +G+
Sbjct: 3   KIQNIVAT--VNLGCELDLRKIALAARNAE---YNPKRFPGVIMRLREPKTTALIFSSGK 57

Query: 272 I 272
           +
Sbjct: 58  M 58


>gnl|CDD|225012 COG2101, SPT15, TATA-box binding protein (TBP), component of TFIID
           and TFIIIB [Transcription].
          Length = 185

 Score =  173 bits (441), Expect = 2e-54
 Identities = 57/146 (39%), Positives = 89/146 (60%), Gaps = 2/146 (1%)

Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
           N+V ++ +G  LDL ++   +  +EYNP +F GL+ ++  P+T  L F+SGK++  GAK 
Sbjct: 12  NIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKS 71

Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDALHHVHSQF-SSYE 246
             D   A +K AK LK  G  I ++   K+ NIV + D+   + L+A+          YE
Sbjct: 72  VEDVHRAVKKLAKKLKDGGIDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYE 131

Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
           PE FPGL+YR+ +PRVVLL+F +G++
Sbjct: 132 PEQFPGLVYRLDEPRVVLLLFGSGKL 157



 Score =  151 bits (383), Expect = 1e-45
 Identities = 65/173 (37%), Positives = 97/173 (56%), Gaps = 6/173 (3%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V ++ +G  LDL ++   +  +EYNP +F GL+ ++  P+T  L F+SGK++  GAK 
Sbjct: 12  NIVASVDLGQELDLEEVALDLPGAEYNPEQFPGLVYRLEEPKTAALIFRSGKVVCTGAKS 71

Query: 100 EHDCKLASRKFAKILKQLGHPEIIF---FFFRNMVGTISVGCPLDLNQINSRV--RYSEY 154
             D   A +K AK LK  G  +I F      +N+V +  +G  L+LN I   +     EY
Sbjct: 72  VEDVHRAVKKLAKKLKDGG-IDIDFEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEY 130

Query: 155 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLG 207
            P +F GL+ ++  PR   L F SGKL+I GAK E D + A  K    L++LG
Sbjct: 131 EPEQFPGLVYRLDEPRVVLLLFGSGKLVITGAKSEEDAEQAVEKIQSRLEELG 183


>gnl|CDD|239954 cd04518, TBP_archaea, archaeal TATA box binding protein (TBP): TBPs
           are transcription factors present in archaea and
           eukaryotes, that recognize promoters and initiate
           transcription. TBP has been shown to be an essential
           component of three different transcription initiation
           complexes: SL1, TFIID and TFIIIB, directing
           transcription by RNA polymerases I, II and III,
           respectively. TBP binds directly to the TATA box
           promoter element, where it nucleates polymerase
           assembly, thus defining the transcription start site.
           TBP's binding in the minor groove induces a dramatic DNA
           bending while its own structure barely changes. The
           conserved core domain of TBP, which binds to the TATA
           box, has a bipartite structure, with intramolecular
           symmetry generating a saddle-shaped structure that sits
           astride the DNA.
          Length = 174

 Score =  136 bits (345), Expect = 4e-40
 Identities = 60/172 (34%), Positives = 97/172 (56%), Gaps = 8/172 (4%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V ++ +G  LDL ++ + +  +EYNP +F GL+ ++ +P+   L F+SGK++  GAK 
Sbjct: 6   NIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKS 65

Query: 100 EHDCKLASRKFAKILKQLG-----HPEIIFFFFRNMVGTISVGCPLDLNQINSRVRYSEY 154
             D   A ++  K LK  G      PEI     +N+V +  +G  ++L+ I   +  +EY
Sbjct: 66  VEDLHRAVKEIIKKLKDYGIKVIEKPEIK---VQNIVASADLGREVNLDAIAIGLPNAEY 122

Query: 155 NPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQL 206
            P +F GL+ ++  P+   L F SGK++I GAK E D K A  K    LK+L
Sbjct: 123 EPEQFPGLVYRLDEPKVVLLLFSSGKMVITGAKSEEDAKRAVEKLLSRLKEL 174



 Score =  131 bits (332), Expect = 3e-38
 Identities = 54/145 (37%), Positives = 90/145 (62%), Gaps = 2/145 (1%)

Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
           N+V ++ +G  LDL ++ + +  +EYNP +F GL+ ++ +P+   L F+SGK++  GAK 
Sbjct: 6   NIVASVDLGQELDLEKVAAELPNAEYNPDQFPGLVYRLEDPKIAALIFRSGKMVCTGAKS 65

Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEP 247
             D   A ++  K LK  G  +  +   K+ NIV + D+   V LDA+  +    + YEP
Sbjct: 66  VEDLHRAVKEIIKKLKDYGIKVIEKPEIKVQNIVASADLGREVNLDAI-AIGLPNAEYEP 124

Query: 248 ELFPGLIYRMVKPRVVLLIFVNGRI 272
           E FPGL+YR+ +P+VVLL+F +G++
Sbjct: 125 EQFPGLVYRLDEPKVVLLLFSSGKM 149


>gnl|CDD|234746 PRK00394, PRK00394, transcription factor; Reviewed.
          Length = 179

 Score =  135 bits (342), Expect = 2e-39
 Identities = 52/146 (35%), Positives = 88/146 (60%), Gaps = 2/146 (1%)

Query: 129 NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 188
           N+V +  +G  LDL ++   +  +EYNP +F GL+ ++ +P+   L F+SGK++  GAK 
Sbjct: 5   NIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKS 64

Query: 189 EHDCKLASRKFAKILKQLGHPIKYQ-GFKIHNIVCTCDVRFPVKLDAL-HHVHSQFSSYE 246
             D   A +   K LK+LG  +  +   K+ NIV + D+   + L+A+   +  +   YE
Sbjct: 65  VEDLHEAVKIIIKKLKELGIKVIDEPEIKVQNIVASADLGVELNLNAIAIGLGLENIEYE 124

Query: 247 PELFPGLIYRMVKPRVVLLIFVNGRI 272
           PE FPGL+YR+  P+VV+L+F +G++
Sbjct: 125 PEQFPGLVYRLDDPKVVVLLFGSGKL 150



 Score =  130 bits (329), Expect = 1e-37
 Identities = 63/175 (36%), Positives = 98/175 (56%), Gaps = 10/175 (5%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V +  +G  LDL ++   +  +EYNP +F GL+ ++ +P+   L F+SGK++  GAK 
Sbjct: 5   NIVASTDLGQELDLEKVAEDLPNAEYNPEQFPGLVYRLEDPKIAALIFRSGKVVCTGAKS 64

Query: 100 EHDCKLASRKFAKILKQLG-----HPEIIFFFFRNMVGTISVGCPLDLNQI--NSRVRYS 152
             D   A +   K LK+LG      PEI     +N+V +  +G  L+LN I     +   
Sbjct: 65  VEDLHEAVKIIIKKLKELGIKVIDEPEIK---VQNIVASADLGVELNLNAIAIGLGLENI 121

Query: 153 EYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQLG 207
           EY P +F GL+ ++ +P+   L F SGKL+I GAK E D + A  K  + L++LG
Sbjct: 122 EYEPEQFPGLVYRLDDPKVVVLLFGSGKLVITGAKSEEDAEKAVEKILEKLEELG 176


>gnl|CDD|239953 cd04517, TLF, TBP-like factors (TLF; also called TLP, TRF, TRP),
           which are found in most metazoans. TLFs and TBPs have
           well-conserved core domains; however, they only share
           about 60% similarity. TLFs, like TBPs, interact with
           TFIIA and TFIIB, which are part of the basal
           transcription machinery. Yet, in contrast to TBPs, TLFs
           seem not to interact with the TATA-box and even have a
           negative effect on the transcription of TATA-containing
           promoters. Recent results indicate that TLFs are
           involved in the transcription via TATA-less promoters.
          Length = 174

 Score =  122 bits (309), Expect = 1e-34
 Identities = 54/151 (35%), Positives = 90/151 (59%), Gaps = 2/151 (1%)

Query: 123 IFFFFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLL 182
           +     N+V   S+ C +DL ++    R  EYNP ++  + M++  PR T   + SGK+ 
Sbjct: 1   LDILIVNVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKIT 59

Query: 183 ILGAKHEHDCKLASRKFAKILKQLGHP-IKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQ 241
           I GA  E + K A+R+ A++L++LG   +++  F++ N++ TC + FP++LD L   +  
Sbjct: 60  ITGATSEEEAKQAARRAARLLQKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRS 119

Query: 242 FSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272
            +SYEPEL PG++YR+  PR  L IF  G +
Sbjct: 120 SASYEPELHPGVVYRITGPRATLSIFSTGSV 150



 Score = 89.3 bits (222), Expect = 6e-22
 Identities = 53/169 (31%), Positives = 93/169 (55%), Gaps = 4/169 (2%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V   S+ C +DL ++    R  EYNP ++  + M++  PR T   + SGK+ I GA  
Sbjct: 7   NVVCQFSLRCHIDLRKLALAGRNVEYNP-RYPKVTMRLREPRATASVWSSGKITITGATS 65

Query: 100 EHDCKLASRKFAKILKQLGHPEIIFFFFR--NMVGTISVGCPLDLNQINSRVR-YSEYNP 156
           E + K A+R+ A++L++LG   + F  FR  N++ T S+  P+ L+++ ++ R  + Y P
Sbjct: 66  EEEAKQAARRAARLLQKLGFKVVRFSNFRVVNVLATCSMPFPIRLDELAAKNRSSASYEP 125

Query: 157 GKFHGLIMKILNPRTTCLAFQSGKLLILGAKHEHDCKLASRKFAKILKQ 205
               G++ +I  PR T   F +G + + GA+   D + A  K   I+ +
Sbjct: 126 ELHPGVVYRITGPRATLSIFSTGSVTVTGARSMEDVREAVEKIYPIVFE 174


>gnl|CDD|201174 pfam00352, TBP, Transcription factor TFIID (or TATA-binding
           protein, TBP). 
          Length = 86

 Score =  104 bits (262), Expect = 8e-29
 Identities = 37/80 (46%), Positives = 54/80 (67%)

Query: 128 RNMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAK 187
           +N+V +  +G P+DL +I  ++R +EY P +F GL+ ++  P+ T L F SGK++I GAK
Sbjct: 7   QNIVASADLGFPIDLEKIALKLRNAEYEPEQFPGLVYRLREPKATALIFSSGKVVITGAK 66

Query: 188 HEHDCKLASRKFAKILKQLG 207
            E D K A RK A ILK+LG
Sbjct: 67  SEEDAKKAVRKIAPILKKLG 86



 Score =  104 bits (261), Expect = 1e-28
 Identities = 37/79 (46%), Positives = 53/79 (67%)

Query: 40  NMVGTISVGCPLDLNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGAKH 99
           N+V +  +G P+DL +I  ++R +EY P +F GL+ ++  P+ T L F SGK++I GAK 
Sbjct: 8   NIVASADLGFPIDLEKIALKLRNAEYEPEQFPGLVYRLREPKATALIFSSGKVVITGAKS 67

Query: 100 EHDCKLASRKFAKILKQLG 118
           E D K A RK A ILK+LG
Sbjct: 68  EEDAKKAVRKIAPILKKLG 86



 Score = 91.8 bits (229), Expect = 6e-24
 Identities = 24/58 (41%), Positives = 38/58 (65%), Gaps = 1/58 (1%)

Query: 215 FKIHNIVCTCDVRFPVKLDALHHVHSQFSSYEPELFPGLIYRMVKPRVVLLIFVNGRI 272
            KI NIV + D+ FP+ L+ +     + + YEPE FPGL+YR+ +P+   LIF +G++
Sbjct: 4   IKIQNIVASADLGFPIDLEKIALKL-RNAEYEPEQFPGLVYRLREPKATALIFSSGKV 60


>gnl|CDD|238383 cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze
           the reversible thiolytic cleavage of 3-ketoacyl-CoA into
           acyl-CoA and acetyl-CoA, a 2-step reaction involving a
           covalent intermediate formed with a catalytic cysteine.
           They are found in prokaryotes and eukaryotes (cytosol,
           microbodies and mitochondria). There are 2 functional
           different classes: thiolase-I (3-ketoacyl-CoA thiolase)
           and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I
           can cleave longer fatty acid molecules and plays an
           important role in the beta-oxidative degradation of
           fatty acids. Thiolase-II has a high substrate
           specificity. Although it can cleave acetoacyl-CoA, its
           main function is the synthesis of acetoacyl-CoA from two
           molecules of acetyl-CoA, which gives it importance in
           several biosynthetic pathways.
          Length = 386

 Score = 37.8 bits (89), Expect = 0.004
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 4   KRKNLKYGIGSACIGGGQA 22
           KR+  +YG+ + CIGGGQ 
Sbjct: 361 KRRGGRYGLATMCIGGGQG 379


>gnl|CDD|180261 PRK05790, PRK05790, putative acyltransferase; Provisional.
          Length = 393

 Score = 36.3 bits (85), Expect = 0.014
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 4   KRKNLKYGIGSACIGGGQ 21
           KR+  K G+ + CIGGGQ
Sbjct: 367 KRRGAKKGLATLCIGGGQ 384


>gnl|CDD|218343 pfam04950, DUF663, Protein of unknown function (DUF663).  This
           family contains several uncharacterized eukaryotic
           proteins.
          Length = 293

 Score = 34.6 bits (80), Expect = 0.036
 Identities = 44/177 (24%), Positives = 62/177 (35%), Gaps = 27/177 (15%)

Query: 53  LNQINSRVRYSEYNPGKFHGLIMKILNPRTTCLAFQSGKLLILGA--KHEHDCKLA---- 106
           LN+     R     PG +  L +K + P+     F S +L+I+     HEH   +     
Sbjct: 58  LNEYEEEKRIEGVKPGWYVRLEIKNV-PKELAENFNSKQLIIVFGLLPHEHKMSVVNFKI 116

Query: 107 --SRKFAKILKQLGHPEIIFF--FFRNMVGTISVGCPLDLNQINSRVRYSEYNPGKFH-- 160
              R + K LK      +I    F R     I        N  N + +Y  Y P      
Sbjct: 117 KKHRWYKKPLKS--KEPLIIQVGFRRFQSQPI-----YSQNTSNDKHKYERYLPPHKTCN 169

Query: 161 -GLIMKILNPRTTCLAFQ-----SGKLLILGAKHEHDCKLASRKFAKILKQLGHPIK 211
                 I  P +  LAF+     +    I       +C   +R  AK LK  GHP K
Sbjct: 170 ATFYGPITFPNSPLLAFKKSFSDTNGFRIAATGCILECD-PNRIIAKKLKLTGHPYK 225


>gnl|CDD|181265 PRK08170, PRK08170, acetyl-CoA acetyltransferase; Provisional.
          Length = 426

 Score = 33.8 bits (78), Expect = 0.090
 Identities = 11/18 (61%), Positives = 13/18 (72%)

Query: 4   KRKNLKYGIGSACIGGGQ 21
           KR+  K GI + CIGGGQ
Sbjct: 400 KRRGTKRGIAAICIGGGQ 417


>gnl|CDD|145779 pfam02803, Thiolase_C, Thiolase, C-terminal domain.  Thiolase is
           reported to be structurally related to beta-ketoacyl
           synthase (pfam00109), and also chalcone synthase.
          Length = 123

 Score = 32.2 bits (74), Expect = 0.095
 Identities = 11/19 (57%), Positives = 14/19 (73%)

Query: 4   KRKNLKYGIGSACIGGGQA 22
           KR+  KYG+ + CIGGGQ 
Sbjct: 98  KRRGGKYGLATLCIGGGQG 116


>gnl|CDD|233642 TIGR01930, AcCoA-C-Actrans, acetyl-CoA acetyltransferases.  This
           model represents a large family of enzymes which
           catalyze the thiolysis of a linear fatty acid CoA (or
           acetoacetyl-CoA) using a second CoA molecule to produce
           acetyl-CoA and a CoA-ester product two carbons shorter
           (or, alternatively, the condensation of two molecules of
           acetyl-CoA to produce acetoacetyl-CoA and CoA). This
           enzyme is also known as "thiolase", "3-ketoacyl-CoA
           thiolase", "beta-ketothiolase" and "Fatty oxidation
           complex beta subunit". When catalyzing the degradative
           reaction on fatty acids the corresponding EC number is
           2.3.1.16. The condensation reaction corresponds to
           2.3.1.9. Note that the enzymes which catalyze the
           condensation are generally not involved in fatty acid
           biosynthesis, which is carried out by a decarboxylating
           condensation of acetyl and malonyl esters of acyl
           carrier proteins. Rather, this activity may produce
           acetoacetyl-CoA for pathways such as IPP biosynthesis in
           the absence of sufficient fatty acid oxidation [Fatty
           acid and phospholipid metabolism, Other].
          Length = 386

 Score = 33.4 bits (77), Expect = 0.099
 Identities = 10/19 (52%), Positives = 14/19 (73%)

Query: 4   KRKNLKYGIGSACIGGGQA 22
           KR+  +YG+ + CIGGGQ 
Sbjct: 362 KRRGGRYGLATICIGGGQG 380


>gnl|CDD|181072 PRK07661, PRK07661, acetyl-CoA acetyltransferase; Provisional.
          Length = 391

 Score = 30.9 bits (70), Expect = 0.73
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 4   KRKNLKYGIGSACIGGG 20
           KR+N ++GI + CIGGG
Sbjct: 365 KRRNEQFGIVTMCIGGG 381


>gnl|CDD|181626 PRK09052, PRK09052, acetyl-CoA acetyltransferase; Provisional.
          Length = 399

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 9/18 (50%), Positives = 13/18 (72%)

Query: 4   KRKNLKYGIGSACIGGGQ 21
           +R NLKYG+ + C+G G 
Sbjct: 373 RRTNLKYGMVTMCVGTGM 390


>gnl|CDD|235741 PRK06205, PRK06205, acetyl-CoA acetyltransferase; Provisional.
          Length = 404

 Score = 29.6 bits (67), Expect = 1.6
 Identities = 9/18 (50%), Positives = 14/18 (77%)

Query: 4   KRKNLKYGIGSACIGGGQ 21
           +R+  +YG+ + CIGGGQ
Sbjct: 377 QRRQARYGLETMCIGGGQ 394


>gnl|CDD|234886 PRK00996, PRK00996, ribonuclease HIII; Provisional.
          Length = 304

 Score = 29.1 bits (66), Expect = 2.5
 Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 51  LDLNQINS-RVRYSEYNPGK-FHGLIMKILNPRTTCLAFQSGKLLILGAKHEH 101
           L    I   + +Y E        G +     P  T  A++SGK++  G   E 
Sbjct: 8   LSDKLIEKLKEQYEEKLTPSLPPGAVFAAKKPGVTITAYKSGKVVFQGKGAEA 60



 Score = 29.1 bits (66), Expect = 2.5
 Identities = 13/53 (24%), Positives = 20/53 (37%), Gaps = 2/53 (3%)

Query: 140 LDLNQINS-RVRYSEYNPGK-FHGLIMKILNPRTTCLAFQSGKLLILGAKHEH 190
           L    I   + +Y E        G +     P  T  A++SGK++  G   E 
Sbjct: 8   LSDKLIEKLKEQYEEKLTPSLPPGAVFAAKKPGVTITAYKSGKVVFQGKGAEA 60


>gnl|CDD|215347 PLN02644, PLN02644, acetyl-CoA C-acetyltransferase.
          Length = 394

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 10/19 (52%), Positives = 12/19 (63%)

Query: 4   KRKNLKYGIGSACIGGGQA 22
           + KN KYG+   C GGG A
Sbjct: 367 RSKNGKYGVAGICNGGGGA 385


>gnl|CDD|222691 pfam14337, DUF4393, Domain of unknown function (DUF4393).  This
           family of proteins is found in bacteria, archaea and
           viruses. Proteins in this family are typically between
           254 and 285 amino acids in length.
          Length = 185

 Score = 28.5 bits (64), Expect = 3.3
 Identities = 15/42 (35%), Positives = 17/42 (40%), Gaps = 6/42 (14%)

Query: 191 DCKLASRKFAKILKQLGHP----IKYQGFKIHNIVCTCDVRF 228
                   F  ILKQL       +KY     +NIV T DVR 
Sbjct: 51  KNDDVHPSFVDILKQLSPDEARILKY--LYKNNIVPTVDVRL 90


>gnl|CDD|152474 pfam12039, DUF3525, Protein of unknown function (DUF3525).  This
           family of proteins is functionally uncharacterized. This
           protein is found in viruses. Proteins in this family are
           about 360 amino acids in length.
          Length = 404

 Score = 28.9 bits (64), Expect = 3.3
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 10/45 (22%)

Query: 232 LDALHHVHSQFSSYEPELFPGL----------IYRMVKPRVVLLI 266
           +D  H V SQFS Y P +   L          +YR+ +P + LL+
Sbjct: 306 IDETHRVLSQFSHYAPRILNELTTWLKGLAENLYRVAEPILELLL 350


>gnl|CDD|180563 PRK06445, PRK06445, acetyl-CoA acetyltransferase; Provisional.
          Length = 394

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 8/16 (50%), Positives = 11/16 (68%)

Query: 6   KNLKYGIGSACIGGGQ 21
           K   YG+ + C+GGGQ
Sbjct: 370 KGKDYGVATLCVGGGQ 385


>gnl|CDD|143599 cd07575, Xc-1258_like, Xanthomonas campestris XC1258 and related
           proteins, members of the nitrilase superfamily (putative
           class 13 nitrilases).  Uncharacterized subgroup
           belonging to a larger nitrilase superfamily comprised of
           nitrile- or amide-hydrolyzing enzymes and
           amide-condensing enzymes, which depend on a Glu-Lys-Cys
           catalytic triad. This superfamily has been classified in
           the literature based on global and structure based
           sequence analysis into thirteen different enzyme classes
           (referred to as 1-13), class 13 represents proteins that
           at the time were difficult to place in a distinct
           similarity group; this subgroup either represents a new
           class or one that was included previously in class 13.
           Members of this superfamily generally form homomeric
           complexes, the basic building block of which is a
           homodimer. XC1258 is a homotetramer.
          Length = 252

 Score = 28.3 bits (64), Expect = 4.0
 Identities = 11/21 (52%), Positives = 16/21 (76%), Gaps = 1/21 (4%)

Query: 210 IKYQGFKIHNIVCTCDVRFPV 230
           ++Y+G+KI   VC  D+RFPV
Sbjct: 128 VEYKGWKILLQVCY-DLRFPV 147


>gnl|CDD|179184 PRK00961, PRK00961, H(2)-dependent methylenetetrahydromethanopterin
           dehydrogenase; Provisional.
          Length = 342

 Score = 27.8 bits (62), Expect = 6.0
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 96  GAKHEHDCKLASRKFAKILKQLG 118
           GA   H C + + KFAKI K LG
Sbjct: 169 GAIVTHACTIPTTKFAKIFKDLG 191



 Score = 27.8 bits (62), Expect = 6.0
 Identities = 12/23 (52%), Positives = 14/23 (60%)

Query: 185 GAKHEHDCKLASRKFAKILKQLG 207
           GA   H C + + KFAKI K LG
Sbjct: 169 GAIVTHACTIPTTKFAKIFKDLG 191


>gnl|CDD|234114 TIGR03121, one_C_dehyd_A, formylmethanofuran dehydrogenase subunit
           A.  Members of this largely archaeal protein family are
           subunit A of the formylmethanofuran dehydrogenase.
           Nomenclature in some bacteria may reflect inclusion of
           the formyltransferase described by TIGR03119 as part of
           the complex, and therefore call this protein
           formyltransferase/hydrolase complex Fhc subunit A. Note
           that this model does not distinguish tungsten (FwdA)
           from molybdenum-containing (FmdA) forms of this enzyme;
           a single gene from this family is expressed
           constitutively in Methanobacterium thermoautotrophicum,
           which has both tungsten and molybdenum forms and may
           work interchangeably.
          Length = 556

 Score = 27.7 bits (62), Expect = 8.1
 Identities = 13/41 (31%), Positives = 18/41 (43%), Gaps = 2/41 (4%)

Query: 205 QLGHPIKYQGFKIHNIVCTCDVRFPVKLDALHHVHSQFSSY 245
            LG P  Y+   +  +     V+ P +   LH  H QF SY
Sbjct: 237 NLGVPGNYE-TTLDTLDAAEGVK-PNRNQVLHLTHVQFHSY 275


>gnl|CDD|218769 pfam05833, FbpA, Fibronectin-binding protein A N-terminus (FbpA).
           This family consists of the N-terminal region of the
           prokaryotic fibronectin-binding protein. Fibronectin
           binding is considered to be an important virulence
           factor in streptococcal infections. Fibronectin is a
           dimeric glycoprotein that is present in a soluble form
           in plasma and extracellular fluids; it is also present
           in a fibrillar form on cell surfaces. Both the soluble
           and cellular forms of fibronectin may be incorporated
           into the extracellular tissue matrix. While fibronectin
           has critical roles in eukaryotic cellular processes,
           such as adhesion, migration and differentiation, it is
           also a substrate for the attachment of bacteria. The
           binding of pathogenic Streptococcus pyogenes and
           Staphylococcus aureus to epithelial cells via
           fibronectin facilitates their internalisation and
           systemic spread within the host.
          Length = 447

 Score = 27.2 bits (61), Expect = 9.5
 Identities = 8/24 (33%), Positives = 11/24 (45%), Gaps = 1/24 (4%)

Query: 232 LDALHHVHSQFSSYEPELFPGLIY 255
           +DA+ HV    S     + PG  Y
Sbjct: 134 IDAIKHVSPSQSRVRT-ILPGEPY 156


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.327    0.144    0.447 

Gapped
Lambda     K      H
   0.267   0.0711    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,711,924
Number of extensions: 1284584
Number of successful extensions: 1416
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1377
Number of HSP's successfully gapped: 44
Length of query: 272
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 177
Effective length of database: 6,723,972
Effective search space: 1190143044
Effective search space used: 1190143044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 58 (26.2 bits)