Query 016024
Match_columns 396
No_of_seqs 160 out of 445
Neff 2.6
Searched_HMMs 46136
Date Fri Mar 29 03:05:49 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/016024.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/016024hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03162 golden-2 like transcr 99.9 9.3E-24 2E-28 209.9 6.7 63 242-305 230-292 (526)
2 TIGR01557 myb_SHAQKYF myb-like 99.8 1.4E-21 3E-26 148.4 6.4 56 247-302 1-56 (57)
3 PF00249 Myb_DNA-binding: Myb- 97.2 0.00066 1.4E-08 48.6 5.1 48 249-300 1-48 (48)
4 PF14379 Myb_CC_LHEQLE: MYB-CC 96.7 3E-05 6.5E-10 59.4 -5.9 35 324-358 2-45 (51)
5 smart00426 TEA TEA domain. 86.1 1 2.2E-05 36.8 3.7 18 251-268 5-22 (68)
6 PF12776 Myb_DNA-bind_3: Myb/S 61.2 11 0.00024 29.5 3.5 51 251-301 1-63 (96)
7 smart00717 SANT SANT SWI3, AD 60.3 40 0.00087 22.2 5.6 45 250-299 2-46 (49)
8 PF01285 TEA: TEA/ATTS domain 58.6 9.8 0.00021 39.6 3.5 54 245-299 45-112 (431)
9 cd00167 SANT 'SWI3, ADA2, N-Co 54.5 53 0.0011 21.4 5.4 44 251-299 1-44 (45)
10 smart00501 BRIGHT BRIGHT, ARID 47.9 19 0.00041 29.0 2.8 44 255-299 33-83 (93)
11 PF11888 DUF3408: Protein of u 45.0 25 0.00055 30.7 3.4 60 229-301 65-124 (136)
12 TIGR02894 DNA_bind_RsfA transc 38.5 19 0.00041 33.6 1.7 54 242-301 41-94 (161)
13 PF07384 DUF1497: Protein of u 34.7 33 0.00072 27.3 2.2 22 250-271 36-57 (59)
14 KOG3841 TEF-1 and related tran 31.1 45 0.00098 35.3 3.2 55 247-303 74-145 (455)
No 1
>PLN03162 golden-2 like transcription factor; Provisional
Probab=99.89 E-value=9.3e-24 Score=209.89 Aligned_cols=63 Identities=46% Similarity=0.729 Sum_probs=59.3
Q ss_pred CCCCCCCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhhhhcccC
Q 016024 242 KRSMRAPRMRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTVKTT 305 (396)
Q Consensus 242 krs~kKpRLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKYRl~k~t 305 (396)
....||+||+||+|||++||+||++|| .++||||+||++|+|+|||++||||||||||+.++.
T Consensus 230 ~~g~KKpRLrWTpELH~rFVeAV~qLG-~dKATPK~ILelMnV~GLTRenVKSHLQKYRl~rk~ 292 (526)
T PLN03162 230 APGKKKAKVDWTPELHRRFVHAVEQLG-VEKAFPSRILELMGVQCLTRHNIASHLQKYRSHRRH 292 (526)
T ss_pred CCCCCCCcccCCHHHHHHHHHHHHHhC-cCccchHHHHHHcCCCCcCHHHHHHHHHHHHHhccc
Confidence 345789999999999999999999999 799999999999999999999999999999998773
No 2
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class. This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by Pfam model pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif.
Probab=99.85 E-value=1.4e-21 Score=148.40 Aligned_cols=56 Identities=55% Similarity=0.844 Sum_probs=54.0
Q ss_pred CCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhhhhc
Q 016024 247 APRMRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRTV 302 (396)
Q Consensus 247 KpRLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKYRl~ 302 (396)
|+|++||+|+|++||+||+.||+.+.|+||.|+++|++++||+.+|+|||||||+.
T Consensus 1 k~r~~WT~eeh~~Fl~ai~~~G~g~~a~pk~I~~~~~~~~lT~~qV~SH~QKy~~k 56 (57)
T TIGR01557 1 KPRVVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYRLK 56 (57)
T ss_pred CCCCCCCHHHHHHHHHHHHHhCCCcccchHHHHHHcCCCCCCHHHHHHHHHHHHcc
Confidence 68999999999999999999997789999999999999999999999999999985
No 3
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=97.24 E-value=0.00066 Score=48.61 Aligned_cols=48 Identities=23% Similarity=0.261 Sum_probs=41.5
Q ss_pred CcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhhh
Q 016024 249 RMRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYR 300 (396)
Q Consensus 249 RLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKYR 300 (396)
|..||+|=+.+|++||.++|. -.-+.|.+.|+ .+-|..+|++|.++|+
T Consensus 1 r~~Wt~eE~~~l~~~v~~~g~---~~W~~Ia~~~~-~~Rt~~qc~~~~~~~~ 48 (48)
T PF00249_consen 1 RGPWTEEEDEKLLEAVKKYGK---DNWKKIAKRMP-GGRTAKQCRSRYQNLL 48 (48)
T ss_dssp S-SS-HHHHHHHHHHHHHSTT---THHHHHHHHHS-SSSTHHHHHHHHHHHT
T ss_pred CCCCCHHHHHHHHHHHHHhCC---cHHHHHHHHcC-CCCCHHHHHHHHHhhC
Confidence 568999999999999999992 26889999998 8999999999999985
No 4
>PF14379 Myb_CC_LHEQLE: MYB-CC type transfactor, LHEQLE motif
Probab=96.74 E-value=3e-05 Score=59.37 Aligned_cols=35 Identities=20% Similarity=0.298 Sum_probs=29.7
Q ss_pred CCCccccccccCCCCCC----CC-----cchhhcccccccccCC
Q 016024 324 GDTSEDIMFGIQHPRRP----ET-----SSIQQQGRASNGLWSN 358 (396)
Q Consensus 324 gd~SEALrlQiEvQrRL----El-----~sIE~QGRylq~lWsN 358 (396)
..++|||++||||||+| |+ ++||+||||+++|.+.
T Consensus 2 ~~i~EALr~QmEvQrrLhEQLEvQr~Lqlrieaqgkyl~~ilek 45 (51)
T PF14379_consen 2 MQITEALRMQMEVQRRLHEQLEVQRHLQLRIEAQGKYLQSILEK 45 (51)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 35689999999999998 21 6899999999998875
No 5
>smart00426 TEA TEA domain.
Probab=86.10 E-value=1 Score=36.76 Aligned_cols=18 Identities=17% Similarity=0.510 Sum_probs=16.4
Q ss_pred ccCHHHHHHHHHHHHHhC
Q 016024 251 RWTTTLHARFVHAVELLG 268 (396)
Q Consensus 251 rWT~ELH~rFV~AVeqLG 268 (396)
+|.++|-..|++|++..-
T Consensus 5 vWp~~lE~Af~~aL~~~~ 22 (68)
T smart00426 5 VWSPDIEQAFQEALAIYP 22 (68)
T ss_pred cCcHHHHHHHHHHHHHcC
Confidence 799999999999998774
No 6
>PF12776 Myb_DNA-bind_3: Myb/SANT-like DNA-binding domain; InterPro: IPR024752 This domain, found in a range of uncharacterised proteins, may be related to Myb/SANT-like DNA binding domains.
Probab=61.17 E-value=11 Score=29.48 Aligned_cols=51 Identities=14% Similarity=0.236 Sum_probs=34.4
Q ss_pred ccCHHHHHHHHHHHHHh---CCC-CCCChH-----HHHhhcCC---CCCCHHHHHHHhhhhhh
Q 016024 251 RWTTTLHARFVHAVELL---GGH-ERATPK-----SVLELMDV---KDLTLAHVKSHLQMYRT 301 (396)
Q Consensus 251 rWT~ELH~rFV~AVeqL---GG~-dkAtPK-----~ILelMnV---~GLTr~hVKSHLQKYRl 301 (396)
+||++..+.||+++-+. |.. .....| .|.+.|+- -.+|..+|++|+...|.
T Consensus 1 ~Wt~~~~~~ll~~~~e~~~~g~~~~~~~fk~~~w~~i~~~~~~~~~~~~t~~qlknk~~~lk~ 63 (96)
T PF12776_consen 1 SWTPEMTRFLLDLLIEQINKGNRPTNGGFKKEGWNNIAEEFNEKTGLNYTKKQLKNKWKTLKK 63 (96)
T ss_pred CCChHHHHHHHHHHHHHHHhCCCCCCCCcCHHHHHHHHHHHHHHhCCcccHHHHHHHHHHHHH
Confidence 59999999999998433 433 233333 35555554 45889999999885543
No 7
>smart00717 SANT SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Probab=60.34 E-value=40 Score=22.17 Aligned_cols=45 Identities=11% Similarity=0.175 Sum_probs=33.8
Q ss_pred cccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhh
Q 016024 250 MRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMY 299 (396)
Q Consensus 250 LrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKY 299 (396)
-.||++=...|+.+|.++| . ..=+.|.+.|+ +=|...|+.+..++
T Consensus 2 ~~Wt~~E~~~l~~~~~~~g-~--~~w~~Ia~~~~--~rt~~~~~~~~~~~ 46 (49)
T smart00717 2 GEWTEEEDELLIELVKKYG-K--NNWEKIAKELP--GRTAEQCRERWNNL 46 (49)
T ss_pred CCCCHHHHHHHHHHHHHHC-c--CCHHHHHHHcC--CCCHHHHHHHHHHH
Confidence 4699999999999999999 1 23466777765 77888887765443
No 8
>PF01285 TEA: TEA/ATTS domain family; InterPro: IPR000818 Transcriptional enhancer activators are nuclear proteins that contain a TEA/ATTSdomain, a DNA-binding region of 66-68 amino acids. The TEA/ATTS domain is found in the N-termini of certain gene regulatory proteins, such as the Simian virus 40 (SV40) enhancer factor TEF-1, yeast trans-acting factor TEC-1 (which is required for TY1 enhancer activity), and the Aspergillus abaA regulatory gene product. SV40 and retroviral enhancers, and those to which TEF-1, TEC-1 and abaA proteins bind, contain GT-IIC sites: the TEA/ATTS domain may therefore recognise and bind such sites. Secondary structure predictions suggest the presence of 3 helices, but have not confirmed the presence of the helix-turn-helix motif characteristic of many DNA-binding proteins: DNA-binding may therefore be effected by a different mechanism [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3L15_B 2HZD_A 3KYS_A 3JUA_C 4EAZ_B.
Probab=58.57 E-value=9.8 Score=39.63 Aligned_cols=54 Identities=19% Similarity=0.268 Sum_probs=29.2
Q ss_pred CCCCCcccCHHHHHHHHHHHHHhC--CCCCCC------------hHHHHhhcCCCCCCHHHHHHHhhhh
Q 016024 245 MRAPRMRWTTTLHARFVHAVELLG--GHERAT------------PKSVLELMDVKDLTLAHVKSHLQMY 299 (396)
Q Consensus 245 ~kKpRLrWT~ELH~rFV~AVeqLG--G~dkAt------------PK~ILelMnV~GLTr~hVKSHLQKY 299 (396)
..+..-+|.++|...|++|+...- |-.|-. -..|...-| +.=|+.+|+||||..
T Consensus 45 ~~~~~~vw~~~~e~af~~al~~~~~~g~~k~~~~~~~~grn~li~~yi~~~tg-~~rt~kqvsshiqvl 112 (431)
T PF01285_consen 45 DGDGEGVWPPDIEQAFQEALAIYPPCGRRKLSDEGKMYGRNELISDYIKLKTG-KTRTRKQVSSHIQVL 112 (431)
T ss_dssp -GGGS--S-HHHHHHHHHHHHHS-SSS---HHHH-----THHHHHHHHHHHHS-----SHHHHHHHHHH
T ss_pred CCCCCCCCCHHHHHHHHHHHHhCCCCCCcccccccccccchhHHHHHHHHHhC-cccchhHHHHHHHHH
Confidence 356788999999999999998763 222200 011111112 346889999999998
No 9
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Probab=54.52 E-value=53 Score=21.37 Aligned_cols=44 Identities=14% Similarity=0.195 Sum_probs=33.6
Q ss_pred ccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhh
Q 016024 251 RWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMY 299 (396)
Q Consensus 251 rWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKY 299 (396)
.||.|=...|+.++.++|- ..=+.|.+.|+ +=|..+|+.+.+++
T Consensus 1 ~Wt~eE~~~l~~~~~~~g~---~~w~~Ia~~~~--~rs~~~~~~~~~~~ 44 (45)
T cd00167 1 PWTEEEDELLLEAVKKYGK---NNWEKIAKELP--GRTPKQCRERWRNL 44 (45)
T ss_pred CCCHHHHHHHHHHHHHHCc---CCHHHHHhHcC--CCCHHHHHHHHHHh
Confidence 4999999999999999992 33567777774 46888888776543
No 10
>smart00501 BRIGHT BRIGHT, ARID (A/T-rich interaction domain) domain. DNA-binding domain containing a helix-turn-helix structure
Probab=47.86 E-value=19 Score=28.98 Aligned_cols=44 Identities=20% Similarity=0.336 Sum_probs=31.7
Q ss_pred HHHHHHHHHHHHhCCCCCCC----hHHHHhhcCCCCC---CHHHHHHHhhhh
Q 016024 255 TLHARFVHAVELLGGHERAT----PKSVLELMDVKDL---TLAHVKSHLQMY 299 (396)
Q Consensus 255 ELH~rFV~AVeqLGG~dkAt----PK~ILelMnV~GL---Tr~hVKSHLQKY 299 (396)
+|++.|. +|..+||.++.+ =+.|.+.||++.- ....++++-+||
T Consensus 33 dL~~Ly~-~V~~~GG~~~v~~~~~W~~Va~~lg~~~~~~~~~~~lk~~Y~k~ 83 (93)
T smart00501 33 DLYRLYR-LVQERGGYDQVTKDKKWKEIARELGIPDTSTSAASSLRKHYERY 83 (93)
T ss_pred cHHHHHH-HHHHccCHHHHcCCCCHHHHHHHhCCCcccchHHHHHHHHHHHH
Confidence 7999998 588999977543 3568889999752 244566666666
No 11
>PF11888 DUF3408: Protein of unknown function (DUF3408); InterPro: IPR021823 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 128 to 160 amino acids in length.
Probab=44.95 E-value=25 Score=30.69 Aligned_cols=60 Identities=23% Similarity=0.351 Sum_probs=42.1
Q ss_pred cchhhhccccCCCCCCCCCCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhhhh
Q 016024 229 GLMRSRFMSRFPAKRSMRAPRMRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRT 301 (396)
Q Consensus 229 g~~r~rf~~~~p~krs~kKpRLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKYRl 301 (396)
.-.+++|+... +-..|++ +-=..|+|++....|..+||. -|.|.|+--.=++-||..|+.
T Consensus 65 ~~Y~~~FL~~~--~~~~R~~-vyI~~e~h~~l~~Iv~~ig~~----------~~si~~yidNIL~~Hle~~~e 124 (136)
T PF11888_consen 65 EDYRETFLKRP--KIKARKG-VYISRETHERLSRIVRVIGER----------KMSISGYIDNILRHHLEEYRE 124 (136)
T ss_pred HHHHHHhCCCC--CCCCCee-eEECHHHHHHHHHHHHHHCCC----------CCcHHHHHHHHHHHHHHHHHH
Confidence 45567787432 2334555 888999999999999999963 345555544556789999975
No 12
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=38.48 E-value=19 Score=33.63 Aligned_cols=54 Identities=31% Similarity=0.434 Sum_probs=40.7
Q ss_pred CCCCCCCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHhhcCCCCCCHHHHHHHhhhhhh
Q 016024 242 KRSMRAPRMRWTTTLHARFVHAVELLGGHERATPKSVLELMDVKDLTLAHVKSHLQMYRT 301 (396)
Q Consensus 242 krs~kKpRLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILelMnV~GLTr~hVKSHLQKYRl 301 (396)
.|+.-++.+||...+-.++.+||+..- -++-.+++. ...||++.|-+-||.|..
T Consensus 41 ~RTsAACGFRWNs~VRkqY~~~i~~AK-kqRk~~~~~-----~~~ltl~~vI~fLq~l~~ 94 (161)
T TIGR02894 41 NRTAAACGFRWNAYVRKQYEEAIELAK-KQRKELKRE-----AGSLTLQDVISFLQNLKT 94 (161)
T ss_pred cccHHHhcchHHHHHHHHHHHHHHHHH-HHHhccccC-----cccCCHHHHHHHHHHHHh
Confidence 356778999999999999999998764 122212111 367999999999999974
No 13
>PF07384 DUF1497: Protein of unknown function (DUF1497); InterPro: IPR009972 This entry is represented by Bacteriophage r1t, Orf17. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage and bacterial proteins of around 59 residues in length. Members of this family seem to be found exclusively in Lactococcus lactis and the bacteriophages that infect this organism. The function of this family is unknown.
Probab=34.66 E-value=33 Score=27.29 Aligned_cols=22 Identities=23% Similarity=0.561 Sum_probs=19.2
Q ss_pred cccCHHHHHHHHHHHHHhCCCC
Q 016024 250 MRWTTTLHARFVHAVELLGGHE 271 (396)
Q Consensus 250 LrWT~ELH~rFV~AVeqLGG~d 271 (396)
-.+..|+|..|-+-|+.|||.+
T Consensus 36 ~kfnqem~aefheri~klggk~ 57 (59)
T PF07384_consen 36 NKFNQEMQAEFHERIKKLGGKN 57 (59)
T ss_pred hHhhHHHHHHHHHHHHHhcccc
Confidence 3578999999999999999854
No 14
>KOG3841 consensus TEF-1 and related transcription factor, TEAD family [Transcription]
Probab=31.14 E-value=45 Score=35.35 Aligned_cols=55 Identities=22% Similarity=0.343 Sum_probs=35.7
Q ss_pred CCCcccCHHHHHHHHHHHHHhCCCCCCChHHHHh--------------hcCC---CCCCHHHHHHHhhhhhhcc
Q 016024 247 APRMRWTTTLHARFVHAVELLGGHERATPKSVLE--------------LMDV---KDLTLAHVKSHLQMYRTVK 303 (396)
Q Consensus 247 KpRLrWT~ELH~rFV~AVeqLGG~dkAtPK~ILe--------------lMnV---~GLTr~hVKSHLQKYRl~k 303 (396)
-.-=+|+++.-+.|.+|+...- .--+=|-||. ..+. +-=|+.+|.||+|.....|
T Consensus 74 daegvWSpdIEqsFqEALaiyp--pcGrrKIilsdegkmyGRNELIarYIKlrtgktRTrKQVSSHIQVlarrk 145 (455)
T KOG3841|consen 74 DAEGVWSPDIEQSFQEALAIYP--PCGRRKIILSDEGKMYGRNELIARYIKLRTGKTRTRKQVSSHIQVLARRK 145 (455)
T ss_pred ccccccChhHHHHHHHHHhhcC--CCCceeEEEccCccccchHHHHHHHHHHhcCCchhHHHHHHHHHHHHHHH
Confidence 4455899999999999998663 0111122222 1222 4578999999999875443
Done!