Query 032204
Match_columns 145
No_of_seqs 99 out of 122
Neff 2.8
Searched_HMMs 46136
Date Fri Mar 29 11:00:24 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/032204.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/032204hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04690 YABBY: YABBY protein; 100.0 4.6E-36 9.9E-41 238.3 9.1 66 46-118 104-170 (170)
2 PF09011 HMG_box_2: HMG-box do 98.6 1E-07 2.3E-12 63.9 4.7 46 67-112 1-47 (73)
3 cd01390 HMGB-UBF_HMG-box HMGB- 98.3 1.8E-06 3.9E-11 55.1 5.5 44 70-113 1-44 (66)
4 cd00084 HMG-box High Mobility 98.2 4.1E-06 8.8E-11 52.7 5.5 44 70-113 1-44 (66)
5 PF00505 HMG_box: HMG (high mo 98.2 3E-06 6.6E-11 54.6 4.8 43 70-112 1-43 (69)
6 cd01388 SOX-TCF_HMG-box SOX-TC 98.2 4.2E-06 9.2E-11 56.0 5.4 43 71-113 3-45 (72)
7 smart00398 HMG high mobility g 98.2 4.6E-06 1E-10 53.0 5.3 45 69-113 1-45 (70)
8 cd01389 MATA_HMG-box MATA_HMG- 98.2 4.3E-06 9.4E-11 56.5 5.4 45 69-113 1-45 (77)
9 PTZ00199 high mobility group p 98.1 9E-06 2E-10 57.9 5.8 49 65-113 18-68 (94)
10 KOG0381 HMG box-containing pro 97.4 0.00059 1.3E-08 47.0 5.6 46 68-113 21-66 (96)
11 PF06244 DUF1014: Protein of u 97.0 0.0008 1.7E-08 51.8 3.9 50 66-117 71-120 (122)
12 KOG3223 Uncharacterized conser 93.5 0.094 2E-06 44.3 3.8 51 66-118 163-213 (221)
13 COG5648 NHP6B Chromatin-associ 82.0 3.3 7.1E-05 35.0 5.2 47 66-112 67-113 (211)
14 PF08073 CHDNT: CHDNT (NUC034) 74.9 5.9 0.00013 27.2 3.9 37 76-112 15-51 (55)
15 KOG0526 Nucleosome-binding fac 45.6 32 0.00069 33.2 4.3 43 67-111 533-575 (615)
16 PF05047 L51_S25_CI-B8: Mitoch 42.5 23 0.00049 22.2 2.0 18 78-95 2-19 (52)
17 PF06382 DUF1074: Protein of u 41.8 39 0.00083 28.2 3.8 35 75-113 84-118 (183)
18 PF05164 ZapA: Cell division p 31.0 76 0.0016 21.1 3.3 32 79-110 28-59 (89)
19 KOG0527 HMG-box transcription 29.0 1.1E+02 0.0023 27.2 4.7 64 63-134 56-119 (331)
20 PF12876 Cellulase-like: Sugar 26.8 1.1E+02 0.0023 21.0 3.5 29 68-96 31-59 (88)
21 PF04769 MAT_Alpha1: Mating-ty 26.5 1.8E+02 0.0038 24.1 5.3 46 66-115 40-85 (201)
22 KOG4715 SWI/SNF-related matrix 26.5 93 0.002 28.7 3.9 42 72-113 67-108 (410)
23 PF14722 KRAP_IP3R_bind: Ki-ra 25.3 85 0.0018 25.7 3.2 28 77-104 71-98 (160)
24 TIGR02147 Fsuc_second hypothet 25.2 73 0.0016 27.1 2.9 25 75-99 8-32 (271)
25 PF10963 DUF2765: Protein of u 22.9 1.2E+02 0.0027 22.1 3.3 14 82-95 48-61 (83)
26 PRK14126 cell division protein 20.2 2.1E+02 0.0047 20.3 4.1 32 79-110 34-65 (85)
No 1
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=100.00 E-value=4.6e-36 Score=238.32 Aligned_cols=66 Identities=73% Similarity=1.163 Sum_probs=60.5
Q ss_pred CCCCccccccc-cccCCCCCCCCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCCccccc
Q 032204 46 PTNKATEERVV-NRRESPHSTTPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFGL 118 (145)
Q Consensus 46 ~~~~~~~~~~~-~R~~~a~~~p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~~~~g~ 118 (145)
..+++.+++++ +| |||||||+|||||+|||+||+|||++||+|+|||||++||+||||+|||||||
T Consensus 104 ~~~~~~pr~~~v~k-------PPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~phihfgl 170 (170)
T PF04690_consen 104 SEEEEIPRAPPVNK-------PPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFPHIHFGL 170 (170)
T ss_pred cccccccccccccC-------CccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCcccccCC
Confidence 44556776665 67 99999999999999999999999999999999999999999999999999997
No 2
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=98.57 E-value=1e-07 Score=63.93 Aligned_cols=46 Identities=35% Similarity=0.634 Sum_probs=38.8
Q ss_pred CcccCCCCchhhHHHHHHHHHHHhh-CCCCCHHHHHHHHHHhhccCC
Q 032204 67 PEKRQRVPSAYNQFIKEEIQRIKAN-NPDISHREAFSTAAKNWAHFP 112 (145)
Q Consensus 67 ~eK~~R~PSAYN~FmKeEiqRiKa~-nP~isHKEaFs~AAkNWa~~P 112 (145)
|.|++|.+|||+.||++.+..++.. .+.++.+|+++.++..|+..+
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls 47 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLS 47 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcC
Confidence 5688999999999999999999999 889999999999999999854
No 3
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=98.31 E-value=1.8e-06 Score=55.06 Aligned_cols=44 Identities=30% Similarity=0.471 Sum_probs=40.8
Q ss_pred cCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 70 RQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 70 ~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
++|.+|+|..||++....+++.+|+++..|+.+.++..|+..+.
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~ 44 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSE 44 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCH
Confidence 46789999999999999999999999999999999999998643
No 4
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=98.21 E-value=4.1e-06 Score=52.74 Aligned_cols=44 Identities=30% Similarity=0.416 Sum_probs=40.8
Q ss_pred cCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 70 RQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 70 ~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
++|-+|+|..|++++...++..+|+++..|+.+.++..|+..+.
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~ 44 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSE 44 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCH
Confidence 46789999999999999999999999999999999999998654
No 5
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=98.20 E-value=3e-06 Score=54.59 Aligned_cols=43 Identities=33% Similarity=0.613 Sum_probs=38.4
Q ss_pred cCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCC
Q 032204 70 RQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFP 112 (145)
Q Consensus 70 ~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P 112 (145)
++|.+|+|..|+++....|++.||+++..|+-+.++..|+..+
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~ 43 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLS 43 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSH
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCC
Confidence 4689999999999999999999999999999999999999843
No 6
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=98.19 E-value=4.2e-06 Score=56.00 Aligned_cols=43 Identities=16% Similarity=0.297 Sum_probs=40.3
Q ss_pred CCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 71 QRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 71 ~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
+|.||||..|+++....|+++||+++..|.-+.++..|+..+.
T Consensus 3 KrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~ 45 (72)
T cd01388 3 KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSN 45 (72)
T ss_pred CCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCH
Confidence 6789999999999999999999999999999999999998653
No 7
>smart00398 HMG high mobility group.
Probab=98.18 E-value=4.6e-06 Score=53.04 Aligned_cols=45 Identities=33% Similarity=0.515 Sum_probs=41.6
Q ss_pred ccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 69 KRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 69 K~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
+++|.+|+|..|+++....+++.+|+++..|..+.++..|+..+.
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~ 45 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSE 45 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCH
Confidence 467899999999999999999999999999999999999998654
No 8
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=98.18 E-value=4.3e-06 Score=56.45 Aligned_cols=45 Identities=18% Similarity=0.348 Sum_probs=41.8
Q ss_pred ccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 69 KRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 69 K~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
+.+|-|+||-.|+++....|+++||++++.|.-+.++..|+..+.
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~ 45 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESP 45 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCH
Confidence 357899999999999999999999999999999999999998653
No 9
>PTZ00199 high mobility group protein; Provisional
Probab=98.09 E-value=9e-06 Score=57.94 Aligned_cols=49 Identities=27% Similarity=0.441 Sum_probs=44.0
Q ss_pred CCCcccCCCCchhhHHHHHHHHHHHhhCCCCC--HHHHHHHHHHhhccCCC
Q 032204 65 TTPEKRQRVPSAYNQFIKEEIQRIKANNPDIS--HREAFSTAAKNWAHFPH 113 (145)
Q Consensus 65 ~p~eK~~R~PSAYN~FmKeEiqRiKa~nP~is--HKEaFs~AAkNWa~~P~ 113 (145)
+.|.+++|.+|||..||++.-..|+++||+++ ..|..+.++..|+..++
T Consensus 18 kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~ 68 (94)
T PTZ00199 18 KDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSE 68 (94)
T ss_pred CCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCH
Confidence 36778899999999999999999999999987 68899999999998654
No 10
>KOG0381 consensus HMG box-containing protein [General function prediction only]
Probab=97.35 E-value=0.00059 Score=46.99 Aligned_cols=46 Identities=28% Similarity=0.441 Sum_probs=42.0
Q ss_pred cccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 68 EKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 68 eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
..++|-+|||..|+.+.-.+||.+||+++..|.-+.+..+|.....
T Consensus 21 ~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~ 66 (96)
T KOG0381|consen 21 QAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAE 66 (96)
T ss_pred CCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCH
Confidence 3678899999999999999999999999999999999999988543
No 11
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=97.03 E-value=0.0008 Score=51.75 Aligned_cols=50 Identities=26% Similarity=0.519 Sum_probs=44.8
Q ss_pred CCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCCcccc
Q 032204 66 TPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFG 117 (145)
Q Consensus 66 p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~~~~g 117 (145)
.|||| .--||..|.-.+|.+||++||++.+-.---..=+.|..+|+|+|-
T Consensus 71 HPErR--~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeNP~N 120 (122)
T PF06244_consen 71 HPERR--MKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPENPFN 120 (122)
T ss_pred Ccchh--HHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCCCcc
Confidence 45554 457999999999999999999999999999999999999999984
No 12
>KOG3223 consensus Uncharacterized conserved protein [Function unknown]
Probab=93.52 E-value=0.094 Score=44.34 Aligned_cols=51 Identities=29% Similarity=0.541 Sum_probs=45.7
Q ss_pred CCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCCccccc
Q 032204 66 TPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFGL 118 (145)
Q Consensus 66 p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~~~~g~ 118 (145)
-||||=| -||--|=..++.|||.+||++.|-+-=-+.-+.|..+|.++|.-
T Consensus 163 HPEkRmr--AA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDNP~Nq 213 (221)
T KOG3223|consen 163 HPEKRMR--AAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDNPFNQ 213 (221)
T ss_pred ChHHHHH--HHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCChhhH
Confidence 6777655 59999999999999999999999988888899999999999963
No 13
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=81.96 E-value=3.3 Score=35.02 Aligned_cols=47 Identities=23% Similarity=0.429 Sum_probs=43.1
Q ss_pred CCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCC
Q 032204 66 TPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFP 112 (145)
Q Consensus 66 p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P 112 (145)
-|--++|--|||-.|..+.=.+|+..+|+++.-|.=+.+.+.|+...
T Consensus 67 dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Lt 113 (211)
T COG5648 67 DPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELT 113 (211)
T ss_pred CCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhcc
Confidence 46678899999999999999999999999999999999999999853
No 14
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=74.91 E-value=5.9 Score=27.18 Aligned_cols=37 Identities=16% Similarity=0.352 Sum_probs=28.1
Q ss_pred hhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCC
Q 032204 76 AYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFP 112 (145)
Q Consensus 76 AYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P 112 (145)
.|..|-.-==.-|-+.||++.+-..+.+.+..|+.+-
T Consensus 15 ~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~ 51 (55)
T PF08073_consen 15 NYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQ 51 (55)
T ss_pred HHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHH
Confidence 3444443333556789999999999999999999873
No 15
>KOG0526 consensus Nucleosome-binding factor SPN, POB3 subunit [Transcription; Replication, recombination and repair; Chromatin structure and dynamics]
Probab=45.63 E-value=32 Score=33.24 Aligned_cols=43 Identities=26% Similarity=0.494 Sum_probs=39.0
Q ss_pred CcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccC
Q 032204 67 PEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHF 111 (145)
Q Consensus 67 ~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~ 111 (145)
|-.++|+-|||-.|...+-..||++ +|+.-|.=+.+...|+.-
T Consensus 533 pnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~m 575 (615)
T KOG0526|consen 533 PNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQM 575 (615)
T ss_pred CCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhh
Confidence 3467899999999999999999999 999999999999999973
No 16
>PF05047 L51_S25_CI-B8: Mitochondrial ribosomal protein L51 / S25 / CI-B8 domain ; InterPro: IPR007741 Proteins containing this domain are located in the mitochondrion and include ribosomal protein L51, and S25. This domain is also found in mitochondrial NADH-ubiquinone oxidoreductase B8 subunit (CI-B8) 1.6.5.3 from EC. It is not known whether all members of this family form part of the NADH-ubiquinone oxidoreductase and whether they are also all ribosomal proteins.; PDB: 1S3A_A.
Probab=42.51 E-value=23 Score=22.18 Aligned_cols=18 Identities=28% Similarity=0.719 Sum_probs=15.7
Q ss_pred hHHHHHHHHHHHhhCCCC
Q 032204 78 NQFIKEEIQRIKANNPDI 95 (145)
Q Consensus 78 N~FmKeEiqRiKa~nP~i 95 (145)
..|+++.+..|+..||++
T Consensus 2 R~F~~~~lp~l~~~NP~v 19 (52)
T PF05047_consen 2 RDFLKNNLPTLKYHNPQV 19 (52)
T ss_dssp HHHHHHTHHHHHHHSTT-
T ss_pred HhHHHHhHHHHHHHCCCc
Confidence 369999999999999987
No 17
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=41.80 E-value=39 Score=28.25 Aligned_cols=35 Identities=23% Similarity=0.574 Sum_probs=29.9
Q ss_pred chhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 75 SAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 75 SAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
.+|=.||.+ .+..|.+++.+|....||+-|...++
T Consensus 84 naYLNFLRe----FRrkh~~L~p~dlI~~AAraW~rLSe 118 (183)
T PF06382_consen 84 NAYLNFLRE----FRRKHCGLSPQDLIQRAARAWCRLSE 118 (183)
T ss_pred hHHHHHHHH----HHHHccCCCHHHHHHHHHHHHHhCCH
Confidence 578888764 77789999999999999999988644
No 18
>PF05164 ZapA: Cell division protein ZapA; InterPro: IPR007838 This entry a structural domain found in the cell division protein ZapA, as well as in related proteins. This domain has a core structure consisting of two layers alpha/beta, and has a long C-terminal helix that forms dimeric parallel and tetrameric antiparallel coiled coils []. ZapA interacts with FtsZ, where FtsZ is part of a mid-cell cytokinetic structure termed the Z-ring that recruits a hierarchy of fission related proteins early in the bacterial cell cycle. ZapA drives the polymerisation and filament bundling of FtsZ, thereby contributing to the spatio-temporal tuning of the Z-ring.; PDB: 1T3U_B 1W2E_B 3HNW_A.
Probab=31.01 E-value=76 Score=21.15 Aligned_cols=32 Identities=34% Similarity=0.420 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHhhCCCCCHHHHHHHHHHhhcc
Q 032204 79 QFIKEEIQRIKANNPDISHREAFSTAAKNWAH 110 (145)
Q Consensus 79 ~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~ 110 (145)
.++.+.|..++...|.++...++..||-|-++
T Consensus 28 ~~i~~~i~~~~~~~~~~~~~~~~vlaaLnla~ 59 (89)
T PF05164_consen 28 ELINEKINEIKKKYPKLSPERLAVLAALNLAD 59 (89)
T ss_dssp HHHHHHHHHHCTTCCTSSHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHH
Confidence 57889999999999999999999999988664
No 19
>KOG0527 consensus HMG-box transcription factor [Transcription]
Probab=28.98 E-value=1.1e+02 Score=27.24 Aligned_cols=64 Identities=17% Similarity=0.359 Sum_probs=51.2
Q ss_pred CCCCCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCCcccccccccCCCCCcchhhhh
Q 032204 63 HSTTPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIHFGLMLEANNQPKLDDASGN 134 (145)
Q Consensus 63 ~~~p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~~~~g~~~~~~~~~k~d~~~~~ 134 (145)
..+..++-+|--.||=.|=+.|=++|-.+||+|---|.=|...+.|+.. -+..|..-+|++.--
T Consensus 56 ~k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~L--------se~EKrPFi~EAeRL 119 (331)
T KOG0527|consen 56 DKTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLL--------SEEEKRPFVDEAERL 119 (331)
T ss_pred CCCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhc--------CHhhhccHHHHHHHH
Confidence 4456778889999999999999999999999998899999999999963 344455555665433
No 20
>PF12876 Cellulase-like: Sugar-binding cellulase-like; InterPro: IPR024778 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This entry represents a family of putative cellulase enzymes.; PDB: 3GYC_B.
Probab=26.76 E-value=1.1e+02 Score=20.97 Aligned_cols=29 Identities=28% Similarity=0.453 Sum_probs=21.6
Q ss_pred cccCCCCchhhHHHHHHHHHHHhhCCCCC
Q 032204 68 EKRQRVPSAYNQFIKEEIQRIKANNPDIS 96 (145)
Q Consensus 68 eK~~R~PSAYN~FmKeEiqRiKa~nP~is 96 (145)
+.......+|-.||++=.+.||+.+|..+
T Consensus 31 ~~~~~~~~~~~~~l~~~~~~iR~~dP~~p 59 (88)
T PF12876_consen 31 EWGDPKAEAYAEWLKEAFRWIRAVDPSQP 59 (88)
T ss_dssp -TT-TTSHHHHHHHHHHHHHHHTT-TTS-
T ss_pred cccchhHHHHHHHHHHHHHHHHHhCCCCc
Confidence 33444678999999999999999999753
No 21
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=26.55 E-value=1.8e+02 Score=24.07 Aligned_cols=46 Identities=22% Similarity=0.329 Sum_probs=37.9
Q ss_pred CCcccCCCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCCcc
Q 032204 66 TPEKRQRVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPHIH 115 (145)
Q Consensus 66 p~eK~~R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~~~ 115 (145)
..++++|.-.+|+.|+.== +...|+.+.|++-...+..|...|+-.
T Consensus 40 ~~~~~kr~lN~Fm~FRsyy----~~~~~~~~Qk~~S~~l~~lW~~dp~k~ 85 (201)
T PF04769_consen 40 SPEKAKRPLNGFMAFRSYY----SPIFPPLPQKELSGILTKLWEKDPFKN 85 (201)
T ss_pred cccccccchhHHHHHHHHH----HhhcCCcCHHHHHHHHHHHHhCCccHh
Confidence 5677888888899887643 367889999999999999999987654
No 22
>KOG4715 consensus SWI/SNF-related matrix-associated actin-dependent regulator of chromatin [Chromatin structure and dynamics]
Probab=26.53 E-value=93 Score=28.69 Aligned_cols=42 Identities=19% Similarity=0.402 Sum_probs=37.5
Q ss_pred CCCchhhHHHHHHHHHHHhhCCCCCHHHHHHHHHHhhccCCC
Q 032204 72 RVPSAYNQFIKEEIQRIKANNPDISHREAFSTAAKNWAHFPH 113 (145)
Q Consensus 72 R~PSAYN~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~~P~ 113 (145)
+-.-.|-+|-+.=-..+|+.||++--=|.=+.++..|.+.|.
T Consensus 67 kpl~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd 108 (410)
T KOG4715|consen 67 KPLMPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPD 108 (410)
T ss_pred cccchhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcc
Confidence 345689999999999999999999999999999999999774
No 23
>PF14722 KRAP_IP3R_bind: Ki-ras-induced actin-interacting protein-IP3R-interacting domain
Probab=25.30 E-value=85 Score=25.74 Aligned_cols=28 Identities=21% Similarity=0.470 Sum_probs=22.2
Q ss_pred hhHHHHHHHHHHHhhCCCCCHHHHHHHH
Q 032204 77 YNQFIKEEIQRIKANNPDISHREAFSTA 104 (145)
Q Consensus 77 YN~FmKeEiqRiKa~nP~isHKEaFs~A 104 (145)
...|++.+++||..++|...---+|.+.
T Consensus 71 ~~~FL~aQ~qrm~~E~p~~~l~~RFRQ~ 98 (160)
T PF14722_consen 71 IRVFLEAQKQRMDIENPNLALASRFRQL 98 (160)
T ss_pred HHHHHHHHHHHHHhhCccHHHHHHHHHH
Confidence 4679999999999999996555555543
No 24
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=25.21 E-value=73 Score=27.14 Aligned_cols=25 Identities=20% Similarity=0.486 Sum_probs=23.5
Q ss_pred chhhHHHHHHHHHHHhhCCCCCHHH
Q 032204 75 SAYNQFIKEEIQRIKANNPDISHRE 99 (145)
Q Consensus 75 SAYN~FmKeEiqRiKa~nP~isHKE 99 (145)
+-|..||++...+-|..+|..|.|+
T Consensus 8 ~dYR~fl~d~ye~rk~~~p~fS~R~ 32 (271)
T TIGR02147 8 TDYRKYLRDYYEERKKTDPAFSWRF 32 (271)
T ss_pred hhHHHHHHHHHHHHhccCcCcCHHH
Confidence 4699999999999999999999997
No 25
>PF10963 DUF2765: Protein of unknown function (DUF2765); InterPro: IPR024406 This family of proteins with no known function is found in phages and suspected prophages.
Probab=22.85 E-value=1.2e+02 Score=22.06 Aligned_cols=14 Identities=29% Similarity=0.567 Sum_probs=9.6
Q ss_pred HHHHHHHHhhCCCC
Q 032204 82 KEEIQRIKANNPDI 95 (145)
Q Consensus 82 KeEiqRiKa~nP~i 95 (145)
|+.+..|-++.||.
T Consensus 48 KeaL~~lle~~PGa 61 (83)
T PF10963_consen 48 KEALKELLEENPGA 61 (83)
T ss_pred HHHHHHHHHHCCCH
Confidence 46677777777775
No 26
>PRK14126 cell division protein ZapA; Provisional
Probab=20.23 E-value=2.1e+02 Score=20.26 Aligned_cols=32 Identities=16% Similarity=0.365 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHhhCCCCCHHHHHHHHHHhhcc
Q 032204 79 QFIKEEIQRIKANNPDISHREAFSTAAKNWAH 110 (145)
Q Consensus 79 ~FmKeEiqRiKa~nP~isHKEaFs~AAkNWa~ 110 (145)
.|+-+.|..|+..+|.++-..+-.+||-|-++
T Consensus 34 ~~vd~km~ei~~~~~~ls~~~iAVLaALNia~ 65 (85)
T PRK14126 34 AIVDDKMRELNEKNPSLDTSKLAVLTAVNVIH 65 (85)
T ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHH
Confidence 58889999999999999999999999998765
Done!