Query 537021.9.peg.950_1 Match_columns 40 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Wed May 25 03:49:30 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_950.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam08408 DNA_pol_B_3 DNA poly 35.5 18 0.00047 18.5 1.2 15 20-34 79-93 (149) 2 pfam03563 Bunya_G2 Bunyavirus 29.5 16 0.0004 18.9 0.1 11 28-38 28-38 (286) 3 PHA00457 inhibitor of host bac 17.7 96 0.0025 14.9 2.2 23 4-26 7-29 (63) 4 pfam10287 DUF2401 Putative sec 11.3 86 0.0022 15.1 0.5 23 2-24 9-38 (234) 5 pfam06702 DUF1193 Protein of u 9.6 59 0.0015 16.0 -0.8 32 3-35 110-142 (221) 6 KOG4723 consensus 8.9 1.7E+02 0.0043 13.7 1.2 24 7-30 28-58 (248) 7 TIGR00034 aroFGH phospho-2-deh 7.8 2.2E+02 0.0057 13.0 2.4 27 2-28 207-233 (348) 8 PHA02131 hypothetical protein 7.7 2.3E+02 0.0058 13.0 2.4 18 20-37 28-45 (70) 9 PRK00882 consensus 7.7 2.3E+02 0.0058 13.0 2.2 17 18-34 253-269 (295) 10 PRK00723 phosphatidylserine de 7.5 2.3E+02 0.0059 12.9 2.3 16 19-34 255-270 (297) No 1 >pfam08408 DNA_pol_B_3 DNA polymerase family B viral insert. This viral domain is found between the exonuclease domain of the DNA polymerase family B (pfam03104) and the pfam00136 domain, connecting the two. Probab=35.54 E-value=18 Score=18.53 Aligned_cols=15 Identities=33% Similarity=0.545 Sum_probs=12.5 Q ss_pred CCEEEEEEECCCEEE Q ss_conf 875999994276159 Q 537021.9.peg.9 20 NSQCFLMFEKDDISI 34 (40) Q Consensus 20 nsqcflmfekddisi 34 (40) ++-+.|+|.|||+++ T Consensus 79 g~~~~lsfgKDDVdL 93 (149) T pfam08408 79 GNIYTLSFGKDDVDL 93 (149) T ss_pred CCEEEEEECCCCCCH T ss_conf 977999834875668 No 2 >pfam03563 Bunya_G2 Bunyavirus glycoprotein G2. Bunyavirus has three genomic segments: small (S), middle-sized (M), and large (L). The S segment encodes the nucleocapsid and a non-structural protein. The M segment codes for two glycoproteins, G1 and G2, and another non-structural protein (NSm). The L segment codes for an RNA polymerase. This family contains the G2 glycoprotein which interacts with the pfam03557 G1 glycoprotein. Probab=29.51 E-value=16 Score=18.88 Aligned_cols=11 Identities=64% Similarity=0.706 Sum_probs=7.5 Q ss_pred ECCCEEEEEEE Q ss_conf 42761599999 Q 537021.9.peg.9 28 EKDDISIIKFR 38 (40) Q Consensus 28 ekddisiikfr 38 (40) -|||||+||-. T Consensus 28 lKDDISmIKS~ 38 (286) T pfam03563 28 LKDDISMIKSE 38 (286) T ss_pred ECCCHHHEEEE T ss_conf 72625431324 No 3 >PHA00457 inhibitor of host bacterial RNA polymerase Probab=17.74 E-value=96 Score=14.88 Aligned_cols=23 Identities=30% Similarity=0.509 Sum_probs=19.2 Q ss_pred CCCCCCCEEEEEEEECCCEEEEE Q ss_conf 11123672677788418759999 Q 537021.9.peg.9 4 DSLQKDNVSFLLHIIGNSQCFLM 26 (40) Q Consensus 4 dslqkdnvsfllhiignsqcflm 26 (40) .||..+|-.|++++-|+++.|-. T Consensus 7 Gsl~~~~kKf~~tveg~~~sfEv 29 (63) T PHA00457 7 GSLSVDEKKFIATVEGSTQSFEV 29 (63) T ss_pred CCCEECCCEEEEEEECCCCEEEE T ss_conf 53152140799999245604886 No 4 >pfam10287 DUF2401 Putative secretory protein (DUF2401). This family of proteins is conserved in fungi. One member is annotated putatively as OPEL, a house-keeping protein, but this could not be confirmed. It contains 5 highly conserved cysteines two of which form a characteristic CGC sequence motif. Probab=11.33 E-value=86 Score=15.12 Aligned_cols=23 Identities=52% Similarity=0.837 Sum_probs=15.6 Q ss_pred CCC--CCCCCCEEEEEEEEC-----CCEEE Q ss_conf 521--112367267778841-----87599 Q 537021.9.peg.9 2 YYD--SLQKDNVSFLLHIIG-----NSQCF 24 (40) Q Consensus 2 yyd--slqkdnvsfllhiig-----nsqcf 24 (40) ||+ +-..||+.||-|.=| +|.|| T Consensus 9 yY~a~~gtadn~tFlnn~Gg~gsGv~s~~~ 38 (234) T pfam10287 9 YYDASSGTSDNLTFLNNAGGTGSGVWSSCF 38 (234) T ss_pred EEECCCCCCCCEEEECCCCCCCCCEECCCC T ss_conf 676698842365886557987652365556 No 5 >pfam06702 DUF1193 Protein of unknown function (DUF1193). This family represents the C-terminus of several hypothetical eukaryotic proteins of unknown function. Family members contain two conserved motifs: DRHHYE and QCC, as well as a number of conserved cysteine residues. Probab=9.65 E-value=59 Score=15.97 Aligned_cols=32 Identities=38% Similarity=0.699 Sum_probs=20.4 Q ss_pred CCCCCC-CCEEEEEEEECCCEEEEEEECCCEEEE Q ss_conf 211123-672677788418759999942761599 Q 537021.9.peg.9 3 YDSLQK-DNVSFLLHIIGNSQCFLMFEKDDISII 35 (40) Q Consensus 3 ydslqk-dnvsfllhiignsqcflmfekddisii 35 (40) |.+.+| .|-+|++|+ .|..-|----+|++||. T Consensus 110 YEtf~~fg~~~~llhl-Dngk~FG~p~~De~sIL 142 (221) T pfam06702 110 YETFEKFGNETFLLHL-DNGRGFGRPSHDELSIL 142 (221) T ss_pred HHHHHHCCCCCEEEEE-ECCCCCCCCCCCHHHHH T ss_conf 6788743887426886-35745689875425566 No 6 >KOG4723 consensus Probab=8.87 E-value=1.7e+02 Score=13.66 Aligned_cols=24 Identities=42% Similarity=0.750 Sum_probs=17.0 Q ss_pred CCCCEEEEEEEE-------CCCEEEEEEECC Q ss_conf 236726777884-------187599999427 Q 537021.9.peg.9 7 QKDNVSFLLHII-------GNSQCFLMFEKD 30 (40) Q Consensus 7 qkdnvsfllhii-------gnsqcflmfekd 30 (40) .+..+|||+|-. ..+-|||-|-|. T Consensus 28 ~eT~gsFl~H~~l~~~Lkan~~~cFlaf~k~ 58 (248) T KOG4723 28 RETPGSFLFHYYLYHALKANESTCFLAFSKT 58 (248) T ss_pred CCCCCEEEHHHHHHHHHHCCCCEEEEEEECC T ss_conf 5588545399999999733773798775155 No 7 >TIGR00034 aroFGH phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006219 Members of this group catalyze the first enzymatic reaction of the shikimate pathway. The common (shikimate) pathway links metabolism of carbohydrates to biosynthesis of aromatic amino acids phenylalanine, tyrosine, tryptophan, and derivatives in microorganisms and in plants. In a sequence of seven enzymatic reactions, D-erythrose 4-phosphate (E4P), an intermediate of the pentose phosphate pathway, and phosphoenol pyruvate (PEP), a glycolytic intermediate, are converted to chorismate. The pathway begins with the stereospecific condensation of E4P and PEP to yield 7-phospho 2-dehydro 3-deoxy-D-arabino-heptulosonate (DAHP), catalyzed by 3-deoxy-7-phosphoheptulonate synthase (DAHPS) (EC 2.5.1.54, formerly EC 4.1.2.15). The divalent metal cation requirement of this enzyme can be satisfied by a broad range of metals . A Cys residue in a Cys-X-X-His motif has been identified as part of a metal binding site . In Escherichia coli, the enzyme exists in three isoforms, each specifically inhibited by one of the three aromatic amino acids. DAHP synthetases fall into two classes, class I (represented by this entry) and class II (represented by PIRSF015573 from PIRSF). Class I was believed to be limited to microorganisms and class II to plants. However, a more recent study showed that class II also contains enzymes from a microbial eukaryote and several bacteria . Brick and Woodard proposed that the difference between the two classes lies in their metal ion requirement for activity. Whereas class I requires no metal cation, class II is dependent on a metal cation for activity. However, recently a class I DAHP synthase from Thermotoga maritima has been purified, characterised, and shown to be a metalloenzyme . The three-dimentional structures of DAHP synthases have been determined , , , , . The DAHPS(Phe) monomer is a (beta/alpha)8 barrel with an additional N-terminal beta strand and helices and an extra beta sheet near the C terminus . The active site is located in a cleft at the carboxyl end of the barrel . The allosteric feedback inhibition binding site of DAHPS(Phe) is composed of residues from two adjacent subunits of a tight dimer and is at least 20 angstroms away from the closest active site . The absence of the shikimate pathway in animals makes it an attractive target for nontoxic herbicidal, antimicrobial, and antifungal agents. The nontoxic herbicide glyphosphate competitively inhibits 3-phosphoshikimate 1-carboxyvinyltransferase, the sixth enzymatic reaction of the pathway.; GO: 0003849 3-deoxy-7-phosphoheptulonate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=7.78 E-value=2.2e+02 Score=13.01 Aligned_cols=27 Identities=22% Similarity=0.454 Sum_probs=22.2 Q ss_pred CCCCCCCCCEEEEEEEECCCEEEEEEE Q ss_conf 521112367267778841875999994 Q 537021.9.peg.9 2 YYDSLQKDNVSFLLHIIGNSQCFLMFE 28 (40) Q Consensus 2 yydslqkdnvsfllhiignsqcflmfe 28 (40) ++=|..|+-++-+.+--||+.||+..- T Consensus 207 ~Fls~~k~G~~ai~~T~GN~~~HiILR 233 (348) T TIGR00034 207 YFLSVTKDGQMAIVQTSGNPDGHIILR 233 (348) T ss_pred EEECCCCCCCEEEEECCCCCCCEEEEC T ss_conf 373126877367875267985248861 No 8 >PHA02131 hypothetical protein Probab=7.71 E-value=2.3e+02 Score=13.00 Aligned_cols=18 Identities=28% Similarity=0.724 Sum_probs=12.4 Q ss_pred CCEEEEEEECCCEEEEEE Q ss_conf 875999994276159999 Q 537021.9.peg.9 20 NSQCFLMFEKDDISIIKF 37 (40) Q Consensus 20 nsqcflmfekddisiikf 37 (40) .-.|+.||..|...--.| T Consensus 28 g~~c~imfk~~~v~dctf 45 (70) T PHA02131 28 GISCWIMFKNDQVIDCTF 45 (70) T ss_pred CEEEEEEECCCCEEEEEE T ss_conf 548999973897887056 No 9 >PRK00882 consensus Probab=7.67 E-value=2.3e+02 Score=12.98 Aligned_cols=17 Identities=47% Similarity=0.812 Sum_probs=13.3 Q ss_pred ECCCEEEEEEECCCEEE Q ss_conf 41875999994276159 Q 537021.9.peg.9 18 IGNSQCFLMFEKDDISI 34 (40) Q Consensus 18 ignsqcflmfekddisi 34 (40) .|.|...|+|||+.|.+ T Consensus 253 fGGSTvillFe~~~i~~ 269 (295) T PRK00882 253 FGGSTVILFFEKNKIKI 269 (295) T ss_pred CCCCEEEEEECCCEEEE T ss_conf 89976999944991788 No 10 >PRK00723 phosphatidylserine decarboxylase; Provisional Probab=7.49 E-value=2.3e+02 Score=12.94 Aligned_cols=16 Identities=50% Similarity=0.841 Sum_probs=0.0 Q ss_pred CCCEEEEEEECCCEEE Q ss_conf 1875999994276159 Q 537021.9.peg.9 19 GNSQCFLMFEKDDISI 34 (40) Q Consensus 19 gnsqcflmfekddisi 34 (40) |.|...|+|||+.|.+ T Consensus 255 GGSTvIllFe~~~i~~ 270 (297) T PRK00723 255 GGSTVILFFEKNKIKI 270 (297) T ss_pred CCCEEEEEECCCCEEE T ss_conf 9976999954992689 Done!