Query 537021.9.peg.523_1 Match_columns 38 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Tue May 24 23:03:22 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_523.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01357 aroB 3-dehydroquinat 33.1 20 0.00052 18.8 1.1 18 13-30 98-115 (361) 2 TIGR01597 PYST-B Plasmodium yo 29.5 52 0.0013 16.7 2.7 29 2-30 208-236 (274) 3 KOG2145 consensus 18.6 88 0.0022 15.6 2.1 32 2-33 30-71 (397) 4 pfam04532 DUF587 Protein of un 17.5 44 0.0011 17.1 0.4 15 15-29 136-150 (215) 5 pfam08986 DUF1889 Domain of un 17.2 47 0.0012 17.0 0.5 17 3-19 18-34 (119) 6 PRK00944 hypothetical protein; 16.8 61 0.0016 16.4 1.0 13 18-30 62-74 (193) 7 pfam10755 DUF2585 Protein of u 16.6 62 0.0016 16.3 1.0 13 18-30 35-47 (165) 8 COG0337 AroB 3-dehydroquinate 12.8 81 0.0021 15.7 0.8 18 13-30 106-123 (360) 9 KOG0571 consensus 11.5 1.6E+02 0.0042 14.2 2.0 22 4-25 421-444 (543) 10 TIGR02566 cas_Csy3 CRISPR-asso 10.3 48 0.0012 16.9 -1.1 16 17-32 353-368 (370) No 1 >TIGR01357 aroB 3-dehydroquinate synthase; InterPro: IPR016037 The 3-dehydroquinate synthase (4.2.3.4 from EC) domain is present in isolation in various bacterial 3-dehydroquinate synthases and also present as a domain in the pentafunctional AROM polypeptide (P07547 from SWISSPROT) . 3-dehydroquinate (DHQ) synthase catalyses the formation of dehydroquinate (DHQ) and orthophosphate from 3-deoxy-D-arabino heptulosonic 7 phosphate . This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids.; GO: 0003856 3-dehydroquinate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=33.10 E-value=20 Score=18.77 Aligned_cols=18 Identities=33% Similarity=0.685 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 665531105776513465 Q 537021.9.peg.5 13 ALDMTGYVPSHLLRGKVF 30 (38) Q Consensus 13 aldmtgyvpshllrgkvf 30 (38) ..||+|+|-|-.+||=-| T Consensus 98 vgDlaGFvAaty~RGi~~ 115 (361) T TIGR01357 98 VGDLAGFVAATYMRGIRF 115 (361) T ss_pred HHHHHHHHHHHHHHCCCE T ss_conf 787899999999615582 No 2 >TIGR01597 PYST-B Plasmodium yoelii subtelomeric family PYST-B; InterPro: IPR006484 The sequences in this group represent a paralogous family of Plasmodium yoelii genes preferentially located in the subtelomeric regions of the chromosomes. There are no obvious homologs to these genes in any other organism. . Probab=29.49 E-value=52 Score=16.73 Aligned_cols=29 Identities=38% Similarity=0.461 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88999999986665531105776513465 Q 537021.9.peg.5 2 LDKLIKGVIDRALDMTGYVPSHLLRGKVF 30 (38) Q Consensus 2 ldklikgvidraldmtgyvpshllrgkvf 30 (38) ++|+.+|.+=|.+-|+=.|-|-|..|-|+ T Consensus 208 l~Kl~kkl~vr~~~~~~~v~s~lv~G~vy 236 (274) T TIGR01597 208 LKKLVKKLIVRLVVLILLVLSLLVSGLVY 236 (274) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 31355678899999999999998212899 No 3 >KOG2145 consensus Probab=18.56 E-value=88 Score=15.57 Aligned_cols=32 Identities=47% Similarity=0.698 Sum_probs=23.7 Q ss_pred HHHHHH---------HHHHHHHHHHHHHHHHHHHHHH-HHHH Q ss_conf 889999---------9998666553110577651346-5456 Q 537021.9.peg.5 2 LDKLIK---------GVIDRALDMTGYVPSHLLRGKV-FGYR 33 (38) Q Consensus 2 ldklik---------gvidraldmtgyvpshllrgkv-fgyr 33 (38) .||||+ ..|+|--..||--|+|+||--+ |.-| T Consensus 30 YdklI~~FG~~k~deeli~R~ek~tg~~~h~flRrg~fFshR 71 (397) T KOG2145 30 YDKLIVQFGCSKIDEELIDRFEKLTGKPPHHFLRRGIFFSHR 71 (397) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHCCEEECC T ss_conf 899999957664899999999985189951776512100004 No 4 >pfam04532 DUF587 Protein of unknown function (DUF587). This family consists of the N termini of some human herpesvirus U58 proteins, and some cytomegalovirus UL87 proteins. This region is always found N terminal to the Pfam family UL87 (pfam03043), which has no known function. Probab=17.48 E-value=44 Score=17.09 Aligned_cols=15 Identities=33% Similarity=0.525 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 553110577651346 Q 537021.9.peg.5 15 DMTGYVPSHLLRGKV 29 (38) Q Consensus 15 dmtgyvpshllrgkv 29 (38) -|++|..+|.+||-. T Consensus 136 vm~~~~~tHVfRGLL 150 (215) T pfam04532 136 VMTNHYATHVFRGLL 150 (215) T ss_pred EECCCCHHHHHHHHH T ss_conf 942874258888777 No 5 >pfam08986 DUF1889 Domain of unknown function (DUF1889). This domain is found in a set of hypothetical bacterial proteins. Probab=17.20 E-value=47 Score=16.95 Aligned_cols=17 Identities=53% Similarity=0.757 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 89999999866655311 Q 537021.9.peg.5 3 DKLIKGVIDRALDMTGY 19 (38) Q Consensus 3 dklikgvidraldmtgy 19 (38) ||-...|||.|||..|- T Consensus 18 dk~MPavIdKALDFIg~ 34 (119) T pfam08986 18 DKNMPAVIDKALDFIGA 34 (119) T ss_pred CCCCHHHHHHHHHHHCC T ss_conf 57657899999987403 No 6 >PRK00944 hypothetical protein; Provisional Probab=16.81 E-value=61 Score=16.36 Aligned_cols=13 Identities=38% Similarity=0.679 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHH Q ss_conf 1105776513465 Q 537021.9.peg.5 18 GYVPSHLLRGKVF 30 (38) Q Consensus 18 gyvpshllrgkvf 30 (38) =|.|||++.|-.| T Consensus 62 WYt~SHiIHGflF 74 (193) T PRK00944 62 WYTFSHIIHGFLF 74 (193) T ss_pred CCCHHHHHHHHHH T ss_conf 5668999999999 No 7 >pfam10755 DUF2585 Protein of unknown function (DUF2585). This family is conserved in Proteobacteria. The function is not known. Probab=16.58 E-value=62 Score=16.31 Aligned_cols=13 Identities=46% Similarity=0.971 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHHH Q ss_conf 1105776513465 Q 537021.9.peg.5 18 GYVPSHLLRGKVF 30 (38) Q Consensus 18 gyvpshllrgkvf 30 (38) =|.|||++.|-.| T Consensus 35 WYt~SHiiHGflF 47 (165) T pfam10755 35 WYTPSHIIHGFLF 47 (165) T ss_pred CCCHHHHHHHHHH T ss_conf 5668999999999 No 8 >COG0337 AroB 3-dehydroquinate synthetase [Amino acid transport and metabolism] Probab=12.82 E-value=81 Score=15.73 Aligned_cols=18 Identities=28% Similarity=0.615 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 665531105776513465 Q 537021.9.peg.5 13 ALDMTGYVPSHLLRGKVF 30 (38) Q Consensus 13 aldmtgyvpshllrgkvf 30 (38) ..||+||+.+-.+||--| T Consensus 106 igDlaGF~Aaty~RGv~f 123 (360) T COG0337 106 IGDLAGFAAATYMRGVRF 123 (360) T ss_pred HHHHHHHHHHHHHHCCCE T ss_conf 777778999999708976 No 9 >KOG0571 consensus Probab=11.55 E-value=1.6e+02 Score=14.22 Aligned_cols=22 Identities=36% Similarity=0.649 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHH Q ss_conf 999999986665531--1057765 Q 537021.9.peg.5 4 KLIKGVIDRALDMTG--YVPSHLL 25 (38) Q Consensus 4 klikgvidraldmtg--yvpshll 25 (38) ++-|-|.+.|+|-+. |.|.+.| T Consensus 421 r~eK~vlrsafd~~~~pyLP~eil 444 (543) T KOG0571 421 RIEKYVLRSAFDTTEKPYLPDEIL 444 (543) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 489999986138757866707888 No 10 >TIGR02566 cas_Csy3 CRISPR-associated protein, Csy3 family; InterPro: IPR013399 CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This entry, typified by YPO2463 of Yersinia pestis, is a CRISPR-associated (Cas) entry strictly associated with the Ypest subtype of CRISPR/Cas locus. It is designated Csy3, for CRISPR/Cas Subtype Ypest protein 3.. Probab=10.30 E-value=48 Score=16.91 Aligned_cols=16 Identities=44% Similarity=0.704 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 3110577651346545 Q 537021.9.peg.5 17 TGYVPSHLLRGKVFGY 32 (38) Q Consensus 17 tgyvpshllrgkvfgy 32 (38) --||-|-|.||-|||- T Consensus 353 LHfv~a~LIRGGVfg~ 368 (370) T TIGR02566 353 LHFVMAVLIRGGVFGE 368 (370) T ss_pred CEEEEEEEEECCCCCC T ss_conf 0023443210320367 Done!