Query 537021.9.peg.603_1 Match_columns 37 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Wed May 25 02:00:20 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_603.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01896 cas_AF1879 CRISPR-as 48.1 5.7 0.00015 21.8 0.3 22 14-35 16-39 (295) 2 TIGR01756 LDH_protist lactate 29.5 6.4 0.00016 21.6 -1.9 27 5-37 224-250 (314) 3 pfam12293 DUF3625 Protein of u 23.6 62 0.0016 16.7 2.3 25 6-30 169-199 (242) 4 PRK09275 aspartate aminotransf 22.0 29 0.00074 18.4 0.3 23 1-23 200-236 (531) 5 pfam09248 DUF1965 Domain of un 18.4 53 0.0013 17.1 1.0 16 12-27 24-39 (74) 6 TIGR01239 galT_2 galactose-1-p 18.0 49 0.0012 17.3 0.7 11 4-14 293-303 (502) 7 PRK05753 nucleoside diphosphat 17.0 45 0.0012 17.4 0.4 30 1-33 88-117 (135) 8 COG4343 CRISPR-associated prot 13.8 55 0.0014 17.0 0.1 22 14-35 25-47 (281) 9 smart00694 DysFC Dysferlin dom 12.2 56 0.0014 16.9 -0.2 15 17-32 1-15 (34) 10 pfam06023 DUF911 Archaeal prot 9.8 89 0.0023 16.0 0.1 10 14-23 26-35 (290) No 1 >TIGR01896 cas_AF1879 CRISPR-associated protein, Csa1 family; InterPro: IPR009260 This family consists of several archaeal strongly conserved proteins whose genes are associated with CRISPRs (Clustered, Regularly Interspaced Short Palidromic Repeats). The function of these proteins has not been experimentally determined, but computational analysis has suggested that they may function as nucleases in DNA repair, similar to RecB (IPR004586 from INTERPRO) . . Probab=48.14 E-value=5.7 Score=21.85 Aligned_cols=22 Identities=50% Similarity=0.800 Sum_probs=12.4 Q ss_pred EEECCCCCCEE-EEEEE-CCCCCC Q ss_conf 54133213256-66642-276202 Q 537021.9.peg.6 14 KLRGWQYGEEF-MLRTY-SGESLS 35 (37) Q Consensus 14 klrgwqygeef-mlrty-sgesls 35 (37) .||||.|++-= .=+.| -|-|+| T Consensus 16 ELRGW~y~~PPv~P~~y~~~Ls~S 39 (295) T TIGR01896 16 ELRGWNYNEPPVKPPAYALGLSLS 39 (295) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 147654785578898530122144 No 2 >TIGR01756 LDH_protist lactate dehydrogenase; InterPro: IPR011272 This entry represents a family of protist (Trichomonad) lactate dehydrogenases which have apparently evolved from a recent protist malate dehydrogenase ancestor . Lactate dehydrogenase converts the hydroxyl at C-2 of lactate to a carbonyl in the product, pyruvate. The preference of this enzyme for NAD or NADP has not been determined. A critical residue in malate dehydrogenase, arginine-91 (Trichomonas vaginalis numbering) has been mutated to a leucine, eliminating the positive charge which complemented the carboxylate in malate which is absent in lactate. Several other more subtle changes are proposed to make the active site smaller to accommodate the less bulky lactate molecule .. Probab=29.50 E-value=6.4 Score=21.61 Aligned_cols=27 Identities=41% Similarity=0.820 Sum_probs=21.3 Q ss_pred CCCCCEEEEEEECCCCCCEEEEEEECCCCCCCC Q ss_conf 053203455541332132566664227620269 Q 537021.9.peg.6 5 FSPVEAAIIKLRGWQYGEEFMLRTYSGESLSAG 37 (37) Q Consensus 5 fspveaaiiklrgwqygeefmlrtysgeslsag 37 (37) -||+.|++-..|.|-+|. -.||-||-| T Consensus 224 aS~~~Asl~HMkAWLFGT------~~~~~lS~G 250 (314) T TIGR01756 224 ASPVKASLQHMKAWLFGT------KPGEVLSLG 250 (314) T ss_pred CCHHHHHHHHHHHHHCCC------CCCCEEEEC T ss_conf 306789999878761578------877767703 No 3 >pfam12293 DUF3625 Protein of unknown function (DUF3625). This family of proteins is found in bacteria. Proteins in this family are typically between 327 and 360 amino acids in length. Probab=23.61 E-value=62 Score=16.74 Aligned_cols=25 Identities=32% Similarity=0.610 Sum_probs=18.3 Q ss_pred CCCCEEEEEEE------CCCCCCEEEEEEEC Q ss_conf 53203455541------33213256666422 Q 537021.9.peg.6 6 SPVEAAIIKLR------GWQYGEEFMLRTYS 30 (37) Q Consensus 6 spveaaiiklr------gwqygeefmlrtys 30 (37) .|.+|.-+|.. .||||....+||-. T Consensus 169 PP~~akrLk~sG~~d~~aW~~~d~~YvRT~~ 199 (242) T pfam12293 169 PPKGAKRLKTTGGVDTQAWQMGDDLYVRTRL 199 (242) T ss_pred CCCCCEEEEECCCCCCHHHEECCEEEEEECC T ss_conf 9866678773499741300459868997467 No 4 >PRK09275 aspartate aminotransferase; Provisional Probab=21.97 E-value=29 Score=18.38 Aligned_cols=23 Identities=35% Similarity=0.904 Sum_probs=13.2 Q ss_pred CCCCCCCCC---------EEEEEE-----ECCCCCCE Q ss_conf 964105320---------345554-----13321325 Q 537021.9.peg.6 1 MIPIFSPVE---------AAIIKL-----RGWQYGEE 23 (37) Q Consensus 1 mipifspve---------aaiikl-----rgwqygee 23 (37) |.|||+|-- -.++.+ .+|||-++ T Consensus 200 ~~PIFTPYlEIP~L~~Y~l~vV~i~a~~~~~WQy~d~ 236 (531) T PRK09275 200 MTPIFTPYLEIPELPDYDLEVVHISADEKNEWQYPDK 236 (531) T ss_pred ECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCHH T ss_conf 5677776301677666786289985565458877989 No 5 >pfam09248 DUF1965 Domain of unknown function (DUF1965). Members of this family of fungal domains adopt a structure that consists of an alpha/beta motif. Their exact function has not, as yet, been determined. Probab=18.39 E-value=53 Score=17.10 Aligned_cols=16 Identities=38% Similarity=0.758 Sum_probs=12.4 Q ss_pred EEEEECCCCCCEEEEE Q ss_conf 5554133213256666 Q 537021.9.peg.6 12 IIKLRGWQYGEEFMLR 27 (37) Q Consensus 12 iiklrgwqygeefmlr 27 (37) --|++||-|+.+|--- T Consensus 24 ~Wkv~gw~y~~~fy~t 39 (74) T pfam09248 24 KWKLEGWLYNDIFYTT 39 (74) T ss_pred HCEEEEEEECCEEECC T ss_conf 9689999989998305 No 6 >TIGR01239 galT_2 galactose-1-phosphate uridylyltransferase; InterPro: IPR000766 Galactose-1-phosphate uridyl transferase 2.7.7.12 from EC (galT) catalyzes the transfer of an uridyldiphosphate group on galactose (or glucose) 1-phosphate. During the reaction, the uridyl moiety links to a histidine residue. In the Escherichia coli enzyme, it has been shown that two histidine residues separated by a single proline residue are essential for enzyme activity. On the basis of sequence similarities, two apparently unrelated families seem to exist. Class-I enzymes are found in eukaryotes as well as some bacteria such as E. coli or Streptomyces lividans, while class-II enzymes have been found so far only in bacteria such as Bacillus subtilis or Lactobacillus helveticus .; GO: 0008108 UDP-glucose:hexose-1-phosphate uridylyltransferase activity, 0006012 galactose metabolic process, 0005737 cytoplasm. Probab=18.03 E-value=49 Score=17.27 Aligned_cols=11 Identities=36% Similarity=0.540 Sum_probs=7.9 Q ss_pred CCCCCCEEEEE Q ss_conf 10532034555 Q 537021.9.peg.6 4 IFSPVEAAIIK 14 (37) Q Consensus 4 ifspveaaiik 14 (37) -|+-|+++|+| T Consensus 293 ~yp~V~~Giv~ 303 (502) T TIGR01239 293 DYPDVSAGIVK 303 (502) T ss_pred CCCCCEEEEEE T ss_conf 77972065420 No 7 >PRK05753 nucleoside diphosphate kinase regulator; Provisional Probab=17.01 E-value=45 Score=17.42 Aligned_cols=30 Identities=23% Similarity=0.481 Sum_probs=23.3 Q ss_pred CCCCCCCCCEEEEEEECCCCCCEEEEEEECCCC Q ss_conf 964105320345554133213256666422762 Q 537021.9.peg.6 1 MIPIFSPVEAAIIKLRGWQYGEEFMLRTYSGES 33 (37) Q Consensus 1 mipifspveaaiiklrgwqygeefmlrtysges 33 (37) .|.++||+-+|++ |-.-|+++-..+-+|+. T Consensus 88 kISv~SPiG~ALl---G~~~Gd~i~~~~P~G~~ 117 (135) T PRK05753 88 QVSVLAPVGAALL---GLSVGQSIDWPLPGGKE 117 (135) T ss_pred CCCCCCHHHHHHH---CCCCCCEEEEECCCCCE T ss_conf 2010169999973---89999999998899977 No 8 >COG4343 CRISPR-associated protein, RecB family exonuclease [Defense mechanisms] Probab=13.84 E-value=55 Score=17.00 Aligned_cols=22 Identities=41% Similarity=0.634 Sum_probs=14.4 Q ss_pred EEECCCCCCEE-EEEEECCCCCC Q ss_conf 54133213256-66642276202 Q 537021.9.peg.6 14 KLRGWQYGEEF-MLRTYSGESLS 35 (37) Q Consensus 14 klrgwqygeef-mlrtysgesls 35 (37) .||||.|+|-- --+.|.+-|+| T Consensus 25 ELRGWnw~~PPVkp~~~~~lslS 47 (281) T COG4343 25 ELRGWNWNEPPVKPPRYVRLSLS 47 (281) T ss_pred HHCCCCCCCCCCCCCCCCCCCHH T ss_conf 76487689999888111575498 No 9 >smart00694 DysFC Dysferlin domain, C-terminal region. Domain of unknown function present in yeast peroxisomal proteins, dysferlin, myoferlin and hypothetical proteins. Due to an insertion of a dysferlin domain within a second dysferlin domain we have chosen to predict these domains in two parts: the N-terminal region and the C-terminal region. Probab=12.19 E-value=56 Score=16.95 Aligned_cols=15 Identities=40% Similarity=1.063 Sum_probs=10.2 Q ss_pred CCCCCCEEEEEEECCC Q ss_conf 3321325666642276 Q 537021.9.peg.6 17 GWQYGEEFMLRTYSGE 32 (37) Q Consensus 17 gwqygeefmlrtysge 32 (37) ||||...| -..|+.+ T Consensus 1 gW~Ys~~f-~~~~~~~ 15 (34) T smart00694 1 GWQYSDNF-WANYHKT 15 (34) T ss_pred CCCCCCCC-CHHHCCH T ss_conf 97512467-2664750 No 10 >pfam06023 DUF911 Archaeal protein of unknown function (DUF911). This family consists of several archaeal proteins of unknown function. Probab=9.83 E-value=89 Score=15.98 Aligned_cols=10 Identities=50% Similarity=1.298 Sum_probs=8.0 Q ss_pred EEECCCCCCE Q ss_conf 5413321325 Q 537021.9.peg.6 14 KLRGWQYGEE 23 (37) Q Consensus 14 klrgwqygee 23 (37) .||||.|.+- T Consensus 26 ELRGWnw~~P 35 (290) T pfam06023 26 ELRGWNWNEP 35 (290) T ss_pred HHCCCCCCCC T ss_conf 7669889999 Done!