Query 537021.9.peg.349_1 Match_columns 50 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Tue May 24 23:22:04 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_349.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG3235 consensus 38.2 3.5 9E-05 22.2 -2.1 15 5-19 28-42 (193) 2 TIGR02149 glgA_Coryne glycogen 33.7 12 0.00031 19.5 0.1 21 11-31 148-177 (416) 3 PRK00565 rplV 50S ribosomal pr 33.0 23 0.00058 18.2 1.4 31 20-50 1-33 (111) 4 cd00336 Ribosomal_L22 Ribosoma 32.4 16 0.00041 19.0 0.5 22 29-50 10-31 (105) 5 TIGR01250 pro_imino_pep_2 prol 28.9 18 0.00045 18.8 0.2 31 10-49 106-138 (302) 6 TIGR02144 LysX_arch Lysine bio 25.9 30 0.00077 17.6 1.0 27 14-40 256-282 (289) 7 pfam11328 DUF3130 Protein of u 23.9 29 0.00073 17.7 0.6 21 15-35 30-50 (90) 8 CHL00034 rpl22 ribosomal prote 19.2 39 0.001 17.0 0.5 20 31-50 13-32 (112) 9 pfam00237 Ribosomal_L22 Riboso 19.0 40 0.001 17.0 0.5 21 30-50 9-29 (105) 10 TIGR02841 spore_YyaC putative 15.4 80 0.002 15.5 1.3 25 17-42 113-137 (140) No 1 >KOG3235 consensus Probab=38.16 E-value=3.5 Score=22.24 Aligned_cols=15 Identities=40% Similarity=0.793 Sum_probs=12.7 Q ss_pred EEEEEEEEECCCCCC Q ss_conf 988676212143575 Q 537021.9.peg.3 5 LIKYYFYQLSSWNGN 19 (50) Q Consensus 5 likyyfyqlsswngn 19 (50) +.|||||.++||-+- T Consensus 28 qmkyylyh~lswp~l 42 (193) T KOG3235 28 QMKYYLYHGLSWPQL 42 (193) T ss_pred HHHHHHHHHCCCCCC T ss_conf 478899862266553 No 2 >TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family; InterPro: IPR011875 This entry describes Corynebacterium glutamicum GlgA and closely related proteins in other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related IPR011835 from INTERPRO, family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor .. Probab=33.73 E-value=12 Score=19.54 Aligned_cols=21 Identities=48% Similarity=0.817 Sum_probs=13.5 Q ss_pred EEECCCCC---------CEEEEEEECCHHH Q ss_conf 21214357---------5055212113046 Q 537021.9.peg.3 11 YQLSSWNG---------NMEYSGTMRNTSL 31 (50) Q Consensus 11 yqlsswng---------nmeysgtmrntsl 31 (50) ||||||-- -.--||.||+--| T Consensus 148 Y~lSsW~EktA~~aAd~vIAVS~amr~DiL 177 (416) T TIGR02149 148 YKLSSWAEKTAIEAADRVIAVSGAMREDIL 177 (416) T ss_pred CCCCHHHHHHHHHHCCCEEEHHHHCHHHHH T ss_conf 420247888899850406531110335583 No 3 >PRK00565 rplV 50S ribosomal protein L22; Reviewed Probab=32.96 E-value=23 Score=18.21 Aligned_cols=31 Identities=32% Similarity=0.294 Sum_probs=21.0 Q ss_pred EEEEEEECC--HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 055212113--0466776776530563210249 Q 537021.9.peg.3 20 MEYSGTMRN--TSLVTALRYAQHIKGMSVDNAI 50 (50) Q Consensus 20 meysgtmrn--tslvtalryaqhikgmsvdnai 50 (50) ||+....|+ .|.--.-+.+..|+||+|+.|+ T Consensus 1 m~~~A~~k~ir~Sp~K~r~v~~~IrG~~v~eAl 33 (111) T PRK00565 1 MEAKAKARFVRISPRKARLVADLIRGKKVEEAL 33 (111) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHCCCCHHHHH T ss_conf 917987187754889999999998699199999 No 4 >cd00336 Ribosomal_L22 Ribosomal protein L22/L17e. L22 (L17 in eukaryotes) is a core protein of the large ribosomal subunit. It is the only ribosomal protein that interacts with all six domains of 23S rRNA, and is one of the proteins important for directing the proper folding and stabilizing the conformation of 23S rRNA. L22 is the largest protein contributor to the surface of the polypeptide exit channel, the tunnel through which the polypeptide product passes. L22 is also one of six proteins located at the putative translocon binding site on the exterior surface of the ribosome. Probab=32.42 E-value=16 Score=18.97 Aligned_cols=22 Identities=45% Similarity=0.465 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 0466776776530563210249 Q 537021.9.peg.3 29 TSLVTALRYAQHIKGMSVDNAI 50 (50) Q Consensus 29 tslvtalryaqhikgmsvdnai 50 (50) .|.--.-+.+..|+||+||.|+ T Consensus 10 ~S~kK~~~v~~~IrG~~v~~A~ 31 (105) T cd00336 10 ISPKKARLVARLIRGMSVDEAL 31 (105) T ss_pred CCHHHHHHHHHHHCCCCHHHHH T ss_conf 5789999999998699199999 No 5 >TIGR01250 pro_imino_pep_2 proline-specific peptidases; InterPro: IPR005945 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. One of the members of this family is the tricorn protease (TRI) interacting factor 1 from Thermoplasma acidophilum. Factor 1 (F1) is a 33.5 kDa serine peptidase of the alpha/beta-hydrolase family. Tricorn generates small peptides, which are cleaved by F1 to yield single amino acids , . ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=28.88 E-value=18 Score=18.76 Aligned_cols=31 Identities=42% Similarity=0.853 Sum_probs=21.4 Q ss_pred EEEE-CCCCCCEEEEEEECCHHHHHHHHH-HHHHCCCCCCCC Q ss_conf 6212-143575055212113046677677-653056321024 Q 537021.9.peg.3 10 FYQL-SSWNGNMEYSGTMRNTSLVTALRY-AQHIKGMSVDNA 49 (50) Q Consensus 10 fyql-sswngnmeysgtmrntslvtalry-aqhikgmsvdna 49 (50) ||-| +||.|-. ...=||+| +||+||.-+.|. T Consensus 106 fyllG~SWGG~L---------A~~Yal~Yh~~~lkglI~ss~ 138 (302) T TIGR01250 106 FYLLGHSWGGLL---------AQEYALKYHGQHLKGLIISSM 138 (302) T ss_pred EEEEECCHHHHH---------HHHHHHHHCCCCCEEEEEECC T ss_conf 789702678999---------999999737898269998556 No 6 >TIGR02144 LysX_arch Lysine biosynthesis enzyme LysX; InterPro: IPR011870 The family of proteins found in this family include the characterised LysX from Thermus thermophilus which is part of a well-organised lysine biosynthesis gene cluster . LysX is believed to carry out an ATP-dependent acylation of the amino group of alpha-aminoadipate in the prokaryotic version of the fungal AAA lysine biosynthesis pathway. No species having a sequence in this family contains the elements of the more common diaminopimelate lysine biosythesis pathway, and none has been shown to be a lysine auxotroph. These sequences have mainly recieved the name of the related enzyme, "ribosomal protein S6 modification protein RimK". RimK has been characterised in Escherichia coli, and acts by ATP-dependent condensation of S6 with glutamate residues .; GO: 0003824 catalytic activity, 0005524 ATP binding, 0009085 lysine biosynthetic process. Probab=25.88 E-value=30 Score=17.60 Aligned_cols=27 Identities=26% Similarity=0.304 Sum_probs=18.6 Q ss_pred CCCCCCEEEEEEECCHHHHHHHHHHHH Q ss_conf 143575055212113046677677653 Q 537021.9.peg.3 14 SSWNGNMEYSGTMRNTSLVTALRYAQH 40 (50) Q Consensus 14 sswngnmeysgtmrntslvtalryaqh 40 (50) .--|++|||-+++|-|..=-|-.-+.| T Consensus 256 ~EVN~~~EFKN~~rvTGvNva~~l~~y 282 (289) T TIGR02144 256 NEVNGVPEFKNTVRVTGVNVAEKLVEY 282 (289) T ss_pred EEECCCCCCCCCEEECCCCHHHHHHHH T ss_conf 735689772785777474537899999 No 7 >pfam11328 DUF3130 Protein of unknown function (DUF3130. This bacterial family of proteins has no known function. Probab=23.91 E-value=29 Score=17.71 Aligned_cols=21 Identities=33% Similarity=0.368 Sum_probs=12.7 Q ss_pred CCCCCEEEEEEECCHHHHHHH Q ss_conf 435750552121130466776 Q 537021.9.peg.3 15 SWNGNMEYSGTMRNTSLVTAL 35 (50) Q Consensus 15 swngnmeysgtmrntslvtal 35 (50) --||||.||-+---.-+-||| T Consensus 30 mK~gNMaYSraNSIn~~RtAl 50 (90) T pfam11328 30 LKDGNMAFSRANSIDQLRSAL 50 (90) T ss_pred CCCCCCCHHHHCCHHHHHHHH T ss_conf 369974034313098999999 No 8 >CHL00034 rpl22 ribosomal protein L22 Probab=19.25 E-value=39 Score=17.01 Aligned_cols=20 Identities=25% Similarity=0.336 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHCCCCCCCCC Q ss_conf 66776776530563210249 Q 537021.9.peg.3 31 LVTALRYAQHIKGMSVDNAI 50 (50) Q Consensus 31 lvtalryaqhikgmsvdnai 50 (50) ---+-..+..|+||+||.|+ T Consensus 13 prK~r~v~~~IRG~~v~~Al 32 (112) T CHL00034 13 PHKVRRVIDQIRGRSYEEAL 32 (112) T ss_pred HHHHHHHHHHHCCCCHHHHH T ss_conf 79999999998699199999 No 9 >pfam00237 Ribosomal_L22 Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes. Probab=19.00 E-value=40 Score=16.96 Aligned_cols=21 Identities=38% Similarity=0.365 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHCCCCCCCCC Q ss_conf 466776776530563210249 Q 537021.9.peg.3 30 SLVTALRYAQHIKGMSVDNAI 50 (50) Q Consensus 30 slvtalryaqhikgmsvdnai 50 (50) |.--+-+.+..|+||+|+.|+ T Consensus 9 S~kK~r~v~~~Irg~~v~~A~ 29 (105) T pfam00237 9 SPKKARLVADLIRGKSVEEAL 29 (105) T ss_pred CHHHHHHHHHHHCCCCHHHHH T ss_conf 889999999998699199999 No 10 >TIGR02841 spore_YyaC putative sporulation protein YyaC; InterPro: IPR009665 This family consists of several uncharacterised bacterial proteins, which seem to be specific to the orders Clostridia and Bacillales. Family members are typically around 180 residues in length. The function of this family is unknown.. Probab=15.43 E-value=80 Score=15.48 Aligned_cols=25 Identities=36% Similarity=0.551 Sum_probs=20.5 Q ss_pred CCCEEEEEEECCHHHHHHHHHHHHHC Q ss_conf 57505521211304667767765305 Q 537021.9.peg.3 17 NGNMEYSGTMRNTSLVTALRYAQHIK 42 (50) Q Consensus 17 ngnmeysgtmrntslvtalryaqhik 42 (50) .|-|||. -..||-|-+-++-||-|- T Consensus 113 sG~mEf~-VLQNTRL~~VM~mA~~is 137 (140) T TIGR02841 113 SGFMEFF-VLQNTRLSTVMKMADVIS 137 (140) T ss_pred CCCCCHH-HHHHHHHHHHHHHHHHHH T ss_conf 8640101-102127889999999998 Done!