Query gi|255764480|ref|YP_003084343.1| hypothetical protein CLIBASIA_03352 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 57 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Sun May 29 18:23:49 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764480.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01436 glu_cys_lig_pln glut 38.0 11 0.00029 19.9 0.5 26 6-33 201-226 (446) 2 pfam07350 DUF1479 Protein of u 33.8 16 0.00042 19.2 0.7 28 1-31 322-349 (408) 3 cd01936 Ntn_CA Cephalosporin a 27.9 15 0.00038 19.4 -0.2 21 8-28 426-446 (469) 4 KOG1643 consensus 22.6 24 0.00062 18.4 0.0 16 12-27 220-235 (247) 5 pfam10574 UPF0552 Uncharacteri 15.2 50 0.0013 16.9 0.2 27 26-53 2-35 (224) 6 pfam02668 TauD Taurine catabol 14.5 44 0.0011 17.2 -0.2 22 2-23 93-114 (253) 7 TIGR01835 HMG-CoA-S_prok hydro 12.2 61 0.0016 16.5 -0.0 12 17-28 327-338 (412) 8 TIGR00108 eRF peptide chain re 11.5 78 0.002 16.0 0.3 11 3-13 122-132 (425) 9 pfam08829 AlphaC_N Alpha C pro 10.1 79 0.002 16.0 -0.0 25 13-37 88-112 (172) 10 KOG2204 consensus 9.7 90 0.0023 15.7 0.1 22 16-57 317-338 (625) No 1 >TIGR01436 glu_cys_lig_pln glutamate--cysteine ligase; InterPro: IPR011556 These sequences represent one of two highly dissimilar forms of glutamate--cysteine ligase (gamma-glutamylcysteine synthetase), an enzyme of glutathione biosynthesis. The other group is represented by IPR006335 from INTERPRO. This form is found in plants (with a probable transit peptide), root nodule and other bacteria, but not Escherichia coli and closely related species. ; GO: 0004357 glutamate-cysteine ligase activity, 0006750 glutathione biosynthetic process, 0009507 chloroplast. Probab=38.02 E-value=11 Score=19.93 Aligned_cols=26 Identities=46% Similarity=0.692 Sum_probs=19.6 Q ss_pred ECCCCCCCCCCHHHHHHHCCCEEEEEHH Q ss_conf 0321114482423353206640354112 Q gi|255764480|r 6 CDSRFAEDKPDIQGFLSYGSSTIYHTTA 33 (57) Q Consensus 6 cdsrfaedkpdiqgflsygsstiyhtta 33 (57) ..|-|.|-||. ||+||-|.-=-||-. T Consensus 201 ANSPFlEGkPN--GFlSyRshIWtdTDp 226 (446) T TIGR01436 201 ANSPFLEGKPN--GFLSYRSHIWTDTDP 226 (446) T ss_pred CCCCCCCCCCC--CCCCCCCCCCCCCCC T ss_conf 37866687878--754355354668887 No 2 >pfam07350 DUF1479 Protein of unknown function (DUF1479). This family consists of several hypothetical Enterobacterial proteins, of around 420 residues in length. Members of this family are often known as YbiU. The function of this family is unknown. Probab=33.76 E-value=16 Score=19.20 Aligned_cols=28 Identities=36% Similarity=0.564 Sum_probs=16.2 Q ss_pred CEEEEECCCCCCCCCCHHHHHHHCCCEEEEE Q ss_conf 9147503211144824233532066403541 Q gi|255764480|r 1 MMFWHCDSRFAEDKPDIQGFLSYGSSTIYHT 31 (57) Q Consensus 1 mmfwhcdsrfaedkpdiqgflsygsstiyht 31 (57) ..|||||.-.+-|+-.-+.. -||.+|-- T Consensus 322 ~V~WHcD~iH~V~~~h~g~~---~ssV~Yip 349 (408) T pfam07350 322 YVFWHCDLIHAVDPEHNGKG---DSSVMYIP 349 (408) T ss_pred EEEECCCCCCCCCCCCCCCC---CCEEEEEC T ss_conf 67876775444465568888---66048851 No 3 >cd01936 Ntn_CA Cephalosporin acylase (CA) belongs to a family of beta-lactam acylases that includes penicillin G acylase (PGA) and aculeacin A acylase. PGA and CA are crucial for the production of backbone chemicals like 6-aminopenicillanic acid and 7-aminocephalosporanic acid (7-ACA), which can be used to synthesize semi-synthetic penicillins and cephalosporins, respectively. While both PGA and CA have a conserved Ntn (N-terminal nucleophile) hydrolase fold and the structural similarity at their active sites is very high, their sequence similarity to other Ntn's is low. Probab=27.90 E-value=15 Score=19.36 Aligned_cols=21 Identities=38% Similarity=0.721 Sum_probs=17.0 Q ss_pred CCCCCCCCCHHHHHHHCCCEE Q ss_conf 211144824233532066403 Q gi|255764480|r 8 SRFAEDKPDIQGFLSYGSSTI 28 (57) Q Consensus 8 srfaedkpdiqgflsygssti 28 (57) -+|.+++|.+++.|+||+|+- T Consensus 426 V~f~~~g~~a~~il~yg~S~d 446 (469) T cd01936 426 VEFDDGGPRARGLLTYGQSTD 446 (469) T ss_pred EEECCCCEEEEEEECCCCCCC T ss_conf 996799858999965765899 No 4 >KOG1643 consensus Probab=22.63 E-value=24 Score=18.39 Aligned_cols=16 Identities=56% Similarity=0.877 Sum_probs=13.4 Q ss_pred CCCCCHHHHHHHCCCE Q ss_conf 4482423353206640 Q gi|255764480|r 12 EDKPDIQGFLSYGSST 27 (57) Q Consensus 12 edkpdiqgflsygsst 27 (57) -++|||.|||--|.|- T Consensus 220 ~~~~diDGFLVGGaSL 235 (247) T KOG1643 220 AKKPDIDGFLVGGASL 235 (247) T ss_pred CCCCCCCCEEECCCCC T ss_conf 3654566347757546 No 5 >pfam10574 UPF0552 Uncharacterized protein family UPF0552. This family of proteins has no known function. Probab=15.17 E-value=50 Score=16.93 Aligned_cols=27 Identities=48% Similarity=0.877 Sum_probs=17.3 Q ss_pred CEEEEEHHHHH-------HHHHHCCEEEEECCCCE Q ss_conf 40354112113-------44431000023246871 Q gi|255764480|r 26 STIYHTTALRE-------WSTIWSSEFFMESEDGT 53 (57) Q Consensus 26 stiyhttalre-------wstiwsseffmesedgt 53 (57) |.|||.+||+. |+..|..+-+ ..-+|. T Consensus 2 Sriy~~~aL~nk~v~s~~~~g~w~P~~~-q~G~Gv 35 (224) T pfam10574 2 SRIYQNTALQNKPVHSEKFSGTWDPEVF-QSGDGV 35 (224) T ss_pred CCCCCCCCCCCCCEECCCCCCCCCHHHH-CCCCEE T ss_conf 6200460003885130205775074571-478648 No 6 >pfam02668 TauD Taurine catabolism dioxygenase TauD, TfdA family. This family consists of taurine catabolism dioxygenases of the TauD, TfdA family. TauD from E. coli is a alpha-ketoglutarate-dependent taurine dioxygenase. This enzyme catalyses the oxygenolytic release of sulfite from taurine. TfdA from Burkholderia sp. is a 2,4-dichlorophenoxyacetic acid/alpha-ketoglutarate dioxygenase. TfdA from Alcaligenes eutrophus JMP134 is a 2,4-dichlorophenoxyacetate monooxygenase. Also included are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1. Probab=14.51 E-value=44 Score=17.17 Aligned_cols=22 Identities=32% Similarity=0.620 Sum_probs=18.3 Q ss_pred EEEEECCCCCCCCCCHHHHHHH Q ss_conf 1475032111448242335320 Q gi|255764480|r 2 MFWHCDSRFAEDKPDIQGFLSY 23 (57) Q Consensus 2 mfwhcdsrfaedkpdiqgflsy 23 (57) +.||+|+-|.+..|.++.+... T Consensus 93 ~~wHtD~sy~~~pp~~~~L~c~ 114 (253) T pfam02668 93 LPWHTDLSYLEEPPGIQLLHCL 114 (253) T ss_pred CCCCCCCCCCCCCCCEEEEEEE T ss_conf 7877767887689862789966 No 7 >TIGR01835 HMG-CoA-S_prok hydroxymethylglutaryl-CoA synthase; InterPro: IPR011554 Synonym(s): 3-hydroxy-3-methylglutaryl-coenzyme A synthase, HMG-CoA synthase. Hydroxymethylglutaryl-CoA synthase (2.3.3.10 from EC) catalyses the condensation of acetyl-CoA with acetoacetyl-CoA to produce HMG-CoA and CoA, the second reaction in the mevalonate-dependent isoprenoid biosynthesis pathway. HMG-CoA synthase contains an important catalytic cysteine residue that acts as a nucleophile in the first step of the reaction: the acetylation of the enzyme by acetyl-CoA (its first substrate) to produce an acetyl-enzyme thioester, releasing the reduced coenzyme A. The subsequent nucleophilic attack on acetoacetyl-CoA (its second substrate) leads to the formation of HMG-CoA . HMG-CoA synthase occurs in eukaryotes, archaea and certain bacteria . In vertebrates, there are two isozymes located in different subcellular compartments: a cytosolic form that is the starting point of the mevalonate pathway (leads to cholesterol and other sterolic and isoprenoid compounds), and a mitochondrial form responsible for ketone body biosynthesis. HMG-CoA is also found in other eukaryotes such as insects, plants and fungi . In bacteria, isoprenoid precursors are generally synthesised via an alternative, non-mevalonate pathway, however a number of Gram-positive pathogens utilise a mevalonate pathway involving HMG-CoA synthase that is parallel to that found in eukaryotes , . This entry represents HMG-CoA synthase enzymes found in a limited spectrum of mostly Gram-positive bacteria, which make isopentenyl pyrophosphate (IPP) via the mevalonate pathway.. Probab=12.20 E-value=61 Score=16.51 Aligned_cols=12 Identities=42% Similarity=0.819 Sum_probs=9.2 Q ss_pred HHHHHHHCCCEE Q ss_conf 233532066403 Q gi|255764480|r 17 IQGFLSYGSSTI 28 (57) Q Consensus 17 iqgflsygssti 28 (57) --||+||||..| T Consensus 327 ~Igl~SYGSGav 338 (412) T TIGR01835 327 KIGLFSYGSGAV 338 (412) T ss_pred EEEEEECCCCCE T ss_conf 111001252201 No 8 >TIGR00108 eRF peptide chain release factor eRF/aRF, subunit 1; InterPro: IPR004403 These proteins are translation factors that have been characterised in eukaryotes as the non-GTP-binding subunit of a cytosolic heterodimer that acts as a translation release factor for all three stop codons. Members of this orthologous family are found in Eukarya and Archaea. The name used should be eRF1 for the Archaea and aRF1 for the Eukarya. Alternative names include eRF1, SUP45, omnipotent suppressor protein 1.; GO: 0016149 translation release factor activity codon specific, 0006415 translational termination, 0005737 cytoplasm. Probab=11.51 E-value=78 Score=16.00 Aligned_cols=11 Identities=45% Similarity=1.117 Sum_probs=7.7 Q ss_pred EEEECCCCCCC Q ss_conf 47503211144 Q gi|255764480|r 3 FWHCDSRFAED 13 (57) Q Consensus 3 fwhcdsrfaed 13 (57) .+||||.|--+ T Consensus 122 ~Y~Cds~Fyle 132 (425) T TIGR00108 122 IYHCDSKFYLE 132 (425) T ss_pred EECCCCCCCHH T ss_conf 34018710110 No 9 >pfam08829 AlphaC_N Alpha C protein N terminal. The alpha C protein (ACP) is found in Streptococcus and acts as an invasin which plays a role in the internalisation and translocation of the organism across human epithelial surfaces. Group B Streptococcus is the leading cause of diseases including bacterial pneumonia, sepsis and meningitis. The N terminal of ACP is associated with virulence and forms a beta sandwich and a three helix bundle. Probab=10.07 E-value=79 Score=15.97 Aligned_cols=25 Identities=44% Similarity=0.311 Sum_probs=18.0 Q ss_pred CCCCHHHHHHHCCCEEEEEHHHHHH Q ss_conf 4824233532066403541121134 Q gi|255764480|r 13 DKPDIQGFLSYGSSTIYHTTALREW 37 (57) Q Consensus 13 dkpdiqgflsygsstiyhttalrew 37 (57) -|||-|--.---+-|||..++||.- T Consensus 88 ~K~DGQiDiV~v~lTiYdst~LRdk 112 (172) T pfam08829 88 TKSDGQIDIVSVSLTIYDSTALRDK 112 (172) T ss_pred CCCCCCEEEEEEEEEEECCHHHHHH T ss_conf 3788826799999999612888877 No 10 >KOG2204 consensus Probab=9.69 E-value=90 Score=15.72 Aligned_cols=22 Identities=55% Similarity=0.620 Sum_probs=0.0 Q ss_pred CHHHHHHHCCCEEEEEHHHHHHHHHHCCEEEEECCCCEEEEC Q ss_conf 423353206640354112113444310000232468716319 Q gi|255764480|r 16 DIQGFLSYGSSTIYHTTALREWSTIWSSEFFMESEDGTLHFL 57 (57) Q Consensus 16 diqgflsygsstiyhttalrewstiwsseffmesedgtlhfl 57 (57) +-+|++|-|+| |. +|-||||+. T Consensus 317 ~n~~wasgg~S-IL-------------------aE~gtlhle 338 (625) T KOG2204 317 DNYGWASGGSS-IL-------------------AEFGTLHLE 338 (625) T ss_pred CCCCCCCCCCH-HH-------------------HHCCCEEEE T ss_conf 77541358623-75-------------------515730222 Done!