Query gi|254781189|ref|YP_003065602.1| hypothetical protein CLIBASIA_05480 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 252 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 05:54:19 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781189.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam08549 SWI-SNF_Ssr4 Fungal 48.4 7.9 0.0002 19.6 1.0 17 207-223 280-296 (669) 2 KOG1253 consensus 33.1 20 0.0005 17.0 1.1 47 121-167 344-399 (525) 3 KOG2886 consensus 32.8 28 0.00071 16.1 1.8 20 56-75 52-75 (209) 4 PRK11280 hypothetical protein; 32.2 19 0.00049 17.1 0.9 25 112-136 128-158 (179) 5 TIGR01758 MDH_euk_cyt malate d 30.4 13 0.00033 18.2 -0.2 36 127-163 21-56 (325) 6 COG3382 Solo B3/4 domain (OB-f 27.4 30 0.00076 15.9 1.2 37 97-133 80-116 (229) 7 COG5017 Uncharacterized conser 23.7 45 0.0011 14.8 1.5 34 101-135 13-50 (161) 8 COG3857 AddB ATP-dependent nuc 22.9 59 0.0015 14.0 4.6 103 138-242 148-261 (1108) 9 PRK11669 pbpG D-alanyl-D-alani 22.7 60 0.0015 14.0 2.4 27 222-248 268-294 (308) 10 TIGR00452 TIGR00452 methyltran 21.3 31 0.00078 15.8 0.3 22 222-243 254-275 (316) 11 pfam07519 Tannase Tannase and 21.2 19 0.00049 17.1 -0.8 23 223-245 392-414 (451) No 1 >pfam08549 SWI-SNF_Ssr4 Fungal domain of unknown function (DUF1750). This is a fungal domain of unknown function. Probab=48.43 E-value=7.9 Score=19.56 Aligned_cols=17 Identities=47% Similarity=0.890 Sum_probs=8.9 Q ss_pred ECCCCCCCCCHHHHHHH Q ss_conf 35688454414455446 Q gi|254781189|r 207 VDPLTPRDISFTQYEKH 223 (252) Q Consensus 207 vdpltprdisftqyekh 223 (252) .|-||||+||...|..| T Consensus 280 ~D~lTPREIS~~RYqqh 296 (669) T pfam08549 280 MDLLTPREVSKMRYQQH 296 (669) T ss_pred HCCCCHHHHHHHHHHHH T ss_conf 20468888889988887 No 2 >KOG1253 consensus Probab=33.14 E-value=20 Score=17.05 Aligned_cols=47 Identities=30% Similarity=0.523 Sum_probs=26.5 Q ss_pred EEEEECCCCCHHEE--CHHHHHHHHHCCHH-------HHHCCHHHHHHHHHHHHHC Q ss_conf 56660289301102--04888877607145-------5422525787765311100 Q gi|254781189|r 121 EVIYNFGGPMYGVI--VPDFIHDLLDIPEE-------KRRLNTSYLTYVDRGLLDV 167 (252) Q Consensus 121 eviynfggpmygvi--vpdfihdlldipee-------krrlntsyltyvdrglldv 167 (252) .--|+.|||||.-- -++|+.++|.+-.+ -.+.--.-|++++.-|.|| T Consensus 344 g~~~~l~GP~W~~PlHd~~fv~~mL~~~k~~~~~~~~t~kri~g~L~~~~eeL~dv 399 (525) T KOG1253 344 GKRLHLGGPMWSGPLHDAEFVTEMLEIAKEVSEDTYGTDKRLSGMLELVDEELPDV 399 (525) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHEEHHHCCCCCCCCC T ss_conf 52100267544675456799999999864277321220021100020154003677 No 3 >KOG2886 consensus Probab=32.76 E-value=28 Score=16.09 Aligned_cols=20 Identities=50% Similarity=0.793 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHH----HHHHH Q ss_conf 999998866669999----86088 Q gi|254781189|r 56 YLWVIFVSSLSLAIV----AFGVE 75 (252) Q Consensus 56 ylwvifvsslslaiv----afgve 75 (252) -.||-|||+++|+-+ .||+- T Consensus 52 qvWvTFVSg~~L~~sLprh~FG~v 75 (209) T KOG2886 52 QVWVTFVSGYVLASSLPRHQFGVV 75 (209) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 246678868999986226677677 No 4 >PRK11280 hypothetical protein; Provisional Probab=32.22 E-value=19 Score=17.09 Aligned_cols=25 Identities=32% Similarity=0.666 Sum_probs=17.1 Q ss_pred HHHHHCCC------EEEEEECCCCCHHEECH Q ss_conf 76520231------35666028930110204 Q gi|254781189|r 112 ACLTQYDR------YEVIYNFGGPMYGVIVP 136 (252) Q Consensus 112 acltqydr------yeviynfggpmygvivp 136 (252) -|-|+||. |.|.|.|+|-.|-+-.+ T Consensus 128 rC~tv~~~~~r~vGYDVTY~y~Gq~ytvRMd 158 (179) T PRK11280 128 RCKTVYDKSEKMLGYDVTYKIGDQQGKIRMD 158 (179) T ss_pred EEEEECCEEEEEEEEEEEEEECCEEEEEEEC T ss_conf 8730210014674038999989998789817 No 5 >TIGR01758 MDH_euk_cyt malate dehydrogenase, NAD-dependent; InterPro: IPR011274 This entry represents the NAD-dependent cytosolic malate dehydrogenase from eukaryotes. The enzyme from pig has been studied by X-ray crystallography ; GO: 0030060 L-malate dehydrogenase activity, 0006108 malate metabolic process. Probab=30.40 E-value=13 Score=18.19 Aligned_cols=36 Identities=36% Similarity=0.454 Sum_probs=26.8 Q ss_pred CCCCHHEECHHHHHHHHHCCHHHHHCCHHHHHHHHHH Q ss_conf 8930110204888877607145542252578776531 Q gi|254781189|r 127 GGPMYGVIVPDFIHDLLDIPEEKRRLNTSYLTYVDRG 163 (252) Q Consensus 127 ggpmygvivpdfihdlldipeekrrlntsyltyvdrg 163 (252) +|-|.|-=-|= |-.|||||+-+.-|+---.--+|-. T Consensus 21 ~G~~lG~dQ~~-~LhLLDIp~a~~~L~Gv~mEL~DcA 56 (325) T TIGR01758 21 RGEMLGKDQPI-ILHLLDIPPAVKVLEGVVMELVDCA 56 (325) T ss_pred CCEEECCCCCE-EECCCCHHHHHHHHHHHHHHHHHHH T ss_conf 67420898600-1002481688875101367665201 No 6 >COG3382 Solo B3/4 domain (OB-fold DNA/RNA-binding) of Phe-aaRS-beta [General function prediction only] Probab=27.40 E-value=30 Score=15.90 Aligned_cols=37 Identities=22% Similarity=0.274 Sum_probs=15.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHE Q ss_conf 2205678887655557652023135666028930110 Q gi|254781189|r 97 EDSASERLVKATEACACLTQYDRYEVIYNFGGPMYGV 133 (252) Q Consensus 97 edsaserlvkateacacltqydryeviynfggpmygv 133 (252) .-+++|-|.|-----.-+...+----+||+-.--|+| T Consensus 80 ~r~S~EALlrRv~kg~~lp~InpvVD~yNa~Sl~y~v 116 (229) T COG3382 80 TRPSAEALLRRVLKGNSLPRINPVVDIYNAASLKYAV 116 (229) T ss_pred CCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 8972999999987489866656255410577776248 No 7 >COG5017 Uncharacterized conserved protein [Function unknown] Probab=23.68 E-value=45 Score=14.78 Aligned_cols=34 Identities=35% Similarity=0.464 Sum_probs=27.5 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCC----CCHHEEC Q ss_conf 6788876555576520231356660289----3011020 Q gi|254781189|r 101 SERLVKATEACACLTQYDRYEVIYNFGG----PMYGVIV 135 (252) Q Consensus 101 serlvkateacacltqydryeviynfgg----pmygviv 135 (252) -.|+|+-.| |-|++++---++|-.||. ||-|.-| T Consensus 13 f~rlv~k~e-~~el~~~i~e~lIvQyGn~d~kpvagl~v 50 (161) T COG5017 13 FNRLVLKIE-VLELTELIQEELIVQYGNGDIKPVAGLRV 50 (161) T ss_pred HHHHHHHHH-HHHHHHHHHHHEEEEECCCCCCCCCCCEE T ss_conf 888970498-99999875553365535887444555278 No 8 >COG3857 AddB ATP-dependent nuclease, subunit B [DNA replication, recombination, and repair] Probab=22.87 E-value=59 Score=14.03 Aligned_cols=103 Identities=27% Similarity=0.193 Sum_probs=65.6 Q ss_pred HHHHHHHCCHHHHHC-CH------HHHHH-HHHHHHHCCCCCCCEEECCCCCCCHHHHHHHCHHHHHHHH-HHHHHHEEC Q ss_conf 888776071455422-52------57877-6531110026688635457658517888643878999997-433533235 Q gi|254781189|r 138 FIHDLLDIPEEKRRL-NT------SYLTY-VDRGLLDVRSSETPVVYDNKYRPSAEAMRTICPTKLMKIF-EDTISLYVD 208 (252) Q Consensus 138 fihdlldipeekrrl-nt------sylty-vdrglldvrssetpvvydnkyrpsaeamrticptklmkif-edtislyvd 208 (252) -.||++-|-|+.-+. +. .+|++ .|++..+---..+.+|-|.-+|=|++--|-| +.||+-- +-||+||.| T Consensus 148 kl~dl~liyee~~~~l~~~~l~~ed~l~~lad~~~~s~~L~~~~IvIDGFt~FS~~E~~vI--e~L~~~~~~v~I~lt~d 225 (1108) T COG3857 148 KLHDLSLIYEEFEANLYNNYLDPEDSLSRLADKIKKSEQLKQAAIVIDGFTRFSPEEYRVI--ELLMKKCARVTIGLTAD 225 (1108) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHCCCEEEEECCCCCCHHHHHHH--HHHHHCCCEEEEEEECC T ss_conf 3443999999999988746688277899998726331655463499815444788899999--99984185599997337 Q ss_pred C--CCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 6--884544144554467789879860874235788 Q gi|254781189|r 209 P--LTPRDISFTQYEKHACALVNWLEKGKFNEMSIA 242 (252) Q Consensus 209 p--ltprdisftqyekhacalvnwlekgkfnemsia 242 (252) - .|..-..+.-|....-.+-.-+++++-+.++|. T Consensus 226 ~~~y~~~~~~~~if~~s~~~l~~L~~~a~~~~i~~~ 261 (1108) T COG3857 226 KKAYTKGSEEGNIFRASKKTLFRLLSKAEELNISIE 261 (1108) T ss_pred CHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 222105743223989769999999988631466410 No 9 >PRK11669 pbpG D-alanyl-D-alanine endopeptidase; Provisional Probab=22.67 E-value=60 Score=14.01 Aligned_cols=27 Identities=26% Similarity=0.564 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 467789879860874235788887423 Q gi|254781189|r 222 KHACALVNWLEKGKFNEMSIARKAFNR 248 (252) Q Consensus 222 khacalvnwlekgkfnemsiarkafnr 248 (252) +.|-.|.+|+|.||.....-+-+++.+ T Consensus 268 ~da~rl~~w~~~~~~~~~~~~~~~y~~ 294 (308) T PRK11669 268 ADASRLRTWIETGKVTPVPAAALSYKK 294 (308) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 999999999986897866178998888 No 10 >TIGR00452 TIGR00452 methyltransferase, putative; InterPro: IPR010017 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalyzed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order . Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI ), shared by other AdoMet-Mtases , is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments , although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases , , , . The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences . This is a set of proteobacterial proteins, which have homology in their central region to a large number of methyltransferases active on a variety of substrates. . Probab=21.28 E-value=31 Score=15.82 Aligned_cols=22 Identities=41% Similarity=0.625 Sum_probs=18.0 Q ss_pred HHHHHHHHHHHCCCCCHHHHHH Q ss_conf 4677898798608742357888 Q gi|254781189|r 222 KHACALVNWLEKGKFNEMSIAR 243 (252) Q Consensus 222 khacalvnwlekgkfnemsiar 243 (252) --+.||.||+||--|....|+. T Consensus 254 PSv~aL~~Wl~kvGF~~~~i~~ 275 (316) T TIGR00452 254 PSVSALKNWLEKVGFENVRILD 275 (316) T ss_pred HHHHHHHHHHHHCCCCEEEEEE T ss_conf 2389998888744953147875 No 11 >pfam07519 Tannase Tannase and feruloyl esterase. This family includes fungal tannase and feruloyl esterase. It also includes several bacterial homologues of unknown function. Probab=21.23 E-value=19 Score=17.13 Aligned_cols=23 Identities=30% Similarity=0.577 Sum_probs=18.4 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 67789879860874235788887 Q gi|254781189|r 223 HACALVNWLEKGKFNEMSIARKA 245 (252) Q Consensus 223 hacalvnwlekgkfnemsiarka 245 (252) ---||++|.|+|+--|.-++.+. T Consensus 392 ~l~aL~~WVE~G~aP~~l~at~~ 414 (451) T pfam07519 392 NLTAMVDWVENGNAPSRIVATDA 414 (451) T ss_pred HHHHHHHHHHCCCCCCEEEEEEE T ss_conf 89999999838979880688883 Done!