Query gi|254780682|ref|YP_003065095.1| hypothetical protein CLIBASIA_02845 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 199 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Sun May 29 21:27:27 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780682.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG1795 Formaldehyde-activatin 28.3 14 0.00036 17.5 -0.3 31 141-171 45-75 (170) 2 pfam03759 PRONE PRONE (Plant-s 25.2 33 0.00084 15.3 1.1 19 39-57 178-196 (360) 3 TIGR01717 AMP-nucleosdse AMP n 17.8 69 0.0018 13.5 1.5 27 91-123 269-295 (486) 4 pfam10811 DUF2532 Protein of u 17.5 76 0.0019 13.2 2.7 51 3-87 2-52 (158) 5 TIGR02653 Lon_rel_chp conserve 16.3 51 0.0013 14.2 0.5 20 67-86 112-131 (677) 6 KOG3729 consensus 16.1 82 0.0021 13.0 2.0 23 74-96 166-188 (715) 7 TIGR02955 TMAO_TorT TMAO reduc 15.8 66 0.0017 13.6 1.0 27 36-66 224-250 (304) 8 KOG4780 consensus 14.7 89 0.0023 12.8 2.6 41 16-56 340-380 (467) 9 pfam03343 SART-1 SART-1 family 14.6 56 0.0014 14.0 0.4 31 52-83 145-177 (577) 10 pfam10832 DUF2559 Protein of u 13.5 52 0.0013 14.2 -0.1 14 82-95 10-23 (54) No 1 >COG1795 Formaldehyde-activating enzyme nesessary for methanogenesis [Energy production and conversion] Probab=28.25 E-value=14 Score=17.50 Aligned_cols=31 Identities=39% Similarity=0.645 Sum_probs=23.6 Q ss_pred ECCCCCCCCCCCEEECCCCCCCCCCCCCHHC Q ss_conf 0458988887511201125786112330102 Q gi|254780682|r 141 KIPLPNNLKPNVCVKEKKLIPPRKNINNLKD 171 (199) Q Consensus 141 kiplpnnlkpnvcvkekklipprkninnlkd 171 (199) ..||--.|+||+.+|--.||-|+-+|.|.-+ T Consensus 45 htPllavlrPNl~~KP~TLivpkvtI~~~~q 75 (170) T COG1795 45 HTPLLAVLRPNLAVKPPTLIVPKVTIKNMAQ 75 (170) T ss_pred CCCEEEEECCCCCCCCCEEEECCEEEEEHHH T ss_conf 9755888458877899668850425300666 No 2 >pfam03759 PRONE PRONE (Plant-specific Rop nucleotide exchanger). This is a functional guanine exchange factor (GEF) of plant Rho GTPase. Probab=25.21 E-value=33 Score=15.34 Aligned_cols=19 Identities=37% Similarity=0.377 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHCCCEEEE Q ss_conf 8899999997424672999 Q gi|254780682|r 39 LSHQIHKALMSISSGIILI 57 (199) Q Consensus 39 lshqihkalmsissgiili 57 (199) ..+||+||-|+|.+.+..- T Consensus 178 ~~nQIlKAAmAIN~~vL~E 196 (360) T pfam03759 178 CVNQILKAAMAINSQVLAE 196 (360) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 9999999999987888756 No 3 >TIGR01717 AMP-nucleosdse AMP nucleosidase; InterPro: IPR011271 This entry represents the AMP nucleosidase from proteobacteria but also including sequences from Corynebacterium, a Gram-positive organism. The Escherichia coli protein is the most well studied .. Probab=17.82 E-value=69 Score=13.48 Aligned_cols=27 Identities=41% Similarity=0.621 Sum_probs=19.9 Q ss_pred CCEEEEEECCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 505888616665421074433310899999999 Q gi|254780682|r 91 NLQYSLINIPSQNKQESPKNANNNILDHIALLK 123 (199) Q Consensus 91 nlqyslinipsqnkqespknannnildhiallk 123 (199) .---.||||- -.|-||. +|-||+|.|. T Consensus 269 G~GITlvNIG-----VGPSNAK-tITDHLAVLR 295 (486) T TIGR01717 269 GDGITLVNIG-----VGPSNAK-TITDHLAVLR 295 (486) T ss_pred CCCEEEEECC-----CCCCCHH-HHHHHHHHCC T ss_conf 9932899607-----4886212-3541022118 No 4 >pfam10811 DUF2532 Protein of unknown function (DUF2532). This bacterial family of proteins has no known function. Probab=17.47 E-value=76 Score=13.23 Aligned_cols=51 Identities=41% Similarity=0.623 Sum_probs=36.4 Q ss_pred CEEEEEHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECCHHHHHHHH Q ss_conf 01243015567897865302554512565898878888999999974246729998034455663104521113331899 Q gi|254780682|r 3 STILFLIKHKLKLHKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIILILSPTDCTANTANILIPSRKIIDYHR 82 (199) Q Consensus 3 stilflikhklklhknimskgkinesfwyirtlfpylshqihkalmsissgiililsptdctantanilipsrkiidyhr 82 (199) +.|.||| ..|.-|+|..|-+|..-|.|-..|. +.-|.-.|||+ -|. T Consensus 2 lii~fli---------lvs~vkvnad~n~iq~~f~~q~eq~-----------~i~lpwsdct~--------------ihk 47 (158) T pfam10811 2 LIICFLI---------LVSAVKVNADFNNIQDNFEYQEEQL-----------TIELPWSDCTE--------------IHK 47 (158) T ss_pred EEEEEEE---------EEEHHHHCCCHHHHHHHHHHHHHHH-----------HEECCHHHHHH--------------HHH T ss_conf 1899770---------1004220564445677776544554-----------01367266689--------------999 Q ss_pred HHHHH Q ss_conf 86552 Q gi|254780682|r 83 LLEQK 87 (199) Q Consensus 83 lleqk 87 (199) +||.| T Consensus 48 ~leek 52 (158) T pfam10811 48 LLEEK 52 (158) T ss_pred HHHHH T ss_conf 99987 No 5 >TIGR02653 Lon_rel_chp conserved hypothetical protein; InterPro: IPR013473 This entry describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to IPR008269 from INTERPRO, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of Escherichia coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST].. Probab=16.29 E-value=51 Score=14.22 Aligned_cols=20 Identities=40% Similarity=0.760 Sum_probs=16.1 Q ss_pred CCEEEECCHHHHHHHHHHHH Q ss_conf 31045211133318998655 Q gi|254780682|r 67 TANILIPSRKIIDYHRLLEQ 86 (199) Q Consensus 67 tanilipsrkiidyhrlleq 86 (199) .-+++||++-|.||.|||.- T Consensus 112 ~K~~~ip~~~v~~ydkLL~G 131 (677) T TIGR02653 112 IKDLLIPERYVRDYDKLLTG 131 (677) T ss_pred CCCCCCCHHHHHHHHHHCCC T ss_conf 42112897898655322068 No 6 >KOG3729 consensus Probab=16.12 E-value=82 Score=13.04 Aligned_cols=23 Identities=13% Similarity=0.112 Sum_probs=10.1 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 11333189986552148505888 Q gi|254780682|r 74 SRKIIDYHRLLEQKKNHNLQYSL 96 (199) Q Consensus 74 srkiidyhrlleqkknhnlqysl 96 (199) .|--+||--+-----+|+..-.+ T Consensus 166 HRSHlDYlliTwIL~~~~Ik~P~ 188 (715) T KOG3729 166 HRSHLDYLLITWILWHFGIKLPH 188 (715) T ss_pred CHHHHHHHHHHHHHHHCCCCCCE T ss_conf 14344689999999827867850 No 7 >TIGR02955 TMAO_TorT TMAO reductase system periplasmic protein TorT; InterPro: IPR014301 Members of this entry are the periplasmic protein TorT that, together with the TorS/TorR histidine kinase/response regulator system, regulates expression of the torCAD operon for trimethylamine N-oxide reductase (TMAO reductase). It appears to bind an inducer for TMAO reductase and shows homology to a periplasmic D-ribose binding protein.. Probab=15.77 E-value=66 Score=13.59 Aligned_cols=27 Identities=33% Similarity=0.559 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCC Q ss_conf 7888899999997424672999803445566 Q gi|254780682|r 36 FPYLSHQIHKALMSISSGIILILSPTDCTAN 66 (199) Q Consensus 36 fpylshqihkalmsissgiililsptdctan 66 (199) =-||||++-..|- -|-| +.||||--+- T Consensus 224 S~YLSH~VYRGLk---R~kv-lfa~~D~MV~ 250 (304) T TIGR02955 224 STYLSHGVYRGLK---RGKV-LFAPTDQMVL 250 (304) T ss_pred EEECCHHHHHHHH---CCCE-EECCCHHHHH T ss_conf 7513556863310---4735-6657526899 No 8 >KOG4780 consensus Probab=14.70 E-value=89 Score=12.82 Aligned_cols=41 Identities=27% Similarity=0.100 Sum_probs=26.4 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 78653025545125658988788889999999742467299 Q gi|254780682|r 16 HKNIMSKGKINESFWYIRTLFPYLSHQIHKALMSISSGIIL 56 (199) Q Consensus 16 hknimskgkinesfwyirtlfpylshqihkalmsissgiil 56 (199) .+++.+--+--.++|--......++||+|++.|-.++.-++ T Consensus 340 ~~~~l~vl~~~~~y~~~Ee~L~~~~~~~~~~~~~~~~~~~v 380 (467) T KOG4780 340 RKKILSVLYHANSYEDGEEILAAESYERHKAQDVCSSSEIV 380 (467) T ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHC T ss_conf 46787622010011327999999987556666511166641 No 9 >pfam03343 SART-1 SART-1 family. SART-1 is a protein involved in cell cycle arrest and pre-mRNA splicing. It has been shown to be a component of U4/U6 x U5 tri-snRNP complex in human, Schizosaccharomyces pombe and Saccharomyces cerevisiae. SART-1 is a known tumour antigen in a range of cancers recognized by T cells. Probab=14.64 E-value=56 Score=13.99 Aligned_cols=31 Identities=16% Similarity=0.238 Sum_probs=16.6 Q ss_pred CCEEEEECCCCCCCCCC--EEEECCHHHHHHHHH Q ss_conf 67299980344556631--045211133318998 Q gi|254780682|r 52 SGIILILSPTDCTANTA--NILIPSRKIIDYHRL 83 (199) Q Consensus 52 sgiililsptdctanta--nilipsrkiidyhrl 83 (199) .++||.|--++...+.. .+|+- -.+.+-.++ T Consensus 145 ~~~ILTLKD~~VL~deee~D~LeN-v~L~e~ek~ 177 (577) T pfam03343 145 EDVILTLKDTGVLEDEDEGDELEN-VELVEKEKD 177 (577) T ss_pred CCEEEEECCCCCCCCCCCCCEEEC-HHHHHHHHH T ss_conf 864887146666766555564431-567788999 No 10 >pfam10832 DUF2559 Protein of unknown function (DUF2559). This family of proteins appear to be restricted to Enterobacteriaceae. The sequences are annotated as yhfG however currently no function is known. Probab=13.55 E-value=52 Score=14.20 Aligned_cols=14 Identities=50% Similarity=0.693 Sum_probs=0.0 Q ss_pred HHHHHHCCCCCEEE Q ss_conf 98655214850588 Q gi|254780682|r 82 RLLEQKKNHNLQYS 95 (199) Q Consensus 82 rlleqkknhnlqys 95 (199) ||.||.+|.|.|-| T Consensus 10 rlweq~rn~nfqas 23 (54) T pfam10832 10 RLWEAQRNQNFQAS 23 (54) T ss_pred HHHHHHHCCCHHHH T ss_conf 99999863246877 Done!