Query gi|254781226|ref|YP_003065639.1| hypothetical protein CLIBASIA_05665 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 129 No_of_seqs 67 out of 69 Neff 3.8 Searched_HMMs 39220 Date Mon May 30 07:27:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781226.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11020 hypothetical protein; 79.0 3.1 7.9E-05 20.3 3.9 32 94-125 28-59 (118) 2 pfam10231 DUF2315 Uncharacteri 26.3 44 0.0011 13.8 6.5 42 45-86 82-123 (127) 3 pfam08381 DZC Disease resistan 23.6 49 0.0013 13.5 2.0 14 49-62 21-34 (36) 4 cd01811 OASL_repeat1 OASL_repe 21.3 44 0.0011 13.8 1.1 18 91-108 15-32 (80) 5 TIGR02017 hutG_amidohyd N-form 20.5 56 0.0014 13.2 1.5 13 107-119 125-137 (269) 6 pfam08898 DUF1843 Domain of un 17.1 69 0.0018 12.7 2.5 19 100-118 34-52 (53) 7 pfam11460 DUF3007 Protein of u 16.0 74 0.0019 12.5 2.9 42 67-114 55-103 (104) 8 COG5509 Uncharacterized small 14.9 78 0.002 12.3 1.9 16 104-119 32-47 (65) 9 pfam07462 MSP1_C Merozoite sur 11.4 1E+02 0.0026 11.7 2.4 24 96-119 46-69 (574) 10 pfam01481 Arteri_nucleo Arteri 11.2 1E+02 0.0026 11.7 1.7 20 3-22 59-78 (116) No 1 >PRK11020 hypothetical protein; Provisional Probab=78.96 E-value=3.1 Score=20.32 Aligned_cols=32 Identities=31% Similarity=0.399 Sum_probs=26.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 30345556799999999899875422399974 Q gi|254781226|r 94 APEEPMGMTKAEIEALEREIARLKAKPIEELI 125 (129) Q Consensus 94 ~~~~~~~~~~~e~~~~~~e~~~~~~~~~~~~~ 125 (129) ...++|.+...|+++|+++|++|+.+--..|- T Consensus 28 gd~~~i~qf~~E~~~l~k~I~~lk~~q~~~ls 59 (118) T PRK11020 28 GDADKIAQFEKEKATLEKEIARLKEVQSQKLS 59 (118) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78899999999999999999999889999988 No 2 >pfam10231 DUF2315 Uncharacterized conserved protein (DUF2315). This is a family of small conserved proteins found from worms to humans. The function is not known. Probab=26.30 E-value=44 Score=13.78 Aligned_cols=42 Identities=24% Similarity=0.318 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 999999998738999999999999947999999999999987 Q gi|254781226|r 45 IRESYHQYYLKNKDKYREYKRRYYLKNRDKMREKARQSYRKL 86 (129) Q Consensus 45 Ike~~Keyy~kNKEki~e~~rkYre~NKEKIre~~Re~y~k~ 86 (129) ..+-.+.+-.+|......|+++||..|-.-|---.+.-.+++ T Consensus 82 MseFYK~FLd~n~~~h~~YN~eWYrrNf~i~~la~~V~l~rl 123 (127) T pfam10231 82 MSEFYKEFLDKNWRIHMMYNLEWYRRNFDITTLAAKVQLQRL 123 (127) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999988999999879999988999999999999999 No 3 >pfam08381 DZC Disease resistance/zinc finger/chromosome condensation-like region. This family features a short region, approximately 35 residues in length, which is found near the C-terminus in a number of plant proteins; it is repeated up to three times in some members. These proteins are annotated as being involved in disease resistance and in the regulation of chromosome condensation, and contain domains with varied functions, such as TIR (pfam01582), PH (Smart:SM00233), and FYVE (pfam01363). Probab=23.60 E-value=49 Score=13.48 Aligned_cols=14 Identities=21% Similarity=0.482 Sum_probs=8.5 Q ss_pred HHHHHHHCHHHHHH Q ss_conf 99998738999999 Q gi|254781226|r 49 YHQYYLKNKDKYRE 62 (129) Q Consensus 49 ~Keyy~kNKEki~e 62 (129) ...||.+|.++|-+ T Consensus 21 A~~WW~EN~~rv~~ 34 (36) T pfam08381 21 AQRWWEENKERVYE 34 (36) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999997999986 No 4 >cd01811 OASL_repeat1 OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats. Probab=21.32 E-value=44 Score=13.77 Aligned_cols=18 Identities=22% Similarity=0.363 Sum_probs=14.7 Q ss_pred HHCCCHHHHHHHHHHHHH Q ss_conf 320303455567999999 Q gi|254781226|r 91 SWIAPEEPMGMTKAEIEA 108 (129) Q Consensus 91 ~~~~~~~~~~~~~~e~~~ 108 (129) -||||-+||-+++.++.- T Consensus 15 l~VnPY~PI~kiK~ki~~ 32 (80) T cd01811 15 LRVNPYSPIRKIKEKIRR 32 (80) T ss_pred EEECCCCHHHHHHHHHHH T ss_conf 996787779999999998 No 5 >TIGR02017 hutG_amidohyd N-formylglutamate amidohydrolase; InterPro: IPR010247 In some species, histidine is converted to via urocanate and then formimino-L-glutamate to glutamate in four steps, where the fourth step is conversion of N-formimino-L-glutamate to L-glutamate and formamide. In others, that pathway from formimino-L-glutamate may differ, with the next enzyme being formiminoglutamate hydrolase (HutF) yielding N-formyl-L-glutamate. This entry represents the enzyme N-formylglutamate deformylase, also called N-formylglutamate amidohydrolase, which then produces glutamate.. Probab=20.48 E-value=56 Score=13.17 Aligned_cols=13 Identities=54% Similarity=0.746 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999899875422 Q gi|254781226|r 107 EALEREIARLKAK 119 (129) Q Consensus 107 ~~~~~e~~~~~~~ 119 (129) .||-.||+||++- T Consensus 125 ~aL~~Ei~RLra~ 137 (269) T TIGR02017 125 AALRAEIERLRAQ 137 (269) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999998843 No 6 >pfam08898 DUF1843 Domain of unknown function (DUF1843). This domain is found at the C-terminus of a family of proteins that are functionally uncharacterized. The presumed domain is about 60 amino acid residues in length and is found independently in some proteins. Probab=17.13 E-value=69 Score=12.66 Aligned_cols=19 Identities=47% Similarity=0.618 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 5679999999989987542 Q gi|254781226|r 100 GMTKAEIEALEREIARLKA 118 (129) Q Consensus 100 ~~~~~e~~~~~~e~~~~~~ 118 (129) +.......+|+-|||||-. T Consensus 34 ~~l~~Al~~L~~EIarlE~ 52 (53) T pfam08898 34 GELAAALEKLKAEIARLER 52 (53) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 5799999999999998655 No 7 >pfam11460 DUF3007 Protein of unknown function (DUF3007). This is a family of uncharacterized proteins found in bacteria and eukaryotes. Probab=15.96 E-value=74 Score=12.49 Aligned_cols=42 Identities=21% Similarity=0.425 Sum_probs=26.3 Q ss_pred HHHHCHHHHHHHHHHHHHHHH-------HHHHHCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 999479999999999999874-------222320303455567999999998998 Q gi|254781226|r 67 YYLKNRDKMREKARQSYRKLY-------SKDSWIAPEEPMGMTKAEIEALEREIA 114 (129) Q Consensus 67 Yre~NKEKIre~~Re~y~k~~-------sk~~~~~~~~~~~~~~~e~~~~~~e~~ 114 (129) +|.-|.+.-=-..+++|++-+ ..++.. +++..|+.+|-.||. T Consensus 55 fRV~~g~MTy~~Qrk~Ye~a~~~~~~~~lqkR~e------~ls~eE~~~L~~eie 103 (104) T pfam11460 55 FRVVTGKMTYMQQRRDYREAYDALTNEELQKRFD------SLSPEELEALQAELE 103 (104) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH------HCCHHHHHHHHHHHC T ss_conf 6450687758999999999999876899999998------599999999999861 No 8 >COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown] Probab=14.93 E-value=78 Score=12.34 Aligned_cols=16 Identities=50% Similarity=0.655 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999899875422 Q gi|254781226|r 104 AEIEALEREIARLKAK 119 (129) Q Consensus 104 ~e~~~~~~e~~~~~~~ 119 (129) .-|-.|.-||+||++- T Consensus 32 eRIalLq~EIeRlkAe 47 (65) T COG5509 32 ERIALLQAEIERLKAE 47 (65) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999999 No 9 >pfam07462 MSP1_C Merozoite surface protein 1 (MSP1) C-terminus. This family represents the C-terminal region of merozoite surface protein 1 (MSP1) which are found in a number of Plasmodium species. MSP-1 is a 200-kDa protein expressed on the surface of the P. vivax merozoite. MSP-1 of Plasmodium species is synthesized as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19-kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate. Probab=11.44 E-value=1e+02 Score=11.73 Aligned_cols=24 Identities=38% Similarity=0.385 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 345556799999999899875422 Q gi|254781226|r 96 EEPMGMTKAEIEALEREIARLKAK 119 (129) Q Consensus 96 ~~~~~~~~~e~~~~~~e~~~~~~~ 119 (129) +.|.+|+.-..++|..||+.||-+ T Consensus 46 e~I~~m~~~k~~aL~~EI~~LKkk 69 (574) T pfam07462 46 ELITALTETKVNALYLEIAHLKEL 69 (574) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 998717999999999999999999 No 10 >pfam01481 Arteri_nucleo Arterivirus nucleocapsid protein. Probab=11.24 E-value=1e+02 Score=11.69 Aligned_cols=20 Identities=35% Similarity=0.617 Sum_probs=0.0 Q ss_pred HHHHCHHHHHHHHHHHHHHH Q ss_conf 23302330144239999999 Q gi|254781226|r 3 RKVLTPEERMLCRREYKRRY 22 (129) Q Consensus 3 r~~~~~~~r~~C~kE~~KkY 22 (129) |-.|++-||.+|..--+..+ T Consensus 59 RHhlt~~Er~lc~~siqtaF 78 (116) T pfam01481 59 RHHLTQQERQLCLSSIQTAF 78 (116) T ss_pred CCCCCHHHHHHHHHHHHHHH T ss_conf 01268989999999999999 Done!