Query gi|254780983|ref|YP_003065396.1| hypothetical protein CLIBASIA_04420 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 55 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 02:24:22 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780983.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG4080 consensus 64.8 8.8 0.00023 20.1 3.5 33 4-36 106-138 (176) 2 KOG2023 consensus 60.1 5.9 0.00015 21.1 1.9 22 5-26 688-709 (885) 3 pfam12356 DUF3643 Protein of u 45.4 23 0.0006 17.8 3.0 42 2-50 81-122 (165) 4 KOG3918 consensus 34.4 31 0.00079 17.1 2.2 27 8-34 44-70 (129) 5 TIGR00284 TIGR00284 dihydropte 31.7 18 0.00047 18.3 0.7 33 3-49 379-411 (529) 6 COG1334 FlaG Uncharacterized f 25.5 44 0.0011 16.2 1.8 33 12-55 52-84 (120) 7 PRK07738 flagellar protein Fla 21.3 54 0.0014 15.8 1.6 22 34-55 56-77 (113) 8 pfam03646 FlaG FlaG protein. A 19.2 60 0.0015 15.5 1.4 32 13-55 42-73 (108) 9 PHA02108 hypothetical protein 18.3 43 0.0011 16.3 0.5 20 14-33 9-30 (48) 10 pfam03584 Herpes_ICP4_N Herpes 18.3 56 0.0014 15.7 1.1 13 3-16 91-103 (175) No 1 >KOG4080 consensus Probab=64.83 E-value=8.8 Score=20.08 Aligned_cols=33 Identities=30% Similarity=0.563 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCE Q ss_conf 689999898899999999999963475432441 Q gi|254780983|r 4 SALCKFCTQTVIGYIAEFMKMMFIKNPYSFKNY 36 (55) Q Consensus 4 salckfctqtvigyiaefmkmmfiknpysfkny 36 (55) ..||-+|-+.|---.+|.-+-|-|++||.++-- T Consensus 106 ~~LC~~Cy~kV~ket~ei~~k~~iq~~~~~e~~ 138 (176) T KOG4080 106 HTLCDYCYAKVHKETSEIKKKMMIQEPYVGEKQ 138 (176) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 520799999999999999998731488887867 No 2 >KOG2023 consensus Probab=60.09 E-value=5.9 Score=21.05 Aligned_cols=22 Identities=32% Similarity=0.552 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999989889999999999996 Q gi|254780983|r 5 ALCKFCTQTVIGYIAEFMKMMF 26 (55) Q Consensus 5 alckfctqtvigyiaefmkmmf 26 (55) .+||+|.+.|++++++|++.+- T Consensus 688 Dltk~c~~~v~p~~~~fl~~lg 709 (885) T KOG2023 688 DLTKACFEHVIPNLADFLPILG 709 (885) T ss_pred HHHHHHHHHCCCHHHHHHHHHH T ss_conf 9998887751521888888875 No 3 >pfam12356 DUF3643 Protein of unknown function (DUF3643). This family of proteins is found in eukaryotes. Proteins in this family are typically between 217 and 4852 amino acids in length. There is a conserved TLA sequence motif. Probab=45.42 E-value=23 Score=17.77 Aligned_cols=42 Identities=29% Similarity=0.413 Sum_probs=26.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCC Q ss_conf 2268999989889999999999996347543244145788733678862 Q gi|254780983|r 2 IDSALCKFCTQTVIGYIAEFMKMMFIKNPYSFKNYQARLEFSIPKDSND 50 (55) Q Consensus 2 idsalckfctqtvigyiaefmkmmfiknpysfknyqarlefsipkdsnd 50 (55) |||.||-+|--. -.|..-.|.-|=|+-| |-+.++ ||..|..| T Consensus 81 vDslLCs~C~I~-Pe~F~~LL~~mGI~~p----~~a~~~--sisDD~K~ 122 (165) T pfam12356 81 VDSLLCSLCYIH-PEYFSLLLVWMGIKLE----PAASSL--SSTDDNKK 122 (165) T ss_pred HHHHHHHHHHCC-HHHHHHHHHHHCCCCC----CHHCCC--CCCCCCCC T ss_conf 999999999617-9999999999626378----321145--44555421 No 4 >KOG3918 consensus Probab=34.37 E-value=31 Score=17.11 Aligned_cols=27 Identities=37% Similarity=0.572 Sum_probs=21.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 998988999999999999634754324 Q gi|254780983|r 8 KFCTQTVIGYIAEFMKMMFIKNPYSFK 34 (55) Q Consensus 8 kfctqtvigyiaefmkmmfiknpysfk 34 (55) ...-||||..|+-.....||.+|+.|- T Consensus 44 DivlQTvisli~~iYgi~~iag~Fk~i 70 (129) T KOG3918 44 DIVLQTVISLIALIYGISFIAGPFKFI 70 (129) T ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 999999999999998579870407762 No 5 >TIGR00284 TIGR00284 dihydropteroate synthase-related protein; InterPro: IPR005236 The proteins of this family have been found so far only in the four archaeal species. The central region of the proteins shows considerable homology to the amino-terminal half of dihydropteroate synthases. . Probab=31.72 E-value=18 Score=18.34 Aligned_cols=33 Identities=48% Similarity=0.582 Sum_probs=25.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCC Q ss_conf 26899998988999999999999634754324414578873367886 Q gi|254780983|r 3 DSALCKFCTQTVIGYIAEFMKMMFIKNPYSFKNYQARLEFSIPKDSN 49 (55) Q Consensus 3 dsalckfctqtvigyiaefmkmmfiknpysfknyqarlefsipkdsn 49 (55) ||+-|+|.+.-+ ++--||||. ||-+||.|||-. T Consensus 379 ~s~K~~~S~~E~----~~A~km~s~----------A~k~nsLPkdig 411 (529) T TIGR00284 379 DSAKSKFSTKEV----AEAAKMISV----------AKKRNSLPKDIG 411 (529) T ss_pred CCCCCHHHHHHH----HHHHHHHHH----------HHHHCCCCCCHH T ss_conf 565322338999----999999999----------986357874100 No 6 >COG1334 FlaG Uncharacterized flagellar protein FlaG [Cell motility and secretion] Probab=25.48 E-value=44 Score=16.24 Aligned_cols=33 Identities=21% Similarity=0.439 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCCEEEEC Q ss_conf 88999999999999634754324414578873367886236869 Q gi|254780983|r 12 QTVIGYIAEFMKMMFIKNPYSFKNYQARLEFSIPKDSNDLYIKL 55 (55) Q Consensus 12 qtvigyiaefmkmmfiknpysfknyqarlefsipkdsndlyikl 55 (55) +-.+.-|-+|||- .+..|+||+-+++|.+|+|. T Consensus 52 ~~~v~~ink~~k~-----------~nt~l~F~~dd~lg~~vVkI 84 (120) T COG1334 52 ALIVEDINKLLKS-----------LNTHLNFSYDDELGELVVKI 84 (120) T ss_pred HHHHHHHHHHHHH-----------HCCCEEEEEECCCCCEEEEE T ss_conf 9999999999986-----------25760799733259479999 No 7 >PRK07738 flagellar protein FlaG; Provisional Probab=21.25 E-value=54 Score=15.76 Aligned_cols=22 Identities=27% Similarity=0.582 Sum_probs=18.5 Q ss_pred CCEEEEEEEECCCCCCCEEEEC Q ss_conf 4414578873367886236869 Q gi|254780983|r 34 KNYQARLEFSIPKDSNDLYIKL 55 (55) Q Consensus 34 knyqarlefsipkdsndlyikl 55 (55) ...+..|+|++-.+++.+|+|. T Consensus 56 ~~~~~~l~F~~~e~~~~~~VkV 77 (113) T PRK07738 56 EPSQTHLKFELHEKLNEYYVKV 77 (113) T ss_pred HHCCCCEEEEEECCCCCEEEEE T ss_conf 7227753899825668489999 No 8 >pfam03646 FlaG FlaG protein. Although important for flagella the exact function of this protein is unknown. Probab=19.19 E-value=60 Score=15.53 Aligned_cols=32 Identities=25% Similarity=0.538 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCCCCCCEEEEC Q ss_conf 8999999999999634754324414578873367886236869 Q gi|254780983|r 13 TVIGYIAEFMKMMFIKNPYSFKNYQARLEFSIPKDSNDLYIKL 55 (55) Q Consensus 13 tvigyiaefmkmmfiknpysfknyqarlefsipkdsndlyikl 55 (55) ..+--+-+||+. .+..|+|++-.+++.+++|. T Consensus 42 ~av~~ln~~~~~-----------~~~~l~F~~de~~~~~vVkV 73 (108) T pfam03646 42 EAVEKLNKFLQS-----------LNTNLEFSVDEDSGRVVVKV 73 (108) T ss_pred HHHHHHHHHHHH-----------CCCCEEEEEECCCCCEEEEE T ss_conf 999999999985-----------49862899856789689999 No 9 >PHA02108 hypothetical protein Probab=18.31 E-value=43 Score=16.29 Aligned_cols=20 Identities=40% Similarity=0.622 Sum_probs=14.0 Q ss_pred HHHHHHHHHH--HHHHCCCCCC Q ss_conf 9999999999--9963475432 Q gi|254780983|r 14 VIGYIAEFMK--MMFIKNPYSF 33 (55) Q Consensus 14 vigyiaefmk--mmfiknpysf 33 (55) -|||||.|-- -|++.|.|.. T Consensus 9 nigyiasf~lfy~my~g~ay~v 30 (48) T PHA02108 9 NIGYIASFGLFYAMYKGDAYFV 30 (48) T ss_pred EEHHHHHHHHHHHHCCCCEEEE T ss_conf 9624543566774004766898 No 10 >pfam03584 Herpes_ICP4_N Herpesvirus ICP4-like protein N-terminal region. The immediate-early protein ICP4 (infected-cell polypeptide 4) is required for efficient transcription of early and late viral genes and is thus essential for productive infection. ICP4 is a large phosphoprotein that binds DNA in a sequence specific manner as a homodimer. ICP4 represses transcription from LAT, ICP4 and ORF-P that have high-affinity a ICP4 binding site that spans the transcription initiation site. ICP4 proteins have two highly conserved regions, this family contains the N-terminal region that contains sites for DNA binding and homodimerization. Probab=18.26 E-value=56 Score=15.68 Aligned_cols=13 Identities=46% Similarity=0.761 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 26899998988999 Q gi|254780983|r 3 DSALCKFCTQTVIG 16 (55) Q Consensus 3 dsalckfctqtvig 16 (55) |.+||+|| |...| T Consensus 91 d~~L~~~c-qk~~~ 103 (175) T pfam03584 91 DQALNHIC-QKFRG 103 (175) T ss_pred HHHHHHHH-HHHCC T ss_conf 89999999-98537 Done!