Query gi|254780951|ref|YP_003065364.1| hypothetical protein CLIBASIA_04250 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 49 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 01:25:18 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780951.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01781 Trep_dent_lipo Trepo 90.3 0.12 3.1E-06 28.6 1.2 15 8-24 10-24 (463) 2 pfam09710 Trep_dent_lipo Trepo 82.6 0.56 1.4E-05 25.5 1.1 13 9-23 8-20 (394) 3 PRK11671 mltC murein transglyc 81.5 0.82 2.1E-05 24.7 1.7 23 11-33 8-31 (360) 4 TIGR03525 GldK gliding motilit 79.4 1.1 2.7E-05 24.1 1.7 28 9-36 5-32 (449) 5 pfam02382 RTX RTX N-terminal d 73.3 1.9 5E-05 22.9 1.6 22 4-25 305-326 (653) 6 TIGR03530 GldJ_short gliding m 62.9 4.1 0.00011 21.3 1.6 23 7-29 3-25 (402) 7 pfam07790 DUF1628 Protein of u 56.0 7.4 0.00019 20.1 1.9 19 3-21 4-22 (78) 8 TIGR01667 YCCS_YHJK integral m 44.8 12 0.00031 19.1 1.5 23 2-24 441-465 (724) 9 PRK10802 peptidoglycan-associa 43.2 24 0.00062 17.7 2.8 27 10-36 10-37 (173) 10 TIGR03302 OM_YfiO outer membra 43.2 17 0.00044 18.4 2.1 21 7-27 3-23 (235) 11 TIGR03659 IsdE heme ABC transp 30.8 42 0.0011 16.6 2.4 23 5-27 3-25 (289) 12 TIGR03524 GldJ gliding motilit 30.7 39 0.00098 16.7 2.2 21 5-25 6-26 (559) 13 PRK10866 outer membrane protei 25.6 56 0.0014 16.0 2.3 20 7-26 6-25 (243) 14 pfam05590 DUF769 Xylella fasti 23.0 46 0.0012 16.4 1.5 25 1-25 1-25 (279) 15 pfam11873 DUF3393 Domain of un 22.6 39 0.001 16.7 1.1 15 9-23 4-18 (200) 16 PRK02898 cobalt transport prot 22.5 65 0.0017 15.6 2.2 31 3-33 7-37 (97) No 1 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=90.31 E-value=0.12 Score=28.60 Aligned_cols=15 Identities=60% Similarity=1.083 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 89999761356311363 Q gi|254780951|r 8 TVLMLAISPLLFSCSSK 24 (49) Q Consensus 8 tvlmlaispllfscssk 24 (49) -+||||+ |||||+.. T Consensus 10 FIL~LAv--LLFSC~ke 24 (463) T TIGR01781 10 FILMLAV--LLFSCKKE 24 (463) T ss_pred HHHHHHH--HHHCCCCC T ss_conf 8999999--86047621 No 2 >pfam09710 Trep_dent_lipo Treponema clustered lipoprotein (Trep_dent_lipo). This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown. Probab=82.61 E-value=0.56 Score=25.46 Aligned_cols=13 Identities=69% Similarity=1.283 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999976135631136 Q gi|254780951|r 9 VLMLAISPLLFSCSS 23 (49) Q Consensus 9 vlmlaispllfscss 23 (49) .||||+ ||||||+ T Consensus 8 ILilav--LLFSCsk 20 (394) T pfam09710 8 ILILAV--LLFSCSK 20 (394) T ss_pred HHHHHH--HHHHHHH T ss_conf 999999--9974036 No 3 >PRK11671 mltC murein transglycosylase C; Provisional Probab=81.52 E-value=0.82 Score=24.67 Aligned_cols=23 Identities=52% Similarity=0.713 Sum_probs=15.0 Q ss_pred HHHHHHHHHHC-CCCCCCHHHCCH Q ss_conf 99761356311-363362111201 Q gi|254780951|r 11 MLAISPLLFSC-SSKKGGEKKGAE 33 (49) Q Consensus 11 mlaispllfsc-sskkggekkgae 33 (49) .+-|.|+|+|| ||+++++-..+- T Consensus 8 ~~~~~~~l~~c~~~~~~~~~~~~~ 31 (360) T PRK11671 8 LALIAPLLISCSSSKKGDTYNEAW 31 (360) T ss_pred HHHHHHHHHHHCCCCCCCCCCHHH T ss_conf 999999998710468887633777 No 4 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=79.36 E-value=1.1 Score=24.12 Aligned_cols=28 Identities=36% Similarity=0.605 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHCCCCCCCHHHCCHHHC Q ss_conf 9999761356311363362111201110 Q gi|254780951|r 9 VLMLAISPLLFSCSSKKGGEKKGAEKKT 36 (49) Q Consensus 9 vlmlaispllfscsskkggekkgaekkt 36 (49) ++.+++.-+++||+|...||-.|+..+. T Consensus 5 ~~~~~~~~~~~SCg~~~~GELvGv~~~~ 32 (449) T TIGR03525 5 LVFAALVVLVYSCGSGDKGELVGVEGKK 32 (449) T ss_pred HHHHHHHHHHHHCCCCCCCEEEEECCCC T ss_conf 8999999987210589971387314875 No 5 >pfam02382 RTX RTX N-terminal domain. The RTX family of bacterial toxins are a group of cytolysins and cytotoxins. This Pfam family represents the N-terminal domain which is found in association with a glycine-rich repeat domain and hemolysinCabind pfam00353. Probab=73.34 E-value=1.9 Score=22.90 Aligned_cols=22 Identities=45% Similarity=0.601 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCC Q ss_conf 4657899997613563113633 Q gi|254780951|r 4 RIIGTVLMLAISPLLFSCSSKK 25 (49) Q Consensus 4 riigtvlmlaispllfscsskk 25 (49) -.|..+.|||||||-|---..| T Consensus 305 ~LIaS~v~LAISPLsFl~iAdk 326 (653) T pfam02382 305 GLIASVVSLAISPLSFLGIADK 326 (653) T ss_pred HHHHHHHHHHHCHHHHHHHHHH T ss_conf 9999999997085877768876 No 6 >TIGR03530 GldJ_short gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. This model represents the GldJ homolog in Cytophaga hutchinsonii and several other species which is of shorter architecture than that found in Flavobacterium johnsoniae and is represented by a separate model (TIGR03524). Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=62.94 E-value=4.1 Score=21.34 Aligned_cols=23 Identities=26% Similarity=0.600 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHH Q ss_conf 78999976135631136336211 Q gi|254780951|r 7 GTVLMLAISPLLFSCSSKKGGEK 29 (49) Q Consensus 7 gtvlmlaispllfscsskkggek 29 (49) -.++.+|.+-+||||+.++|... T Consensus 3 ~~~~~~~~~~~~~~c~~~~~~~~ 25 (402) T TIGR03530 3 HAAMLIAAASILFSCNKTPGYGN 25 (402) T ss_pred HHHHHHHHHHHHHHCCCCCCCCC T ss_conf 01589999999865114778898 No 7 >pfam07790 DUF1628 Protein of unknown function (DUF1628). The sequences making up this family are derived from hypothetical proteins of unknown function expressed by various archaeal species. The region in question is approximately 160 residues long. Probab=55.96 E-value=7.4 Score=20.13 Aligned_cols=19 Identities=42% Similarity=0.693 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHHHHC Q ss_conf 2465789999761356311 Q gi|254780951|r 3 TRIIGTVLMLAISPLLFSC 21 (49) Q Consensus 3 triigtvlmlaispllfsc 21 (49) .-.||.+||+||.-+|++- T Consensus 4 SpViG~iLmvaitvilaa~ 22 (78) T pfam07790 4 SPVIGVVLMVAITVILAAV 22 (78) T ss_pred CHHHHHHHHHHHHHHHHHH T ss_conf 1799999999999999999 No 8 >TIGR01667 YCCS_YHJK integral membrane protein, YccS/YhfK family; InterPro: IPR010020 This entry identifies a clade of sequences from gamma and beta proteobacteria. These proteins are more than 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from Escherichia coli has been annotated as yccS. The YccS hypothetical equivalog is found in beta and gamma proteobacteria, while the smaller YhfK group is only found in E. coli, Salmonella and Yersinia. . Probab=44.79 E-value=12 Score=19.11 Aligned_cols=23 Identities=52% Similarity=0.595 Sum_probs=17.6 Q ss_pred CHHHHHHHHHHHH--HHHHHHCCCC Q ss_conf 5246578999976--1356311363 Q gi|254780951|r 2 NTRIIGTVLMLAI--SPLLFSCSSK 24 (49) Q Consensus 2 ntriigtvlmlai--spllfscssk 24 (49) ..||||||+=++| ++|.|-..|- T Consensus 441 ~~Ri~GTvvGlv~aG~~L~~~~Psl 465 (724) T TIGR01667 441 VQRIIGTVVGLVIAGVALLFLIPSL 465 (724) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 7645778889998757886424755 No 9 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=43.20 E-value=24 Score=17.70 Aligned_cols=27 Identities=33% Similarity=0.557 Sum_probs=15.3 Q ss_pred HHHHHHH-HHHHCCCCCCCHHHCCHHHC Q ss_conf 9997613-56311363362111201110 Q gi|254780951|r 10 LMLAISP-LLFSCSSKKGGEKKGAEKKT 36 (49) Q Consensus 10 lmlaisp-llfscsskkggekkgaekkt 36 (49) |+++++- .|..|||++..+...++... T Consensus 10 l~ia~~~l~LaACSS~~~~~~~~~~~~~ 37 (173) T PRK10802 10 LMIALPVMAIAACSSNKNASNDGSEGML 37 (173) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 9999999999855799988877766556 No 10 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=43.17 E-value=17 Score=18.38 Aligned_cols=21 Identities=48% Similarity=0.559 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHCCCCCCC Q ss_conf 789999761356311363362 Q gi|254780951|r 7 GTVLMLAISPLLFSCSSKKGG 27 (49) Q Consensus 7 gtvlmlaispllfscsskkgg 27 (49) -.++++.+.-+|.|||+++.. T Consensus 3 ~~~~~~~~~~~l~sCs~~~~~ 23 (235) T TIGR03302 3 LLILLLALLLLLAGCSSKKKK 23 (235) T ss_pred HHHHHHHHHHHHHHCCCCCCC T ss_conf 999999999999873899766 No 11 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=30.75 E-value=42 Score=16.58 Aligned_cols=23 Identities=35% Similarity=0.476 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 65789999761356311363362 Q gi|254780951|r 5 IIGTVLMLAISPLLFSCSSKKGG 27 (49) Q Consensus 5 iigtvlmlaispllfscsskkgg 27 (49) |+..++++.+.-.|+.|++++.. T Consensus 3 ~~~~~~~~~~~~~l~~C~~~~~~ 25 (289) T TIGR03659 3 ILSLILLALLSLGLTGCSSSKAE 25 (289) T ss_pred HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 29999999999998432899877 No 12 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=30.74 E-value=39 Score=16.73 Aligned_cols=21 Identities=24% Similarity=0.520 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCC Q ss_conf 657899997613563113633 Q gi|254780951|r 5 IIGTVLMLAISPLLFSCSSKK 25 (49) Q Consensus 5 iigtvlmlaispllfscsskk 25 (49) +.-.+++++|+-+|.||+.++ T Consensus 6 ~~~~~~~~~~~~~~~sC~~~~ 26 (559) T TIGR03524 6 VFKITFIVFVSLLFVSCKKKS 26 (559) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 899999999999976425888 No 13 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=25.58 E-value=56 Score=15.97 Aligned_cols=20 Identities=30% Similarity=0.318 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 78999976135631136336 Q gi|254780951|r 7 GTVLMLAISPLLFSCSSKKG 26 (49) Q Consensus 7 gtvlmlaispllfscsskkg 26 (49) --+.+++++-+|.+||+.+. T Consensus 6 ~l~~~~~l~l~l~gCs~~~~ 25 (243) T PRK10866 6 YLVAAATLSLFLAGCSGSKE 25 (243) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999987689988 No 14 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=22.95 E-value=46 Score=16.36 Aligned_cols=25 Identities=36% Similarity=0.309 Sum_probs=19.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9524657899997613563113633 Q gi|254780951|r 1 MNTRIIGTVLMLAISPLLFSCSSKK 25 (49) Q Consensus 1 mntriigtvlmlaispllfscsskk 25 (49) |.-|+--.+-+||.|-||-.|||-- T Consensus 1 Msl~prL~~SllaAslLLagCSsgP 25 (279) T pfam05590 1 MSLHPRLSCSLLAASLLLAGCSSGP 25 (279) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 9866640169999999972378899 No 15 >pfam11873 DUF3393 Domain of unknown function (DUF3393). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is typically between 188 to 206 amino acids in length. This domain is found associated with pfam01464. Probab=22.60 E-value=39 Score=16.69 Aligned_cols=15 Identities=47% Similarity=0.587 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999976135631136 Q gi|254780951|r 9 VLMLAISPLLFSCSS 23 (49) Q Consensus 9 vlmlaispllfscss 23 (49) +..+.+.|||.+||+ T Consensus 4 l~~~~~~llL~~Cs~ 18 (200) T pfam11873 4 LSLLLILLLLSGCSS 18 (200) T ss_pred EHHHHHHHHHHHCCC T ss_conf 079999999841577 No 16 >PRK02898 cobalt transport protein CbiN; Provisional Probab=22.51 E-value=65 Score=15.65 Aligned_cols=31 Identities=32% Similarity=0.549 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCHHHCCH Q ss_conf 2465789999761356311363362111201 Q gi|254780951|r 3 TRIIGTVLMLAISPLLFSCSSKKGGEKKGAE 33 (49) Q Consensus 3 triigtvlmlaispllfscsskkggekkgae 33 (49) --++..|..|++.||+|.-++.-||....|+ T Consensus 7 ~~Ll~~vi~l~v~Pl~~~~~~eFgGaD~~a~ 37 (97) T PRK02898 7 LLLLLLVILLAVAPLFFNKGGEFGGADGQAE 37 (97) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH T ss_conf 4899999999999998436884067326899 Done!