Query gi|254781185|ref|YP_003065598.1| hypothetical protein CLIBASIA_05460 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 42 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 06:06:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781185.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam07273 DUF1439 Protein of u 63.6 5.1 0.00013 21.3 2.1 25 10-34 7-37 (177) 2 PRK01622 OxaA-like protein pre 59.4 6.9 0.00018 20.6 2.2 24 8-31 10-33 (266) 3 TIGR03302 OM_YfiO outer membra 57.9 12 0.00031 19.4 3.2 31 8-38 4-39 (235) 4 TIGR01452 PGP_euk phosphoglyco 56.7 4.1 0.00011 21.7 0.7 25 1-25 233-263 (288) 5 PRK10598 hypothetical protein; 50.0 15 0.00037 19.0 2.6 31 4-34 2-38 (186) 6 pfam07901 DUF1672 Protein of u 49.2 14 0.00036 19.1 2.4 23 8-30 5-27 (304) 7 pfam08139 LPAM_1 Prokaryotic m 48.2 12 0.00031 19.4 2.0 19 8-26 4-22 (26) 8 PRK13731 conjugal transfer sur 46.7 12 0.00032 19.3 1.8 33 1-33 1-36 (243) 9 PRK10866 outer membrane protei 41.3 32 0.00082 17.3 3.2 31 8-38 7-38 (243) 10 TIGR02898 spore_YhcN_YlaJ spor 40.5 19 0.00049 18.4 2.0 24 8-31 2-27 (185) 11 TIGR02917 PEP_TPR_lipo putativ 40.4 29 0.00074 17.5 2.9 30 9-38 9-42 (924) 12 PRK11372 lysozyme inhibitor; P 38.0 19 0.00049 18.4 1.7 17 10-26 7-23 (109) 13 PRK12407 flgH flagellar basal 37.6 20 0.00051 18.3 1.7 16 9-24 4-19 (220) 14 PRK09837 copper/silver efflux 36.1 23 0.00058 18.0 1.8 18 10-27 7-24 (460) 15 TIGR01432 QOXA cytochrome aa3 35.3 5.2 0.00013 21.2 -1.6 14 10-23 2-15 (229) 16 PRK02944 OxaA-like protein pre 35.1 22 0.00057 18.0 1.6 16 8-23 8-23 (255) 17 PRK10718 hypothetical protein; 34.2 17 0.00044 18.6 0.9 15 10-24 7-21 (191) 18 PRK12698 flgH flagellar basal 31.1 30 0.00076 17.4 1.7 16 9-24 4-19 (224) 19 pfam06572 DUF1131 Protein of u 29.8 27 0.00068 17.7 1.3 14 10-23 7-20 (192) 20 pfam10954 DUF2755 Protein of u 29.4 23 0.0006 18.0 0.9 21 4-24 67-87 (100) 21 PRK10525 cytochrome o ubiquino 28.6 40 0.001 16.8 2.0 20 4-23 8-27 (305) 22 pfam11353 DUF3153 Protein of u 28.4 30 0.00076 17.4 1.3 13 10-22 3-15 (210) 23 COG5645 Predicted periplasmic 27.4 32 0.00082 17.3 1.4 17 5-23 3-19 (80) 24 TIGR02722 lp_ uncharacterized 26.7 45 0.0012 16.5 2.0 17 8-24 6-22 (215) 25 PRK09915 putative outer membra 24.1 57 0.0014 16.0 2.1 23 4-26 7-29 (488) 26 PRK02463 OxaA-like protein pre 23.3 71 0.0018 15.5 2.5 15 9-23 12-26 (307) 27 PRK06760 hypothetical protein; 21.9 40 0.001 16.8 1.0 21 5-25 4-25 (223) 28 pfam03304 Mlp Mlp lipoprotein 21.4 53 0.0014 16.1 1.5 17 8-24 4-21 (150) 29 pfam04507 DUF576 Protein of un 21.2 85 0.0022 15.1 2.8 38 3-40 4-44 (257) 30 PRK10397 hypothetical protein; 20.6 55 0.0014 16.1 1.5 15 10-24 5-19 (137) 31 KOG2882 consensus 20.6 12 0.0003 19.5 -2.0 29 2-30 247-281 (306) 32 PRK11548 hypothetical protein; 20.2 47 0.0012 16.4 1.1 12 14-25 13-24 (113) 33 PRK10510 putative outer membra 20.1 90 0.0023 15.0 2.5 30 1-30 1-34 (219) No 1 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=63.61 E-value=5.1 Score=21.28 Aligned_cols=25 Identities=52% Similarity=0.801 Sum_probs=19.4 Q ss_pred HHHHHHHHHHCC------CCCCHHHHCHHHH Q ss_conf 999999999237------4220100024999 Q gi|254781185|r 10 IIFGLVLVLSGC------SIGEQKKNNSVKK 34 (42) Q Consensus 10 iifglvlvlsgc------sigeqkknnsvkk 34 (42) ++|-+++.|+|| ||.||.-|.-+.+ T Consensus 7 ~~~~l~~~LsgC~~ls~ysISE~Ein~yL~k 37 (177) T pfam07273 7 LILLLVLLLSGCASLSQYSISEAEINQYLQK 37 (177) T ss_pred HHHHHHHHHHCCCCCCCEEECHHHHHHHHHH T ss_conf 9999999971557566252669999999987 No 2 >PRK01622 OxaA-like protein precursor; Validated Probab=59.43 E-value=6.9 Score=20.63 Aligned_cols=24 Identities=42% Similarity=0.581 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCH Q ss_conf 799999999992374220100024 Q gi|254781185|r 8 INIIFGLVLVLSGCSIGEQKKNNS 31 (42) Q Consensus 8 iniifglvlvlsgcsigeqkknns 31 (42) +-...+++++||||+-.+...++| T Consensus 10 ~~~~~~~~~~lsgc~~~~~~~~~~ 33 (266) T PRK01622 10 ISLSLLLVFVLSGCSNAAPIDAHS 33 (266) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 999999999995468999988999 No 3 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=57.94 E-value=12 Score=19.42 Aligned_cols=31 Identities=26% Similarity=0.683 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHCCCCCCHHH-----HCHHHHHHHH Q ss_conf 799999999992374220100-----0249999999 Q gi|254781185|r 8 INIIFGLVLVLSGCSIGEQKK-----NNSVKKYFNK 38 (42) Q Consensus 8 iniifglvlvlsgcsigeqkk-----nnsvkkyfnk 38 (42) +-+++.+.++|+|||-.+..+ +.+.+..|.+ T Consensus 4 ~~~~~~~~~~l~sCs~~~~~~~~~~~~~~~~~lY~~ 39 (235) T TIGR03302 4 LILLLALLLLLAGCSSKKKKEADPVEERPAEELYNE 39 (235) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 999999999998738997665555557899999999 No 4 >TIGR01452 PGP_euk phosphoglycolate/pyridoxal phosphate phosphatase family; InterPro: IPR006349 This family of sequences represent 2-phosphoglycolate phosphatase which is limited to the eukaryotic lineage. PGP is an essential enzyme in the glycolate salvage pathway in higher organisms (photorespiration in plants). Phosphoglycolate results from the oxidase activity of RubisCO in the Calvin cycle when concentrations of carbon dioxide are low relative to oxygen. In mammals, PGP is found in many tissues, notably in red blood cells where P-glycolate is and important activator of the hydrolysis of 2,3-bisphosphoglycerate, a major modifier of the oxygen affinity of hemoglobin. The PGPase enzyme described here is a member of the Haloacid dehalogenase superfamily of hydrolase enzymes. Unlike the bacterial PGP equivalog (IPR006346 from INTERPRO), which is a member of class (subfamily) I, this enzyme is a member of class (subfamily) II . These two enzymes have almost certainly arisen from convergent evolution (although these two ancestors may themselves have diverged from a more distant HAD superfamily progenitor). One of the sequences in the group comes from Chlamydomonas reinhardtii, a photosynthetic alga. The enzyme has been purified and characterised and these data are fully consistent with the assignment of function as a PGPase involved in photorespiration . Another, from Homo sapiens, is supported as a member of the family indirectly. Biochemical characterization of partially purified PGP's from various tissues including red blood cells have been performed while one gene for PGP has been localized to chromosome 16p13.3 . The Homo sapiens sequence maps to chromosome 22. There is indeed a related gene on chromosome 16 (and it is expressed, since EST's are found) which shows 46 0dentity and 59603b08ositives by BLAST2 (E=1e-66). Presumably, these two genes are isoforms and have the same catalytic function while being expressed in different tissues and may be differently regulated. The sequence from Caenorhabditis elegans, is only supported by sequence similarity. This family is closely related to a family of bacterial sequences including the Escherichia coli NagD and Bacillus subtilus AraL genes which are characterised by the ability to hydrolyze para-nitrophenylphosphate (pNPPases or NPPases). The Chlamydomonas reinhardtii PGPase does not catalyze this reaction and so presumably these two groups have different functions and substrate specificities. Many of the genes in this family have been annotated as pNPPases due to this association.; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=56.72 E-value=4.1 Score=21.74 Aligned_cols=25 Identities=52% Similarity=0.723 Sum_probs=16.9 Q ss_pred CCCCHHHHHHHHH------HHHHHHCCCCCC Q ss_conf 9631112799999------999992374220 Q gi|254781185|r 1 MVSKNIGINIIFG------LVLVLSGCSIGE 25 (42) Q Consensus 1 mvskniginiifg------lvlvlsgcsige 25 (42) ||-...--.|.|| -+||||||+-=| T Consensus 233 MvGDRl~TDi~FG~~~Gm~T~LvLsGv~~~E 263 (288) T TIGR01452 233 MVGDRLETDILFGKDGGMKTVLVLSGVSRLE 263 (288) T ss_pred EECCCCHHHHHHCCCCCHHHHHHHHCCHHHH T ss_conf 7517720224532114211345450611466 No 5 >PRK10598 hypothetical protein; Provisional Probab=50.05 E-value=15 Score=19.00 Aligned_cols=31 Identities=29% Similarity=0.398 Sum_probs=20.9 Q ss_pred CHHHHHHHHHHHHHHHCC------CCCCHHHHCHHHH Q ss_conf 111279999999999237------4220100024999 Q gi|254781185|r 4 KNIGINIIFGLVLVLSGC------SIGEQKKNNSVKK 34 (42) Q Consensus 4 kniginiifglvlvlsgc------sigeqkknnsvkk 34 (42) |.+-.-..+.+...|||| ||.||.-|+-..+ T Consensus 2 kk~~l~~~l~l~~llsGC~~l~qYsISEqEin~YL~~ 38 (186) T PRK10598 2 KKFLLAAALLVSGLLVGCNQLTQYTISEQEINQYLAK 38 (186) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEECHHHHHHHHHH T ss_conf 1269999999999971435336440079999999987 No 6 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=49.20 E-value=14 Score=19.07 Aligned_cols=23 Identities=39% Similarity=0.667 Sum_probs=16.4 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHC Q ss_conf 79999999999237422010002 Q gi|254781185|r 8 INIIFGLVLVLSGCSIGEQKKNN 30 (42) Q Consensus 8 iniifglvlvlsgcsigeqkknn 30 (42) |-.|+-..|.|+|||.-..+.+. T Consensus 5 i~~il~~~llL~GCs~mn~~~~~ 27 (304) T pfam07901 5 IKLILIATLLLSGCSTTNNESNK 27 (304) T ss_pred HHHHHHHHHHHCCCCCCCCCCCH T ss_conf 99999999997133445554200 No 7 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=48.23 E-value=12 Score=19.38 Aligned_cols=19 Identities=21% Similarity=0.560 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHCCCCCCH Q ss_conf 7999999999923742201 Q gi|254781185|r 8 INIIFGLVLVLSGCSIGEQ 26 (42) Q Consensus 8 iniifglvlvlsgcsigeq 26 (42) +..+++++++|+||+-... T Consensus 4 ~~~l~~~~l~LagCas~~~ 22 (26) T pfam08139 4 LLLLLLALLLLAGCASXXX 22 (26) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 9999999999823320013 No 8 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=46.73 E-value=12 Score=19.34 Aligned_cols=33 Identities=33% Similarity=0.484 Sum_probs=23.0 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCC---HHHHCHHH Q ss_conf 9631112799999999992374220---10002499 Q gi|254781185|r 1 MVSKNIGINIIFGLVLVLSGCSIGE---QKKNNSVK 33 (42) Q Consensus 1 mvskniginiifglvlvlsgcsige---qkknnsvk 33 (42) |-.|.+-+-.+...+|+||||+--. .|+|-.|+ T Consensus 1 m~~kk~~~~~~~~~~l~lsGC~a~~tai~krnL~Vq 36 (243) T PRK13731 1 MKTKKLMMVALVSSTLALSGCGAMSTAIKKRNLEVK 36 (243) T ss_pred CCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCEE T ss_conf 933775799999999997156876778773575222 No 9 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=41.30 E-value=32 Score=17.25 Aligned_cols=31 Identities=26% Similarity=0.408 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHCCCCC-CHHHHCHHHHHHHH Q ss_conf 79999999999237422-01000249999999 Q gi|254781185|r 8 INIIFGLVLVLSGCSIG-EQKKNNSVKKYFNK 38 (42) Q Consensus 8 iniifglvlvlsgcsig-eqkknnsvkkyfnk 38 (42) +-++..+.|.|+|||-. +.....+....|++ T Consensus 7 l~~~~~l~l~l~gCs~~~~~~~~~~~~~~Y~~ 38 (243) T PRK10866 7 LVAAATLSLFLAGCSGSKEEVPDNPPNEIYAT 38 (243) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH T ss_conf 99999999999876899888889999999999 No 10 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=40.46 E-value=19 Score=18.39 Aligned_cols=24 Identities=33% Similarity=0.664 Sum_probs=17.4 Q ss_pred HHHHHHHHHHH--HCCCCCCHHHHCH Q ss_conf 79999999999--2374220100024 Q gi|254781185|r 8 INIIFGLVLVL--SGCSIGEQKKNNS 31 (42) Q Consensus 8 iniifglvlvl--sgcsigeqkknns 31 (42) +-+|.-+.|+| +||++.+++.+.. T Consensus 2 l~~~ll~~~~l~~tgC~~~~~~~~r~ 27 (185) T TIGR02898 2 LFIILLLLLLLLATGCTNAQKKSGRN 27 (185) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 68899999999873134100146888 No 11 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=40.45 E-value=29 Score=17.49 Aligned_cols=30 Identities=40% Similarity=0.709 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHCCCCCCHHHH----CHHHHHHHH Q ss_conf 999999999923742201000----249999999 Q gi|254781185|r 9 NIIFGLVLVLSGCSIGEQKKN----NSVKKYFNK 38 (42) Q Consensus 9 niifglvlvlsgcsigeqkkn----nsvkkyfnk 38 (42) -+.|.-.+.|||||-|++... .+-|+|.++ T Consensus 9 ~~~~~a~~lLsaCsGg~~~~~~~l~~~Ak~yl~~ 42 (924) T TIGR02917 9 TILLAAILLLSACSGGDQLSPEELIEEAKKYLQK 42 (924) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHC T ss_conf 9999999998630578888968999999999861 No 12 >PRK11372 lysozyme inhibitor; Provisional Probab=37.95 E-value=19 Score=18.37 Aligned_cols=17 Identities=41% Similarity=0.710 Sum_probs=12.5 Q ss_pred HHHHHHHHHHCCCCCCH Q ss_conf 99999999923742201 Q gi|254781185|r 10 IIFGLVLVLSGCSIGEQ 26 (42) Q Consensus 10 iifglvlvlsgcsigeq 26 (42) ++..+.+.|||||.-.| T Consensus 7 l~~~~~llLsGCs~~~~ 23 (109) T PRK11372 7 LIICLPVLLTGCSAYNQ 23 (109) T ss_pred HHHHHHHHHHCCCCCCC T ss_conf 99999999704543221 No 13 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=37.63 E-value=20 Score=18.29 Aligned_cols=16 Identities=38% Similarity=0.611 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999999237422 Q gi|254781185|r 9 NIIFGLVLVLSGCSIG 24 (42) Q Consensus 9 niifglvlvlsgcsig 24 (42) -++..++|+|+||+-- T Consensus 4 l~l~~~~l~LsGC~~~ 19 (220) T PRK12407 4 LILTPMVLALCGCESP 19 (220) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 7999999997365888 No 14 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=36.08 E-value=23 Score=18.02 Aligned_cols=18 Identities=33% Similarity=0.715 Sum_probs=13.3 Q ss_pred HHHHHHHHHHCCCCCCHH Q ss_conf 999999999237422010 Q gi|254781185|r 10 IIFGLVLVLSGCSIGEQK 27 (42) Q Consensus 10 iifglvlvlsgcsigeqk 27 (42) +.+.+.+.|+||+.|..- T Consensus 7 l~l~~~l~L~gC~~~P~y 24 (460) T PRK09837 7 LPFCVALALTGCSLAPDY 24 (460) T ss_pred HHHHHHHHHCCCCCCCCC T ss_conf 999999998247789999 No 15 >TIGR01432 QOXA cytochrome aa3 quinol oxidase, subunit II; InterPro: IPR006332 Members of this family catalyze the oxidation of quinol with the concomitant reduction of molecular oxygen to water. This acts as the terminal electron acceptor in the respiratory chain. This subunit contains two transmembrane helices and a large external domain responsible for the binding and oxidation of quinol. QuoX is (presently) only found in Gram-positive bacteria of the Bacillus/Staphylococcus group . Like CyoA, the ubiquinol oxidase found in proteobacteria, the residues responsible for the ligation of Cu(a) and cytochrome c (found in the related cyt. c oxidases) are absent. Unlike CyoA, QoxA is in complex with a subunit I which contains cytochromes similar to the cytochrome c oxidases (as opposed to cytochromes b).; GO: 0016682 oxidoreductase activity acting on diphenols and related substances as donors oxygen as acceptor, 0006122 mitochondrial electron transport ubiquinol to cytochrome c. Probab=35.28 E-value=5.2 Score=21.23 Aligned_cols=14 Identities=50% Similarity=1.049 Sum_probs=10.0 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999999923742 Q gi|254781185|r 10 IIFGLVLVLSGCSI 23 (42) Q Consensus 10 iifglvlvlsgcsi 23 (42) ..||.+.+|||||- T Consensus 2 ~Llg~~~llsGCs~ 15 (229) T TIGR01432 2 ALLGVVILLSGCSN 15 (229) T ss_pred HHHEEEEEEECCCC T ss_conf 02226642335766 No 16 >PRK02944 OxaA-like protein precursor; Validated Probab=35.07 E-value=22 Score=18.05 Aligned_cols=16 Identities=31% Similarity=0.810 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHCCCC Q ss_conf 7999999999923742 Q gi|254781185|r 8 INIIFGLVLVLSGCSI 23 (42) Q Consensus 8 iniifglvlvlsgcsi 23 (42) +-...+++++||||+- T Consensus 8 ~~~~~~~~~~lsgC~~ 23 (255) T PRK02944 8 LAMVVALMAILAGCSE 23 (255) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 9999999999962468 No 17 >PRK10718 hypothetical protein; Provisional Probab=34.19 E-value=17 Score=18.64 Aligned_cols=15 Identities=40% Similarity=0.840 Sum_probs=12.2 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999999237422 Q gi|254781185|r 10 IIFGLVLVLSGCSIG 24 (42) Q Consensus 10 iifglvlvlsgcsig 24 (42) .+.+++|.|+|||-- T Consensus 7 ~l~~~~llLsGCst~ 21 (191) T PRK10718 7 LLLALPLLLTGCSTM 21 (191) T ss_pred HHHHHHHHHHCCCCC T ss_conf 231224764065666 No 18 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=31.14 E-value=30 Score=17.43 Aligned_cols=16 Identities=44% Similarity=0.839 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999999237422 Q gi|254781185|r 9 NIIFGLVLVLSGCSIG 24 (42) Q Consensus 9 niifglvlvlsgcsig 24 (42) .+++.+++.|+||+-- T Consensus 4 ~~l~~~~l~L~gCa~~ 19 (224) T PRK12698 4 YILLALALLLAGCSST 19 (224) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999998365688 No 19 >pfam06572 DUF1131 Protein of unknown function (DUF1131). This family consists of several hypothetical bacterial proteins of unknown function. Probab=29.85 E-value=27 Score=17.67 Aligned_cols=14 Identities=43% Similarity=0.940 Sum_probs=12.0 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999999923742 Q gi|254781185|r 10 IIFGLVLVLSGCSI 23 (42) Q Consensus 10 iifglvlvlsgcsi 23 (42) .+.++.|.|+|||- T Consensus 7 ~l~~~~llL~GCst 20 (192) T pfam06572 7 FLLALPLLLTGCST 20 (192) T ss_pred HHHHHHHHHHCCCC T ss_conf 68776688606556 No 20 >pfam10954 DUF2755 Protein of unknown function (DUF2755). Some members in this family of proteins are annotated as YaiY however no function is known. The family appears to be restricted to Enterobacteriaceae. Probab=29.38 E-value=23 Score=17.95 Aligned_cols=21 Identities=43% Similarity=0.795 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHHHHHCCCCC Q ss_conf 111279999999999237422 Q gi|254781185|r 4 KNIGINIIFGLVLVLSGCSIG 24 (42) Q Consensus 4 kniginiifglvlvlsgcsig 24 (42) -..|...||||+-.|.||-|- T Consensus 67 iglgVgTiFgliPFL~Gc~i~ 87 (100) T pfam10954 67 IGLGVGTIFGLIPFLIGCLIF 87 (100) T ss_pred EEECHHHHHHHHHHHHHHHHH T ss_conf 731057899889999988999 No 21 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=28.60 E-value=40 Score=16.78 Aligned_cols=20 Identities=35% Similarity=0.664 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 11127999999999923742 Q gi|254781185|r 4 KNIGINIIFGLVLVLSGCSI 23 (42) Q Consensus 4 kniginiifglvlvlsgcsi 23 (42) ++++.-.+++..+.||||+. T Consensus 8 ~~~~~l~l~~~~~lLsGC~~ 27 (305) T PRK10525 8 KSLGWLSLFAGTVLLSGCNS 27 (305) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 78999999999998667876 No 22 >pfam11353 DUF3153 Protein of unknown function (DUF3153). This family of proteins with unknown function appear to be restricted to Cyanobacteria. Some members are annotated as membrane proteins however this cannot be confirmed. Probab=28.43 E-value=30 Score=17.44 Aligned_cols=13 Identities=46% Similarity=0.938 Sum_probs=11.4 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999999992374 Q gi|254781185|r 10 IIFGLVLVLSGCS 22 (42) Q Consensus 10 iifglvlvlsgcs 22 (42) +++++.+.||||- T Consensus 3 vlllltlLLSGCV 15 (210) T pfam11353 3 VLLLLTLLLSGCV 15 (210) T ss_pred HHHHHHHHHCCEE T ss_conf 2999999847818 No 23 >COG5645 Predicted periplasmic lipoprotein [General function prediction only] Probab=27.43 E-value=32 Score=17.26 Aligned_cols=17 Identities=41% Similarity=0.722 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 1127999999999923742 Q gi|254781185|r 5 NIGINIIFGLVLVLSGCSI 23 (42) Q Consensus 5 niginiifglvlvlsgcsi 23 (42) ||-+-++.. ++||||+- T Consensus 3 ~i~l~l~v~--lllSGC~S 19 (80) T COG5645 3 NILLSLMVL--LLLSGCGS 19 (80) T ss_pred EEHHHHHHH--HHHCCCCE T ss_conf 532789999--98376410 No 24 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=26.72 E-value=45 Score=16.50 Aligned_cols=17 Identities=35% Similarity=0.546 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 79999999999237422 Q gi|254781185|r 8 INIIFGLVLVLSGCSIG 24 (42) Q Consensus 8 iniifglvlvlsgcsig 24 (42) |-++..++|.|+||+-- T Consensus 6 l~~~~~~Al~L~GC~~~ 22 (215) T TIGR02722 6 LIFVALLALLLSGCVSQ 22 (215) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 89999999998546887 No 25 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=24.13 E-value=57 Score=16.02 Aligned_cols=23 Identities=22% Similarity=0.502 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 11127999999999923742201 Q gi|254781185|r 4 KNIGINIIFGLVLVLSGCSIGEQ 26 (42) Q Consensus 4 kniginiifglvlvlsgcsigeq 26 (42) |.+-+-.+.+..++|+||+.|.. T Consensus 7 ~r~~~~~~l~~~~lLagC~~~p~ 29 (488) T PRK09915 7 SRLLLCSILGSTTLISGCALVRK 29 (488) T ss_pred HHHHHHHHHHHHHHHCCCCCCCC T ss_conf 69999999999999752668999 No 26 >PRK02463 OxaA-like protein precursor; Provisional Probab=23.26 E-value=71 Score=15.51 Aligned_cols=15 Identities=27% Similarity=0.700 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999999923742 Q gi|254781185|r 9 NIIFGLVLVLSGCSI 23 (42) Q Consensus 9 niifglvlvlsgcsi 23 (42) -...+++++||||.- T Consensus 12 ~~~~~~~l~LsgC~~ 26 (307) T PRK02463 12 GLALSMLLTLTGCVG 26 (307) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999999963468 No 27 >PRK06760 hypothetical protein; Provisional Probab=21.86 E-value=40 Score=16.78 Aligned_cols=21 Identities=29% Similarity=0.447 Sum_probs=12.3 Q ss_pred HHHHHHHHHH-HHHHHCCCCCC Q ss_conf 1127999999-99992374220 Q gi|254781185|r 5 NIGINIIFGL-VLVLSGCSIGE 25 (42) Q Consensus 5 niginiifgl-vlvlsgcsige 25 (42) -++|-|.-++ ...||+||.++ T Consensus 4 ~l~i~i~~iis~~~lsaCS~~~ 25 (223) T PRK06760 4 TLTIFMLTILLLISFSACSKKE 25 (223) T ss_pred EEEEHHHHHHHHHHHHHCCCCC T ss_conf 6764999999999840033678 No 28 >pfam03304 Mlp Mlp lipoprotein family. The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species. This family were previously known as 2.9 lipoprotein genes. These surface expressed genes may represent new candidate vaccinogens for Lyme disease. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity. Probab=21.37 E-value=53 Score=16.15 Aligned_cols=17 Identities=53% Similarity=1.048 Sum_probs=12.4 Q ss_pred HHHHHHH-HHHHHCCCCC Q ss_conf 7999999-9999237422 Q gi|254781185|r 8 INIIFGL-VLVLSGCSIG 24 (42) Q Consensus 8 iniifgl-vlvlsgcsig 24 (42) |||.|-| .|.|+||.-- T Consensus 4 INilfclfll~Ln~Cnsn 21 (150) T pfam03304 4 INILFCLFLLMLNGCNSN 21 (150) T ss_pred EHHHHHHHHHHHCCCCCC T ss_conf 149999999998265678 No 29 >pfam04507 DUF576 Protein of unknown function, DUF576. This family contains several uncharacterized staphylococcal proteins. Probab=21.17 E-value=85 Score=15.12 Aligned_cols=38 Identities=34% Similarity=0.592 Sum_probs=22.9 Q ss_pred CCHHHHHHHHH-HHHHHHCCCCC--CHHHHCHHHHHHHHHH Q ss_conf 31112799999-99999237422--0100024999999983 Q gi|254781185|r 3 SKNIGINIIFG-LVLVLSGCSIG--EQKKNNSVKKYFNKLI 40 (42) Q Consensus 3 skniginiifg-lvlvlsgcsig--eqkknnsvkkyfnkli 40 (42) +|-+.+.|.+- |+...+||... |.-|....|+-|+|.. T Consensus 4 ~kk~~L~is~liliifI~Gcg~~~k~dsKe~qIKksF~KtL 44 (257) T pfam04507 4 SKKLALYISVLILIIFISGCGFMNKEDSKEAQIKKSFAKTL 44 (257) T ss_pred HHEEHHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHHH T ss_conf 01003678899786520002235556657889999998776 No 30 >PRK10397 hypothetical protein; Provisional Probab=20.62 E-value=55 Score=16.07 Aligned_cols=15 Identities=33% Similarity=0.689 Sum_probs=11.7 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999999237422 Q gi|254781185|r 10 IIFGLVLVLSGCSIG 24 (42) Q Consensus 10 iifglvlvlsgcsig 24 (42) ++-|.+++|+||.-- T Consensus 5 ~~~g~~l~LsGCa~v 19 (137) T PRK10397 5 AIAGALMLLAGCAEV 19 (137) T ss_pred HHHHHHHHHHCCCCC T ss_conf 677899997353244 No 31 >KOG2882 consensus Probab=20.59 E-value=12 Score=19.47 Aligned_cols=29 Identities=34% Similarity=0.548 Sum_probs=18.8 Q ss_pred CCCHHHHHHHHH------HHHHHHCCCCCCHHHHC Q ss_conf 631112799999------99999237422010002 Q gi|254781185|r 2 VSKNIGINIIFG------LVLVLSGCSIGEQKKNN 30 (42) Q Consensus 2 vskniginiifg------lvlvlsgcsigeqkknn 30 (42) |....--.|.|| .+||||||+.-|.-+++ T Consensus 247 vGDRL~TDIlFG~~~G~~TLLvltGv~~led~~~~ 281 (306) T KOG2882 247 VGDRLDTDILFGKNCGFKTLLVLSGVTTLEDILEA 281 (306) T ss_pred ECCCCHHHHHHHHCCCCCEEEEECCCCCHHHHHHH T ss_conf 76632024567651696159985584718999740 No 32 >PRK11548 hypothetical protein; Provisional Probab=20.24 E-value=47 Score=16.43 Aligned_cols=12 Identities=42% Similarity=0.808 Sum_probs=8.5 Q ss_pred HHHHHHCCCCCC Q ss_conf 999992374220 Q gi|254781185|r 14 LVLVLSGCSIGE 25 (42) Q Consensus 14 lvlvlsgcsige 25 (42) +.+.+||||.-+ T Consensus 13 ~~lllsgCS~~~ 24 (113) T PRK11548 13 LLMLTAGCSTLE 24 (113) T ss_pred HHHHHCCCCCCC T ss_conf 999980115787 No 33 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=20.13 E-value=90 Score=15.00 Aligned_cols=30 Identities=30% Similarity=0.468 Sum_probs=18.4 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCC----CCHHHHC Q ss_conf 96311127999999999923742----2010002 Q gi|254781185|r 1 MVSKNIGINIIFGLVLVLSGCSI----GEQKKNN 30 (42) Q Consensus 1 mvskniginiifglvlvlsgcsi----geqkknn 30 (42) |-.+-.-|-......|+||||.. |||+.++ T Consensus 1 m~~r~~lia~~~a~~l~lsgC~t~p~tg~~~~~~ 34 (219) T PRK10510 1 MKKRVYLIAAIVSGALAVSGCTTNPYTGEREAGK 34 (219) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH T ss_conf 9426899999999998741356789876655326 Done!