Query gi|255764500|ref|YP_003064987.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 200 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Sun May 29 22:28:43 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764500.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03212 uraD_N-term-dom puta 60.4 14 0.00036 17.9 3.9 64 129-192 118-187 (297) 2 COG1342 Predicted DNA-binding 42.6 22 0.00056 16.7 2.6 40 137-178 44-83 (99) 3 pfam12356 DUF3643 Protein of u 41.4 14 0.00036 17.9 1.4 14 124-137 33-46 (165) 4 LOAD_SKIP consensus 40.8 29 0.00075 15.9 5.3 101 51-159 60-162 (168) 5 TIGR01168 YSIRK_signal Gram-po 38.1 17 0.00044 17.3 1.4 20 8-27 12-34 (44) 6 KOG4330 consensus 37.8 33 0.00083 15.6 3.6 26 79-104 135-162 (206) 7 KOG3375 consensus 35.2 35 0.00089 15.4 2.6 46 130-175 119-166 (174) 8 TIGR01181 dTDP_gluc_dehyt dTDP 31.6 22 0.00056 16.7 1.1 32 145-185 149-183 (340) 9 TIGR01172 cysE serine O-acetyl 29.7 44 0.0011 14.8 2.6 47 113-159 14-60 (163) 10 TIGR02852 spore_dpaB dipicolin 29.5 11 0.00029 18.5 -0.7 47 50-98 71-121 (188) 11 pfam09293 RNaseH_C T4 RNase H, 28.7 43 0.0011 14.9 2.2 93 41-134 10-111 (122) 12 PRK04217 hypothetical protein; 27.3 49 0.0012 14.5 2.6 40 137-178 53-92 (110) 13 TIGR01559 squal_synth farnesyl 27.0 35 0.00089 15.4 1.5 46 130-190 208-255 (350) 14 pfam02001 DUF134 Protein of un 26.5 50 0.0013 14.5 2.6 40 137-178 47-86 (100) 15 KOG4504 consensus 25.2 45 0.0011 14.8 1.8 51 15-68 13-67 (370) 16 COG3058 FdhE Uncharacterized p 23.9 56 0.0014 14.1 3.5 49 63-111 67-139 (308) 17 COG2320 GrpB Uncharacterized c 20.7 65 0.0017 13.7 2.1 26 145-170 150-175 (185) No 1 >TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family. Probab=60.45 E-value=14 Score=17.92 Aligned_cols=64 Identities=16% Similarity=0.184 Sum_probs=43.9 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-----H-HHHHHHHHHHHCCCCCCCHHHC Q ss_conf 99987765144455306898987764077798866554311-----2-2433345433043323311100 Q gi|255764500|r 129 SIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAAN-----Q-KAADMLMLATYGLQYDDSLTKI 192 (200) Q Consensus 129 siiwhryqhmdykgkteqekralakqsrddfqryataqaan-----q-kaadmlmlatyglqyddsltki 192 (200) -|..|-|.|.++.+-++++.|++-.++.+-+++..-..... . ...--|++...|+.|+-++.-- T Consensus 118 Ei~~HGy~~~~~~~l~~~~E~~~i~~~~~~l~~~tG~rP~Gw~s~~~s~~T~~Ll~~~~G~~Y~sd~~~D 187 (297) T TIGR03212 118 EIASHGLRWIDYQDMDEAQEREHIAEAIRLHTEVTGERPLGWYTGRTSPNTRRLVAEEGGFLYDSDSYAD 187 (297) T ss_pred CHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHCCCEEECCCCCC T ss_conf 3775433033766699899999999999999986599988117899996336899995896898988888 No 2 >COG1342 Predicted DNA-binding proteins [General function prediction only] Probab=42.62 E-value=22 Score=16.67 Aligned_cols=40 Identities=35% Similarity=0.440 Sum_probs=32.1 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 144455306898987764077798866554311224333454 Q gi|255764500|r 137 HMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLML 178 (200) Q Consensus 137 hmdykgkteqekralakqsrddfqryataqaanqkaadmlml 178 (200) -.||.|-+.+|--+.-.-||..|+|-- ++|-+|.||+|.- T Consensus 44 LvD~~~l~QeeAA~rMgISr~Tfwr~l--~sAR~KvA~aLve 83 (99) T COG1342 44 LVDYEGLTQEEAALRMGISRQTFWRLL--TSARKKVADALVE 83 (99) T ss_pred HHHHHHCCHHHHHHHHCCCHHHHHHHH--HHHHHHHHHHHHC T ss_conf 886861057899998464299999999--9999999999855 No 3 >pfam12356 DUF3643 Protein of unknown function (DUF3643). This family of proteins is found in eukaryotes. Proteins in this family are typically between 217 and 4852 amino acids in length. There is a conserved TLA sequence motif. Probab=41.38 E-value=14 Score=17.87 Aligned_cols=14 Identities=36% Similarity=0.930 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 78999999877651 Q gi|255764500|r 124 EEAIASIIWHRYQH 137 (200) Q Consensus 124 eeaiasiiwhryqh 137 (200) -..||+|+||-|+- T Consensus 33 ihcvAsILW~s~~~ 46 (165) T pfam12356 33 LHCVAAILWHSYEL 46 (165) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999998751 No 4 >LOAD_SKIP consensus Probab=40.78 E-value=29 Score=15.91 Aligned_cols=101 Identities=28% Similarity=0.261 Sum_probs=56.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCHHH--HHHHCCCHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 87899878756554898577786565433210100286100247268--7420622456887788666543214178999 Q gi|255764500|r 51 EPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVI--ENLRKPTEAEKEKILAARDRYQKTNNEEAIA 128 (200) Q Consensus 51 eprkptpldhsdsqhppvlpnsennahgdepikksekptfrrnqpvi--enlrkpteaekekilaardryqktnneeaia 128 (200) -|||.|.-|..+-.-||...|-.|+.--.-|+.|--.-.-|..|.|- +++-+-.||--.---.||.... . T Consensus 60 p~rKlT~eeq~~WkIPp~ISNWKN~kGyTIpLDKRlaadgRglq~v~iNdkFa~lseAL~~Aer~AREev~--------~ 131 (168) T LOAD_SKIP 60 PPRKVTVKDQNDWKIPPCVSNWKNPKGYTIPLDKRLAADGRGLQDVEINDNFAKLSEALYIAEKKAREEVR--------A 131 (168) T ss_pred CCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH--------H T ss_conf 98778987897077998756773888852070577652377877531426799999999999999999999--------9 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9998776514445530689898776407779 Q gi|255764500|r 129 SIIWHRYQHMDYKGKTEQEKRALAKQSRDDF 159 (200) Q Consensus 129 siiwhryqhmdykgkteqekralakqsrddf 159 (200) --.-.+-.-+.-+-+-|+.-|.||.++|.+- T Consensus 132 Rae~~k~~a~kEk~~kEe~LR~lA~~AR~eR 162 (168) T LOAD_SKIP 132 RAELERKLAQKEKEEKEDKLKELAQKAREER 162 (168) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999985 No 5 >TIGR01168 YSIRK_signal Gram-positive signal peptide, YSIRK family; InterPro: IPR005877 Many surface proteins found in Streptococcus, Staphylococcus, and related lineages share apparently homologous signal sequences. A motif resembling [YF]SIRKxxxGxxS[VIA] appears at the start of the transmembrane domain. The GxxS motif appears perfectly conserved, suggesting a specific function and not just homology. ; GO: 0016020 membrane. Probab=38.08 E-value=17 Score=17.34 Aligned_cols=20 Identities=40% Similarity=0.542 Sum_probs=14.2 Q ss_pred EEEEHH--HHHHHHHHH-HHCCC Q ss_conf 430010--058988977-74135 Q gi|255764500|r 8 VSIKKL--GIVSVLIPV-VLGVS 27 (200) Q Consensus 8 vsikkl--givsvlipv-vlgvs 27 (200) -||+|| |++||+|-. ++-.+ T Consensus 12 ySIRKlsvG~aSV~va~~~f~g~ 34 (44) T TIGR01168 12 YSIRKLSVGVASVLVASTLFFGG 34 (44) T ss_pred EEHHHHHHHHHHHHHHHHHHHCC T ss_conf 21034478899999888760015 No 6 >KOG4330 consensus Probab=37.83 E-value=33 Score=15.62 Aligned_cols=26 Identities=38% Similarity=0.480 Sum_probs=16.7 Q ss_pred CCCHHHCCCCCCCC--CCHHHHHHHCCC Q ss_conf 32101002861002--472687420622 Q gi|255764500|r 79 DEPIKKSEKPTFRR--NQPVIENLRKPT 104 (200) Q Consensus 79 depikksekptfrr--nqpvienlrkpt 104 (200) ..+.++||||+|.- -|-+.|.|-|-- T Consensus 135 s~~mr~~ek~~FTlrqVqmICErllKer 162 (206) T KOG4330 135 SSTMRKSEKPLFTLRQVQMICERLLKER 162 (206) T ss_pred CCHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 3211114432000999999999999999 No 7 >KOG3375 consensus Probab=35.17 E-value=35 Score=15.43 Aligned_cols=46 Identities=39% Similarity=0.406 Sum_probs=28.7 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH Q ss_conf 9987765144455306898987764077798866554--311224333 Q gi|255764500|r 130 IIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQ--AANQKAADM 175 (200) Q Consensus 130 iiwhryqhmdykgkteqekralakqsrddfqryataq--aanqkaadm 175 (200) ---.||..|..-|||+|-+.-||.-.----||--+|. -+..||++. T Consensus 119 rA~eRYmklh~aGKTeqAkaDLaRlalir~qReeaa~kre~ekkaa~~ 166 (174) T KOG3375 119 RAKERYMKLHLAGKTEQAKADLARLALIRQQREEAAAKREAEKKAADV 166 (174) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999970441788888999889888888988877878754014 No 8 >TIGR01181 dTDP_gluc_dehyt dTDP-glucose 4,6-dehydratase; InterPro: IPR005888 The conversion of dTDP-glucose into dTDP-4-keto-6-deoxyglucose by Escherichia coli dTDP-glucose 4,6-dehydratase) takes place in the active site in three steps: dehydrogenation to dTDP-4-ketoglucose, dehydration to dTDP-4-ketoglucose-5,6-ene, and rereduction of C6 to the methyl group. The 4,6-dehydratase makes use of tightly bound NAD^+ as the coenzyme for transiently oxidizing the substrate, activating it for the dehydration step . This and other 4,6-dehydratases catalyze the first committed step in all 6-deoxysugar biosynthetic pathways described to date. Numerous 6-deoxysugars are used in bacterial lipopolysaccharide production as well as in the biosynthesis of a diverse array of secondary metabolites.; GO: 0008460 dTDP-glucose 46-dehydratase activity, 0009225 nucleotide-sugar metabolic process. Probab=31.57 E-value=22 Score=16.69 Aligned_cols=32 Identities=44% Similarity=0.593 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHCCCC Q ss_conf 68989877640777988665543112243334543---304332 Q gi|255764500|r 145 EQEKRALAKQSRDDFQRYATAQAANQKAADMLMLA---TYGLQY 185 (200) Q Consensus 145 eqekralakqsrddfqryataqaanqkaadmlmla---tyglqy 185 (200) -.|.-.|+-.|- | .|-.-|+|||.+| ||||.+ T Consensus 149 ftE~tpl~PsSP-----Y----SASKAasD~LVrAy~rTYGLp~ 183 (340) T TIGR01181 149 FTETTPLAPSSP-----Y----SASKAASDLLVRAYHRTYGLPV 183 (340) T ss_pred EEECCCCCCCCC-----H----HHHHHHHHHHHHHHHHHCCCCE T ss_conf 423278877872-----4----5889888789888887548860 No 9 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=29.66 E-value=44 Score=14.80 Aligned_cols=47 Identities=19% Similarity=0.077 Sum_probs=33.5 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 86665432141789999998776514445530689898776407779 Q gi|255764500|r 113 AARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDF 159 (200) Q Consensus 113 aardryqktnneeaiasiiwhryqhmdykgkteqekralakqsrddf 159 (200) ||+...+---.-.-+-.+.|||..|.=|+-+-.---|.+..++|--| T Consensus 14 Aa~~~l~vll~YpG~HA~~~yR~aH~Ly~~~~k~lAr~~~~~~r~~t 60 (163) T TIGR01172 14 AARSYLEVLLYYPGLHALWAYRFAHALYKRGFKFLARLISNFIRVLT 60 (163) T ss_pred CCCHHHHHHHHCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf 60123789772637899999998656762463179999987665530 No 10 >TIGR02852 spore_dpaB dipicolinic acid synthetase, B subunit; InterPro: IPR014214 Members of this entry represent the B subunit of dipicolinic acid synthetase, an enzyme that synthesizes a small molecule that appears to confer heat stability to bacterial endospores such as those of Bacillus subtilis. The A and B subunits are together in what was originally designated the spoVF locus for stage V of endospore formation.. Probab=29.49 E-value=11 Score=18.48 Aligned_cols=47 Identities=40% Similarity=0.561 Sum_probs=30.9 Q ss_pred CCCCCC-CCCCCCCCCCCCCCCCCC---CCCCCCCCHHHCCCCCCCCCCHHHH Q ss_conf 887899-878756554898577786---5654332101002861002472687 Q gi|255764500|r 50 GEPRKP-TPLDHSDSQHPPVLPNSE---NNAHGDEPIKKSEKPTFRRNQPVIE 98 (200) Q Consensus 50 geprkp-tpldhsdsqhppvlpnse---nnahgdepikksekptfrrnqpvie 98 (200) -||--| .|||--= --|.--||- -||+-|.|+--.-|-|.|-++||+- T Consensus 71 AEP~GP~~~LD~mv--iaP~TGNsmsKlANA~tDspVLMAAKA~lRN~~PVvL 121 (188) T TIGR02852 71 AEPFGPKVPLDVMV--IAPLTGNSMSKLANAITDSPVLMAAKATLRNNKPVVL 121 (188) T ss_pred CCCCCCCCCCCEEE--ECCCCCCHHHHHHCCCCCCHHHHHHHHHHHCCCCEEE T ss_conf 54597697533688--7788621389984215897778877877517897898 No 11 >pfam09293 RNaseH_C T4 RNase H, C terminal. Members of this family are found in T4 RNaseH ribonuclease, and adopt a SAM domain-like fold, consisting of a bundle of four/five helices. These residues may have a role in providing a docking site for other proteins or enzymes in the replication fork. Probab=28.71 E-value=43 Score=14.85 Aligned_cols=93 Identities=18% Similarity=0.288 Sum_probs=41.7 Q ss_pred EEECCCCCCCCCCCCCCCCCC-----CCCCCCCCCC-CCCCCCCCCC---HHHCCCCCCCCCCHHHHHHHCCCHHHHHHH Q ss_conf 132057777887899878756-----5548985777-8656543321---010028610024726874206224568877 Q gi|255764500|r 41 PILKDDKNDGEPRKPTPLDHS-----DSQHPPVLPN-SENNAHGDEP---IKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 (200) Q Consensus 41 pilkddkndgeprkptpldhs-----dsqhppvlpn-sennahgdep---ikksekptfrrnqpvienlrkpteaekeki 111 (200) -|||.|+.||-|---.+=|.- .-..+|+--. -+.-...+.| .-..+.--|.||+-.|.--.-| +.-+++| T Consensus 10 hIlkGD~~DGIPNiLS~Dd~fvt~veG~RQkPiskKkl~~~~~~~~~~~~~~~e~~rny~RN~~LIDL~~iP-~~i~~~I 88 (122) T pfam09293 10 KIIKGDKKDGVASIKVRSDYILTRVEGERAPSIRTKELEAIFDREDPKVLLTEEEYARYDENRELIDFDFIP-DDIADNI 88 (122) T ss_pred HHHCCCCCCCCCCCCCCCCCEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHCC-HHHHHHH T ss_conf 822678868878746688623215676568984099999998668844348999999997537772811184-9999999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 88666543214178999999877 Q gi|255764500|r 112 LAARDRYQKTNNEEAIASIIWHR 134 (200) Q Consensus 112 laardryqktnneeaiasiiwhr 134 (200) +..-..+...+...-.--.|=+| T Consensus 89 i~~y~~~~~~~r~kll~YFi~~r 111 (122) T pfam09293 89 IEAYNTYKVPPRGKIYSYFVKNG 111 (122) T ss_pred HHHHHCCCCCCHHHHHHHHHHHH T ss_conf 99986489886215499999999 No 12 >PRK04217 hypothetical protein; Provisional Probab=27.27 E-value=49 Score=14.53 Aligned_cols=40 Identities=33% Similarity=0.375 Sum_probs=31.8 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 144455306898987764077798866554311224333454 Q gi|255764500|r 137 HMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLML 178 (200) Q Consensus 137 hmdykgkteqekralakqsrddfqryataqaanqkaadmlml 178 (200) -.||.|.+..|--..-.-||..|||-- ..|.+|-|+.|.- T Consensus 53 L~D~egl~qeeaA~~M~VSR~Tf~ril--~~AR~KvA~aLv~ 92 (110) T PRK04217 53 LVDYEGLTQEEAGKRMGVSRGTVWRAL--TSARKKVAQMLVE 92 (110) T ss_pred HHHHCCCCHHHHHHHCCCCHHHHHHHH--HHHHHHHHHHHHC T ss_conf 873607989999988497699999999--9999999999867 No 13 >TIGR01559 squal_synth farnesyl-diphosphate farnesyltransferase; InterPro: IPR006449 This family of sequences describe farnesyl-diphosphate farnesyltransferase, also known as squalene synthase, as found in eukaryotes. This family is related to phytoene synthases. Tentatively identified archaeal homologs lack the C-terminal predicted transmembrane region universally conserved among members of this family and therefore are not included in this group. These enzymes catalyse the formation of squalene by the reductive dimerisation of two farnesyl diphosphate molecules in a two-step reaction. This reaction occurs at the final branch point of the isoprenoid biosynthesis pathway, and is the first committed step in cholesterol biosynthesis.2 farnesyl diphosphate ---> presqualene diphosphate ---> squalene The human enzyme (P37268 from SWISSPROT) is a membrane-bound monomer which folds as a single domain . A large channel surrounded by five alpha helices runs through the centre of the protein. One end of this channel is exposed to solvent, while the other is completely enclosed and surrounded by hydrophobic residues. The two half reactions occur at different active sites in this channel. The first half reaction occurs at one end of the channel, then the intermediate is thought to move into the enclosed pocket where the second half reaction occurs. ; GO: 0004310 farnesyl-diphosphate farnesyltransferase activity, 0008610 lipid biosynthetic process, 0016021 integral to membrane. Probab=27.02 E-value=35 Score=15.44 Aligned_cols=46 Identities=37% Similarity=0.517 Sum_probs=28.4 Q ss_pred HHHHHHHH--CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH Q ss_conf 99877651--44455306898987764077798866554311224333454330433233111 Q gi|255764500|r 130 IIWHRYQH--MDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLATYGLQYDDSLT 190 (200) Q Consensus 130 iiwhryqh--mdykgkteqekralakqsrddfqryataqaanqkaadmlmlatyglqyddslt 190 (200) =||++|-. -|++ +-|-++.| -|.-|.-.++-|+-+++-|+|=++|+ T Consensus 208 EIWskYa~~L~Df~-~P~N~~~a--------------l~Clnelv~NAL~H~~D~L~YL~~l~ 255 (350) T TIGR01559 208 EIWSKYAKKLGDFK-KPENSEKA--------------LQCLNELVTNALSHATDCLTYLSSLR 255 (350) T ss_pred CHHHHHHHHHHHHC-CHHHHHHH--------------HHHHHHHHHHHHHCHHHHHHHHCCCC T ss_conf 31666566677512-83678888--------------64368999878404899999850477 No 14 >pfam02001 DUF134 Protein of unknown function DUF134. This family of archaeal proteins has no known function. Probab=26.54 E-value=50 Score=14.45 Aligned_cols=40 Identities=33% Similarity=0.370 Sum_probs=32.3 Q ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 144455306898987764077798866554311224333454 Q gi|255764500|r 137 HMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLML 178 (200) Q Consensus 137 hmdykgkteqekralakqsrddfqryataqaanqkaadmlml 178 (200) -+||.|-+..|--.--.-||..|||-- ..|.+|-|+.|.- T Consensus 47 L~D~egl~QeeaA~~MgVSR~Tf~ril--~~ARkKvA~aLv~ 86 (100) T pfam02001 47 LVDYEDYTQEEAAKLMGISRRTVWRLL--TSARKKIADALVE 86 (100) T ss_pred HHHHCCCCHHHHHHHCCCCHHHHHHHH--HHHHHHHHHHHHC T ss_conf 873607989999988497699999999--9999999999867 No 15 >KOG4504 consensus Probab=25.21 E-value=45 Score=14.75 Aligned_cols=51 Identities=29% Similarity=0.339 Sum_probs=37.1 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEEEC----CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 589889777413557767543897311320----577778878998787565548985 Q gi|255764500|r 15 IVSVLIPVVLGVSNCDHSDSQHPPVIPILK----DDKNDGEPRKPTPLDHSDSQHPPV 68 (200) Q Consensus 15 ivsvlipvvlgvsncdhsdsqhppvipilk----ddkndgeprkptpldhsdsqhppv 68 (200) +..+++||.||+- .+-+|..--|-+++ +--+||+|.|-|..+---+.|-|+ T Consensus 13 ll~~s~pv~L~~~---~~~~q~a~g~~v~~~~~~~l~~d~~pk~~~~~~~fCA~s~pn 67 (370) T KOG4504 13 LLGGSKPVNLGRV---RDGPQWADGIIVLKYVDGDLCPDGIPKKSTTIRFFCAESQPN 67 (370) T ss_pred HHCCCCCEEEECC---CCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCC T ss_conf 8704543376215---787520255488873145538988777656422256404788 No 16 >COG3058 FdhE Uncharacterized protein involved in formate dehydrogenase formation [Posttranslational modification, protein turnover, chaperones] Probab=23.89 E-value=56 Score=14.14 Aligned_cols=49 Identities=35% Similarity=0.548 Sum_probs=24.9 Q ss_pred CCCCCCCC-----CCCCCCCCCCCHHHCCCC---CCCCC----------------CHHHHHHHCCCHHHHHHH Q ss_conf 54898577-----786565433210100286---10024----------------726874206224568877 Q gi|255764500|r 63 SQHPPVLP-----NSENNAHGDEPIKKSEKP---TFRRN----------------QPVIENLRKPTEAEKEKI 111 (200) Q Consensus 63 sqhppvlp-----nsennahgdepikksekp---tfrrn----------------qpvienlrkpteaekeki 111 (200) ++||+..+ +-+-+++|-.|++-++-+ .||.- --+||+|+|-.++|-+-+ T Consensus 67 ~~~P~~~~~~~~~~~~sra~gkpPL~~~~~~r~p~w~~~L~alla~~~~~a~~~~~a~ie~L~ka~~~el~~~ 139 (308) T COG3058 67 SDHPLLMDLTDARLEESRAQGKPPLDISVLPREPHWRKLLMALLAELKPEANAAALAAIEQLEKASEQELESM 139 (308) T ss_pred HCCCCCCCCCHHHHHHHHHCCCCCCCHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 2499877776788999873699975576527980799999999986254676899999999998569999999 No 17 >COG2320 GrpB Uncharacterized conserved protein [Function unknown] Probab=20.71 E-value=65 Score=13.75 Aligned_cols=26 Identities=38% Similarity=0.492 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 68989877640777988665543112 Q gi|255764500|r 145 EQEKRALAKQSRDDFQRYATAQAANQ 170 (200) Q Consensus 145 eqekralakqsrddfqryataqaanq 170 (200) ..-||-||.|+.++.++|+.+..+-- T Consensus 150 ~~lKr~laa~~~~e~~~Y~~~K~~~i 175 (185) T COG2320 150 AELKRELAAQEDDEILRYTAAKSAFI 175 (185) T ss_pred HHHHHHHHHHCHHHHHHHHHHHCHHH T ss_conf 99999999856488988876005789 Done!