Query gi|254780919|ref|YP_003065332.1| dTDP-4-dehydrorhamnose 3,5-epimerase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 198 No_of_seqs 134 out of 2337 Neff 5.7 Searched_HMMs 39220 Date Mon May 30 02:16:46 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780919.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01221 rmlC dTDP-4-dehydror 100.0 0 0 506.5 8.1 174 1-176 6-181 (181) 2 pfam00908 dTDP_sugar_isom dTDP 100.0 0 0 427.4 12.8 174 1-175 4-177 (177) 3 COG1898 RfbC dTDP-4-dehydrorha 100.0 0 0 379.8 9.6 168 1-174 6-173 (173) 4 pfam05523 FdtA WxcM-like, C-te 99.5 4.1E-14 1E-18 101.7 7.7 119 5-139 2-120 (131) 5 TIGR03404 bicupin_oxalic bicup 96.4 0.004 1E-07 38.5 3.7 103 17-137 49-152 (367) 6 smart00835 Cupin_1 Cupin. This 95.1 0.059 1.5E-06 31.7 5.5 104 13-131 6-111 (146) 7 pfam00190 Cupin_1 Cupin. This 94.5 0.033 8.5E-07 33.2 2.9 108 12-131 8-116 (139) 8 COG2140 Thermophilic glucose-6 94.3 0.073 1.9E-06 31.2 4.3 70 49-127 88-157 (209) 9 LOAD_DSBH consensus 93.9 0.032 8.2E-07 33.3 1.9 60 49-123 6-65 (66) 10 PRK04190 glucose-6-phosphate i 90.3 0.32 8.1E-06 27.5 3.4 77 57-142 90-171 (191) 11 pfam08007 Cupin_4 Cupin superf 85.9 0.75 1.9E-05 25.4 2.9 35 92-130 175-209 (320) 12 COG1917 Uncharacterized conser 84.9 1.6 4.1E-05 23.5 4.2 70 36-123 40-109 (131) 13 pfam07883 Cupin_2 Cupin domain 84.5 1.1 2.9E-05 24.3 3.4 61 49-126 6-66 (70) 14 pfam06560 GPI Glucose-6-phosph 84.2 1.2 3.2E-05 24.1 3.4 86 49-138 57-156 (181) 15 KOG2107 consensus 79.9 1.2 2.9E-05 24.3 1.9 51 65-123 93-143 (179) 16 COG3822 ABC-type sugar transpo 79.5 2.2 5.7E-05 22.7 3.3 119 14-138 46-194 (225) 17 COG3542 Uncharacterized conser 76.4 2.1 5.2E-05 22.9 2.4 103 17-131 20-130 (162) 18 COG4101 Predicted mannose-6-ph 74.9 5 0.00013 20.7 4.0 86 49-147 54-140 (142) 19 PRK10296 DNA-binding transcrip 69.0 6.1 0.00016 20.2 3.4 51 49-116 24-74 (271) 20 COG0662 {ManC} Mannose-6-phosp 61.3 14 0.00036 18.1 4.1 76 49-140 44-120 (127) 21 pfam05694 SBP56 56kDa selenium 56.5 2.7 6.8E-05 22.2 -0.3 12 81-92 42-53 (461) 22 PRK13502 transcriptional activ 44.0 25 0.00064 16.7 3.0 50 55-121 32-81 (282) 23 pfam06172 Cupin_5 Cupin superf 43.7 27 0.0007 16.4 3.2 113 9-133 8-130 (139) 24 TIGR02279 PaaC-3OHAcCoADH 3-hy 42.5 12 0.0003 18.5 1.2 27 48-75 133-159 (508) 25 PRK13501 transcriptional activ 41.8 14 0.00035 18.2 1.4 50 55-121 32-81 (290) 26 TIGR01896 cas_AF1879 CRISPR-as 41.4 9.1 0.00023 19.2 0.4 39 51-94 15-57 (295) 27 pfam09313 DUF1971 Domain of un 38.4 28 0.00072 16.4 2.6 52 69-126 29-80 (82) 28 COG2850 Uncharacterized conser 37.4 18 0.00045 17.5 1.4 31 101-133 184-214 (383) 29 KOG2131 consensus 36.8 16 0.00041 17.7 1.1 87 24-122 181-294 (427) 30 PRK10371 DNA-binding transcrip 36.8 32 0.00082 16.0 2.7 56 49-121 34-89 (302) 31 pfam01238 PMI_typeI Phosphoman 34.1 39 0.001 15.5 2.8 21 96-118 252-272 (373) 32 pfam03079 ARD ARD/ARD' family. 33.6 40 0.001 15.5 3.2 65 52-124 79-143 (157) 33 pfam12470 SUFU_C Suppressor of 31.8 32 0.00082 16.0 2.0 76 50-139 92-172 (201) 34 COG4194 Predicted membrane pro 29.5 37 0.00093 15.7 2.0 34 83-118 290-323 (350) 35 COG1791 Uncharacterized conser 28.6 28 0.00072 16.4 1.2 67 48-122 78-144 (181) 36 pfam07385 DUF1498 Protein of u 26.1 38 0.00097 15.6 1.6 44 88-134 148-191 (225) 37 TIGR01555 phge_rel_HI1409 phag 25.9 36 0.00091 15.8 1.4 36 80-115 145-181 (426) 38 TIGR01150 puhA photosynthetic 25.4 56 0.0014 14.6 2.6 81 44-134 149-230 (264) 39 KOG2435 consensus 24.9 56 0.0014 14.6 2.2 66 11-84 206-271 (323) 40 PRK04231 rpl3p 50S ribosomal p 24.7 29 0.00073 16.3 0.7 47 72-123 43-89 (333) 41 PRK13503 transcriptional activ 23.1 63 0.0016 14.4 3.5 51 55-122 29-79 (278) 42 pfam01050 MannoseP_isomer Mann 22.1 65 0.0017 14.2 4.1 66 66-145 85-151 (151) 43 COG4388 Mu-like prophage I pro 21.2 68 0.0017 14.1 2.1 14 11-24 93-106 (357) 44 TIGR02461 osmo_MPG_phos mannos 20.3 38 0.00096 15.6 0.6 13 101-113 65-77 (248) No 1 >TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase; InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilized by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate .; GO: 0008830 dTDP-4-dehydrorhamnose 35-epimerase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=100.00 E-value=0 Score=506.50 Aligned_cols=174 Identities=49% Similarity=0.870 Sum_probs=169.0 Q ss_pred CCCCCEEEEECCEEEECCCEEEEEECHHHHHH-CCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHH Q ss_conf 96575399707622037950421108898986-79987413442331252000000015555544012100002222234 Q gi|254780919|r 1 MNINPVRILKTRKFEDSRGWFSQTYSSKLLKE-LGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDI 79 (198) Q Consensus 1 L~I~gv~ii~~~~f~D~RG~f~e~f~~~~~~~-~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dv 79 (198) |+|+||.||+|++|.|+||+|+|+||.+.|.+ .|.+..|+|+|+|+| .||||||||||.++++|+|||+|+.|+|+|| T Consensus 6 t~IpDv~liEP~VF~D~RGFF~E~yn~~~F~~~~G~~~~FVQdN~S~S-~~GVLRGLHyQ~~~~~QGKLVRv~~GevfDV 84 (181) T TIGR01221 6 TEIPDVLLIEPRVFGDERGFFFETYNQEAFQEAVGIPVRFVQDNHSKS-KKGVLRGLHYQREEHAQGKLVRVLRGEVFDV 84 (181) T ss_pred CCCCCEEEECCEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCC-CCCCCCCEECCCCCCCCCCEEEEEEEEEEEE T ss_conf 347875885455637888613566424678874288763067552265-6784011010188888872678985458877 Q ss_pred HHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCCCHHCEECCCCCCCCCCCCCCCCCCC Q ss_conf 33406421333100113662365302333023220345307453389721787671017021688801177478887676 Q gi|254780919|r 80 AVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYSVEHDSGVAWQDKSIDITWPLLDTIL 159 (198) Q Consensus 80 vvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~p~~e~~i~~~Dp~l~i~Wp~~~~~~ 159 (198) +||||++|||||||.+.+||++|+++||||+||||||++|||.|+++|+||++|.|++|+||.||||.|+|+||+.... T Consensus 85 AVDlRrnSP~fGkWVGv~Ls~~NKrqLWIP~GFAHGF~VLSd~Aev~YK~Td~Y~Pe~e~gi~WNDP~igIdWPl~~~D- 163 (181) T TIGR01221 85 AVDLRRNSPTFGKWVGVLLSAENKRQLWIPEGFAHGFVVLSDEAEVLYKCTDYYAPEYERGIIWNDPDIGIDWPLEDAD- 163 (181) T ss_pred EEECCCCCCCCCEEEEEEECHHCCEECCCCCCCCCCEEEECCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC- T ss_conf 7886786876466999986614281312589744330440416788600104026865773556773331258865578- Q ss_pred C-EECHHHHCCCCHHHCC Q ss_conf 3-0167884599877818 Q gi|254780919|r 160 P-SVSEKDQNLPFLNQID 176 (198) Q Consensus 160 p-iiS~kD~~~p~l~d~~ 176 (198) | |||+||+++|.|+|++ T Consensus 164 ~~lLS~KD~~~~~l~E~~ 181 (181) T TIGR01221 164 PVLLSEKDRKGPPLAEAE 181 (181) T ss_pred CEEECHHHHCCCCHHHCC T ss_conf 436334350487356369 No 2 >pfam00908 dTDP_sugar_isom dTDP-4-dehydrorhamnose 3,5-epimerase. This family catalyse the isomerisation of dTDP-4-dehydro-6-deoxy -D-glucose with dTDP-4-dehydro-6-deoxy-L-mannose. The EC number of this enzyme is 5.1.3.13. Probab=100.00 E-value=0 Score=427.37 Aligned_cols=174 Identities=51% Similarity=0.956 Sum_probs=168.8 Q ss_pred CCCCCEEEEECCEEEECCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHH Q ss_conf 96575399707622037950421108898986799874134423312520000000155555440121000022222343 Q gi|254780919|r 1 MNINPVRILKTRKFEDSRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIA 80 (198) Q Consensus 1 L~I~gv~ii~~~~f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvv 80 (198) ++|+||++++++.|.|+||+|.|+|+.+.+++.+....|+|+|+|+| +||||||||||.+|++|+|||+|++|+|+||+ T Consensus 4 t~I~gv~~i~~~~~~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~S-~kgvlRGlH~q~~p~~q~Klv~ci~G~i~dvv 82 (177) T pfam00908 4 LAIPGVLLIEPRVFGDERGFFSESFNAEEFEEAGGHPDFVQDNHSFS-SRGVLRGLHYQQPPPAQGKLVRCVRGEVFDVA 82 (177) T ss_pred CCCCCEEEEECCEEECCCCCEEEEECHHHHHHHCCCCCCCEEEEEEC-CCCCEEEECEECCCCCCEEEEEEEECEEEEEE T ss_conf 76898799989524879927888816789997489998738998896-69918988502899886249999406698999 Q ss_pred HHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCCCHHCEECCCCCCCCCCCCCCCCCCCC Q ss_conf 34064213331001136623653023330232203453074533897217876710170216888011774788876763 Q gi|254780919|r 81 VDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYSVEHDSGVAWQDKSIDITWPLLDTILP 160 (198) Q Consensus 81 vDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~p~~e~~i~~~Dp~l~i~Wp~~~~~~p 160 (198) ||+|++|||||+|.+++|+++++.+||||+||||||+||+|+|+++|+++++|+|+.|.||+|+||.|+|+||++....| T Consensus 83 vDlR~~S~Tfgk~~~~~L~~~~~~~l~IP~G~aHGf~~L~d~~~v~Y~~~~~y~p~~e~gi~~~Dp~l~i~WP~~~~~~~ 162 (177) T pfam00908 83 VDIRKGSPTFGKWVGVELSAENRRQLWIPEGFAHGFLALSDDAEVLYKVSAYYSPEAERGIRWNDPAIGIDWPLADGPEP 162 (177) T ss_pred EECCCCCCCCCCEEEEEECCCCCCEEEECCCCCEEEEEECCCEEEEEECCCCCCCCCCEEECCCCCHHCCCCCCCCCCCC T ss_conf 98988989888489999767757559968986389998079979999837623866650675899520866888999776 Q ss_pred EECHHHHCCCCHHHC Q ss_conf 016788459987781 Q gi|254780919|r 161 SVSEKDQNLPFLNQI 175 (198) Q Consensus 161 iiS~kD~~~p~l~d~ 175 (198) |||+||+++|+|+|+ T Consensus 163 iiS~kD~~~p~l~~a 177 (177) T pfam00908 163 SLSEKDANAPLLAEA 177 (177) T ss_pred EECHHHCCCCCCCCC T ss_conf 887899199980319 No 3 >COG1898 RfbC dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=379.78 Aligned_cols=168 Identities=51% Similarity=0.960 Sum_probs=162.3 Q ss_pred CCCCCEEEEECCEEEECCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHH Q ss_conf 96575399707622037950421108898986799874134423312520000000155555440121000022222343 Q gi|254780919|r 1 MNINPVRILKTRKFEDSRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIA 80 (198) Q Consensus 1 L~I~gv~ii~~~~f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvv 80 (198) +.|+|++++++++|.|+||+|+|+|+.+++...+.+..|+|+|+|+|+ +||+||||||..| |.|||+|++|+|+||+ T Consensus 6 ~~i~gv~~~~~~~~~D~RG~F~E~~~~~~~~~~~~~~~~~Q~n~S~S~-~GvlRGlHyq~~~--q~klv~~v~G~v~dv~ 82 (173) T COG1898 6 TAIPGVLIIEPKVFGDERGFFTETFKAEEFSTLGLPLDFVQDNHSFSY-PGVLRGLHYQHKP--QGKLVRVVSGKVFDVA 82 (173) T ss_pred CCCCCEEEECCCCCCCCCCEEEEHHHHHHHHHCCCCCCCCCCEEEEEC-CCEEEEEECCCCC--CCEEEEEECCCEEEEE T ss_conf 578951898885446666512000123455421676653661588844-8815778711577--8759998057099999 Q ss_pred HHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCCCHHCEECCCCCCCCCCCCCCCCCCCC Q ss_conf 34064213331001136623653023330232203453074533897217876710170216888011774788876763 Q gi|254780919|r 81 VDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYSVEHDSGVAWQDKSIDITWPLLDTILP 160 (198) Q Consensus 81 vDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~p~~e~~i~~~Dp~l~i~Wp~~~~~~p 160 (198) ||+|++|||||+|..++||++|+++|+||+||||||++|+|+++++|+++++|+|+++++|+||||.|+|+||+.. + T Consensus 83 vDlR~~SpTyg~~~~~~ls~~N~~~l~IP~G~AHGf~~L~d~~~~~y~~~~~Y~p~~~~~i~~nDp~i~i~Wp~~~---~ 159 (173) T COG1898 83 VDLRKDSPTYGKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDAEVVYKVTEEYDPEHERGIPWNDPTIGIDWPLKD---P 159 (173) T ss_pred EECCCCCCCCCEEEEEEECCCCCEEEEECCCCCCEEEECCCCEEEEEEECCEECCCCCCCCCCCCCCCCCCCCCCC---C T ss_conf 9766999870259999964787249994785423059816853999972340475566444578875565876676---3 Q ss_pred EECHHHHCCCCHHH Q ss_conf 01678845998778 Q gi|254780919|r 161 SVSEKDQNLPFLNQ 174 (198) Q Consensus 161 iiS~kD~~~p~l~d 174 (198) ++|+||+++|.|++ T Consensus 160 ~~S~kD~~~p~l~~ 173 (173) T COG1898 160 ILSDKDAAAPLLKE 173 (173) T ss_pred EECHHHHCCCCCCC T ss_conf 50887836885479 No 4 >pfam05523 FdtA WxcM-like, C-terminal. This family includes FdtA from Aneurinibacillus thermoaerophilus, which has been characterized as a dtdp-6-deoxy-3,4-keto-hexulose isomerase. It also includes WxcM from Xanthomonas campestris (pv. campestris). Probab=99.52 E-value=4.1e-14 Score=101.70 Aligned_cols=119 Identities=16% Similarity=0.219 Sum_probs=96.3 Q ss_pred CEEEEECCEEEECCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCC Q ss_conf 53997076220379504211088989867998741344233125200000001555554401210000222223433406 Q gi|254780919|r 5 PVRILKTRKFEDSRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIR 84 (198) Q Consensus 5 gv~ii~~~~f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR 84 (198) ++++++.+.+.|+||.+.-+-..+ .+++.++.+.+....+++..||.|.|+.- ..++.|++|++...+-|-. T Consensus 2 ~~kli~l~~~~D~rG~L~~iE~~~-----~ipF~iKRvf~i~~~~~~~~RG~HAHk~~---~Q~li~l~Gs~~v~~~dg~ 73 (131) T pfam05523 2 NVKIIELPKITDDRGNLTVIEGGK-----HIPFEIKRVYYLYDVPGGEERGGHAHKEL---HQLLIALSGSFEVLLDDGK 73 (131) T ss_pred CEEEEECCCEECCCCCEEEEECCC-----CCCCCEEEEEEEECCCCCCCEEEECCCCE---EEEEEEECCEEEEEEECCC T ss_conf 648994780778997499998676-----48975279999967999961830611320---5999981548999997898 Q ss_pred CCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCCCHHCE Q ss_conf 4213331001136623653023330232203453074533897217876710170 Q gi|254780919|r 85 RNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYSVEHDS 139 (198) Q Consensus 85 ~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~p~~e~ 139 (198) +...+.|++.+ ..|+|||+.||....++++++++-+||..|++++.. T Consensus 74 -------~~~~~~L~~~~-~gL~ipp~~W~~~~~~s~~sillVlas~~Y~~~DYI 120 (131) T pfam05523 74 -------NKKEILLDSPD-KGLYIPPMIWREMYNFSEGAVLLVLASDYYDEDDYI 120 (131) T ss_pred -------CEEEEEECCCC-CEEEECCCEEEEECCCCCCCEEEEEECCCCCHHHHH T ss_conf -------46999965999-689948987875305799979999908988968831 No 5 >TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins. Probab=96.36 E-value=0.004 Score=38.47 Aligned_cols=103 Identities=23% Similarity=0.378 Sum_probs=70.1 Q ss_pred CCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEE Q ss_conf 79504211088989867998741344233125200000001555554401210000222223433406421333100113 Q gi|254780919|r 17 SRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSL 96 (198) Q Consensus 17 ~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~ 96 (198) +-||-.|.-. .++-....+.-+|..+ .+|.+|-||++. ++.--+++.|++..-+||-. |+.+.- T Consensus 49 ~GGWaRe~tv----~~lP~s~~~AGVnMrL--~~GaiRELHWH~----~AEWayvl~G~~Ritavd~~------G~~~i~ 112 (367) T TIGR03404 49 NGGWAREVTV----RDLPISTAIAGVNMRL--EPGAIRELHWHK----EAEWAYVLYGSCRITAVDEN------GRNYID 112 (367) T ss_pred CCCCEEEEEH----HHCCCHHHHCCEEEEC--CCCCEEEEEECC----CCEEEEEECCEEEEEEECCC------CCEEEE T ss_conf 6871567671----1174402111413430--788245543045----00034365171799998899------976874 Q ss_pred EEECCCCCEEEECCCCCEEEEEECCCCEEEEEE-CCCCCCHH Q ss_conf 662365302333023220345307453389721-78767101 Q gi|254780919|r 97 EISANNGLQIYIPTGFAHGFMTLEMNTEVIYKV-TDFYSVEH 137 (198) Q Consensus 97 ~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~-~~~y~p~~ 137 (198) .+.+ +..=|-|+|++|..|.|++.|.++-.- ++.|+... T Consensus 113 dv~~--GDlWyFP~G~pHSiQgl~~g~EFLLvFDdG~FSE~~ 152 (367) T TIGR03404 113 DVGA--GDLWYFPPGIPHSLQGLDEGCEFLLVFDDGNFSEDG 152 (367) T ss_pred CCCC--CCEEECCCCCCCEECCCCCCCEEEEEECCCCCCCCC T ss_conf 2588--886214899973011058885799994588757556 No 6 >smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. Probab=95.11 E-value=0.059 Score=31.74 Aligned_cols=104 Identities=18% Similarity=0.203 Sum_probs=65.7 Q ss_pred EEEECCCEEEEEECHHHHHHC-CCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHH Q ss_conf 220379504211088989867-9987413442331252000000015555544012100002222234334064213331 Q gi|254780919|r 13 KFEDSRGWFSQTYSSKLLKEL-GLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYG 91 (198) Q Consensus 13 ~f~D~RG~f~e~f~~~~~~~~-~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfg 91 (198) .++.+-|++.+. +...+..+ +....... ..+ .+|.++-.|++... +.+++|++|+...-+|+.... T Consensus 6 ~~~~~~G~~~~~-~~~~~P~L~~~~is~~~--~~l--~pg~~~~PH~h~~A---~e~~~V~~G~~~~~~v~~~~~----- 72 (146) T smart00835 6 DFSNEGGRLREA-DPTNFPALNGLGISAAR--VNL--EPGGMLPPHYHPRA---TELLYVVRGEGRVGVVDPNGN----- 72 (146) T ss_pred CCCCCCEEEEEE-CHHHCCCCCCCCEEEEE--EEE--CCCCCCCCEECCCC---CEEEEEEEEEEEEEEECCCCC----- T ss_conf 510898189997-93439140338638999--999--89969266167999---899999963599999869996----- Q ss_pred HEEEEEEECCCCCEEEECCCCCEEEEEECC-CCEEEEEECC Q ss_conf 001136623653023330232203453074-5338972178 Q gi|254780919|r 92 CWVSLEISANNGLQIYIPTGFAHGFMTLEM-NTEVIYKVTD 131 (198) Q Consensus 92 k~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d-~~~i~Y~~~~ 131 (198) +.....|.+ +..++||+|++|-.+..++ ...++.+-++ T Consensus 73 ~~~~~~l~~--Gdv~v~P~g~~h~~~n~g~~~~~~~~f~~~ 111 (146) T smart00835 73 KVYDARLRE--GDVFVVPQGHPHFQVNSGDENLEFVAFNTN 111 (146) T ss_pred EEEEEEECC--CEEEEECCCCEEEEECCCCCCEEEEEEECC T ss_conf 899999738--819998999769999279998899999678 No 7 >pfam00190 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. Probab=94.46 E-value=0.033 Score=33.18 Aligned_cols=108 Identities=16% Similarity=0.096 Sum_probs=62.4 Q ss_pred CEEEECCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHH Q ss_conf 62203795042110889898679987413442331252000000015555544012100002222234334064213331 Q gi|254780919|r 12 RKFEDSRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYG 91 (198) Q Consensus 12 ~~f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfg 91 (198) +.+.-+-|.+.+. +...+.... ........... .++|.++..||+.. -++++++++|+....+|+.-... T Consensus 8 ~~~~~~~G~~~~~-~~~~~p~l~-~~~~~~~~~~~-i~pg~~~~Ph~~~~---a~~i~~V~~G~~~~~~v~~~~~~---- 77 (139) T pfam00190 8 PVFNSEGGRLETA-NPNNLPGLL-GLAGSAVRRDL-IEPGGLLLPHYHPN---ATEILYVLQGRGRVGFVVPGCGF---- 77 (139) T ss_pred CCCCCCCEEEEEE-CCHHCHHHH-HCCCEEEEEEE-ECCCCEECCCCCCC---CCEEEEEEECEEEEEEEECCCCC---- T ss_conf 8400798799997-802273254-24327889865-06897943714799---98899999574999999689995---- Q ss_pred HEEEEEEECCCCCEEEECCCCCEEEEEECC-CCEEEEEECC Q ss_conf 001136623653023330232203453074-5338972178 Q gi|254780919|r 92 CWVSLEISANNGLQIYIPTGFAHGFMTLEM-NTEVIYKVTD 131 (198) Q Consensus 92 k~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d-~~~i~Y~~~~ 131 (198) +....+|.+ +..++||+|++|-.....+ ...+..+.++ T Consensus 78 ~~~~~~l~~--Gdv~~vP~g~~~~~~n~g~~~~~~~~f~~~ 116 (139) T pfam00190 78 RVFHQKLRE--GDVFVVPAGFAHWQYNSGDPGLELVAFDTN 116 (139) T ss_pred EEEEEEECC--CCEEEECCCCEEEEEECCCCCEEEEEEECC T ss_conf 789978507--979998899869999169987899998468 No 8 >COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only] Probab=94.26 E-value=0.073 Score=31.23 Aligned_cols=70 Identities=19% Similarity=0.300 Sum_probs=51.1 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEE Q ss_conf 2000000015555544012100002222234334064213331001136623653023330232203453074533897 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIY 127 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y 127 (198) .+|..|-+|+|- +..+..+.+|++|+....+-+- . |+...+.+ +.+.++|||++.+|--.-..|...+.- T Consensus 88 t~G~~~~~H~Hp-~ade~E~y~vi~G~g~m~v~~~-----~-G~~~v~~~--~~Gd~iyVPp~~gH~t~N~Gd~pLvf~ 157 (209) T COG2140 88 TPGAMRELHYHP-NADEPEIYYVLKGEGRMLVQKP-----E-GEARVIAV--RAGDVIYVPPGYGHYTINTGDEPLVFL 157 (209) T ss_pred CCCCCCCCCCCC-CCCCCCEEEEEECCEEEEEECC-----C-CCEEEEEE--CCCCEEEECCCCCEEEECCCCCCEEEE T ss_conf 688642102189-9986318999916288999758-----9-95799982--589789948976437465899977999 No 9 >LOAD_DSBH consensus Probab=93.90 E-value=0.032 Score=33.28 Aligned_cols=60 Identities=22% Similarity=0.321 Sum_probs=44.0 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCC Q ss_conf 200000001555554401210000222223433406421333100113662365302333023220345307453 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNT 123 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~ 123 (198) .+|.....|++.. ...+++|++|++..++.| ...+.+. .+.+++||+|..|++...++.. T Consensus 6 ~pg~~~~~H~H~~---~~e~~~v~~G~~~~~~~~----------~~~~~~~--~Gd~~~ip~g~~H~~~n~~~~~ 65 (66) T LOAD_DSBH 6 EPGAEFEPHYHPD---ADEILYVLEGEAEVTVGD----------NERFLLE--PGDSIYIPAGVPHRFRNLSDEP 65 (66) T ss_pred CCCCCCCCCCCCC---CCEEEEEEECEEEEEECC----------EEEEEEC--CCCEEEECCCCCEECCCCCCCC T ss_conf 9998778742979---769999998949999999----------7999955--9989998998969289498888 No 10 >PRK04190 glucose-6-phosphate isomerase; Provisional Probab=90.32 E-value=0.32 Score=27.54 Aligned_cols=77 Identities=21% Similarity=0.271 Sum_probs=49.9 Q ss_pred CCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEE-----CC Q ss_conf 1555554401210000222223433406421333100113662365302333023220345307453389721-----78 Q gi|254780919|r 57 HFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKV-----TD 131 (198) Q Consensus 57 H~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~-----~~ 131 (198) |||... .++-+-.|++|+-...+-| +. |.+...++.+ +..+||||++||--....+...+.-.| +. T Consensus 90 H~H~~~-~~~E~Y~~l~G~g~~Llq~-----~~-g~~~~~~~~~--G~~vyIPp~~aHrtVN~g~e~L~~~a~~p~daGh 160 (191) T PRK04190 90 HFHAVL-DRAEIYYGLKGKGVMLLQD-----PE-GEARWEEMTP--GTVHYVPPYWAHRSVNTGDEPLVFLACYPADAGH 160 (191) T ss_pred EECCCC-CCCCEEEEEECEEEEEEEC-----CC-CCEEEEEECC--CCEEEECCCEEEEEEECCCCCEEEEEEEECCCCC T ss_conf 105889-9874899981659999985-----99-9899999459--9689939972589870799748999996064365 Q ss_pred CCCCHHCEECC Q ss_conf 76710170216 Q gi|254780919|r 132 FYSVEHDSGVA 142 (198) Q Consensus 132 ~y~p~~e~~i~ 142 (198) .|..-.+.|.. T Consensus 161 dY~~I~~~Gf~ 171 (191) T PRK04190 161 DYGTIAEKGFS 171 (191) T ss_pred CCHHHHHCCCE T ss_conf 52889857972 No 11 >pfam08007 Cupin_4 Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily. Probab=85.90 E-value=0.75 Score=25.42 Aligned_cols=35 Identities=26% Similarity=0.449 Sum_probs=26.8 Q ss_pred HEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEEC Q ss_conf 001136623653023330232203453074533897217 Q gi|254780919|r 92 CWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVT 130 (198) Q Consensus 92 k~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~ 130 (198) -...++|.+ +-+||||+|++|-=.++++ .++|-++ T Consensus 175 p~~d~vLep--GD~LYiPpG~~H~g~a~~~--s~~~SvG 209 (320) T pfam08007 175 PVHDFVLEP--GDMLYLPRGFIHQGVALDD--SLHYSVG 209 (320) T ss_pred CCEEEEECC--CCEEEECCCCCCCCCCCCC--CEEEEEC T ss_conf 505899768--8879858998644302888--3588812 No 12 >COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown] Probab=84.91 E-value=1.6 Score=23.51 Aligned_cols=70 Identities=21% Similarity=0.297 Sum_probs=47.5 Q ss_pred CCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEE Q ss_conf 87413442331252000000015555544012100002222234334064213331001136623653023330232203 Q gi|254780919|r 36 QDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHG 115 (198) Q Consensus 36 ~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHG 115 (198) ...+.+..+++ .+|-.=.+|.+ | .....++|++|++.. .+. -...+|.+ +..++||+|..|+ T Consensus 40 ~~~~~~~~v~~--~~G~~~~~H~h--p-~~~~~~~Vl~G~~~~---~~~--------g~~~~l~~--Gd~i~ip~g~~H~ 101 (131) T COG1917 40 GENLSVVLVTF--EPGAVIPWHTH--P-LGEQTIYVLEGEGTV---QLE--------GEKKELKA--GDVIIIPPGVVHG 101 (131) T ss_pred CCEEEEEEEEE--CCCCCCCCCCC--C-CCCEEEEEEEEEEEE---EEC--------CEEEEECC--CCEEEECCCCCCC T ss_conf 82688999998--58983666029--9-863699999636999---978--------97699458--8589967999511 Q ss_pred EEEECCCC Q ss_conf 45307453 Q gi|254780919|r 116 FMTLEMNT 123 (198) Q Consensus 116 f~~L~d~~ 123 (198) +.+.++.. T Consensus 102 ~~a~~~~~ 109 (131) T COG1917 102 LKAVEDEP 109 (131) T ss_pred EECCCCCC T ss_conf 45179866 No 13 >pfam07883 Cupin_2 Cupin domain. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). Probab=84.50 E-value=1.1 Score=24.35 Aligned_cols=61 Identities=21% Similarity=0.268 Sum_probs=42.6 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEE Q ss_conf 200000001555554401210000222223433406421333100113662365302333023220345307453389 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVI 126 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~ 126 (198) .+|.--.+|.+. . ..+++|++|++... +| ++ .+.|.+ +.++++|+|..|++...++..... T Consensus 6 ~PG~~~~~H~H~--~--~e~~~vl~G~~~~~-~~--------g~--~~~l~~--Gd~~~ip~~~~H~~~N~g~~~~~~ 66 (70) T pfam07883 6 PPGESSPPHRHP--G--EEFFYVLEGEGELT-VD--------GE--EVVLKA--GDSVYFPAGVPHRFRNTGDEPARL 66 (70) T ss_pred CCCCCCCCEECC--C--HHEEEEEEEEEEEE-EC--------CE--EEECCC--CCEEEECCCCEEEEEECCCCCEEE T ss_conf 999859889998--6--63899991279999-98--------88--998689--999998999949999998687999 No 14 >pfam06560 GPI Glucose-6-phosphate isomerase (GPI). This family consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (EC:5.3.1.9). Probab=84.24 E-value=1.2 Score=24.15 Aligned_cols=86 Identities=13% Similarity=0.235 Sum_probs=48.5 Q ss_pred CCCEEEE------CCCCCCC---CCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEE Q ss_conf 2000000------0155555---440121000022222343340642133310011366236530233302322034530 Q gi|254780919|r 49 DCGTIRG------LHFQRPP---YAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTL 119 (198) Q Consensus 49 ~kgvlRG------lH~Q~~p---~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L 119 (198) .+|++-+ =|||..+ .+..-+-.|++|+..-.+=+ ++..+-..+..+++ +.+..++|||++||--... T Consensus 57 ~PG~vG~E~~~T~GH~H~~~~~~~~~pEvY~~l~G~g~~LLQ~--~~g~~~~d~~~v~~--~~G~~v~IPP~~aHrtIN~ 132 (181) T pfam06560 57 PPGKIGKEYFKTRGHYHAQVDTRNDRPEVYQVLKGKALMLLQK--PEGGAVDDVRVIEM--EKGTVVYVPPYYGHRTINI 132 (181) T ss_pred CCCCCCCCEEECCCCCCCCCCCCCCCCCEEEEEECEEEEEEEE--CCCCCCEEEEEEEE--CCCCEEEECCCEEEEEECC T ss_conf 8975067350056512587888998762999991789999997--78997115999995--4898899799815999838 Q ss_pred CCCCEEEEE-E----CCCCCCHHC Q ss_conf 745338972-1----787671017 Q gi|254780919|r 120 EMNTEVIYK-V----TDFYSVEHD 138 (198) Q Consensus 120 ~d~~~i~Y~-~----~~~y~p~~e 138 (198) ++...+.-. + +..|.+-.+ T Consensus 133 g~~~L~~~~~~~~~~g~dY~~i~~ 156 (181) T pfam06560 133 GDEPLVMLNWYPADAGSDYDTIAK 156 (181) T ss_pred CCCCEEEEEEEECCCCCCCHHHHH T ss_conf 998479999984754565487885 No 15 >KOG2107 consensus Probab=79.85 E-value=1.2 Score=24.33 Aligned_cols=51 Identities=24% Similarity=0.490 Sum_probs=39.7 Q ss_pred HHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCC Q ss_conf 01210000222223433406421333100113662365302333023220345307453 Q gi|254780919|r 65 QAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNT 123 (198) Q Consensus 65 Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~ 123 (198) ..-+=+|+.|.-+ .|.|..+ .+|..+-+ +.+-.+++|+|..|-|-+-++|- T Consensus 93 deeiR~il~GtgY---fDVrd~d---d~WIRi~v--ekGDlivlPaGiyHRFTtt~~n~ 143 (179) T KOG2107 93 DEEIRYILEGTGY---FDVRDKD---DQWIRIFV--EKGDLIVLPAGIYHRFTTTPSNY 143 (179) T ss_pred HHHEEEEEECCEE---EEECCCC---CCEEEEEE--ECCCEEEECCCCEEEEECCCHHH T ss_conf 0314778606357---7611478---87799999--43988996686225424271477 No 16 >COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only] Probab=79.54 E-value=2.2 Score=22.67 Aligned_cols=119 Identities=24% Similarity=0.302 Sum_probs=64.4 Q ss_pred EEECCCEEEEEECHHHHHHCCCCCCEEEEEEE---------------ECCCCCEEEECCCCCCC------CCHHHHEEEC Q ss_conf 20379504211088989867998741344233---------------12520000000155555------4401210000 Q gi|254780919|r 14 FEDSRGWFSQTYSSKLLKELGLQDVFVQDNHS---------------FSFDCGTIRGLHFQRPP------YAQAKLVRCI 72 (198) Q Consensus 14 f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S---------------~s~~kgvlRGlH~Q~~p------~~Q~Klv~ci 72 (198) |.-.-|+=..-|....+.+.|+.. |.-.|-| ++ ++|-+--||++-.. .++.+|| |- T Consensus 46 fd~~lgwditdyG~~kf~~~GL~L-Ft~rNg~~~~~~~~~~yaeKiM~v-r~gQvtPmHrH~~k~eDiinrgggtlv-~e 122 (225) T COG3822 46 FDLKLGWDITDYGLNKFAKVGLTL-FTLRNGSAYGMPYGKCYAEKIMHV-RPGQVTPMHRHWRKPEDIINRGGGTLV-VE 122 (225) T ss_pred HHHHHCEEEECCCCCHHHHCCEEE-EEEECCCCCCCCCCCCCHHEEEEE-CCCCCCCCCCCCCCHHHHHHCCCCEEE-EE T ss_conf 543423333001551145406089-999668623575653121015774-268747520266665554206885599-99 Q ss_pred HHHHH---------HHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCCCHHC Q ss_conf 22222---------3433406421333100113662365302333023220345307453389721787671017 Q gi|254780919|r 73 AGRIF---------DIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYSVEHD 138 (198) Q Consensus 73 ~G~I~---------dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~p~~e 138 (198) -+... +|.|-+...-.|+-.+..+.|++ +.++.+|||..|.|.+=+.. +++=-+|..-+-.++ T Consensus 123 l~~~d~~~~~~~ks~vtv~~dg~r~~~~ag~~lkL~P--GesitL~Pg~~HsFwae~g~-vlvgEvSsvndD~hD 194 (225) T COG3822 123 LWNVDLVEGQDEKSDVTVPVDGCRQTHTAGSQLKLSP--GESITLPPGLYHSFWAEEGG-VLVGEVSSVNDDLHD 194 (225) T ss_pred EECCCCCCCCCCCCCEEECCCCCEEEECCCEEEEECC--CCCEECCCCCEEEEEECCCC-EEEEEEEECCCCCCC T ss_conf 8521355673788773765787677761550688779--97176288740001206786-899987513575542 No 17 >COG3542 Uncharacterized conserved protein [Function unknown] Probab=76.44 E-value=2.1 Score=22.89 Aligned_cols=103 Identities=17% Similarity=0.165 Sum_probs=49.4 Q ss_pred CCCEEEEEECHHHHHHCCCCC--CEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEE Q ss_conf 795042110889898679987--413442331252000000015555544012100002222234334064213331001 Q gi|254780919|r 17 SRGWFSQTYSSKLLKELGLQD--VFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWV 94 (198) Q Consensus 17 ~RG~f~e~f~~~~~~~~~~~~--~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~ 94 (198) +-|+|.|+|++++....|.+. .-.-+++-+ .+..+-.||- .. -+.++.-..|.=+.+.+ +. -|+.. T Consensus 20 EGG~y~eT~rs~e~~~~G~~~R~~sT~IYyLL--e~~~~s~~HR--v~--a~eiwHf~ag~pl~~~l-----~~-dG~~~ 87 (162) T COG3542 20 EGGFYRETFRSEEKDAAGGDVRNHSTAIYYLL--EEDNISAWHR--VT--ADEIWHFYAGAPLELHL-----SE-DGGAE 87 (162) T ss_pred CCCCEEEEECCCCCCCCCCCCCCEEEEEEEEE--CCCCCCHHEE--CC--HHHEEEEECCCCEEEEE-----EE-CCCEE T ss_conf 88726530013235578866344037999985--2786110122--34--32357772399659999-----81-79768 Q ss_pred EEEE----ECCCCCEEEECCC-CCEEEEEEC-CCCEEEEEECC Q ss_conf 1366----2365302333023-220345307-45338972178 Q gi|254780919|r 95 SLEI----SANNGLQIYIPTG-FAHGFMTLE-MNTEVIYKVTD 131 (198) Q Consensus 95 ~~~L----s~~~~~~l~IP~G-~aHGf~~L~-d~~~i~Y~~~~ 131 (198) ++.| .....-+..||.| ++-.+.++. +.+.|-..+.- T Consensus 88 s~~LG~d~~~Ge~~Q~vVP~g~w~aS~~~~g~~~tLVgCtVaP 130 (162) T COG3542 88 SFTLGPDLEKGERPQYVVPAGTWWASAVSLGEDYTLVGCTVAP 130 (162) T ss_pred EEEECCCCCCCCEEEEEEECCCEEEEEEECCCCCEEEEEEECC T ss_conf 9995532037845579975784887787138980698877517 No 18 >COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism] Probab=74.89 E-value=5 Score=20.66 Aligned_cols=86 Identities=16% Similarity=0.136 Sum_probs=52.4 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEE Q ss_conf 20000000155555440121000022222343340642133310011366236530233302322034530745338972 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYK 128 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~ 128 (198) -+|.--+-|.+... .--|++++|+.- --||....+..-..-+..+|||+|+-|-=.-+++.....-. T Consensus 54 ~pgAkakaH~H~~h---EtaIYvlsG~ah----------~w~G~rLE~ha~~~pGDf~YiPpgVPHqp~N~S~ep~s~vI 120 (142) T COG4101 54 PPGAKAKAHLHEEH---ETAIYVLSGEAH----------TWYGNRLEEHAEVGPGDFFYIPPGVPHQPANLSTEPLSAVI 120 (142) T ss_pred CCCCCCCCCCCCCC---CEEEEEEECEEE----------EEECCCEEEEEEECCCCEEECCCCCCCCCCCCCCCCEEEEE T ss_conf 89851120002564---078999834044----------31011202467744887698389998764546789767999 Q ss_pred ECCCCCC-HHCEECCCCCCC Q ss_conf 1787671-017021688801 Q gi|254780919|r 129 VTDFYSV-EHDSGVAWQDKS 147 (198) Q Consensus 129 ~~~~y~p-~~e~~i~~~Dp~ 147 (198) .-..-++ +.-..+.|.|+- T Consensus 121 aRsDp~~~Esv~~lpelD~l 140 (142) T COG4101 121 ARSDPNPQESVQLLPELDPL 140 (142) T ss_pred ECCCCCCCCCCEEECCCCCC T ss_conf 82599977683770012456 No 19 >PRK10296 DNA-binding transcriptional regulator ChbR; Provisional Probab=68.98 E-value=6.1 Score=20.17 Aligned_cols=51 Identities=20% Similarity=0.342 Sum_probs=35.1 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEE Q ss_conf 20000000155555440121000022222343340642133310011366236530233302322034 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGF 116 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf 116 (198) ++-.+.|||+|.. --+++|.+|+....+ | -..+.|.+ |..++||+|..|-. T Consensus 24 ~~~~~~~lH~Hd~----~Ei~yv~~G~~~~~i-n----------~~~~~l~~--Gdli~I~~~~~H~~ 74 (271) T PRK10296 24 KTESVSGLHQHDY----YEFTLVLTGRYYQEI-N----------GKRVLLER--GDFVFIPLGSHHQS 74 (271) T ss_pred CCCCCCCCEECCC----EEEEEEECCEEEEEE-C----------CEEEEEEC--CEEEEECCCCCEEE T ss_conf 8778998866686----899999767489999-9----------99999708--98999829972432 No 20 >COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism] Probab=61.33 E-value=14 Score=18.08 Aligned_cols=76 Identities=25% Similarity=0.254 Sum_probs=50.3 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEE Q ss_conf 20000000155555440121000022222343340642133310011366236530233302322034530745338972 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYK 128 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~ 128 (198) ++|---++|.+.. -+..+++++|... ++++ -..++|.+ +.+++||+|..|-..........++- T Consensus 44 ~pg~~~~~~~H~~---~dE~~~Vl~G~g~---v~~~--------~~~~~v~~--gd~~~iP~g~~H~~~N~G~~~L~lie 107 (127) T COG0662 44 KPGEEISLHHHHH---RDEHWYVLEGTGK---VTIG--------GEEVEVKA--GDSVYIPAGTPHRVRNTGKIPLVLIE 107 (127) T ss_pred CCCCCCCCCCCCC---CCEEEEEEEEEEE---EEEC--------CEEEEECC--CCEEEECCCCCEECCCCCCCCEEEEE T ss_conf 6798547322477---2229999955699---9999--------99999748--99999899996833279960369999 Q ss_pred E-CCCCCCHHCEE Q ss_conf 1-78767101702 Q gi|254780919|r 129 V-TDFYSVEHDSG 140 (198) Q Consensus 129 ~-~~~y~p~~e~~ 140 (198) + +..|..+.+.. T Consensus 108 i~~p~~~~e~~~~ 120 (127) T COG0662 108 VQSPPYLGEDDIV 120 (127) T ss_pred EECCCCCCCCEEE T ss_conf 8268816766289 No 21 >pfam05694 SBP56 56kDa selenium binding protein (SBP56). This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins, in vesicular Golgi transport. Probab=56.50 E-value=2.7 Score=22.23 Aligned_cols=12 Identities=58% Similarity=0.977 Sum_probs=4.5 Q ss_pred HHCCCCHHHHHH Q ss_conf 340642133310 Q gi|254780919|r 81 VDIRRNSPTYGC 92 (198) Q Consensus 81 vDlR~~SpTfgk 92 (198) ||++++||||+| T Consensus 42 VDvdP~S~TY~q 53 (461) T pfam05694 42 VDVDPESPTYGQ 53 (461) T ss_pred EECCCCCCCCCC T ss_conf 868999998531 No 22 >PRK13502 transcriptional activator RhaR; Provisional Probab=44.01 E-value=25 Score=16.65 Aligned_cols=50 Identities=18% Similarity=0.226 Sum_probs=35.9 Q ss_pred ECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECC Q ss_conf 0015555544012100002222234334064213331001136623653023330232203453074 Q gi|254780919|r 55 GLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEM 121 (198) Q Consensus 55 GlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d 121 (198) +.|.|. .-.++.|.+|+....+ | -..+.|.+ |..++||||-.|.+.+.++ T Consensus 32 p~H~Hd----~~El~~v~~G~g~~~i-~----------~~~y~l~~--Gdl~lI~p~~~H~~~~~~~ 81 (282) T PRK13502 32 AEHTHE----FCELVMVWRGNGLHVL-N----------ERPYRITR--GDLFYIRAEDKHSYTSVND 81 (282) T ss_pred CCCCCC----EEEEEEEECCEEEEEE-C----------CEEEEECC--CEEEEECCCCEEEEECCCC T ss_conf 986207----1999999847299999-9----------99999529--9899999998581341788 No 23 >pfam06172 Cupin_5 Cupin superfamily (DUF985). Family of uncharacterized proteins found in bacteria and eukaryotes that belongs to the Cupin superfamily. Probab=43.70 E-value=27 Score=16.41 Aligned_cols=113 Identities=14% Similarity=0.079 Sum_probs=58.4 Q ss_pred EECCEEEECCCEEEEEECHHHHHHC----CCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCC Q ss_conf 7076220379504211088989867----998741344233125200000001555554401210000222223433406 Q gi|254780919|r 9 LKTRKFEDSRGWFSQTYSSKLLKEL----GLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIR 84 (198) Q Consensus 9 i~~~~f~D~RG~f~e~f~~~~~~~~----~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR 84 (198) +++..|. +-|+|.|+|++...-.. +......-+.+-++ ++-+=.||- -. .+-++.-..|.-+.+.+ +. T Consensus 8 L~L~pHP-EGG~yret~rs~~~~~~~~~~~~R~~~TsIyfLL~--~~~~S~~Hr--v~--sdEiw~~~~G~pl~l~~-~~ 79 (139) T pfam06172 8 LGLEPHP-EGGFYRETYRSPGTFLGNSPLPPRAASTSIYFLLT--AGDVSAFHR--ID--ADEVWHFHAGDPLTLHL-IS 79 (139) T ss_pred CCCCCCC-CCCCEEEEEECCCCCCCCCCCCCCCCEEEEEEEEC--CCCCCEEEE--EC--CCEEEEEECCCCEEEEE-EC T ss_conf 6996189-98568989988884467788887454038999946--999880699--07--87589998799889999-87 Q ss_pred CCHHHHHHEEEEEEECC----CCCEEEECCCCCEEEEEE--CCCCEEEEEECCCC Q ss_conf 42133310011366236----530233302322034530--74533897217876 Q gi|254780919|r 85 RNSPTYGCWVSLEISAN----NGLQIYIPTGFAHGFMTL--EMNTEVIYKVTDFY 133 (198) Q Consensus 85 ~~SpTfgk~~~~~Ls~~----~~~~l~IP~G~aHGf~~L--~d~~~i~Y~~~~~y 133 (198) + -|+...+.|.++ ..-++.||+|.+-|-... .+-+.+...+.=-| T Consensus 80 ~----dg~~~~~~LG~~~~~ge~~q~vVp~G~w~aa~l~~~~~~~Lvsc~VaPGF 130 (139) T pfam06172 80 P----DGELETTVLGLDLAAGERPQYVVPAGVWFASRLESLGGFSLVGCTVAPGF 130 (139) T ss_pred C----CCCEEEEEECCCCCCCCCEEEEECCCEEEEEEECCCCCCEEEEEEECCCC T ss_conf 9----99888999558865587258887799599878879998199997734979 No 24 >TIGR02279 PaaC-3OHAcCoADH 3-hydroxyacyl-CoA dehydrogenase PaaC; InterPro: IPR011967 This 3-hydroxyacyl-CoA dehydrogenase is involved in the degradation of phenylacetic acid, presumably in steps following the opening of the phenyl ring . The sequences included in this entry are all found in possible operons with other related genes such as paaA, paaB, paaD, paaE, paaF and paaN . Some genomes contain these other genes without an apparent paaC in the same operon - possibly in these cases a different dehydrogenase involved in fatty acid degradation may fill in the needed activity. . Probab=42.52 E-value=12 Score=18.51 Aligned_cols=27 Identities=30% Similarity=0.438 Sum_probs=23.3 Q ss_pred CCCCEEEECCCCCCCCCHHHHEEECHHH Q ss_conf 5200000001555554401210000222 Q gi|254780919|r 48 FDCGTIRGLHFQRPPYAQAKLVRCIAGR 75 (198) Q Consensus 48 ~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~ 75 (198) .+++-+=||||= +|-|-.|||-|++|- T Consensus 133 arP~rvaGlHFF-NPAP~MaLVEVVSGL 159 (508) T TIGR02279 133 ARPERVAGLHFF-NPAPVMALVEVVSGL 159 (508) T ss_pred CCCCEEEECCCC-CCCCCCCEEEEECCC T ss_conf 387707511134-865536215662363 No 25 >PRK13501 transcriptional activator RhaR; Provisional Probab=41.82 E-value=14 Score=18.16 Aligned_cols=50 Identities=18% Similarity=0.232 Sum_probs=36.6 Q ss_pred ECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECC Q ss_conf 0015555544012100002222234334064213331001136623653023330232203453074 Q gi|254780919|r 55 GLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEM 121 (198) Q Consensus 55 GlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d 121 (198) .+|.|. .-.++.|.+|+...++=| ..+.+.+ |..++|+||--|++.+.++ T Consensus 32 ~~H~H~----~~Ei~~v~~G~g~~~i~~-----------~~~~l~~--Gdl~~i~p~~~H~~~~~~~ 81 (290) T PRK13501 32 VEHTHQ----FCEIVIVWRGNGLHVLND-----------HPYRITC--GDVFYIQAADHHSYESVHD 81 (290) T ss_pred CCCCCC----CEEEEEEECCEEEEEECC-----------EEEEEEC--CEEEEECCCCCCCCCCCCC T ss_conf 981358----679999974659999999-----------9999648--9799989998457545778 No 26 >TIGR01896 cas_AF1879 CRISPR-associated protein, Csa1 family; InterPro: IPR009260 This family consists of several archaeal strongly conserved proteins whose genes are associated with CRISPRs (Clustered, Regularly Interspaced Short Palidromic Repeats). The function of these proteins has not been experimentally determined, but computational analysis has suggested that they may function as nucleases in DNA repair, similar to RecB (IPR004586 from INTERPRO) . . Probab=41.40 E-value=9.1 Score=19.16 Aligned_cols=39 Identities=23% Similarity=0.416 Sum_probs=21.8 Q ss_pred CEEEECCCCCCCCCHHHH-EEECHHHHHHHHH---HCCCCHHHHHHEE Q ss_conf 000000155555440121-0000222223433---4064213331001 Q gi|254780919|r 51 GTIRGLHFQRPPYAQAKL-VRCIAGRIFDIAV---DIRRNSPTYGCWV 94 (198) Q Consensus 51 gvlRGlH~Q~~p~~Q~Kl-v~ci~G~I~dvvv---DlR~~SpTfgk~~ 94 (198) -=||||||..||-.=--+ +.. ++-|++. +.+++ +|+|.. T Consensus 15 ~ELRGW~y~~PPv~P~~y~~~L---s~SD~a~g~C~~~Rd--vYLkrV 57 (295) T TIGR01896 15 EELRGWNYNEPPVKPPAYALGL---SLSDVAYGYCPTGRD--VYLKRV 57 (295) T ss_pred CCCCCCCCCCCCCCCCCCCCCC---HHHHHHCCCCCCCCC--EEEEEE T ss_conf 0147654785578898530122---144552676677543--345411 No 27 >pfam09313 DUF1971 Domain of unknown function (DUF1971). Members of this family of functionally uncharacterized domains are predominantly found in bacterial Tellurite resistance protein. Probab=38.45 E-value=28 Score=16.36 Aligned_cols=52 Identities=19% Similarity=0.134 Sum_probs=35.9 Q ss_pred EEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEE Q ss_conf 0000222223433406421333100113662365302333023220345307453389 Q gi|254780919|r 69 VRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVI 126 (198) Q Consensus 69 v~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~ 126 (198) ++|++|++.-+..|-+.+ ......+.+. +..-+|||..+|---.++|++.+. T Consensus 29 l~Vl~G~L~f~~~~e~~~-----~~~~~~~~~~-~~~~~i~Pq~wH~Ve~lsdD~~f~ 80 (82) T pfam09313 29 LTVLEGELKFYGLDEDGE-----EPEEEVFIAA-GQPPVIPPQYWHKVEPLSDDTEFQ 80 (82) T ss_pred EEEEEEEEEEEEECCCCC-----CCEEEEEECC-CCCCCCCCCCCEEEEECCCCEEEE T ss_conf 999972799999888998-----4207999569-997431898638988768986998 No 28 >COG2850 Uncharacterized conserved protein [Function unknown] Probab=37.35 E-value=18 Score=17.53 Aligned_cols=31 Identities=29% Similarity=0.330 Sum_probs=21.9 Q ss_pred CCCCEEEECCCCCEEEEEECCCCEEEEEECCCC Q ss_conf 653023330232203453074533897217876 Q gi|254780919|r 101 NNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFY 133 (198) Q Consensus 101 ~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y 133 (198) +.+.+||||||++|-=+++ ++ .+.|-+...+ T Consensus 184 epGDiLYiPp~~~H~gvae-~d-c~tySvG~r~ 214 (383) T COG2850 184 EPGDILYIPPGFPHYGVAE-DD-CMTYSVGFRA 214 (383) T ss_pred CCCCEEECCCCCCCCCCCC-CC-CCCEEEECCC T ss_conf 7886542389877678554-45-5634661348 No 29 >KOG2131 consensus Probab=36.85 E-value=16 Score=17.74 Aligned_cols=87 Identities=23% Similarity=0.395 Sum_probs=49.8 Q ss_pred EECHHHHHHCCCC-----CCEEEEEEEECCCCCEEEECC---CCCCCCCHHHHEEECHHHHHHHHHHCC----------- Q ss_conf 1088989867998-----741344233125200000001---555554401210000222223433406----------- Q gi|254780919|r 24 TYSSKLLKELGLQ-----DVFVQDNHSFSFDCGTIRGLH---FQRPPYAQAKLVRCIAGRIFDIAVDIR----------- 84 (198) Q Consensus 24 ~f~~~~~~~~~~~-----~~~~Q~n~S~s~~kgvlRGlH---~Q~~p~~Q~Klv~ci~G~I~dvvvDlR----------- 84 (198) .|.++++.+..+. +.|+ +-.-+|+--++| |....|+ ..+.|+-...+++.- T Consensus 181 ~F~~dwlne~~i~~~~ddyrFv-----y~Gp~gSwtp~HaDVf~s~swS-----~nicG~KrWl~~pP~qe~~l~dr~gn 250 (427) T KOG2131 181 LFAKDWLNEFVIDGESDDYRFV-----YAGPAGSWTPFHADVFHSPSWS-----VNICGRKRWLLYPPEQEQTLADRYGN 250 (427) T ss_pred HHHHCCCCHHHCCCCCCCEEEE-----EECCCCCCCCCCHHHHCCCCCE-----EEEECCEEEEEECHHHHHHHHHHCCC T ss_conf 5543365222134788754689-----8636788786532010377631-----56623056887371776556664057 Q ss_pred --------CCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCC Q ss_conf --------42133310011366236530233302322034530745 Q gi|254780919|r 85 --------RNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMN 122 (198) Q Consensus 85 --------~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~ 122 (198) +.--+-+.+ .++..+-+.++|||.|.-|-..-|+|. T Consensus 251 lp~~~~~~~ld~~~~~~--lei~Qepge~VFvPsGW~hQV~NL~dT 294 (427) T KOG2131 251 LPLPSWITKLDLFRGPL--LEIFQEPGETVFVPSGWHHQVLNLGDT 294 (427) T ss_pred CCCCCCCCCCCCCCCCH--HHHHCCCCCEEECCCCCCCCCCCCCCE T ss_conf 67764322110135644--444306773365468655211146662 No 30 >PRK10371 DNA-binding transcriptional regulator MelR; Provisional Probab=36.84 E-value=32 Score=16.01 Aligned_cols=56 Identities=11% Similarity=0.059 Sum_probs=40.8 Q ss_pred CCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECC Q ss_conf 2000000015555544012100002222234334064213331001136623653023330232203453074 Q gi|254780919|r 49 DCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEM 121 (198) Q Consensus 49 ~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d 121 (198) -+.+.-..|+|. +-.+++|++|++. +.|| -..+.|.+ |..++|++|.-|.....++ T Consensus 34 ~p~~m~~~HwH~----~iEi~yv~~G~~~-~~I~----------~~~y~l~~--Gd~iiIns~~lH~~~~~~~ 89 (302) T PRK10371 34 PPHIMPTSHWHG----QVEVNVPFDGDVE-YLIN----------NEKVNINQ--GHITLFWACTPHQLTDTGN 89 (302) T ss_pred CCCCCCCCCCCC----CEEEEEEECCEEE-EEEC----------CEEEEECC--CEEEEECCCCCEEEECCCC T ss_conf 899999886688----7899999815799-9999----------99999769--8699998999876535999 No 31 >pfam01238 PMI_typeI Phosphomannose isomerase type I. This is a family of Phosphomannose isomerase type I enzymes (EC 5.3.1.8). Probab=34.07 E-value=39 Score=15.50 Aligned_cols=21 Identities=10% Similarity=0.507 Sum_probs=16.9 Q ss_pred EEEECCCCCEEEECCCCCEEEEE Q ss_conf 36623653023330232203453 Q gi|254780919|r 96 LEISANNGLQIYIPTGFAHGFMT 118 (198) Q Consensus 96 ~~Ls~~~~~~l~IP~G~aHGf~~ 118 (198) +.|. .+.++|+|+|.-|+|+. T Consensus 252 v~L~--PGeAlfl~Ag~~HAYL~ 272 (373) T pfam01238 252 VKLN--PGEAMFLKANDPHAYLS 272 (373) T ss_pred EEEC--CCCEEEECCCCCEEEEC T ss_conf 7606--98778736999628754 No 32 >pfam03079 ARD ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed Probab=33.55 E-value=40 Score=15.45 Aligned_cols=65 Identities=20% Similarity=0.333 Sum_probs=42.9 Q ss_pred EEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCE Q ss_conf 0000015555544012100002222234334064213331001136623653023330232203453074533 Q gi|254780919|r 52 TIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTE 124 (198) Q Consensus 52 vlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~ 124 (198) .++..-|+..-+...-.=+|+.|+-+ .|+|-. -++|..+.+. .+-.|.||+|.-|=|-.-+++.+ T Consensus 79 ~~~~~F~~EH~H~deEiR~i~~G~g~---fdvr~~---~d~wiri~~~--~GDli~lPag~~HrFtl~~~~~i 143 (157) T pfam03079 79 EKLKKFFEEHLHTDEEIRYIVEGTGY---FDVRDK---DDVWIRVFVE--KGDLISLPAGIYHRFTTTPDNYV 143 (157) T ss_pred HHHHHHHHCCCCCCCEEEEEEEEEEE---EEEECC---CCEEEEEEEC--CCCEEEECCCCCEEEECCCCCCE T ss_conf 99988873438987789999953499---999918---9739999985--89989969998486775899879 No 33 >pfam12470 SUFU_C Suppressor of Fused Gli/Ci N terminal binding domain. This domain family is found in eukaryotes, and is typically between 192 and 219 amino acids in length. The family is found in association with pfam05076. There is a conserved HGRHFT sequence motif. This family is the C terminal domain of the Suppressor of Fused protein (Su(fu)). Su(fu) is a repressor of the Gli and Ci transcription factors of the Hedgehog signalling cascade. It functions by binding these proteins and preventing their translocation to the nucleus. The C terminal domain is only found in eukaryotic Su(fu) proteins; it is not present in bacterial homologues. The C terminal domain binds to the N terminal of Gli/Ci while the N terminal of Su(fu) binds to the C terminal of Gli/Ci. This dual binding mechanism is likely an evolutionary advancement in this signalling cascade which is not present in bacterial homologues. Probab=31.78 E-value=32 Score=16.01 Aligned_cols=76 Identities=14% Similarity=0.137 Sum_probs=49.6 Q ss_pred CCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCC-C----E Q ss_conf 0000000155555440121000022222343340642133310011366236530233302322034530745-3----3 Q gi|254780919|r 50 CGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMN-T----E 124 (198) Q Consensus 50 kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~-~----~ 124 (198) .-.|-|+|.-..+.+-.=|.-|++|+|. .|++++|. +.+-..-||.+++.-.|.+-+.- + = T Consensus 92 ~~~L~gihLtfn~eAg~lLpLAikgRlr------------HGRHFTFK--~~d~AITfVa~~VtGs~V~~e~PY~a~G~W 157 (201) T pfam12470 92 TRPLEGIELTLAPEAAKLLPLAIRDRIR------------HGRHFTFK--AQDMAITFVSESVTGSIVTREHPYAVLGYW 157 (201) T ss_pred CEEEEEEEEEECHHHHHHHHHHHHHHHH------------CCCCEEEE--CCCEEEEEECCCCEEEEECCCCCCEECCCE T ss_conf 5552016999655367676798641443------------06630575--387789995376147665266884202531 Q ss_pred EEEEECCCCCCHHCE Q ss_conf 897217876710170 Q gi|254780919|r 125 VIYKVTDFYSVEHDS 139 (198) Q Consensus 125 i~Y~~~~~y~p~~e~ 139 (198) ++-+.++.+.++... T Consensus 158 LQILI~~dl~~~M~~ 172 (201) T pfam12470 158 LQILIPNELVPRMLD 172 (201) T ss_pred EEEEECHHHHHHHHH T ss_conf 789814889999999 No 34 >COG4194 Predicted membrane protein [General function prediction only] Probab=29.50 E-value=37 Score=15.69 Aligned_cols=34 Identities=18% Similarity=0.371 Sum_probs=21.0 Q ss_pred CCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEE Q ss_conf 064213331001136623653023330232203453 Q gi|254780919|r 83 IRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMT 118 (198) Q Consensus 83 lR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~ 118 (198) +|.+ .-|=|-.-+.-+++++ .+|||.+++|||-- T Consensus 290 m~dD-dkyWk~GiiY~n~nDp-rIfV~krfGigyTi 323 (350) T COG4194 290 MRDD-DKYWKAGIIYVNPNDP-RIFVEKRFGIGYTI 323 (350) T ss_pred CCCC-CHHHCCCEEEECCCCC-EEEEEECCCCEEEE T ss_conf 5553-1110465799879998-18887057751699 No 35 >COG1791 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown] Probab=28.62 E-value=28 Score=16.35 Aligned_cols=67 Identities=19% Similarity=0.229 Sum_probs=44.7 Q ss_pred CCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCC Q ss_conf 520000000155555440121000022222343340642133310011366236530233302322034530745 Q gi|254780919|r 48 FDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMN 122 (198) Q Consensus 48 ~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~ 122 (198) .++..+|.=-+|..-+...-.-+|+.|.-+ .|+|..+ |+++.+.... +..|-||+|.-|=|-.-++. T Consensus 78 pk~del~akF~~EH~H~d~EvRy~vaG~Gi---F~v~~~d---~~~~~i~c~~--gDLI~vP~gi~HwFtlt~~~ 144 (181) T COG1791 78 PKLDELRAKFLQEHLHTDDEVRYFVAGEGI---FDVHSPD---GKVYQIRCEK--GDLISVPPGIYHWFTLTESP 144 (181) T ss_pred CCHHHHHHHHHHHHCCCCCEEEEEEECCEE---EEEECCC---CCEEEEEECC--CCEEECCCCCEEEEECCCCC T ss_conf 328999999999853678117999936558---9998799---9689999835--98795289843887736798 No 36 >pfam07385 DUF1498 Protein of unknown function (DUF1498). This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown. Probab=26.07 E-value=38 Score=15.59 Aligned_cols=44 Identities=23% Similarity=0.261 Sum_probs=29.0 Q ss_pred HHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCCEEEEEECCCCC Q ss_conf 33310011366236530233302322034530745338972178767 Q gi|254780919|r 88 PTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNTEVIYKVTDFYS 134 (198) Q Consensus 88 pTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~~i~Y~~~~~y~ 134 (198) .|+.--..+.|.+ +.++.++||.+|.|..-+ ..+++--+|..=+ T Consensus 148 ~t~~aG~~l~L~P--GeSVtL~pg~yH~Fw~e~-G~VLiGEVS~vND 191 (225) T pfam07385 148 VTMKAGGLLKLKP--GQSICLPPGLYHSFWAEG-GFVLIGEVSMVND 191 (225) T ss_pred EEECCCCEEEECC--CCEEECCCCCCEEEECCC-CCEEEEEEECCCC T ss_conf 7888999788799--993667998715675389-8889985343557 No 37 >TIGR01555 phge_rel_HI1409 phage-associated protein, HI1409 family; InterPro: IPR006445 This group describes an uncharacterised family of proteins found in prophage regions of a number of bacterial genomes, including Haemophilus influenzae, Xylella fastidiosa, Salmonella typhi, and Enterococcus faecalis. Distantly related proteins can be found in the prophage-bearing plasmids of Borrelia burgdorferi.. Probab=25.87 E-value=36 Score=15.76 Aligned_cols=36 Identities=28% Similarity=0.309 Sum_probs=25.2 Q ss_pred HHHCCCCHHHHHHEEEEEEECC-CCCEEEECCCCCEE Q ss_conf 3340642133310011366236-53023330232203 Q gi|254780919|r 80 AVDIRRNSPTYGCWVSLEISAN-NGLQIYIPTGFAHG 115 (198) Q Consensus 80 vvDlR~~SpTfgk~~~~~Ls~~-~~~~l~IP~G~aHG 115 (198) ..|-..=|||||++.+|.+... ++..+..-.-+-|. T Consensus 145 ~~D~DvfSpnyGq~~~y~i~g~s~nsrih~~~~v~~S 181 (426) T TIGR01555 145 QIDEDVFSPNYGQIESYEIAGSSNNSRIHLLERVDYS 181 (426) T ss_pred CCCCCCCCCCCCCCCEEEECCCCCCCEEEEECCCCHH T ss_conf 3146778888887405786177654123320011034 No 38 >TIGR01150 puhA photosynthetic reaction center H subunit; InterPro: IPR005652 The photosynthetic apparatus in non-oxygenic bacteria consists of light-harvesting (LH) protein-pigment complexes LH1 and LH2, which use carotenoid and bacteriochlorophyll as primary donors . LH1 acts as the energy collection hub, temporarily storing it before its transfer to the photosynthetic reaction centre (RC) . Electrons are transferred from the primary donor via an intermediate acceptor (bacteriopheophytin) to the primary acceptor (quinine Qa), and finally to the secondary acceptor (quinone Qb), resulting in the formation of ubiquinol QbH2. RC uses the excitation energy to shuffle electrons across the membrane, transferring them via ubiquinol to the cytochrome bc1 complex in order to establish a proton gradient across the membrane, which is used by ATP synthetase to form ATP , , . The core complex is anchored in the cell membrane, consisting of one unit of RC surrounded by LH1; in some species there may be additional subunits . RC consists of three subunits: L (light), M (medium), and H (heavy). Subunits L and M provide the scaffolding for the chromophore, while subunit H contains a cytoplasmic domain . In Blastochloris viridis, there is also a non-membranous tetrahaem cytochrome (4Hcyt) subunit on the periplasmic surface. This entry describes the photosynthetic reaction centre H subunit, which has a single transmembrane helix and a large cytoplasmic domain . The core of the cytoplasmic domain has a PRC-barrel structure.; GO: 0045156 electron transporter transferring electrons within the cyclic electron transport pathway of photosynthesis activity, 0006118 electron transport, 0019684 photosynthesis light reaction, 0030077 plasma membrane light-harvesting complex. Probab=25.40 E-value=56 Score=14.62 Aligned_cols=81 Identities=19% Similarity=0.236 Sum_probs=56.2 Q ss_pred EEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEE-CCC Q ss_conf 3312520000000155555440121000022222343340642133310011366236530233302322034530-745 Q gi|254780919|r 44 HSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTL-EMN 122 (198) Q Consensus 44 ~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L-~d~ 122 (198) +|+|.---==|||=.+. ++. .+.|+|-|.+||. |..+-++-.++|.+..++..++|=+-..=+.-. ++. T Consensus 149 f~vaagD~dPrGLPV~~-----~d~--~~aGkv~DlWvD~---~e~~~RYLEvel~~~p~~~~LlP~~atri~~k~~s~~ 218 (264) T TIGR01150 149 FSVAAGDKDPRGLPVVA-----ADG--EVAGKVVDLWVDR---PEQLIRYLEVELAAGPARKKLLPLAATRIMVKVKSDR 218 (264) T ss_pred CEECCCCCCCCCCCEEC-----CCE--EEEEEEEEEEEEC---HHHCEEEEEEEEECCCCCCEECCCCCEEEEEEEECCE T ss_conf 52502677887275344-----880--4743798876401---5740154468750788751202411126777876187 Q ss_pred CEEEEEECCCCC Q ss_conf 338972178767 Q gi|254780919|r 123 TEVIYKVTDFYS 134 (198) Q Consensus 123 ~~i~Y~~~~~y~ 134 (198) ..|.-..++.|. T Consensus 219 v~V~si~s~~Fa 230 (264) T TIGR01150 219 VTVNSILSDLFA 230 (264) T ss_pred EEEEEEEHHHHC T ss_conf 899886034414 No 39 >KOG2435 consensus Probab=24.85 E-value=56 Score=14.63 Aligned_cols=66 Identities=21% Similarity=0.185 Sum_probs=40.4 Q ss_pred CCEEEECCCEEEEEECHHHHHHCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCHHHHEEECHHHHHHHHHHCC Q ss_conf 76220379504211088989867998741344233125200000001555554401210000222223433406 Q gi|254780919|r 11 TRKFEDSRGWFSQTYSSKLLKELGLQDVFVQDNHSFSFDCGTIRGLHFQRPPYAQAKLVRCIAGRIFDIAVDIR 84 (198) Q Consensus 11 ~~~f~D~RG~f~e~f~~~~~~~~~~~~~~~Q~n~S~s~~kgvlRGlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR 84 (198) +++-.|-|-+...++-..++.. ..|-|.++--.|=-|-|+|...-+=+|+..-.+|+|.|.-.-+- T Consensus 206 LrvRGDGRsy~inihte~~~dq--------~wndsys~flft~gGp~wq~~KIPfSKff~t~kGriqDrq~e~n 271 (323) T KOG2435 206 LRVRGDGRSYMINIHTETDFDQ--------RWNDSYSYFLFTRGGPYWQEVKIPFSKFFFTNKGRIQDRQHELN 271 (323) T ss_pred EEEECCCCEEEEEECCCCCHHH--------HCCCCEEEEEECCCCCCEEEEECCHHHHEECCCCCEEECCCCCC T ss_conf 9970598358999647550444--------40662036886279984068855335400145663462103358 No 40 >PRK04231 rpl3p 50S ribosomal protein L3P; Reviewed Probab=24.67 E-value=29 Score=16.30 Aligned_cols=47 Identities=28% Similarity=0.267 Sum_probs=32.7 Q ss_pred CHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCCC Q ss_conf 0222223433406421333100113662365302333023220345307453 Q gi|254780919|r 72 IAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMNT 123 (198) Q Consensus 72 i~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~~ 123 (198) =.|..-.+.+|.|++|+|.|+-...-. +.|=.||...-|+-.-.++. T Consensus 43 KAGMTHvv~~dd~p~S~~~gkEi~~pV-----TViEtPPm~V~gvr~Y~~tp 89 (333) T PRK04231 43 KAGMTHVIMIDDRPNSPTEGKEIFVPV-----TVLETPPLRVAGIRAYEKDP 89 (333) T ss_pred CCCCEEEEEECCCCCCCCCCCEEEEEE-----EEEECCCEEEEEEEEEEECC T ss_conf 257759999648899976893788766-----77506986999999988879 No 41 >PRK13503 transcriptional activator RhaS; Provisional Probab=23.05 E-value=63 Score=14.35 Aligned_cols=51 Identities=16% Similarity=0.168 Sum_probs=33.9 Q ss_pred ECCCCCCCCCHHHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECCC Q ss_conf 00155555440121000022222343340642133310011366236530233302322034530745 Q gi|254780919|r 55 GLHFQRPPYAQAKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEMN 122 (198) Q Consensus 55 GlH~Q~~p~~Q~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d~ 122 (198) -.|++ ..-.++.|.+|+.... +| | ..+.|.+ +..++||||..|.|...++. T Consensus 29 P~H~H----~~~Ei~~v~~G~~~~~-i~--------g--~~~~l~~--Gdl~li~p~~~H~~~~~~~~ 79 (278) T PRK13503 29 PEHHH----DFHEIVIVEHGTGIHV-FN--------G--QPYTISG--GTVCFVRDHDRHLYEHTDNL 79 (278) T ss_pred CCEEC----CCEEEEEEEECEEEEE-EC--------C--EEEEECC--CEEEEECCCCEEEEEECCCC T ss_conf 99876----8689999980349999-99--------9--9999749--99999999983888855885 No 42 >pfam01050 MannoseP_isomer Mannose-6-phosphate isomerase. All of the members of this Pfam entry belong to family 2 of the mannose-6-phosphate isomerases. The type II phosphomannose isomerases are bifunctional enzymes. This Pfam entry covers the isomerase domain. The guanosine diphospho-D-mannose pyrophosphorylase domain is in another Pfam entry, see pfam00483. Probab=22.15 E-value=65 Score=14.25 Aligned_cols=66 Identities=21% Similarity=0.203 Sum_probs=39.9 Q ss_pred HHHEEECHHHHHHHHHHCCCCHHHHHHEEEEEEECCCCCEEEECCCCCEEEEEECC-CCEEEEEECCCCCCHHCEECCCC Q ss_conf 12100002222234334064213331001136623653023330232203453074-53389721787671017021688 Q gi|254780919|r 66 AKLVRCIAGRIFDIAVDIRRNSPTYGCWVSLEISANNGLQIYIPTGFAHGFMTLEM-NTEVIYKVTDFYSVEHDSGVAWQ 144 (198) Q Consensus 66 ~Klv~ci~G~I~dvvvDlR~~SpTfgk~~~~~Ls~~~~~~l~IP~G~aHGf~~L~d-~~~i~Y~~~~~y~p~~e~~i~~~ 144 (198) +-.+++++|..... +| ++ .+.+. .+.+++||+|..|-.....+ +..++=-..+.|--+++. +++. T Consensus 85 sE~W~Vv~G~a~V~-~~--------~~--~~~~~--~gdsi~Ip~g~~Hri~N~~~~~l~iIEVQ~G~~l~EdDI-~R~e 150 (151) T pfam01050 85 AEHWVVVSGTARVT-RG--------EE--VFLLT--ENESTYIPLGAIHRLENPGKIPLELIEVQSGSYLGEDDI-VRFE 150 (151) T ss_pred EEEEEEECCEEEEE-EC--------CE--EEEEC--CCCEEEECCCCCEECCCCCCCCEEEEEEECCCCCCCCCE-EECC T ss_conf 77999984889999-89--------99--99954--898899879992787379998889999860898780465-8746 Q ss_pred C Q ss_conf 8 Q gi|254780919|r 145 D 145 (198) Q Consensus 145 D 145 (198) | T Consensus 151 D 151 (151) T pfam01050 151 D 151 (151) T ss_pred C T ss_conf 9 No 43 >COG4388 Mu-like prophage I protein [General function prediction only] Probab=21.22 E-value=68 Score=14.14 Aligned_cols=14 Identities=29% Similarity=0.608 Sum_probs=7.6 Q ss_pred CCEEEECCCEEEEE Q ss_conf 76220379504211 Q gi|254780919|r 11 TRKFEDSRGWFSQT 24 (198) Q Consensus 11 ~~~f~D~RG~f~e~ 24 (198) .-.|.|+||.+.|. T Consensus 93 ~~~f~derGl~~e~ 106 (357) T COG4388 93 KYVFDDERGLMGEV 106 (357) T ss_pred EEEECCCCCCEEEC T ss_conf 06764566735413 No 44 >TIGR02461 osmo_MPG_phos mannosyl-3-phosphoglycerate phosphatase; InterPro: IPR012813 Members of this family are mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC). It acts sequentially after mannosyl-3-phosphoglycerate synthase (2.4.1.217 from EC) in a two-step pathway of biosynthesis of the compatible solute mannosylglycerate, a typical osmolyte of thermophiles.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=20.30 E-value=38 Score=15.61 Aligned_cols=13 Identities=54% Similarity=1.007 Sum_probs=10.2 Q ss_pred CCCCEEEECCCCC Q ss_conf 6530233302322 Q gi|254780919|r 101 NNGLQIYIPTGFA 113 (198) Q Consensus 101 ~~~~~l~IP~G~a 113 (198) +||-+||||+|+- T Consensus 65 ENGgaIyiPrgyf 77 (248) T TIGR02461 65 ENGGAIYIPRGYF 77 (248) T ss_pred ECCCEEECCCCCC T ss_conf 2273320477761 Done!