Query 537021.9.peg.816_1 Match_columns 97 No_of_seqs 103 out of 1329 Neff 5.7 Searched_HMMs 39220 Date Wed May 25 00:31:29 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_816.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00608 radc DNA repair prot 100.0 8.2E-43 0 273.5 9.8 97 1-97 125-223 (223) 2 PRK00024 radC DNA repair prote 100.0 4.6E-40 1.1E-44 257.3 12.8 97 1-97 128-224 (224) 3 pfam04002 DUF2466 Protein of u 100.0 1.9E-39 4.9E-44 253.7 12.7 97 1-97 27-123 (123) 4 COG2003 RadC DNA repair protei 100.0 2.4E-39 6E-44 253.1 9.2 97 1-97 128-224 (224) 5 COG1310 Predicted metal-depend 92.4 0.15 3.8E-06 30.5 3.4 42 20-61 48-91 (134) 6 COG1212 KdsB CMP-2-keto-3-deox 82.0 3 7.7E-05 22.8 4.7 54 20-85 43-98 (247) 7 TIGR01736 FGAM_synth_II phosph 74.5 1 2.6E-05 25.7 0.4 70 16-94 469-551 (763) 8 PRK12738 kbaY tagatose-bisphos 72.9 9.2 0.00024 20.0 5.8 49 27-80 86-134 (286) 9 PRK05835 fructose-bisphosphate 68.3 12 0.0003 19.4 6.0 49 28-81 87-135 (307) 10 PRK07709 fructose-bisphosphate 63.8 14 0.00036 18.9 5.9 48 28-80 90-137 (285) 11 PRK12737 gatY tagatose-bisphos 62.3 15 0.00039 18.7 5.8 48 28-80 87-134 (284) 12 cd00947 TBP_aldolase_IIB Tagat 58.0 18 0.00046 18.3 5.8 48 28-80 82-129 (276) 13 PRK05764 aspartate aminotransf 56.3 17 0.00043 18.4 3.8 63 25-92 152-220 (389) 14 PRK13399 fructose-1,6-bisphosp 55.6 20 0.00051 18.0 5.8 49 28-81 88-143 (347) 15 PRK07143 hypothetical protein; 54.0 21 0.00054 17.9 4.9 69 10-78 15-83 (277) 16 PRK06806 fructose-bisphosphate 53.8 21 0.00055 17.9 6.0 49 28-81 87-135 (281) 17 TIGR01481 ccpA catabolite cont 53.2 19 0.00047 18.2 3.5 61 27-93 165-225 (332) 18 smart00232 JAB_MPN JAB/MPN dom 52.8 22 0.00057 17.8 4.4 50 22-71 56-108 (135) 19 PRK08610 fructose-bisphosphate 52.8 22 0.00057 17.8 5.9 49 28-81 90-138 (286) 20 PRK07084 fructose-bisphosphate 51.8 23 0.00059 17.7 5.9 49 28-81 98-146 (321) 21 PRK06801 hypothetical protein; 51.6 23 0.00059 17.6 5.9 49 28-81 87-135 (286) 22 PRK11178 uridine phosphorylase 51.3 15 0.00038 18.8 2.8 39 51-89 123-161 (251) 23 pfam11697 DUF3293 Protein of u 51.2 21 0.00053 17.9 3.5 51 43-96 1-51 (73) 24 COG2820 Udp Uridine phosphoryl 48.7 26 0.00066 17.4 3.9 40 50-89 122-161 (248) 25 PRK07998 gatY putative fructos 48.0 27 0.00068 17.3 6.0 49 28-81 87-135 (283) 26 COG0191 Fba Fructose/tagatose 47.6 27 0.00069 17.3 5.6 49 28-81 88-136 (286) 27 TIGR01241 FtsH_fam ATP-depende 47.3 16 0.00042 18.5 2.5 30 11-45 127-157 (505) 28 TIGR01361 DAHP_synth_Bsub phos 47.2 26 0.00066 17.4 3.5 45 29-73 214-259 (262) 29 pfam01116 F_bP_aldolase Fructo 45.9 29 0.00073 17.1 5.9 48 28-80 85-132 (283) 30 PRK07315 fructose-bisphosphate 44.2 30 0.00078 17.0 5.8 49 27-81 89-137 (293) 31 PRK05450 3-deoxy-manno-octulos 41.5 34 0.00085 16.7 3.5 46 26-82 51-96 (248) 32 PRK08185 hypothetical protein; 41.3 34 0.00086 16.7 5.9 48 28-80 81-128 (283) 33 COG2876 AroA 3-deoxy-D-arabino 41.1 34 0.00087 16.7 3.5 33 29-61 233-266 (286) 34 cd01989 STK_N The N-terminal d 40.8 34 0.00088 16.6 8.3 58 27-84 12-88 (146) 35 cd06403 PB1_Par6 The PB1 domai 40.7 31 0.0008 16.9 3.1 47 29-75 29-75 (80) 36 PRK09195 gatY tagatose-bisphos 40.1 35 0.0009 16.6 5.7 48 28-80 87-134 (284) 37 PRK12857 putative aldolase; Re 39.7 36 0.00091 16.5 5.8 48 28-80 87-134 (284) 38 TIGR02256 ICE_VC0181 conserved 39.5 15 0.00037 18.8 1.2 43 45-87 80-123 (143) 39 pfam09863 DUF2090 Uncharacteri 36.4 40 0.001 16.2 4.7 46 42-87 142-187 (310) 40 cd02203 PurL_repeat1 PurL subu 36.4 11 0.00029 19.5 0.3 16 7-22 210-225 (313) 41 cd02517 CMP-KDO-Synthetase CMP 36.2 35 0.00088 16.6 2.7 45 26-82 50-94 (239) 42 pfam05853 DUF849 Prokaryotic p 36.1 41 0.001 16.2 3.9 49 22-70 20-73 (274) 43 PRK09196 fructose-1,6-bisphosp 33.8 45 0.0011 16.0 5.7 49 28-81 88-143 (347) 44 TIGR01296 asd_B aspartate-semi 33.0 40 0.001 16.3 2.6 33 25-60 268-300 (350) 45 cd04819 PA_2 PA_2: Protease-as 32.4 47 0.0012 15.8 4.9 28 25-53 57-84 (127) 46 PRK05819 deoD purine nucleosid 32.3 47 0.0012 15.8 3.1 39 52-90 123-161 (235) 47 pfam01398 Mov34 Mov34/MPN/PAD- 30.9 42 0.0011 16.1 2.4 31 39-69 84-114 (117) 48 PRK13374 purine nucleoside pho 30.6 48 0.0012 15.8 2.7 40 52-91 124-163 (233) 49 TIGR01162 purE phosphoribosyla 30.4 38 0.00097 16.4 2.2 28 54-81 5-32 (159) 50 TIGR01718 Uridine-psphlse urid 30.3 51 0.0013 15.6 3.0 40 54-93 123-162 (248) 51 TIGR00107 deoD purine nucleosi 28.8 52 0.0013 15.6 2.6 49 44-94 116-164 (234) 52 PRK01213 phosphoribosylformylg 27.7 19 0.00049 18.1 0.3 56 20-77 460-528 (736) 53 cd02039 cytidylyltransferase_l 25.2 63 0.0016 15.1 2.5 38 15-52 2-41 (141) 54 pfam06574 FAD_syn FAD syntheta 24.9 64 0.0016 15.0 3.5 65 13-77 8-79 (158) 55 PRK05627 bifunctional riboflav 24.5 65 0.0017 15.0 2.8 68 10-77 10-84 (303) 56 PRK13600 putative ribosomal pr 24.0 67 0.0017 14.9 3.7 57 12-78 3-59 (84) 57 TIGR03029 EpsG chain length de 24.0 67 0.0017 14.9 2.8 38 46-84 182-219 (274) 58 PRK08637 hypothetical protein; 23.9 67 0.0017 14.9 4.1 35 25-61 130-169 (386) 59 PRK13601 putative ribosomal pr 23.1 70 0.0018 14.8 5.6 50 20-79 7-56 (84) 60 cd03426 CoAse Coenzyme A pyrop 22.2 73 0.0019 14.7 3.0 41 49-90 37-78 (157) 61 cd04821 PA_M28_1_2 PA_M28_1_2: 20.8 77 0.002 14.6 2.4 21 30-50 84-104 (157) No 1 >TIGR00608 radc DNA repair protein RadC; InterPro: IPR001405 RadC is a DNA repair protein found in many bacteria. They share a region of similarity at the C-terminus that contains two perfectly conserved histidine residues.; GO: 0006281 DNA repair. Probab=100.00 E-value=8.2e-43 Score=273.46 Aligned_cols=97 Identities=47% Similarity=0.761 Sum_probs=96.0 Q ss_pred CCCCCCCCEEEEEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCC--CCCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 951568877348998540463589329999999997319689999556898--988147789999999999976498899 Q 537021.9.peg.8 1 LFLDKHNILIADEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSG--NPNPSDADINMTQNIITTLNPLNIIVH 78 (97) Q Consensus 1 l~Ld~~~~li~~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG--~~~PS~~Di~~T~~l~~a~~~l~i~ll 78 (97) ||||+|||+|.++++|.||+|.+.|+|||||+.|+..+|+++|+||||||| +|+||++|+.+|++|.+||+.|||+++ T Consensus 125 lfLD~~nrli~~~~~~~Gt~~~v~v~PREi~keA~k~~A~alI~aHNHPSGGl~p~PS~~D~~~T~~l~~aae~l~i~lL 204 (223) T TIGR00608 125 LFLDLKNRLIAKEVVFIGTVNHVLVHPREIIKEALKLSASALILAHNHPSGGLEPSPSKEDILITERLLKAAELLGIELL 204 (223) T ss_pred EEECCCCCEECCEEEEEEEEEEEEECCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 87168541010024565412422366507877887617430255653755298998687779999999988866286241 Q ss_pred EEEEECCCCEEEHHHCCCC Q ss_conf 7999859959974525899 Q 537021.9.peg.8 79 DHIIIGKDAFVSFKGLRII 97 (97) Q Consensus 79 DHiIv~~~~~~S~re~gll 97 (97) ||+|||+++|+||||+|+| T Consensus 205 DHLviG~g~~~Sfre~g~L 223 (223) T TIGR00608 205 DHLVIGKGKYVSFREEGLL 223 (223) T ss_pred EEEEECCCCEECHHCCCCC T ss_conf 1778538741214500689 No 2 >PRK00024 radC DNA repair protein RadC; Reviewed Probab=100.00 E-value=4.6e-40 Score=257.27 Aligned_cols=97 Identities=47% Similarity=0.696 Sum_probs=96.0 Q ss_pred CCCCCCCCEEEEEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 95156887734899854046358932999999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 1 LFLDKHNILIADEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 1 l~Ld~~~~li~~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) +|||++|++|+++.+++||+|++.|+||+||+.||..+|+++|++||||||+++||++|+++|++|+++++.+||+|+|| T Consensus 128 l~Ld~~n~li~~~~i~~Gt~~~~~v~pReI~~~Al~~~A~~iIl~HNHPSG~~~PS~~Di~~T~~l~~a~~~lgI~llDH 207 (224) T PRK00024 128 LFLDTKNRLIADEELFIGTLNSSIVHPREIVKEALKLNAAALILAHNHPSGDPEPSQADILITKRLKEAGELLGIRLLDH 207 (224) T ss_pred EEECCCCCEEEEEEEECCCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99878777556899704788646762899999999827887999954699999989889999999999997689868618 Q ss_pred EEECCCCEEEHHHCCCC Q ss_conf 99859959974525899 Q 537021.9.peg.8 81 IIIGKDAFVSFKGLRII 97 (97) Q Consensus 81 iIv~~~~~~S~re~gll 97 (97) ||||+++||||||+|+| T Consensus 208 iIv~~~~~~S~~e~Gll 224 (224) T PRK00024 208 IIIGDGEYVSFAERGLL 224 (224) T ss_pred EEEECCEEEEHHHCCCC T ss_conf 99949909987735899 No 3 >pfam04002 DUF2466 Protein of unknown function (DUF2466). This family was named initially with reference to the E. coli radC102 mutation which suggested that RadC was involved in repair of DNA lesions. However the relevant mutation has subsequently been shown to be in recG, where radC is in fact an allele of recG. In addition, a personal communication from Claverys, J-P, et al, indicates a total failure of all attempts to characterize a radiation-related function for RadC in Streptococcus pneumoniae, suggesting that it is not involved in repair of DNA lesions, in recombination during transformation, in gene conversion, nor in mismatch repair. Probab=100.00 E-value=1.9e-39 Score=253.65 Aligned_cols=97 Identities=45% Similarity=0.711 Sum_probs=95.8 Q ss_pred CCCCCCCCEEEEEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 95156887734899854046358932999999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 1 LFLDKHNILIADEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 1 l~Ld~~~~li~~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) +|||++|++++++.+++||++++.|+||+||++||..+|+++|++||||||+++||++|+++|++|+++++++||+|+|| T Consensus 27 l~Ld~~~~li~~~~i~~Gt~~~~~v~pReI~~~Al~~~A~~iIl~HNHPSG~~~PS~~D~~~T~~l~~a~~~l~I~llDH 106 (123) T pfam04002 27 LYLDTQNRLIAVEEVSEGTVNSASVHPREIFKRALRLNAAAVILAHNHPSGDPTPSQADIELTKRLKEAGELLGIRLLDH 106 (123) T ss_pred EEECCCCCEEEEEEECCCCCCEEEECHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99879998898999735788467887999999999838986999831899989958879999999999997689879738 Q ss_pred EEECCCCEEEHHHCCCC Q ss_conf 99859959974525899 Q 537021.9.peg.8 81 IIIGKDAFVSFKGLRII 97 (97) Q Consensus 81 iIv~~~~~~S~re~gll 97 (97) +|||+++||||||+|+| T Consensus 107 iIv~~~~~~S~~e~g~l 123 (123) T pfam04002 107 IIIGDGGYFSFAEEGLL 123 (123) T ss_pred EEEECCCEEEHHHCCCC T ss_conf 99969939985534789 No 4 >COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair] Probab=100.00 E-value=2.4e-39 Score=253.07 Aligned_cols=97 Identities=42% Similarity=0.651 Sum_probs=95.9 Q ss_pred CCCCCCCCEEEEEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 95156887734899854046358932999999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 1 LFLDKHNILIADEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 1 l~Ld~~~~li~~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) ||||+||++|+++.+|.||+|++.|+||||+++|+..+|+++|++||||||+|+||++|+.+|++|+++|+.+||+++|| T Consensus 128 L~Ld~qnrlI~~e~lf~GTi~~s~V~PREI~k~Al~~nAaavIlaHNHPSGd~~PS~aD~~iT~rl~~a~~ll~I~vLDH 207 (224) T COG2003 128 LYLDSQNRLIATETLFIGTLNVSEVHPREIFKEALKYNAAAVILAHNHPSGDPTPSRADILITERLKEAGKLLGIRLLDH 207 (224) T ss_pred HHHCCCCCEEEEEEEEEEECCCCEECHHHHHHHHHHHCCHHHHEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99547684331003775201450433799999999822135530015899999857788999999999987429331004 Q ss_pred EEECCCCEEEHHHCCCC Q ss_conf 99859959974525899 Q 537021.9.peg.8 81 IIIGKDAFVSFKGLRII 97 (97) Q Consensus 81 iIv~~~~~~S~re~gll 97 (97) +|||+++|+||||+|+| T Consensus 208 iIig~~~y~Sf~E~gll 224 (224) T COG2003 208 IIIGEGLYISFAERGLL 224 (224) T ss_pred EEECCCCEEEHHHHCCC T ss_conf 78648858747761269 No 5 >COG1310 Predicted metal-dependent protease of the PAD1/JAB1 superfamily [General function prediction only] Probab=92.41 E-value=0.15 Score=30.54 Aligned_cols=42 Identities=21% Similarity=0.335 Sum_probs=27.1 Q ss_pred CCEEEECHHH--HHHHHHHHCCCEEEEEEECCCCCCCCCHHHHH Q ss_conf 6358932999--99999973196899995568989881477899 Q 537021.9.peg.8 20 IDHVPVYIRE--IVQRCLELSATSIILVHNHPSGNPNPSDADIN 61 (97) Q Consensus 20 ~~~~~v~~r~--i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~ 61 (97) ......++.. ....+...+-.-+...|-||+|.+.||++|++ T Consensus 48 ~~~~~~~~~~~~~~~~~~~~g~~vvg~yHSHP~~~~~pS~~D~~ 91 (134) T COG1310 48 VEYFEIDPEYSLFYLAAEDAGEVVVGWYHSHPGGPPYPSEADRR 91 (134) T ss_pred CCCEECCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHH T ss_conf 44136382888899985237984999981798998885987883 No 6 >COG1212 KdsB CMP-2-keto-3-deoxyoctulosonic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=82.05 E-value=3 Score=22.85 Aligned_cols=54 Identities=22% Similarity=0.369 Sum_probs=42.3 Q ss_pred CCEEEE--CHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECC Q ss_conf 635893--299999999973196899995568989881477899999999999764988997999859 Q 537021.9.peg.8 20 IDHVPV--YIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGK 85 (97) Q Consensus 20 ~~~~~v--~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~ 85 (97) ++++.| |--+|+..+-..+...+.-.-||||| |.|+.++++.++.. -|++||.- T Consensus 43 ~~rvvVATDde~I~~av~~~G~~avmT~~~h~SG-----------TdR~~Ev~~~l~~~-~~~iIVNv 98 (247) T COG1212 43 ADRVVVATDDERIAEAVQAFGGEAVMTSKDHQSG-----------TDRLAEVVEKLGLP-DDEIIVNV 98 (247) T ss_pred CCEEEEECCCHHHHHHHHHHCCEEEECCCCCCCC-----------CHHHHHHHHHCCCC-CCEEEEEC T ss_conf 8728997498899999997097899617888980-----------08999999965999-64299986 No 7 >TIGR01736 FGAM_synth_II phosphoribosylformylglycinamidine synthase II; InterPro: IPR010074 Phosphoribosylformylglycinamidine synthase is a single, long polypeptide in most bacteria. Three proteins are required in Bacillus subtilis and many other species. This is the longest of the three and is designated PurL, phosphoribosylformylglycinamidine synthase II, or FGAM synthase II.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=74.51 E-value=1 Score=25.67 Aligned_cols=70 Identities=17% Similarity=0.149 Sum_probs=49.1 Q ss_pred EECCCCEEEECHH--------HHHHHHHHHCCCEEEEEEECCCCCCCCCHHHH-----HHHHHHHHHHHHCCCEEEEEEE Q ss_conf 5404635893299--------99999997319689999556898988147789-----9999999999764988997999 Q 537021.9.peg.8 16 SRGTIDHVPVYIR--------EIVQRCLELSATSIILVHNHPSGNPNPSDADI-----NMTQNIITTLNPLNIIVHDHII 82 (97) Q Consensus 16 ~~Gt~~~~~v~~r--------~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di-----~~T~~l~~a~~~l~i~llDHiI 82 (97) +-+....|.++|+ |-.|-....+|.-+.+.=|=--|||+ ++++ +.++-|.+||+.|+++.+ T Consensus 469 ~D~NPr~~~~dPy~Ga~~~vaEa~RNLa~vGA~PLa~vDcLNFGNPE--rPEvywQf~~~v~Gl~daa~~l~~PVV---- 542 (763) T TIGR01736 469 ADCNPRYVYLDPYAGAAAAVAEAYRNLAAVGAEPLAIVDCLNFGNPE--RPEVYWQFVEAVKGLGDAARALGTPVV---- 542 (763) T ss_pred ECCCCCCEEECHHHHHHHHHHHHHHCEEEECCEEEEEEECCCCCCCC--CCHHHHHHHHHHHHHHHHHHHCCCCEE---- T ss_conf 07876603316035588977743112564087548888405689888--703678999999999999986388988---- Q ss_pred ECCCCEEEHHHC Q ss_conf 859959974525 Q 537021.9.peg.8 83 IGKDAFVSFKGL 94 (97) Q Consensus 83 v~~~~~~S~re~ 94 (97) + |=+||..+ T Consensus 543 -g--GNVSLYNE 551 (763) T TIGR01736 543 -G--GNVSLYNE 551 (763) T ss_pred -E--CCEECCCC T ss_conf -2--11000231 No 8 >PRK12738 kbaY tagatose-bisphosphate aldolase; Reviewed Probab=72.88 E-value=9.2 Score=20.01 Aligned_cols=49 Identities=10% Similarity=0.238 Sum_probs=42.8 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 999999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 27 IREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 27 ~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) --+.+++|+..+-+++.+ .|+..|-++.++.|+++.+.+...|+.+--- T Consensus 86 ~~e~i~~ai~~GftSVM~-----DgS~lp~eeNi~~Tk~vv~~Ah~~gv~VEaE 134 (286) T PRK12738 86 SLDDIRRKVHAGVRSAMI-----DGSHFPFAENVKLVKSVVDFCHSQDCSVEAE 134 (286) T ss_pred CHHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 999999999779987987-----3899999999999999999984739978886 No 9 >PRK05835 fructose-bisphosphate aldolase; Provisional Probab=68.29 E-value=12 Score=19.40 Aligned_cols=49 Identities=10% Similarity=0.293 Sum_probs=43.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+.++.|+..+-+++.+ -|+..|-++.+..|+++.+.+...|+.+---+ T Consensus 87 ~e~~~~ai~~GFsSVM~-----DgS~l~~eeNI~~tk~vve~ah~~gv~VEaEl 135 (307) T PRK05835 87 FESCEKAVKAGFTSVMI-----DASHHAFEENLELTSKVVKMAHNAGVSVEAEL 135 (307) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999819987995-----19989999999999999999987098699974 No 10 >PRK07709 fructose-bisphosphate aldolase; Provisional Probab=63.82 E-value=14 Score=18.88 Aligned_cols=48 Identities=15% Similarity=0.331 Sum_probs=42.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|+..+-+++.+ -|+..|-++.++.|+++.+.+...|+.+--- T Consensus 90 ~e~i~~ai~~Gf~SVM~-----D~S~l~~eeNi~~Tk~vv~~Ah~~gv~VEaE 137 (285) T PRK07709 90 FEKCKEAIDAGFTSVMI-----DASHHPFEENVETTKKVVEYAHARNVSVEAE 137 (285) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999999729977985-----2998999999999999999998749839997 No 11 >PRK12737 gatY tagatose-bisphosphate aldolase; Reviewed Probab=62.30 E-value=15 Score=18.72 Aligned_cols=48 Identities=8% Similarity=0.234 Sum_probs=41.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+.+++|++.+-+++.+ -|+..|-++.+..|+++.+.+...|+.+--- T Consensus 87 ~e~~~~ai~~GftSVM~-----DgS~lp~eeNi~~T~~vv~~Ah~~gv~VEaE 134 (284) T PRK12737 87 LDDIKKKVRAGIRSVMI-----DGSHLSFEENIAIVKEVVEFCHRYDASVEAE 134 (284) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999999819987987-----0999999999999999999864528869996 No 12 >cd00947 TBP_aldolase_IIB Tagatose-1,6-bisphosphate (TBP) aldolase and related Type B Class II aldolases. TBP aldolase is a tetrameric class II aldolase that catalyzes the reversible condensation of dihydroxyacetone phosphate with glyceraldehyde 3-phsophate to produce tagatose 1,6-bisphosphate. There is an absolute requirement for a divalent metal ion, usually zinc, and in addition the enzymes are activated by monovalent cations such as Na+. The type A and type B Class II FBPA's differ in the presence and absence of distinct indels in the sequence that result in differing loop lengths in the structures. Probab=57.98 E-value=18 Score=18.27 Aligned_cols=48 Identities=15% Similarity=0.389 Sum_probs=41.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|++.+-+++.+ -|+..|=++.++.|+++.+.+...|+.+--- T Consensus 82 ~e~i~~ai~~Gf~SVM~-----DgS~lp~eeNi~~T~~vv~~ah~~gv~VEaE 129 (276) T cd00947 82 FELIKRAIRAGFSSVMI-----DGSHLPFEENVAKTKEVVELAHAYGVSVEAE 129 (276) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHHCCEEEEE T ss_conf 99999999839987985-----2898999999999999999999819848998 No 13 >PRK05764 aspartate aminotransferase; Provisional Probab=56.34 E-value=17 Score=18.44 Aligned_cols=63 Identities=14% Similarity=0.227 Sum_probs=35.4 Q ss_pred ECHHHHHHHHHHHCCCEEEEEE-ECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE-----EEEEECCCCEEEHH Q ss_conf 3299999999973196899995-56898988147789999999999976498899-----79998599599745 Q 537021.9.peg.8 25 VYIREIVQRCLELSATSIILVH-NHPSGNPNPSDADINMTQNIITTLNPLNIIVH-----DHIIIGKDAFVSFK 92 (97) Q Consensus 25 v~~r~i~~~al~~~A~~vilaH-NHPsG~~~PS~~Di~~T~~l~~a~~~l~i~ll-----DHiIv~~~~~~S~r 92 (97) +++-+ ++.++..+...++++. |-|+|...+- + --++|.+.|+.-++-++ .+++..+..+.|+. T Consensus 152 ~d~~~-l~~~~~~~~k~i~l~~P~NPtG~v~s~-~---~l~~l~~~a~~~~i~ii~DEiY~~l~~~~~~~~~~~ 220 (389) T PRK05764 152 LTPEQ-LEAAITPKTKALILNSPSNPTGAVYSK-E---ELEAIADVAVEHDIWVLSDEIYEKLVYDGFEFTSIA 220 (389) T ss_pred CCHHH-HHHHCCCCCEEEEECCCCCCCCCCCCH-H---HHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCHH T ss_conf 89999-998638466499988998998867577-9---999999998863532751013435556898888877 No 14 >PRK13399 fructose-1,6-bisphosphate aldolase; Provisional Probab=55.56 E-value=20 Score=18.03 Aligned_cols=49 Identities=14% Similarity=0.297 Sum_probs=36.6 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCC-------CHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9999999973196899995568989881-------47789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNP-------SDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~P-------S~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+..+++++.+-+|+.+ .|+..| -++.++.|+++.+.+...|+.+---+ T Consensus 88 ~e~~~~ai~~GFtSVMi-----DgS~lP~~e~~~sfeeNI~~Tk~vve~Ah~~gv~VEaEl 143 (347) T PRK13399 88 PATCQSAIRSGFTSVMM-----DGSLLADGKTPASYDYNVDVTRRVTEMAHAVGVSVEGEL 143 (347) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEC T ss_conf 99999998719974763-----465565667765589999999999999998198899831 No 15 >PRK07143 hypothetical protein; Provisional Probab=53.99 E-value=21 Score=17.87 Aligned_cols=69 Identities=16% Similarity=0.227 Sum_probs=48.6 Q ss_pred EEEEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 348998540463589329999999997319689999556898988147789999999999976498899 Q 537021.9.peg.8 10 IADEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVH 78 (97) Q Consensus 10 i~~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~ll 78 (97) +....++.|.-|.+..-=+.+++.|...+...+++.--||.-.+..+..-+...++=.+.++.+|+..+ T Consensus 15 ~~~svvtIG~FDGvH~GHq~Li~~~~~~~~~~~~~~~~~P~~~~~~~~~~l~~~~eK~~~l~~~Gvd~~ 83 (277) T PRK07143 15 FEKPTFVIGGFESFHLGHLKLYKKALEISGEIVFIVVKNPENLPKNTNENFFDLNSRIQMFANLNIKNV 83 (277) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 488599998373426899999999998579967999338643657765347999999999997599999 No 16 >PRK06806 fructose-bisphosphate aldolase; Provisional Probab=53.80 E-value=21 Score=17.85 Aligned_cols=49 Identities=24% Similarity=0.414 Sum_probs=42.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+..++|++.+-+++.+ -|+..|=++.+..|+++.+.+...|+.+---+ T Consensus 87 ~e~i~~ai~~GfsSVM~-----DgS~l~~eeNi~~Tkevve~Ah~~gv~VEaEl 135 (281) T PRK06806 87 FEKIKEALEIGFTSVMF-----DGSHYPLEENIQKTKEIVELAKQYGATVEAEI 135 (281) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999829987996-----09989999999999999999988598699973 No 17 >TIGR01481 ccpA catabolite control protein A; InterPro: IPR006377 Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [e.g. acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.; GO: 0030528 transcription regulator activity, 0045449 regulation of transcription. Probab=53.18 E-value=19 Score=18.22 Aligned_cols=61 Identities=18% Similarity=0.209 Sum_probs=53.3 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEHHH Q ss_conf 9999999997319689999556898988147789999999999976498899799985995997452 Q 537021.9.peg.8 27 IREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVSFKG 93 (97) Q Consensus 27 ~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S~re 93 (97) .+|+....+..++..|-++ ||...-|-.-..-+.-++++++..||.+-|-+|+.. -|||++ T Consensus 165 ~kEav~~l~~~GhK~iAfv----~G~L~~~~N~~~~l~GykE~L~~~gI~~~e~L~~~~--~YsY~~ 225 (332) T TIGR01481 165 TKEAVKELIAKGHKKIAFV----SGSLSDSVNGEVRLEGYKEALKKAGIAYEEDLVIEA--KYSYDA 225 (332) T ss_pred HHHHHHHHHHCCCCEEEEE----ECCCCCCCCCEECCCHHHHHHHHCCCCCCCCEEEEC--CCCCHH T ss_conf 9999999863358268887----367571104520133046788736874142213304--555044 No 18 >smart00232 JAB_MPN JAB/MPN domain. Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function. Probab=52.84 E-value=22 Score=17.76 Aligned_cols=50 Identities=14% Similarity=0.250 Sum_probs=34.4 Q ss_pred EEEECHHHHHH---HHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 58932999999---999731968999955689898814778999999999997 Q 537021.9.peg.8 22 HVPVYIREIVQ---RCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLN 71 (97) Q Consensus 22 ~~~v~~r~i~~---~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~ 71 (97) ....++.+.+. ++...+..-+-.-|-||...+.||..|++.-...++... T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~vVGwYhShp~~~~~~S~~Di~~~~~~q~~~~ 108 (135) T smart00232 56 EYDEDYSHLMDEELKKVNKDLEIVGWYHSHPDESPFPSEVDVATHESYQAPWP 108 (135) T ss_pred EEHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 20152999999999975899806999976788687058738999999983499 No 19 >PRK08610 fructose-bisphosphate aldolase; Reviewed Probab=52.77 E-value=22 Score=17.76 Aligned_cols=49 Identities=12% Similarity=0.305 Sum_probs=41.7 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+..++|++.+-+++.+ .|+..|=++.++.|+++.+.+...|+.+---+ T Consensus 90 ~e~~~~ai~~GFtSVM~-----DgS~l~~eeNi~~Tk~vv~~Ah~~gv~VEaEl 138 (286) T PRK08610 90 FEKCKEAIDAGFTSVMI-----DASHSPFEENVATTKKVVEYAHEKGVSVEAEL 138 (286) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEE T ss_conf 99999999719998998-----19989899999999999999987088269975 No 20 >PRK07084 fructose-bisphosphate aldolase; Provisional Probab=51.83 E-value=23 Score=17.67 Aligned_cols=49 Identities=14% Similarity=0.492 Sum_probs=42.5 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+.++.|++.+-+++.+ -|+..|=++.+..|+++.+.+...|+.+---+ T Consensus 98 ~e~i~~ai~~GFtSVM~-----DgS~lp~eeNI~~Tk~vv~~Ah~~gv~VEaEl 146 (321) T PRK07084 98 FELCKDCIDSGFSSVMI-----DGSHLPYEENVALTKKVVEYAHQFDVTVEGEL 146 (321) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999809987996-----39979999999999999999726399699975 No 21 >PRK06801 hypothetical protein; Provisional Probab=51.65 E-value=23 Score=17.65 Aligned_cols=49 Identities=18% Similarity=0.377 Sum_probs=41.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+....|++.+-+++.+ -|+..|=++.++.|+++.+.+...|+.+---+ T Consensus 87 ~e~i~~ai~~Gf~SVM~-----DgS~l~~eeNi~~Tk~vve~ah~~gv~VEaEl 135 (286) T PRK06801 87 FEAVVRALRLGFSSVMF-----DGSTLEYEENVRQTREVVKMCHAVGVSVEAEL 135 (286) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999829987997-----49989999999999999999988498599996 No 22 >PRK11178 uridine phosphorylase; Provisional Probab=51.31 E-value=15 Score=18.79 Aligned_cols=39 Identities=18% Similarity=0.268 Sum_probs=35.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEE Q ss_conf 898814778999999999997649889979998599599 Q 537021.9.peg.8 51 GNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFV 89 (97) Q Consensus 51 G~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~ 89 (97) ...-|.-+|.+++..|.++++.++++..==++++.++|| T Consensus 123 ~~~~pa~ad~~l~~~l~~aa~~~~~~~~~G~t~S~d~FY 161 (251) T PRK11178 123 PLEFPAVADFECTTALVEAAKSIGATTHVGVTASSDTFY 161 (251) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCC T ss_conf 766764379999999999999829996586897647778 No 23 >pfam11697 DUF3293 Protein of unknown function (DUF3293). This bacterial family of proteins has no known function. Probab=51.22 E-value=21 Score=17.93 Aligned_cols=51 Identities=16% Similarity=0.130 Sum_probs=35.7 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEHHHCCC Q ss_conf 999556898988147789999999999976498899799985995997452589 Q 537021.9.peg.8 43 ILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVSFKGLRI 96 (97) Q Consensus 43 ilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S~re~gl 96 (97) |-||| |.+...+-.+.....++|.+.+...|.....-+=...++ +..|.++ T Consensus 1 ITA~N-P~s~~~s~~eN~~~n~~L~~~L~~~g~~~~~~~G~~~~~--~w~E~Sf 51 (73) T pfam11697 1 ITAWN-PGSQLLSAEENRRRNAALLRELAALGYPYVPGIGGDPDG--SWPEESF 51 (73) T ss_pred CCCCC-CCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC--CCCCCEE T ss_conf 90218-898768999999999999999998597169864438998--8457613 No 24 >COG2820 Udp Uridine phosphorylase [Nucleotide transport and metabolism] Probab=48.69 E-value=26 Score=17.37 Aligned_cols=40 Identities=25% Similarity=0.457 Sum_probs=35.6 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEE Q ss_conf 9898814778999999999997649889979998599599 Q 537021.9.peg.8 50 SGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFV 89 (97) Q Consensus 50 sG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~ 89 (97) -+.--|.-+|-.+|..|+++++.++++.+==++++.|.|| T Consensus 122 ~~~~~PAv~d~~~t~al~~aa~~~~~~~~vG~v~S~D~FY 161 (248) T COG2820 122 APEEFPAVADFELTNALVEAAESLGVTVHVGVVASSDAFY 161 (248) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCC T ss_conf 6767888799799999999998569725887776245322 No 25 >PRK07998 gatY putative fructose-1,6-bisphosphate aldolase; Reviewed Probab=47.97 E-value=27 Score=17.30 Aligned_cols=49 Identities=20% Similarity=0.330 Sum_probs=40.8 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+..+.|++.+-+++.+ -|+..|=++.++.|+++.+.+...|+.+---+ T Consensus 87 ~e~i~~ai~~GftSVM~-----DgS~l~~eeNi~~Tk~vv~~Ah~~gv~VEaEl 135 (283) T PRK07998 87 FEDVKQAVRAGFTSVMI-----DGAALPFEENIAFTQEAVDFCKSYGVPVEAEL 135 (283) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999739988986-----09989999999999999999977699799985 No 26 >COG0191 Fba Fructose/tagatose bisphosphate aldolase [Carbohydrate transport and metabolism] Probab=47.61 E-value=27 Score=17.27 Aligned_cols=49 Identities=14% Similarity=0.339 Sum_probs=42.0 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+.+..+++.+=+|+.+ -|+..|=++.+.+|++..+.|...|+.+---+ T Consensus 88 ~~~~~~ai~~GFsSvMi-----DgS~~p~eENi~~tkevv~~ah~~gvsVEaEl 136 (286) T COG0191 88 FEDCKQAIRAGFSSVMI-----DGSHLPFEENIAITKEVVEFAHAYGVSVEAEL 136 (286) T ss_pred HHHHHHHHHCCCCEEEE-----CCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99999997568825875-----38757788899999999999998298189985 No 27 >TIGR01241 FtsH_fam ATP-dependent metallopeptidase HflB; InterPro: IPR005936 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M41 (FtsH endopeptidase family, clan MA(E)). The predicted active site residues for members of this family and thermolysin, the type example for clan MA, occur in the motif HEXXH. FtsH is a membrane-anchored ATP-dependent protease that degrades misfolded or misassembled membrane proteins as well as a subset of cytoplasmic regulatory proteins. FtsH is a 647-residue protein of 70 kDa, with two putative transmembrane segments towards its N terminus which anchor the protein to the membrane, giving rise to a periplasmic domain of 70 residues and a cytoplasmic segment of 520 residues containing the ATPase and protease domains . ; GO: 0004222 metalloendopeptidase activity, 0030163 protein catabolic process, 0016020 membrane. Probab=47.33 E-value=16 Score=18.55 Aligned_cols=30 Identities=20% Similarity=0.274 Sum_probs=23.7 Q ss_pred EEEEEEEC-CCCEEEECHHHHHHHHHHHCCCEEEEE Q ss_conf 48998540-463589329999999997319689999 Q 537021.9.peg.8 11 ADEVQSRG-TIDHVPVYIREIVQRCLELSATSIILV 45 (97) Q Consensus 11 ~~e~i~~G-t~~~~~v~~r~i~~~al~~~A~~vila 45 (97) .+...|+| +.+++ |++|..| +.||.++|.. T Consensus 127 dFVEMFVGVGASRV----RDLFeqA-K~nAPCIIFI 157 (505) T TIGR01241 127 DFVEMFVGVGASRV----RDLFEQA-KKNAPCIIFI 157 (505) T ss_pred CEEECCCCCCCEEH----HHHHHHH-HHHCCCEEEE T ss_conf 10111205640001----4457999-9718970564 No 28 >TIGR01361 DAHP_synth_Bsub phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006268 These sequences are one of at least three types of phospho-2-dehydro-3-deoxyheptonate aldolase (DAHP synthase). This enzyme catalyzes the first of 7 steps in the biosynthesis of chorismate, that last common precursor of all three aromatic amino acids and of PABA, ubiquinone and menaquinone. Some members of this family, including an experimentally characterised member from Bacillus subtilis, are bifunctional, with a chorismate mutase domain N-terminal to this region. The member of this family from Synechocystis PCC 6803, CcmA, was shown to be essential for carboxysome formation. However, no other candidate for this enzyme is present in that species, chorismate biosynthesis does occur, other species having this protein lack carboxysomes but appear to make chorismate, and a requirement of CcmA for carboxysome formation does not prohibit a role in chorismate biosynthesis.; GO: 0016832 aldehyde-lyase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=47.24 E-value=26 Score=17.38 Aligned_cols=45 Identities=13% Similarity=0.296 Sum_probs=34.0 Q ss_pred HHHHHHHHHCCCEEEE-EEECCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 9999999731968999-95568989881477899999999999764 Q 537021.9.peg.8 29 EIVQRCLELSATSIIL-VHNHPSGNPNPSDADINMTQNIITTLNPL 73 (97) Q Consensus 29 ~i~~~al~~~A~~vil-aHNHPsG~~~PS~~Di~~T~~l~~a~~~l 73 (97) -+-++|+..+|.++++ +|+||-.-..=|++-+....++.+..+.+ T Consensus 214 plA~AA~A~GADgl~iEVHp~Pe~ALSD~~Qql~~c~~f~~~~~~~ 259 (262) T TIGR01361 214 PLAKAAIAAGADGLMIEVHPDPEKALSDSKQQLTPCEEFKRLVKEL 259 (262) T ss_pred HHHHHHHHHCCCEEEEEECCCHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 9999989757473689866783332078711446678899999985 No 29 >pfam01116 F_bP_aldolase Fructose-bisphosphate aldolase class-II. Probab=45.88 E-value=29 Score=17.11 Aligned_cols=48 Identities=17% Similarity=0.323 Sum_probs=39.2 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|++.+-+++.+ -|+..|=++.++.|+++.+.+...|+.+--- T Consensus 85 ~e~~~~ai~~GftSVM~-----DgS~l~~eeNi~~T~~vv~~Ah~~gv~VEaE 132 (283) T pfam01116 85 FEGILEAIEAGFSSVMI-----DGSHLPFEENIAITKEVVEYAHARGVSVEAE 132 (283) T ss_pred HHHHHHHHHHCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999999819986986-----3897999999999999999998739848997 No 30 >PRK07315 fructose-bisphosphate aldolase; Provisional Probab=44.21 E-value=30 Score=16.95 Aligned_cols=49 Identities=18% Similarity=0.502 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9999999997319689999556898988147789999999999976498899799 Q 537021.9.peg.8 27 IREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 27 ~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -.++. .|+..+-+++.+ -|+..|-++.++.|+++.+.+...|+.+---+ T Consensus 89 ~~~~~-~ai~~GftSVM~-----DgS~lp~eeNi~~Tk~vv~~Ah~~gv~VEaEl 137 (293) T PRK07315 89 YEDAL-ECIEVGYTSIMF-----DGSHLPVEENLKLAKEVVEKAHAKGISVEAEV 137 (293) T ss_pred CHHHH-HHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 28999-999829987998-----79989999999999999999987599489985 No 31 >PRK05450 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=41.54 E-value=34 Score=16.71 Aligned_cols=46 Identities=17% Similarity=0.289 Sum_probs=35.9 Q ss_pred CHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 299999999973196899995568989881477899999999999764988997999 Q 537021.9.peg.8 26 YIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHII 82 (97) Q Consensus 26 ~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiI 82 (97) +-.+|...+-..++..+.-..+|++|. .|+.+|++.++..=-|.+| T Consensus 51 dd~~I~~~~~~~g~~~imTs~~h~~GT-----------dRi~Ea~~~l~~~~~d~II 96 (248) T PRK05450 51 DDERIADAVEAFGGEVVMTSADHPSGT-----------DRIAEAAAKLGLSDDDIVV 96 (248) T ss_pred CCHHHHHHHHCCCCCEEECCCCCCCCH-----------HHHHHHHHHCCCCCCCEEE T ss_conf 885244353156862340464547703-----------8999999850877786899 No 32 >PRK08185 hypothetical protein; Provisional Probab=41.30 E-value=34 Score=16.69 Aligned_cols=48 Identities=19% Similarity=0.377 Sum_probs=38.4 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|++.+=+|+.+ -|+..|=++.++.|+++.+.+...|+.+--- T Consensus 81 ~e~~~~ai~~GFsSVM~-----DgS~lp~eeNi~~Tk~vv~~ah~~gv~VEaE 128 (283) T PRK08185 81 VEDVMRAIRCGFTSVMI-----DGSLLPYEENVALTKEVVELAHKVGVSVEGE 128 (283) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999999829987986-----3898999999999999999998659838998 No 33 >COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism] Probab=41.10 E-value=34 Score=16.67 Aligned_cols=33 Identities=12% Similarity=0.304 Sum_probs=25.4 Q ss_pred HHHHHHHHHCCCEEE-EEEECCCCCCCCCHHHHH Q ss_conf 999999973196899-995568989881477899 Q 537021.9.peg.8 29 EIVQRCLELSATSII-LVHNHPSGNPNPSDADIN 61 (97) Q Consensus 29 ~i~~~al~~~A~~vi-laHNHPsG~~~PS~~Di~ 61 (97) -+.+.|+..+|.+++ =+|+||.+...-|.+-+. T Consensus 233 pla~AA~AaGAdglmiEVHp~P~~AlsD~~Qql~ 266 (286) T COG2876 233 PLAKAAIAAGADGLMIEVHPDPEKALSDAKQQLT 266 (286) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC T ss_conf 8899998616773699964795434576000179 No 34 >cd01989 STK_N The N-terminal domain of Eukaryotic Serine Threonine kinases. The Serine Threonine kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. The N-terminal domain is homologous to the USP family which has a ATP binding fold. The N-terminal domain is predicted to be involved in ATP binding. Probab=40.82 E-value=34 Score=16.64 Aligned_cols=58 Identities=17% Similarity=0.226 Sum_probs=37.4 Q ss_pred HHHHHHHHHHHC---CCEEEEEEECCCCCCCCCH----------------HHHHHHHHHHHHHHHCCCEEEEEEEEC Q ss_conf 999999999731---9689999556898988147----------------789999999999976498899799985 Q 537021.9.peg.8 27 IREIVQRCLELS---ATSIILVHNHPSGNPNPSD----------------ADINMTQNIITTLNPLNIIVHDHIIIG 84 (97) Q Consensus 27 ~r~i~~~al~~~---A~~vilaHNHPsG~~~PS~----------------~Di~~T~~l~~a~~~l~i~llDHiIv~ 84 (97) -+.-++||+..- +..+++.|=||-....|.. +-.++-...++.|..-++.....++-+ T Consensus 12 S~~AL~WAl~n~~~~~~~lvLlHV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~C~~~~V~~e~~v~e~ 88 (146) T cd01989 12 SKNALKWALDNLATKGQTIVLVHVHPPITSIPSSSGKLEVASAYKQEEDKEAKELLLPYRCFCSRKGVQCEDVVLED 88 (146) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEC T ss_conf 99999999986406698199999716887677765306678899888889999999999999987597289999834 No 35 >cd06403 PB1_Par6 The PB1 domain is an essential part of Par6 protein which in complex with Par3 and aPKC proteins is crucial for establishment of apical-basal polarity of animal cells. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The Probab=40.74 E-value=31 Score=16.87 Aligned_cols=47 Identities=15% Similarity=0.289 Sum_probs=36.8 Q ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 99999997319689999556898988147789999999999976498 Q 537021.9.peg.8 29 EIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNI 75 (97) Q Consensus 29 ~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i 75 (97) .+++..=...-..+.++...|.|+..|=..|-.+.+.+..|-.++.| T Consensus 29 ~Ll~~~H~l~~~~f~i~Y~DphgDLLPINNDdN~~kAl~sa~PlLRi 75 (80) T cd06403 29 KLLEHLHHIPNVDFLIGYTDPHGDLLPINNDDNFLKALSSANPLLRI 75 (80) T ss_pred HHHHHHCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHCCCCEEE T ss_conf 99999737889877999757998751613834799998706971468 No 36 >PRK09195 gatY tagatose-bisphosphate aldolase; Reviewed Probab=40.07 E-value=35 Score=16.57 Aligned_cols=48 Identities=10% Similarity=0.286 Sum_probs=40.2 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|++.+-+++.+ -|+..|=++.+..|+++.+.+...|+.+--- T Consensus 87 ~e~i~~ai~~GftSVM~-----DgS~l~~eeNi~~Tk~vv~~Ah~~gv~VEaE 134 (284) T PRK09195 87 FDDIAQKVRSGVRSVMI-----DGSHLPFAQNISLVKEVVDFCHRFDVSVEAE 134 (284) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHHCCEEEEE T ss_conf 99999999749986886-----3898999999999999999998728818997 No 37 >PRK12857 putative aldolase; Reviewed Probab=39.71 E-value=36 Score=16.54 Aligned_cols=48 Identities=21% Similarity=0.458 Sum_probs=40.9 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 99999999731968999955689898814778999999999997649889979 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDH 80 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDH 80 (97) -+..++|+..+=+++.+ .|+..|=++.++.|+++.+.+...|+.+--- T Consensus 87 ~e~i~~ai~~Gf~SVM~-----DgS~l~~eeNi~~Tk~vv~~ah~~gv~VEaE 134 (284) T PRK12857 87 FEQVMKCIRNGFTSVMI-----DGSKLPLEENIALTKKVVEIAHAVGVSVEAE 134 (284) T ss_pred HHHHHHHHHCCCCEEEE-----ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999999809987997-----2898999999999999999998708915885 No 38 >TIGR02256 ICE_VC0181 conserved hypothetical protein; InterPro: IPR011952 This uncharacterised protein is found in several proteobacteria, among them Rhizobium sp. (strain NGR234), Vibrio cholerae, Myxococcus xanthus, and Escherichia coli strain ECOR31. In the latter, it is part of an integrative and conjugative element that is readily induced to excise and circularise .. Probab=39.50 E-value=15 Score=18.83 Aligned_cols=43 Identities=26% Similarity=0.344 Sum_probs=27.9 Q ss_pred EEECCCCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCC Q ss_conf 955689898-8147789999999999976498899799985995 Q 537021.9.peg.8 45 VHNHPSGNP-NPSDADINMTQNIITTLNPLNIIVHDHIIIGKDA 87 (97) Q Consensus 45 aHNHPsG~~-~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~ 87 (97) =|-||...| .||--|..-=++...+-+.-.=+=+==+|||+.+ T Consensus 80 WHTHPed~pG~PS~TDr~swrt~i~s~E~raPrP~lllivGrk~ 123 (143) T TIGR02256 80 WHTHPEDQPGEPSYTDRRSWRTIIVSDEARAPRPMLLLIVGRKE 123 (143) T ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEEEEEECCCC T ss_conf 30588874225875447877500247434689844899844877 No 39 >pfam09863 DUF2090 Uncharacterized protein conserved in bacteria (DUF2090). This domain, found in various prokaryotic carbohydrate kinases, has no known function. Probab=36.43 E-value=40 Score=16.23 Aligned_cols=46 Identities=13% Similarity=0.055 Sum_probs=39.4 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCC Q ss_conf 9999556898988147789999999999976498899799985995 Q 537021.9.peg.8 42 IILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDA 87 (97) Q Consensus 42 vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~ 87 (97) =+|++-||..++.--.+-.+.-++|.++|..-|-+|+=-+|..++. T Consensus 142 K~Lv~yhPdD~~~lr~~Qe~~l~~L~~ac~~~g~elLLEiI~p~~~ 187 (310) T pfam09863 142 KCLVFYHPDDDAALRAEQEAKLKRLYEACRRSGHELLLEVIPPKVG 187 (310) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 9987538999799999999999999999984798168887257778 No 40 >cd02203 PurL_repeat1 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), first repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=36.42 E-value=11 Score=19.48 Aligned_cols=16 Identities=6% Similarity=0.239 Sum_probs=5.8 Q ss_pred CCEEEEEEEEECCCCE Q ss_conf 8773489985404635 Q 537021.9.peg.8 7 NILIADEVQSRGTIDH 22 (97) Q Consensus 7 ~~li~~e~i~~Gt~~~ 22 (97) +-+.+...++.|++.. T Consensus 210 ~li~~ihD~GaGGl~~ 225 (313) T cd02203 210 GLIVGIQDLGAGGLSS 225 (313) T ss_pred CCEEEEEECCCCCHHH T ss_conf 9868888658850654 No 41 >cd02517 CMP-KDO-Synthetase CMP-KDO synthetase catalyzes the activation of KDO which is an essential component of the lipopolysaccharide. CMP-KDO Synthetase: 3-Deoxy-D-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase) catalyzes the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate (CMP-KDO) and pyrophosphate. KDO is an essential component of the lipopolysaccharide found in the outer surface of gram-negative eubacteria. It is also a constituent of the capsular polysaccharides of some gram-negative eubacteria. Its presence in the cell wall polysaccharides of green algae and plant were also discovered. However, they have not been found in yeast and animals. The absence of the enzyme in mammalian cells makes it an attractive target molecule for drug design. Probab=36.21 E-value=35 Score=16.62 Aligned_cols=45 Identities=20% Similarity=0.308 Sum_probs=35.3 Q ss_pred CHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 299999999973196899995568989881477899999999999764988997999 Q 537021.9.peg.8 26 YIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHII 82 (97) Q Consensus 26 ~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiI 82 (97) +-.+|...+-..++..+.-.-+|++| |.|+.+|++.++.. -|.+| T Consensus 50 Dd~eI~~~~~~~g~~~imT~~~h~~G-----------TdRi~ea~~~l~~~-~D~ii 94 (239) T cd02517 50 DDERIADAVESFGGKVVMTSPDHPSG-----------TDRIAEVAEKLDAD-DDIVV 94 (239) T ss_pred CCHHHHHHHCCCCCCCCCCCCCCCCC-----------CHHHHHHHHHCCCC-CCEEE T ss_conf 82655533214797401046433562-----------15999999971998-89899 No 42 >pfam05853 DUF849 Prokaryotic protein of unknown function (DUF849). This family consists of several hypothetical prokaryotic proteins with no known function. Probab=36.08 E-value=41 Score=16.19 Aligned_cols=49 Identities=20% Similarity=0.330 Sum_probs=34.5 Q ss_pred EEEECHHHHHHHHHHHCCCEEEEEEECCCC--CCCCCH--HHH-HHHHHHHHHH Q ss_conf 589329999999997319689999556898--988147--789-9999999999 Q 537021.9.peg.8 22 HVPVYIREIVQRCLELSATSIILVHNHPSG--NPNPSD--ADI-NMTQNIITTL 70 (97) Q Consensus 22 ~~~v~~r~i~~~al~~~A~~vilaHNHPsG--~~~PS~--~Di-~~T~~l~~a~ 70 (97) ..++-|-||.+.|..+...|--++|-|.-. +-+||. +.. ++..++++.| T Consensus 20 ~lP~Tp~Eia~~A~~c~~AGAsivH~HvRd~~dG~~s~d~~~y~e~i~~Ir~~~ 73 (274) T pfam05853 20 ALPVTPEEIAEEAVAAAEAGAAIVHLHVRDPEDGRPSQDPELFREVLERIREAV 73 (274) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 998998999999999997087389988447888990688999999999999878 No 43 >PRK09196 fructose-1,6-bisphosphate aldolase; Reviewed Probab=33.82 E-value=45 Score=15.98 Aligned_cols=49 Identities=18% Similarity=0.404 Sum_probs=36.3 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCC------C-HHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9999999973196899995568989881------4-7789999999999976498899799 Q 537021.9.peg.8 28 REIVQRCLELSATSIILVHNHPSGNPNP------S-DADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 28 r~i~~~al~~~A~~vilaHNHPsG~~~P------S-~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) -+.+++|++.+-+++.+= |+..| | ++.++.|+++.+.+...|+.+---+ T Consensus 88 ~e~i~~ai~~GFtSVMiD-----gS~lp~~~~~~sfeeNi~~Tkevve~Ah~~gv~VEaEl 143 (347) T PRK09196 88 PATCQRAIQLGFTSVMMD-----GSLKADGKTPASYEYNVDVTRKVVEMAHACGVSVEGEL 143 (347) T ss_pred HHHHHHHHHCCCCEEEEE-----CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 999999986489838973-----65565556777889999999999999987398499960 No 44 >TIGR01296 asd_B aspartate-semialdehyde dehydrogenase; InterPro: IPR005986 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . Aspartate-semialdehyde dehydrogenase, the second enzyme in the aspartate pathway, converts aspartyl phosphate to aspartate-semialdehyde, the branch point intermediate between the lysine and threonine/methionine pathways. Based on sequence alignments, the aspartate-semialdehyde dehydrogenase family appears to have two distinct subgroups, one found in most bacteria (Gram-positve and Gram-negative), while the other is found primarily in organisms lacking peptidoglycan (archaea,fungi and some bacteria). Most studies have been performed on enzymes isolated from Gram-negative bacteria , , , . The N-terminal domain forms the active site and NADP-binding pocket, while C-terminal domain is primarily involved in hydrophobic intersubunit contacts. The catalytic mechanism involves the formation of a covalent thioester acyl-enzyme intermediate mediated through nucleophilic attack by an active site cysteine residue on the substrate aspartyl phosphate. Release of inorganic phosphate is followed by hydride transfer from NADPH to yield the product. The recently described archaeal structure suggests that the two subgroups of aspartate semi-aldehyde dehydrogenase share similar structures and have an identical catalytic mechanism, despite their relatively low sequence identity . Unlike the bacterial enzymes, the archaeal enzyme utilised both NAD and NADP as cofactor. This entry represents the subgroup of aspartate-semialdehyde dehydrogenase most commonly found in bacteria. Note that some bacteria, such as Chlamydia, contain the enzyme from the other subgroup (IPR005676 from INTERPRO) instead.; GO: 0004073 aspartate-semialdehyde dehydrogenase activity, 0009086 methionine biosynthetic process, 0009088 threonine biosynthetic process. Probab=33.04 E-value=40 Score=16.26 Aligned_cols=33 Identities=15% Similarity=0.374 Sum_probs=23.5 Q ss_pred ECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHH Q ss_conf 329999999997319689999556898988147789 Q 537021.9.peg.8 25 VYIREIVQRCLELSATSIILVHNHPSGNPNPSDADI 60 (97) Q Consensus 25 v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di 60 (97) ++|+++-. |..+|.||+++-| ||++.-|++-+- T Consensus 268 ~~~e~~~E--~Lk~ApGV~~~D~-P~~~~Yp~P~~A 300 (350) T TIGR01296 268 ISPEDVRE--VLKNAPGVVVIDD-PSQNLYPTPLEA 300 (350) T ss_pred CCHHHHHH--HHCCCCEEEEEEC-CCCCCCCCCHHE T ss_conf 78889999--7347986799876-888778883101 No 45 >cd04819 PA_2 PA_2: Protease-associated (PA) domain subgroup 2. A subgroup of PA-domain containing proteins. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins in this group contain a C-terminal RING-finger domain. Proteins into which the PA domain is inserted include the following: i) various signal peptide peptidases: such as hSPPL2a and 2b, ii) various E3 ubiquitin ligases similar to human GRAIL (gene related to anergy in lymphocytes) protein, iii) various proteins containing a RING finger motif such as Arabidopsis ReMembR-H2 protein, iv) EDEM3 (ER-degradation-enhancing mannosidase-like 3 protein), v) various plant vacuola Probab=32.38 E-value=47 Score=15.83 Aligned_cols=28 Identities=21% Similarity=0.274 Sum_probs=22.2 Q ss_pred ECHHHHHHHHHHHCCCEEEEEEECCCCCC Q ss_conf 32999999999731968999955689898 Q 537021.9.peg.8 25 VYIREIVQRCLELSATSIILVHNHPSGNP 53 (97) Q Consensus 25 v~~r~i~~~al~~~A~~vilaHNHPsG~~ 53 (97) ++-+|-+..|...+|.+||+. ||-.|.. T Consensus 57 ~~~~eK~~~A~~aGA~gvvi~-N~v~G~l 84 (127) T cd04819 57 VDRKEKYAKAVAAGAAAFVVV-NTVPGVL 84 (127) T ss_pred CCHHHHHHHHHHCCCEEEEEE-ECCCCCC T ss_conf 028999999997797399999-3798743 No 46 >PRK05819 deoD purine nucleoside phosphorylase; Reviewed Probab=32.27 E-value=47 Score=15.82 Aligned_cols=39 Identities=18% Similarity=0.215 Sum_probs=33.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEE Q ss_conf 988147789999999999976498899799985995997 Q 537021.9.peg.8 52 NPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVS 90 (97) Q Consensus 52 ~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S 90 (97) ...|.-+|..+++.+.++++..++...-=++++.+.||. T Consensus 123 ~~~pa~ad~~l~~~l~~~a~~~g~~~~~G~v~S~D~FY~ 161 (235) T PRK05819 123 HDFAPIADFDLLRKAYDAAKEKGITVHVGNVFSADLFYN 161 (235) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEECCEECCCCCCC T ss_conf 675677999999999999997699677766014575748 No 47 >pfam01398 Mov34 Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has been shown that this domain occurs in prokaryotes. Probab=30.91 E-value=42 Score=16.15 Aligned_cols=31 Identities=16% Similarity=0.093 Sum_probs=23.7 Q ss_pred CCEEEEEEECCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9689999556898988147789999999999 Q 537021.9.peg.8 39 ATSIILVHNHPSGNPNPSDADINMTQNIITT 69 (97) Q Consensus 39 A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a 69 (97) -.-+---|-||+..+.||..|+..-..-++. T Consensus 84 ~~~VGWYhshp~~~~~lS~~dv~tq~~yq~~ 114 (117) T pfam01398 84 EEVVGWYHSHPGYGCWLSAVDVHTQELYQRY 114 (117) T ss_pred CCEEEEEECCCCCCCCCCHHHHHHHHHHHHH T ss_conf 5435688727877875588489989999852 No 48 >PRK13374 purine nucleoside phosphorylase; Provisional Probab=30.61 E-value=48 Score=15.79 Aligned_cols=40 Identities=15% Similarity=0.114 Sum_probs=34.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEH Q ss_conf 9881477899999999999764988997999859959974 Q 537021.9.peg.8 52 NPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVSF 91 (97) Q Consensus 52 ~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S~ 91 (97) ..-|..+|.++...+.++++..+++..-=++++.+.||.- T Consensus 124 ~~~~a~ad~~l~~~~~~~a~~~~~~~~~G~v~S~D~FY~~ 163 (233) T PRK13374 124 YDFAAIADYQLLEKAVETAREKGVRVKVGNVFSSDLFYDP 163 (233) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCC T ss_conf 7667789999999999999974995887674466837189 No 49 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=30.37 E-value=38 Score=16.37 Aligned_cols=28 Identities=18% Similarity=0.223 Sum_probs=23.1 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 8147789999999999976498899799 Q 537021.9.peg.8 54 NPSDADINMTQNIITTLNPLNIIVHDHI 81 (97) Q Consensus 54 ~PS~~Di~~T~~l~~a~~~l~i~llDHi 81 (97) .=|+.|+..-++..+.|+.|||+.-=-+ T Consensus 5 MGS~SD~~~m~~a~~~L~~fgi~~e~~V 32 (159) T TIGR01162 5 MGSDSDLETMKKAAEILEEFGIPYELRV 32 (159) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 3672547899999999985599667898 No 50 >TIGR01718 Uridine-psphlse uridine phosphorylase; InterPro: IPR010058 This entry represents a family of bacterial and archaeal uridine phosphorylases unrelated to the mammalian enzymes of the same name. The Escherichia coli , Salmonella and Klebsiella genes have been characterised. Sequences from Clostridium, Streptomyces, Treponema, Aeropyrum and Pyrobaculum are also included in this family, but it does not include related sequences from Halobacterium, which are more distantly related and represent enzymes with a slightly different substrate specificity. Also distantly related is a clade of archaeal sequences which are related to the DeoD family of inosine phosphorylases (IPR004402 from INTERPRO) as they are to these uridine phosphorylases. This clade includes a characterised protein from Sulfolobus solfataricus which has been miss-named as a methylthioadenosine phosphorylase, but which acts on inosine and guanosine - it is unclear whether uridine has been evaluated as a substrate .; GO: 0004850 uridine phosphorylase activity, 0009166 nucleotide catabolic process, 0005737 cytoplasm. Probab=30.33 E-value=51 Score=15.63 Aligned_cols=40 Identities=20% Similarity=0.307 Sum_probs=32.1 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEHHH Q ss_conf 8147789999999999976498899799985995997452 Q 537021.9.peg.8 54 NPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVSFKG 93 (97) Q Consensus 54 ~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S~re 93 (97) -|-=+|-.+|.-|++|++.+|++-.==++-..|.||.=.| T Consensus 123 fPAVadf~vt~ALv~AAes~gvrhhvGvvas~D~FY~gqe 162 (248) T TIGR01718 123 FPAVADFVVTAALVKAAESVGVRHHVGVVASKDSFYVGQE 162 (248) T ss_pred CCCCCCHHHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCC T ss_conf 8988844568889998873475121247862145578888 No 51 >TIGR00107 deoD purine nucleoside phosphorylase; InterPro: IPR004402 Purine nucleoside phosphorylase 2.4.2.1 from EC also called inosine phosphorylase (pnp) cleave guanosine or inosine to respective bases and sugar-1-phosphate molecules. These enzymes are involved in the salvage of nucleotides and nucleosides.; GO: 0004731 purine-nucleoside phosphorylase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=28.84 E-value=52 Score=15.58 Aligned_cols=49 Identities=22% Similarity=0.135 Sum_probs=39.8 Q ss_pred EEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCEEEHHHC Q ss_conf 995568989881477899999999999764988997999859959974525 Q 537021.9.peg.8 44 LVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGKDAFVSFKGL 94 (97) Q Consensus 44 laHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~~~~~S~re~ 94 (97) -.-|||.--.. -+|-++.++-+++++..|+.+.-=.+++.|-||+.+.+ T Consensus 116 ~f~~~P~D~~~--~Adf~L~~~A~~tAk~~g~~~~vG~vF~~D~FY~~~~~ 164 (234) T TIGR00107 116 RFVEVPEDFAA--IADFELVELAYDTAKALGIDVRVGNVFSADAFYSPDKD 164 (234) T ss_pred HCCCCCCCCCC--CCCHHHHHHHHHHHHHCCCCEEECCEECCCCCCCCCHH T ss_conf 05687644334--78989999999999867972776523311113074488 No 52 >PRK01213 phosphoribosylformylglycinamidine synthase II; Provisional Probab=27.65 E-value=19 Score=18.12 Aligned_cols=56 Identities=16% Similarity=0.148 Sum_probs=25.9 Q ss_pred CCEEEECHHHHHHHH--------HHHCCCEEEEEEECCCCCCCCCHHHH-----HHHHHHHHHHHHCCCEE Q ss_conf 635893299999999--------97319689999556898988147789-----99999999997649889 Q 537021.9.peg.8 20 IDHVPVYIREIVQRC--------LELSATSIILVHNHPSGNPNPSDADI-----NMTQNIITTLNPLNIIV 77 (97) Q Consensus 20 ~~~~~v~~r~i~~~a--------l~~~A~~vilaHNHPsG~~~PS~~Di-----~~T~~l~~a~~~l~i~l 77 (97) ..-+..+|..=-+.| +..+|.-+-+.-|=-.|+| .+++. +.-+-+.++|+.+++++ T Consensus 460 p~~~~~dP~~ga~~aV~Ea~rNlva~Ga~plai~d~lnfgnP--e~pe~~~~l~~av~g~~dac~~l~ipv 528 (736) T PRK01213 460 PRYVYLDPYEGAKLAVAEAYRNVAAVGAEPLAITDCLNFGNP--EKPEVMWQFVEAVRGIAEACRALGTPV 528 (736) T ss_pred CHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCC--CCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 515206999999999999997788617861322125323897--787787889999999999999819996 No 53 >cd02039 cytidylyltransferase_like Cytidylyltransferase-like domain.Many ofthese proteins are known to use CTP or ATP and release pyrophosphate. Protein families that contain at least one copy of this domain include citrate lyase ligase, pantoate-beta-alanine ligase, glycerol-3-phosphate cytidyltransferase, ADP-heptose synthase, phosphocholine cytidylyltransferase, lipopolysaccharide core biosynthesis protein KdtB, the bifunctional protein NadR, and a number whose function is unknown. Probab=25.15 E-value=63 Score=15.10 Aligned_cols=38 Identities=18% Similarity=0.256 Sum_probs=27.1 Q ss_pred EEECCCCEEEECHHHHHHHHHHHC--CCEEEEEEECCCCC Q ss_conf 854046358932999999999731--96899995568989 Q 537021.9.peg.8 15 QSRGTIDHVPVYIREIVQRCLELS--ATSIILVHNHPSGN 52 (97) Q Consensus 15 i~~Gt~~~~~v~~r~i~~~al~~~--A~~vilaHNHPsG~ 52 (97) ++-|+-|....-=..+++.|.... -..++...+||... T Consensus 2 i~~G~FDp~H~GH~~li~~a~~~~~~~~~v~~~~~~~~~~ 41 (141) T cd02039 2 LAGGSFDPVHLGHLALIERAKELALDELIVGVFSDPPKKV 41 (141) T ss_pred EECCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCC T ss_conf 8944648767999999999999789961799748984335 No 54 >pfam06574 FAD_syn FAD synthetase. This family corresponds to the N terminal domain of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activity. They catalyse the 5'-phosphorylation of riboflavin to FMN and the adenylylation of FMN to FAD. This domain is thought to have the flavin mononucleotide (FMN) adenylyltransferase activity. Probab=24.87 E-value=64 Score=15.04 Aligned_cols=65 Identities=20% Similarity=0.293 Sum_probs=42.8 Q ss_pred EEEEECCCCEEEECHHHHHHHHHHHC-----CCEEEEEEECCCCCCCCCHHHHHHH--HHHHHHHHHCCCEE Q ss_conf 99854046358932999999999731-----9689999556898988147789999--99999997649889 Q 537021.9.peg.8 13 EVQSRGTIDHVPVYIREIVQRCLELS-----ATSIILVHNHPSGNPNPSDADINMT--QNIITTLNPLNIIV 77 (97) Q Consensus 13 e~i~~Gt~~~~~v~~r~i~~~al~~~-----A~~vilaHNHPsG~~~PS~~Di~~T--~~l~~a~~~l~i~l 77 (97) ..++.|.-|.+..-=+.|++.+.... .+.++...+||.--..|..+-..+| ++=.+.++.+||.. T Consensus 8 svvtiG~FDGvH~GHq~li~~~~~~a~~~~~~~~viTF~phP~~~l~~~~~~~~l~~~~~K~~~l~~~Gid~ 79 (158) T pfam06574 8 SVVTIGNFDGVHLGHQALIAHAKEIARELGLPSVVVTFEPHPREVFAPDKAPERLTTLREKIELLAELGVDR 79 (158) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHCCCCCCHHCCCHHHHHHHHHHCCCCE T ss_conf 199997465315899999999999999829978999964987786087666410157999999999669999 No 55 >PRK05627 bifunctional riboflavin kinase/FMN adenylyltransferase; Reviewed Probab=24.51 E-value=65 Score=15.00 Aligned_cols=68 Identities=19% Similarity=0.262 Sum_probs=45.0 Q ss_pred EEEEEEEECCCCEEEECHHHHHHHHHHHC-----CCEEEEEEECCCCCCCCCHHHHHHH--HHHHHHHHHCCCEE Q ss_conf 34899854046358932999999999731-----9689999556898988147789999--99999997649889 Q 537021.9.peg.8 10 IADEVQSRGTIDHVPVYIREIVQRCLELS-----ATSIILVHNHPSGNPNPSDADINMT--QNIITTLNPLNIIV 77 (97) Q Consensus 10 i~~e~i~~Gt~~~~~v~~r~i~~~al~~~-----A~~vilaHNHPsG~~~PS~~Di~~T--~~l~~a~~~l~i~l 77 (97) +.-..++.|.-|.+..-=+.+++.+.... .+.++...+||.--..|..+-..+| ++=.+.++.+||.. T Consensus 10 ~~~svvaIG~FDGvH~GHq~li~~~~~~a~~~~~~s~v~TF~phP~~vl~~~~~~~~l~~~~ek~~ll~~~Gid~ 84 (303) T PRK05627 10 LEGCVLTIGNFDGVHRGHQALLARAREIARERGLPSVVMTFEPHPREFFAPDSAPARLTPLRDKAELLAELGVDY 84 (303) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCE T ss_conf 998389998582346899999999999999849978999956999998199888612499999999999739987 No 56 >PRK13600 putative ribosomal protein L7Ae-like; Provisional Probab=24.02 E-value=67 Score=14.95 Aligned_cols=57 Identities=23% Similarity=0.330 Sum_probs=45.4 Q ss_pred EEEEEECCCCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 8998540463589329999999997319689999556898988147789999999999976498899 Q 537021.9.peg.8 12 DEVQSRGTIDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVH 78 (97) Q Consensus 12 ~e~i~~Gt~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~ll 78 (97) .|.+++=+.....|-.++-++..=+..+..+++|- ++|..+|.++.+.|+.-+|+.. T Consensus 3 ~ekv~~~na~~~VVG~KQTlKAl~~~~v~~viIA~----------Dad~~l~~~V~~~~~~k~VpV~ 59 (84) T PRK13600 3 KEKVARFNKQHFVVGLKETLKALKKDQVTSLIIAE----------DVEVYLMTRVLSQINQKNIPVS 59 (84) T ss_pred HHHHHHHCCCCEEEEEHHHHHHHHCCCEEEEEEEE----------CCHHHHHHHHHHHHHHCCCCEE T ss_conf 77876624624688629989998549703999972----------3428899999999984599869 No 57 >TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF). Probab=24.00 E-value=67 Score=14.94 Aligned_cols=38 Identities=18% Similarity=0.225 Sum_probs=29.4 Q ss_pred EECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEC Q ss_conf 556898988147789999999999976498899799985 Q 537021.9.peg.8 46 HNHPSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIG 84 (97) Q Consensus 46 HNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~ 84 (97) +=-|+|...|...++-...++.+..+.+.-++ |.|||- T Consensus 182 ~VLpaG~~ppnP~eLL~s~~~~~ll~~l~~~y-D~IIiD 219 (274) T TIGR03029 182 SVLPAGAIPPNPQELLARPAFTDLLNKVMGDY-DVVIVD 219 (274) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHHHHHHHHC-CEEEEE T ss_conf 89969999989799873589999999998409-999993 No 58 >PRK08637 hypothetical protein; Provisional Probab=23.87 E-value=67 Score=14.93 Aligned_cols=35 Identities=23% Similarity=0.501 Sum_probs=18.9 Q ss_pred ECHHHHHHHHHHHCC--CEEEEEEECCCCCC---CCCHHHHH Q ss_conf 329999999997319--68999955689898---81477899 Q 537021.9.peg.8 25 VYIREIVQRCLELSA--TSIILVHNHPSGNP---NPSDADIN 61 (97) Q Consensus 25 v~~r~i~~~al~~~A--~~vilaHNHPsG~~---~PS~~Di~ 61 (97) ++.. -+..+|...+ ..+++.||-| .|| .||.++++ T Consensus 130 ~~~~-~~~~~l~~~~~~~~vvll~n~p-hNPTG~~~t~ee~~ 169 (386) T PRK08637 130 FTTD-ALVEALQAAYNKDKVILILNFP-NNPTGYTPTHKEVT 169 (386) T ss_pred CCHH-HHHHHHHHCCCCCCEEEEECCC-CCCCCCCCCHHHHH T ss_conf 7819-9999998457788389998789-99888689999999 No 59 >PRK13601 putative ribosomal protein L7Ae-like; Provisional Probab=23.09 E-value=70 Score=14.84 Aligned_cols=50 Identities=10% Similarity=0.117 Sum_probs=41.8 Q ss_pred CCEEEECHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 635893299999999973196899995568989881477899999999999764988997 Q 537021.9.peg.8 20 IDHVPVYIREIVQRCLELSATSIILVHNHPSGNPNPSDADINMTQNIITTLNPLNIIVHD 79 (97) Q Consensus 20 ~~~~~v~~r~i~~~al~~~A~~vilaHNHPsG~~~PS~~Di~~T~~l~~a~~~l~i~llD 79 (97) ...-.|-.+.-++..-+..+..+++|.+ +|..+|+.+.++|+.-+|++.. T Consensus 7 ~~kKvVG~KQT~KAl~~g~a~~V~vA~D----------AD~~v~~~v~~lc~e~~VpV~~ 56 (84) T PRK13601 7 PSKRVVGAKQTLKAIMNCNVLQVYIAKD----------AEEHVTKKIKELCEEKSIKIVY 56 (84) T ss_pred CCCCEECHHHHHHHHHCCCCCEEEEEEC----------CCHHHHHHHHHHHHHCCCCEEE T ss_conf 8555763899999997498338999816----------8688999999999973998698 No 60 >cd03426 CoAse Coenzyme A pyrophosphatase (CoAse), a member of the Nudix hydrolase superfamily, functions to catalyze the elimination of oxidized inactive CoA, which can inhibit CoA-utilizing enzymes. The need of CoAses mainly arises under conditions of oxidative stress. CoAse has a conserved Nudix fold and requires a single divalent cation for catalysis. In addition to a signature Nudix motif G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val, CoAse contains an additional motif upstream called the NuCoA motif (LLTXT(SA)X3RX3GX3FPGG) which is postulated to be involved in CoA recognition. CoA plays a central role in lipid metabolism. It is involved in the initial steps of fatty acid sythesis in the cytosol, in the oxidation of fatty acids and the citric acid cycle in the mitochondria, and in the oxidation of long-chain fatty acids in peroxisomes. CoA has the important role of activating fatty acids for further modification into key biological signalling molecules. Probab=22.15 E-value=73 Score=14.73 Aligned_cols=41 Identities=15% Similarity=0.359 Sum_probs=32.6 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECC-CCEEE Q ss_conf 8989881477899999999999764988997999859-95997 Q 537021.9.peg.8 49 PSGNPNPSDADINMTQNIITTLNPLNIIVHDHIIIGK-DAFVS 90 (97) Q Consensus 49 PsG~~~PS~~Di~~T~~l~~a~~~l~i~llDHiIv~~-~~~~S 90 (97) |.|..+|+++|..- -.++++.+-+|+..-+--++|. +.+++ T Consensus 37 PGG~~e~~D~~~~~-tAlREt~EEiGl~~~~~~~lg~L~~~~t 78 (157) T cd03426 37 PGGKVDPGDEDPVA-TALREAEEEIGLPPDSVEVLGRLPPYYT 78 (157) T ss_pred CCCCCCCCCCCHHH-HHHHHHHHHHCCCCCCEEEEEECCCCCC T ss_conf 99602766533899-9999999986899113389863487132 No 61 >cd04821 PA_M28_1_2 PA_M28_1_2: Protease-associated (PA) domain, peptidase family M28, subfamily-1, subgroup 2. A subgroup of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subgroups; relatively litt Probab=20.82 E-value=77 Score=14.56 Aligned_cols=21 Identities=14% Similarity=0.248 Sum_probs=16.8 Q ss_pred HHHHHHHHCCCEEEEEEECCC Q ss_conf 999999731968999955689 Q 537021.9.peg.8 30 IVQRCLELSATSIILVHNHPS 50 (97) Q Consensus 30 i~~~al~~~A~~vilaHNHPs 50 (97) =++.|.+.+|.++|+.|+-+. T Consensus 84 K~eeAar~GA~g~liIHet~~ 104 (157) T cd04821 84 KYEEAARQGAAGALIVHETEP 104 (157) T ss_pred HHHHHHHCCCCEEEEECCCCC T ss_conf 899998778848999637876 Done!