Query 537021.9.peg.754_1 Match_columns 184 No_of_seqs 84 out of 86 Neff 3.6 Searched_HMMs 39220 Date Tue May 24 23:10:12 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_754.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09074 consensus 99.8 8.7E-19 2.2E-23 137.4 14.5 151 11-180 998-1149(1149) 2 PRK06637 consensus 99.7 4.6E-15 1.2E-19 114.4 13.5 143 10-169 1033-1178(1182) 3 PRK05673 dnaE DNA polymerase I 99.3 5E-11 1.3E-15 89.4 13.7 146 10-178 989-1141(1143) 4 PRK08744 consensus 99.3 9.2E-11 2.4E-15 87.7 14.8 140 19-178 1039-1187(1195) 5 PRK08957 consensus 99.2 2E-09 5E-14 79.5 13.6 146 11-178 1001-1155(1159) 6 PRK07892 consensus 99.1 1.1E-08 2.8E-13 74.9 14.2 140 11-173 1028-1174(1180) 7 PRK07373 DNA polymerase III su 99.0 2E-08 5E-13 73.3 12.5 140 12-172 290-435 (447) 8 PRK06658 consensus 98.9 1.6E-07 4E-12 67.8 13.8 147 11-177 989-1142(1145) 9 PRK07012 consensus 98.9 1.4E-07 3.6E-12 68.0 13.6 151 11-178 994-1170(1173) 10 PRK06826 dnaE DNA polymerase I 98.7 8.7E-07 2.2E-11 63.1 12.1 142 11-178 1007-1155(1157) 11 PRK08512 consensus 98.5 3.2E-06 8.1E-11 59.6 11.4 137 10-172 1018-1183(1185) 12 PRK06920 dnaE DNA polymerase I 98.5 6.8E-06 1.7E-10 57.6 13.0 139 15-178 958-1105(1107) 13 PRK07374 dnaE DNA polymerase I 98.5 3.2E-06 8.1E-11 59.6 11.2 153 11-181 1012-1169(1171) 14 COG0587 DnaE DNA polymerase II 98.3 5E-06 1.3E-10 58.4 7.7 148 11-177 988-1136(1139) 15 PRK06326 consensus 98.1 8.6E-05 2.2E-09 50.8 11.0 135 16-171 1045-1228(1240) 16 cd04485 DnaE_OBF DnaE_OBF: A s 97.9 4.6E-05 1.2E-09 52.5 6.5 77 10-102 7-83 (84) 17 PRK07279 dnaE DNA polymerase I 97.7 0.00077 2E-08 44.9 10.7 128 15-173 899-1028(1033) 18 PRK05672 dnaE2 error-prone DNA 95.7 0.038 9.6E-07 34.4 6.2 61 39-102 981-1041(1050) 19 PRK07135 dnaE DNA polymerase I 92.5 0.34 8.6E-06 28.5 5.3 38 39-78 934-971 (987) 20 TIGR00021 rpiA ribose 5-phosph 78.0 3.2 8.1E-05 22.5 3.7 35 143-177 181-217 (236) 21 PRK00702 ribose-5-phosphate is 76.9 6.9 0.00018 20.4 5.2 36 142-177 166-201 (222) 22 TIGR02945 SUF_assoc FeS assemb 73.2 5.2 0.00013 21.2 3.8 44 135-178 22-71 (96) 23 COG0120 RpiA Ribose 5-phosphat 71.7 9.3 0.00024 19.6 5.0 106 67-177 74-203 (227) 24 cd04877 ACT_TyrR N-terminal AC 71.3 5.1 0.00013 21.2 3.4 68 108-181 2-71 (74) 25 pfam06026 Rib_5-P_isom_A Ribos 70.9 9.7 0.00025 19.5 4.8 36 142-177 118-153 (172) 26 PRK09977 putative Mg(2+) trans 70.0 10 0.00026 19.4 6.7 76 95-178 135-213 (215) 27 PRK10820 DNA-binding transcrip 65.3 8.8 0.00022 19.8 3.6 21 53-74 213-233 (513) 28 cd01398 RPI_A RPI_A: Ribose 5- 64.3 13 0.00034 18.7 5.4 107 66-177 69-199 (213) 29 cd04901 ACT_3PGDH C-terminal A 54.8 5.7 0.00015 20.9 1.1 56 112-178 8-67 (69) 30 COG2177 FtsX Cell division pro 53.7 19 0.00048 17.7 3.6 59 110-179 92-151 (297) 31 PRK13978 ribose-5-phosphate is 53.1 20 0.00052 17.5 5.5 108 66-177 75-205 (228) 32 TIGR00006 TIGR00006 S-adenosyl 51.5 6.5 0.00017 20.6 1.0 45 100-148 212-257 (323) 33 pfam01883 DUF59 Domain of unkn 51.3 19 0.0005 17.6 3.4 43 136-178 25-73 (76) 34 PRK07218 replication factor A; 46.9 25 0.00064 16.9 3.4 23 134-156 327-349 (430) 35 pfam01336 tRNA_anti OB-fold nu 46.7 26 0.00066 16.9 4.4 54 39-95 20-74 (75) 36 TIGR00383 corA magnesium and c 44.6 4.5 0.00012 21.5 -0.7 76 40-128 51-135 (339) 37 TIGR02803 ExbD_1 TonB system t 40.5 18 0.00045 17.9 1.8 76 65-141 44-119 (122) 38 pfam04954 SIP Siderophore-inte 39.8 33 0.00084 16.2 4.7 18 109-126 5-22 (118) 39 TIGR01213 TIGR01213 conserved 38.0 20 0.0005 17.6 1.7 45 63-115 275-319 (450) 40 TIGR02852 spore_dpaB dipicolin 37.4 21 0.00052 17.5 1.7 35 53-88 102-136 (188) 41 PRK05611 rpmD 50S ribosomal pr 36.9 15 0.00039 18.3 1.0 21 159-179 38-58 (58) 42 cd04903 ACT_LSD C-terminal ACT 36.1 38 0.00096 15.8 5.1 58 112-178 8-69 (71) 43 pfam05399 EVI2A Ectropic viral 34.3 32 0.00082 16.3 2.3 57 12-68 165-222 (227) 44 cd04902 ACT_3PGDH-xct C-termin 33.4 42 0.0011 15.6 2.9 60 112-180 8-71 (73) 45 pfam04530 Viral_Beta_CD Viral 32.8 39 0.001 15.7 2.6 23 149-171 89-111 (122) 46 PRK10260 hypothetical protein; 32.4 28 0.00072 16.6 1.8 83 1-91 108-207 (306) 47 PRK11750 gltB glutamate syntha 31.0 30 0.00076 16.5 1.7 134 33-175 799-966 (1483) 48 COG3040 Blc Bacterial lipocali 30.8 27 0.00069 16.7 1.5 15 20-34 111-125 (174) 49 COG0275 Predicted S-adenosylme 29.0 49 0.0013 15.1 3.5 44 100-146 205-248 (314) 50 cd04492 YhaM_OBF_like YhaM_OBF 28.9 50 0.0013 15.1 3.6 68 14-98 11-78 (83) 51 PRK11790 D-3-phosphoglycerate 28.8 32 0.00081 16.3 1.5 64 106-180 338-408 (409) 52 cd04879 ACT_3PGDH-like ACT_3PG 28.3 51 0.0013 15.0 5.5 59 110-179 6-70 (71) 53 PRK08305 spoVFB dipicolinate s 28.2 32 0.00081 16.3 1.5 18 64-81 117-134 (195) 54 PRK00910 ribB 3,4-dihydroxy-2- 27.1 54 0.0014 14.9 3.4 25 62-88 21-45 (218) 55 TIGR02720 pyruv_oxi_spxB pyruv 27.0 54 0.0014 14.9 2.5 21 52-73 509-529 (577) 56 pfam03927 NapD NapD protein. U 26.7 54 0.0014 14.9 4.2 69 106-183 6-74 (78) 57 cd02953 DsbDgamma DsbD gamma f 22.9 64 0.0016 14.4 4.7 94 63-168 6-103 (104) 58 TIGR02034 CysN sulfate adenyly 22.1 30 0.00076 16.5 0.4 57 44-106 304-366 (411) 59 KOG2364 consensus 21.7 68 0.0017 14.3 2.2 13 63-75 266-278 (433) 60 TIGR02398 gluc_glyc_Psyn gluco 21.3 29 0.00073 16.6 0.1 53 46-124 304-356 (495) 61 PRK10190 hypothetical protein; 21.0 50 0.0013 15.1 1.3 104 1-112 105-231 (310) 62 TIGR00833 actII Transport prot 20.3 57 0.0015 14.7 1.5 60 113-183 419-479 (992) 63 PRK03353 ribB 3,4-dihydroxy-2- 20.2 73 0.0019 14.1 4.1 24 63-88 21-44 (217) No 1 >PRK09074 consensus Probab=99.81 E-value=8.7e-19 Score=137.42 Aligned_cols=151 Identities=28% Similarity=0.443 Sum_probs=135.0 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|.++...|+. ++| +...|+||...++-|.++.+.-+ .+|++|.+++|....+.| .++.++++ T Consensus 998 ~~~~~~tk~G~~-maf------------~tleD~tg~~e~vvfp~~~~~~~-~~l~~~~~~~v~g~v~~~--~~~~~~~~ 1061 (1149) T PRK09074 998 SRQERKTKTGNK-MGI------------VQLSDPTGQFEAVLFSEGLAQYR-DLLEPGKAVLITVDAELR--GEGVRLRI 1061 (1149) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEC--CCCEEEEE T ss_conf 988535899997-899------------99997998589999889999999-986279989999999954--99369999 Q ss_pred EECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCCHHHHHHHH Q ss_conf 650415899973201109997380157899999611221037888713999974898547999668-9707898999874 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPISPEIAFDLE 169 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSPqIagALK 169 (184) ..++||+++.++..+.++|++++....+.+...|+.-. .++.|.+.+++...+...+|++.|+ +|+|||++.++|| T Consensus 1062 ~~~~~l~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~~---~~~~G~~~v~l~~~~~~~~~~~~l~~~~~v~p~~~~~l~ 1138 (1149) T PRK09074 1062 QTVEPLDDAAARMQKGLRVFLRDADALPSIAARLNRAQ---ARGDGEVSLILILDDGQREVEIELPGRYRVSPQIAGALK 1138 (1149) T ss_pred EEEECHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHH---HCCCCCEEEEEEECCCCEEEEEECCCCCEECHHHHHHHH T ss_conf 87034999986017846999856435889999999997---169995689999858975899987996101999999998 Q ss_pred HCCCHHEHHHH Q ss_conf 06462111030 Q 537021.9.peg.7 170 HITGITKVVQV 180 (184) Q Consensus 170 sIpGVl~VeEi 180 (184) ++.||.+|+|+ T Consensus 1139 ~~~g~~~v~~~ 1149 (1149) T PRK09074 1139 AVPGVVDVEEV 1149 (1149) T ss_pred HCCCCCCEEEC T ss_conf 58791010339 No 2 >PRK06637 consensus Probab=99.65 E-value=4.6e-15 Score=114.36 Aligned_cols=143 Identities=16% Similarity=0.140 Sum_probs=122.7 Q ss_pred CCCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHH-HHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCE Q ss_conf 871100286676148985013443332355531027875221058-9999987875249982899984100167776426 Q 537021.9.peg.7 10 NNRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQ-QQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSI 88 (184) Q Consensus 10 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~L 88 (184) ..||.+.+..|. |.| +.+.|+||..+++.|. ++++..+ .||++++.+++....+.+ ..++++ T Consensus 1033 ~~~q~~~t~~G~--~~~------------~~leD~tG~~e~~vf~~~~~~~~~-~ll~~~~~~vv~g~~~~~--~g~~~~ 1095 (1182) T PRK06637 1033 QKKDSRMSARGR--FVT------------LVLSDPENIFELSIFSEEVLKDYV-HLLDVKSLVVVNCDIVKD--EGGIKL 1095 (1182) T ss_pred EEEEEECCCCCC--EEE------------EEEEECCCCEEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEC--CCCEEE T ss_conf 661443278886--899------------999968986899995879999999-985559879999998864--894289 Q ss_pred EEEECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEE-EEECC-CCCCCHHHHH Q ss_conf 986504158999732011099973801578999996112210378887139999748985479-99668-9707898999 Q 537021.9.peg.7 89 RLLWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEI-EITLD-HYPISPEIAF 166 (184) Q Consensus 89 Ri~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EV-EI~Lp-~f~VSPqIag 166 (184) ++..+++||+++.+++++|+||++|..++.+|+.+|...-.....|..+|.+..+..+....+ ++.|| +|+|+||=.. T Consensus 1096 ~a~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 1175 (1182) T PRK06637 1096 TAKSFSSIEDAINNKQFELQLYPQNHEELRQIVTLLAARINNEDQSNAKATIYLQSKDVKNFVAKITLPEKFFLQGQDFE 1175 (1182) T ss_pred EEEECCCHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCEEEEEECCCCCCCCHHHHH T ss_conf 99753889999999731757995576678999987421356777887644999972441001579845860004788999 Q ss_pred HHH Q ss_conf 874 Q 537021.9.peg.7 167 DLE 169 (184) Q Consensus 167 ALK 169 (184) -|+ T Consensus 1176 ~~~ 1178 (1182) T PRK06637 1176 ILK 1178 (1182) T ss_pred HHH T ss_conf 998 No 3 >PRK05673 dnaE DNA polymerase III subunit alpha; Validated Probab=99.35 E-value=5e-11 Score=89.39 Aligned_cols=146 Identities=18% Similarity=0.206 Sum_probs=121.9 Q ss_pred CCCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEE Q ss_conf 87110028667614898501344333235553102787522105899999878752499828999841001677764269 Q 537021.9.peg.7 10 NNRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIR 89 (184) Q Consensus 10 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LR 89 (184) .++|.+++..|+. ++| +...|+||+..++.|.++.++-+ .+|.+|+.++|....+.| .++.++. T Consensus 989 ~~~~~~~TkkG~~-maf------------~tleD~tg~~e~vvfp~~~~~~~-~~l~~~~~~~v~G~v~~~--~~~~~l~ 1052 (1143) T PRK05673 989 VSVRVRVTKRGNK-MAI------------VTLEDRSGRIEVMLFSEALEEYR-DLLEEDRIVVVKGQVSFD--DFGLRLT 1052 (1143) T ss_pred EEEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEE T ss_conf 8978511799898-999------------99998998489999799999999-985269879999999960--9868999 Q ss_pred EEECCCHHHHHHHHCCCEEEEEECCCH----HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCC--CH Q ss_conf 865041589997320110999738015----7899999611221037888713999974898547999668-9707--89 Q 537021.9.peg.7 90 LLWVESLEKKSMEMLDSLRVYLQDRSP----LREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPI--SP 162 (184) Q Consensus 90 i~~vepLD~aAa~~~kgLRIfVrD~~p----L~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~V--SP 162 (184) +..++|||++..+..+.++|++..... +..++..|+. ..|+-+|.+.+..+ ..++.+.|+ +|+| |+ T Consensus 1053 ~~~i~~l~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~l~~-----~~G~~~v~l~~~~~--~~~~~~~l~~~~~v~~~~ 1125 (1143) T PRK05673 1053 AREVMDLDEAREKYARPLRISLDDRQLTDQLLERLKQVLEP-----GPGTVPVHLYYQDG--DAEAELRLGDRWRVSPSD 1125 (1143) T ss_pred EEECCCHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHH-----CCCCCEEEEEEECC--CEEEEEECCCCCEECCCH T ss_conf 99740489998625871799970354798999999999970-----98998699999769--827999869985277799 Q ss_pred HHHHHHHHCCCHHEHH Q ss_conf 8999874064621110 Q 537021.9.peg.7 163 EIAFDLEHITGITKVV 178 (184) Q Consensus 163 qIagALKsIpGVl~Ve 178 (184) ++..+|+.+-|.-.|. T Consensus 1126 ~l~~~l~~l~G~~nv~ 1141 (1143) T PRK05673 1126 ELLGDLKALLGVECVE 1141 (1143) T ss_pred HHHHHHHHHHCCCCEE T ss_conf 9999999850976589 No 4 >PRK08744 consensus Probab=99.34 E-value=9.2e-11 Score=87.73 Aligned_cols=140 Identities=14% Similarity=0.187 Sum_probs=112.3 Q ss_pred CCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEE--EECCCCCCCCCCCEEEEECCCH Q ss_conf 67614898501344333235553102787522105899999878752499828999--8410016777642698650415 Q 537021.9.peg.7 19 DGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIV--VSQEDKLSLQRDSIRLLWVESL 96 (184) Q Consensus 19 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~--VsAe~rle~eg~~LRi~~vepL 96 (184) .|+. ++| +...|.||...++.|.++.+.-+ .+|+.+..++|. |+.+.. ..|.++++..+.+| T Consensus 1039 ~G~~-maf------------~tleD~tg~~e~~vF~~~y~~~~-~~l~~d~~l~v~G~v~~d~~--~~~~~~~~~~~~~~ 1102 (1195) T PRK08744 1039 KGDS-QVF------------VQLEDGRGRVECSAFSDAMAEFG-HLLTRDRILIVKGGLREDEF--NGGYSLRIRQCWDY 1102 (1195) T ss_pred CCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEECCC--CCCEEEEEEEECCH T ss_conf 6987-999------------99996897389998789999999-98614988999999860356--89745898550679 Q ss_pred HHHHHHHCCC--EEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEE--CCCCCEEEEEECC-CCC--CCHHHHHHHH Q ss_conf 8999732011--0999738015789999961122103788871399997--4898547999668-970--7898999874 Q 537021.9.peg.7 97 EKKSMEMLDS--LRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFL--RSQGKIEIEITLD-HYP--ISPEIAFDLE 169 (184) Q Consensus 97 D~aAa~~~kg--LRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli--~~~~~~EVEI~Lp-~f~--VSPqIagALK 169 (184) +++.++..+. |++.+.+...+..+++.|... ..|.++|.+.++ ...+....++.|+ +|+ +|+++.++|| T Consensus 1103 ~~~~~~~~~~l~~~~~~~~~~~~~~l~~~l~~~----~~g~~~v~~~~~~~~~~~~~~~~l~l~~~~~v~~~~~l~~~l~ 1178 (1195) T PRK08744 1103 EQICADHAQRLSLRLDLREKQAWSRIDALLAKH----RPGKTPLRLDLLLRSPSGGVAGMLDLNGSHSVRIDQQLMDSLR 1178 (1195) T ss_pred HHHHHHHHHEEEEEEECCCCHHHHHHHHHHHHC----CCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCHHHHHHHH T ss_conf 999986521159998556715899999999833----8999864899999627887579997388575378999999998 Q ss_pred HCCCHHEHH Q ss_conf 064621110 Q 537021.9.peg.7 170 HITGITKVV 178 (184) Q Consensus 170 sIpGVl~Ve 178 (184) +++|+..|+ T Consensus 1179 ~~~g~~~v~ 1187 (1195) T PRK08744 1179 ADPAVRTLK 1187 (1195) T ss_pred HCCCCCEEE T ss_conf 532855289 No 5 >PRK08957 consensus Probab=99.16 E-value=2e-09 Score=79.50 Aligned_cols=146 Identities=16% Similarity=0.209 Sum_probs=115.5 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEE--EEECCCCCCCCCCCE Q ss_conf 711002866761489850134433323555310278752210589999987875249982899--984100167776426 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYII--VVSQEDKLSLQRDSI 88 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii--~VsAe~rle~eg~~L 88 (184) ++|.+....|+. ++| +...|+||...++-|.++.+.-+ .+|++|..+++ .|+.+++ .++.++ T Consensus 1001 ~~~~~~tkkg~~-maf------------~~leD~tg~~e~vvfp~~~~~~~-~~l~~~~~~~v~G~v~~~~~--~~~~~~ 1064 (1159) T PRK08957 1001 AARVMVTKRGNR-IGI------------CTLDDRSGRLEVMLFTDALEKYQ-HLLEKDRILIVSGQVSFDDF--SGGLKM 1064 (1159) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEECC--CCCEEE T ss_conf 977557899898-899------------99998998689999789999999-98626987999999986057--996269 Q ss_pred EEEECCCHHHHHHHHCCCEEEEEEC----CCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCCH- Q ss_conf 9865041589997320110999738----0157899999611221037888713999974898547999668-970789- Q 537021.9.peg.7 89 RLLWVESLEKKSMEMLDSLRVYLQD----RSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPISP- 162 (184) Q Consensus 89 Ri~~vepLD~aAa~~~kgLRIfVrD----~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSP- 162 (184) .+..+.||+++..+..+.++|.+.. ...+..++..|+. ...|.-+|.+.+..++ ....+.|+ +|+|+| T Consensus 1065 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~l~~----~~~G~~~v~l~~~~~~--~~~~~~l~~~~~v~~~ 1138 (1159) T PRK08957 1065 TAREVMDIDEAREKYARGLAISLTDRQIDDQLLNRLRQSLEP----HRSGTIPVHLYYQRAD--ARARLRLGATWRVTPS 1138 (1159) T ss_pred EEEECCCHHHHHHHCCCCEEEEEEHHHCCHHHHHHHHHHHHH----CCCCCCEEEEEEECCC--CEEEEECCCCCEECCC T ss_conf 998657689998703870799960331698999999999984----4899950899996698--2699987998436588 Q ss_pred -HHHHHHHHCCCHHEHH Q ss_conf -8999874064621110 Q 537021.9.peg.7 163 -EIAFDLEHITGITKVV 178 (184) Q Consensus 163 -qIagALKsIpGVl~Ve 178 (184) ++..+|+.+-|.-.|. T Consensus 1139 ~~~~~~l~~l~G~~nV~ 1155 (1159) T PRK08957 1139 DRLLNDLRTLLGSEQVE 1155 (1159) T ss_pred HHHHHHHHHHHCCCCEE T ss_conf 99999999961987189 No 6 >PRK07892 consensus Probab=99.07 E-value=1.1e-08 Score=74.91 Aligned_cols=140 Identities=14% Similarity=0.112 Sum_probs=115.1 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|.+....|+. |+| +...|+||...++.|.++.++-+ .+|.+|..++|.-.-+.| .+++++.+ T Consensus 1028 ~~~~~~TkkG~~-maf------------~tleD~tG~~e~vvFp~~y~~~~-~~l~~~~~l~v~G~v~~~--~~~~~li~ 1091 (1180) T PRK07892 1028 SVQRRVNKNGNP-WAI------------ATLEDLDGSIEVLFFPQTYSVVG-AQLVEDAVVLVKGRVSRR--DDRISLFA 1091 (1180) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEEE T ss_conf 888541789998-999------------99997998689998689999999-986359879999999988--99789999 Q ss_pred EECCCHHHHHHHHCCCEEEEEEC----CCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCC--CHH Q ss_conf 65041589997320110999738----0157899999611221037888713999974898547999668-9707--898 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQD----RSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPI--SPE 163 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD----~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~V--SPq 163 (184) ..+.+|+.+.++..+.+.|.+.. ..-+..++..|+. ..|+.+|.+.+..++ ....+.|+ +|+| |++ T Consensus 1092 ~~i~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~l~~~l~~-----~~G~~~v~~~~~~~~--~~~~~~l~~~~~v~~~~~ 1164 (1180) T PRK07892 1092 NDLTVPDLSAAQGGGPLVITLPTRRCTPPKVTALKQVLAR-----HPGTSEVHLRLIGGD--RTTVLRLDDSLRVTPSPA 1164 (1180) T ss_pred EEEECHHHHHHCCCCCEEEEEEHHHCCHHHHHHHHHHHHH-----CCCCCEEEEEEECCC--EEEEEECCCCCEECCCHH T ss_conf 9722545565434773899960320799999999999983-----989975999998799--069998899850677999 Q ss_pred HHHHHHHCCC Q ss_conf 9998740646 Q 537021.9.peg.7 164 IAFDLEHITG 173 (184) Q Consensus 164 IagALKsIpG 173 (184) +..+|+.+-| T Consensus 1165 ~~~~l~~l~G 1174 (1180) T PRK07892 1165 LMGDLKALLG 1174 (1180) T ss_pred HHHHHHHHHC T ss_conf 9999997328 No 7 >PRK07373 DNA polymerase III subunit alpha; Reviewed Probab=98.98 E-value=2e-08 Score=73.32 Aligned_cols=140 Identities=14% Similarity=0.118 Sum_probs=103.8 Q ss_pred CHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEE Q ss_conf 11002866761489850134433323555310278752210589999987875249982899984100167776426986 Q 537021.9.peg.7 12 RKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLL 91 (184) Q Consensus 12 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~ 91 (184) .+.++.-+|+. ++| ++..|.||..+++-|.+++++-+ .+|+.|..++|....+.| .++.++++. T Consensus 290 vr~~~TKkG~~-MAf------------vtLeD~tG~iEvvvFpe~ye~~~-~~L~~d~ivlv~Gkv~~r--d~~~~liv~ 353 (447) T PRK07373 290 VKKIVTKKGDP-MAF------------LQLEDLSGQSEAVVFPKSYERIQ-ELLQEDSRLIIWGKVDRR--DDQVQLIVE 353 (447) T ss_pred EEEEECCCCCE-EEE------------EEEEECCCCEEEEEEHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCCEEEEEE T ss_conf 99987048965-999------------99997897489998089999998-873379989999999987--893789980 Q ss_pred ECCCHHHHHHHHCCCEEEEEE---CCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCC--HHHH Q ss_conf 504158999732011099973---80157899999611221037888713999974898547999668-97078--9899 Q 537021.9.peg.7 92 WVESLEKKSMEMLDSLRVYLQ---DRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPIS--PEIA 165 (184) Q Consensus 92 ~vepLD~aAa~~~kgLRIfVr---D~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VS--PqIa 165 (184) .++|||++-. ..+++... +..-+..+++.|+.......+|.-||.+... +......+.|+ +|+|+ ..+. T Consensus 354 ~i~~l~~~~~---~~i~l~~~~~~~~~~~~~L~~iL~~~~g~~~~G~~PV~l~~~--~~~~~~~l~Lg~~~~V~p~~~ll 428 (447) T PRK07373 354 DAEPIEEVKM---VMVELSPQQAGNIQSQQRLKEILQEQAGDKEKAKVPVIAIIG--DGQDRQFVRLGQQFWVEDDRSAV 428 (447) T ss_pred ECCCHHHHHH---HHEECCHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEE--CCCCEEEEECCCCCCCCCCHHHH T ss_conf 0355888866---661468767158789999999998567974578643899998--79827999849867178629999 Q ss_pred HHHHHCC Q ss_conf 9874064 Q 537021.9.peg.7 166 FDLEHIT 172 (184) Q Consensus 166 gALKsIp 172 (184) .+||+.. T Consensus 429 ~~Lr~~~ 435 (447) T PRK07373 429 QALQNAG 435 (447) T ss_pred HHHHHCC T ss_conf 9998589 No 8 >PRK06658 consensus Probab=98.87 E-value=1.6e-07 Score=67.75 Aligned_cols=147 Identities=10% Similarity=0.067 Sum_probs=115.4 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|.+....|+. ++| +...|+||...++.|.++.++-+ .+|..|+.++|.-.-+- +.++.++.+ T Consensus 989 ~~~~~~tk~g~~-maf------------~~leD~tg~~e~~~fp~~~~~~~-~~l~~~~~~~v~g~~~~--~~~~~~~~~ 1052 (1145) T PRK06658 989 SVKVIQTKRGAK-MAF------------GVFEDLKGAIDIVVFPESWERHR-AFLLPETVIGVVGRLTF--NREKFSFLV 1052 (1145) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEE--ECCEEEEEE T ss_conf 867435799997-899------------99996898689999789999999-98534998999999995--188368998 Q ss_pred EECCCHHHHHHHHCCCEEEEEEC----CCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCC--CHH Q ss_conf 65041589997320110999738----0157899999611221037888713999974898547999668-9707--898 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQD----RSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPI--SPE 163 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD----~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~V--SPq 163 (184) ..|.+|++...+..+.+.|.+.. ..-+..++..|.... +..|.-+|.+.+.. ......+.|+ .|+| |++ T Consensus 1053 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~l~~~~--~~~g~~~v~~~~~~--~~~~~~~~l~~~~~v~~~~~ 1128 (1145) T PRK06658 1053 DKVVNIERLSEDKVKEVHIELSNKLNDLQLLNSLKESLSNFE--DNSGFSEVYFHLRE--NDKDYKLKMNSILNFVPDED 1128 (1145) T ss_pred EECCCHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCC--CCCCCCEEEEEEEC--CCEEEEEECCCCEEECCCHH T ss_conf 653549999873068389995514389999999999998665--57899559999966--98079998899726767899 Q ss_pred HHHHHHHCCCHHEH Q ss_conf 99987406462111 Q 537021.9.peg.7 164 IAFDLEHITGITKV 177 (184) Q Consensus 164 IagALKsIpGVl~V 177 (184) ..+.||.+-|--.| T Consensus 1129 ~~~~l~~~~G~env 1142 (1145) T PRK06658 1129 FLDKLRRCVIVEDV 1142 (1145) T ss_pred HHHHHHHHHCHHCE T ss_conf 99999987480026 No 9 >PRK07012 consensus Probab=98.86 E-value=1.4e-07 Score=67.98 Aligned_cols=151 Identities=11% Similarity=0.157 Sum_probs=112.5 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|.+....|+ ++| +...|+||...++-|.++.++-+ .+|..++.++|.-..+..-...+.++.+ T Consensus 994 ~~~~~~tk~G~--maf------------~tleD~tg~~ev~vfp~~~~~~~-~~l~~~~~l~v~G~v~~d~~~g~~~~~a 1058 (1173) T PRK07012 994 SLRTQMTQRGK--MVI------------ALLDDGTGQCEITVFNEQFEANR-ALFKEDELLIVQGQARNDAFTGGIRFTA 1058 (1173) T ss_pred EEEECCCCCCC--EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEECCCCCEEEEE T ss_conf 86878888777--899------------99997998789999789999999-9862198799999999605788658997 Q ss_pred EECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHH-----------------------HCCCCCCCCEEEEEECCCC Q ss_conf 65041589997320110999738015789999961122-----------------------1037888713999974898 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQ-----------------------TQNYDKQGKIGLIFLRSQG 147 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~-----------------------~~~~~G~G~V~Lvli~~~~ 147 (184) ..+.+|+++..+..+.|+|.+........++..|.... .....|.-+|.+... .. T Consensus 1059 ~~i~~L~~ar~~~~~~l~i~l~~~~~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~v~i~~~--~~ 1136 (1173) T PRK07012 1059 ESVMDLERARSRYAQAVRLTMNGNADAAALRRVLEPHVAKPDEGPPAAPAPRGGRDGGRRAQAAIPNGLAVRIVYS--NA 1136 (1173) T ss_pred EECCCHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCEEEEEEE--CC T ss_conf 3017899999843652699836643078999876554045432331112341000023332036899814999996--79 Q ss_pred CEEEEEECC-CCCCCH--HHHHHHHHCCCHHEHH Q ss_conf 547999668-970789--8999874064621110 Q 537021.9.peg.7 148 KIEIEITLD-HYPISP--EIAFDLEHITGITKVV 178 (184) Q Consensus 148 ~~EVEI~Lp-~f~VSP--qIagALKsIpGVl~Ve 178 (184) ..+.++.|+ +|+|+| ++.++||++-|--.|+ T Consensus 1137 ~~~~~~~l~~~~rV~psd~Ll~~Lk~llG~~~V~ 1170 (1173) T PRK07012 1137 RAQGEMRLGDAWRVKPSDALLAELRAAFGGSVVE 1170 (1173) T ss_pred CEEEEEECCCCCEECCCHHHHHHHHHHHCCCCEE T ss_conf 8279998699763676999999999860977079 No 10 >PRK06826 dnaE DNA polymerase III DnaE; Reviewed Probab=98.65 E-value=8.7e-07 Score=63.13 Aligned_cols=142 Identities=18% Similarity=0.192 Sum_probs=105.2 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|.+....|+. ++| +...|+||..+++-|.++.+.-+ .+|.+|.+++|....+.| +++++++.+ T Consensus 1007 ~~k~~~TkkG~~-maf------------~tleD~tG~~e~vvFp~~y~~~~-~~l~~~~~l~v~G~v~~r-~~~~~~li~ 1071 (1157) T PRK06826 1007 EVKRKVTRNNDM-MAF------------ITLEDLYGTIEVIVFPKVYEKYR-SLLNEDEIVLIKGRVSLR-EDEQPKLIC 1071 (1157) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE-CCCCEEEEE T ss_conf 877531799997-899------------99996898689999889999999-874259889999999960-798458997 Q ss_pred EECCCHHHHHHHHCCCEEEEEECCCH----HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCC--CHH Q ss_conf 65041589997320110999738015----7899999611221037888713999974898547999668-9707--898 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQDRSP----LREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPI--SPE 163 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD~~p----L~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~V--SPq 163 (184) ..|+||++.... +.+.|.+..... +..+++.|+. -.|.-+|.|.+ .+.. -.+.|+ .|+| |++ T Consensus 1072 ~~v~~l~~~~~~--~~~~i~l~~~~~~~~~~~~Lk~il~~-----~~G~~~V~l~~--~~~~--~~~~L~~~~~V~~~~~ 1140 (1157) T PRK06826 1072 EEIEPLEKINSE--KKLYLRVEDKKDIKLKLKELKEILKK-----HPGNTPVYLYT--EKER--KKFKLDRELWVSLSEE 1140 (1157) T ss_pred EECCCHHHHCCC--CEEEEEECCCCCCHHHHHHHHHHHHH-----CCCCCEEEEEE--CCCC--EEEECCCCCCCCCCHH T ss_conf 513336773326--70799961022788999999999973-----97998599998--6897--7998899874677999 Q ss_pred HHHHHHHCCCHHEHH Q ss_conf 999874064621110 Q 537021.9.peg.7 164 IAFDLEHITGITKVV 178 (184) Q Consensus 164 IagALKsIpGVl~Ve 178 (184) +...|+++-|--.|. T Consensus 1141 l~~~L~~l~G~~~v~ 1155 (1157) T PRK06826 1141 LIDNLRNLFGEENVK 1155 (1157) T ss_pred HHHHHHHHHCCCCEE T ss_conf 999999861975789 No 11 >PRK08512 consensus Probab=98.49 E-value=3.2e-06 Score=59.63 Aligned_cols=137 Identities=18% Similarity=0.202 Sum_probs=104.4 Q ss_pred CCCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEE Q ss_conf 87110028667614898501344333235553102787522105899999878752499828999841001677764269 Q 537021.9.peg.7 10 NNRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIR 89 (184) Q Consensus 10 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LR 89 (184) .++|.+.+..|+. |+| +...|+||+..++.|.++.++-+ .++. ..|+++.+..+ .++++++++ T Consensus 1018 ~~~~~~~tk~G~~-~a~------------~~leD~~g~~e~~~F~~~y~~~~-~~~~-~~~~~~~~~~~--~~~~~~~~~ 1080 (1185) T PRK08512 1018 EDIKKKISKKGKK-FGI------------INILDFHGNIELMVFEKNLEELE-ENDL-NKPYAFKVKIS--KEGSFFRLR 1080 (1185) T ss_pred EEEEECCCCCCCE-EEE------------EEEEECCCCEEEEECCHHHHHHH-HHCC-CCCEEEEEEEE--ECCCCEEEE T ss_conf 7867521799898-999------------99984898589998119999998-6432-58669999997--517621563 Q ss_pred EEECCCHHHHHHH------------------------HCCCEEEEEE----CCCHHHHHHHHHHHHHHCCCCCCCCEEEE Q ss_conf 8650415899973------------------------2011099973----80157899999611221037888713999 Q 537021.9.peg.7 90 LLWVESLEKKSME------------------------MLDSLRVYLQ----DRSPLREIHKYFENRQTQNYDKQGKIGLI 141 (184) Q Consensus 90 i~~vepLD~aAa~------------------------~~kgLRIfVr----D~~pL~sI~s~Le~~~~~~~~G~G~V~Lv 141 (184) +..+.+|+++... ..+.+.|.+. ++.-+..|++.|+. -.|+.+|.|. T Consensus 1081 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~l~~~l~~-----h~G~~~v~l~ 1155 (1185) T PRK08512 1081 IKKIMSLEDAKKEKIKTKGYKKDNKQKEENPFTAEPKEFEKLEIVLDLTDLNKEKIYEIYELARN-----HQGKRPLKLK 1155 (1185) T ss_pred HHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHCCHHHHHHHHHHHHH-----CCCCCEEEEE T ss_conf 35440355442112111001111111113321111145777699963587799999999999985-----8999718999 Q ss_pred EECCCCCEEEEEECC-CCCCCHHHHHHHHHCC Q ss_conf 974898547999668-9707898999874064 Q 537021.9.peg.7 142 FLRSQGKIEIEITLD-HYPISPEIAFDLEHIT 172 (184) Q Consensus 142 li~~~~~~EVEI~Lp-~f~VSPqIagALKsIp 172 (184) +..++. - +.++ .|.||+++.+.+|.+. T Consensus 1156 ~~~~~~--~--~~~~~~~~v~~~~~~~~~~~~ 1183 (1185) T PRK08512 1156 IKSKGE--C--LLYETDFIVSEKIKEEIKGLE 1183 (1185) T ss_pred EECCCE--E--EEEECCCEECHHHHHHHHHHC T ss_conf 954986--8--997435266488998986411 No 12 >PRK06920 dnaE DNA polymerase III DnaE; Reviewed Probab=98.49 E-value=6.8e-06 Score=57.58 Aligned_cols=139 Identities=16% Similarity=0.194 Sum_probs=102.3 Q ss_pred CCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECC Q ss_conf 02866761489850134433323555310278752210589999987875249982899984100167776426986504 Q 537021.9.peg.7 15 KKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVE 94 (184) Q Consensus 15 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~ve 94 (184) .+..+|+. ++| +...|+||...++-|..+.+.-+ .+|..|..+++.-..+.| .++.++.+..+. T Consensus 958 ~~tk~g~~-maf------------~~leD~tg~~e~vvFp~~~~~~~-~~l~~~~~~~v~G~v~~~--~~~~~~~~~~~~ 1021 (1107) T PRK06920 958 IRTKKGQK-MAF------------ITFCDQNDEMEAVVFPETYIHFS-DKLQEGAIVLVDGTIELR--NHKLQWIVNGLY 1021 (1107) T ss_pred EECCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCCCEEEEEEEE T ss_conf 63688996-999------------99997998689999979999999-985369879999999976--795307988643 Q ss_pred CHHHHHHHH-CCCEEEEEE-----CCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCCH--HHH Q ss_conf 158999732-011099973-----80157899999611221037888713999974898547999668-970789--899 Q 537021.9.peg.7 95 SLEKKSMEM-LDSLRVYLQ-----DRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPISP--EIA 165 (184) Q Consensus 95 pLD~aAa~~-~kgLRIfVr-----D~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSP--qIa 165 (184) |+++..+.. .+..++|++ +..-+..+++.|.. ..|+-+|.|.. .+. .-.+.|+ +|+|+| ++. T Consensus 1022 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~l~~~~~~-----~~G~~~v~l~~--~~~--~~~~~l~~~~~v~~~~~~~ 1092 (1107) T PRK06920 1022 PLEEMDAYEEKKDASVYVKLPSQYEKKLLNQVTKILFD-----YSGFAKVLIYY--EKE--HKMVQLSRSLSIHPSEECL 1092 (1107) T ss_pred EHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHC-----CCCCCEEEEEE--CCC--CEEEECCCCCEECCCHHHH T ss_conf 06765566644386499981643589999999999970-----98998499998--689--8799879987067799999 Q ss_pred HHHHHCCCHHEHH Q ss_conf 9874064621110 Q 537021.9.peg.7 166 FDLEHITGITKVV 178 (184) Q Consensus 166 gALKsIpGVl~Ve 178 (184) .+|+.+-|--.|. T Consensus 1093 ~~l~~~~G~~nv~ 1105 (1107) T PRK06920 1093 GALREIVGEENVV 1105 (1107) T ss_pred HHHHHHHCCCCEE T ss_conf 9999860976789 No 13 >PRK07374 dnaE DNA polymerase III subunit alpha; Validated Probab=98.48 E-value=3.2e-06 Score=59.64 Aligned_cols=153 Identities=6% Similarity=0.047 Sum_probs=108.9 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) ++|......|+. ++| +...|+||...++.|.++.+.-+ .+|..++.++|.-.-+.| ++++.|.+ T Consensus 1012 ~~~~~~Tk~G~~-maf------------~tleD~tg~~evvvFp~~y~~~~-~~l~~~~~l~v~G~v~~~--~~~~~li~ 1075 (1171) T PRK07374 1012 EMRQVTTRKGDR-MAI------------LQLEDLTGSCEAVVFPKTYARLS-DHLMTDTRLLVWASVDRR--DDRVQLII 1075 (1171) T ss_pred EEEECCCCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEEE T ss_conf 625267789996-899------------99998998689999789999999-985129879999999978--99899995 Q ss_pred EECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHC--CCCCCCCEEEEEECCCCCEEEEEECC-CCCCCH--HHH Q ss_conf 6504158999732011099973801578999996112210--37888713999974898547999668-970789--899 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQ--NYDKQGKIGLIFLRSQGKIEIEITLD-HYPISP--EIA 165 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~--~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSP--qIa 165 (184) ..+++++++-+-...-..--..|......++..|+..... ..+++=||.+.....+ ...++.|+ .|+|+| ++. T Consensus 1076 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~pv~~~~~~~~--~~~~~~lg~~~~v~~~~~l~ 1153 (1171) T PRK07374 1076 DDCREIDDLRLLLVELDSDQASDIRVQHKLRECLQKHRPERDELGVRIPVVAAVRAGN--SVRYVRLGHQFCVKDAQAAV 1153 (1171) T ss_pred HHHCCHHHHHHHHHHCCHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCC--CEEEEEECCCCEECCHHHHH T ss_conf 4312599888888505753235657899999999862687544688677699997698--36999818865358779999 Q ss_pred HHHHHCCCHHEHHHHH Q ss_conf 9874064621110303 Q 537021.9.peg.7 166 FDLEHITGITKVVQVI 181 (184) Q Consensus 166 gALKsIpGVl~VeEi~ 181 (184) ++||.-.+-..+..++ T Consensus 1154 ~~l~~~~~~~~~~~~~ 1169 (1171) T PRK07374 1154 KALQDKSFTARLSSLL 1169 (1171) T ss_pred HHHHHCCCCEECCCCC T ss_conf 9999656861244544 No 14 >COG0587 DnaE DNA polymerase III, alpha subunit [DNA replication, recombination, and repair] Probab=98.26 E-value=5e-06 Score=58.45 Aligned_cols=148 Identities=20% Similarity=0.225 Sum_probs=103.2 Q ss_pred CCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEE Q ss_conf 71100286676148985013443332355531027875221058999998787524998289998410016777642698 Q 537021.9.peg.7 11 NRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRL 90 (184) Q Consensus 11 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi 90 (184) -|+-+-|+.|+. ++| +...|++|-+.++.|..+.++-+. .+..++.++|.--...+-++.+..+++ T Consensus 988 vr~~~tk~~G~~-~~f------------~tl~D~~g~~e~v~f~~~~~~~~~-~l~~~~~~~v~g~v~~~~~~~~~~~~~ 1053 (1139) T COG0587 988 VRQRPTKAKGNK-MAF------------LTLEDETGILEVVVFPSEYERYRR-LLLEGRLLIVKGKVQRREDGVGHALIL 1053 (1139) T ss_pred EEEEECCCCCCE-EEE------------EEEECCCCCEEEEECHHHHHHHHH-HHCCCCEEEEEEEEEECCCCCCHHHHH T ss_conf 899632678977-999------------998337885799986899999998-741483799999986146664211007 Q ss_pred EECCCHHHHHHHHCCCEEEEEE-CCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHH Q ss_conf 6504158999732011099973-801578999996112210378887139999748985479996689707898999874 Q 537021.9.peg.7 91 LWVESLEKKSMEMLDSLRVYLQ-DRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLE 169 (184) Q Consensus 91 ~~vepLD~aAa~~~kgLRIfVr-D~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALK 169 (184) ..+.+|+....+....+++|++ ++.... ..++.........++|.+.+++....+.....+.|+.|+|+ ++..+|+ T Consensus 1054 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~~v~~~~~~~~~~~~~~~~~~~v~-~l~~~l~ 1130 (1139) T COG0587 1054 EDLSPLEEARERVADFLAIYLRLNTSQLD--RLKLLKIKSILRQGKGKTPVILIYQNGDSRNFLRLGELRVS-TLVEALK 1130 (1139) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHH--HHHHHHHHHHHCCCCCCCCEEEEECCCCHHHEEECCCEEHH-HHHHHHH T ss_conf 87776888887655431010136733332--55430123332037998537998415650230014514799-9999863 Q ss_pred HCCCHHEH Q ss_conf 06462111 Q 537021.9.peg.7 170 HITGITKV 177 (184) Q Consensus 170 sIpGVl~V 177 (184) +|...| T Consensus 1131 --~~~~~~ 1136 (1139) T COG0587 1131 --DGKGNV 1136 (1139) T ss_pred --CCCCCE T ss_conf --576544 No 15 >PRK06326 consensus Probab=98.08 E-value=8.6e-05 Score=50.77 Aligned_cols=135 Identities=16% Similarity=0.266 Sum_probs=98.9 Q ss_pred CCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECCC Q ss_conf 28667614898501344333235553102787522105899999878752499828999841001677764269865041 Q 537021.9.peg.7 16 KKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVES 95 (184) Q Consensus 16 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~vep 95 (184) -++.++..++| ++..|+||+..++.|..+.|+-+ .+|+.|+.+.++..-+.| ++.++|.+++++. T Consensus 1045 ~~~k~~~~maf------------~tleD~tg~~e~vvfp~~y~~~~-~~l~~~~~~~v~g~v~~~--~~~~~l~a~~~~d 1109 (1240) T PRK06326 1045 ISSKEQKKFAL------------LRVSDGVDSYELPIWPDMYAEQQ-DLLEEDRLIYAILVIDKR--SDSLRLSCRWMRD 1109 (1240) T ss_pred EECCCCCEEEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEC--CCEEEEEEHHHHH T ss_conf 86799986999------------99996898589998749999999-986269889999999833--9868885143110 Q ss_pred HHHHHH---------------H-----------------------------HCCCEEEEEE----CCCHHHHHHHHHHHH Q ss_conf 589997---------------3-----------------------------2011099973----801578999996112 Q 537021.9.peg.7 96 LEKKSM---------------E-----------------------------MLDSLRVYLQ----DRSPLREIHKYFENR 127 (184) Q Consensus 96 LD~aAa---------------~-----------------------------~~kgLRIfVr----D~~pL~sI~s~Le~~ 127 (184) |-..-- + ..+-+.|-++ ++.-+..+++.|+. T Consensus 1110 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~l~~~~~~- 1188 (1240) T PRK06326 1110 LSIVNEEVIYECDEAYDRIKSQKQKMSFTYSSTKKSTKAKSNKMKKNGETSAISPVTISLDLNQLRHSHLCILKKLIRK- 1188 (1240) T ss_pred HHHCCHHHHHHHHHHHHHHCCCHHCCCCHHHHHCCCCCCHHHHHHHHHHHCCCCCEEEEECHHHCCHHHHHHHHHHHHH- T ss_conf 1112377665310133332011011330000000133200002332222124676699956687376899999999984- Q ss_pred HHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCCHHHHHHHHHC Q ss_conf 21037888713999974898547999668-970789899987406 Q 537021.9.peg.7 128 QTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPISPEIAFDLEHI 171 (184) Q Consensus 128 ~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSPqIagALKsI 171 (184) -.|.-+|.|++..+. .+-..+.+. .|.|++.+..-++.+ T Consensus 1189 ----h~g~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 1228 (1240) T PRK06326 1189 ----YPGSRTLSLVFTKDN-ERVASISPDADYFVTEDISELLQEI 1228 (1240) T ss_pred ----CCCCCEEEEEEECCC-CEEEEECCCCCEEECCCHHHHHHHH T ss_conf ----899842899994386-3145644665605468659999987 No 16 >cd04485 DnaE_OBF DnaE_OBF: A subfamily of OB folds corresponding to the C-terminal OB-fold nucleic acid binding domain of Thermus aquaticus and Escherichia coli type C replicative DNA polymerase III alpha subunit (DnaE). The DNA polymerase holoenzyme of E. coli contains two copies of this replicative polymerase, each of which copies a different DNA strand. This group also contains Bacillus subtilis DnaE. Replication in B. subtilis and Staphylococcus aureus requires two different type C polymerases, polC and DnaE, both of which are thought to be included in the DNA polymerase holoenzyme. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=97.87 E-value=4.6e-05 Score=52.47 Aligned_cols=77 Identities=13% Similarity=0.153 Sum_probs=65.9 Q ss_pred CCCHHCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEE Q ss_conf 87110028667614898501344333235553102787522105899999878752499828999841001677764269 Q 537021.9.peg.7 10 NNRKGKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIR 89 (184) Q Consensus 10 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LR 89 (184) .+.|.+++.+|+. ++| +...|.+|+++++.|+..+++-+ .+|.+|.++++....+.+ .+++++. T Consensus 7 ~~~k~~~tk~G~~-~a~------------~tleD~~g~~e~~~F~~~~~~~~-~~l~~~~~v~i~g~v~~~--~~~~~l~ 70 (84) T cd04485 7 TSVRRRRTKKGKR-MAF------------VTLEDLTGSIEVVVFPETYEKYR-DLLKEDALLLVEGKVERR--DGGLRLI 70 (84) T ss_pred EEEEEEECCCCCE-EEE------------EEEECCCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEE T ss_conf 9989985579998-999------------99943787499999699999988-874469889999999996--9989999 Q ss_pred EEECCCHHHHHHH Q ss_conf 8650415899973 Q 537021.9.peg.7 90 LLWVESLEKKSME 102 (184) Q Consensus 90 i~~vepLD~aAa~ 102 (184) +..+.||+++..+ T Consensus 71 ~~~i~~l~~~~~~ 83 (84) T cd04485 71 AERIEDLEDARER 83 (84) T ss_pred EEEEEEHHHHHHH T ss_conf 9994678998830 No 17 >PRK07279 dnaE DNA polymerase III DnaE; Reviewed Probab=97.72 E-value=0.00077 Score=44.89 Aligned_cols=128 Identities=14% Similarity=0.157 Sum_probs=96.7 Q ss_pred CCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECC Q ss_conf 02866761489850134433323555310278752210589999987875249982899984100167776426986504 Q 537021.9.peg.7 15 KKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVE 94 (184) Q Consensus 15 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~ve 94 (184) .+..+|+. ++| +...|+||+..++-|.++.+.-+ .+|..|..++|...-+.| .+.+.+.+..++ T Consensus 899 ~~tk~g~~-maf------------~tleD~tg~ie~vvFp~~y~~~~-~~l~~~~~v~v~G~v~~r--~~~~~li~~~i~ 962 (1033) T PRK07279 899 IRTKTGQQ-MAF------------LSVNDTKKKLDVTLFPETYRQYK-DELKEGKFYYLKGKIQER--DGRLQMVLQQIQ 962 (1033) T ss_pred EECCCCCE-EEE------------EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEEHHHHH T ss_conf 87798996-999------------99998998689998679999999-985359889999999965--998999876412 Q ss_pred CHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECCCCC--CCHHHHHHHHHCC Q ss_conf 15899973201109997380157899999611221037888713999974898547999668970--7898999874064 Q 537021.9.peg.7 95 SLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLDHYP--ISPEIAFDLEHIT 172 (184) Q Consensus 95 pLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~--VSPqIagALKsIp 172 (184) +. ..+.+.|.+.+..-...++..|.. ..|.-+|.+.+... .++ +.+++|+ .|+++.+.||.+= T Consensus 963 ~~------~~~~~~i~~~~~~~~~~l~~~l~~-----~~G~~~V~~~~~~~---~~~-~~~~~~~v~~~~~l~~~L~~~~ 1027 (1033) T PRK07279 963 EA------ISEKFWILLENHEFDQEISEILGA-----FPGSIPVVIHYQKN---KET-IQSTKIQVEKSEELKEKLRPFV 1027 (1033) T ss_pred HC------CCCCEEEEECCCHHHHHHHHHHHH-----CCCCCCEEEEECCC---CEE-EECCCCEECCCHHHHHHHHHHH T ss_conf 15------168689991640468999999984-----97998189998279---808-8868826895999999999998 Q ss_pred C Q ss_conf 6 Q 537021.9.peg.7 173 G 173 (184) Q Consensus 173 G 173 (184) | T Consensus 1028 ~ 1028 (1033) T PRK07279 1028 L 1028 (1033) T ss_pred H T ss_conf 6 No 18 >PRK05672 dnaE2 error-prone DNA polymerase; Validated Probab=95.70 E-value=0.038 Score=34.41 Aligned_cols=61 Identities=10% Similarity=-0.054 Sum_probs=52.3 Q ss_pred HHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHH Q ss_conf 5531027875221058999998787524998289998410016777642698650415899973 Q 537021.9.peg.7 39 LQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSME 102 (184) Q Consensus 39 ~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~ 102 (184) +...|+||+..++-|.++.++-+ .+|..|..++|.---+.| .+++++.+.+++||+..-.. T Consensus 981 ~tleD~tG~ievvvFp~~y~~~~-~~l~~~~vllV~G~v~~~--~~~~~lia~~i~~l~~~L~~ 1041 (1050) T PRK05672 981 LTLEDETGMVNVVVWPGLWERQR-REALGARLLLVRGRVQNA--EGVRHLVADRLEDLSPLLGG 1041 (1050) T ss_pred EEEEECCCCEEEEECHHHHHHHH-HHHCCCCEEEEEEEEEEE--CCEEEEEEEEEECHHHHHHH T ss_conf 99986897389999889999999-985229879999999976--99799998340241466623 No 19 >PRK07135 dnaE DNA polymerase III DnaE; Validated Probab=92.46 E-value=0.34 Score=28.54 Aligned_cols=38 Identities=16% Similarity=0.234 Sum_probs=29.5 Q ss_pred HHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 5531027875221058999998787524998289998410 Q 537021.9.peg.7 39 LQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQE 78 (184) Q Consensus 39 ~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe 78 (184) +...|.||+++++.|++++|+-+. |+.|+.+.++|..- T Consensus 934 ~~l~D~tg~~e~~vf~~~~ekyr~--l~~g~~~~~~v~~~ 971 (987) T PRK07135 934 VIFADDTVEIKIFVNDNNLEKYRL--LKTGKIYEVIIFFK 971 (987) T ss_pred EEEECCCCCEEEEECHHHHHHHHC--CCCCCEEEEEEEEE T ss_conf 999868865899986899998762--64698799999996 No 20 >TIGR00021 rpiA ribose 5-phosphate isomerase A; InterPro: IPR004788 Ribose 5-phosphate isomerase (5.3.1.6 from EC), also known as phosphoriboisomerase, catalyses the conversion of D-ribose 5-phosphate to D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway.; GO: 0004751 ribose-5-phosphate isomerase activity, 0009052 pentose-phosphate shunt non-oxidative branch. Probab=78.02 E-value=3.2 Score=22.50 Aligned_cols=35 Identities=17% Similarity=0.373 Sum_probs=29.4 Q ss_pred ECCCCCEEEEEECCC-CCCCHH-HHHHHHHCCCHHEH Q ss_conf 748985479996689-707898-99987406462111 Q 537021.9.peg.7 143 LRSQGKIEIEITLDH-YPISPE-IAFDLEHITGITKV 177 (184) Q Consensus 143 i~~~~~~EVEI~Lp~-f~VSPq-IagALKsIpGVl~V 177 (184) +-|+|..-+++..|+ -.-.|. +..+|++||||++. T Consensus 181 ~TDNGN~IlD~~~~~~~i~dP~~l~~~l~~ipGVVE~ 217 (236) T TIGR00021 181 VTDNGNYILDCHFGKEIIPDPEALEEELKSIPGVVEH 217 (236) T ss_pred EEECCCEEEEEECCCCCCCCHHHHHHHHCCCCCEEEE T ss_conf 9607978999776987578968999996079988850 No 21 >PRK00702 ribose-5-phosphate isomerase A; Provisional Probab=76.89 E-value=6.9 Score=20.41 Aligned_cols=36 Identities=14% Similarity=0.352 Sum_probs=29.5 Q ss_pred EECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEH Q ss_conf 974898547999668970789899987406462111 Q 537021.9.peg.7 142 FLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKV 177 (184) Q Consensus 142 li~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~V 177 (184) ++.++|..-+++..+...-..++...|+++|||++. T Consensus 166 ~vTDnGN~IlD~~~~~i~dp~~l~~~L~~ipGVVe~ 201 (222) T PRK00702 166 VVTDNGNYILDVHFLRIPDPEALEKELNNIPGVVEN 201 (222) T ss_pred EEEECCCEEEEEECCCCCCHHHHHHHHHCCCCEEEE T ss_conf 285289889992089979999999998579999812 No 22 >TIGR02945 SUF_assoc FeS assembly SUF system protein; InterPro: IPR014291 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry is a subset of the larger DUF59 family. Many members of DUF59 are candidate ring hydroxylating complex subunits. However, members of the narrower family defined here are all found as part of the FeS assembly SUF system locus, in a subset of SUF-positive proteobacteria. . Probab=73.19 E-value=5.2 Score=21.15 Aligned_cols=44 Identities=20% Similarity=0.331 Sum_probs=37.3 Q ss_pred CCCEEEEEECCCCCEEEEEEC--CCCCCCH----HHHHHHHHCCCHHEHH Q ss_conf 871399997489854799966--8970789----8999874064621110 Q 537021.9.peg.7 135 QGKIGLIFLRSQGKIEIEITL--DHYPISP----EIAFDLEHITGITKVV 178 (184) Q Consensus 135 ~G~V~Lvli~~~~~~EVEI~L--p~f~VSP----qIagALKsIpGVl~Ve 178 (184) =|-|+=|=+.|++.-.|++.| |++++-. .++.|..+||||-.|+ T Consensus 22 LGLIY~i~i~Dd~~V~I~MTLTaPgCPvAgemP~~V~~aV~~v~GV~~v~ 71 (96) T TIGR02945 22 LGLIYKIEISDDRHVHIDMTLTAPGCPVAGEMPGWVEDAVGQVPGVSSVE 71 (96) T ss_pred CCCEEEEEECCCCEEEEEEEECCCCCCCCCCCHHHHHHHHCCCCCCCEEE T ss_conf 58675887658986788775178568422452268899733388985025 No 23 >COG0120 RpiA Ribose 5-phosphate isomerase [Carbohydrate transport and metabolism] Probab=71.68 E-value=9.3 Score=19.60 Aligned_cols=106 Identities=14% Similarity=0.179 Sum_probs=57.7 Q ss_pred CCCEEEEEEECCCCCCCCCCCEEEEE-CCCHHHHHHHHCCCEEEEEECCC------------------HHHHHHHHHHHH Q ss_conf 99828999841001677764269865-04158999732011099973801------------------578999996112 Q 537021.9.peg.7 67 EKKPYIIVVSQEDKLSLQRDSIRLLW-VESLEKKSMEMLDSLRVYLQDRS------------------PLREIHKYFENR 127 (184) Q Consensus 67 ~gkpvii~VsAe~rle~eg~~LRi~~-vepLD~aAa~~~kgLRIfVrD~~------------------pL~sI~s~Le~~ 127 (184) +-..+-++|..-+-++++..=+-=-. +-.-|+-.|.+.+-+ ||+-|+. +.+.+..+|+.- T Consensus 74 ~~~~lDl~iDGADEvd~~~~lIKGGGgAl~rEKIva~~ak~~-IvIvDesKlV~~LG~fplPVEVip~a~~~v~r~l~~~ 152 (227) T COG0120 74 EVDSLDLAIDGADEVDPNLNLIKGGGGALLREKIVASAAKRF-IVIVDESKLVEVLGKFPLPVEVIPFARSAVLRKLEKL 152 (227) T ss_pred CCCCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHHCCEE-EEEEECCCCHHHCCCCCCCEEECHHHHHHHHHHHHHH T ss_conf 367666576464111888876703758988989999744718-9999676015331899868898230799999999980 Q ss_pred -----HHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEH Q ss_conf -----21037888713999974898547999668970789899987406462111 Q 537021.9.peg.7 128 -----QTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKV 177 (184) Q Consensus 128 -----~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~V 177 (184) .+......| -++.++|..-+++..+++.=--.+...|++||||+++ T Consensus 153 g~~~~lR~~~~k~G----pviTDnGN~IlDv~~~~i~dp~~le~~l~~IpGVVE~ 203 (227) T COG0120 153 GGKPTLREGEGKDG----PVITDNGNYILDVHFGRIEDPEELEKELNAIPGVVEN 203 (227) T ss_pred CCCCCCCCCCCCCC----CEEEECCCEEEEECCCCCCCHHHHHHHHHCCCCEEEE T ss_conf 89751235556789----8895289889992079859989999998379888871 No 24 >cd04877 ACT_TyrR N-terminal ACT domain of the TyrR protein. ACT_TyrR: N-terminal ACT domain of the TyrR protein. The TyrR protein of Escherichia coli controls the expression of a group of transcription units (TyrR regulon) whose gene products are involved in the biosynthesis or transport of the aromatic amino acids. Binding to specific DNA sequences known as TyrR boxes, the TyrR protein can either activate or repress transcription at different sigma70 promoters. Its regulatory activity occurs in response to intracellular levels of tyrosine, phenylalanine and tryptophan. The TyrR protein consists of an N-terminal region important for transcription activation with an ATP-independent aromatic amino acid binding site (contained within the ACT domain) and is involved in dimerization; a central region with an ATP binding site, an ATP-dependent aromatic amino acid binding site and is involved in hexamerization; and a helix turn helix DNA binding C-terminal region. In solution, in the absence Probab=71.26 E-value=5.1 Score=21.23 Aligned_cols=68 Identities=16% Similarity=0.175 Sum_probs=39.1 Q ss_pred EEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC--CCCCCHHHHHHHHHCCCHHEHHHHH Q ss_conf 9997380157899999611221037888713999974898547999668--9707898999874064621110303 Q 537021.9.peg.7 108 RVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD--HYPISPEIAFDLEHITGITKVVQVI 181 (184) Q Consensus 108 RIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp--~f~VSPqIagALKsIpGVl~VeEi~ 181 (184) |+-+.-.+-+.-.++.|..--..+ +.|.-+--+..+-+=+.-| .|..-.++..+|+.++||.+|..+- T Consensus 2 RLeI~c~DRiGi~~eiL~lL~~~~------IdL~giEi~~~g~IYl~~~~l~f~~~~~Lm~~ir~I~GV~dVktv~ 71 (74) T cd04877 2 RLEITCEDRLGITQEVLDLLVEHN------IDLRGIEIDPKGRIYLNFPTIEFEKLQTLMPEIRRIDGVEDVKTVP 71 (74) T ss_pred EEEEEEECCCHHHHHHHHHHHHCC------CCCEEEEECCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 389997300014999999999779------9832888726881899778789899999999984789831678861 No 25 >pfam06026 Rib_5-P_isom_A Ribose 5-phosphate isomerase A (phosphoriboisomerase A). This family consists of several ribose 5-phosphate isomerase A or phosphoriboisomerase A (EC:5.3.1.6) from bacteria, eukaryotes and archaea. Probab=70.88 E-value=9.7 Score=19.49 Aligned_cols=36 Identities=14% Similarity=0.353 Sum_probs=28.2 Q ss_pred EECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEH Q ss_conf 974898547999668970789899987406462111 Q 537021.9.peg.7 142 FLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKV 177 (184) Q Consensus 142 li~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~V 177 (184) ++.++|..-+++..+...--.++...|+++|||++. T Consensus 118 ~iTDnGN~IlD~~~~~i~dp~~l~~~L~~ipGVVe~ 153 (172) T pfam06026 118 VVTDNGNYILDVHFGRIEDPEELEKELKNIPGVVEH 153 (172) T ss_pred EEECCCCEEEEECCCCCCCHHHHHHHHHCCCCEEEE T ss_conf 084699989990289869999999998579999834 No 26 >PRK09977 putative Mg(2+) transport ATPase; Provisional Probab=70.01 E-value=10 Score=19.37 Aligned_cols=76 Identities=14% Similarity=0.180 Sum_probs=45.2 Q ss_pred CHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEEC--C-CCCCCHHHHHHHHHC Q ss_conf 1589997320110999738015789999961122103788871399997489854799966--8-970789899987406 Q 537021.9.peg.7 95 SLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITL--D-HYPISPEIAFDLEHI 171 (184) Q Consensus 95 pLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~L--p-~f~VSPqIagALKsI 171 (184) +++..-.+-+..+++.+.+ ..+..+-+.++.. +--...+.+..+....++++.+ | +++++ ++...|+.+ T Consensus 135 ~~~~~~~~~~~~~~i~~~~-~~~~~i~~~l~~~------~i~i~~v~i~~~~~~~~v~l~v~~~~~~~~~-dl~~~L~~i 206 (215) T PRK09977 135 QLTFRLMNKNYHLQLTLVN-GNTVSMLDWFKQQ------KIKTDLVSLQENEDHEVVTIDITLHATTSIE-DLLRLLKGM 206 (215) T ss_pred HHHHHHCCCCEEEEEEEEC-CCHHHHHHHHHHC------CCEEEEEEEEECCCCCEEEEEEEECCCCCHH-HHHHHHHCC T ss_conf 9999860565169999935-8277999999984------9758999989828885899999977987499-999998559 Q ss_pred CCHHEHH Q ss_conf 4621110 Q 537021.9.peg.7 172 TGITKVV 178 (184) Q Consensus 172 pGVl~Ve 178 (184) |||..|. T Consensus 207 ~gV~~V~ 213 (215) T PRK09977 207 AGVKGVS 213 (215) T ss_pred CCCEEEE T ss_conf 9935884 No 27 >PRK10820 DNA-binding transcriptional regulator TyrR; Provisional Probab=65.35 E-value=8.8 Score=19.76 Aligned_cols=21 Identities=19% Similarity=0.384 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHCCCCEEEEE Q ss_conf 5899999878752499828999 Q 537021.9.peg.7 53 EQQQLEEEKPRLLREKKPYIIV 74 (184) Q Consensus 53 ~~~~le~~~~~LLe~gkpvii~ 74 (184) .+++.+..+ ++-.+..||+|+ T Consensus 213 m~~v~~~a~-r~A~~d~pVLI~ 233 (513) T PRK10820 213 MKHVVEQAR-KLAMLSAPLLIT 233 (513) T ss_pred HHHHHHHHH-HHHCCCCCEEEE T ss_conf 999999999-985989988998 No 28 >cd01398 RPI_A RPI_A: Ribose 5-phosphate isomerase type A (RPI_A) subfamily; RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. This reaction leads to the conversion of phosphosugars into glycolysis intermediates, which are precursors for the synthesis of amino acids, vitamins, nucleotides, and cell wall components. In plants, RPI is part of the Calvin cycle as ribulose 5-phosphate is the carbon dioxide receptor in the first dark reaction of photosynthesis. There are two unrelated types of RPIs (A and B), which catalyze the same reaction, at least one type of RPI is present in an organism. RPI_A is more widely distributed than RPI_B in bacteria, eukaryotes, and archaea. Probab=64.29 E-value=13 Score=18.67 Aligned_cols=107 Identities=12% Similarity=0.131 Sum_probs=56.1 Q ss_pred CCCCEEEEEEECCCCCCCCCCCEEEEEC-CCHHHHHHHHCCCEEEEEEC------------------CCHHHHHHHHHHH Q ss_conf 4998289998410016777642698650-41589997320110999738------------------0157899999611 Q 537021.9.peg.7 66 REKKPYIIVVSQEDKLSLQRDSIRLLWV-ESLEKKSMEMLDSLRVYLQD------------------RSPLREIHKYFEN 126 (184) Q Consensus 66 e~gkpvii~VsAe~rle~eg~~LRi~~v-epLD~aAa~~~kgLRIfVrD------------------~~pL~sI~s~Le~ 126 (184) +....+.+++..-+-.+++..-+-=-.. ---|+..|.+.+-. |++-| +.+...+...|+. T Consensus 69 ~~~~~iDi~iDGaDevd~~l~lIKGgGgal~rEKiva~~a~~~-I~i~D~sK~v~~Lg~~plPiEV~p~a~~~v~~~l~~ 147 (213) T cd01398 69 DEVPRLDLAIDGADEVDPDLNLIKGGGGALLREKIVASAAKKF-IVIADESKLVERLGEFPLPVEVVPFAWSYVARELEK 147 (213) T ss_pred HHCCCEEEEECCHHHHCCCCCEEEECCHHHHHHHHHHHHHHEE-EEEECCCCEECCCCCCCEEEEECCCHHHHHHHHHHH T ss_conf 5678300675374664547668970828989899999742209-999660116054799882599820279999999997 Q ss_pred H-----HHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEH Q ss_conf 2-----21037888713999974898547999668970789899987406462111 Q 537021.9.peg.7 127 R-----QTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKV 177 (184) Q Consensus 127 ~-----~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~V 177 (184) . -+.+.+++|+ ++.++|..-+++..+...--.++...|+++|||++. T Consensus 148 lg~~~~lR~~~~k~gp----~iTDnGN~IlD~~~~~i~dp~~l~~~L~~i~GVVe~ 199 (213) T cd01398 148 LGGKPVLREGSGKGGP----VVTDNGNYILDVHFGTIEDPEALEKELKSIPGVVEH 199 (213) T ss_pred CCCCEEEEECCCCCCC----EEECCCCEEEEECCCCCCCHHHHHHHHHCCCCEEEE T ss_conf 6997599326889985----481599889991289968999999998469999834 No 29 >cd04901 ACT_3PGDH C-terminal ACT (regulatory) domain of D-3-Phosphoglycerate Dehydrogenase (3PGDH) found in fungi and bacteria. The C-terminal ACT (regulatory) domain of D-3-Phosphoglycerate Dehydrogenase (3PGDH) found in fungi and bacteria. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In Escherichia coli, the SerA 3PGDH is feedback-controlled by the end product L-serine in an allosteric manner. In the homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active site is postulated to involve the tethering of the regulatory domains together to create a rigid quaternary structure with a solvent- Probab=54.85 E-value=5.7 Score=20.90 Aligned_cols=56 Identities=21% Similarity=0.342 Sum_probs=37.1 Q ss_pred ECCCHHHHHHHHHHHHH----HCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEHH Q ss_conf 38015789999961122----10378887139999748985479996689707898999874064621110 Q 537021.9.peg.7 112 QDRSPLREIHKYFENRQ----TQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKVV 178 (184) Q Consensus 112 rD~~pL~sI~s~Le~~~----~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~Ve 178 (184) +-|..|+.|.+.|.... ..-.+.+|++...++.-+ -.++.++...|+++|||++|. T Consensus 8 NvPG~l~~I~~il~~~~iNI~~m~~~srg~~ay~vidvd-----------~~~~~~~~~~l~~i~gvi~vR 67 (69) T cd04901 8 NVPGVLGQINTILAEHNINIAAQYLQTRGEIGYVVIDID-----------SEVSEELLEALRAIPGTIRVR 67 (69) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHCCCCCCCEEEEEEEEC-----------CCCCHHHHHHHHCCCCEEEEE T ss_conf 988689999999998599979972426897689999907-----------989899999997299859997 No 30 >COG2177 FtsX Cell division protein [Cell division and chromosome partitioning] Probab=53.71 E-value=19 Score=17.69 Aligned_cols=59 Identities=22% Similarity=0.379 Sum_probs=37.8 Q ss_pred EEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEC-CCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEHHH Q ss_conf 97380157899999611221037888713999974-89854799966897078989998740646211103 Q 537021.9.peg.7 110 YLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLR-SQGKIEIEITLDHYPISPEIAFDLEHITGITKVVQ 179 (184) Q Consensus 110 fVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~-~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~VeE 179 (184) |+..++++..++..| |.| .+.... +-.+..+=|+...=+-.++++.+|+.+|||-+|+. T Consensus 92 ~~sre~~l~~L~~~l---------g~~--~~~~l~~nPLP~~~vV~~~~p~~~~~i~~~l~~l~gV~~V~~ 151 (297) T COG2177 92 FISREEALKELQPWL---------GFG--ALLMLDENPLPDVFVVTPDDPPQVKAIAAALRDLPGVAEVDD 151 (297) T ss_pred EECHHHHHHHHHHHC---------CCH--HHHCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCCEEEEC T ss_conf 767999999999872---------941--420477899986589972897627999999874866423105 No 31 >PRK13978 ribose-5-phosphate isomerase A; Provisional Probab=53.10 E-value=20 Score=17.48 Aligned_cols=108 Identities=15% Similarity=0.114 Sum_probs=56.2 Q ss_pred CCCCEEEEEEECCCCCCCCCCCEEEEE-CCCHHHHHHHHCCCEEEEEECCC------------------HHHHHHHHHHH Q ss_conf 499828999841001677764269865-04158999732011099973801------------------57899999611 Q 537021.9.peg.7 66 REKKPYIIVVSQEDKLSLQRDSIRLLW-VESLEKKSMEMLDSLRVYLQDRS------------------PLREIHKYFEN 126 (184) Q Consensus 66 e~gkpvii~VsAe~rle~eg~~LRi~~-vepLD~aAa~~~kgLRIfVrD~~------------------pL~sI~s~Le~ 126 (184) ++...+-+++.-.+-.+++..-+-=-. +.--|+..|.+.+-.=+-++..+ +...+...|+. T Consensus 75 ~~~~~iDl~iDGADEvd~~lnlIKGgGgal~rEKivA~~ak~~IiI~D~sK~v~~Lg~~~plPVEVvp~a~~~v~~~l~~ 154 (228) T PRK13978 75 NDVDHIDLAIDGADEVDPSLNIIKGGGGALFREKVIDEMASRFVVVVDETKIVQYLGETFKLPVEVDKFNWYHILRKIES 154 (228) T ss_pred CCCCCCCEEEECCCCCCCCCCEEEECCHHHHHHHHHHHCCCCEEEEEECCHHHHHCCCCCCCEEEECCCHHHHHHHHHHH T ss_conf 56873467980886417887889878499999999997038489998365434543883880089903489999999998 Q ss_pred H----HHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEH Q ss_conf 2----21037888713999974898547999668970789899987406462111 Q 537021.9.peg.7 127 R----QTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKV 177 (184) Q Consensus 127 ~----~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~V 177 (184) . ..++.+++++ ++.++|..-+++..+...=--.+...|+++|||++. T Consensus 155 ~~~~~~~~r~~~~~p----~vTDnGN~IlD~~f~~~~dp~~l~~~L~~ipGVVe~ 205 (228) T PRK13978 155 YADIKVERRVNEDVA----FITDNGNYILDCKLPKGIDPYKFHEYLIHLTGVFET 205 (228) T ss_pred CCCCCEEEEECCCCC----EEECCCCEEEEECCCCCCCHHHHHHHHHCCCCEEEE T ss_conf 079954766459996----693199889992289989999999998779999811 No 32 >TIGR00006 TIGR00006 S-adenosyl-methyltransferase MraW; InterPro: IPR002903 This is a family of methyltransferases. Methyltransferases are responsible for the transfer of methyl groups between two molecules.; GO: 0008168 methyltransferase activity. Probab=51.54 E-value=6.5 Score=20.58 Aligned_cols=45 Identities=18% Similarity=0.447 Sum_probs=30.1 Q ss_pred HHHHCCCEEEEEECCCHHHHHHHHHH-HHHHCCCCCCCCEEEEEECCCCC Q ss_conf 97320110999738015789999961-12210378887139999748985 Q 537021.9.peg.7 100 SMEMLDSLRVYLQDRSPLREIHKYFE-NRQTQNYDKQGKIGLIFLRSQGK 148 (184) Q Consensus 100 Aa~~~kgLRIfVrD~~pL~sI~s~Le-~~~~~~~~G~G~V~Lvli~~~~~ 148 (184) |+++=.++||||+|+ |..+...|+ +.+ .-+-+|+.++|...+-.. T Consensus 212 AtrvFQAiRI~vNdE--L~~Le~~L~~~~~--~L~~~Grl~vIsFHSLED 257 (323) T TIGR00006 212 ATRVFQAIRIYVNDE--LEELEEALQQAPN--LLKPGGRLSVISFHSLED 257 (323) T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHH--HHCCCCEEEEEEHHHHHH T ss_conf 125553567788878--9999999999998--417897189874000567 No 33 >pfam01883 DUF59 Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG, and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid. So members of this family may all be components of ring hydroxylating complexes. Probab=51.26 E-value=19 Score=17.62 Aligned_cols=43 Identities=19% Similarity=0.422 Sum_probs=27.4 Q ss_pred CCEEEEEECCCCCEEEEEECC--CCCCCH----HHHHHHHHCCCHHEHH Q ss_conf 713999974898547999668--970789----8999874064621110 Q 537021.9.peg.7 136 GKIGLIFLRSQGKIEIEITLD--HYPISP----EIAFDLEHITGITKVV 178 (184) Q Consensus 136 G~V~Lvli~~~~~~EVEI~Lp--~f~VSP----qIagALKsIpGVl~Ve 178 (184) |-|.=+-+.+++...|.+.|. .++..+ +++.|++++|||-+|. T Consensus 25 GlI~~i~v~~~g~v~I~~~lT~~~CP~~~~i~~~i~~~l~~v~gv~~V~ 73 (76) T pfam01883 25 GLVYEVDIDDDGNVKVKMTLTTPGCPLADLIALDVREALLELPGVEDVE 73 (76) T ss_pred CEEEEEEECCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEEE T ss_conf 5368999857984999999589999837899999999998399940789 No 34 >PRK07218 replication factor A; Provisional Probab=46.91 E-value=25 Score=16.92 Aligned_cols=23 Identities=13% Similarity=0.273 Sum_probs=15.6 Q ss_pred CCCCEEEEEECCCCCEEEEEECC Q ss_conf 88713999974898547999668 Q 537021.9.peg.7 134 KQGKIGLIFLRSQGKIEIEITLD 156 (184) Q Consensus 134 G~G~V~Lvli~~~~~~EVEI~Lp 156 (184) |.--+.+.+.+++|..-+.+-|. T Consensus 327 ~~~Dlr~k~vlDdG~~~~~viln 349 (430) T PRK07218 327 GEDDMRIKAVLDDGTGSVTVMLD 349 (430) T ss_pred CCCCEEEEEEEECCCCEEEEEEC T ss_conf 76440899999789722899976 No 35 >pfam01336 tRNA_anti OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl-tRNA synthetases catalyse the addition of an amino acid to the appropriate tRNA molecule EC:6.1.1.-. This family also includes part of RecG helicase involved in DNA repair. Replication factor A is a heterotrimeric complex, that contains a subunit in this family. This domain is also found at the C-terminus of bacterial DNA polymerase III alpha chain. Probab=46.75 E-value=26 Score=16.86 Aligned_cols=54 Identities=11% Similarity=0.042 Sum_probs=40.2 Q ss_pred HHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCC-CCEEEEECCC Q ss_conf 55310278752210589999987875249982899984100167776-4269865041 Q 537021.9.peg.7 39 LQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQR-DSIRLLWVES 95 (184) Q Consensus 39 ~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg-~~LRi~~vep 95 (184) +...|.||+..+..+....+... +.|.+|..+.+. ...+...++ ..+.+..+++ T Consensus 20 ~~l~D~tg~i~~~~~~~~~~~~~-~~l~~g~~v~v~--G~v~~~~~~~~~i~~~~i~~ 74 (75) T pfam01336 20 LTLRDGTGSIQVVLFKEEAEKLA-KKLKEGDVVLVT--GKVKKRPGGELELVVEEIEV 74 (75) T ss_pred EEEEECCEEEEEEEEHHHHHHHH-HCCCCCCEEEEE--EEEEECCCCCEEEEEEEEEE T ss_conf 99997993999999523868886-328989899999--99999749989999989997 No 36 >TIGR00383 corA magnesium and cobalt transport protein CorA; InterPro: IPR004488 The CorA transport system is the primary Mg2+ influx system of Salmonella typhimurium and Escherichia coli. It has an unusual membrane topology, with a large, soluble, highly charged periplasmic N-terminal domain with three transmembrane segments in a shorter, hydrophobic C-terminal domain. It has been suggested that the CorA Mg2+ transport system forms the major Mg2+ uptake system in the bacteria and archaea but that some family members may have a function other than Mg2+ transport .; GO: 0015087 cobalt ion transmembrane transporter activity, 0015095 magnesium ion transmembrane transporter activity, 0006824 cobalt ion transport, 0015693 magnesium ion transport, 0016020 membrane. Probab=44.60 E-value=4.5 Score=21.54 Aligned_cols=76 Identities=12% Similarity=0.086 Sum_probs=36.1 Q ss_pred HHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECC------CCCCCCCCCEEEEECCCHHHHHHHHCCC--EEEEE Q ss_conf 531027875221058999998787524998289998410------0167776426986504158999732011--09997 Q 537021.9.peg.7 40 QEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQE------DKLSLQRDSIRLLWVESLEKKSMEMLDS--LRVYL 111 (184) Q Consensus 40 ~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe------~rle~eg~~LRi~~vepLD~aAa~~~kg--LRIfV 111 (184) ++++.=+......+...-..++||+-|-.-.+++..-+- +.++.+.+++-+ .++ +-+.- T Consensus 51 ~L~~~~~~hp~~ledi~~~~~r~K~~ed~d~~~~~~~~~~~d~e~~~~~~~~v~f~l-------------~~~~L~T~~~ 117 (339) T TIGR00383 51 KLGEFLAIHPLALEDILNSPQRPKVEEDDDHLFIISFFLLEDREEETFETEQVSFIL-------------GKNLLLTIHE 117 (339) T ss_pred HHHHHHCCCHHHHHHHHHCCCCCEEEECCCEEEEEEEECCCCCCCCCCCHHHEEEEE-------------ECCEEEEEEC T ss_conf 998986331444554531565451686288168877630247543431032032444-------------2653797633 Q ss_pred EC-CCHHHHHHHHHHHHH Q ss_conf 38-015789999961122 Q 537021.9.peg.7 112 QD-RSPLREIHKYFENRQ 128 (184) Q Consensus 112 rD-~~pL~sI~s~Le~~~ 128 (184) +. -.++.++++++.+.+ T Consensus 118 ~~~~~~~~~~r~r~~~~~ 135 (339) T TIGR00383 118 REKLPAFDSIRERIRTSQ 135 (339) T ss_pred CCCCHHHHHHHHHHHCCC T ss_conf 741057899888864287 No 37 >TIGR02803 ExbD_1 TonB system transport protein ExbD; InterPro: IPR014170 Members of this family are Gram-negative bacterial inner membrane proteins, generally designated ExbD, related to the TolR IPR014168 from INTERPRO. Members are always found next to a protein designated ExbB (IPR014164 from INTERPRO), which is related to the TolQ family (IPR014163 from INTERPRO). ExbD and ExbB together form a proton channel through which they can harness the proton-motive force to energize TonB, which in turn energizes TonB-dependent receptors in the outer membrane. TonB-dependent receptors with known specificity tend to import siderophores or vitamin B12. A TonB system and Tol-Pal system often will co-exist in a single bacterial genome.. Probab=40.46 E-value=18 Score=17.88 Aligned_cols=76 Identities=13% Similarity=0.130 Sum_probs=49.5 Q ss_pred HCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEE Q ss_conf 24998289998410016777642698650415899973201109997380157899999611221037888713999 Q 537021.9.peg.7 65 LREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLI 141 (184) Q Consensus 65 Le~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lv 141 (184) =+|.||+-+++.+..++.--.....-.+..+.=++..+..+.-+||++.++.+.- .+..+.-+..+..|-=+|.|+ T Consensus 44 PrP~kP~~~~~k~d~~~~~G~~~v~~~~l~~~l~~~~~~~k~~~if~~adk~v~y-~~lm~~m~~l~~aGylk~~lv 119 (122) T TIGR02803 44 PRPEKPVYVSLKADLSLAVGNDTVAREALGAALDALTEGDKDTRIFLRADKAVDY-GDLMKVMNLLREAGYLKVALV 119 (122) T ss_pred CCCCCCEEEEECCCCEEEECCCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCH-HHHHHHHHHHHHCCCEEEEEE T ss_conf 8999867898717865653454210678999875420354310666631476378-899999998874783478875 No 38 >pfam04954 SIP Siderophore-interacting protein. Probab=39.78 E-value=33 Score=16.20 Aligned_cols=18 Identities=22% Similarity=0.451 Sum_probs=11.4 Q ss_pred EEEECCCHHHHHHHHHHH Q ss_conf 997380157899999611 Q 537021.9.peg.7 109 VYLQDRSPLREIHKYFEN 126 (184) Q Consensus 109 IfVrD~~pL~sI~s~Le~ 126 (184) +++-|+.+|+.|.+.|+. T Consensus 5 llvgDeTAlPAi~~iLe~ 22 (118) T pfam04954 5 LLVGDETALPAIARILEE 22 (118) T ss_pred EEECCCCHHHHHHHHHHH T ss_conf 996440009999999986 No 39 >TIGR01213 TIGR01213 conserved hypothetical protein TIGR01213; InterPro: IPR005912 The members of this family of conserved hypothetical proteins show a low level of similarity to several predicted RNA pseudouridine synthases. All trusted members of this family are archaeal. Several eukaryotic homologs lack N-terminal homology including two CXXC motifs. . Probab=37.95 E-value=20 Score=17.59 Aligned_cols=45 Identities=18% Similarity=0.291 Sum_probs=33.5 Q ss_pred HHHCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCCCEEEEEECCC Q ss_conf 75249982899984100167776426986504158999732011099973801 Q 537021.9.peg.7 63 RLLREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSMEMLDSLRVYLQDRS 115 (184) Q Consensus 63 ~LLe~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~~~kgLRIfVrD~~ 115 (184) +||=+||||+| | =..++.|-.-.++|++++.....|+-|=|.+-+ T Consensus 275 RMLG~GRPFVl----E----v~~Pr~R~~dl~~l~~eiN~~a~GkGVEv~~~~ 319 (450) T TIGR01213 275 RMLGTGRPFVL----E----VKEPRKRKIDLDELEEEINKGASGKGVEVLGLK 319 (450) T ss_pred ECCCCCCCEEE----E----ECCCCCCCCCHHHHHHHHCCCCCCCEEEEEEEE T ss_conf 01788883599----7----258865888878999995467885348997333 No 40 >TIGR02852 spore_dpaB dipicolinic acid synthetase, B subunit; InterPro: IPR014214 Members of this entry represent the B subunit of dipicolinic acid synthetase, an enzyme that synthesizes a small molecule that appears to confer heat stability to bacterial endospores such as those of Bacillus subtilis. The A and B subunits are together in what was originally designated the spoVF locus for stage V of endospore formation.. Probab=37.44 E-value=21 Score=17.47 Aligned_cols=35 Identities=29% Similarity=0.481 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCE Q ss_conf 589999987875249982899984100167776426 Q 537021.9.peg.7 53 EQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSI 88 (184) Q Consensus 53 ~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~L 88 (184) -+-+|=+++ --|+-+|||++.||-++-|..++.+| T Consensus 102 DspVLMAAK-A~lRN~~PVvLa~sTNDaLGlNa~NL 136 (188) T TIGR02852 102 DSPVLMAAK-ATLRNNKPVVLAISTNDALGLNAVNL 136 (188) T ss_pred CCHHHHHHH-HHHHCCCCEEEEEECCCCCCHHHHHH T ss_conf 977788778-77517897898750345011328999 No 41 >PRK05611 rpmD 50S ribosomal protein L30; Reviewed Probab=36.87 E-value=15 Score=18.25 Aligned_cols=21 Identities=19% Similarity=0.267 Sum_probs=18.9 Q ss_pred CCCHHHHHHHHHCCCHHEHHH Q ss_conf 078989998740646211103 Q 537021.9.peg.7 159 PISPEIAFDLEHITGITKVVQ 179 (184) Q Consensus 159 ~VSPqIagALKsIpGVl~VeE 179 (184) +-||+++|.|+.+.--++|+| T Consensus 38 ~dtp~irGmi~kV~hLV~VEE 58 (58) T PRK05611 38 EDTPAIRGMINKVSHLVKVEE 58 (58) T ss_pred CCCHHHHHHHHHHHHHEEEEC T ss_conf 899789879996254269849 No 42 >cd04903 ACT_LSD C-terminal ACT domain of the L-serine dehydratase (LSD), iron-sulfur-dependent, beta subunit. The C-terminal ACT domain of the L-serine dehydratase (LSD), iron-sulfur-dependent, beta subunit, found in various bacterial anaerobes such as Clostridium, Bacillis, and Treponema species. These enzymes catalyze the deamination of L-serine, producing pyruvate and ammonia. Unlike the eukaryotic L-serine dehydratase, which requires the pyridoxal-5'-phosphate (PLP) cofactor, the prokaryotic L-serine dehydratase contains an [4Fe-4S] cluster instead of a PLP active site. The LSD alpha and beta subunits of the 'clostridial' enzyme are encoded by the sdhA and sdhB genes. The single subunit bacterial homologs of L-serine dehydratase (LSD1, LSD2, TdcG) present in Escherichia coli, and other enterobacterials, lack the ACT domain described here. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=36.12 E-value=38 Score=15.84 Aligned_cols=58 Identities=14% Similarity=0.332 Sum_probs=36.1 Q ss_pred ECCCHHHHHHHHHHHHHHCCCCCCCCEEE---EEECCCCCEEEEEECC-CCCCCHHHHHHHHHCCCHHEHH Q ss_conf 38015789999961122103788871399---9974898547999668-9707898999874064621110 Q 537021.9.peg.7 112 QDRSPLREIHKYFENRQTQNYDKQGKIGL---IFLRSQGKIEIEITLD-HYPISPEIAFDLEHITGITKVV 178 (184) Q Consensus 112 rD~~pL~sI~s~Le~~~~~~~~G~G~V~L---vli~~~~~~EVEI~Lp-~f~VSPqIagALKsIpGVl~Ve 178 (184) +-|..|..|.+.|... .|.+ .+-.....+.--.-+. .=++++++..+|+.+|||.+|. T Consensus 8 D~PG~Ig~i~~~L~~~---------~INIa~m~v~R~~~g~~A~~ii~~D~~v~~~~l~~i~~~~~I~~v~ 69 (71) T cd04903 8 DKPGAIAKVTSVLADH---------EINIAFMRVSRKEKGDQALMVIEVDQPIDEEVIEEIKKIPNIHQVI 69 (71) T ss_pred CCCCCHHHHHHHHHHC---------CCCCHHEEEECCCCCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEE T ss_conf 8699199999999865---------9680361875158997899999928999999999997599956999 No 43 >pfam05399 EVI2A Ectropic viral integration site 2A protein (EVI2A). This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours. Probab=34.35 E-value=32 Score=16.27 Aligned_cols=57 Identities=23% Similarity=0.281 Sum_probs=39.2 Q ss_pred CH-HCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCC Q ss_conf 11-0028667614898501344333235553102787522105899999878752499 Q 537021.9.peg.7 12 RK-GKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREK 68 (184) Q Consensus 12 ~~-~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~g 68 (184) .| ||..||.|-||+--+.-+-++-.---|--+-||+-++.|.+-+|.+.|-+--|.| T Consensus 165 kQ~GKRqPRSNGDfLAs~glWpaesdTWkR~Keltg~nL~mQssGvlta~~erk~eEg 222 (227) T pfam05399 165 KQVGKRQPRSNGDFLASNGLWPAESDTWKRPKELTGSNLVMQSSGVLTAIRERKQEEG 222 (227) T ss_pred HHHCCCCCCCCCCEEECCCCCCCCCCCCCCCHHCCCCCEEEECCCHHHHHHHHHCCCC T ss_conf 8743368887887531267564556632281222676355304332677776521456 No 44 >cd04902 ACT_3PGDH-xct C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). The C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with an extended C-terminal (xct) region from bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback-controlled by the end product L-serine in an allosteric manner. Some 3PGDH enzymes have an additional domain formed by an extended C-terminal region. This additional domain introduces significant asymmetry to the homotetramer. Adjacent ACT (regulatory) domains interact, creating two serine-binding sites, however, this asymmetric arrangement results in the formation of two different and distinct domain interfaces between iden Probab=33.41 E-value=42 Score=15.57 Aligned_cols=60 Identities=17% Similarity=0.225 Sum_probs=35.0 Q ss_pred ECCCHHHHHHHHHHHHHHCCCCCCCCEEEE---EECCCCCEEEEEECC-CCCCCHHHHHHHHHCCCHHEHHHH Q ss_conf 380157899999611221037888713999---974898547999668-970789899987406462111030 Q 537021.9.peg.7 112 QDRSPLREIHKYFENRQTQNYDKQGKIGLI---FLRSQGKIEIEITLD-HYPISPEIAFDLEHITGITKVVQV 180 (184) Q Consensus 112 rD~~pL~sI~s~Le~~~~~~~~G~G~V~Lv---li~~~~~~EVEI~Lp-~f~VSPqIagALKsIpGVl~VeEi 180 (184) +-|..|..|.+.|... .|.+- +-.....+.-=.-+. .=++++++..+|+.+|||.+|.-+ T Consensus 8 D~PGvIg~v~~~Lg~~---------~INIa~m~l~R~~~g~~A~~vi~vD~~v~~~~l~~i~~~~~V~~v~~i 71 (73) T cd04902 8 DRPGVIGKVGTILGEA---------GINIAGMQVGRDEPGGEALMVLSVDEPVPDEVLEELRALPGILSAKVV 71 (73) T ss_pred CCCCCHHHHHHHHHHC---------CCCHHHHHHCCCCCCCEEEEEEEECCCCCHHHHHHHHCCCCCCEEEEE T ss_conf 9899289999999870---------977888432576899878999990899999999999759895289998 No 45 >pfam04530 Viral_Beta_CD Viral Beta C/D like family. Family of ssRNA positive-strand viral proteins. Conserved region found in the Beta C and Beta D transcripts. Probab=32.84 E-value=39 Score=15.74 Aligned_cols=23 Identities=35% Similarity=0.546 Sum_probs=19.3 Q ss_pred EEEEEECCCCCCCHHHHHHHHHC Q ss_conf 47999668970789899987406 Q 537021.9.peg.7 149 IEIEITLDHYPISPEIAFDLEHI 171 (184) Q Consensus 149 ~EVEI~Lp~f~VSPqIagALKsI 171 (184) ..|||+++..+|.|.+..||++- T Consensus 89 NsVeik~~~~PIDP~VIaAIHHw 111 (122) T pfam04530 89 NSVEIKLGPGPIDPEVIAAIHHW 111 (122) T ss_pred CCEEEEECCCCCCHHHHHHHHHH T ss_conf 42678615899898999999886 No 46 >PRK10260 hypothetical protein; Provisional Probab=32.44 E-value=28 Score=16.63 Aligned_cols=83 Identities=18% Similarity=0.216 Sum_probs=40.5 Q ss_pred CEEEEEECCCC----------CHHCCCCCC-CEEEEE-EEH-HHHHHHHHHHHHHCCCCCEEEEEH----HHHHHHHHHH Q ss_conf 91899506887----------110028667-614898-501-344333235553102787522105----8999998787 Q 537021.9.peg.7 1 MRLIFFPKNNN----------RKGKKKPDG-NHDFLF-LED-CYDHAIEMKLQEAEEDPNFLSLQE----QQQLEEEKPR 63 (184) Q Consensus 1 ~~~~~~~~~~~----------~~~~~~~~g-~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~----~~~le~~~~~ 63 (184) |||.|||++.| |.|+..|.| ...+.- ..+ +.----.| -.++.+.|..+-.-+ .--|-.-.. T Consensus 108 mRLYYfp~~~~~V~t~PVGIGr~g~~TPlg~~t~I~~K~~~PtW~pp~si-r~e~~~~G~~LP~vvPpGPdNPLG~~Al- 185 (306) T PRK10260 108 MRLYYYPKGTNTVIVLPIGIGQLGKDTPINWTTKVERKKAGPTWTPTAKM-HAEYRAAGEPLPAVVPAGPDNPMGLYAL- 185 (306) T ss_pred HEEEEECCCCCEEEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHH-HHHHHHCCCCCCCCCCCCCCCCHHHHHH- T ss_conf 24478438998699982443668887887641799972579988898788-8999972998876689999896646777- Q ss_pred HHCCCCEEEEEEECCCCCCCCCCCEEEE Q ss_conf 5249982899984100167776426986 Q 537021.9.peg.7 64 LLREKKPYIIVVSQEDKLSLQRDSIRLL 91 (184) Q Consensus 64 LLe~gkpvii~VsAe~rle~eg~~LRi~ 91 (184) .| |++++| -.-++ +.|+.+|+- T Consensus 186 ~l--~~~YlI--HGTN~--p~gIG~rvS 207 (306) T PRK10260 186 YI--GRLYAI--HGTNA--NFGIGLRVS 207 (306) T ss_pred HH--CCCEEE--ECCCC--CCCCCCCCC T ss_conf 62--784057--77899--875132016 No 47 >PRK11750 gltB glutamate synthase subunit alpha; Provisional Probab=30.96 E-value=30 Score=16.46 Aligned_cols=134 Identities=17% Similarity=0.095 Sum_probs=60.7 Q ss_pred HHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHC-CCEEEEE Q ss_conf 333235553102787522105899999878752499828999841001677764269865041589997320-1109997 Q 537021.9.peg.7 33 HAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSMEML-DSLRVYL 111 (184) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~~~-kgLRIfV 111 (184) ..+--+||.+-.+++|..-+.-..+-.++ .|.-|----+-+ +....+-+..|||.++-..+-. .+|-.== T Consensus 799 P~~i~~Lq~A~r~~~y~~yk~y~~~i~~~-------~~~~lR~Ll~~~--~~~~pi~l~eVE~~~~I~krF~tgaMS~Gs 869 (1483) T PRK11750 799 PDVVNTLQKAVQSGDYSDYQEYAKLVNER-------PVATLRDLLALK--PADNPIPLDEVEPAEELFKRFDSAAMSIGA 869 (1483) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHCC-------CCCCCCCCCCCC--CCCCCCCHHHCCCHHHHHHHCCCCCCCCCC T ss_conf 99999999998649999999999987258-------976611043738--899998968825377777512544466666 Q ss_pred ECCCHHHHHHHHH---HHHHHCCCCCC----------CCEEEEEE--------CCCCCEEEEEE------------CCCC Q ss_conf 3801578999996---11221037888----------71399997--------48985479996------------6897 Q 537021.9.peg.7 112 QDRSPLREIHKYF---ENRQTQNYDKQ----------GKIGLIFL--------RSQGKIEIEIT------------LDHY 158 (184) Q Consensus 112 rD~~pL~sI~s~L---e~~~~~~~~G~----------G~V~Lvli--------~~~~~~EVEI~------------Lp~f 158 (184) -.+.|-+.++-.. -.+-+...+|. +.|.=|-. -.....|++|+ ||++ T Consensus 870 iS~eAHetlA~AmNriG~~SnsGEGGEd~~R~g~~~~S~iKQVASGRFGVt~~YL~~A~eiqIKiAQGAKPGEGGqLPG~ 949 (1483) T PRK11750 870 LSPEAHEALAIAMNRLGGRSNSGEGGEDPARYGTEKNSKIKQVASGRFGVTPHYLVNAEVLQIKVAQGAKPGEGGQLPGD 949 (1483) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHCCCCCCCCEEEEECCCCCCHHHHCCHHHEEHHHHCCCCCCCCCCCCCC T ss_conf 79899999999999848832588888788770898888633750012778988946965621223147899978889886 Q ss_pred CCCHHHHHHHHHCCCHH Q ss_conf 07898999874064621 Q 537021.9.peg.7 159 PISPEIAFDLEHITGIT 175 (184) Q Consensus 159 ~VSPqIagALKsIpGVl 175 (184) +|||.||.+=.+.|||- T Consensus 950 KV~~~IA~~R~stPGV~ 966 (1483) T PRK11750 950 KVNPLIAKLRYSVPGVT 966 (1483) T ss_pred CCCHHHHHHCCCCCCCC T ss_conf 45799998707999978 No 48 >COG3040 Blc Bacterial lipocalin [Cell envelope biogenesis, outer membrane] Probab=30.85 E-value=27 Score=16.73 Aligned_cols=15 Identities=33% Similarity=0.359 Sum_probs=6.1 Q ss_pred CCEEEEEEEHHHHHH Q ss_conf 761489850134433 Q 537021.9.peg.7 20 GNHDFLFLEDCYDHA 34 (184) Q Consensus 20 g~~~~~~~~~~~~~~ 34 (184) |+|.+++++|-|-+| T Consensus 111 g~Y~Vl~~d~eYs~a 125 (174) T COG3040 111 GDYWVLALDPEYSWA 125 (174) T ss_pred CCEEEEEECCCCCEE T ss_conf 537999977875489 No 49 >COG0275 Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis [Cell envelope biogenesis, outer membrane] Probab=29.05 E-value=49 Score=15.12 Aligned_cols=44 Identities=20% Similarity=0.443 Sum_probs=26.1 Q ss_pred HHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCC Q ss_conf 97320110999738015789999961122103788871399997489 Q 537021.9.peg.7 100 SMEMLDSLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQ 146 (184) Q Consensus 100 Aa~~~kgLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~ 146 (184) |.++=.++||||+|+ |..+...|+...+ -.+-+|++.+|...+- T Consensus 205 Atr~FQAiRI~VNdE--L~~L~~~L~~a~~-~L~~~GRl~VIsFHSL 248 (314) T COG0275 205 ATRTFQAIRIYVNDE--LEELEEALEAALD-LLKPGGRLAVISFHSL 248 (314) T ss_pred CHHHHHHHEEEEHHH--HHHHHHHHHHHHH-HHCCCCEEEEEEECCH T ss_conf 066563410000567--9999999999999-6188967999995342 No 50 >cd04492 YhaM_OBF_like YhaM_OBF_like: A subfamily of OB folds similar to that found in Bacillus subtilis YhaM and Staphylococcus aureus cmp-binding factor-1 (SaCBF1). Both these proteins are 3'-to-5'exoribonucleases. YhaM requires Mn2+ or Co2+ for activity and is inactive in the presence of Mg2+. YhaM also has a Mn2+ dependent 3'-to-5'single-stranded DNA exonuclease activity. SaCBF is also a double-stranded DNA binding protein, binding specifically to cmp, the replication enhancer found in S. aureus plasmid pT181. Proteins in this group combine an N-terminal OB fold with a C-terminal HD domain. The HD domain is found in metal-dependent phosphohydrolases. Probab=28.95 E-value=50 Score=15.10 Aligned_cols=68 Identities=7% Similarity=0.067 Sum_probs=45.0 Q ss_pred HCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEEC Q ss_conf 00286676148985013443332355531027875221058999998787524998289998410016777642698650 Q 537021.9.peg.7 14 GKKKPDGNHDFLFLEDCYDHAIEMKLQEAEEDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWV 93 (184) Q Consensus 14 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~v 93 (184) -++.-.|.. |+. |.+.|.||+-.+-....+-+. . +-+++|..|-+ .+.--.=-..+-|++.++ T Consensus 11 ~~~tk~G~~-Yl~------------l~L~D~tG~I~ak~W~~~~~~-~-~~~~~g~~V~v--~G~v~~y~g~~Ql~I~~i 73 (83) T cd04492 11 LRTAKNGKP-YLA------------LTLQDKTGEIEAKLWDASEED-E-EKFKPGDIVHV--KGRVEEYRGRLQLKIQRI 73 (83) T ss_pred HHCCCCCCC-EEE------------EEEECCCCEEEEEECCCCHHH-H-HCCCCCCEEEE--EEEEEEECCCEEEEEEEE T ss_conf 000579991-789------------999878876888885788678-8-30789999999--999988899510899896 Q ss_pred CCHHH Q ss_conf 41589 Q 537021.9.peg.7 94 ESLEK 98 (184) Q Consensus 94 epLD~ 98 (184) .++++ T Consensus 74 ~~~~~ 78 (83) T cd04492 74 RLVTE 78 (83) T ss_pred EECCC T ss_conf 89983 No 51 >PRK11790 D-3-phosphoglycerate dehydrogenase; Provisional Probab=28.76 E-value=32 Score=16.31 Aligned_cols=64 Identities=19% Similarity=0.255 Sum_probs=40.5 Q ss_pred CEEEEE---ECCCHHHHHHHHHHHH----HHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEHH Q ss_conf 109997---3801578999996112----210378887139999748985479996689707898999874064621110 Q 537021.9.peg.7 106 SLRVYL---QDRSPLREIHKYFENR----QTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKVV 178 (184) Q Consensus 106 gLRIfV---rD~~pL~sI~s~Le~~----~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~Ve 178 (184) .-||.+ +-+..|.+|.+.|... .+.-.+.+|++...+++- .-.++.++..+|+.++||+.|. T Consensus 338 ~~Ri~~~h~N~Pg~l~~i~~~l~~~~~NI~~~~n~s~~~~aY~i~D~-----------~~~~~~~~~~~l~~i~~vi~vR 406 (409) T PRK11790 338 GHRLLHIHENRPGVLTAINQIFAEQGINIAAQYLQTDAEIGYVVIDV-----------DADDSEEALEALKAIPGTIRAR 406 (409) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHCCCCHHHHHHCCCCCEEEEEEEC-----------CCCCCHHHHHHHHHCCCEEEEE T ss_conf 70699980798758999999998769998898620677654999978-----------8888599999998489889998 Q ss_pred HH Q ss_conf 30 Q 537021.9.peg.7 179 QV 180 (184) Q Consensus 179 Ei 180 (184) =| T Consensus 407 ~l 408 (409) T PRK11790 407 LL 408 (409) T ss_pred EE T ss_conf 60 No 52 >cd04879 ACT_3PGDH-like ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). ACT_3PGDH-like: The ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with or without an extended C-terminal (xct) region found in various bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback controlled by the end product L-serine in an allosteric manner. In the Escherichia coli homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active Probab=28.27 E-value=51 Score=15.03 Aligned_cols=59 Identities=15% Similarity=0.264 Sum_probs=37.6 Q ss_pred EEECCCHHHHHHHHHHHH------HHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEHHH Q ss_conf 973801578999996112------2103788871399997489854799966897078989998740646211103 Q 537021.9.peg.7 110 YLQDRSPLREIHKYFENR------QTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKVVQ 179 (184) Q Consensus 110 fVrD~~pL~sI~s~Le~~------~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~VeE 179 (184) .-+-|..|..|.+.|... ......++|...+.++.-| =.+++++..+|+.+|||.+|.- T Consensus 6 ~~D~PGvi~~I~~~L~~~~iNI~~m~l~r~~~g~~A~~ii~vD-----------~~v~~~~l~~i~~~~~v~~vr~ 70 (71) T cd04879 6 HKDVPGVIGKVGTILGEHGINIAAMQVGRKEKGGIAYMVLDVD-----------SPVPEEVLEELKALPGIIRVRL 70 (71) T ss_pred ECCCCCHHHHHHHHHHHCCCCHHHHHHCCCCCCCEEEEEEEEC-----------CCCCHHHHHHHHCCCCEEEEEE T ss_conf 4397998999999999859788996412367997589999908-----------9899999999976998089997 No 53 >PRK08305 spoVFB dipicolinate synthase subunit B; Reviewed Probab=28.22 E-value=32 Score=16.31 Aligned_cols=18 Identities=39% Similarity=0.741 Sum_probs=8.7 Q ss_pred HHCCCCEEEEEEECCCCC Q ss_conf 524998289998410016 Q 537021.9.peg.7 64 LLREKKPYIIVVSQEDKL 81 (184) Q Consensus 64 LLe~gkpvii~VsAe~rl 81 (184) .|..||||+|.+|-++.| T Consensus 117 ~Lrn~kPVvia~stNd~l 134 (195) T PRK08305 117 TLRNQKPVVLAISTNDAL 134 (195) T ss_pred HHCCCCCEEEEEECCHHH T ss_conf 862799889998368678 No 54 >PRK00910 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=27.05 E-value=54 Score=14.90 Aligned_cols=25 Identities=20% Similarity=0.284 Sum_probs=16.6 Q ss_pred HHHHCCCCEEEEEEECCCCCCCCCCCE Q ss_conf 875249982899984100167776426 Q 537021.9.peg.7 62 PRLLREKKPYIIVVSQEDKLSLQRDSI 88 (184) Q Consensus 62 ~~LLe~gkpvii~VsAe~rle~eg~~L 88 (184) .+-|..||||+++.+.. .|.||+=+ T Consensus 21 i~al~~Gk~Viv~Dded--REnEgDlv 45 (218) T PRK00910 21 LQALREGRGVLLLDDED--RENEGDII 45 (218) T ss_pred HHHHHCCCEEEEEECCC--CCCHHCEE T ss_conf 99998899599997899--87614168 No 55 >TIGR02720 pyruv_oxi_spxB pyruvate oxidase; InterPro: IPR014092 Members of this family are examples of pyruvate oxidase (1.2.3.3 from EC), an enzyme with FAD and TPP as cofactors that catalyses the reaction pyruvate + phosphate + O2 + H2O = acetyl phosphate + CO2 + H2O2. It should not be confused with pyruvate dehydrogenase (1.2.2.2 from EC) as in Escherichia coli PoxB, although the E. coli enzyme is closely homologous and has pyruvate oxidase as an alternate name.. Probab=27.04 E-value=54 Score=14.90 Aligned_cols=21 Identities=43% Similarity=0.599 Sum_probs=18.7 Q ss_pred EHHHHHHHHHHHHHCCCCEEEE Q ss_conf 0589999987875249982899 Q 537021.9.peg.7 52 QEQQQLEEEKPRLLREKKPYII 73 (184) Q Consensus 52 ~~~~~le~~~~~LLe~gkpvii 73 (184) |+.+++|..+ +.+-.||||+| T Consensus 509 ~l~~~~e~A~-~~~k~~~pVlI 529 (577) T TIGR02720 509 QLKAVFEEAK-KELKEGKPVLI 529 (577) T ss_pred HHHHHHHHHH-HHHHCCCCEEE T ss_conf 8899999999-99847897899 No 56 >pfam03927 NapD NapD protein. Uncharacterized protein involved in formation of periplasmic nitrate reductase. Probab=26.74 E-value=54 Score=14.86 Aligned_cols=69 Identities=14% Similarity=0.158 Sum_probs=44.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECCCCCCCHHHHHHHHHCCCHHEHHHHHCC Q ss_conf 109997380157899999611221037888713999974898547999668970789899987406462111030307 Q 537021.9.peg.7 106 SLRVYLQDRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLDHYPISPEIAFDLEHITGITKVVQVIHN 183 (184) Q Consensus 106 gLRIfVrD~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp~f~VSPqIagALKsIpGVl~VeEi~~~ 183 (184) ++=|..+ |..+..+++.|..- .. +.++...+.|.--|-++=+...-.-+...+|+.+|||+.+.=+.|. T Consensus 6 SlVV~~~-Pe~~~~V~~~l~~~------pg--~Eih~~~~~GKiVVtiE~~~~~~~~~~i~~i~~l~GVlsa~lVYh~ 74 (78) T pfam03927 6 SLVVHVR-PERLAEVKAAILAL------PG--AEIHAVSPEGKLVVVLEGESQGAILDTIEAINALEGVLSASLVYHQ 74 (78) T ss_pred EEEEEEC-HHHHHHHHHHHHCC------CC--CEEECCCCCCEEEEEEEECCHHHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 8999968-77799999999749------99--6886379994299999728827999999998659980489876785 No 57 >cd02953 DsbDgamma DsbD gamma family; DsbD gamma is the C-terminal periplasmic domain of the bacterial protein DsbD. It contains a CXXC motif in a TRX fold and shuttles the reducing potential from the membrane domain (DsbD beta) to the N-terminal periplasmic domain (DsbD alpha). DsbD beta, a transmembrane domain comprising of eight helices, acquires its reducing potential from the cytoplasmic thioredoxin. DsbD alpha transfers the acquired reducing potential from DsbD gamma to target proteins such as the periplasmic protein disulphide isomerases, DsbC and DsbG. This flow of reducing potential from the cytoplasm through DsbD allows DsbC and DsbG to act as isomerases in the oxidizing environment of the bacterial periplasm. DsbD also transfers reducing potential from the cytoplasm to specific reductases in the periplasm which are involved in the maturation of cytochromes. Probab=22.94 E-value=64 Score=14.42 Aligned_cols=94 Identities=16% Similarity=0.115 Sum_probs=44.6 Q ss_pred HHHCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHH---CCCEEEEEEC-CCHHHHHHHHHHHHHHCCCCCCCCE Q ss_conf 75249982899984100167776426986504158999732---0110999738-0157899999611221037888713 Q 537021.9.peg.7 63 RLLREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSMEM---LDSLRVYLQD-RSPLREIHKYFENRQTQNYDKQGKI 138 (184) Q Consensus 63 ~LLe~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~~---~kgLRIfVrD-~~pL~sI~s~Le~~~~~~~~G~G~V 138 (184) +.+..||||+|-..|.-= +-=-.+ -..+-+-++.+... -..+++=+.+ ...+..+.+++ +-.|-- T Consensus 6 ~a~~~gKpVlvdf~A~WC--~~Ck~m-~~~~~~~~~v~~~~~~~~v~vkvD~t~~~~~~~~l~~~~--------~v~g~P 74 (104) T cd02953 6 QALAQGKPVFVDFTADWC--VTCKVN-EKVVFSDPEVQAALKKDVVLLRADWTKNDPEITALLKRF--------GVFGPP 74 (104) T ss_pred HHHHCCCCEEEEEECCCC--HHHHHH-HHHCCCCHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHC--------CCCCCC T ss_conf 999829919999986758--888851-142048899999972477357636778998999999983--------995567 Q ss_pred EEEEECCCCCEEEEEECCCCCCCHHHHHHH Q ss_conf 999974898547999668970789899987 Q 537021.9.peg.7 139 GLIFLRSQGKIEIEITLDHYPISPEIAFDL 168 (184) Q Consensus 139 ~Lvli~~~~~~EVEI~Lp~f~VSPqIagAL 168 (184) .+++..+++.. ....+++|.-..++..+| T Consensus 75 t~~f~~~~g~~-~~~~l~G~~~~~~fl~~L 103 (104) T cd02953 75 TYLFYGPGGEP-EPLRLPGFLTADEFLEAL 103 (104) T ss_pred EEEEECCCCCC-CCCEEECCCCHHHHHHHH T ss_conf 79998799986-762612304899999975 No 58 >TIGR02034 CysN sulfate adenylyltransferase, large subunit; InterPro: IPR011779 Metabolic assimilation of sulphur from inorganic sulphate, requires sulphate activation by coupling to a nucleoside, for the production of high-energy nucleoside phosphosulphates. This pathway appears to be similar in all prokaryotic organisms. Activation is first achieved through sulphation of sulphate with ATP by sulphate adenylyltransferase (ATP sulphurylase) to produce 5'-phosphosulphate (APS), coupled by GTP hydrolysis. Subsequently, APS is phosphorylated by an APS kinase to produce 3'-phosphoadenosine-5'-phosphosulphate (PAPS) . In Escherichia coli, ATP sulphurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulphur-containing amino acids . Homologous to this E. coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulphurase and APS kinase activity, yet are involved in Nod factor sulphation, and sulphation of other macromolecules . With members of the Rhizobiaceae family, nodQ often appears as a fusion of cysN (large subunit of ATP sulphurase) and cysC (APS kinase) , .; GO: 0016772 transferase activity transferring phosphorus-containing groups. Probab=22.13 E-value=30 Score=16.46 Aligned_cols=57 Identities=18% Similarity=0.194 Sum_probs=34.5 Q ss_pred CCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECC------CHHHHHHHHCCC Q ss_conf 278752210589999987875249982899984100167776426986504------158999732011 Q 537021.9.peg.7 44 EDPNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVE------SLEKKSMEMLDS 106 (184) Q Consensus 44 ~~~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~ve------pLD~aAa~~~kg 106 (184) ...-..+-||.++|=-=.-+-|-||++|.| |.+...++-++..|. .|++.+|+.-.| T Consensus 304 D~~p~~~~~F~a~lVWm~~~~l~PG~~Y~l------k~g~~~v~a~v~~I~~~vdvn~~~~~~A~~L~G 366 (411) T TIGR02034 304 DSAPEVADQFAATLVWMADEPLLPGRSYLL------KLGTRKVRASVTAIKHKVDVNTLEKNAAKELEG 366 (411) T ss_pred CCCCCCCHHHEEEEEEECCCCCCCCCEEEE------EECCEEEEEEEEEEEEEECCCCCCHHHHHHHHC T ss_conf 677870120012566201134588953788------732427758998777997246431022123106 No 59 >KOG2364 consensus Probab=21.66 E-value=68 Score=14.26 Aligned_cols=13 Identities=23% Similarity=0.782 Sum_probs=6.3 Q ss_pred HHHCCCCEEEEEE Q ss_conf 7524998289998 Q 537021.9.peg.7 63 RLLREKKPYIIVV 75 (184) Q Consensus 63 ~LLe~gkpvii~V 75 (184) ++|..|+||++.+ T Consensus 266 RmLG~GRPFvlEl 278 (433) T KOG2364 266 RMLGTGRPFVLEL 278 (433) T ss_pred EECCCCCCEEEEC T ss_conf 7505898359985 No 60 >TIGR02398 gluc_glyc_Psyn glucosylglycerol-phosphate synthase; InterPro: IPR012764 Glucosylglycerol-phosphate synthase catalyzes the key step in the biosynthesis of the osmolyte glucosylglycerol. It is known in several cyanobacteria and in Pseudomonas anguilliseptica. The enzyme is closely related to the alpha,alpha-trehalose-phosphate synthase, likewise involved in osmolyte biosynthesis, of Escherichia coli and many other bacteria. A close homolog from Xanthomonas campestris is excluded from this family.; GO: 0016758 transferase activity transferring hexosyl groups, 0051473 glucosylglycerol biosynthetic process. Probab=21.31 E-value=29 Score=16.59 Aligned_cols=53 Identities=19% Similarity=0.251 Sum_probs=33.5 Q ss_pred CCEEEEEHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCEEEEECCCHHHHHHHHCCCEEEEEECCCHHHHHHHHH Q ss_conf 8752210589999987875249982899984100167776426986504158999732011099973801578999996 Q 537021.9.peg.7 46 PNFLSLQEQQQLEEEKPRLLREKKPYIIVVSQEDKLSLQRDSIRLLWVESLEKKSMEMLDSLRVYLQDRSPLREIHKYF 124 (184) Q Consensus 46 ~~~~~~~~~~~le~~~~~LLe~gkpvii~VsAe~rle~eg~~LRi~~vepLD~aAa~~~kgLRIfVrD~~pL~sI~s~L 124 (184) |+.+.|.--+-|=|++|+|+ ||=.+++|.+ +..+|||||=+--.-|+.+.-|. T Consensus 304 G~~E~L~ayERLLErrpeL~--gkV~Lv~acv------------------------~AA~gM~iY~e~Q~~IE~~vGRI 356 (495) T TIGR02398 304 GIKELLVAYERLLERRPELL--GKVVLVVACV------------------------KAASGMKIYEELQSEIERLVGRI 356 (495) T ss_pred CHHHHHHHHHHHHCCCCCCC--CCEEEEEEEH------------------------HHCCCCEEHHHHHHHHHHHHCCC T ss_conf 41577888887522582305--4134024442------------------------66067772687668899860642 No 61 >PRK10190 hypothetical protein; Provisional Probab=21.04 E-value=50 Score=15.11 Aligned_cols=104 Identities=14% Similarity=0.150 Sum_probs=48.5 Q ss_pred CEEEEEECCCC----------CHHCCCCCCC-EEEEEE-E-HHHHHHHHHHHHHHCCCCCEEEEEH----HHHHHHHHHH Q ss_conf 91899506887----------1100286676-148985-0-1344333235553102787522105----8999998787 Q 537021.9.peg.7 1 MRLIFFPKNNN----------RKGKKKPDGN-HDFLFL-E-DCYDHAIEMKLQEAEEDPNFLSLQE----QQQLEEEKPR 63 (184) Q Consensus 1 ~~~~~~~~~~~----------~~~~~~~~g~-~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~le~~~~~ 63 (184) |||.|||++.| |.|+..|.|- .-+..- . -++----.| -+++.+.|-.+-..+ .--|-.-+. T Consensus 105 mRLYYfp~~~~~V~t~PIGIGr~g~~TP~g~~t~i~~K~~nPtW~Pp~si-r~e~~~~G~~LP~vvP~GPdNPLG~~Al- 182 (310) T PRK10190 105 MRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWTPTPNT-RREYAKRGESLPAFVPAGPDNPMGLYAI- 182 (310) T ss_pred HEEEEECCCCCEEEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHH-HHHHHHCCCCCCCCCCCCCCCCHHHHHH- T ss_conf 02378559998799971464658887888754899862569987797778-8999971998877779999895777888- Q ss_pred HHCCCCEEEEEEECCCCCCCCCCCEEEEE------CCCHHHHHHHHCCCEEEEEE Q ss_conf 52499828999841001677764269865------04158999732011099973 Q 537021.9.peg.7 64 LLREKKPYIIVVSQEDKLSLQRDSIRLLW------VESLEKKSMEMLDSLRVYLQ 112 (184) Q Consensus 64 LLe~gkpvii~VsAe~rle~eg~~LRi~~------vepLD~aAa~~~kgLRIfVr 112 (184) .| |++++| -.-++ +-|+.+|+-+ -+-.+..-..+..|-+|-+= T Consensus 183 ~l--~~~YlI--HGTN~--p~gIG~rvS~GCIRl~peDI~~Lf~~V~vGT~V~iI 231 (310) T PRK10190 183 YI--GRLYAI--HGTNA--NFGIGLRVSQGCIRLRNDDIKYLFDNVPVGTRVQII 231 (310) T ss_pred HH--CCCEEE--ECCCC--CCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEE T ss_conf 62--786068--78899--975021106875144878699998429998869997 No 62 >TIGR00833 actII Transport protein; InterPro: IPR004707 Characterised members of the Resistance-Nodulation-Cell Division (RND) superfamily all probably catalyze substrate efflux via an H+ antiport mechanism. These proteins are found ubiquitously in bacteria, archaea and eukaryotes. This group includes the Streptomyces coelicolor ActII3 protein, which may play a role in drug resistance, and the Mycobacterium tuberculosis MmpL7 protein, which catalyzes export of an outer membrane lipid, phthiocerol dimycocerosate.; GO: 0016021 integral to membrane. Probab=20.30 E-value=57 Score=14.71 Aligned_cols=60 Identities=8% Similarity=0.131 Sum_probs=39.0 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEECC-CCCCCHHHHHHHHHCCCHHEHHHHHCC Q ss_conf 80157899999611221037888713999974898547999668-970789899987406462111030307 Q 537021.9.peg.7 113 DRSPLREIHKYFENRQTQNYDKQGKIGLIFLRSQGKIEIEITLD-HYPISPEIAFDLEHITGITKVVQVIHN 183 (184) Q Consensus 113 D~~pL~sI~s~Le~~~~~~~~G~G~V~Lvli~~~~~~EVEI~Lp-~f~VSPqIagALKsIpGVl~VeEi~~~ 183 (184) .....+.+.+++ +.+.+.-..+++..| =++.=| .+..=-++..+|+++|||..|+++-|. T Consensus 419 s~~G~~a~~RHF-------p~~~~~P~~~~Ik~D----hD~R~Pa~l~~i~~~~r~I~~~pgi~~V~~~~~~ 479 (992) T TIGR00833 419 SVQGYEAADRHF-------PANSMEPMVVMIKSD----HDLRNPAALADIERLEREIKAVPGIVMVQELQRP 479 (992) T ss_pred HHHHHHHHHCCC-------CCCCCCCEEEEEECC----CCCCCHHHHHHHHHHHHHHHHCCCCEECCCCCCC T ss_conf 678999860137-------666788248998378----8767838899999999998647882223556676 No 63 >PRK03353 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=20.21 E-value=73 Score=14.07 Aligned_cols=24 Identities=21% Similarity=0.313 Sum_probs=15.7 Q ss_pred HHHCCCCEEEEEEECCCCCCCCCCCE Q ss_conf 75249982899984100167776426 Q 537021.9.peg.7 63 RLLREKKPYIIVVSQEDKLSLQRDSI 88 (184) Q Consensus 63 ~LLe~gkpvii~VsAe~rle~eg~~L 88 (184) +=|..|+||+++.+.. .|.||+=+ T Consensus 21 ~al~~G~~Viv~Dd~d--REnEgDlv 44 (217) T PRK03353 21 DALREGRGVLVLDDED--RENEGDMI 44 (217) T ss_pred HHHHCCCEEEEEECCC--CCCCCCEE T ss_conf 9998799599983899--87640689 Done!