Query gi|254780267|ref|YP_003064680.1| acetyl-CoA carboxylase biotin carboxyl carrier protein subunit [Candidatus Liberibacter asiaticus str. psy62] Match_columns 171 No_of_seqs 136 out of 3810 Neff 8.2 Searched_HMMs 39220 Date Tue May 24 11:09:42 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780267.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK06302 acetyl-CoA carboxylas 100.0 0 0 325.6 16.0 155 8-165 1-155 (155) 2 COG0511 AccB Biotin carboxyl c 100.0 3.4E-35 8.7E-40 234.2 11.6 140 13-166 1-140 (140) 3 PRK07051 hypothetical protein; 100.0 1.1E-32 2.9E-37 218.8 8.3 78 89-166 3-80 (80) 4 TIGR00531 BCCP acetyl-CoA carb 100.0 1.1E-31 2.7E-36 212.9 9.5 159 7-166 1-159 (159) 5 PRK12999 pyruvate carboxylase; 99.9 3E-24 7.6E-29 167.7 9.9 78 82-166 1070-1147(1147) 6 PRK05641 putative acetyl-CoA c 99.9 3.7E-24 9.4E-29 167.1 9.8 72 86-164 76-147 (148) 7 PRK06549 acetyl-CoA carboxylas 99.9 1.8E-23 4.7E-28 162.9 12.8 76 82-164 56-131 (132) 8 PRK09282 pyruvate carboxylase 99.9 2.4E-23 6.1E-28 162.2 11.7 78 81-165 503-580 (580) 9 TIGR02712 urea_carbox urea car 99.9 1.2E-21 3.1E-26 151.9 7.7 79 80-165 1148-1226(1226) 10 PRK08225 acetyl-CoA carboxylas 99.8 6.7E-21 1.7E-25 147.4 7.7 69 90-165 2-70 (70) 11 TIGR01108 oadA oxaloacetate de 99.8 1.4E-20 3.6E-25 145.3 7.4 71 84-161 546-616 (616) 12 PRK05889 putative acetyl-CoA c 99.8 3.2E-20 8.1E-25 143.2 7.1 68 90-164 3-70 (71) 13 TIGR01235 pyruv_carbox pyruvat 99.8 6.2E-20 1.6E-24 141.5 8.4 78 81-165 1092-1169(1169) 14 cd06850 biotinyl_domain The bi 99.8 5.9E-20 1.5E-24 141.6 7.4 67 91-164 1-67 (67) 15 COG1038 PycA Pyruvate carboxyl 99.8 2.5E-19 6.3E-24 137.8 8.6 80 80-166 1070-1149(1149) 16 COG4770 Acetyl/propionyl-CoA c 99.7 3.3E-18 8.3E-23 131.0 7.0 75 85-166 571-645 (645) 17 pfam00364 Biotin_lipoyl Biotin 99.7 1.4E-17 3.5E-22 127.2 6.3 72 91-164 2-73 (73) 18 cd06663 Biotinyl_lipoyl_domain 99.6 1.7E-15 4.2E-20 114.6 6.9 56 109-164 18-73 (73) 19 KOG0369 consensus 99.5 1.3E-14 3.2E-19 109.2 6.5 82 78-166 1095-1176(1176) 20 KOG0238 consensus 99.5 5.8E-15 1.5E-19 111.3 4.5 74 85-165 597-670 (670) 21 cd06849 lipoyl_domain Lipoyl d 99.3 2.3E-12 6E-17 95.5 6.3 56 109-164 19-74 (74) 22 PRK11855 dihydrolipoamide acet 99.3 9.7E-11 2.5E-15 85.6 11.3 61 109-169 136-196 (549) 23 PTZ00144 dihydrolipoamide succ 99.2 4.4E-11 1.1E-15 87.7 7.1 61 109-169 76-136 (430) 24 PRK11855 dihydrolipoamide acet 99.2 5.3E-11 1.4E-15 87.2 6.8 61 109-169 21-81 (549) 25 PRK05704 dihydrolipoamide succ 99.2 6.6E-11 1.7E-15 86.6 6.7 61 109-169 21-81 (406) 26 TIGR01348 PDHac_trf_long pyruv 99.1 5.6E-10 1.4E-14 81.0 9.7 83 87-170 204-286 (655) 27 COG0508 AceF Pyruvate/2-oxoglu 99.1 4.4E-10 1.1E-14 81.7 7.2 61 109-169 21-81 (404) 28 PRK11854 aceF dihydrolipoamide 99.1 1.1E-09 2.8E-14 79.2 9.2 60 109-168 122-181 (630) 29 PRK11856 branched-chain alpha- 99.1 5E-10 1.3E-14 81.3 6.8 57 109-165 20-76 (324) 30 PRK11854 aceF dihydrolipoamide 99.0 8E-10 2.1E-14 80.0 6.8 61 109-169 223-283 (630) 31 TIGR01347 sucB 2-oxoglutarate 99.0 6.5E-10 1.7E-14 80.6 4.1 55 112-166 22-76 (435) 32 TIGR01348 PDHac_trf_long pyruv 98.9 5E-09 1.3E-13 75.2 6.0 60 109-168 19-78 (655) 33 KOG0559 consensus 98.8 4.2E-09 1.1E-13 75.7 4.9 57 111-167 93-149 (457) 34 PRK11578 macrolide transporter 98.3 9.6E-07 2.4E-11 61.4 4.6 74 86-166 35-196 (347) 35 KOG0368 consensus 98.3 2.2E-06 5.7E-11 59.1 6.4 74 87-168 683-756 (2196) 36 PRK03598 hypothetical protein; 98.2 4.3E-06 1.1E-10 57.4 6.0 35 134-168 204-238 (331) 37 PRK10476 multidrug resistance 98.1 1.4E-05 3.4E-10 54.4 6.9 37 133-169 210-246 (348) 38 PRK10559 p-hydroxybenzoic acid 98.1 1.1E-05 2.8E-10 54.9 6.2 73 90-169 48-190 (310) 39 KOG0557 consensus 98.0 1.5E-05 3.7E-10 54.2 6.3 59 112-170 60-119 (470) 40 cd06251 M14_ASTE_ASPA_like_1 A 98.0 4.8E-05 1.2E-09 51.0 8.8 68 87-164 217-286 (287) 41 cd06252 M14_ASTE_ASPA_like_2 A 98.0 4.2E-05 1.1E-09 51.4 8.5 70 87-166 242-315 (316) 42 cd06253 M14_ASTE_ASPA_like_3 A 98.0 5.1E-05 1.3E-09 50.9 8.8 68 87-164 227-297 (298) 43 PRK11556 multidrug efflux syst 97.8 3.7E-05 9.4E-10 51.8 5.7 76 85-167 83-231 (415) 44 PRK09578 periplasmic multidrug 97.8 9.6E-05 2.4E-09 49.2 7.3 75 86-167 60-207 (385) 45 PRK11892 pyruvate dehydrogenas 97.8 8.3E-05 2.1E-09 49.6 6.9 62 108-169 20-82 (464) 46 TIGR02927 SucB_Actino 2-oxoglu 97.8 3.4E-05 8.6E-10 52.0 4.8 74 85-165 131-210 (607) 47 PRK09859 multidrug efflux syst 97.8 0.00012 3.1E-09 48.6 7.5 72 88-166 60-204 (385) 48 TIGR03309 matur_yqeB selenium- 97.7 0.00026 6.6E-09 46.6 7.9 70 87-169 162-231 (256) 49 PRK00624 glycine cleavage syst 97.7 7.7E-05 2E-09 49.8 4.9 58 112-169 39-103 (113) 50 TIGR03077 not_gcvH glycine cle 97.6 7.8E-05 2E-09 49.8 4.8 58 112-169 38-102 (110) 51 PRK09783 copper/silver efflux 97.6 0.00046 1.2E-08 45.1 8.3 72 88-166 120-240 (407) 52 KOG0558 consensus 97.6 4.6E-05 1.2E-09 51.2 3.0 61 109-169 83-143 (474) 53 TIGR01349 PDHac_trf_mito pyruv 97.6 0.00022 5.6E-09 47.0 6.2 55 112-166 149-204 (584) 54 PRK05352 Na(+)-translocating N 97.6 0.00022 5.5E-09 47.1 6.1 57 109-170 43-99 (448) 55 cd06250 M14_PaAOTO_like An unc 97.6 0.00079 2E-08 43.6 8.9 68 87-164 287-358 (359) 56 PRK08225 acetyl-CoA carboxylas 97.5 0.00013 3.2E-09 48.5 4.2 34 133-166 1-34 (70) 57 cd06254 M14_ASTE_ASPA_like_4 A 97.5 0.00067 1.7E-08 44.1 7.5 66 86-161 220-287 (288) 58 PRK09439 glucose-specific PTS 97.4 0.00059 1.5E-08 44.4 6.9 57 109-166 105-169 (169) 59 PRK11578 macrolide transporter 97.4 0.00051 1.3E-08 44.8 6.6 28 88-122 159-186 (347) 60 cd06848 GCS_H Glycine cleavage 97.4 0.00022 5.6E-09 47.0 4.2 40 112-151 37-76 (96) 61 PRK10559 p-hydroxybenzoic acid 97.4 0.00022 5.7E-09 47.0 4.2 34 88-128 153-186 (310) 62 PRK06748 hypothetical protein; 97.4 0.00064 1.6E-08 44.2 6.5 71 90-167 6-77 (84) 63 PRK10476 multidrug resistance 97.4 0.00025 6.3E-09 46.7 4.3 37 88-131 209-245 (348) 64 PRK10255 N-acetyl glucosamine 97.4 0.0012 2.9E-08 42.7 7.6 68 89-167 498-604 (648) 65 TIGR01945 rnfC electron transp 97.3 0.00014 3.4E-09 48.3 2.6 60 108-168 43-103 (444) 66 cd06255 M14_ASTE_ASPA_like_5 A 97.3 0.0011 2.7E-08 42.9 7.2 53 87-147 229-283 (293) 67 TIGR00527 gcvH glycine cleavag 97.3 0.00013 3.3E-09 48.5 2.3 66 103-168 32-108 (132) 68 COG0509 GcvH Glycine cleavage 97.3 0.00023 5.9E-09 46.9 3.4 57 112-168 47-110 (131) 69 COG3608 Predicted deacylase [G 97.3 0.0013 3.4E-08 42.3 7.2 66 89-164 256-324 (331) 70 pfam06898 YqfD Putative stage 97.3 0.0034 8.7E-08 39.8 9.3 57 91-149 191-249 (383) 71 pfam00358 PTS_EIIA_1 phosphoen 97.3 0.00097 2.5E-08 43.1 6.3 26 142-167 84-109 (133) 72 TIGR02927 SucB_Actino 2-oxoglu 97.3 0.00024 6E-09 46.8 3.2 57 112-168 24-80 (607) 73 PRK01202 glycine cleavage syst 97.2 0.00035 8.9E-09 45.8 3.8 57 112-168 45-108 (127) 74 COG4656 RnfC Predicted NADH:ub 97.2 0.00076 1.9E-08 43.8 5.4 57 108-168 45-104 (529) 75 PRK05889 putative acetyl-CoA c 97.2 0.00035 8.9E-09 45.8 3.6 33 133-165 2-34 (71) 76 PRK12784 hypothetical protein; 97.2 0.0013 3.3E-08 42.3 6.6 70 91-167 7-77 (84) 77 PRK09824 beta-glucoside-specif 97.2 0.0016 4E-08 41.9 6.8 55 110-164 562-623 (625) 78 PRK05035 electron transport co 97.2 0.002 5E-08 41.3 7.1 61 108-169 55-116 (725) 79 PRK03598 hypothetical protein; 97.2 0.00056 1.4E-08 44.6 4.3 37 87-130 201-237 (331) 80 PRK13380 glycine cleavage syst 97.1 0.00042 1.1E-08 45.3 3.5 58 112-169 46-110 (132) 81 cd00210 PTS_IIA_glc PTS_IIA, P 97.1 0.00093 2.4E-08 43.2 5.2 29 138-166 75-103 (124) 82 pfam01597 GCV_H Glycine cleava 97.1 0.00052 1.3E-08 44.8 3.6 57 112-168 39-102 (122) 83 PRK05641 putative acetyl-CoA c 97.1 0.00076 2E-08 43.7 4.3 33 134-166 80-112 (148) 84 TIGR00998 8a0101 efflux pump m 97.0 0.00077 2E-08 43.7 3.6 37 90-133 245-282 (379) 85 cd06850 biotinyl_domain The bi 97.0 0.00096 2.4E-08 43.1 4.1 31 136-166 2-32 (67) 86 COG0511 AccB Biotin carboxyl c 97.0 0.001 2.6E-08 43.0 4.2 35 132-166 69-103 (140) 87 COG1566 EmrA Multidrug resista 97.0 0.00084 2.1E-08 43.5 3.7 34 135-168 55-88 (352) 88 PRK09578 periplasmic multidrug 96.9 0.0022 5.6E-08 40.9 5.5 44 123-167 54-97 (385) 89 PRK06549 acetyl-CoA carboxylas 96.9 0.0014 3.5E-08 42.2 4.1 32 135-166 65-96 (132) 90 COG2190 NagE Phosphotransferas 96.8 0.003 7.6E-08 40.2 5.7 57 109-165 90-154 (156) 91 PRK11556 multidrug efflux syst 96.8 0.0025 6.3E-08 40.7 5.1 46 122-168 77-122 (415) 92 pfam04952 AstE_AspA Succinylgl 96.6 0.0091 2.3E-07 37.2 6.8 70 87-166 215-288 (289) 93 PRK09282 pyruvate carboxylase 96.5 0.0032 8.1E-08 40.0 4.0 35 132-166 510-544 (580) 94 pfam05896 NQRA Na(+)-transloca 96.5 0.0036 9.1E-08 39.7 4.2 40 109-149 42-81 (257) 95 TIGR02876 spore_yqfD sporulati 96.5 0.0079 2E-07 37.6 6.0 91 4-127 137-231 (406) 96 TIGR01995 PTS-II-ABC-beta PTS 96.5 0.0036 9.1E-08 39.7 4.1 67 88-165 508-613 (660) 97 TIGR01108 oadA oxaloacetate de 96.5 0.0022 5.5E-08 41.0 2.9 36 131-166 549-584 (616) 98 PRK09859 multidrug efflux syst 96.4 0.007 1.8E-07 37.9 5.3 32 135-166 63-94 (385) 99 TIGR00830 PTBA PTS system, glu 96.4 0.0049 1.3E-07 38.8 4.3 22 145-166 90-111 (129) 100 KOG3373 consensus 96.4 0.0023 6E-08 40.8 2.6 72 96-167 65-150 (172) 101 TIGR01843 type_I_hlyD type I s 96.4 0.0026 6.7E-08 40.5 2.8 35 135-169 45-79 (434) 102 COG4770 Acetyl/propionyl-CoA c 96.2 0.007 1.8E-07 37.9 4.1 32 135-166 577-608 (645) 103 PRK12999 pyruvate carboxylase; 96.1 0.0063 1.6E-07 38.2 3.5 32 135-166 1079-1110(1147) 104 TIGR00998 8a0101 efflux pump m 95.9 0.0064 1.6E-07 38.1 3.0 33 134-166 245-277 (379) 105 TIGR01349 PDHac_trf_mito pyruv 95.8 0.022 5.7E-07 34.8 5.5 56 111-166 20-76 (584) 106 TIGR01235 pyruv_carbox pyruvat 95.8 0.0084 2.1E-07 37.4 3.2 31 135-165 1102-1132(1169) 107 TIGR02645 ARCH_P_rylase putati 95.5 0.0092 2.4E-07 37.2 2.5 22 109-130 456-477 (499) 108 TIGR01843 type_I_hlyD type I s 95.5 0.019 4.8E-07 35.3 3.9 34 88-127 277-310 (434) 109 COG0845 AcrA Membrane-fusion p 95.2 0.074 1.9E-06 31.7 6.2 47 120-167 54-100 (372) 110 PRK07051 hypothetical protein; 95.0 0.023 5.8E-07 34.8 3.1 34 88-128 46-79 (80) 111 COG1038 PycA Pyruvate carboxyl 94.8 0.12 3.1E-06 30.4 6.3 33 134-166 1080-1112(1149) 112 TIGR01936 nqrA NADH:ubiquinone 94.6 0.025 6.4E-07 34.5 2.5 41 110-151 43-83 (466) 113 TIGR02712 urea_carbox urea car 94.5 0.047 1.2E-06 32.9 3.7 32 135-166 1159-1190(1226) 114 COG1566 EmrA Multidrug resista 94.5 0.051 1.3E-06 32.7 3.9 31 89-126 208-238 (352) 115 PRK09824 beta-glucoside-specif 94.4 0.046 1.2E-06 32.9 3.4 70 87-167 474-582 (625) 116 cd06663 Biotinyl_lipoyl_domain 94.2 0.05 1.3E-06 32.7 3.4 33 88-127 41-73 (73) 117 PRK10255 N-acetyl glucosamine 94.1 0.078 2E-06 31.6 4.2 78 89-166 535-648 (648) 118 cd06849 lipoyl_domain Lipoyl d 94.1 0.057 1.4E-06 32.4 3.5 33 88-127 42-74 (74) 119 PTZ00144 dihydrolipoamide succ 94.1 0.055 1.4E-06 32.5 3.3 38 86-130 97-134 (430) 120 COG1726 NqrA Na+-transporting 93.9 0.058 1.5E-06 32.3 3.2 39 110-151 43-83 (447) 121 TIGR00999 8a0102 Membrane Fusi 93.5 0.066 1.7E-06 32.0 2.8 32 135-166 106-137 (284) 122 KOG0238 consensus 93.3 0.089 2.3E-06 31.2 3.3 31 135-165 603-633 (670) 123 pfam00364 Biotin_lipoyl Biotin 93.2 0.066 1.7E-06 32.0 2.4 33 88-127 41-73 (73) 124 PRK09603 DNA-directed RNA poly 93.0 0.12 3.1E-06 30.3 3.6 57 106-162 2612-2721(2890) 125 PRK04350 thymidine phosphoryla 92.4 0.28 7.2E-06 28.2 4.8 20 148-167 456-475 (502) 126 PRK11637 hypothetical protein; 92.2 0.19 4.9E-06 29.2 3.8 43 87-129 313-376 (404) 127 pfam09891 DUF2118 Uncharacteri 92.0 0.32 8.1E-06 27.8 4.7 41 110-150 93-134 (149) 128 PRK09439 glucose-specific PTS 92.0 0.29 7.5E-06 28.1 4.5 69 88-167 19-126 (169) 129 COG2190 NagE Phosphotransferas 91.9 0.17 4.3E-06 29.5 3.2 66 90-166 6-110 (156) 130 TIGR02971 heterocyst_DevB ABC 91.9 0.12 3.1E-06 30.3 2.5 23 108-130 30-53 (363) 131 KOG0368 consensus 91.8 0.51 1.3E-05 26.6 5.6 53 114-166 666-718 (2196) 132 PRK05704 dihydrolipoamide succ 91.8 0.17 4.3E-06 29.5 3.1 36 88-130 44-79 (406) 133 PRK12784 hypothetical protein; 91.2 0.17 4.4E-06 29.4 2.7 38 132-169 4-41 (84) 134 PRK09783 copper/silver efflux 90.8 0.38 9.6E-06 27.4 4.1 30 135-164 123-153 (407) 135 TIGR01995 PTS-II-ABC-beta PTS 90.7 0.2 5.1E-06 29.1 2.6 57 109-165 594-660 (660) 136 TIGR02645 ARCH_P_rylase putati 90.6 0.13 3.3E-06 30.2 1.6 45 125-169 411-479 (499) 137 PRK05305 phosphatidylserine de 90.4 1.5 3.8E-05 23.7 8.3 67 90-164 144-210 (214) 138 PRK02597 DNA-directed RNA poly 90.0 0.22 5.5E-06 28.8 2.3 21 105-125 950-970 (1295) 139 PRK06748 hypothetical protein; 89.8 0.28 7.1E-06 28.2 2.7 40 130-169 1-41 (84) 140 TIGR03327 AMP_phos AMP phospho 89.7 0.75 1.9E-05 25.6 4.8 18 148-165 453-470 (500) 141 PRK11856 branched-chain alpha- 89.6 0.33 8.3E-06 27.8 2.9 32 89-127 44-75 (324) 142 COG0508 AceF Pyruvate/2-oxoglu 89.3 0.32 8.2E-06 27.8 2.7 35 89-130 45-79 (404) 143 TIGR02013 rpoB DNA-directed RN 88.7 0.2 5.1E-06 29.0 1.3 18 108-125 850-867 (1449) 144 PRK06302 acetyl-CoA carboxylas 88.5 0.37 9.4E-06 27.4 2.6 32 90-128 124-155 (155) 145 pfam06898 YqfD Putative stage 88.4 0.49 1.3E-05 26.7 3.2 29 134-162 190-225 (383) 146 TIGR02386 rpoC_TIGR DNA-direct 88.0 2.2 5.6E-05 22.7 6.3 56 110-168 1136-1212(1552) 147 TIGR02876 spore_yqfD sporulati 87.3 0.54 1.4E-05 26.4 2.8 29 135-163 193-228 (406) 148 PRK05820 deoA thymidine phosph 87.0 1.4 3.5E-05 24.0 4.8 21 110-130 385-405 (440) 149 KOG0369 consensus 86.7 0.57 1.4E-05 26.3 2.7 32 134-165 1107-1138(1176) 150 PRK06078 pyrimidine-nucleoside 86.7 1.5 3.8E-05 23.8 4.8 21 110-130 380-400 (434) 151 TIGR02644 Y_phosphoryl pyrimid 86.4 0.6 1.5E-05 26.2 2.6 30 92-130 377-406 (425) 152 TIGR02388 rpoC2_cyan DNA-direc 85.6 0.74 1.9E-05 25.6 2.8 56 107-163 918-991 (1252) 153 TIGR01347 sucB 2-oxoglutarate 85.5 0.53 1.4E-05 26.5 2.0 33 89-128 43-75 (435) 154 KOG0559 consensus 84.5 0.65 1.7E-05 26.0 2.1 36 88-130 114-149 (457) 155 TIGR00830 PTBA PTS system, glu 84.5 0.51 1.3E-05 26.6 1.5 19 109-127 91-109 (129) 156 pfam05896 NQRA Na(+)-transloca 84.2 0.79 2E-05 25.4 2.4 31 137-167 33-63 (257) 157 pfam02666 PS_Dcarbxylase Phosp 84.1 1.5 3.9E-05 23.7 3.8 62 91-162 138-200 (201) 158 cd01134 V_A-ATPase_A V/A-type 82.5 4.1 0.00011 21.1 5.9 57 109-167 52-111 (369) 159 cd00210 PTS_IIA_glc PTS_IIA, P 82.3 0.79 2E-05 25.4 1.8 24 109-132 83-106 (124) 160 KOG2419 consensus 82.0 0.9 2.3E-05 25.1 2.0 31 5-36 712-746 (975) 161 pfam00358 PTS_EIIA_1 phosphoen 81.7 0.86 2.2E-05 25.2 1.8 21 109-129 88-108 (133) 162 pfam07831 PYNP_C Pyrimidine nu 81.0 0.98 2.5E-05 24.9 1.9 29 94-131 29-57 (75) 163 pfam01333 Apocytochr_F_C Apocy 77.8 2 5.1E-05 23.0 2.7 19 107-125 43-61 (118) 164 TIGR01043 ATP_syn_A_arch ATP s 74.7 3.4 8.6E-05 21.6 3.1 58 109-169 120-181 (584) 165 TIGR01730 RND_mfp efflux trans 73.8 4.2 0.00011 21.0 3.5 33 135-167 136-170 (333) 166 COG0739 NlpD Membrane proteins 72.2 3.1 7.9E-05 21.8 2.4 19 109-127 215-233 (277) 167 pfam06572 DUF1131 Protein of u 70.3 9.1 0.00023 19.0 5.6 43 112-154 70-115 (192) 168 CHL00006 consensus 69.8 7.1 0.00018 19.6 3.9 50 107-164 1054-1124(1372) 169 PRK03140 phosphatidylserine de 69.0 5.5 0.00014 20.3 3.1 18 84-101 133-150 (259) 170 PRK11649 hypothetical protein; 68.9 3.8 9.7E-05 21.3 2.3 22 109-130 343-364 (418) 171 COG0213 DeoA Thymidine phospho 67.9 5 0.00013 20.6 2.7 19 111-129 382-400 (435) 172 COG1155 NtpA Archaeal/vacuolar 65.6 11 0.00029 18.4 5.1 56 109-165 119-176 (588) 173 COG4072 Uncharacterized protei 64.7 11 0.00027 18.6 3.9 40 109-148 104-144 (161) 174 PRK10718 hypothetical protein; 63.8 12 0.00031 18.2 5.4 42 112-154 70-114 (191) 175 PRK06148 hypothetical protein; 61.3 6.3 0.00016 19.9 2.3 43 88-130 452-525 (1015) 176 pfam02749 QRPTase_N Quinolinat 60.3 10 0.00026 18.7 3.2 22 146-167 48-69 (88) 177 KOG1668 consensus 58.6 7.5 0.00019 19.5 2.2 35 95-134 172-206 (231) 178 PRK01497 DNA-directed RNA poly 57.3 16 0.00041 17.5 4.9 17 110-126 994-1010(1384) 179 PRK06149 hypothetical protein; 57.1 9.2 0.00023 19.0 2.5 39 88-126 441-493 (972) 180 PRK10334 mechanosensitive chan 53.8 18 0.00047 17.2 4.0 36 112-147 130-165 (285) 181 pfam03829 PTSIIA_gutA PTS syst 51.6 8.2 0.00021 19.3 1.5 25 93-119 93-117 (117) 182 PRK06106 nicotinate-nucleotide 51.4 20 0.00051 16.9 3.5 19 110-128 72-90 (281) 183 PRK10871 nlpD lipoprotein NlpD 51.4 20 0.00051 16.9 6.2 25 144-168 324-348 (374) 184 pfam01551 Peptidase_M23 Peptid 49.3 22 0.00055 16.7 7.2 20 109-128 54-73 (96) 185 PRK04192 V-type ATP synthase s 46.3 21 0.00053 16.8 2.9 40 109-148 121-163 (585) 186 TIGR00078 nadC nicotinate-nucl 43.7 15 0.00039 17.6 1.9 21 109-129 63-83 (276) 187 cd01568 QPRTase_NadC Quinolina 42.0 28 0.00071 16.0 3.9 25 142-166 56-80 (269) 188 PRK08385 nicotinate-nucleotide 41.7 27 0.0007 16.1 2.9 19 147-165 63-81 (279) 189 COG4942 Membrane-bound metallo 41.1 29 0.00074 15.9 5.2 40 124-167 354-393 (420) 190 PRK05848 nicotinate-nucleotide 39.6 31 0.00078 15.8 2.9 19 147-165 61-79 (272) 191 pfam00924 MS_channel Mechanose 38.3 32 0.00082 15.7 3.5 33 113-145 60-92 (203) 192 cd01572 QPRTase Quinolinate ph 38.2 32 0.00082 15.7 3.6 22 144-165 59-80 (268) 193 PRK06096 molybdenum transport 37.5 33 0.00084 15.6 3.2 20 146-165 64-83 (284) 194 PRK06543 nicotinate-nucleotide 36.4 34 0.00087 15.5 3.8 21 109-129 68-88 (281) 195 PRK07896 nicotinate-nucleotide 36.2 34 0.00088 15.5 3.9 20 109-128 78-97 (288) 196 cd01573 modD_like ModD; Quinol 35.0 36 0.00092 15.3 2.9 18 148-165 61-78 (272) 197 PRK09016 quinolinate phosphori 34.3 37 0.00094 15.3 3.9 20 109-128 88-107 (296) 198 PRK08072 nicotinate-nucleotide 34.1 37 0.00095 15.3 3.5 19 110-128 68-86 (277) 199 COG0157 NadC Nicotinate-nucleo 33.5 38 0.00097 15.2 3.4 19 111-129 69-87 (280) 200 CHL00075 rpl21 ribosomal prote 32.1 40 0.001 15.1 3.2 10 151-160 45-54 (110) 201 PRK06559 nicotinate-nucleotide 29.8 34 0.00088 15.5 1.9 26 1-26 1-26 (290) 202 PRK07428 nicotinate-nucleotide 29.8 44 0.0011 14.8 2.8 18 111-128 74-91 (285) 203 PRK06978 nicotinate-nucleotide 27.9 48 0.0012 14.6 3.5 20 109-128 79-98 (288) 204 pfam00166 Cpn10 Chaperonin 10 26.9 45 0.0011 14.8 2.0 11 117-127 56-66 (92) 205 TIGR01000 bacteriocin_acc bact 26.8 37 0.00095 15.3 1.6 24 84-107 330-353 (476) 206 pfam12046 DUF3529 Protein of u 26.3 31 0.0008 15.7 1.1 39 91-129 116-155 (173) 207 PRK05742 nicotinate-nucleotide 26.3 51 0.0013 14.4 3.2 18 111-128 71-88 (277) 208 TIGR02643 T_phosphoryl thymidi 26.0 25 0.00063 16.3 0.6 21 110-130 387-407 (440) 209 KOG3358 consensus 25.9 47 0.0012 14.7 2.0 66 82-160 63-134 (211) 210 PRK10027 cryptic adenine deami 25.7 52 0.0013 14.4 2.7 22 12-33 263-284 (588) 211 CHL00117 rpoC2 RNA polymerase 25.3 53 0.0014 14.3 4.3 17 111-127 410-426 (1350) 212 PRK10929 hypothetical protein; 25.2 53 0.0014 14.3 2.8 32 112-160 936-967 (1109) 213 PRK10377 glucitol/sorbitol-spe 25.1 45 0.0012 14.8 1.8 24 94-119 94-117 (120) 214 PRK02173 DNA-directed RNA poly 24.9 54 0.0014 14.3 5.4 64 91-163 1165-1232(1438) 215 smart00841 Elong-fact-P_C Elon 24.9 27 0.00069 16.1 0.6 14 109-122 35-48 (56) 216 pfam09285 Elong-fact-P_C Elong 24.7 54 0.0014 14.3 2.2 25 85-123 25-49 (56) 217 PRK00405 rpoB DNA-directed RNA 24.6 38 0.00098 15.2 1.3 18 108-125 702-719 (1127) 218 TIGR02303 HpaG-C-term 4-hydrox 24.5 47 0.0012 14.6 1.8 23 117-139 225-249 (249) 219 TIGR01009 rpsC_bact ribosomal 24.3 55 0.0014 14.2 3.0 31 9-40 37-71 (217) 220 CHL00037 petA cytochrome f 24.3 56 0.0014 14.2 5.2 43 120-162 211-263 (320) 221 PRK11281 potassium efflux prot 23.8 57 0.0014 14.2 3.0 32 112-160 938-969 (1107) 222 PRK02693 apocytochrome f; Revi 23.7 57 0.0015 14.1 5.5 30 132-162 218-255 (312) 223 pfam09160 FimH_man-bind FimH, 23.5 57 0.0015 14.1 3.3 35 94-133 99-133 (150) 224 cd05794 S1_EF-P_repeat_2 S1_EF 23.2 58 0.0015 14.1 2.3 15 109-123 35-49 (56) 225 KOG2420 consensus 21.9 47 0.0012 14.6 1.4 16 146-161 367-382 (382) 226 COG4405 Uncharacterized protei 21.9 61 0.0015 14.0 1.9 26 109-134 54-83 (140) 227 pfam10011 DUF2254 Predicted me 21.8 62 0.0016 13.9 2.0 28 138-165 217-244 (369) 228 cd03079 GST_N_Metaxin2 GST_N f 21.8 39 0.001 15.1 0.9 18 101-119 44-61 (74) 229 KOG2362 consensus 21.7 63 0.0016 13.9 2.3 29 90-127 302-330 (336) 230 TIGR02828 TIGR02828 putative m 21.2 44 0.0011 14.8 1.1 10 28-37 85-94 (197) 231 PRK13526 glutamine amidotransf 20.9 65 0.0017 13.8 2.7 21 94-119 120-140 (179) 232 TIGR00259 TIGR00259 conserved 20.4 50 0.0013 14.5 1.2 17 95-111 3-19 (261) 233 cd03443 PaaI_thioesterase PaaI 20.0 68 0.0017 13.7 2.5 38 91-130 58-95 (113) No 1 >PRK06302 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=100.00 E-value=0 Score=325.56 Aligned_cols=155 Identities=35% Similarity=0.518 Sum_probs=123.7 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEECCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 39899999999998659708999509969999706776521133202344455422233221122344444544333454 Q gi|254780267|r 8 INLTLIRNLANILNETNLTEVEIDNDGMRIRLLRSPQKDTVTNYYSEDNKNNHSLVGFPPSSTIDNTPPESDLIPLLSPD 87 (171) Q Consensus 8 mdi~~Ik~Li~~~~es~l~eleie~~~~kI~I~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (171) ||+++|++|++++++|+|+||+|+.++++|+|+|+...................... ..............+..... T Consensus 1 Mdl~~IkeLi~~~~~s~l~Elel~~~~~~i~l~k~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ 77 (155) T PRK06302 1 MDIRKIKKLIELVEESGISELEIKEGEESVRISRNEAAAAAPVAQQYAPPVAAAPAP---AAAAAPAAAAAAEAPAAAAA 77 (155) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEECCEEEEEEECCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCC T ss_conf 988999999999988898489998599899998378887664455666666678888---88788888887777665545 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 662687046256774036764100012442010434899973772689998998199999848986836997299995 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) +++.|+|||||||||+|+|+++|||++||+|++||+||||||||+||+|+|+++|+|.+|||+||++|+|||+||+|| T Consensus 78 ~~~~I~SPmVGtFY~sp~P~~~pFV~vGd~V~~Gq~v~iIEaMK~mneI~a~~~G~I~~Ilv~~G~~Veygq~Lf~i~ 155 (155) T PRK06302 78 SGHVVTSPMVGTFYRAPSPDAPPFVEVGDTVKEGQTLCIIEAMKMMNEIEADKSGVVKEILVENGQPVEFGQPLFVIE 155 (155) T ss_pred CCCEEECCCEEEEEECCCCCCCCCCCCCCEECCCCEEEEEEECCCCCEEECCCCCEEEEEEECCCCEECCCCEEEEEC T ss_conf 798786784367987899899974246867248988999984243532406988489999855998763998269869 No 2 >COG0511 AccB Biotin carboxyl carrier protein [Lipid metabolism] Probab=100.00 E-value=3.4e-35 Score=234.17 Aligned_cols=140 Identities=39% Similarity=0.541 Sum_probs=101.1 Q ss_pred HHHHHHHHHHCCCEEEEEEECCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 99999999865970899950996999970677652113320234445542223322112234444454433345466268 Q gi|254780267|r 13 IRNLANILNETNLTEVEIDNDGMRIRLLRSPQKDTVTNYYSEDNKNNHSLVGFPPSSTIDNTPPESDLIPLLSPDNYHTV 92 (171) Q Consensus 13 Ik~Li~~~~es~l~eleie~~~~kI~I~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I 92 (171) |++|++++.++++++++++.+++++++.|.... . .............+...............+... .+++.| T Consensus 1 i~~l~~~~~~~~~~~~~v~~~~~~~~~~~~~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~V 73 (140) T COG0511 1 IKELIKLVVESDGTEFEVKVEEGEVRLSRKTPV--V----QDVPAPAPIEASSPSAAAAQPAASAPAPAPAAA-AGGTQV 73 (140) T ss_pred CCCHHHHEEEECCEEEEEEECCCEEEEEECCCE--E----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCEE T ss_conf 902120225756589999827816787403530--0----134344431024665445566555566666554-678667 Q ss_pred EECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 70462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 93 TSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 93 ~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) +|||+||||+ +||++||+|++||+|||||||||||+|.||.+|+|.+|+|++||+|+|||+||+|++ T Consensus 74 ~SPm~Gtv~~-------~~V~vGd~V~~Gq~l~IiEAMKmeneI~A~~~G~V~~Ilv~~G~~Ve~G~~L~~I~~ 140 (140) T COG0511 74 TSPMVGTVYK-------PFVEVGDTVKAGQTLAIIEAMKMENEIEAPADGVVKEILVKNGDPVEYGDPLAVIEP 140 (140) T ss_pred ECCCCEEEEE-------EEECCCCEECCCCEEEEEEEEECCCEECCCCCCEEEEEEECCCCCCCCCCEEEEEEC T ss_conf 4787618999-------876579997589999999820015532289996899999448985268998899609 No 3 >PRK07051 hypothetical protein; Validated Probab=99.98 E-value=1.1e-32 Score=218.82 Aligned_cols=78 Identities=33% Similarity=0.625 Sum_probs=76.3 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 626870462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .+.|+|||+|||||+|+|+++|||++||+|++||+||+||||||||+|+|+++|+|.+|||++|+.|+|||+||+||+ T Consensus 3 ~~~I~sPm~G~fy~~~~p~~~pfV~~Gd~V~~G~~v~iiEaMKm~~~V~A~~~G~V~~ilv~~G~~V~~Gq~L~~Iee 80 (80) T PRK07051 3 QHEIVSPLPGTFYRRPSPDAPPYVEVGDTVAAGDVVGLVEVMKQFSEVEAEVAGRVVRFLVDDGEPVDAGQVLMRIEE 80 (80) T ss_pred CCEEECCCCEEEECCCCCCCCCCCCCCCEECCCCEEEEEEECCCCCEEECCCCCEEEEEEECCCCEECCCCEEEEEEC T ss_conf 977728944789827995999813679998899999999700277467269982999999569898679997899829 No 4 >TIGR00531 BCCP acetyl-CoA carboxylase, biotin carboxyl carrier protein; InterPro: IPR001249 The proteins in this family are a component of the acetyl coenzyme A carboxylase complex (6.4.1.2 from EC) and are involved in the first step in long-chain fatty acid synthesis. In plants this is usually located in the chloroplast. In the first step, biotin carboxylase catalyses the carboxylation of the carrier protein to form an intermediate. Next, the transcarboxylase complex transfers the carboxyl group from the intermediate to acetyl-CoA forming malonyl-CoA. This protein functions in the transfer of CO_2 from one site to another, the biotin binding site locates to the C-terminal of this protein. The biotin is specifically attached to a lysine residue in the sequence AMKM. The structure of the C-terminal domain of the biotin carboxyl carrier (BCC) protein was shown to be a flattened beta-barrel structure comprising two four-stranded beta sheets interrupted by a structural loop forming a thumb structure. The biotinyl-lysine is located on a tight beta-turn on the opposite end of the molecule. The thumb structure has been shown to attached biotin, thus stabilising the structure. ; GO: 0003989 acetyl-CoA carboxylase activity, 0006633 fatty acid biosynthetic process, 0009317 acetyl-CoA carboxylase complex. Probab=99.97 E-value=1.1e-31 Score=212.91 Aligned_cols=159 Identities=33% Similarity=0.537 Sum_probs=125.1 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 13989999999999865970899950996999970677652113320234445542223322112234444454433345 Q gi|254780267|r 7 KINLTLIRNLANILNETNLTEVEIDNDGMRIRLLRSPQKDTVTNYYSEDNKNNHSLVGFPPSSTIDNTPPESDLIPLLSP 86 (171) Q Consensus 7 ~mdi~~Ik~Li~~~~es~l~eleie~~~~kI~I~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (171) .||+++|++|++.++++++++|+++.+++.+++.|...............+............. ...+......+...+ T Consensus 1 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~-~p~~~~~~~~p~~~~ 79 (159) T TIGR00531 1 ALNIREIKELIKLIEESGITELELKEEEFELRLSKAAPAAGKSAVQPAAAPVPAPVPAAVPPAA-VPAPAPAAPAPPPAD 79 (159) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECCCEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHCCCCC-CCCCCCCCCCCCCCC T ss_conf 9646789999876520585204652475168872211100000001112553100011103110-343211246764110 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 46626870462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) ...+.|.|||+||||++|+|+++|||++||.|++||.+|++|||||+|+|.++..|.|..+++++|++|+||++|+.|++ T Consensus 80 ~~~~~~~~p~~gtfy~~p~p~~~p~~~~g~~~~~g~~~c~~ea~k~~n~~e~~~~g~~~~~~~~~g~~~e~g~~l~~~~~ 159 (159) T TIGR00531 80 EKGHLVRSPMVGTFYRAPSPDAKPFVEVGDKVNKGDGVCIVEAMKLMNEIEAEVAGEVVEILVEDGQPVEYGEPLILIEP 159 (159) T ss_pred CCCCEEECCCCCCEECCCCCCCCCEEEECCEECCCCEEEHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCCCEEEEEEC T ss_conf 23301004421100027888866402105500246332012455565442002465378997428850103862467619 No 5 >PRK12999 pyruvate carboxylase; Reviewed Probab=99.91 E-value=3e-24 Score=167.69 Aligned_cols=78 Identities=40% Similarity=0.562 Sum_probs=71.4 Q ss_pred CCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEE Q ss_conf 33345466268704625677403676410001244201043489997377268999899819999984898683699729 Q gi|254780267|r 82 PLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDAL 161 (171) Q Consensus 82 ~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L 161 (171) +..+..+...|.|||+|+++ +.+|++||+|++||+|++||||||||+|.||++|+|++|+|+.|+.|+.||+| T Consensus 1070 ~kAd~~np~~VgAPmpG~V~-------kV~Vk~GD~VkkGd~LlvlEAMKMEt~I~Ap~dG~V~~I~V~~Gd~V~~GDLL 1142 (1147) T PRK12999 1070 EKADPGNPGHIGAPMPGSVV-------TVAVKEGDEVKAGDPLLVIEAMKMETTITAPFDGTVKRVLVKAGDQVEAGDLL 1142 (1147) T ss_pred CCCCCCCCCCCCCCCCCEEE-------EEEECCCCEECCCCEEEEEEHHCCCCEECCCCCCEEEEEEECCCCEECCCCEE T ss_conf 66899998813699982589-------99968999878999899985103866043899948989997898868899879 Q ss_pred EEEEE Q ss_conf 99953 Q gi|254780267|r 162 LVLEK 166 (171) Q Consensus 162 ~~i~~ 166 (171) ++|+| T Consensus 1143 v~Iep 1147 (1147) T PRK12999 1143 VELEP 1147 (1147) T ss_pred EEEEC T ss_conf 99729 No 6 >PRK05641 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=99.91 E-value=3.7e-24 Score=167.15 Aligned_cols=72 Identities=43% Similarity=0.609 Sum_probs=67.4 Q ss_pred CCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 5466268704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 86 PDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 86 ~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) ..+...|+|||+|+++ +.+|++||+|++||+||+||||||+|+|.||.+|+|.+|+|+.||.|+.||+|++| T Consensus 76 ~~g~~~v~apmpG~v~-------~v~v~~Gd~V~~g~~l~vlEaMKMe~~i~a~~~G~v~~i~v~~g~~V~~g~~l~~i 147 (148) T PRK05641 76 QVGENVVTAPMPGKIL-------KILVKEGDQVKTGQGLLILEAMKMENEIPAPKDGVVKKILVKEGDTVDTGQPLIEL 147 (148) T ss_pred CCCCCEEECCCCEEEE-------EEEECCCCEECCCCEEEEEEHHCCCEEEEECCCEEEEEEEECCCCEECCCCEEEEE T ss_conf 5689979769990899-------99968999987999999999431513999299989989993897998999989995 No 7 >PRK06549 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=99.91 E-value=1.8e-23 Score=162.92 Aligned_cols=76 Identities=41% Similarity=0.613 Sum_probs=69.7 Q ss_pred CCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEE Q ss_conf 33345466268704625677403676410001244201043489997377268999899819999984898683699729 Q gi|254780267|r 82 PLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDAL 161 (171) Q Consensus 82 ~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L 161 (171) +.....+...|+|||+|++| +.+|++||+|++||+||+||||||+|+|.||++|+|.+|+|..||.|+.||+| T Consensus 56 ~~~~~~~~~~v~aPm~G~v~-------~i~v~~Gd~V~~g~~l~vlEAMKMe~~i~ap~~G~V~~i~v~~G~~V~~g~~L 128 (132) T PRK06549 56 PVAPAAGADAIPSPMPGTIL-------KVLVAVGDQVTENQPLLILEAMKMENEIVASSAGTITAIHVGPGQVVNPGDGL 128 (132) T ss_pred CCCCCCCCCEECCCCCEEEE-------EEEECCCCEECCCCEEEEEEEHHCCCEEECCCCEEEEEEEECCCCEECCCCEE T ss_conf 87788889998089997999-------99938999987999899998010244105899979989994798998999989 Q ss_pred EEE Q ss_conf 999 Q gi|254780267|r 162 LVL 164 (171) Q Consensus 162 ~~i 164 (171) +.| T Consensus 129 ~~i 131 (132) T PRK06549 129 ITI 131 (132) T ss_pred EEE T ss_conf 994 No 8 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=99.90 E-value=2.4e-23 Score=162.22 Aligned_cols=78 Identities=38% Similarity=0.589 Sum_probs=70.7 Q ss_pred CCCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCE Q ss_conf 43334546626870462567740367641000124420104348999737726899989981999998489868369972 Q gi|254780267|r 81 IPLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDA 160 (171) Q Consensus 81 ~~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~ 160 (171) .+.....+...|.|||+|+++ +.+|++||+|++||+|++||||||||+|.||++|+|.+|+|+.|+.|+.||+ T Consensus 503 ~~~a~~~~~~~V~APmpG~V~-------~V~Vk~Gd~V~~Gd~L~vlEAMKMEt~I~Ap~~G~V~~I~V~~Gd~V~~gdl 575 (580) T PRK09282 503 AAAAPASAGGPVTAPMAGNIV-------KVLVKEGDKVKEGDVLLILEAMKMETEIRAPIAGTVKEIFVKEGDAVSVGDV 575 (580) T ss_pred CCCCCCCCCCEECCCCCCEEE-------EEEECCCCEECCCCEEEEEEHHCCCCEEECCCCCEEEEEEECCCCEECCCCE T ss_conf 577899999853279982699-------9997899987899989998510474425079994897999789887579998 Q ss_pred EEEEE Q ss_conf 99995 Q gi|254780267|r 161 LLVLE 165 (171) Q Consensus 161 L~~i~ 165 (171) |++|. T Consensus 576 L~~I~ 580 (580) T PRK09282 576 LMTIA 580 (580) T ss_pred EEEEC T ss_conf 99949 No 9 >TIGR02712 urea_carbox urea carboxylase; InterPro: IPR014084 Members of this family are ATP-dependent urea carboxylases, including characterised members from Oleomonas sagaranensis (alpha class Proteobacterium) and yeasts such as Saccharomyces cerevisiae (Baker's yeast). The allophanate hydrolase domain of the yeast enzyme is not included in this entry and is represented by an adjacent gene in Oleomonas sagaranensis. The fusion of urea carboxylase and allophanate hydrolase is designated urea amidolyase. The enzyme from O. sagaranensis was shown to be highly active on acetamide and formamide as well as urea.. Probab=99.85 E-value=1.2e-21 Score=151.85 Aligned_cols=79 Identities=39% Similarity=0.515 Sum_probs=72.6 Q ss_pred CCCCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCC Q ss_conf 44333454662687046256774036764100012442010434899973772689998998199999848986836997 Q gi|254780267|r 80 LIPLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGD 159 (171) Q Consensus 80 ~~~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq 159 (171) .......++...|.|+|.|+|| +..|.+||.|++||.|.||||||||..|.||.+|+|++|++++|++|..|| T Consensus 1148 ~~~~~~~~g~~~V~s~~~G~~W-------K~~V~~G~~V~~G~~l~i~EaMKmE~~v~Ap~~G~V~~i~~~~G~~V~aG~ 1220 (1226) T TIGR02712 1148 EEEADLPEGAVQVESEVAGNVW-------KVLVEVGDRVEAGQPLVIIEAMKMEMTVSAPVAGKVTKILCKPGDMVDAGD 1220 (1226) T ss_pred CCCCCCCCCCEEEEEECEEEEE-------EEEECCCCEEECCCEEEEEEECCCCEEECCCCCCEEEEEEECCCCCCCCCC T ss_conf 2224365771489850201465-------665212663204873899943572112438988548899736848226786 Q ss_pred EEEEEE Q ss_conf 299995 Q gi|254780267|r 160 ALLVLE 165 (171) Q Consensus 160 ~L~~i~ 165 (171) .|+.|+ T Consensus 1221 ~l~~~~ 1226 (1226) T TIGR02712 1221 ILVVLE 1226 (1226) T ss_pred EEEEEC T ss_conf 378747 No 10 >PRK08225 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=99.84 E-value=6.7e-21 Score=147.36 Aligned_cols=69 Identities=41% Similarity=0.590 Sum_probs=66.1 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 2687046256774036764100012442010434899973772689998998199999848986836997299995 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) ..|.|||+|++| +.+|++||+|++||+|++||||||+++|.||++|+|++|+++.||.|+.||+|+.|+ T Consensus 2 ~~V~Apm~G~V~-------~v~V~~Gd~V~~G~~l~vlEAMKME~~i~A~~~G~V~~i~v~~Gd~V~~G~~L~~ie 70 (70) T PRK08225 2 TKVYASMAGNVW-------KIVVKVGDTVEEGQDVVILESMKMEIPIVAEEAGTVKKINVQEGDFVNEGDVLLEIE 70 (70) T ss_pred CEEECCCCEEEE-------EEEECCCCEECCCCEEEEEEECCCCCEEECCCCEEEEEEEECCCCEECCCCEEEEEC T ss_conf 878578994999-------999289899899999999971126576847999099999958989989999999979 No 11 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=99.83 E-value=1.4e-20 Score=145.34 Aligned_cols=71 Identities=37% Similarity=0.540 Sum_probs=65.0 Q ss_pred CCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEE Q ss_conf 345466268704625677403676410001244201043489997377268999899819999984898683699729 Q gi|254780267|r 84 LSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDAL 161 (171) Q Consensus 84 ~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L 161 (171) +.......|+|||.|.++ +..|..||+|++||+|+||||||||+||+|+.+|+|.+|+|+.|+.|..||.| T Consensus 546 a~~~a~~~vTaP~aG~i~-------~~~V~~G~~V~~G~vlL~~EAMKME~Ei~A~~aG~V~~i~v~~Gd~V~~gqVL 616 (616) T TIGR01108 546 AKAGAITDVTAPIAGSIV-------KITVSEGDEVKEGEVLLVVEAMKMETEIKAAAAGIVEEILVKKGDKVSPGQVL 616 (616) T ss_pred CCCCCCCCCCCCCCCCEE-------EEEECCCCEECCCCEEEEEECCCCCHHHCCCCCCCEEEEEECCCCEECCCCCC T ss_conf 888989875667777369-------99835787431577588531001112200543570112344578750886679 No 12 >PRK05889 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Provisional Probab=99.81 E-value=3.2e-20 Score=143.23 Aligned_cols=68 Identities=29% Similarity=0.538 Sum_probs=65.0 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 268704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) ..|.|||+|++| +.+|++||+|++||+|+++|||||+++|.||++|+|.+|+|+.||.|+.||+|+.| T Consensus 3 e~V~Apm~G~V~-------~v~V~~Gd~V~~Gd~l~vlEAMKME~~v~A~~~G~V~~i~v~~Gd~V~~Gd~L~~I 70 (71) T PRK05889 3 EDVRAEIVASVL-------EVVVNEGDQIGEGDTLVLLESMKMEIPVLAEVAGTVSKVSVSVGDVIQAGDLIAVI 70 (71) T ss_pred CCCCCCCCEEEE-------EEEECCCCEECCCCEEEEEEHHCCCCEEECCCCCEEEEEEECCCCEECCCCEEEEE T ss_conf 531489984899-------99979999988999999998200457784799909979997899998899999996 No 13 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=99.81 E-value=6.2e-20 Score=141.47 Aligned_cols=78 Identities=36% Similarity=0.503 Sum_probs=71.0 Q ss_pred CCCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCE Q ss_conf 43334546626870462567740367641000124420104348999737726899989981999998489868369972 Q gi|254780267|r 81 IPLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDA 160 (171) Q Consensus 81 ~~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~ 160 (171) ...++..|...|-|||+|++ .+.||+.|+.|++||+|.++||||||.-|.||.||+|++++|++|+.|+.+|+ T Consensus 1092 ~~KA~~~n~~hvGApMpG~v-------~~v~v~~G~~v~kGd~l~~~~AMKME~~v~ap~~G~vk~v~v~~G~~i~~~DL 1164 (1169) T TIGR01235 1092 RRKADKGNPAHVGAPMPGVV-------IEVKVEAGQKVNKGDPLVVLEAMKMETVVQAPKDGTVKEVLVKDGEKIDAKDL 1164 (1169) T ss_pred ECCCCCCCHHHCCCCCCCCE-------EEEEEECCCEECCCCEEEEEECCCCCCEEECCCCCEEEEEEECCCCCCCCCCE T ss_conf 14778887432256899707-------99994138730167848996037632046678997246888358870351211 Q ss_pred EEEEE Q ss_conf 99995 Q gi|254780267|r 161 LLVLE 165 (171) Q Consensus 161 L~~i~ 165 (171) |+.+| T Consensus 1165 L~~le 1169 (1169) T TIGR01235 1165 LLVLE 1169 (1169) T ss_pred EEEEC T ss_conf 35409 No 14 >cd06850 biotinyl_domain The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine. Probab=99.81 E-value=5.9e-20 Score=141.58 Aligned_cols=67 Identities=45% Similarity=0.686 Sum_probs=64.2 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 68704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .|+|||+|+||+ .+|++||+|++||+||+||||||+++|+||.+|+|.+|++++|+.|++||+|+.| T Consensus 1 ~i~ap~~G~v~~-------~~V~~Gd~V~~G~~l~~iE~mK~~~~i~a~~~G~v~~i~~~~G~~V~~G~~L~~i 67 (67) T cd06850 1 EVTAPMPGTVVK-------VLVKEGDKVEAGQPLAVLEAMKMENEVTAPVAGVVKEILVKEGDQVEAGQLLVVI 67 (67) T ss_pred CEECCCCEEEEE-------EEECCCCEECCCCEEEEEEECCCEEEEEECCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 910879889999-------9957999997899999999366119999689999999997898998999999979 No 15 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=99.80 E-value=2.5e-19 Score=137.84 Aligned_cols=80 Identities=38% Similarity=0.564 Sum_probs=71.9 Q ss_pred CCCCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCC Q ss_conf 44333454662687046256774036764100012442010434899973772689998998199999848986836997 Q gi|254780267|r 80 LIPLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGD 159 (171) Q Consensus 80 ~~~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq 159 (171) .....+..+...|-|||+|++ ...+|+.||+||+||+|++||||||+..|.||+||+|.+++|.+|++|+.|| T Consensus 1070 ~r~KAd~~Np~higApmpG~V-------v~v~V~~G~~Vk~Gd~l~~ieAMKMEt~i~Ap~dG~i~~v~V~~gd~i~~gD 1142 (1149) T COG1038 1070 ARRKADPGNPGHIGAPMPGVV-------VEVKVKKGDKVKKGDVLAVIEAMKMETTISAPFDGTVKEVLVKDGDQIDGGD 1142 (1149) T ss_pred EECCCCCCCCCCCCCCCCCCE-------EEEEECCCCCCCCCCEEEEEHHHHHCEEEECCCCCEEEEEEECCCCCCCCCC T ss_conf 001267898112178888726-------9999745873047986533022322224415778667689965887124675 Q ss_pred EEEEEEE Q ss_conf 2999953 Q gi|254780267|r 160 ALLVLEK 166 (171) Q Consensus 160 ~L~~i~~ 166 (171) +|+.+++ T Consensus 1143 LLi~~~~ 1149 (1149) T COG1038 1143 LLVVVEE 1149 (1149) T ss_pred EEEECCC T ss_conf 5997059 No 16 >COG4770 Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism] Probab=99.74 E-value=3.3e-18 Score=131.03 Aligned_cols=75 Identities=32% Similarity=0.502 Sum_probs=68.6 Q ss_pred CCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 45466268704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 85 SPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 85 ~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .......++|||+|++- ...|+.|+.|.+||+|+++|||||+|.|.||.+|+|.++.|..|++|..|+.|+++ T Consensus 571 ~~~~~~~l~aPMpG~v~-------~v~V~~G~~V~~G~~lvvlEAMKME~~l~A~~dG~V~~v~v~~Gd~V~~g~vLve~ 643 (645) T COG4770 571 AAASSGELLAPMPGTVV-------SVAVKEGQEVSAGDLLVVLEAMKMENTLRAPRDGVVAKLAVAEGDQVAVGTVLVEF 643 (645) T ss_pred CCCCCCCEECCCCCEEE-------EEEECCCCEECCCCEEEEEEEHHCCCCEECCCCCEEEEEEECCCCCCCCCCEEEEE T ss_conf 67787716567896289-------99936897754798678867112256122476857999985589712057269983 Q ss_pred EE Q ss_conf 53 Q gi|254780267|r 165 EK 166 (171) Q Consensus 165 ~~ 166 (171) ++ T Consensus 644 ~~ 645 (645) T COG4770 644 EE 645 (645) T ss_pred CC T ss_conf 69 No 17 >pfam00364 Biotin_lipoyl Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not recognize the Glycine cleavage system H proteins. Probab=99.71 E-value=1.4e-17 Score=127.22 Aligned_cols=72 Identities=39% Similarity=0.584 Sum_probs=68.3 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 68704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .|+.|+.|.++..+. .+.||++||+|++||+||+||+|||+++|+||++|+|.+|++++|+.|.+||+|++| T Consensus 2 ei~~P~lG~~~~~~~--~~w~v~~Gd~V~~g~~l~~iEt~K~~~~v~a~~~G~i~~i~v~~G~~V~~G~~l~~I 73 (73) T pfam00364 2 EIKSPMIGESVKEGT--AEWLVKVGDKVKAGQVLCEVEAMKMEMEIPAPVAGVVKEILVKEGDTVEVGDPLAKI 73 (73) T ss_pred CEECCCCCCCCEECC--EEEEECCCCEECCCCEEEEEEECCCCCEECCCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 347789879737234--848658989992899899999665352250898889999997998998999999979 No 18 >cd06663 Biotinyl_lipoyl_domains Biotinyl_lipoyl_domains are present in biotin-dependent carboxylases/decarboxylases, the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases, and the H-protein of the glycine cleavage system (GCS). These domains transport CO2, acyl, or methylamine, respectively, between components of the complex/protein via a biotinyl or lipoyl group, which is covalently attached to a highly conserved lysine residue. Probab=99.60 E-value=1.7e-15 Score=114.59 Aligned_cols=56 Identities=36% Similarity=0.477 Sum_probs=54.0 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 10001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) +-+|++||+|++||+||+||||||+++|.||++|+|.+|+++.|+.|..||+|++| T Consensus 18 ~~~v~~Gd~V~~Gd~l~~iEa~K~~~ei~ap~~G~v~~i~v~~G~~V~~G~~l~~I 73 (73) T cd06663 18 KWLKKVGDKVKKGDVLAEIEAMKATSDVEAPKSGTVKKVLVKEGTKVEGDTPLVKI 73 (73) T ss_pred EEEECCCCEECCCCEEEEEEEEEEEEEEECCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 99926999997999999999500569986699989999995898997899999979 No 19 >KOG0369 consensus Probab=99.54 E-value=1.3e-14 Score=109.25 Aligned_cols=82 Identities=33% Similarity=0.455 Sum_probs=73.4 Q ss_pred CCCCCCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCC Q ss_conf 45443334546626870462567740367641000124420104348999737726899989981999998489868369 Q gi|254780267|r 78 SDLIPLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEY 157 (171) Q Consensus 78 ~~~~~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~ 157 (171) ...-|.....+.+.|.|||+|++. +.-|++|++|++||+||++-||||+.-|.||.+|+|+++++.+|+.|+- T Consensus 1095 ~v~~PkA~~~~~g~igAPMpG~vi-------eikvk~G~kV~Kgqpl~VLSAMKMEmVv~sP~~G~vk~v~v~~g~~~~g 1167 (1176) T KOG0369 1095 IVTRPKADPGVKGHIGAPMPGTVI-------EIKVKEGAKVKKGQPLAVLSAMKMEMVISSPHAGTVKKVHVVQGTKVEG 1167 (1176) T ss_pred HHCCCCCCCCCCCCCCCCCCCCEE-------EEEEECCCEECCCCCEEEEECCEEEEEECCCCCCEEEEEEECCCCCCCC T ss_conf 433776799985653698998469-------9998307510379835761201031366377776035787336860356 Q ss_pred CCEEEEEEE Q ss_conf 972999953 Q gi|254780267|r 158 GDALLVLEK 166 (171) Q Consensus 158 gq~L~~i~~ 166 (171) |+++++||+ T Consensus 1168 ~DL~~~~E~ 1176 (1176) T KOG0369 1168 GDLIVELEH 1176 (1176) T ss_pred CCEEEECCC T ss_conf 645888159 No 20 >KOG0238 consensus Probab=99.53 E-value=5.8e-15 Score=111.29 Aligned_cols=74 Identities=34% Similarity=0.534 Sum_probs=68.4 Q ss_pred CCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 45466268704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 85 SPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 85 ~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .......+.|||+|++ ++.||+.||+|++||.+|+++||||+|.++||.+|+|+.+.++.|+.|.-|.+||++ T Consensus 597 ~~~~s~v~~aPMpG~I-------ekv~Vkpgd~V~~Gq~l~Vl~AMKMe~~~~apk~gtvk~v~~~aG~~v~~g~vlv~~ 669 (670) T KOG0238 597 KEDGSGVIVAPMPGII-------EKVLVKPGDKVKEGQELVVLIAMKMEHSLKAPKDGTVKDVKYKAGATVGDGAVLVEF 669 (670) T ss_pred CCCCCCCEECCCCCEE-------EEEECCCHHHHCCCCCEEEEEECCHHHHHHCCCCCCEEEEEEECCCCCCCCCEEEEE T ss_conf 4577885655788704-------466516435531567348999414465641799873156755227633788657982 Q ss_pred E Q ss_conf 5 Q gi|254780267|r 165 E 165 (171) Q Consensus 165 ~ 165 (171) + T Consensus 670 ~ 670 (670) T KOG0238 670 E 670 (670) T ss_pred C T ss_conf 9 No 21 >cd06849 lipoyl_domain Lipoyl domain of the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases. 2-oxo acid dehydrogenase multienzyme complexes, like pyruvate dehydrogenase (PDH), 2-oxoglutarate dehydrogenase (OGDH) and branched-chain 2-oxo acid dehydrogenase (BCDH), contain at least three different enzymes, 2-oxo acid dehydrogenase (E1), dihydrolipoyl acyltransferase (E2) and dihydrolipoamide dehydrogenase (E3) and play a key role in redox regulation. E2, the central component of the complex, catalyzes the transfer of the acyl group of CoA from E1 to E3 via reductive acetylation of a lipoyl group covalently attached to a lysine residue. Probab=99.33 E-value=2.3e-12 Score=95.46 Aligned_cols=56 Identities=30% Similarity=0.504 Sum_probs=53.5 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 10001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) +-++++||+|++||+||.+|+|||..+|.||++|+|.+++++.|+.|..|++|..| T Consensus 19 ~w~v~~Gd~V~~gd~l~~vEt~K~~~~i~a~~~G~v~~i~~~~G~~v~~G~~l~~i 74 (74) T cd06849 19 EWLVKEGDSVEEGDVLAEVETDKATVEVEAPAAGVLAKILVEEGDTVPVGQVIAVI 74 (74) T ss_pred EEEECCCCEECCCCEEEEEECCCEEEEEECCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 99927989997899999998585078410798999999987899998899999979 No 22 >PRK11855 dihydrolipoamide acetyltransferase; Reviewed Probab=99.26 E-value=9.7e-11 Score=85.64 Aligned_cols=61 Identities=34% Similarity=0.525 Sum_probs=57.0 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +-++++||+|++||+||.||.-|.-.+|.|+.+|+|.+|+++.|+.|.-|++|+.|+..++ T Consensus 136 ~W~~k~GD~V~~gd~l~eVETDKa~~Ev~sp~~G~l~ki~v~eGd~v~VG~~ia~i~~~~~ 196 (549) T PRK11855 136 EWLVKVGDTVEEDQSLITVETDKATMEIPSPVAGVVKEIKVKVGDKVSVGSLLFKIEVAGA 196 (549) T ss_pred EEECCCCCEECCCCCEEEEEECCCCEEEECCCCCEEEEEEECCCCCCCCCCEEEEEECCCC T ss_conf 7840789801368725788605531267436587544765058980457987899725777 No 23 >PTZ00144 dihydrolipoamide succinyltransferase; Provisional Probab=99.20 E-value=4.4e-11 Score=87.70 Aligned_cols=61 Identities=25% Similarity=0.318 Sum_probs=57.1 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +=+|++||+|++||+||.||.-|.-.||+||.+|+|.+|+++.|+.|.-|++|..|+-+++ T Consensus 76 ~WlvkeGD~V~~gd~L~EVETDKA~~EVeSp~~G~l~kIlv~eGd~V~VG~~la~I~~~~e 136 (430) T PTZ00144 76 EWKKKVGDYVKVDETIAIVETDKVSVDINSKFSGVLSKIFAEVGDTVLVGKPLCEIDTSGE 136 (430) T ss_pred EEECCCCCEECCCCCEEEEECCCCCEEECCCCCEEEEEEEECCCCEEECCCEEEEEECCCC T ss_conf 9981899986899908999858841796577897999986379998946986999966878 No 24 >PRK11855 dihydrolipoamide acetyltransferase; Reviewed Probab=99.19 E-value=5.3e-11 Score=87.21 Aligned_cols=61 Identities=36% Similarity=0.547 Sum_probs=57.7 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +-+|++||+|++||+||.||.-|--.+|+||++|+|.+|+++.|+.|.-|++|+.|+..++ T Consensus 21 ~W~v~~GD~V~~~d~l~evETDKa~~Evps~~~G~v~~i~v~~Gd~v~vG~~i~~i~~~~~ 81 (549) T PRK11855 21 EWLVKEGDTVEEDQPLVTVETDKATMEIPSPAAGVVKKIKVKVGDTVSVGAPLAVIEAAGA 81 (549) T ss_pred EEEECCCCEECCCCEEEEEECCCCCEEECCCCCEEEEEEEECCCCEECCCCEEEEEECCCC T ss_conf 9995799987899928999817713684388897999996189988469985999956877 No 25 >PRK05704 dihydrolipoamide succinyltransferase; Validated Probab=99.17 E-value=6.6e-11 Score=86.64 Aligned_cols=61 Identities=23% Similarity=0.367 Sum_probs=57.7 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +-+|++||+|++||+||.||.-|.-.+|+||.+|+|.+|+++.|+.|.-|++|..|+..++ T Consensus 21 ~Wlvk~GD~V~~gd~L~eVETDKa~~Ei~s~~~G~l~ki~~~eGd~v~VG~~ia~I~~~~~ 81 (406) T PRK05704 21 TWHKKPGDAVARDEVLVEIETDKVVLEVPAPAAGVLSEILAEEGDTVTVGQVLGRIDEGAA 81 (406) T ss_pred EEECCCCCEECCCCEEEEEECCCCCEEECCCCCEEEEEEEECCCCEECCCCEEEEEECCCC T ss_conf 9981899997899908999838620797488898999983089899668998999826887 No 26 >TIGR01348 PDHac_trf_long pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006256 This group of sequences are a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and percent identity (UPGMA) trees. Members contain two or three copies of the lipoyl-binding domain. Escherichia coli AceF is included in this set while mitochondrial and some other bacterial forms are excluded. dihydrolipoyllysine-residue acetyltransferase E2 component of pyruvate dehydrogenase complex; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006096 glycolysis, 0045254 pyruvate dehydrogenase complex. Probab=99.13 E-value=5.6e-10 Score=81.00 Aligned_cols=83 Identities=31% Similarity=0.453 Sum_probs=69.5 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 46626870462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .+...|+=|=+|--|. ..+=-+.+|+|||+|.++|.|.-||.-|--.||+|+++|+|+.+-|+-||.|..|+.++.++- T Consensus 204 sg~~~v~vPDiGdn~~-~~~V~evlV~vGDtV~~~QslItLE~DKA~~EvPa~asGvv~~v~v~vGD~v~~G~~il~L~~ 282 (655) T TIGR01348 204 SGVQEVKVPDIGDNIE-KVEVIEVLVKVGDTVSADQSLITLESDKASMEVPAPASGVVKSVKVKVGDSVSTGDLILVLSV 282 (655) T ss_pred CCCEEEECCCCCCCCC-CEEEEEEEECCCCEECCCCCEEEEECCEEEEEECCCCCCEEEEEEEEECCEECCCCEEEEEEE T ss_conf 5741544265377777-502799995437776478867887067147874689852378999865667057888999710 Q ss_pred CCCC Q ss_conf 4788 Q gi|254780267|r 167 TGDN 170 (171) Q Consensus 167 ~~~~ 170 (171) -+++ T Consensus 283 ~~st 286 (655) T TIGR01348 283 AGST 286 (655) T ss_pred CCCC T ss_conf 5677 No 27 >COG0508 AceF Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes [Energy production and conversion] Probab=99.07 E-value=4.4e-10 Score=81.66 Aligned_cols=61 Identities=28% Similarity=0.441 Sum_probs=57.9 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +-++++||+|++||+|+-||.-|...||+||++|+|.+|+++.|+.|.-|++|..|...++ T Consensus 21 ~W~~k~GD~V~~gd~L~eVeTDKa~~EV~ap~~G~l~~i~~~~G~~V~Vg~~I~~i~~~~~ 81 (404) T COG0508 21 EWLKKVGDKVKEGDVLVEVETDKATMEVPAPDAGVLAKILVEEGDTVPVGAVIARIEEEGA 81 (404) T ss_pred EEEECCCCEECCCCCEEEEEECCEEEEECCCCCCEEEEEECCCCCEECCCCEEEEEECCCC T ss_conf 8972699863699806999806533551477785996762058988758985899933777 No 28 >PRK11854 aceF dihydrolipoamide acetyltransferase; Validated Probab=99.07 E-value=1.1e-09 Score=79.22 Aligned_cols=60 Identities=32% Similarity=0.530 Sum_probs=43.0 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 100012442010434899973772689998998199999848986836997299995347 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) +-+|++||+|++||+||.+|.-|--.+|+||++|+|.+|+++.|+.|..|++|+.|+..+ T Consensus 122 ~~~~~~Gd~v~~~~~~~~vetdKa~~evps~~~G~~~~i~~~~g~~v~~g~~i~~i~~~~ 181 (630) T PRK11854 122 EILVKVGDTVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDKVSTGSLIMVFEVAG 181 (630) T ss_pred EEEECCCCEEECCCCEEEEECCCEEEEEECCCCCEEEEECCCCCCEECCCCEEEEECCCC T ss_conf 997247988723881578752524688526888168871268998844797678853678 No 29 >PRK11856 branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed Probab=99.05 E-value=5e-10 Score=81.30 Aligned_cols=57 Identities=26% Similarity=0.466 Sum_probs=54.2 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 100012442010434899973772689998998199999848986836997299995 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) .=++++||.|++||+||.||.-|.-++|+||.+|++.+|+++.|+.|.-|++|..|. T Consensus 20 ~W~~~~Gd~V~~g~~l~eVeTDKa~~ei~a~~~G~l~~i~~~~G~~v~VG~~ia~i~ 76 (324) T PRK11856 20 EWLVKEGDTVKKGDPLAEVETDKATVEVESPVAGTLRKILVEEGDVVPVGAPIAVIV 76 (324) T ss_pred EEEECCCCEECCCCEEEEEECCCEEEEEECCCCEEEEEEECCCCCEECCCCEEEEEC T ss_conf 999089999789994999980766689806888399899608979957999899944 No 30 >PRK11854 aceF dihydrolipoamide acetyltransferase; Validated Probab=99.02 E-value=8e-10 Score=80.05 Aligned_cols=61 Identities=30% Similarity=0.485 Sum_probs=57.5 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +-++++||+|++||+||.+|.-|.-.+|.||++|+|.+|+++.|+.|..|++|..|+..+. T Consensus 223 ewlvkvGD~V~~~d~l~evETDKa~~EvpSp~aG~v~~i~v~~Gd~v~vG~~ia~i~~~~~ 283 (630) T PRK11854 223 EVMVKVGDKVEAEQSLITVEGDKASMEVPAPFAGTVKEIKVNVGDKVSTGSLIMRFEVEGA 283 (630) T ss_pred EECCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCEEEEECCCCCCCCCCEEEEECCCCC T ss_conf 4224679844668743011257743113588777411564058981456887888426777 No 31 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=98.95 E-value=6.5e-10 Score=80.62 Aligned_cols=55 Identities=31% Similarity=0.473 Sum_probs=53.7 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 0124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) =+|||.|++|..||=||--|...||.||+||+|.+|+.++|+.|.-||.|..|++ T Consensus 22 KkvGD~V~~DE~iveiETDKV~lEV~Sp~dGvl~~~~~~eG~TV~~g~~La~~~E 76 (435) T TIGR01347 22 KKVGDTVKRDELIVEIETDKVVLEVPSPADGVLQEILEKEGDTVTSGQVLAVLEE 76 (435) T ss_pred CCCCCEEECCCEEEEEECCCEEEEEECCCCCHHHHHHCCCCCEECCCEEEEEEEC T ss_conf 6888711227768975207446788288530217430689888505507788823 No 32 >TIGR01348 PDHac_trf_long pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006256 This group of sequences are a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and percent identity (UPGMA) trees. Members contain two or three copies of the lipoyl-binding domain. Escherichia coli AceF is included in this set while mitochondrial and some other bacterial forms are excluded. dihydrolipoyllysine-residue acetyltransferase E2 component of pyruvate dehydrogenase complex; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006096 glycolysis, 0045254 pyruvate dehydrogenase complex. Probab=98.86 E-value=5e-09 Score=75.24 Aligned_cols=60 Identities=28% Similarity=0.496 Sum_probs=57.3 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 100012442010434899973772689998998199999848986836997299995347 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) +.+|++||+|.+||.|.-||.-|--.||+|+.+|+|++|-|+.||....|..+..|+-.+ T Consensus 19 evLV~~GD~V~~~Qslit~E~DKA~mEvPss~AG~ik~ikvkVGDt~~~G~~i~~l~~~~ 78 (655) T TIGR01348 19 EVLVKVGDKVEAGQSLITLESDKASMEVPSSKAGIIKEIKVKVGDTLKEGDVIALLEVSA 78 (655) T ss_pred EEEECCCCEECCCCCEEEECCCEEEEEECCCCCCEEEEEEEEECCCCCCCCEEEEECCCC T ss_conf 998688974326874688706712784068999737889754466547787899971665 No 33 >KOG0559 consensus Probab=98.83 E-value=4.2e-09 Score=75.66 Aligned_cols=57 Identities=30% Similarity=0.427 Sum_probs=55.1 Q ss_pred EECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 001244201043489997377268999899819999984898683699729999534 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) +.++||+|++...||-||--|.-.+|.||.+|+|.++||++|+.|+.||.|+.|++. T Consensus 93 lK~~Gd~v~~DE~va~IETDK~tv~V~sP~sGvi~e~lvk~gdtV~~g~~la~i~~g 149 (457) T KOG0559 93 LKKVGDRVNEDEAVAEIETDKTTVEVPSPASGVITELLVKDGDTVTPGQKLAKISPG 149 (457) T ss_pred HHCCCCCCCCCHHHEEEECCCEEEECCCCCCCEEEEEECCCCCCCCCCCEEEEECCC T ss_conf 647464213001311542153233136887642457740789750478645773588 No 34 >PRK11578 macrolide transporter subunit MacA; Provisional Probab=98.29 E-value=9.6e-07 Score=61.37 Aligned_cols=74 Identities=27% Similarity=0.333 Sum_probs=59.5 Q ss_pred CCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC----------------------------------- Q ss_conf 546626870462567740367641000124420104348999737----------------------------------- Q gi|254780267|r 86 PDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM----------------------------------- 130 (171) Q Consensus 86 ~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM----------------------------------- 130 (171) ..+...|.|++.|++ ...+|++||.|++||+|+.|+.= T Consensus 35 a~~~~~i~a~v~G~I-------~~i~v~~Gd~V~kGqvL~~ld~~~~~~~l~~a~a~l~~~~a~l~~a~~~~~~a~~~~~ 107 (347) T PRK11578 35 ALRKVDVGAQVSGQL-------KTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELNAQRQQAEAELKLARVTLS 107 (347) T ss_pred EEEEEEEECCCCEEE-------EEEECCCCCEECCCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 838999974077899-------9999899398889999999874999999999999999999999999999999899999 Q ss_pred --------------------------C------------------------EEEEEECCCCCEEEEEECCCCCCCCCCCE Q ss_conf --------------------------7------------------------26899989981999998489868369972 Q gi|254780267|r 131 --------------------------K------------------------TMNHIVAPCSGKVQDINVKDGQSVEYGDA 160 (171) Q Consensus 131 --------------------------K------------------------m~n~I~a~~~G~I~~i~v~~G~~V~~gq~ 160 (171) + -...|+||++|+|.+.+++.|+.|..|++ T Consensus 108 r~~~L~~~~~is~~~~d~a~~~~~~~~a~l~~~~a~~~~~~a~l~~a~~~l~~t~I~AP~~GvV~~~~~~~G~~v~~~~~ 187 (347) T PRK11578 108 RQQRLAKTQAVSQQDLDTAATEMAVKQAQIGTIDAQIKRNQASLDTAKTNLDYTRIVAPMAGEVTQITTLQGQTVIAAQQ 187 (347) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCCEEEEEEEECCCEEECCCC T ss_conf 99999875762478898878799987645889999999999999999876525699825784486799833988863788 Q ss_pred ---EEEEEE Q ss_conf ---999953 Q gi|254780267|r 161 ---LLVLEK 166 (171) Q Consensus 161 ---L~~i~~ 166 (171) ||.|-. T Consensus 188 ~~~l~~i~~ 196 (347) T PRK11578 188 APNILTLAD 196 (347) T ss_pred CCEEEEECC T ss_conf 741357316 No 35 >KOG0368 consensus Probab=98.28 E-value=2.2e-06 Score=59.14 Aligned_cols=74 Identities=30% Similarity=0.412 Sum_probs=60.3 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 46626870462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .+...+.||.+|.+. .-+|+-|++|.+||+-+-||.|||-.++.|..+|+|.-| ...|+..+.|..|-+++= T Consensus 683 nDpt~LrsPs~GKLl-------~ylVedG~hv~~Gq~YAeiEvMKMvm~lva~~~G~i~~i-~~~G~~i~aG~vlakL~l 754 (2196) T KOG0368 683 NDPTVLRSPSPGKLL-------QYLVEDGEHVEAGQPYAEIEVMKMVMPLVAKEPGRIQLI-KQEGDAIEAGSVLAKLTL 754 (2196) T ss_pred CCCCEECCCCCCCCE-------EEEECCCCEEECCCEEEEHEEEEEEEEEECCCCCEEEEE-CCCCCCCCCCCEEEEEEC T ss_conf 896433489986205-------899548971443872000120002445630578668983-589973577663788744 Q ss_pred CC Q ss_conf 47 Q gi|254780267|r 167 TG 168 (171) Q Consensus 167 ~~ 168 (171) +. T Consensus 755 Dd 756 (2196) T KOG0368 755 DD 756 (2196) T ss_pred CC T ss_conf 88 No 36 >PRK03598 hypothetical protein; Provisional Probab=98.18 E-value=4.3e-06 Score=57.44 Aligned_cols=35 Identities=23% Similarity=0.441 Sum_probs=30.6 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 89998998199999848986836997299995347 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) ..|.||+||+|....++.|+.|..|++||.|-... T Consensus 204 t~I~AP~dGvV~~~~v~~Ge~V~~G~~l~ti~~~~ 238 (331) T PRK03598 204 TTLIAPSDGTILTRAVEPGTMLSAGSTVFTLSLTR 238 (331) T ss_pred CEEECCCCEEEEEEECCCCCEECCCCCEEEEEECC T ss_conf 79978996699989758898848998179999179 No 37 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=98.08 E-value=1.4e-05 Score=54.39 Aligned_cols=37 Identities=22% Similarity=0.302 Sum_probs=32.6 Q ss_pred EEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 6899989981999998489868369972999953478 Q gi|254780267|r 133 MNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 133 ~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) ...|.||+||+|.+..++.|+.|..|+|||.|=+.++ T Consensus 210 ~t~I~AP~dG~V~~~~v~~G~~V~pG~pl~~lv~~~~ 246 (348) T PRK10476 210 DTTVRAPFDGRVVGLKVSVGEFAAPMQPIFTLIDTDH 246 (348) T ss_pred CCEEECCCCEEEEEEECCCCCEECCCCEEEEEEECCC T ss_conf 5899778984999886699987789984899996895 No 38 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=98.06 E-value=1.1e-05 Score=54.91 Aligned_cols=73 Identities=26% Similarity=0.398 Sum_probs=60.1 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCC-------------------------------------- Q ss_conf 268704625677403676410001244201043489997377-------------------------------------- Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMK-------------------------------------- 131 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMK-------------------------------------- 131 (171) ..|.+-..|.+ .+.+|+.||+|++||+|+.|+.=- T Consensus 48 v~Vsp~VsG~V-------~eV~V~dnq~VkkGdvL~~ID~~~y~~al~qA~A~la~~qa~~~~~~~e~~R~~~l~~~aiS 120 (310) T PRK10559 48 VAIAPDVSGLI-------TQVNVHDNQLVKKGQVLFTIDQPRYQKALAEAEADVAYYQVLAQEKRQEAGRRNRLGVQAMS 120 (310) T ss_pred EEEECCCCEEE-------EEEECCCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99936674699-------99991794987699889998761799999999999999999999999999999986651005 Q ss_pred --------------------------------EEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf --------------------------------26899989981999998489868369972999953478 Q gi|254780267|r 132 --------------------------------TMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 132 --------------------------------m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) =...|.||+||.|.+..++.|+.|..||+||.|=+..+ T Consensus 121 ~~~ld~a~~~l~~a~aql~~A~A~l~~A~l~L~~T~I~AP~dG~V~~~~v~~Gq~V~~Gq~l~slV~~~~ 190 (310) T PRK10559 121 REEIDQANNVLQTVLHQLAKAQATRDLAKLDLERTVIRAPADGWVTNLNVYTGEFITRGSTAVALVKKNS 190 (310) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCCEEEEEECCCCCEECCCCEEEEEEECCC T ss_conf 9999999999999999999999999999988725878179886784455488988569985899996896 No 39 >KOG0557 consensus Probab=98.03 E-value=1.5e-05 Score=54.17 Aligned_cols=59 Identities=22% Similarity=0.410 Sum_probs=55.1 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCC-CCCCCCCEEEEEEECCCC Q ss_conf 01244201043489997377268999899819999984898-683699729999534788 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDG-QSVEYGDALLVLEKTGDN 170 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G-~~V~~gq~L~~i~~~~~~ 170 (171) -++||+++.||+||-||.-|--..+++.-+|..-+||++.| .-|.-|++|-.|=++.++ T Consensus 60 kKeGdkls~GDvl~EVETDKAtmd~E~~ddGyLAKILi~EGskdvpVGk~Iaiive~e~d 119 (470) T KOG0557 60 KKEGDKLSAGDVLLEVETDKATMDVEAQDDGYLAKILIEEGSKDVPVGKPIAIIVEDEDD 119 (470) T ss_pred ECCCCCCCCCCEEEEEECCCCEEEEEECCCCEEEEEEECCCCCCCCCCCCEEEEECCCCC T ss_conf 225775688865899960451233440367765455413576556688715999526333 No 40 >cd06251 M14_ASTE_ASPA_like_1 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=98.02 E-value=4.8e-05 Score=51.04 Aligned_cols=68 Identities=29% Similarity=0.305 Sum_probs=56.4 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC--CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 46626870462567740367641000124420104348999737--7268999899819999984898683699729999 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM--KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM--Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .....|.||-.|.|+ ++++.||.|++||+|+.|--- ....++.||.+|+|...... -.|..|+.|+.| T Consensus 217 ~~~~~v~A~~~Gl~~--------~~~~~G~~V~~Gq~lg~I~dp~g~~~~~v~Ap~dG~v~~~~~~--p~V~~Gd~l~~I 286 (287) T cd06251 217 RSSVWVRAPQGGLLR--------SLVKLGDKVKKGQLLATITDPFGEEEAEVKAPFDGIVIGRNNL--PLVNEGDALFHI 286 (287) T ss_pred CCCEEEECCCCCEEE--------ECCCCCCEECCCCEEEEEECCCCCCEEEEECCCCEEEEEEECC--CCCCCCCEEEEE T ss_conf 586799769998798--------8589889947999899997899994289984898399985079--830799878995 No 41 >cd06252 M14_ASTE_ASPA_like_2 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=98.02 E-value=4.2e-05 Score=51.39 Aligned_cols=70 Identities=30% Similarity=0.291 Sum_probs=56.8 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC----CEEEEEECCCCCEEEEEECCCCCCCCCCCEEE Q ss_conf 46626870462567740367641000124420104348999737----72689998998199999848986836997299 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM----KTMNHIVAPCSGKVQDINVKDGQSVEYGDALL 162 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM----Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~ 162 (171) +....|.||--|.|+ +.++.||.|++||+|+.|--. -...+|.||.+|+|.-. ...-.|..|+.|+ T Consensus 242 ~~~~~v~Ap~~Gif~--------~~v~lG~~V~kGq~lg~I~dp~~~g~~~~~v~Ap~dGiV~~~--~~~p~V~~Gd~la 311 (316) T cd06252 242 DARCYVFAPHPGLFE--------PLVDLGDEVSAGQVAGRIHFPERPGRPPLEIRAPDGGVLAAR--RPPGLVRRGDCLA 311 (316) T ss_pred CCCEEEECCCCEEEE--------EECCCCCEECCCCEEEEEECCCCCCCCEEEEEECCCEEEEEE--ECCCCCCCCCEEE T ss_conf 897799769986999--------917889997799999999688779972399990898399995--0798205898689 Q ss_pred EEEE Q ss_conf 9953 Q gi|254780267|r 163 VLEK 166 (171) Q Consensus 163 ~i~~ 166 (171) .|-. T Consensus 312 ~Ia~ 315 (316) T cd06252 312 VLAA 315 (316) T ss_pred EEEC T ss_conf 9955 No 42 >cd06253 M14_ASTE_ASPA_like_3 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=98.02 E-value=5.1e-05 Score=50.91 Aligned_cols=68 Identities=19% Similarity=0.362 Sum_probs=54.9 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE-ECC--CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEE Q ss_conf 46626870462567740367641000124420104348999-737--726899989981999998489868369972999 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII-EAM--KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLV 163 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii-EAM--Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~ 163 (171) .+...|.||.-|.|+ +.++.||.|++||+|+.| ... ....+|.||.+|+|..+. ..-.|..|+.|+. T Consensus 227 ~~~~~i~Ap~~Gl~~--------~~~~lG~~V~~G~~lg~I~dp~~G~~~~~v~Ap~dGiv~~~~--~~p~V~~Gd~l~~ 296 (298) T cd06253 227 RDVVYVNAETSGIFV--------PAKHLGDIVKRGDVIGEIVDPLEGEVIEEVIAPCDGILFTLR--EYPLVYEGSLVAR 296 (298) T ss_pred CCEEEEECCCCEEEE--------EECCCCCEECCCCEEEEEECCCCCCCEEEEECCCCEEEEEEC--CCCCCCCCCEEEE T ss_conf 870899659998999--------946889988789999999688899965999928986999811--7881078987999 Q ss_pred E Q ss_conf 9 Q gi|254780267|r 164 L 164 (171) Q Consensus 164 i 164 (171) | T Consensus 297 I 297 (298) T cd06253 297 I 297 (298) T ss_pred E T ss_conf 6 No 43 >PRK11556 multidrug efflux system subunit MdtA; Provisional Probab=97.85 E-value=3.7e-05 Score=51.75 Aligned_cols=76 Identities=29% Similarity=0.394 Sum_probs=59.7 Q ss_pred CCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCC--------------------------------- Q ss_conf 45466268704625677403676410001244201043489997377--------------------------------- Q gi|254780267|r 85 SPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMK--------------------------------- 131 (171) Q Consensus 85 ~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMK--------------------------------- 131 (171) ...+...|.+...|++ ...+|++||.|++||+|+.|+.-. T Consensus 83 ~a~~~v~v~~~VsG~I-------~~i~v~~Gd~VkkGqvLa~iD~~~~~~~l~qA~A~l~~a~A~l~~A~~~~~R~~~L~ 155 (415) T PRK11556 83 TAANTVTVRSRVDGQL-------MALHFQEGQQVKAGDLLAEIDPSQFKVALAQAQGQLAKDQATLANARRDLARYQQLA 155 (415) T ss_pred EEEEEEEEEEEEEEEE-------EEEECCCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8646999987850299-------999858989978999999988357899999999999999989999999999999987 Q ss_pred --------------------------------------EEEEEECCCCCEEEEEECCCCCCCCCCC--EEEEEEEC Q ss_conf --------------------------------------2689998998199999848986836997--29999534 Q gi|254780267|r 132 --------------------------------------TMNHIVAPCSGKVQDINVKDGQSVEYGD--ALLVLEKT 167 (171) Q Consensus 132 --------------------------------------m~n~I~a~~~G~I~~i~v~~G~~V~~gq--~L~~i~~~ 167 (171) =...|+||++|+|...+++.|+.|..|+ +|+.|-.. T Consensus 156 ~~~~iS~~~ld~~~a~~~~A~A~l~~a~a~l~~A~~~L~~t~I~AP~~GvV~~r~v~~G~~V~~G~~~pl~~i~~~ 231 (415) T PRK11556 156 KTNLVSRQELDAQQALVSETEGTIKADEAAVASAQLQLDWSRITAPISGRVGLKQVDVGNQISSGDTTGIVVITQT 231 (415) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCEEEEEECCCCCEECCCCCCCEEEEECC T ss_conf 6599569999989999999999999999999999875248899826651577688589973777886526999527 No 44 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=97.81 E-value=9.6e-05 Score=49.22 Aligned_cols=75 Identities=20% Similarity=0.314 Sum_probs=58.2 Q ss_pred CCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC----------------C------------------ Q ss_conf 546626870462567740367641000124420104348999737----------------7------------------ Q gi|254780267|r 86 PDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM----------------K------------------ 131 (171) Q Consensus 86 ~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM----------------K------------------ 131 (171) ......|.+...|.+ ...+|++||.|++||+|+.|+.= + T Consensus 60 a~~~v~l~~~v~G~v-------~~v~v~~Gd~V~kGq~La~lD~~~~~a~~~~a~a~l~~A~a~~~~A~~~~~R~~~L~~ 132 (385) T PRK09578 60 AYRQAEVRARVAGIV-------TARTYEEGQEVKQGAVLFRIDPAPLKAARDAAAGALAKAEAAHLAAADKRRRYDDLVR 132 (385) T ss_pred EEEEEEEEEECCEEE-------EEEEECCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 879999996644699-------9998079998738988999853469999999999999999999999999999999886 Q ss_pred -------------------------------------EEEEEECCCCCEEEEEECCCCCCCCCCC--EEEEEEEC Q ss_conf -------------------------------------2689998998199999848986836997--29999534 Q gi|254780267|r 132 -------------------------------------TMNHIVAPCSGKVQDINVKDGQSVEYGD--ALLVLEKT 167 (171) Q Consensus 132 -------------------------------------m~n~I~a~~~G~I~~i~v~~G~~V~~gq--~L~~i~~~ 167 (171) =...|.||++|+|.+.+++.|+.|..|+ ||+.|... T Consensus 133 ~~~is~~~~~~a~~~~~~a~a~l~~A~a~l~~A~~~l~~t~i~AP~~G~V~~~~v~~G~~v~~~~~~~l~~i~~~ 207 (385) T PRK09578 133 DRAVSERDHTEAVADERQAKAEVASAKAELARAQLQLDYATVTAPIDGRARRALVTEGALVGQDQATPLTTVEQL 207 (385) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCEEEEEECCCCCEECCCCCCCEEEEEEC T ss_conf 366326789999999999999999999999988875410352389774588888178865668887314999826 No 45 >PRK11892 pyruvate dehydrogenase subunit beta; Provisional Probab=97.80 E-value=8.3e-05 Score=49.59 Aligned_cols=62 Identities=24% Similarity=0.333 Sum_probs=55.2 Q ss_pred CCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCC-CCCCCCEEEEEEECCC Q ss_conf 4100012442010434899973772689998998199999848986-8369972999953478 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQ-SVEYGDALLVLEKTGD 169 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~-~V~~gq~L~~i~~~~~ 169 (171) ++=++++||.|+.||+|.=||--|--.|++|--.|+|.+|||.+|. .|..+.|+-.|-..+. T Consensus 20 ~kw~~~~gd~v~~gd~~aeietdkatme~ea~deg~~~kilv~~g~~~v~vn~~ia~~~~~~e 82 (464) T PRK11892 20 AKWLKKEGDKVKSGDVIAEIETDKATMEVEAVDEGTLGKILVPEGTEGVKVNTPIAVLLEEGE 82 (464) T ss_pred EEEECCCCCCCCCCCEEEEEEECCEEEEEEECCCCEEEEEEECCCCCCCCCCCCEEEEECCCC T ss_conf 146446897525786689975065668987506748888986677779755762379612576 No 46 >TIGR02927 SucB_Actino 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR014276 This entry represents the mainly actinobacterial clade of the E2 component of the 2-oxoglutarate dehydrogenase complex involved in the TCA cycle. These proteins have multiple domains including the catalytic domain (IPR001078 from INTERPRO), one or two biotin domains (IPR000089 from INTERPRO) and an E3-component binding domain (IPR004167 from INTERPRO).. Probab=97.80 E-value=3.4e-05 Score=51.98 Aligned_cols=74 Identities=23% Similarity=0.330 Sum_probs=60.0 Q ss_pred CCCCCCEEEECCC------EEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCC Q ss_conf 4546626870462------5677403676410001244201043489997377268999899819999984898683699 Q gi|254780267|r 85 SPDNYHTVTSPMV------GTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYG 158 (171) Q Consensus 85 ~~~~~~~I~SPmv------Gtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~g 158 (171) .......|.=|-. ||+ ..=+=.|||+|+...+|+=+=--|.-+||.||++|++.+|++.+-+.|+-| T Consensus 131 gsg~aT~v~MPeLGESVTEGTv-------t~WLKaVGD~vEvDEPllEVsTDKVDTEiPSPvaGtlleI~a~EDdtVeVG 203 (607) T TIGR02927 131 GSGEATEVEMPELGESVTEGTV-------TRWLKAVGDTVEVDEPLLEVSTDKVDTEIPSPVAGTLLEILAEEDDTVEVG 203 (607) T ss_pred CCCCCCCCCCCCCCCCCCCCHH-------HHHHHHCCCEEEECCCEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEECC T ss_conf 7646563316446762023137-------878863398776337635642572066668871436888981778864315 Q ss_pred CEEEEEE Q ss_conf 7299995 Q gi|254780267|r 159 DALLVLE 165 (171) Q Consensus 159 q~L~~i~ 165 (171) -.|-+|= T Consensus 204 ~vlA~IG 210 (607) T TIGR02927 204 AVLAKIG 210 (607) T ss_pred CEEEEEC T ss_conf 0665423 No 47 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=97.79 E-value=0.00012 Score=48.60 Aligned_cols=72 Identities=17% Similarity=0.277 Sum_probs=55.2 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC------------------------------------- Q ss_conf 6626870462567740367641000124420104348999737------------------------------------- Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM------------------------------------- 130 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM------------------------------------- 130 (171) ....+.+...|.+ ...+|+.||+|++||+|+.|+.- T Consensus 60 ~~~~l~~~VsG~v-------~~~~v~~Gd~VkkGq~La~LD~~~~~~~l~~A~A~l~~A~a~~~~A~~~~~R~~~L~~~~ 132 (385) T PRK09859 60 EVAEIRPQVGGII-------IKRNFIEGDKVNQGDSLYQIDPAPLQAELNSAKGSLAKALSTASNARITFNRQASLLKTN 132 (385) T ss_pred EEEEEEEECCEEE-------EEEECCCCCEEECCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 8999981456299-------999838999884898899828479999999999999999999999998999999887658 Q ss_pred ----------------------------------CEEEEEECCCCCEEEEEECCCCCCCCCCC--EEEEEEE Q ss_conf ----------------------------------72689998998199999848986836997--2999953 Q gi|254780267|r 131 ----------------------------------KTMNHIVAPCSGKVQDINVKDGQSVEYGD--ALLVLEK 166 (171) Q Consensus 131 ----------------------------------Km~n~I~a~~~G~I~~i~v~~G~~V~~gq--~L~~i~~ 166 (171) --...|.||++|+|.+..++.|+.|..|+ +|+.+.. T Consensus 133 ~iS~~~~d~a~~~~~~A~A~~~~A~a~l~~a~~~l~~t~i~APf~GvV~~~~v~~G~~V~~~~~~~l~~v~~ 204 (385) T PRK09859 133 YVSRQDYDTARTQLNEAEANVTVAKAAVEQATINLQYANVTSPITGVSGKSSVTVGALVTANQADSLVTVQR 204 (385) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCEEEEECCCCCEECCCCCCEEEEEEC T ss_conf 623547998999999999999999999987553212269987778704103547787764788631699971 No 48 >TIGR03309 matur_yqeB selenium-dependent molybdenum hydroxylase system protein, YqeB family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes with labile selenium-containing centers, different from selenocysteine-containing proteins. Probab=97.69 E-value=0.00026 Score=46.61 Aligned_cols=70 Identities=20% Similarity=0.349 Sum_probs=60.9 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 46626870462567740367641000124420104348999737726899989981999998489868369972999953 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .....|.||..|.|+ ..++.||.|++||+|+.+.- .+|.||.+|.+.- ++.+|-.|..|..|..|.| T Consensus 162 ~~ERvi~Ap~~G~~~--------~~~~IGd~V~kG~vig~v~~----~pV~A~i~GilRG-lirdG~~V~~G~Ki~dIDP 228 (256) T TIGR03309 162 THERVLRAPADGIVT--------PTKAIGDSVKKGDVIATVGD----VPVVAPIDGLLRG-LIHEGLTVTEGLKIGDVDP 228 (256) T ss_pred CCCEEEECCCCCCEE--------CHHHCCCEEECCCEEEEECC----EEEEEECCCEEEE-EECCCCCCCCCCEEEEECC T ss_conf 724189758785385--------41441896867968999899----8898765848972-4128978679988998779 Q ss_pred CCC Q ss_conf 478 Q gi|254780267|r 167 TGD 169 (171) Q Consensus 167 ~~~ 169 (171) -++ T Consensus 229 R~~ 231 (256) T TIGR03309 229 RGE 231 (256) T ss_pred CCC T ss_conf 987 No 49 >PRK00624 glycine cleavage system protein H; Provisional Probab=97.66 E-value=7.7e-05 Score=49.81 Aligned_cols=58 Identities=26% Similarity=0.453 Sum_probs=48.3 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCC---CCCCC---CCC-EEEEEEECCC Q ss_conf 0124420104348999737726899989981999998489---86836---997-2999953478 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKD---GQSVE---YGD-ALLVLEKTGD 169 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~---G~~V~---~gq-~L~~i~~~~~ 169 (171) -++|++|++|+.++.||+-|.-.+|.||++|+|.+++-+- =+.++ ||+ =||+|+.++. T Consensus 39 p~~G~~v~~g~~~~~iEs~K~~~~i~sPvsG~VvevN~~l~~~P~liN~dpy~~GWl~~vk~~d~ 103 (113) T PRK00624 39 PSVGSFCKEGEVLVILESSKSAIEVLSPVSGEVIEVNLALEDDIQPINNAPESEGWFVVVQLDED 103 (113) T ss_pred CCCCCEEECCCEEEEEEECCEEECCCCCCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECCH T ss_conf 99999884798599999652441121886649998578554498987289777963899998986 No 50 >TIGR03077 not_gcvH glycine cleavage protein H homolog, Chlamydial. The H protein (GcvH) of the glycine cleavage system shuttles the methylamine group of glycine from the P protein to the T protein. Most Chlamydia but lack the P and T proteins, and have a single homolog of GcvH that appears deeply split from canonical GcvH in molecular phylogenetic trees. The protein family modeled here is observed the Chlamydial GcvH homolog, so far always seen as part of a two-gene operon, downstream of a member of the uncharacterized protein family TIGR03076. The function of this protein is unknown. Probab=97.65 E-value=7.8e-05 Score=49.78 Aligned_cols=58 Identities=28% Similarity=0.506 Sum_probs=47.6 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC---CCCCCC---CCC-EEEEEEECCC Q ss_conf 012442010434899973772689998998199999848---986836---997-2999953478 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK---DGQSVE---YGD-ALLVLEKTGD 169 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~---~G~~V~---~gq-~L~~i~~~~~ 169 (171) -++|+.|++|+.+|.||+-|.-.+|.||++|+|.+++-. +=+.++ ||+ =|++||.++. T Consensus 38 p~~G~~v~~g~~~~~vEs~K~~~~i~sPvsG~VvevN~~l~~~P~liN~dpy~~gWl~~vk~~d~ 102 (110) T TIGR03077 38 PSVGSSCKEGEVLVILESSKSAIEVLSPVSGEVIEVNIALEDDTQPINHSPESEGWFVVVQLDED 102 (110) T ss_pred CCCCCEEECCCEEEEEEECEEEECCCCCCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECCH T ss_conf 89999885798599999651612020787669998668454498987289777965999999986 No 51 >PRK09783 copper/silver efflux system membrane fusion protein CusB; Provisional Probab=97.61 E-value=0.00046 Score=45.06 Aligned_cols=72 Identities=18% Similarity=0.330 Sum_probs=54.9 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEE-CCCCCEEECCEEEEEECCC----------------------------------- Q ss_conf 6626870462567740367641000-1244201043489997377----------------------------------- Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFV-NKGNLVVEGQTLLIIEAMK----------------------------------- 131 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V-~vGd~Vk~G~~l~iiEAMK----------------------------------- 131 (171) ....|.+-.-|.+ +.-|| .+||.|++||+|+-|-+=- T Consensus 120 ~~~~v~~r~~G~I-------e~L~v~~~G~~V~kGq~L~~lYSPel~~AQ~EyL~~~~~~~~~~~~~aa~~rL~llg~~~ 192 (407) T PRK09783 120 QYAIVQARAAGFI-------DKVYPLTVGDKVQKGTPLLDLTIPDWVEAQSEYLLLRETGGTATQTEGILERLRLAGMPE 192 (407) T ss_pred CEEEEEECCCCEE-------EEEEECCCCCEECCCCEEEEEECHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH T ss_conf 3289971035268-------898742578787389788999388899999999999872797268899999999859999 Q ss_pred -------------EEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf -------------26899989981999998489868369972999953 Q gi|254780267|r 132 -------------TMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 132 -------------m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) --.+|.||++|+|.+..+..|+.|..|++||+|-- T Consensus 193 ~~I~~l~~~~~~~~~~ti~AP~sGvV~~~~v~~G~~V~~G~~Lf~IaD 240 (407) T PRK09783 193 ADIRRLIATQKIQTRFTLKAPIDGVITAFDLRAGMNIAKDNVVAKIQG 240 (407) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCEEEEEECCCCCEECCCCEEEEEEC T ss_conf 999999863887733899768880799974478956679980699964 No 52 >KOG0558 consensus Probab=97.59 E-value=4.6e-05 Score=51.18 Aligned_cols=61 Identities=25% Similarity=0.300 Sum_probs=56.8 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999737726899989981999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +=||++||.|++=|.||=+.+-|--.+|+|-++|+|++|+-+.++..-.|+||+.++--++ T Consensus 83 eWfVKEGDtVeqFd~lCEVQSDKAsvtItsRydG~v~ki~h~~ddia~VGk~Lvd~eve~~ 143 (474) T KOG0558 83 EWFVKEGDTVEQFDPLCEVQSDKASVTITSRYDGKVKKIYHSPDDIAKVGKPLVDLEVEDS 143 (474) T ss_pred EEHHHCCCCHHHHCCHHHCCCCCCEEEEEEEECCEEEEEEECCHHHHHHCCCEEEEEECCC T ss_conf 1003069738874222302046524899864065677874073346673861155661157 No 53 >TIGR01349 PDHac_trf_mito pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006257 This group of sequences represent one of several closely related clades of the dihydrolipoamide acetyltransferase subunit of the pyruvate dehydrogenase complex. It includes sequences from mitochondria and from alpha and beta branches of the proteobacteria, as well as from some other bacteria, but not the Gram-positive bacteria.; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006090 pyruvate metabolic process, 0045254 pyruvate dehydrogenase complex. Probab=97.57 E-value=0.00022 Score=47.01 Aligned_cols=55 Identities=24% Similarity=0.308 Sum_probs=48.6 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCC-CCCCCCEEEEEEE Q ss_conf 012442010434899973772689998998199999848986-8369972999953 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQ-SVEYGDALLVLEK 166 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~-~V~~gq~L~~i~~ 166 (171) =|+||++++||.|+=||.-|--....-.-.|..-+|||-.|. -|.-||||+.|=+ T Consensus 149 KK~G~~~~~GD~~AE~ETDKAT~~F~~~~~GYLAKILVP~GT~~~~~~~P~CIIV~ 204 (584) T TIGR01349 149 KKVGDKLKEGDLLAEVETDKATIDFEVEDEGYLAKILVPEGTRDVQLNTPLCIIVK 204 (584) T ss_pred CCCCCCCCCCCEEEEEECCCEEECCEECCCCEEEEEECCCCCCCCCCCCCEEEEEC T ss_conf 34576223375255542252022214635761899872888756637886489972 No 54 >PRK05352 Na(+)-translocating NADH-quinone reductase subunit A; Provisional Probab=97.56 E-value=0.00022 Score=47.06 Aligned_cols=57 Identities=32% Similarity=0.384 Sum_probs=39.0 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCCC Q ss_conf 10001244201043489997377268999899819999984898683699729999534788 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGDN 170 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~~ 170 (171) +.+|++||+|++||+|..=... ....+.||++|+|++|.- |+.=.. .=+.|+.+++. T Consensus 43 kl~VkeGD~V~~Gq~Lf~dK~~-~~v~~~SPvSG~V~~I~r--G~kR~i--lsIvIe~d~d~ 99 (448) T PRK05352 43 TMKVKEGDKVKKGQPLFEDKKN-PGVKFTAPASGTVSAINR--GEKRVL--QSVVIEVEGDE 99 (448) T ss_pred CEEECCCCEECCCCEEEECCCC-CCEEEECCCCEEEEEEEC--CCCCCE--EEEEEEECCCC T ss_conf 5685579997479855653898-962696889829952205--887413--58999966874 No 55 >cd06250 M14_PaAOTO_like An uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the the M14 family of metallocarboxypeptidases. This subgroup includes Pseudomonas aeruginosa AotO and related proteins. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. The gene encoding Probab=97.56 E-value=0.00079 Score=43.65 Aligned_cols=68 Identities=22% Similarity=0.182 Sum_probs=53.1 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC-C---EEEEEECCCCCEEEEEECCCCCCCCCCCEEE Q ss_conf 46626870462567740367641000124420104348999737-7---2689998998199999848986836997299 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM-K---TMNHIVAPCSGKVQDINVKDGQSVEYGDALL 162 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM-K---m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~ 162 (171) .+...|.||..|.|. ..++.||.|++||+|+.|--- . -..+|.||.+|.|-...- .-.|..|++|+ T Consensus 287 ~~~~~v~Ap~~Gi~~--------~~~~~G~~V~~G~~la~I~dP~~~~~~~~~i~Ap~~Gii~~~~~--~p~V~~G~~l~ 356 (359) T cd06250 287 AGVEMLYAPAGGMVV--------YRAAPGDWVEAGDVLAEILDPLGDGVGPVEIRAPTDGLLFARAS--RRFVRAGDELA 356 (359) T ss_pred CCCEEEECCCCEEEE--------ECCCCCCEECCCCEEEEEECCCCCCCCEEEEECCCCCEEEEEEC--CCCCCCCCCEE T ss_conf 366799569986898--------72799999779998999948999987548997599949998523--86067898046 Q ss_pred EE Q ss_conf 99 Q gi|254780267|r 163 VL 164 (171) Q Consensus 163 ~i 164 (171) .| T Consensus 357 ~I 358 (359) T cd06250 357 KI 358 (359) T ss_pred EE T ss_conf 61 No 56 >PRK08225 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=97.50 E-value=0.00013 Score=48.50 Aligned_cols=34 Identities=38% Similarity=0.583 Sum_probs=25.2 Q ss_pred EEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 6899989981999998489868369972999953 Q gi|254780267|r 133 MNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 133 ~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) |.+|.||..|+|.+++|+.||.|+.||+|+.+|- T Consensus 1 m~~V~Apm~G~V~~v~V~~Gd~V~~G~~l~vlEA 34 (70) T PRK08225 1 MTKVYASMAGNVWKIVVKVGDTVEEGQDVVILES 34 (70) T ss_pred CCEEECCCCEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 9878578994999999289899899999999971 No 57 >cd06254 M14_ASTE_ASPA_like_4 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=97.46 E-value=0.00067 Score=44.07 Aligned_cols=66 Identities=33% Similarity=0.429 Sum_probs=49.2 Q ss_pred CCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC--CEEEEEECCCCCEEEEEECCCCCCCCCCCEE Q ss_conf 546626870462567740367641000124420104348999737--7268999899819999984898683699729 Q gi|254780267|r 86 PDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM--KTMNHIVAPCSGKVQDINVKDGQSVEYGDAL 161 (171) Q Consensus 86 ~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM--Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L 161 (171) ..+...|.||--|.|+ +.++.||.|++||+|+.|--- ..-.+|.||++|.|.-+. ..-.|..|++| T Consensus 220 ~~~~~~v~A~~~Gl~~--------~~v~~G~~V~~Gq~lg~I~dp~g~~~~~v~Ap~dGiv~~~r--~~p~V~~Gd~l 287 (288) T cd06254 220 IDDVYYVTSPASGLWY--------PFVKAGDTVQKGALLGYVTDYFGNVIAEYRAPFDGVVLYNT--ATLPVRKGDPL 287 (288) T ss_pred ECCCEEEECCCCEEEE--------EECCCCCEECCCCEEEEEECCCCCCEEEEEECCCEEEEECC--CCCCCCCCCCC T ss_conf 4377799659984999--------93688999768999999968999961899938985999813--78710799857 No 58 >PRK09439 glucose-specific PTS system component; Provisional Probab=97.43 E-value=0.00059 Score=44.45 Aligned_cols=57 Identities=16% Similarity=0.328 Sum_probs=35.9 Q ss_pred CCEECCCCCEEECCEEEEEEC-------CCEEEEEECCCCCEEEEEECCCCCCCCCC-CEEEEEEE Q ss_conf 100012442010434899973-------77268999899819999984898683699-72999953 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEA-------MKTMNHIVAPCSGKVQDINVKDGQSVEYG-DALLVLEK 166 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEA-------MKm~n~I~a~~~G~I~~i~v~~G~~V~~g-q~L~~i~~ 166 (171) ..+|+.||+|++||.|+-+.- .-...+|.-.....+..+....|. |..| +++++|++ T Consensus 105 ~~~v~~Gd~Vk~G~~L~~~D~~~i~~~g~~~~tpvvitN~~~~~~~~~~~g~-v~aGe~~ii~v~K 169 (169) T PRK09439 105 KRIAEEGQRVKVGDPIIEFDLELLEEKAKSTLTPVVISNMDEIKELIKLSGS-VTAGETPVIRVKK 169 (169) T ss_pred EEEEECCCEECCCCEEEEECHHHHHHCCCCCCEEEEEECCHHHCEEECCCCC-EECCCCEEEEEEC T ss_conf 6999276998899999998699998639998219999885673246426770-8719957999969 No 59 >PRK11578 macrolide transporter subunit MacA; Provisional Probab=97.43 E-value=0.00051 Score=44.80 Aligned_cols=28 Identities=25% Similarity=0.485 Sum_probs=15.4 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECC Q ss_conf 66268704625677403676410001244201043 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQ 122 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~ 122 (171) ....|+||+-|++- ..+++.|+.|-.|+ T Consensus 159 ~~t~I~AP~~GvV~-------~~~~~~G~~v~~~~ 186 (347) T PRK11578 159 DYTRIVAPMAGEVT-------QITTLQGQTVIAAQ 186 (347) T ss_pred HCCEEECCCCCEEE-------EEEEECCCEEECCC T ss_conf 25699825784486-------79983398886378 No 60 >cd06848 GCS_H Glycine cleavage H-protein. Glycine cleavage H-proteins are part of the glycine cleavage system (GCS) found in bacteria, archea and the mitochondria of eukaryotes. GCS is a multienzyme complex consisting of 4 different components (P-, H-, T- and L-proteins) which catalyzes the oxidative cleavage of glycine. The H-protein shuttles the methylamine group of glycine from the P-protein (glycine dehydrogenase) to the T-protein (aminomethyltransferase) via a lipoyl group, attached to a completely conserved lysine residue. Probab=97.38 E-value=0.00022 Score=47.02 Aligned_cols=40 Identities=25% Similarity=0.396 Sum_probs=36.4 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCC Q ss_conf 0124420104348999737726899989981999998489 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKD 151 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~ 151 (171) .++|++|++|+.++.||+-|.-.+|.||++|+|.+++-+- T Consensus 37 p~~g~~v~~g~~~~~iEs~k~~~~i~sPv~G~vv~~N~~l 76 (96) T cd06848 37 PEVGTEVKKGDPFGSVESVKAASDLYSPVSGEVVEVNEAL 76 (96) T ss_pred CCCCCCCCCCCEEEEEEECCEEEEEECCCCEEEEEEHHHH T ss_conf 7999701389859999963357887377166999837965 No 61 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=97.38 E-value=0.00022 Score=46.97 Aligned_cols=34 Identities=18% Similarity=0.296 Sum_probs=24.2 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEE Q ss_conf 66268704625677403676410001244201043489997 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiE 128 (171) +...|.||+-|++= .-.+++|+.|..|++|+-|- T Consensus 153 ~~T~I~AP~dG~V~-------~~~v~~Gq~V~~Gq~l~slV 186 (310) T PRK10559 153 ERTVIRAPADGWVT-------NLNVYTGEFITRGSTAVALV 186 (310) T ss_pred HCCEEECCCCCEEE-------EEECCCCCEECCCCEEEEEE T ss_conf 25878179886784-------45548898856998589999 No 62 >PRK06748 hypothetical protein; Validated Probab=97.37 E-value=0.00064 Score=44.21 Aligned_cols=71 Identities=27% Similarity=0.403 Sum_probs=65.1 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC-CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 26870462567740367641000124420104348999737-7268999899819999984898683699729999534 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM-KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM-Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) ..|.||.-|.+ ++.||..++.|-+=..|+.|+-| +-.-.|.-.++|.|.-+.|+.||.|..+..|+.++.+ T Consensus 6 e~IyS~~~G~V-------ekifi~e~syVyEWEkL~~I~~~dg~le~v~vGisG~I~~v~Ve~Gq~i~~~~lL~~vedD 77 (84) T PRK06748 6 EGVYSPCYGKV-------EKLFVTESSYVYEWEKLALIETIDKQKVEIKVGISGYIESLEVVEGQAIADQKLLITVRDD 77 (84) T ss_pred HHHCCCCCCEE-------EEEEECCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEECCCEECCCCEEEEEECC T ss_conf 32336663088-------9999858753786423457750698289999821427889886228465377389998714 No 63 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=97.36 E-value=0.00025 Score=46.71 Aligned_cols=37 Identities=19% Similarity=0.194 Sum_probs=27.3 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCC Q ss_conf 66268704625677403676410001244201043489997377 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMK 131 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMK 131 (171) +...|+||+-|++.. -.+++|+.|..|++|+.|-.+. T Consensus 209 ~~t~I~AP~dG~V~~-------~~v~~G~~V~pG~pl~~lv~~~ 245 (348) T PRK10476 209 EDTTVRAPFDGRVVG-------LKVSVGEFAAPMQPIFTLIDTD 245 (348) T ss_pred HCCEEECCCCEEEEE-------EECCCCCEECCCCEEEEEEECC T ss_conf 558997789849998-------8669998778998489999689 No 64 >PRK10255 N-acetyl glucosamine specific PTS system components IIABC; Provisional Probab=97.35 E-value=0.0012 Score=42.66 Aligned_cols=68 Identities=21% Similarity=0.375 Sum_probs=51.5 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEE----ECCEEEEEECCCEEEEEECCCCCEEEEE----------------- Q ss_conf 6268704625677403676410001244201----0434899973772689998998199999----------------- Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVV----EGQTLLIIEAMKTMNHIVAPCSGKVQDI----------------- 147 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk----~G~~l~iiEAMKm~n~I~a~~~G~I~~i----------------- 147 (171) ...+.||+.|.+. |+=+|-|.|= -||-++|.- -...|.||+||+|..+ T Consensus 498 ~~~~~aP~~G~vi--------~l~~V~D~vFa~~~mG~G~aI~P---~~g~v~aP~dG~i~~vf~T~HA~gi~~~~G~Ei 566 (648) T PRK10255 498 IAELVSPITGDVV--------ALDQVPDEAFASKAVGDGVAVKP---TDKIVVSPAAGTIVKIFNTNHAFCLETEKGAEI 566 (648) T ss_pred EEEEECCCCCEEE--------ECCCCCCHHHHHCCCCCCEEEEC---CCCEEECCCCCEEEEECCCCCEEEEECCCCCEE T ss_conf 2589755785288--------70108675553145578569945---899598068857999716885799984898189 Q ss_pred ------------------ECCCCCCCCCCCEEEEEEEC Q ss_conf ------------------84898683699729999534 Q gi|254780267|r 148 ------------------NVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 148 ------------------~v~~G~~V~~gq~L~~i~~~ 167 (171) +|+.||.|..||+|++++.+ T Consensus 567 LiHiGiDTV~L~G~gF~~~v~~Gd~Vk~G~~l~~~D~~ 604 (648) T PRK10255 567 VVHMGIDTVALEGKGFKRLVEEGAQVSAGQPILEMDLD 604 (648) T ss_pred EEEEEECCEEECCCCCEEEECCCCEECCCCEEEEECHH T ss_conf 99954452421780534883485996799988997499 No 65 >TIGR01945 rnfC electron transport complex, RnfABCDGE type, C subunit; InterPro: IPR010208 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation . A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized . This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase (IPR003667 from INTERPRO), but is distinct. This entry describes the C subunit.; GO: 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport, 0016020 membrane. Probab=97.33 E-value=0.00014 Score=48.31 Aligned_cols=60 Identities=27% Similarity=0.401 Sum_probs=43.8 Q ss_pred CCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC-CCCCCCCCCEEEEEEECC Q ss_conf 4100012442010434899973772689998998199999848-986836997299995347 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK-DGQSVEYGDALLVLEKTG 168 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~-~G~~V~~gq~L~~i~~~~ 168 (171) ..|.|++||+|.+||.|.-=++.= +-+|-||+||+|.+|--- .+-.=.+-.+=+.|++|+ T Consensus 43 ~~p~V~~GD~VLkGq~Ia~~~G~~-sap~HaPtSG~v~~I~~~~~pH~sGlp~~~i~I~~Dg 103 (444) T TIGR01945 43 AEPIVKVGDKVLKGQLIAKADGFV-SAPIHAPTSGTVVAIEERVVPHASGLPVPAIVIEPDG 103 (444) T ss_pred CCCEECCCCEEECCCEECCCCCEE-EEEEECCCCCEEEEEEEEECCCCCCCCCCEEEECCCC T ss_conf 773002786520661100677407-8700078110688741031157888874458867789 No 66 >cd06255 M14_ASTE_ASPA_like_5 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=97.33 E-value=0.0011 Score=42.89 Aligned_cols=53 Identities=23% Similarity=0.192 Sum_probs=43.3 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC--CEEEEEECCCCCEEEEE Q ss_conf 46626870462567740367641000124420104348999737--72689998998199999 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM--KTMNHIVAPCSGKVQDI 147 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM--Km~n~I~a~~~G~I~~i 147 (171) .....|.||..|.|| +.++.||.|++||+|+.|--- ....+|.||.+|+|.-+ T Consensus 229 ~~~~~v~A~~~Gl~~--------~~v~~G~~V~kGq~lg~I~dp~G~~~~~v~Ap~dGivi~~ 283 (293) T cd06255 229 VSRDWVAAIHGGLFE--------PSVPAGDTIPAGQPLGRVVDLYGAEVLEASPPRDGIVIGI 283 (293) T ss_pred ECCEEEECCCCEEEE--------EECCCCCEECCCCEEEEEECCCCCCEEEEECCCCCEEEEE T ss_conf 124599759985999--------8378899977999999998999996489983999699980 No 67 >TIGR00527 gcvH glycine cleavage system H protein; InterPro: IPR002930 This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group is attached to a completely conserved lysine residue. The H protein shuttles the methylamine group of glycine from the P protein to the T protein.; GO: 0006546 glycine catabolic process, 0005960 glycine cleavage complex. Probab=97.31 E-value=0.00013 Score=48.46 Aligned_cols=66 Identities=23% Similarity=0.335 Sum_probs=51.5 Q ss_pred ECCCCCCCEE---CCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC--------CCCCCCCCCEEEEEEECC Q ss_conf 0367641000---12442010434899973772689998998199999848--------986836997299995347 Q gi|254780267|r 103 ASSPGSDPFV---NKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK--------DGQSVEYGDALLVLEKTG 168 (171) Q Consensus 103 ~psP~~~~~V---~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~--------~G~~V~~gq~L~~i~~~~ 168 (171) ..+=|.-.|| +||++|+.||.++.||+-|--.+|-||++|+|.+|+-. |-+|-+-.-=||+||.++ T Consensus 32 ~~~LGD~vFVeLPEvG~~v~~~~~~G~vESVKaaSelY~Pv~Gtv~e~N~~L~d~PelvN~~Py~~eGWl~kv~~~d 108 (132) T TIGR00527 32 QDELGDIVFVELPEVGAEVSAGESLGSVESVKAASELYAPVDGTVVEVNEALEDSPELVNEDPYGEEGWLIKVKLSD 108 (132) T ss_pred HHCCCCEEEEECCCCCCEEECCCCEEEEEEECHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCEEEEEECCC T ss_conf 33069668862788477762389701441000033303898744998538766778741688877872089984178 No 68 >COG0509 GcvH Glycine cleavage system H protein (lipoate-binding) [Amino acid transport and metabolism] Probab=97.29 E-value=0.00023 Score=46.91 Aligned_cols=57 Identities=26% Similarity=0.513 Sum_probs=46.5 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC---CCCCCC---CCC-EEEEEEECC Q ss_conf 012442010434899973772689998998199999848---986836---997-299995347 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK---DGQSVE---YGD-ALLVLEKTG 168 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~---~G~~V~---~gq-~L~~i~~~~ 168 (171) -++|++|++|+.+|+||+-|--.+|.||++|+|.+++=+ +=+.|+ ||. =||+|++++ T Consensus 47 pe~G~~v~~g~~~~~vESvKaasdvyaPvsGeVvevN~~l~~~PeliN~dPyg~gWi~klk~~d 110 (131) T COG0509 47 PEVGAEVKAGESLAVVESVKAASDVYAPVSGEVVEVNEALVDSPELINSDPYGEGWIVKLKPAD 110 (131) T ss_pred CCCCCEECCCCEEEEEEEEEEECCCCCCCCEEEEEECHHHHCCHHHHCCCCCCCCEEEEEEECC T ss_conf 8998711179758999851032025278851699865444449466436877885089996378 No 69 >COG3608 Predicted deacylase [General function prediction only] Probab=97.28 E-value=0.0013 Score=42.28 Aligned_cols=66 Identities=27% Similarity=0.331 Sum_probs=50.5 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC---CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 626870462567740367641000124420104348999737---7268999899819999984898683699729999 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM---KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM---Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) ...+.||--|.| .+.|+.||+|++||+|++|--+ ...-+|.|+++|+|...... -.|+.|+.|..+ T Consensus 256 ~~~i~Ap~~G~v--------~~~v~lGd~VeaG~~la~i~~~~~~~~~~eirA~~~G~i~~~r~~--~~v~~Gdl~~~v 324 (331) T COG3608 256 DEMIRAPAGGLV--------EFLVDLGDKVEAGDVLATIHDPPLGEGEAEIRAPVSGIIIARRSL--RLVQPGDLLKVV 324 (331) T ss_pred CCEEECCCCCEE--------EEEECCCCCCCCCCEEEEEECCCCCCCCEEEECCCCCEEEEEEEC--CCCCCCCEEEEE T ss_conf 525656888548--------774167880217876799736888875258972788269987403--436788766223 No 70 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=97.28 E-value=0.0034 Score=39.78 Aligned_cols=57 Identities=16% Similarity=0.131 Sum_probs=37.0 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEE--EEEECCCEEEEEECCCCCEEEEEEC Q ss_conf 6870462567740367641000124420104348--9997377268999899819999984 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTL--LIIEAMKTMNHIVAPCSGKVQDINV 149 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l--~iiEAMKm~n~I~a~~~G~I~~i~v 149 (171) .|-|-.-|++.+-=.-...|.|++||.|++||+| ++++--.-...| ..+|.|..-.. T Consensus 191 ~iVA~k~GvI~~i~v~~G~p~Vk~GD~VkkGqiLVsG~i~~e~~~~~V--~A~G~V~a~t~ 249 (383) T pfam06898 191 NIVAKKDGIITRMYVTKGTAVVKVGDVVKKGDILVSGQIGKEGNEYEV--HAKGEVLAETW 249 (383) T ss_pred CEEECCCCEEEEEEECCCEEEECCCCEECCCCEEEEEEECCCCCCEEE--CCCEEEEEEEE T ss_conf 658999979999996477177658998778989996356578981697--77529999999 No 71 >pfam00358 PTS_EIIA_1 phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 1. Probab=97.26 E-value=0.00097 Score=43.11 Aligned_cols=26 Identities=31% Similarity=0.529 Sum_probs=19.4 Q ss_pred CEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 19999984898683699729999534 Q gi|254780267|r 142 GKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 142 G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) |.==+++|+.||.|..||+|+++... T Consensus 84 G~gF~~~v~~Gd~V~~G~~L~~~D~~ 109 (133) T pfam00358 84 GEGFESHVEEGDRVKQGDKLLEFDLP 109 (133) T ss_pred CCCEEEEECCCCEECCCCEEEEECHH T ss_conf 96318997589999899999998599 No 72 >TIGR02927 SucB_Actino 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR014276 This entry represents the mainly actinobacterial clade of the E2 component of the 2-oxoglutarate dehydrogenase complex involved in the TCA cycle. These proteins have multiple domains including the catalytic domain (IPR001078 from INTERPRO), one or two biotin domains (IPR000089 from INTERPRO) and an E3-component binding domain (IPR004167 from INTERPRO).. Probab=97.25 E-value=0.00024 Score=46.85 Aligned_cols=57 Identities=25% Similarity=0.425 Sum_probs=53.4 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 012442010434899973772689998998199999848986836997299995347 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) =.+||+|+...+|+=+=--|.-+||.||++|++.+|-+++-+.|+-|-.|-+|=+.+ T Consensus 24 KavGDTVE~DEPLLEVsTDKVDTEiPSPaaGVlleI~a~EDdTVevGg~lAiIGe~g 80 (607) T TIGR02927 24 KAVGDTVEVDEPLLEVSTDKVDTEIPSPAAGVLLEIKAEEDDTVEVGGVLAIIGEEG 80 (607) T ss_pred HCCCCEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEECCEEEEECCCC T ss_conf 106887710476210015620666797603326677736778023050889863854 No 73 >PRK01202 glycine cleavage system protein H; Provisional Probab=97.23 E-value=0.00035 Score=45.81 Aligned_cols=57 Identities=28% Similarity=0.513 Sum_probs=45.5 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC---CCCCC---CCCC-EEEEEEECC Q ss_conf 012442010434899973772689998998199999848---98683---6997-299995347 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK---DGQSV---EYGD-ALLVLEKTG 168 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~---~G~~V---~~gq-~L~~i~~~~ 168 (171) -.+|++|++|++++.||+-|.-.+|.||++|+|.+++-+ +=+.+ -||+ =|++|++++ T Consensus 45 p~~G~~v~~g~~~~~vEs~K~~~~i~sPvsG~Vv~vN~~l~~~P~liN~dpy~~GWl~~i~~~d 108 (127) T PRK01202 45 PEVGDTVKAGESFGVVESVKAASDVYAPVSGEVVEVNEALEDSPELVNSDPYGEGWLFKLKPSD 108 (127) T ss_pred CCCCCEEECCCEEEEEEECCEECCCCCCCCEEEEEECHHHHHCHHHHCCCCCCCCEEEEEEECC T ss_conf 8999887179878999974602111288520899864654529386448988896799999899 No 74 >COG4656 RnfC Predicted NADH:ubiquinone oxidoreductase, subunit RnfC [Energy production and conversion] Probab=97.21 E-value=0.00076 Score=43.77 Aligned_cols=57 Identities=28% Similarity=0.373 Sum_probs=45.6 Q ss_pred CCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEE---CCCCCCCCCCCEEEEEEECC Q ss_conf 41000124420104348999737726899989981999998---48986836997299995347 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDIN---VKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~---v~~G~~V~~gq~L~~i~~~~ 168 (171) ..++|++||+|.+||.|.-=+- .+-++-||++|+|.+|. +-+|..- --+.+.|++|+ T Consensus 45 ~~~~Vkvgd~V~~GQ~l~~~~g--~~~~vHaP~sG~V~~I~~~~~~~~sgl--~~~~I~i~~d~ 104 (529) T COG4656 45 GILLVKVGDKVLKGQPLTRGEG--IMLPVHAPTSGTVTAIEPANFPHPSGL--KELSIVIQGDG 104 (529) T ss_pred CCEEEEECCEEECCCEEECCCC--CEEEEECCCCCEEEEEEECCCCCCCCC--CCEEEEECCCC T ss_conf 3448830888850864203678--446542798731335422256885567--53389975751 No 75 >PRK05889 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Provisional Probab=97.21 E-value=0.00035 Score=45.82 Aligned_cols=33 Identities=27% Similarity=0.494 Sum_probs=20.5 Q ss_pred EEEEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 689998998199999848986836997299995 Q gi|254780267|r 133 MNHIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 133 ~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) -.+|.||..|+|.+|+|+.||.|+.||+|+.+| T Consensus 2 ae~V~Apm~G~V~~v~V~~Gd~V~~Gd~l~vlE 34 (71) T PRK05889 2 AEDVRAEIVASVLEVVVNEGDQIGEGDTLVLLE 34 (71) T ss_pred CCCCCCCCCEEEEEEEECCCCEECCCCEEEEEE T ss_conf 653148998489999979999988999999998 No 76 >PRK12784 hypothetical protein; Provisional Probab=97.21 E-value=0.0013 Score=42.32 Aligned_cols=70 Identities=26% Similarity=0.397 Sum_probs=64.5 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC-CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 6870462567740367641000124420104348999737-7268999899819999984898683699729999534 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM-KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM-Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) .|.||.-|.+ ++.||..++.|-+=..|+.|+.| +-.-.|.-.++|.|..+.|+.||.|..|..|+.++.+ T Consensus 7 ~iyS~c~G~V-------ekifi~e~SyVYEWEkL~~I~~~dg~le~v~vGiSG~I~~v~Ve~Gq~i~~~tlL~~~edD 77 (84) T PRK12784 7 EICSSYEGKV-------EEIFVNESSYVYEWEKLMMIRKNNGELEKVAVGISGNIRLVNVVVGQQIHTDTLLVRLEDD 77 (84) T ss_pred HHCCCCCCEE-------EEEEECCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEEEEEEECCCEECCCCEEEEEECC T ss_conf 3447663188-------8999858753786423457750698289999820116899986228543477189998603 No 77 >PRK09824 beta-glucoside-specific PTS system components IIABC; Provisional Probab=97.19 E-value=0.0016 Score=41.87 Aligned_cols=55 Identities=20% Similarity=0.321 Sum_probs=23.6 Q ss_pred CEECCCCCEEECCEEEEEECCCE-------EEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 00012442010434899973772-------68999899819999984898683699729999 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAMKT-------MNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAMKm-------~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) .+|+.||+|++||.|+-.+.-+. ..++.-..+-....+...++..|+.|++|+.+ T Consensus 562 ~~v~~g~~V~~g~~l~~~d~~~i~~~g~~~~~~viitN~~~~~~v~~~~~~~v~~g~~~~~v 623 (625) T PRK09824 562 AHVNVGDKVNTGDRLISFDIPAIREAGFDLTTPVLISNSDDFTDVLPHGTAQISAGEPLLSI 623 (625) T ss_pred EEECCCCEECCCCEEEEECHHHHHHCCCCCCCEEEEECCHHCCCEEECCCCCCCCCCEEEEE T ss_conf 67467899679998999749999862999830899963044154363579704589954787 No 78 >PRK05035 electron transport complex protein RnfC; Provisional Probab=97.17 E-value=0.002 Score=41.25 Aligned_cols=61 Identities=25% Similarity=0.400 Sum_probs=43.8 Q ss_pred CCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCC-CCCCCCCCEEEEEEECCC Q ss_conf 41000124420104348999737726899989981999998489-868369972999953478 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKD-GQSVEYGDALLVLEKTGD 169 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~-G~~V~~gq~L~~i~~~~~ 169 (171) ..|.|++||+|.+||.|+--+. -+.-+|.||+||+|..|--.. .-+-....+=++|++|+. T Consensus 55 ~~p~VkvGD~VlkGQ~I~~~~g-~vsapVHAptSGtV~aIe~~~~~HpsG~~~~~VvIe~DG~ 116 (725) T PRK05035 55 GELLVSVGDRVLKGQPLTQGDG-RMSLPVHAPTSGTVTAIEPRPTAHPSGLAELCVVIEADGE 116 (725) T ss_pred CCCCCCCCCEECCCCEEEECCC-CEEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEECCCC T ss_conf 7141478999768887454698-4586056799829987501215799888554599924886 No 79 >PRK03598 hypothetical protein; Provisional Probab=97.16 E-value=0.00056 Score=44.55 Aligned_cols=37 Identities=22% Similarity=0.304 Sum_probs=29.4 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 46626870462567740367641000124420104348999737 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) -+...|+||+-|++.. -++++|+.|..|++|+.|--+ T Consensus 201 L~~t~I~AP~dGvV~~-------~~v~~Ge~V~~G~~l~ti~~~ 237 (331) T PRK03598 201 LQDTTLIAPSDGTILT-------RAVEPGTMLSAGSTVFTLSLT 237 (331) T ss_pred HHCCEEECCCCEEEEE-------EECCCCCEECCCCCEEEEEEC T ss_conf 6067997899669998-------975889884899817999917 No 80 >PRK13380 glycine cleavage system protein H; Provisional Probab=97.14 E-value=0.00042 Score=45.30 Aligned_cols=58 Identities=21% Similarity=0.389 Sum_probs=46.7 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC---CCCCC---CCCC-EEEEEEECCC Q ss_conf 012442010434899973772689998998199999848---98683---6997-2999953478 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK---DGQSV---EYGD-ALLVLEKTGD 169 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~---~G~~V---~~gq-~L~~i~~~~~ 169 (171) -++|++|++|+.++.||+-|.-.+|.||++|+|.+++-. +=+.+ -||+ =||+|++++. T Consensus 46 p~~g~~v~~g~~~~~iEs~K~~~~i~sPvsG~VvevN~~l~~~P~liN~dPy~~GWl~~ik~~d~ 110 (132) T PRK13380 46 PELGTKVEKGKEVATLESVKAASPVPMPLDGEVVEVNDALEDSPELVNEDPYGEGWFFRFKPADP 110 (132) T ss_pred CCCCCEEECCCEEEEEEECCCCCCCCCCCCCEEEEECHHHHHCHHHHCCCCCCCCEEEEEEECCH T ss_conf 99998751798899999734001112772525888534345395865279888967999998998 No 81 >cd00210 PTS_IIA_glc PTS_IIA, PTS system, glucose/sucrose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. Probab=97.13 E-value=0.00093 Score=43.22 Aligned_cols=29 Identities=31% Similarity=0.450 Sum_probs=21.6 Q ss_pred CCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 89981999998489868369972999953 Q gi|254780267|r 138 APCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 138 a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) ....|.==+.+|+.||.|..||+|+++.. T Consensus 75 V~L~G~gF~~~v~~Gd~V~~G~~L~~~D~ 103 (124) T cd00210 75 VKLNGEGFTSHVEEGQRVKQGDKLLEFDL 103 (124) T ss_pred CCCCCCCEEEEECCCCEECCCCEEEEECH T ss_conf 01489531899748899989999999859 No 82 >pfam01597 GCV_H Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group is attached to a completely conserved lysine residue. The H protein shuttles the methylamine group of glycine from the P protein to the T protein. Probab=97.09 E-value=0.00052 Score=44.78 Aligned_cols=57 Identities=28% Similarity=0.452 Sum_probs=46.5 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECC---CCCCC---CCCC-EEEEEEECC Q ss_conf 012442010434899973772689998998199999848---98683---6997-299995347 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVK---DGQSV---EYGD-ALLVLEKTG 168 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~---~G~~V---~~gq-~L~~i~~~~ 168 (171) -++|++|++||.++.||+-|.-.++.||++|+|.+++-. +=+.| -||+ =||+|++++ T Consensus 39 p~~g~~v~~g~~~~~iEs~K~v~~i~sPisG~Vv~vN~~l~~~P~liN~dpy~~gWl~~i~~~d 102 (122) T pfam01597 39 PEVGTKVKKGESLGAVESVKAASEVYAPVSGEVVEVNEKLEDNPGLINKDPYEDGWIAKLKPSN 102 (122) T ss_pred CCCCCEEECCCEEEEEEECCCCCCEECCCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECC T ss_conf 9999985169888999973656414366342799875534219386438988895699999899 No 83 >PRK05641 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=97.07 E-value=0.00076 Score=43.74 Aligned_cols=33 Identities=45% Similarity=0.739 Sum_probs=31.0 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 899989981999998489868369972999953 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) +.|+||..|+|.+|+|+.||.|+.||+|+.+|- T Consensus 80 ~~v~apmpG~v~~v~v~~Gd~V~~g~~l~vlEa 112 (148) T PRK05641 80 NVVTAPMPGKILKILVKEGDQVKTGQGLLILEA 112 (148) T ss_pred CEEECCCCEEEEEEEECCCCEECCCCEEEEEEH T ss_conf 979769990899999689999879999999994 No 84 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=96.98 E-value=0.00077 Score=43.73 Aligned_cols=37 Identities=24% Similarity=0.384 Sum_probs=19.1 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEE-EEECCCEE Q ss_conf 268704625677403676410001244201043489-99737726 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLL-IIEAMKTM 133 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~-iiEAMKm~ 133 (171) ..|.||+-|.+=+ =+|++||.|+.||+|+ +|.+-+|. T Consensus 245 T~iraP~dG~Va~-------r~vqvG~~vS~g~p~ma~V~~~~~y 282 (379) T TIGR00998 245 TEIRAPFDGYVAR-------RKVQVGQVVSPGQPLMAVVPAEQMY 282 (379) T ss_pred CEEECCCCCEEEE-------EECCCCCEECCCCCEEEEECCCCEE T ss_conf 5643672205877-------6307886115897257778454048 No 85 >cd06850 biotinyl_domain The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine. Probab=96.98 E-value=0.00096 Score=43.14 Aligned_cols=31 Identities=45% Similarity=0.696 Sum_probs=22.8 Q ss_pred EECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 9989981999998489868369972999953 Q gi|254780267|r 136 IVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 136 I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) |.||..|+|.+++|+.||.|+.||+|+.||- T Consensus 2 i~ap~~G~v~~~~V~~Gd~V~~G~~l~~iE~ 32 (67) T cd06850 2 VTAPMPGTVVKVLVKEGDKVEAGQPLAVLEA 32 (67) T ss_pred EECCCCEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 1087988999999579999978999999993 No 86 >COG0511 AccB Biotin carboxyl carrier protein [Lipid metabolism] Probab=96.98 E-value=0.001 Score=43.01 Aligned_cols=35 Identities=26% Similarity=0.414 Sum_probs=32.0 Q ss_pred EEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 26899989981999998489868369972999953 Q gi|254780267|r 132 TMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 132 m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) -.+.|+||..|++.+++|+.||.|..||.|..||- T Consensus 69 ~~~~V~SPm~Gtv~~~~V~vGd~V~~Gq~l~IiEA 103 (140) T COG0511 69 GGTQVTSPMVGTVYKPFVEVGDTVKAGQTLAIIEA 103 (140) T ss_pred CCCEEECCCCEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 78667478761899987657999758999999982 No 87 >COG1566 EmrA Multidrug resistance efflux pump [Defense mechanisms] Probab=96.97 E-value=0.00084 Score=43.48 Aligned_cols=34 Identities=38% Similarity=0.614 Sum_probs=25.6 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 9998998199999848986836997299995347 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) .|.+.++|.|.+++|+|.+.|..||+||+|+|.. T Consensus 55 ~Iap~VsG~V~eV~V~dnq~Vk~Gd~L~~iD~~~ 88 (352) T COG1566 55 PIAPQVSGRVTEVNVKDNQLVKKGDVLFRIDPRD 88 (352) T ss_pred EECCCCCEEEEEEEECCCCEECCCCEEEEECCHH T ss_conf 9837673589999916898851897689989578 No 88 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=96.92 E-value=0.0022 Score=40.95 Aligned_cols=44 Identities=20% Similarity=0.302 Sum_probs=25.8 Q ss_pred EEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 489997377268999899819999984898683699729999534 Q gi|254780267|r 123 TLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 123 ~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) ..+.++|-. .-+|.+.++|.|.+++|+.||.|..||+|++|++. T Consensus 54 ~~G~v~a~~-~v~l~~~v~G~v~~v~v~~Gd~V~kGq~La~lD~~ 97 (385) T PRK09578 54 LPGRLDAYR-QAEVRARVAGIVTARTYEEGQEVKQGAVLFRIDPA 97 (385) T ss_pred EEEEEEEEE-EEEEEEECCEEEEEEEECCCCEECCCCEEEEECCH T ss_conf 999999879-99999664469999980799987389889998534 No 89 >PRK06549 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.87 E-value=0.0014 Score=42.22 Aligned_cols=32 Identities=31% Similarity=0.487 Sum_probs=29.8 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .|.||..|+|.+|+|+.||.|..||+|+.+|- T Consensus 65 ~v~aPm~G~v~~i~v~~Gd~V~~g~~l~vlEA 96 (132) T PRK06549 65 AIPSPMPGTILKVLVAVGDQVTENQPLLILEA 96 (132) T ss_pred EECCCCCEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 98089997999999389999879998999980 No 90 >COG2190 NagE Phosphotransferase system IIA components [Carbohydrate transport and metabolism] Probab=96.84 E-value=0.003 Score=40.16 Aligned_cols=57 Identities=23% Similarity=0.401 Sum_probs=33.8 Q ss_pred CCEECCCCCEEECCEEEEEECCC-------EEEEEECCCCCEE-EEEECCCCCCCCCCCEEEEEE Q ss_conf 10001244201043489997377-------2689998998199-999848986836997299995 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMK-------TMNHIVAPCSGKV-QDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMK-------m~n~I~a~~~G~I-~~i~v~~G~~V~~gq~L~~i~ 165 (171) ..+|+.||+|++||.|+-..--. +..+|.=...... .-+....+..|..|+.|+.+- T Consensus 90 ~~~v~~Gd~Vk~Gd~Li~fDl~~Ik~~~~s~itpvVvtN~~~~~~~~~~~~~~~v~~g~~~~~~~ 154 (156) T COG2190 90 ESLVKEGDKVKAGDPLLEFDLDLIKAKGYSTITPVVVTNMDDFKKIVKLSGGGEVKAGETLLLVI 154 (156) T ss_pred EEEEECCCEECCCCEEEEECHHHHHHCCCCCEEEEEEECCHHCEEEEEECCCCCEECCCEEEEEE T ss_conf 78750799881598989997999963487860269998203500167613676240487438987 No 91 >PRK11556 multidrug efflux system subunit MdtA; Provisional Probab=96.81 E-value=0.0025 Score=40.66 Aligned_cols=46 Identities=20% Similarity=0.344 Sum_probs=36.8 Q ss_pred CEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 34899973772689998998199999848986836997299995347 Q gi|254780267|r 122 QTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 122 ~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) ..++.|+|-. .-.|.+.++|+|.+|+|+.||.|..||.|++|++.. T Consensus 77 ~~~Gtv~a~~-~v~v~~~VsG~I~~i~v~~Gd~VkkGqvLa~iD~~~ 122 (415) T PRK11556 77 TGLGTVTAAN-TVTVRSRVDGQLMALHFQEGQQVKAGDLLAEIDPSQ 122 (415) T ss_pred EEEEEEEEEE-EEEEEEEEEEEEEEEECCCCCEECCCCEEEEECCHH T ss_conf 9999998646-999987850299999858989978999999988357 No 92 >pfam04952 AstE_AspA Succinylglutamate desuccinylase / Aspartoacylase family. This family includes Succinylglutamate desuccinylase EC:3.1.-.- that catalyses the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway. The family also include aspartoacylase EC:3.5.1.15 which cleaves acylaspartate into a fatty acid and aspartate. Mutations in human aspartoacylase (ACY-2) lead to Canavan disease. This family is probably structurally related to pfam00246 (Bateman A pers. obs.). Probab=96.61 E-value=0.0091 Score=37.19 Aligned_cols=70 Identities=26% Similarity=0.347 Sum_probs=54.3 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEE---EEECC-CEEEEEECCCCCEEEEEECCCCCCCCCCCEEE Q ss_conf 466268704625677403676410001244201043489---99737-72689998998199999848986836997299 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLL---IIEAM-KTMNHIVAPCSGKVQDINVKDGQSVEYGDALL 162 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~---iiEAM-Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~ 162 (171) .....+.||..|.|+ +.++.||.|++||.+. ++.-. ....++.||.+|.|.-+ ...-.|..|+.|+ T Consensus 215 ~~~~~~~a~~~Gl~~--------~~~~~g~~V~~G~~l~~~~i~d~~~~~~~~~~ap~dG~i~~~--~~~~~v~~G~~l~ 284 (289) T pfam04952 215 DRPRDIRAEAAGLVE--------PHVNLGDDVEAGDLLPGGHVFDSLGGESIEYRAPDTGIIVFI--NEAAYVGKGAALA 284 (289) T ss_pred CCCEECCCCCCEEEE--------EECCCCCCEECCCCCCCCEEECCCCCCEEEEEECCCEEEEEE--CCCCEECCCCEEE T ss_conf 364130377646999--------942789966278641773021699993699992899999995--6866456897589 Q ss_pred EEEE Q ss_conf 9953 Q gi|254780267|r 163 VLEK 166 (171) Q Consensus 163 ~i~~ 166 (171) .|.+ T Consensus 285 ~i~~ 288 (289) T pfam04952 285 LVAK 288 (289) T ss_pred EEEC T ss_conf 9961 No 93 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=96.53 E-value=0.0032 Score=39.99 Aligned_cols=35 Identities=37% Similarity=0.646 Sum_probs=30.9 Q ss_pred EEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 26899989981999998489868369972999953 Q gi|254780267|r 132 TMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 132 m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) -..+|.||..|.|.+|+|+.||.|..||+|+.||- T Consensus 510 ~~~~V~APmpG~V~~V~Vk~Gd~V~~Gd~L~vlEA 544 (580) T PRK09282 510 AGGPVTAPMAGNIVKVLVKEGDKVKEGDVLLILEA 544 (580) T ss_pred CCCEECCCCCCEEEEEEECCCCEECCCCEEEEEEH T ss_conf 99853279982699999789998789998999851 No 94 >pfam05896 NQRA Na(+)-translocating NADH-quinone reductase subunit A (NQRA). This family consists of several bacterial Na(+)-translocating NADH-quinone reductase subunit A (NQRA) proteins. The Na(+)-translocating NADH: ubiquinone oxidoreductase (Na(+)-NQR) generates an electrochemical Na(+) potential driven by aerobic respiration. Probab=96.53 E-value=0.0036 Score=39.68 Aligned_cols=40 Identities=33% Similarity=0.384 Sum_probs=31.1 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEEC Q ss_conf 10001244201043489997377268999899819999984 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINV 149 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v 149 (171) +.+|++||+|+.||+|+.=-..-= .-.+||++|+|.+|.- T Consensus 42 kl~VkeGD~Vk~G~pLF~dK~np~-i~f~SPvsG~V~~I~R 81 (257) T pfam05896 42 KMLVKEGDKVKAGQPLFEDKKNPG-VKFTAPASGTVVAINR 81 (257) T ss_pred EEEEECCCEEECCCEEEEEECCCC-CEEECCCCEEEEEECC T ss_conf 699834999855875799736988-4685678738950105 No 95 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=96.52 E-value=0.0079 Score=37.58 Aligned_cols=91 Identities=16% Similarity=0.227 Sum_probs=52.4 Q ss_pred CCCCCC-HHHHH-HHHHHHHHCCCEEEEEEECCEEEEEECCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 632139-89999-9999998659708999509969999706776521133202344455422233221122344444544 Q gi|254780267|r 4 KKQKIN-LTLIR-NLANILNETNLTEVEIDNDGMRIRLLRSPQKDTVTNYYSEDNKNNHSLVGFPPSSTIDNTPPESDLI 81 (171) Q Consensus 4 ~~~~md-i~~Ik-~Li~~~~es~l~eleie~~~~kI~I~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (171) ++..|| .+.|+ +|++-+++=-+..++++...+.|++....... . . T Consensus 137 ~k~~~Ds~~~~~~~L~~~~~ei~Wv~v~~~Gt~l~~kvvEk~~~k------------i---------------------~ 183 (406) T TIGR02876 137 KKFSIDSVYKLERKLLDRVPEIMWVGVRVRGTKLVIKVVEKQEPK------------I---------------------K 183 (406) T ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEEEEEECCEEEEEEEECCCCC------------C---------------------C T ss_conf 203688988999999845994899999985309999998213332------------4---------------------4 Q ss_pred CCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEE--EEE Q ss_conf 3334546626870462567740367641000124420104348--999 Q gi|254780267|r 82 PLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTL--LII 127 (171) Q Consensus 82 ~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l--~ii 127 (171) +........-|-|=+-|.+++-=.-...|.|+.||.|++||.| |.+ T Consensus 184 ~~~~~~ep~niVAkk~G~i~~~~v~~G~~~Vk~GD~VkkGd~Li~G~~ 231 (406) T TIGR02876 184 EVKKKAEPRNIVAKKDGVIKRVYVTSGEAVVKKGDVVKKGDLLISGIL 231 (406) T ss_pred CCCCCCCCCCEEEECCEEEEEEEEECCEEEECCCCEECCCCEEEEECC T ss_conf 568767772326633408888655111648548887657717871110 No 96 >TIGR01995 PTS-II-ABC-beta PTS system, beta-glucoside-specific IIABC component; InterPro: IPR011297 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . This entry represents a family of PTS enzyme II proteins in which all three domains are found in the same polypeptide chain and which appear to have a broad specificity for beta-glucosides including salicin (beta-D-glucose-1-salicylate) and arbutin (hydroquinone-O-beta-D-glucopyranoside) . These are distinct from the closely related sucrose-specific and trehalose-specific PTS transporters.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=96.50 E-value=0.0036 Score=39.67 Aligned_cols=67 Identities=27% Similarity=0.468 Sum_probs=49.0 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCE----EECCEEEEEECCCEEEEEECCCCCEEEEE---------------- Q ss_conf 6626870462567740367641000124420----10434899973772689998998199999---------------- Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLV----VEGQTLLIIEAMKTMNHIVAPCSGKVQDI---------------- 147 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~V----k~G~~l~iiEAMKm~n~I~a~~~G~I~~i---------------- 147 (171) ..++|.||+.|+.. ++=+|-|+| --|+-++|+ --+-+|.||+||+|..+ T Consensus 508 ~~~~~~~P~~Ge~~--------~L~eV~D~vFSsG~mG~G~Ai~---P~eG~~~AP~dG~v~~VF~T~HAiGi~~~nG~E 576 (660) T TIGR01995 508 RKEELYAPVAGEVV--------PLNEVPDEVFSSGAMGKGVAIL---PSEGEVVAPVDGTVTAVFPTKHAIGIRSDNGVE 576 (660) T ss_pred HHHHCCCCEEEEEE--------ECCCCCCHHHHHHHCCCCEEEE---ECCCEEECCCCCEEEEEECCCCEEEECCCCCEE T ss_conf 13320576013467--------5056884123022306845897---158778823377899997486227412689728 Q ss_pred -------------------ECCCCCCCCCCCEEEEEE Q ss_conf -------------------848986836997299995 Q gi|254780267|r 148 -------------------NVKDGQSVEYGDALLVLE 165 (171) Q Consensus 148 -------------------~v~~G~~V~~gq~L~~i~ 165 (171) +|+.||.|..||+|.+++ T Consensus 577 ~LIHvGIDTV~L~G~~Fe~~V~~Gd~v~~Gq~L~~~D 613 (660) T TIGR01995 577 ILIHVGIDTVELKGEYFEILVKVGDKVKAGQLLLTFD 613 (660) T ss_pred EEEEEEEEEEEECCEEEEEEEEECCEEEECCCCCEEE T ss_conf 9998412345654501222145667875066312210 No 97 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=96.49 E-value=0.0022 Score=41.01 Aligned_cols=36 Identities=33% Similarity=0.657 Sum_probs=32.3 Q ss_pred CEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 726899989981999998489868369972999953 Q gi|254780267|r 131 KTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 131 Km~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) +--.+|+||..|.|.+|.|.+|+.|..||.|+.||- T Consensus 549 ~a~~~vTaP~aG~i~~~~V~~G~~V~~G~vlL~~EA 584 (616) T TIGR01108 549 GAITDVTAPIAGSIVKITVSEGDEVKEGEVLLVVEA 584 (616) T ss_pred CCCCCCCCCCCCCEEEEEECCCCEECCCCEEEEEEC T ss_conf 989875667777369998357874315775885310 No 98 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=96.44 E-value=0.007 Score=37.90 Aligned_cols=32 Identities=25% Similarity=0.430 Sum_probs=18.0 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .|.+.++|+|.+++|+.||.|..||+|++|++ T Consensus 63 ~l~~~VsG~v~~~~v~~Gd~VkkGq~La~LD~ 94 (385) T PRK09859 63 EIRPQVGGIIIKRNFIEGDKVNQGDSLYQIDP 94 (385) T ss_pred EEEEECCEEEEEEECCCCCEEECCCEEEEECC T ss_conf 99814562999998389998848988998284 No 99 >TIGR00830 PTBA PTS system, glucose subfamily, IIA component; InterPro: IPR001127 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . ; GO: 0005351 sugar:hydrogen ion symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=96.39 E-value=0.0049 Score=38.84 Aligned_cols=22 Identities=32% Similarity=0.524 Sum_probs=15.4 Q ss_pred EEEECCCCCCCCCCCEEEEEEE Q ss_conf 9998489868369972999953 Q gi|254780267|r 145 QDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 145 ~~i~v~~G~~V~~gq~L~~i~~ 166 (171) =+-+|+.||.|+.||+|+++.. T Consensus 90 F~~~v~~G~~V~~Gd~l~efD~ 111 (129) T TIGR00830 90 FTSHVEEGDKVKKGDPLLEFDL 111 (129) T ss_pred EEEEEEECCEECCCCEEEEECH T ss_conf 4787643878717977999566 No 100 >KOG3373 consensus Probab=96.38 E-value=0.0023 Score=40.78 Aligned_cols=72 Identities=28% Similarity=0.401 Sum_probs=51.8 Q ss_pred CCEEE----EEECCCCCCCEE---CCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCC---CCC----CCCEE Q ss_conf 62567----740367641000---12442010434899973772689998998199999848986---836----99729 Q gi|254780267|r 96 MVGTA----YLASSPGSDPFV---NKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQ---SVE----YGDAL 161 (171) Q Consensus 96 mvGtf----y~~psP~~~~~V---~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~---~V~----~gq~L 161 (171) -+||+ |.++.=+.-.|| ++|..|.+||.+|.+|+-|--.+|.||++|+|.+|+-.-.+ .|+ -+-=| T Consensus 65 ~vgtvGIt~~A~~~LGdvv~veLPe~Gt~vskgds~gavESVKaaSeIysp~sGeVtEiNe~l~EnPGlvN~Sp~e~GWl 144 (172) T KOG3373 65 GVGTVGITDFAQEHLGDVVYVELPEVGTEVSKGDSFGAVESVKAASEIYSPVSGEVTEINEKLEENPGLVNESPEEDGWL 144 (172) T ss_pred CEEEECHHHHHHHHCCCEEEEECCCCCCCCCCCCCEEEEEEHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEE T ss_conf 72575333555652276699976888972236760114420134565307677527882452344888545785558538 Q ss_pred EEEEEC Q ss_conf 999534 Q gi|254780267|r 162 LVLEKT 167 (171) Q Consensus 162 ~~i~~~ 167 (171) ++++.+ T Consensus 145 ~k~kls 150 (172) T KOG3373 145 IKMKLS 150 (172) T ss_pred EEEEEC T ss_conf 999858 No 101 >TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family; InterPro: IPR010129 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Type I secretion is an ABC transporter that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C terminus of the transported protein. This entry represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This entry selects a group of sequences closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.; GO: 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=96.37 E-value=0.0026 Score=40.48 Aligned_cols=35 Identities=29% Similarity=0.414 Sum_probs=27.6 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 99989981999998489868369972999953478 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) .|-+--.|+|.+|+|++||.|..||+|+.++..+. T Consensus 45 ~vQhlegGi~~~I~V~EG~~V~~Gq~L~~Ld~t~~ 79 (434) T TIGR01843 45 VVQHLEGGIVREILVREGDRVKAGQVLVELDATSV 79 (434) T ss_pred EEECCCCCEEEEEEECCCCEECCCCCCEEECHHHH T ss_conf 99668653441355225864203770135413578 No 102 >COG4770 Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism] Probab=96.16 E-value=0.007 Score=37.90 Aligned_cols=32 Identities=47% Similarity=0.761 Sum_probs=30.0 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .+.||..|+|..+.|+.|+.|+.||+|+++|- T Consensus 577 ~l~aPMpG~v~~v~V~~G~~V~~G~~lvvlEA 608 (645) T COG4770 577 ELLAPMPGTVVSVAVKEGQEVSAGDLLVVLEA 608 (645) T ss_pred CEECCCCCEEEEEEECCCCEECCCCEEEEEEE T ss_conf 16567896289999368977547986788671 No 103 >PRK12999 pyruvate carboxylase; Reviewed Probab=96.09 E-value=0.0063 Score=38.17 Aligned_cols=32 Identities=50% Similarity=0.792 Sum_probs=29.1 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) +|.||..|.|.+|+|+.||.|..||+|+.+|- T Consensus 1079 ~VgAPmpG~V~kV~Vk~GD~VkkGd~LlvlEA 1110 (1147) T PRK12999 1079 HIGAPMPGSVVTVAVKEGDEVKAGDPLLVIEA 1110 (1147) T ss_pred CCCCCCCCEEEEEEECCCCEECCCCEEEEEEH T ss_conf 13699982589999689998789998999851 No 104 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=95.94 E-value=0.0064 Score=38.15 Aligned_cols=33 Identities=33% Similarity=0.437 Sum_probs=20.6 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 899989981999998489868369972999953 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) ..|.||+||.|.+..|..|+.|..|||||.|=| T Consensus 245 T~iraP~dG~Va~r~vqvG~~vS~g~p~ma~V~ 277 (379) T TIGR00998 245 TEIRAPFDGYVARRKVQVGQVVSPGQPLMAVVP 277 (379) T ss_pred CEEECCCCCEEEEEECCCCCEECCCCCEEEEEC T ss_conf 564367220587763078861158972577784 No 105 >TIGR01349 PDHac_trf_mito pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006257 This group of sequences represent one of several closely related clades of the dihydrolipoamide acetyltransferase subunit of the pyruvate dehydrogenase complex. It includes sequences from mitochondria and from alpha and beta branches of the proteobacteria, as well as from some other bacteria, but not the Gram-positive bacteria.; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006090 pyruvate metabolic process, 0045254 pyruvate dehydrogenase complex. Probab=95.84 E-value=0.022 Score=34.84 Aligned_cols=56 Identities=25% Similarity=0.316 Sum_probs=51.9 Q ss_pred EECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCC-CCCCCEEEEEEE Q ss_conf 00124420104348999737726899989981999998489868-369972999953 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQS-VEYGDALLVLEK 166 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~-V~~gq~L~~i~~ 166 (171) ..|+||+|+.||+||=||--|--.++++.=+|.+-+|||..|.. |.-|.||-++=. T Consensus 20 ~KKeGDkv~~GDViaEIETDKAtm~~e~~~eGyLAKILv~eGTkdvpvn~pIAV~v~ 76 (584) T TIGR01349 20 LKKEGDKVKPGDVIAEIETDKATMEFEAQEEGYLAKILVPEGTKDVPVNKPIAVLVE 76 (584) T ss_pred EECCCCEECCCCEEEEEECCCCEEEEEECCCCEEEEEEECCCCCCCCCCCEEEEEEC T ss_conf 411377716887698874584267457626857888865247857635861689606 No 106 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=95.84 E-value=0.0084 Score=37.43 Aligned_cols=31 Identities=48% Similarity=0.781 Sum_probs=29.1 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 9998998199999848986836997299995 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) .|-||..|+|.+|+|+.|+.|+.||+|+.++ T Consensus 1102 hvGApMpG~v~~v~v~~G~~v~kGd~l~~~~ 1132 (1169) T TIGR01235 1102 HVGAPMPGVVIEVKVEAGQKVNKGDPLVVLE 1132 (1169) T ss_pred HCCCCCCCCEEEEEEECCCEECCCCEEEEEE T ss_conf 2256899707999941387301678489960 No 107 >TIGR02645 ARCH_P_rylase putative thymidine phosphorylase; InterPro: IPR013466 Proteins in this entry are closely related to characterised examples of thymidine phosphorylase (2.4.2.4 from EC) and pyrimidine nucleoside phosphorylase (2.4.2.2 from EC). Most examples are found in the archaea, but other examples are found in bacteria such as Legionella pneumophila (strain Paris) and Rhodopseudomonas palustris CGA009.; GO: 0009032 thymidine phosphorylase activity. Probab=95.53 E-value=0.0092 Score=37.17 Aligned_cols=22 Identities=32% Similarity=0.382 Sum_probs=13.8 Q ss_pred CCEECCCCCEEECCEEEEEECC Q ss_conf 1000124420104348999737 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAM 130 (171) .-|||+||+||+||+|..|-|- T Consensus 456 ~lhvK~G~~Vk~GdPL~TIyA~ 477 (499) T TIGR02645 456 ELHVKVGDKVKKGDPLYTIYAE 477 (499) T ss_pred EEEEEECCEEECCCCCEEEECC T ss_conf 8886546772038701588505 No 108 >TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family; InterPro: IPR010129 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Type I secretion is an ABC transporter that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C terminus of the transported protein. This entry represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This entry selects a group of sequences closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.; GO: 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=95.48 E-value=0.019 Score=35.31 Aligned_cols=34 Identities=29% Similarity=0.368 Sum_probs=16.8 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 6626870462567740367641000124420104348999 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) .-..|+||.-|+++.- .+.-+|--|..|++|+.| T Consensus 277 ~~~~irsPvdG~v~~l------~~~T~GgVv~pG~~lm~I 310 (434) T TIGR01843 277 QRLIIRSPVDGTVQSL------KVHTVGGVVQPGETLMEI 310 (434) T ss_pred HHHEEECCCCEEEEEE------EEEEECCEECCCCCEEEE T ss_conf 3211442545199999------998405443278617898 No 109 >COG0845 AcrA Membrane-fusion protein [Cell envelope biogenesis, outer membrane] Probab=95.17 E-value=0.074 Score=31.68 Aligned_cols=47 Identities=28% Similarity=0.343 Sum_probs=37.6 Q ss_pred ECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 043489997377268999899819999984898683699729999534 Q gi|254780267|r 120 EGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 120 ~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) .-...+.+++ .-...+.+...|.|.+|+|+.||.|..||+|+.++.+ T Consensus 54 ~~~~~G~v~~-~~~~~v~~~~~G~v~~i~v~~G~~Vk~Gq~L~~ld~~ 100 (372) T COG0845 54 AVRAPGRVEA-TRSVEVLARVAGIVAEILVKEGDRVKKGQLLARLDPS 100 (372) T ss_pred EEEEEEEEEE-EEEEEEECCCCCEEEEEEECCCCEEECCCEEEEECCH T ss_conf 4787569986-1347874036747989983689888359748995532 No 110 >PRK07051 hypothetical protein; Validated Probab=94.98 E-value=0.023 Score=34.77 Aligned_cols=34 Identities=29% Similarity=0.468 Sum_probs=30.3 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEE Q ss_conf 66268704625677403676410001244201043489997 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiE 128 (171) -.+.|+||..|++ .+.+|+.||.|..||+|..|| T Consensus 46 m~~~V~A~~~G~V-------~~ilv~~G~~V~~Gq~L~~Ie 79 (80) T PRK07051 46 QFSEVEAEVAGRV-------VRFLVDDGEPVDAGQVLMRIE 79 (80) T ss_pred CCCEEECCCCCEE-------EEEEECCCCEECCCCEEEEEE T ss_conf 7746726998299-------999956989867999789982 No 111 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=94.77 E-value=0.12 Score=30.41 Aligned_cols=33 Identities=45% Similarity=0.715 Sum_probs=29.6 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 899989981999998489868369972999953 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) +.|-||..|+|.+++|..|+.|..||+|..||- T Consensus 1080 ~higApmpG~Vv~v~V~~G~~Vk~Gd~l~~ieA 1112 (1149) T COG1038 1080 GHIGAPMPGVVVEVKVKKGDKVKKGDVLAVIEA 1112 (1149) T ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCEEEEEHH T ss_conf 121788887269999745873047986533022 No 112 >TIGR01936 nqrA NADH:ubiquinone oxidoreductase, Na(+)-translocating, A subunit; InterPro: IPR008703 This family consists of several bacterial Na^+-translocating NADH-quinone reductase subunit A (NQRA) proteins. The Na^+-translocating NADH: ubiquinone oxidoreductase (Na^+-NQR) generates an electrochemical Na^+ potential driven by aerobic respiration .; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0006814 sodium ion transport. Probab=94.58 E-value=0.025 Score=34.52 Aligned_cols=41 Identities=29% Similarity=0.291 Sum_probs=30.6 Q ss_pred CEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCC Q ss_conf 000124420104348999737726899989981999998489 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKD 151 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~ 151 (171) .-|.+||+|++||+|+-=-+--- .-++||+||+|.+|.--+ T Consensus 43 ~~v~~GD~V~~G~~L~~~K~~P~-v~fTsPvsG~V~ai~RG~ 83 (466) T TIGR01936 43 MKVRPGDKVKAGQVLFEDKKNPG-VKFTSPVSGEVVAINRGA 83 (466) T ss_pred EEEEECCEEEECCHHHHHCCCCC-CEEECCCCCEEEEEECCC T ss_conf 67700770100222333026898-568617775699974189 No 113 >TIGR02712 urea_carbox urea carboxylase; InterPro: IPR014084 Members of this family are ATP-dependent urea carboxylases, including characterised members from Oleomonas sagaranensis (alpha class Proteobacterium) and yeasts such as Saccharomyces cerevisiae (Baker's yeast). The allophanate hydrolase domain of the yeast enzyme is not included in this entry and is represented by an adjacent gene in Oleomonas sagaranensis. The fusion of urea carboxylase and allophanate hydrolase is designated urea amidolyase. The enzyme from O. sagaranensis was shown to be highly active on acetamide and formamide as well as urea.. Probab=94.50 E-value=0.047 Score=32.87 Aligned_cols=32 Identities=28% Similarity=0.514 Sum_probs=29.7 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .|.|++.|.+=+++|+.|+.|+.||+|+.||- T Consensus 1159 ~V~s~~~G~~WK~~V~~G~~V~~G~~l~i~Ea 1190 (1226) T TIGR02712 1159 QVESEVAGNVWKVLVEVGDRVEAGQPLVIIEA 1190 (1226) T ss_pred EEEEECEEEEEEEEECCCCEEECCCEEEEEEE T ss_conf 89850201465665212663204873899943 No 114 >COG1566 EmrA Multidrug resistance efflux pump [Defense mechanisms] Probab=94.48 E-value=0.051 Score=32.67 Aligned_cols=31 Identities=23% Similarity=0.388 Sum_probs=16.4 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEE Q ss_conf 62687046256774036764100012442010434899 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLI 126 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~i 126 (171) ...|+||+-|.+ ..-.+.+|+.|..|+.|+. T Consensus 208 ~T~IrAP~dG~V-------~~~~v~~G~~V~~G~~l~a 238 (352) T COG1566 208 RTVIRAPVDGYV-------TNLSVRVGQYVSAGTPLMA 238 (352) T ss_pred CCEEECCCCCEE-------EEECCCCCCEECCCCCEEE T ss_conf 789987789669-------7003557875048973499 No 115 >PRK09824 beta-glucoside-specific PTS system components IIABC; Provisional Probab=94.35 E-value=0.046 Score=32.92 Aligned_cols=70 Identities=21% Similarity=0.509 Sum_probs=53.1 Q ss_pred CCCCEEEECCCEEEEEECCCCCCCEECCCCCE----EECCEEEEEECCCEEEEEECCCCCEEEEE--------------- Q ss_conf 46626870462567740367641000124420----10434899973772689998998199999--------------- Q gi|254780267|r 87 DNYHTVTSPMVGTAYLASSPGSDPFVNKGNLV----VEGQTLLIIEAMKTMNHIVAPCSGKVQDI--------------- 147 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~V----k~G~~l~iiEAMKm~n~I~a~~~G~I~~i--------------- 147 (171) .....|.||+.|.+. ++=+|.|.| --|+-++|.- -...|.||++|+|..+ T Consensus 474 ~~~~~i~sp~~G~~~--------~l~~v~D~~Fa~~~~G~g~ai~p---~~g~v~aP~~g~v~~~~~t~HAig~~~~~G~ 542 (625) T PRK09824 474 PEQGGICSPMTGEIV--------PLIHVADTTFASGLLGKGIAILP---SVGEVRSPVAGRIASLFATLHAIGIESDDGV 542 (625) T ss_pred CCCCEEECCCCCEEE--------ECCCCCCHHHCCCCCCCCEEEEC---CCCEEECCCCCEEEEECCCCCEEEEECCCCC T ss_conf 666459555771587--------71318665540367779669956---9996980798089897478967899728981 Q ss_pred --------------------ECCCCCCCCCCCEEEEEEEC Q ss_conf --------------------84898683699729999534 Q gi|254780267|r 148 --------------------NVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 148 --------------------~v~~G~~V~~gq~L~~i~~~ 167 (171) +|+.||.|..||+|++.+.+ T Consensus 543 e~liHiGidTV~l~G~~F~~~v~~g~~V~~g~~l~~~d~~ 582 (625) T PRK09824 543 EILIHVGIDTVKLDGKFFSAHVNVGDKVNTGDRLISFDIP 582 (625) T ss_pred EEEEEECCCCCCCCCCCCEEEECCCCEECCCCEEEEECHH T ss_conf 8999966165010891541674678996799989997499 No 116 >cd06663 Biotinyl_lipoyl_domains Biotinyl_lipoyl_domains are present in biotin-dependent carboxylases/decarboxylases, the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases, and the H-protein of the glycine cleavage system (GCS). These domains transport CO2, acyl, or methylamine, respectively, between components of the complex/protein via a biotinyl or lipoyl group, which is covalently attached to a highly conserved lysine residue. Probab=94.21 E-value=0.05 Score=32.72 Aligned_cols=33 Identities=27% Similarity=0.367 Sum_probs=28.2 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 6626870462567740367641000124420104348999 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) -.+.|.||.-|++ .+.+++.||.|+.||+||.| T Consensus 41 ~~~ei~ap~~G~v-------~~i~v~~G~~V~~G~~l~~I 73 (73) T cd06663 41 ATSDVEAPKSGTV-------KKVLVKEGTKVEGDTPLVKI 73 (73) T ss_pred EEEEEECCCCEEE-------EEEEECCCCEECCCCEEEEC T ss_conf 5699866999899-------99995898997899999979 No 117 >PRK10255 N-acetyl glucosamine specific PTS system components IIABC; Provisional Probab=94.12 E-value=0.078 Score=31.55 Aligned_cols=78 Identities=23% Similarity=0.318 Sum_probs=52.0 Q ss_pred CCEEEECCCEEEEEE-CCC---------C------------------CCCEECCCCCEEECCEEEEEECC-------CEE Q ss_conf 626870462567740-367---------6------------------41000124420104348999737-------726 Q gi|254780267|r 89 YHTVTSPMVGTAYLA-SSP---------G------------------SDPFVNKGNLVVEGQTLLIIEAM-------KTM 133 (171) Q Consensus 89 ~~~I~SPmvGtfy~~-psP---------~------------------~~~~V~vGd~Vk~G~~l~iiEAM-------Km~ 133 (171) ...|.||.-|++-.- |+. | =+++|+.||+||+||.|+-.+-- -+- T Consensus 535 ~g~v~aP~dG~i~~vf~T~HA~gi~~~~G~EiLiHiGiDTV~L~G~gF~~~v~~Gd~Vk~G~~l~~~D~~~i~~~g~~~~ 614 (648) T PRK10255 535 DKIVVSPAAGTIVKIFNTNHAFCLETEKGAEIVVHMGIDTVALEGKGFKRLVEEGAQVSAGQPILEMDLDYLNANARSMI 614 (648) T ss_pred CCEEECCCCCEEEEECCCCCEEEEECCCCCEEEEEEEECCEEECCCCCEEEECCCCEECCCCEEEEECHHHHHHCCCCCC T ss_conf 99598068857999716885799984898189999544524217805348834859967999889974999985099986 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCC-CEEEEEEE Q ss_conf 8999899819999984898683699-72999953 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYG-DALLVLEK 166 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~g-q~L~~i~~ 166 (171) .+|.=..+....++-+.....|..| ++|++|++ T Consensus 615 tpviitN~~~~~~i~~~~~g~v~aGe~~i~~i~k 648 (648) T PRK10255 615 SPVVCSNIDDFSGLIIKAQGHVVAGQTPLYEIKK 648 (648) T ss_pred CEEEEECCHHCCCEEECCCCCEECCCCEEEEEEC T ss_conf 3799965413052487057847518830799719 No 118 >cd06849 lipoyl_domain Lipoyl domain of the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases. 2-oxo acid dehydrogenase multienzyme complexes, like pyruvate dehydrogenase (PDH), 2-oxoglutarate dehydrogenase (OGDH) and branched-chain 2-oxo acid dehydrogenase (BCDH), contain at least three different enzymes, 2-oxo acid dehydrogenase (E1), dihydrolipoyl acyltransferase (E2) and dihydrolipoamide dehydrogenase (E3) and play a key role in redox regulation. E2, the central component of the complex, catalyzes the transfer of the acyl group of CoA from E1 to E3 via reductive acetylation of a lipoyl group covalently attached to a lysine residue. Probab=94.10 E-value=0.057 Score=32.38 Aligned_cols=33 Identities=27% Similarity=0.414 Sum_probs=27.4 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 6626870462567740367641000124420104348999 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) -...|.||--|++. +.+++.||.|+.|++||+| T Consensus 42 ~~~~i~a~~~G~v~-------~i~~~~G~~v~~G~~l~~i 74 (74) T cd06849 42 ATVEVEAPAAGVLA-------KILVEEGDTVPVGQVIAVI 74 (74) T ss_pred EEEEEECCCCEEEE-------EEEECCCCEECCCCEEEEC T ss_conf 07841079899999-------9987899998899999979 No 119 >PTZ00144 dihydrolipoamide succinyltransferase; Provisional Probab=94.07 E-value=0.055 Score=32.46 Aligned_cols=38 Identities=24% Similarity=0.317 Sum_probs=31.2 Q ss_pred CCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 546626870462567740367641000124420104348999737 Q gi|254780267|r 86 PDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 86 ~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) +.-...|.||.-|++ .+.++++||.|+.|++||.|++- T Consensus 97 DKA~~EVeSp~~G~l-------~kIlv~eGd~V~VG~~la~I~~~ 134 (430) T PTZ00144 97 DKVSVDINSKFSGVL-------SKIFAEVGDTVLVGKPLCEIDTS 134 (430) T ss_pred CCCCEEECCCCCEEE-------EEEEECCCCEEECCCEEEEEECC T ss_conf 884179657789799-------99863799989469869999668 No 120 >COG1726 NqrA Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrA [Energy production and conversion] Probab=93.89 E-value=0.058 Score=32.34 Aligned_cols=39 Identities=36% Similarity=0.397 Sum_probs=29.4 Q ss_pred CEECCCCCEEECCEEEEEECCCEE--EEEECCCCCEEEEEECCC Q ss_conf 000124420104348999737726--899989981999998489 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAMKTM--NHIVAPCSGKVQDINVKD 151 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAMKm~--n~I~a~~~G~I~~i~v~~ 151 (171) ..|++||.|++||+|.-= |-+ .-++||+||+|.+|.--+ T Consensus 43 mkV~~gD~VkkGq~LfEd---Kknpgv~~TapvsG~V~aI~RG~ 83 (447) T COG1726 43 MKVREGDAVKKGQVLFED---KKNPGVVFTAPVSGKVTAIHRGE 83 (447) T ss_pred CEECCCCEEECCCEEEEC---CCCCCEEEECCCCCEEEEEECCC T ss_conf 153347722026531121---56898589456782588864465 No 121 >TIGR00999 8a0102 Membrane Fusion Protein cluster 2 protein; InterPro: IPR005695 Gram-negative bacteria have evolved transport complexes that export macromolecules and toxic substances such as heavy metals across the two membranes of the cell envelope in a single energy coupled step. The process requires (1) a cytoplasmic membrane export system, (2) a membrane fusion protein (MFP), and (3) an outer membrane factor. Phylogenetic analyses reveal that the MFPs cluster in accordance with the type of cytoplasmic membrane transport systems with which they function - proteins in this family are associated with reisistance to heavy metal poisoning. ; GO: 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport. Probab=93.45 E-value=0.066 Score=31.98 Aligned_cols=32 Identities=19% Similarity=0.292 Sum_probs=15.9 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 99989981999998489868369972999953 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) ++.||++|.|.+=.|.-|+.|..|+.||+|-. T Consensus 106 evrsP~~G~i~~k~v~~G~~v~~~~~~f~~~~ 137 (284) T TIGR00999 106 EVRSPFDGYIVEKSVTLGDYVAPQAELFRVAD 137 (284) T ss_pred HHCCCCCCEEEEEEEECCCCCCCCCCEEEEEC T ss_conf 21188687899997411720068973377764 No 122 >KOG0238 consensus Probab=93.31 E-value=0.089 Score=31.20 Aligned_cols=31 Identities=42% Similarity=0.626 Sum_probs=28.6 Q ss_pred EEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 9998998199999848986836997299995 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) .+.||..|.|++++|+.|+.|..||.|+.++ T Consensus 603 v~~aPMpG~Iekv~Vkpgd~V~~Gq~l~Vl~ 633 (670) T KOG0238 603 VIVAPMPGIIEKVLVKPGDKVKEGQELVVLI 633 (670) T ss_pred CEECCCCCEEEEEECCCHHHHCCCCCEEEEE T ss_conf 5655788704466516435531567348999 No 123 >pfam00364 Biotin_lipoyl Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not recognize the Glycine cleavage system H proteins. Probab=93.15 E-value=0.066 Score=32.00 Aligned_cols=33 Identities=24% Similarity=0.393 Sum_probs=28.2 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 6626870462567740367641000124420104348999 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) -...|.||..|++. +.+++.||.|+.||+|+.| T Consensus 41 ~~~~v~a~~~G~i~-------~i~v~~G~~V~~G~~l~~I 73 (73) T pfam00364 41 MEMEIPAPVAGVVK-------EILVKEGDTVEVGDPLAKI 73 (73) T ss_pred CCCEECCCCCEEEE-------EEEECCCCEECCCCEEEEC T ss_conf 35225089888999-------9997998998999999979 No 124 >PRK09603 DNA-directed RNA polymerase subunit beta/beta'; Reviewed Probab=92.95 E-value=0.12 Score=30.33 Aligned_cols=57 Identities=26% Similarity=0.332 Sum_probs=44.2 Q ss_pred CCCCCEECCCCCEEECCEEEE---------------------EECC--C-EEEEEECCCCCEEE---------------- Q ss_conf 764100012442010434899---------------------9737--7-26899989981999---------------- Q gi|254780267|r 106 PGSDPFVNKGNLVVEGQTLLI---------------------IEAM--K-TMNHIVAPCSGKVQ---------------- 145 (171) Q Consensus 106 P~~~~~V~vGd~Vk~G~~l~i---------------------iEAM--K-m~n~I~a~~~G~I~---------------- 145 (171) +++-.+|+.||.|++||+|+- +||- | -...|.|+.+|+|. T Consensus 2612 ~~a~l~V~dG~~V~~GdiLakl~~e~~kt~DIt~GLPRV~ELfEAR~~KPK~~ailae~~G~V~~g~~~~~~~~iii~~~ 2691 (2890) T PRK09603 2612 PKTSIAISDGSSVEQAEVLAKIPKATVKSRDITGGLPRVSELFEARKPKPKDVAILSEVDGIVSFGKPIRNKEHIIVTSK 2691 (2890) T ss_pred CCCEEEECCCCEECCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCEEEECCCCEEEECCCCCCCEEEEEECC T ss_conf 99789966999960587689710221567866488860598775134688874389536518995653478526999769 Q ss_pred -------------EEECCCCCCCCCCCEEE Q ss_conf -------------99848986836997299 Q gi|254780267|r 146 -------------DINVKDGQSVEYGDALL 162 (171) Q Consensus 146 -------------~i~v~~G~~V~~gq~L~ 162 (171) .++|.+|+.|..|++|. T Consensus 2692 dg~~~ey~i~~g~~l~V~~Gd~V~~Ge~Lt 2721 (2890) T PRK09603 2692 DGRSMDYFVDKGKQILVHADEFVHAGEAMT 2721 (2890) T ss_pred CCCEEEEECCCCCEEEECCCCEEECCCCCC T ss_conf 994777754899877767998870688823 No 125 >PRK04350 thymidine phosphorylase; Provisional Probab=92.39 E-value=0.28 Score=28.16 Aligned_cols=20 Identities=30% Similarity=0.405 Sum_probs=11.8 Q ss_pred ECCCCCCCCCCCEEEEEEEC Q ss_conf 84898683699729999534 Q gi|254780267|r 148 NVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 148 ~v~~G~~V~~gq~L~~i~~~ 167 (171) +++-|+.|+.|||||+|--+ T Consensus 456 ~~KvGd~V~kGdpL~tIhA~ 475 (502) T PRK04350 456 HKKIGDKVKKGDPLYTIYAE 475 (502) T ss_pred EECCCCEECCCCEEEEEECC T ss_conf 60487996799849999669 No 126 >PRK11637 hypothetical protein; Provisional Probab=92.20 E-value=0.19 Score=29.16 Aligned_cols=43 Identities=23% Similarity=0.292 Sum_probs=26.1 Q ss_pred CCCCEEEECCCEEEEEE-CCCC--------------------CCCEECCCCCEEECCEEEEEEC Q ss_conf 46626870462567740-3676--------------------4100012442010434899973 Q gi|254780267|r 87 DNYHTVTSPMVGTAYLA-SSPG--------------------SDPFVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 87 ~~~~~I~SPmvGtfy~~-psP~--------------------~~~~V~vGd~Vk~G~~l~iiEA 129 (171) ..+..|.|..-|++..+ +.+| +..+|++||.|+.||+|+.+=. T Consensus 313 ~~Gt~V~Av~~G~Vv~a~~~~gyG~~ViIdHG~g~~TlYah~s~l~v~~Gq~V~~Gq~Ig~vG~ 376 (404) T PRK11637 313 SEGTEVKAIADGRVILADWLQGYGLVVVVEHGKGDMSLYGYNQSALVSVGAQVRAGQPIALVGS 376 (404) T ss_pred CCCCEEEEECCEEEEEEEECCCCCCEEEEECCCCCEEECCCCCCCCCCCCCEECCCCEEEEEEC T ss_conf 9998054101769999114088885799986994657152889588899799899996987326 No 127 >pfam09891 DUF2118 Uncharacterized protein conserved in archaea (DUF2118). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=92.03 E-value=0.32 Score=27.83 Aligned_cols=41 Identities=29% Similarity=0.454 Sum_probs=35.7 Q ss_pred CEECCCCCEEECCEEEEEECCCEE-EEEECCCCCEEEEEECC Q ss_conf 000124420104348999737726-89998998199999848 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAMKTM-NHIVAPCSGKVQDINVK 150 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAMKm~-n~I~a~~~G~I~~i~v~ 150 (171) |++.+||.|.+||.|+.+-.=|-+ .-+.||++|+|.-++-. T Consensus 93 ~i~~~G~rV~~g~~lA~v~T~KGevR~~~spv~G~Vv~i~e~ 134 (149) T pfam09891 93 PIVDEGDRVLKGDRLAAVTTRKGEVRYVRSPVEGTVVFINEI 134 (149) T ss_pred EECCCCCEEECCCEEEEEEECCCEEEEECCCCCEEEEEEEEC T ss_conf 872656477218648999816841799449976899999815 No 128 >PRK09439 glucose-specific PTS system component; Provisional Probab=91.97 E-value=0.29 Score=28.05 Aligned_cols=69 Identities=26% Similarity=0.460 Sum_probs=46.1 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCE----EECCEEEEEECCCEEEEEECCCCCEEEEE---------------- Q ss_conf 6626870462567740367641000124420----10434899973772689998998199999---------------- Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLV----VEGQTLLIIEAMKTMNHIVAPCSGKVQDI---------------- 147 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~V----k~G~~l~iiEAMKm~n~I~a~~~G~I~~i---------------- 147 (171) +...|.||+-|.+. ++-++-|.| --||-++|+= -.+.|.||++|+|..+ T Consensus 19 ~~i~i~AP~~G~vi--------~L~~V~D~vFa~k~mGdG~AI~P---~~~~v~AP~~G~V~~vf~T~HAigi~~~~G~e 87 (169) T PRK09439 19 GTIEIIAPLSGEIV--------NIEDVPDVVFAEKIVGDGIAIKP---TGNKIVAPVDGTIGKIFETNHAFSIESDSGVE 87 (169) T ss_pred CCEEEEECCCCEEE--------EHHHCCCHHHHCCCCCCEEEEEC---CCCEEECCCCEEEEEECCCCCEEEEECCCCCE T ss_conf 84899964771698--------80049774860578247489986---79989918895999977889389999289989 Q ss_pred -------------------ECCCCCCCCCCCEEEEEEEC Q ss_conf -------------------84898683699729999534 Q gi|254780267|r 148 -------------------NVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 148 -------------------~v~~G~~V~~gq~L~~i~~~ 167 (171) +|+.||.|..||+|+++... T Consensus 88 iLIHiGiDTV~L~G~gF~~~v~~Gd~Vk~G~~L~~~D~~ 126 (169) T PRK09439 88 LFVHFGIDTVELKGEGFKRIAEEGQRVKVGDPIIEFDLE 126 (169) T ss_pred EEEEECCCCEECCCCCEEEEEECCCEECCCCEEEEECHH T ss_conf 999976240233896726999276998899999998699 No 129 >COG2190 NagE Phosphotransferase system IIA components [Carbohydrate transport and metabolism] Probab=91.93 E-value=0.17 Score=29.50 Aligned_cols=66 Identities=32% Similarity=0.505 Sum_probs=39.5 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCE----EECCEEEEEECCCEEEEEECCCCCEEEEE------------------ Q ss_conf 26870462567740367641000124420----10434899973772689998998199999------------------ Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLV----VEGQTLLIIEAMKTMNHIVAPCSGKVQDI------------------ 147 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~V----k~G~~l~iiEAMKm~n~I~a~~~G~I~~i------------------ 147 (171) ..|.||+.|.+. ++-+|-|.| -.||-++|. --.++|.||++|+|..| T Consensus 6 ~~i~sP~~G~vv--------~Ls~VpD~vFs~k~mGdGiAI~---P~~g~vvAPvdG~v~~iFpTkHAigi~t~~GvEiL 74 (156) T COG2190 6 EEIYSPLSGEVV--------PLSDVPDPVFSEKMVGDGVAIK---PSEGEVVAPVDGTVVLIFPTKHAIGIETDEGVEIL 74 (156) T ss_pred EEEECCCCCEEE--------ECHHCCCHHHHCCCCCCCEEEE---CCCCEEEECCCCEEEEEEECCCEEEEECCCCCEEE T ss_conf 799764772488--------8104874676415634867994---68995981268589998517847999838995999 Q ss_pred -----------------ECCCCCCCCCCCEEEEEEE Q ss_conf -----------------8489868369972999953 Q gi|254780267|r 148 -----------------NVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 148 -----------------~v~~G~~V~~gq~L~~i~~ 166 (171) +|+.||.|..||+|+++.- T Consensus 75 iHiGiDTV~L~GegF~~~v~~Gd~Vk~Gd~Li~fDl 110 (156) T COG2190 75 IHIGIDTVKLNGEGFESLVKEGDKVKAGDPLLEFDL 110 (156) T ss_pred EEECEEEEEECCCCEEEEEECCCEECCCCEEEEECH T ss_conf 995640586889324787507998815989899979 No 130 >TIGR02971 heterocyst_DevB ABC exporter membrane fusion protein, DevB family; InterPro: IPR014315 Members of this protein are found mostly in the Cyanobacteria, but also in the Planctomycetes. DevB from Anabaena sp. (strain PCC 7120) is partially characterised as a membrane fusion protein of the DevBCA ABC exporter, probably a glycolipid exporter required for heterocyst formation. Most Cyanobacteria have one member only, but Nostoc sp. which has seven members.. Probab=91.92 E-value=0.12 Score=30.33 Aligned_cols=23 Identities=26% Similarity=0.395 Sum_probs=20.3 Q ss_pred CCCE-ECCCCCEEECCEEEEEECC Q ss_conf 4100-0124420104348999737 Q gi|254780267|r 108 SDPF-VNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 108 ~~~~-V~vGd~Vk~G~~l~iiEAM 130 (171) ++-| |++||.|++||+|+++|.. T Consensus 30 ~~LlPV~eGD~V~~Gq~lA~Ld~~ 53 (363) T TIGR02971 30 KKLLPVAEGDRVKAGQVLAELDSR 53 (363) T ss_pred HHHCCCCCCCCCCCCCEEEEECCC T ss_conf 550677888701157536764570 No 131 >KOG0368 consensus Probab=91.81 E-value=0.51 Score=26.56 Aligned_cols=53 Identities=23% Similarity=0.327 Sum_probs=45.3 Q ss_pred CCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 24420104348999737726899989981999998489868369972999953 Q gi|254780267|r 114 KGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 114 vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) .|-++.-|---|.+|.--==.-+.||..|+..+.+|++|+.|+.||+--+||- T Consensus 666 ~~~rltIdn~t~~fe~enDpt~LrsPs~GKLl~ylVedG~hv~~Gq~YAeiEv 718 (2196) T KOG0368 666 DGYRLTIDNNTCLFEKENDPTVLRSPSPGKLLQYLVEDGEHVEAGQPYAEIEV 718 (2196) T ss_pred CEEEEEECCEEEEEECCCCCCEECCCCCCCCEEEEECCCCEEECCCEEEEHEE T ss_conf 33799888847897526896433489986205899548971443872000120 No 132 >PRK05704 dihydrolipoamide succinyltransferase; Validated Probab=91.77 E-value=0.17 Score=29.48 Aligned_cols=36 Identities=28% Similarity=0.343 Sum_probs=28.7 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 6626870462567740367641000124420104348999737 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) -...|.||.-|++ .+.++++||.|..|++||+|+.- T Consensus 44 a~~Ei~s~~~G~l-------~ki~~~eGd~v~VG~~ia~I~~~ 79 (406) T PRK05704 44 VVLEVPAPAAGVL-------SEILAEEGDTVTVGQVLGRIDEG 79 (406) T ss_pred CCEEECCCCCEEE-------EEEEECCCCEECCCCEEEEEECC T ss_conf 2079748889899-------99830898996689989998268 No 133 >PRK12784 hypothetical protein; Provisional Probab=91.21 E-value=0.17 Score=29.45 Aligned_cols=38 Identities=26% Similarity=0.412 Sum_probs=31.4 Q ss_pred EEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 26899989981999998489868369972999953478 Q gi|254780267|r 132 TMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 132 m~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) +.-+|.||+-|+|.+|++..++-|--=.+||.|+..+- T Consensus 4 ~ie~iyS~c~G~Vekifi~e~SyVYEWEkL~~I~~~dg 41 (84) T PRK12784 4 RMEEICSSYEGKVEEIFVNESSYVYEWEKLMMIRKNNG 41 (84) T ss_pred EHHHHCCCCCCEEEEEEECCCCEEEEEEEEEEEEECCC T ss_conf 23434476631888999858753786423457750698 No 134 >PRK09783 copper/silver efflux system membrane fusion protein CusB; Provisional Probab=90.81 E-value=0.38 Score=27.39 Aligned_cols=30 Identities=27% Similarity=0.379 Sum_probs=15.0 Q ss_pred EEECCCCCEEEEEEC-CCCCCCCCCCEEEEE Q ss_conf 999899819999984-898683699729999 Q gi|254780267|r 135 HIVAPCSGKVQDINV-KDGQSVEYGDALLVL 164 (171) Q Consensus 135 ~I~a~~~G~I~~i~v-~~G~~V~~gq~L~~i 164 (171) .|.+-++|.|.++++ ..|+.|..||+|+.| T Consensus 123 ~v~~r~~G~Ie~L~v~~~G~~V~kGq~L~~l 153 (407) T PRK09783 123 IVQARAAGFIDKVYPLTVGDKVQKGTPLLDL 153 (407) T ss_pred EEEECCCCEEEEEEECCCCCEECCCCEEEEE T ss_conf 9971035268898742578787389788999 No 135 >TIGR01995 PTS-II-ABC-beta PTS system, beta-glucoside-specific IIABC component; InterPro: IPR011297 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . This entry represents a family of PTS enzyme II proteins in which all three domains are found in the same polypeptide chain and which appear to have a broad specificity for beta-glucosides including salicin (beta-D-glucose-1-salicylate) and arbutin (hydroquinone-O-beta-D-glucopyranoside) . These are distinct from the closely related sucrose-specific and trehalose-specific PTS transporters.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=90.69 E-value=0.2 Score=29.07 Aligned_cols=57 Identities=28% Similarity=0.332 Sum_probs=39.9 Q ss_pred CCEECCCCCEEECCEEEEEECCCE-------EEEEECCCCCEEEEEECCCC--C-CCCCCCEEEEEE Q ss_conf 100012442010434899973772-------68999899819999984898--6-836997299995 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKT-------MNHIVAPCSGKVQDINVKDG--Q-SVEYGDALLVLE 165 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm-------~n~I~a~~~G~I~~i~v~~G--~-~V~~gq~L~~i~ 165 (171) +.+|++||+||+||.|+-++--|. ...|.=..+=.-.+|...|- + .|..+|.|+.++ T Consensus 594 e~~V~~Gd~v~~Gq~L~~~D~d~I~~AGYd~~t~vvVTN~~~~~~v~~~~~~~ek~V~~~d~ll~~~ 660 (660) T TIGR01995 594 EILVKVGDKVKAGQLLLTFDLDKIKEAGYDPTTPVVVTNSKDFLDVITTDKENEKTVTKEDVLLEVI 660 (660) T ss_pred EEEEEECCEEEECCCCCEEEHHHHHHCCCCCCEEEEEECCHHHCCCEECCCCCCEEECCCCCEEEEC T ss_conf 2214566787506631221068996178982034898422202553220377751660556235538 No 136 >TIGR02645 ARCH_P_rylase putative thymidine phosphorylase; InterPro: IPR013466 Proteins in this entry are closely related to characterised examples of thymidine phosphorylase (2.4.2.4 from EC) and pyrimidine nucleoside phosphorylase (2.4.2.2 from EC). Most examples are found in the archaea, but other examples are found in bacteria such as Legionella pneumophila (strain Paris) and Rhodopseudomonas palustris CGA009.; GO: 0009032 thymidine phosphorylase activity. Probab=90.63 E-value=0.13 Score=30.20 Aligned_cols=45 Identities=33% Similarity=0.364 Sum_probs=38.1 Q ss_pred EEEECCCEEEEEECCCCCEEEEE------------------------ECCCCCCCCCCCEEEEEEECCC Q ss_conf 99973772689998998199999------------------------8489868369972999953478 Q gi|254780267|r 125 LIIEAMKTMNHIVAPCSGKVQDI------------------------NVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 125 ~iiEAMKm~n~I~a~~~G~I~~i------------------------~v~~G~~V~~gq~L~~i~~~~~ 169 (171) |=+|+-+.-..|.|+.||.|.+| |++-|+.|+.|+|||+|=-..+ T Consensus 411 ~~ie~G~~~~DI~A~~DG~Vt~IDN~~i~~IAr~AGAP~DKgAGv~lhvK~G~~Vk~GdPL~TIyA~~e 479 (499) T TIGR02645 411 DDIEVGKYTADIHAETDGYVTEIDNKRITRIARLAGAPNDKGAGVELHVKVGDKVKKGDPLYTIYAESE 479 (499) T ss_pred CCCCCCCEEEEEEECCCCEEHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCEEECCCCCEEEECCCC T ss_conf 765555325778736886150423889999998718887676843888654677203870158850541 No 137 >PRK05305 phosphatidylserine decarboxylase; Provisional Probab=90.38 E-value=1.5 Score=23.73 Aligned_cols=67 Identities=18% Similarity=0.189 Sum_probs=46.2 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEE Q ss_conf 268704625677403676410001244201043489997377268999899819999984898683699729999 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVL 164 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i 164 (171) ..+.....|.+=|+= -.++++||+|++||.++.|---- -.+|.-|.+ .+++|+.||.|..|+-++.- T Consensus 144 ~i~~~qIaG~~ARrI----v~~~~~g~~l~~G~r~G~I~fGS-rvdl~lP~~---~~i~V~~G~~V~aGeTvia~ 210 (214) T PRK05305 144 EIGVVQIAGLVARRI----VCYVEEGDELERGERFGLIRFGS-RVDVYLPLG---TEPLVSVGQKVIAGETVLAR 210 (214) T ss_pred EEEEEECCCHHHHHE----EECCCCCCEEECCCEEEEEECCC-CEEEEECCC---CEEEECCCCEEECCEEEEEE T ss_conf 899999253213224----62267897885377773796087-279993799---83877769999867879996 No 138 >PRK02597 DNA-directed RNA polymerase subunit beta'; Provisional Probab=90.01 E-value=0.22 Score=28.84 Aligned_cols=21 Identities=43% Similarity=0.672 Sum_probs=12.3 Q ss_pred CCCCCCEECCCCCEEECCEEE Q ss_conf 676410001244201043489 Q gi|254780267|r 105 SPGSDPFVNKGNLVVEGQTLL 125 (171) Q Consensus 105 sP~~~~~V~vGd~Vk~G~~l~ 125 (171) +|++-.++.-||-|++||+|+ T Consensus 950 s~gA~ihv~~GdiV~~GDtLv 970 (1295) T PRK02597 950 SPGAVLHVRDGDLVQRGDNLA 970 (1295) T ss_pred CCCCEEEEECCCEEECCCEEE T ss_conf 999689842687971497678 No 139 >PRK06748 hypothetical protein; Validated Probab=89.76 E-value=0.28 Score=28.20 Aligned_cols=40 Identities=30% Similarity=0.498 Sum_probs=32.3 Q ss_pred CCE-EEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 772-6899989981999998489868369972999953478 Q gi|254780267|r 130 MKT-MNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 130 MKm-~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) ||+ .-+|.||+-|+|.+|++..++-|--=.+||.|+..+- T Consensus 1 m~t~ie~IyS~~~G~Vekifi~e~syVyEWEkL~~I~~~dg 41 (84) T PRK06748 1 MKTVIEGVYSPCYGKVEKLFVTESSYVYEWEKLALIETIDK 41 (84) T ss_pred CCEEHHHHCCCCCCEEEEEEECCCCEEEEEEEEEEEEECCC T ss_conf 92022323366630889999858753786423457750698 No 140 >TIGR03327 AMP_phos AMP phosphorylase. This enzyme family is found, so far, strictly in the Archaea, and only in those with a type III Rubisco enzyme. Most of the members previously were annotated as thymidine phosphorylase, or DeoA. The AMP metabolized by this enzyme may be produced by ADP-dependent sugar kinases. Probab=89.66 E-value=0.75 Score=25.58 Aligned_cols=18 Identities=39% Similarity=0.584 Sum_probs=9.0 Q ss_pred ECCCCCCCCCCCEEEEEE Q ss_conf 848986836997299995 Q gi|254780267|r 148 NVKDGQSVEYGDALLVLE 165 (171) Q Consensus 148 ~v~~G~~V~~gq~L~~i~ 165 (171) +++-|+.|+.||||++|- T Consensus 453 ~~k~Gd~V~~GdpL~tih 470 (500) T TIGR03327 453 HVKVGEKVKKGDPLYTIY 470 (500) T ss_pred ECCCCCEECCCCEEEEEE T ss_conf 436879877999399997 No 141 >PRK11856 branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed Probab=89.58 E-value=0.33 Score=27.77 Aligned_cols=32 Identities=31% Similarity=0.515 Sum_probs=27.4 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 626870462567740367641000124420104348999 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) ...|.||.-|++ .+.++++||.|..|++||+| T Consensus 44 ~~ei~a~~~G~l-------~~i~~~~G~~v~VG~~ia~i 75 (324) T PRK11856 44 TVEVESPVAGTL-------RKILVEEGDVVPVGAPIAVI 75 (324) T ss_pred EEEEECCCCEEE-------EEEECCCCCEECCCCEEEEE T ss_conf 689806888399-------89960897995799989994 No 142 >COG0508 AceF Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes [Energy production and conversion] Probab=89.33 E-value=0.32 Score=27.82 Aligned_cols=35 Identities=26% Similarity=0.347 Sum_probs=28.8 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 626870462567740367641000124420104348999737 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) ...|.||--|+. .+.++++||.|..|++||+|+.= T Consensus 45 ~~EV~ap~~G~l-------~~i~~~~G~~V~Vg~~I~~i~~~ 79 (404) T COG0508 45 TMEVPAPDAGVL-------AKILVEEGDTVPVGAVIARIEEE 79 (404) T ss_pred EEEECCCCCCEE-------EEEECCCCCEECCCCEEEEEECC T ss_conf 355147778599-------67620589887589858999337 No 143 >TIGR02013 rpoB DNA-directed RNA polymerase, beta subunit; InterPro: IPR010243 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry describes orthologues of the beta subunit of bacterial RNA polymerase. The core enzyme consists of two alpha chains, one beta chain, and one beta' subunit.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=88.69 E-value=0.2 Score=29.04 Aligned_cols=18 Identities=33% Similarity=0.547 Sum_probs=14.8 Q ss_pred CCCEECCCCCEEECCEEE Q ss_conf 410001244201043489 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLL 125 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~ 125 (171) -.|+|++||+|.+||+|+ T Consensus 850 Q~PiV~~GDrV~~GdvlA 867 (1449) T TIGR02013 850 QRPIVSVGDRVEAGDVLA 867 (1449) T ss_pred CEEECCCCCEECCCCEEE T ss_conf 535501486810212773 No 144 >PRK06302 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=88.54 E-value=0.37 Score=27.44 Aligned_cols=32 Identities=28% Similarity=0.468 Sum_probs=16.5 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEE Q ss_conf 268704625677403676410001244201043489997 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiE 128 (171) +.|.|+.-|++ .+.+|+-|+-|.-||.|..|| T Consensus 124 neI~a~~~G~I-------~~Ilv~~G~~Veygq~Lf~i~ 155 (155) T PRK06302 124 NEIEADKSGVV-------KEILVENGQPVEFGQPLFVIE 155 (155) T ss_pred CEEECCCCCEE-------EEEEECCCCEECCCCEEEEEC T ss_conf 32406988489-------999855998763998269869 No 145 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=88.40 E-value=0.49 Score=26.69 Aligned_cols=29 Identities=38% Similarity=0.527 Sum_probs=24.9 Q ss_pred EEEECCCCCEEEEEEC-------CCCCCCCCCCEEE Q ss_conf 8999899819999984-------8986836997299 Q gi|254780267|r 134 NHIVAPCSGKVQDINV-------KDGQSVEYGDALL 162 (171) Q Consensus 134 n~I~a~~~G~I~~i~v-------~~G~~V~~gq~L~ 162 (171) ..|.|..+|+|.++++ +.||.|..||.|+ T Consensus 190 ~~iVA~k~GvI~~i~v~~G~p~Vk~GD~VkkGqiLV 225 (383) T pfam06898 190 RNIVAKKDGIITRMYVTKGTAVVKVGDVVKKGDILV 225 (383) T ss_pred CCEEECCCCEEEEEEECCCEEEECCCCEECCCCEEE T ss_conf 665899997999999647717765899877898999 No 146 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=88.01 E-value=2.2 Score=22.72 Aligned_cols=56 Identities=25% Similarity=0.401 Sum_probs=32.3 Q ss_pred CEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEE---------------------EEECCCCCCCCCCCEEEEEEECC Q ss_conf 000124420104348999737726899989981999---------------------99848986836997299995347 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQ---------------------DINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~---------------------~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) .-|.-||+|-+|..|.+=- |-+ .+.+...|++. .|+|++|+.|+.||+|.+++|.. T Consensus 1136 ~~V~~~Q~V~KG~~l~~~~--~Ge-~~v~~~~G~~~~~~~~f~i~~ee~~~~V~~G~~i~V~~g~~V~~G~~~aefDP~~ 1212 (1552) T TIGR02386 1136 LKVLDSQKVIKGENLVVDK--KGE-EIVVSEEGRIKIEDKKFLIVEEEEVVPVKLGTKILVEDGDAVKEGQRIAEFDPYN 1212 (1552) T ss_pred EEEECCCEEECCCEEEECC--CCC-EEEECCCCEEEEECCCEEEECCCEEEECCCCCEEEEECCCCCCCCCEEEEECCCC T ss_conf 5640573553371367658--887-6770788878850585776403215642656688861586205887578844777 No 147 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=87.31 E-value=0.54 Score=26.45 Aligned_cols=29 Identities=38% Similarity=0.587 Sum_probs=24.8 Q ss_pred EEECCCCCEEEEEEC-------CCCCCCCCCCEEEE Q ss_conf 999899819999984-------89868369972999 Q gi|254780267|r 135 HIVAPCSGKVQDINV-------KDGQSVEYGDALLV 163 (171) Q Consensus 135 ~I~a~~~G~I~~i~v-------~~G~~V~~gq~L~~ 163 (171) .|.|-.+|+|.++++ +.||.|..||.|+. T Consensus 193 niVAkk~G~i~~~~v~~G~~~Vk~GD~VkkGd~Li~ 228 (406) T TIGR02876 193 NIVAKKDGVIKRVYVTSGEAVVKKGDVVKKGDLLIS 228 (406) T ss_pred CEEEECCEEEEEEEEECCEEEECCCCEECCCCEEEE T ss_conf 326633408888655111648548887657717871 No 148 >PRK05820 deoA thymidine phosphorylase; Reviewed Probab=87.01 E-value=1.4 Score=23.95 Aligned_cols=21 Identities=24% Similarity=0.259 Sum_probs=13.3 Q ss_pred CEECCCCCEEECCEEEEEECC Q ss_conf 000124420104348999737 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAM 130 (171) -++++||+|++||+|+.|-+- T Consensus 385 l~~k~Gd~V~kGdpl~~iya~ 405 (440) T PRK05820 385 LHARLGDRVDAGEPLATLHAD 405 (440) T ss_pred EECCCCCEECCCCEEEEEECC T ss_conf 851588996799949999749 No 149 >KOG0369 consensus Probab=86.71 E-value=0.57 Score=26.31 Aligned_cols=32 Identities=47% Similarity=0.717 Sum_probs=27.1 Q ss_pred EEEECCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 89998998199999848986836997299995 Q gi|254780267|r 134 NHIVAPCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 134 n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) ..|-||..|+|.+|-|..|+-|..||||+.+- T Consensus 1107 g~igAPMpG~vieikvk~G~kV~Kgqpl~VLS 1138 (1176) T KOG0369 1107 GHIGAPMPGTVIEIKVKEGAKVKKGQPLAVLS 1138 (1176) T ss_pred CCCCCCCCCCEEEEEEECCCEECCCCCEEEEE T ss_conf 65369899846999983075103798357612 No 150 >PRK06078 pyrimidine-nucleoside phosphorylase; Reviewed Probab=86.69 E-value=1.5 Score=23.78 Aligned_cols=21 Identities=29% Similarity=0.287 Sum_probs=13.0 Q ss_pred CEECCCCCEEECCEEEEEECC Q ss_conf 000124420104348999737 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAM 130 (171) -++++||+|++||+||.|-|= T Consensus 380 l~~k~Gd~V~kGdpL~tiya~ 400 (434) T PRK06078 380 LRKKVGDSVKKGESLVTIYAN 400 (434) T ss_pred EECCCCCEECCCCEEEEEECC T ss_conf 725695995799949999748 No 151 >TIGR02644 Y_phosphoryl pyrimidine-nucleoside phosphorylase; InterPro: IPR000053 Two highly similar activities are represented in this group: thymidine phosphorylase (TP, gene deoA, 2.4.2.4 from EC) and pyrimidine-nucleoside phosphorylase (PyNP, gene pdp, 2.4.2.2 from EC). Both are dimeric enzymes that function in the salvage pathway to catalyse the reversible phosphorolysis of pyrimidine nucleosides to the free base and sugar moieties. In the case of thymidine phosphorylase, thymidine (and to a lesser extent, 2'-deoxyuridine) is lysed to produce thymine (or uracil) and 2'-deoxyribose-1-phosphate. Pyrimidine-nucleoside phosphorylase performs the analogous reaction on thymidine (to produce the same products) and uridine (to produce uracil and ribose-1-phosphate). PyNP is typically the only pyrimidine nucleoside phosphorylase encoded by Gram positive bacteria, while eukaryotes and proteobacteria encode two: TP, and the unrelated uridine phosphorylase. In humans, TP was originally characterised as platelet-derived endothelial cell growth factor and gliostatin . Structurally, the enzymes are homodimers, each composed of a rigid all alpha-helix lobe and a mixed alpha-helix/beta-sheet lobe, which are connected by a flexible hinge , . Prior to substrate binding, the lobes are separated by a large cleft. A functional active site and subsequent catalysis occurs upon closing of the cleft. The active site, composed of a phosphate binding site and a (deoxy)ribonucleotide binding site within the cleft region, is highly conserved between the two enzymes of this group. Active site residues (Escherichia coli DeoA numbering) include the phosphate binding Lys84 and Ser86 (close to a glycine-rich loop), Ser113, and Thr123, and the pyrimidine nucleoside-binding Arg171, Ser186, and Lys190. Sequence comparison between the active site residues for both enzymes reveals only one difference , which has been proposed to partially mediate substrate specificity. In TP, position 111 is a methionine, while the analogous position in PyNP is lysine. It should be noted that the uncharacterised archaeal members of this family differ in a number of respects from either of the characterised activities. The residue at position 108 is lysine, indicating the activity might be PyNP-like (though the determinants of substrate specificity have not been fully elucidated). Position 171 is glutamate (negative charge side chain) rather than arginine (positive charge side chain). In addition, a large loop that may "lock in" the substrates within the active site is much smaller than in the characterised members. It is not clear what effect these and other differences have on activity and specificity.; GO: 0006206 pyrimidine base metabolic process. Probab=86.35 E-value=0.6 Score=26.15 Aligned_cols=30 Identities=13% Similarity=0.141 Sum_probs=23.6 Q ss_pred EEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 870462567740367641000124420104348999737 Q gi|254780267|r 92 VTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 92 I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) ---|.+|.+.+ .++||+|++||+|+.|-+= T Consensus 377 ~ID~~aGi~l~---------~k~Gd~V~~Gd~~~tlY~~ 406 (425) T TIGR02644 377 KIDHEAGIYLH---------KKVGDKVKKGDPIATLYSS 406 (425) T ss_pred CCCCCCCEEEE---------CCCCCEEECCCEEEEEECC T ss_conf 30011355676---------5668753237579998628 No 152 >TIGR02388 rpoC2_cyan DNA-directed RNA polymerase, beta'' subunit; InterPro: IPR012756 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The family consists of the product of the rpoC2 gene, a subunit of DNA-directed RNA polymerase of cyanobacteria and chloroplasts. RpoC2 corresponds largely to the C-terminal region of the RpoC (the beta' subunit) of other bacteria. Members of this family are designated beta'' in chloroplasts/plastids, and beta' (confusingly) in Cyanobacteria, where RpoC1 is called beta' in chloroplasts/plastids and gamma in Cyanobacteria. We prefer to name this family beta'', after its organellar members, to emphasize that this RpoC1 and RpoC2 together replace RpoC in other bacteria.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=85.64 E-value=0.74 Score=25.61 Aligned_cols=56 Identities=32% Similarity=0.526 Sum_probs=27.5 Q ss_pred CCCCEECCCCCEEECCEEEE----EECCCEEEEEE-C-CCCCEEEEE------------ECCCCCCCCCCCEEEE Q ss_conf 64100012442010434899----97377268999-8-998199999------------8489868369972999 Q gi|254780267|r 107 GSDPFVNKGNLVVEGQTLLI----IEAMKTMNHIV-A-PCSGKVQDI------------NVKDGQSVEYGDALLV 163 (171) Q Consensus 107 ~~~~~V~vGd~Vk~G~~l~i----iEAMKm~n~I~-a-~~~G~I~~i------------~v~~G~~V~~gq~L~~ 163 (171) +++|-||.||.|-.||.|.- -|+-+++ +|. + +..-+|.++ .+++||.|.-||-|+. T Consensus 918 g~kp~VK~Gdl~v~G~~l~~Gv~~~~~g~~~-~v~~~~~~~~v~lR~~RPY~VSPgaVL~i~dGDLV~RGDnL~L 991 (1252) T TIGR02388 918 GAKPKVKTGDLVVAGDELAKGVKAEESGEVL-EVKNADDNYAVILRIARPYRVSPGAVLHIEDGDLVQRGDNLVL 991 (1252) T ss_pred CCCCCCCCCCEEECCCHHCCCCCCCCCCEEE-EECCCCCCCEEEEECCCCCEECCCCEEEEECCCEEEECCCEEE T ss_conf 7886124676541761212773444662056-7315888507999617884347865477545856651785589 No 153 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=85.52 E-value=0.53 Score=26.48 Aligned_cols=33 Identities=30% Similarity=0.477 Sum_probs=27.0 Q ss_pred CCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEE Q ss_conf 6268704625677403676410001244201043489997 Q gi|254780267|r 89 YHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 89 ~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiE 128 (171) ...|.||.-|+. .+...+.||+|..||+|++|+ T Consensus 43 ~lEV~Sp~dGvl-------~~~~~~eG~TV~~g~~La~~~ 75 (435) T TIGR01347 43 VLEVPSPADGVL-------QEILEKEGDTVTSGQVLAVLE 75 (435) T ss_pred EEEEECCCCCHH-------HHHHCCCCCEECCCEEEEEEE T ss_conf 678828853021-------743068988850550778882 No 154 >KOG0559 consensus Probab=84.48 E-value=0.65 Score=25.96 Aligned_cols=36 Identities=31% Similarity=0.409 Sum_probs=30.0 Q ss_pred CCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 6626870462567740367641000124420104348999737 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) -...|.||-.|++ .+-+|+.||+|..||.|..|+-- T Consensus 114 ~tv~V~sP~sGvi-------~e~lvk~gdtV~~g~~la~i~~g 149 (457) T KOG0559 114 TTVEVPSPASGVI-------TELLVKDGDTVTPGQKLAKISPG 149 (457) T ss_pred EEEECCCCCCCEE-------EEEECCCCCCCCCCCEEEEECCC T ss_conf 2331368876424-------57740789750478645773588 No 155 >TIGR00830 PTBA PTS system, glucose subfamily, IIA component; InterPro: IPR001127 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . ; GO: 0005351 sugar:hydrogen ion symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=84.45 E-value=0.51 Score=26.60 Aligned_cols=19 Identities=32% Similarity=0.515 Sum_probs=12.7 Q ss_pred CCEECCCCCEEECCEEEEE Q ss_conf 1000124420104348999 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~ii 127 (171) ..+|+.||+|++||.|+-. T Consensus 91 ~~~v~~G~~V~~Gd~l~ef 109 (129) T TIGR00830 91 TSHVEEGDKVKKGDPLLEF 109 (129) T ss_pred EEEEEECCEECCCCEEEEE T ss_conf 7876438787179779995 No 156 >pfam05896 NQRA Na(+)-translocating NADH-quinone reductase subunit A (NQRA). This family consists of several bacterial Na(+)-translocating NADH-quinone reductase subunit A (NQRA) proteins. The Na(+)-translocating NADH: ubiquinone oxidoreductase (Na(+)-NQR) generates an electrochemical Na(+) potential driven by aerobic respiration. Probab=84.17 E-value=0.79 Score=25.44 Aligned_cols=31 Identities=26% Similarity=0.323 Sum_probs=17.3 Q ss_pred ECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 9899819999984898683699729999534 Q gi|254780267|r 137 VAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 137 ~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) .+++-|..=+.+|+.||.|..|||||.=|.+ T Consensus 33 p~Df~glkPkl~VkeGD~Vk~G~pLF~dK~n 63 (257) T pfam05896 33 GEDYVGMKPKMLVKEGDKVKAGQPLFEDKKN 63 (257) T ss_pred CCCCCCCCEEEEEECCCEEECCCEEEEEECC T ss_conf 6014887326998349998558757997369 No 157 >pfam02666 PS_Dcarbxylase Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <= phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in the biosynthesis of aminophospholipids by converting phosphatidylserine to phosphatidylethanolamine. Probab=84.05 E-value=1.5 Score=23.69 Aligned_cols=62 Identities=26% Similarity=0.374 Sum_probs=38.3 Q ss_pred EEEECCCEEEEEECCCCCCCEE-CCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEE Q ss_conf 6870462567740367641000-12442010434899973772689998998199999848986836997299 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFV-NKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALL 162 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V-~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~ 162 (171) .|.+-++|.+- .+. +.|+.|++||.++.+.= .=-..+.-|. +.+.++.++.|+.|..||.|. T Consensus 138 ~Vga~~v~~I~--------~~~~~~g~~v~kG~e~G~f~f-GStV~l~f~~-~~~~~~~v~~g~~V~~Ge~ig 200 (201) T pfam02666 138 QVGALNVGSIV--------LYFKEEGDEVKKGDELGYFKF-GSTVVLLFPK-GTIFNFNVKVGQKVKAGETIG 200 (201) T ss_pred EECCCCCCEEE--------EEECCCCCEEECCCEEEEEEC-CCEEEEEEEC-CCCCCEEECCCCEEEEEEECC T ss_conf 95530136789--------997158979845748807962-8809999959-985221278999998363627 No 158 >cd01134 V_A-ATPase_A V/A-type ATP synthase catalytic subunit A. These ATPases couple ATP hydrolysis to the build up of a H+ gradient, but V-type ATPases do not catalyze the reverse reaction. The Vacuolar (V-type) ATPase is found in the membranes of vacuoles, the golgi apparatus and in other coated vesicles in eukaryotes. Archaea have a protein which is similar in sequence to V-ATPases, but functions like an F-ATPase (called A-ATPase). A similar protein is also found in a few bacteria. Probab=82.53 E-value=4.1 Score=21.07 Aligned_cols=57 Identities=28% Similarity=0.395 Sum_probs=35.2 Q ss_pred CCEECCCCCEEECCEEEEE-ECCCEEEEEECC--CCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 1000124420104348999-737726899989--9819999984898683699729999534 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLII-EAMKTMNHIVAP--CSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~ii-EAMKm~n~I~a~--~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) .|.|++||+|+.||+++.+ |---+.|-|-.| ..|+|..|...-.-.| .+.+..++.. T Consensus 52 ~p~v~~Gd~V~~GDi~G~V~E~~~i~HkImvpp~~~G~v~~i~~~g~y~v--~~~~~~~~~~ 111 (369) T cd01134 52 KPLVKVGDHVTGGDILGTVPENSLIEHKIMVPPRVRGTVTYIAPAGDYTV--DDVILEVEFD 111 (369) T ss_pred CCCCCCCCEECCCCEEEEEECCCCEEEEEECCCCCCEEEEEEECCCCCEE--EEEEEEECCC T ss_conf 00267799962786799985578367778648987379999915898524--6569997068 No 159 >cd00210 PTS_IIA_glc PTS_IIA, PTS system, glucose/sucrose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. Probab=82.29 E-value=0.79 Score=25.44 Aligned_cols=24 Identities=25% Similarity=0.367 Sum_probs=19.0 Q ss_pred CCEECCCCCEEECCEEEEEECCCE Q ss_conf 100012442010434899973772 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKT 132 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm 132 (171) +.+|+.||+|++||.|+-+..=++ T Consensus 83 ~~~v~~Gd~V~~G~~L~~~D~~~i 106 (124) T cd00210 83 TSHVEEGQRVKQGDKLLEFDLPAI 106 (124) T ss_pred EEEECCCCEECCCCEEEEECHHHH T ss_conf 899748899989999999859999 No 160 >KOG2419 consensus Probab=82.05 E-value=0.9 Score=25.09 Aligned_cols=31 Identities=16% Similarity=0.290 Sum_probs=16.7 Q ss_pred CCCCC----HHHHHHHHHHHHHCCCEEEEEEECCEE Q ss_conf 32139----899999999998659708999509969 Q gi|254780267|r 5 KQKIN----LTLIRNLANILNETNLTEVEIDNDGMR 36 (171) Q Consensus 5 ~~~md----i~~Ik~Li~~~~es~l~eleie~~~~k 36 (171) .+||| -++|+-+|++++ -+..|..+-.++|| T Consensus 712 GkK~dS~~Sak~I~pFi~Ff~-lnm~ev~~p~~~FK 746 (975) T KOG2419 712 GKKMDSVESAKQIPPFIEFFK-LNMAEVKYPLKHFK 746 (975) T ss_pred CCCCCCHHHHHHCCHHHHHHH-CCHHHHCCCCCCCH T ss_conf 656576454542625786654-24354328644420 No 161 >pfam00358 PTS_EIIA_1 phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 1. Probab=81.66 E-value=0.86 Score=25.21 Aligned_cols=21 Identities=29% Similarity=0.539 Sum_probs=16.0 Q ss_pred CCEECCCCCEEECCEEEEEEC Q ss_conf 100012442010434899973 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEA 129 (171) +.+|+.||+|++||.|+-+.. T Consensus 88 ~~~v~~Gd~V~~G~~L~~~D~ 108 (133) T pfam00358 88 ESHVEEGDRVKQGDKLLEFDL 108 (133) T ss_pred EEEECCCCEECCCCEEEEECH T ss_conf 899758999989999999859 No 162 >pfam07831 PYNP_C Pyrimidine nucleoside phosphorylase C-terminal domain. This domain is found at the C-terminal end of the large alpha/beta domain making up various pyrimidine nucleoside phosphorylases. It has slightly different conformations in different members of this family. For example, in pyrimidine nucleoside phosphorylase (PYNP) there is an added three-stranded anti-parallel beta sheet as compared to other members of the family, such as E. coli thymidine phosphorylase (TP). The domain contains an alpha/ beta hammerhead fold and residues in this domain seem to be important in formation of the homodimer. Probab=81.01 E-value=0.98 Score=24.87 Aligned_cols=29 Identities=28% Similarity=0.288 Sum_probs=22.1 Q ss_pred ECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCC Q ss_conf 04625677403676410001244201043489997377 Q gi|254780267|r 94 SPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMK 131 (171) Q Consensus 94 SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMK 131 (171) -|.+|.. -++++||+|++||+||.|-+-. T Consensus 29 D~~vGi~---------l~~k~Gd~V~~g~pl~~i~~~~ 57 (75) T pfam07831 29 DPGAGIY---------LHKKVGDKVKKGDPLATIYAND 57 (75) T ss_pred CHHHHHH---------EECCCCCEECCCCEEEEEECCC T ss_conf 7112254---------0246899966999399997799 No 163 >pfam01333 Apocytochr_F_C Apocytochrome F, C-terminal. This is a sub-family of cytochrome C. See pfam00034. Probab=77.82 E-value=2 Score=22.97 Aligned_cols=19 Identities=32% Similarity=0.478 Sum_probs=12.6 Q ss_pred CCCCEECCCCCEEECCEEE Q ss_conf 6410001244201043489 Q gi|254780267|r 107 GSDPFVNKGNLVVEGQTLL 125 (171) Q Consensus 107 ~~~~~V~vGd~Vk~G~~l~ 125 (171) |-+..|++||.|+.||.|- T Consensus 43 GpeliV~~G~~V~~dqpLT 61 (118) T pfam01333 43 GPELIVSEGQTVKADQPLT 61 (118) T ss_pred CCEEEECCCCEEECCCCCC T ss_conf 9748973898886698444 No 164 >TIGR01043 ATP_syn_A_arch ATP synthase archaeal, A subunit; InterPro: IPR005726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents the alpha subunit from the A1 complex of A-ATPases. The A1 complex contains three copies each of subunits alpha (or A) and beta (or B) (IPR005724 from INTERPRO), both of which form the headpiece that participates in nucleotide binding. However, only the alpha subunit is catalytic, the beta subunit being regulatory in function . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane, 0016469 proton-transporting two-sector ATPase complex. Probab=74.72 E-value=3.4 Score=21.60 Aligned_cols=58 Identities=26% Similarity=0.485 Sum_probs=41.0 Q ss_pred CCEECCCCCEEECCEEEEE-ECCCEEEEEE--CCCCC-EEEEEECCCCCCCCCCCEEEEEEECCC Q ss_conf 1000124420104348999-7377268999--89981-999998489868369972999953478 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLII-EAMKTMNHIV--APCSG-KVQDINVKDGQSVEYGDALLVLEKTGD 169 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~ii-EAMKm~n~I~--a~~~G-~I~~i~v~~G~~V~~gq~L~~i~~~~~ 169 (171) .|.|+.||+|+.||+|+.+ |.--++|.|- ..+.| +|.+|. .|+.-= .+.+..++.+|+ T Consensus 120 ~P~v~~Gd~V~~Gd~~G~V~ET~~i~h~ilvPp~~~gg~~~~~~--~G~ftV-~d~i~~~~~~G~ 181 (584) T TIGR01043 120 KPTVKEGDKVEGGDIIGVVQETSLIEHKILVPPNVEGGEIVEIA--EGDFTV-EDTIAVVEKDGE 181 (584) T ss_pred CCEECCCCEECCCCEEEEECCCCCEEEEEEECCCCCCCEEEEEE--CCCEEE-EEEEEEEECCCC T ss_conf 22122686434761587763765033477735745787577872--673146-556899822788 No 165 >TIGR01730 RND_mfp efflux transporter, RND family, MFP subunit; InterPro: IPR006143 Gram-negative bacteria produce a number of proteins which are secreted into the growth medium by a mechanism that does not require a cleaved N-terminal signal sequence. These proteins, while having different functions, require the help of two or more proteins for their secretion across the cell envelope. These secretion proteins include members belonging to the ABC transporter family (see the relevant entry IPR003439 from INTERPRO) and a protein belonging to a family which includes the following members , , , , : Hemolysin secretion protein D from Escherichia coli. Lactococcin A secretion protein lcnD from Lactococcus lactis. RTX-I toxin determinant D from Actinobacillus pleuropneumoniae. Calmodulin-sensitive adenylate cyclase-hemolysin (cyclolysin) cyaD from Bordetella pertussis. Colicin Vsecretion protein cvaA from Escherichia coli. Proteases secretion protein prtE from Erwinia chrysanthemi. Alkaline protease secretion protein aprE from Pseudomonas aeruginosa. Multidrug resistance protein A from Escherichia coli. The secretion proteins are evolutionary related and consist of from 390 to 480 amino acid residues. They seem to be anchored in the inner membrane by a N-terminal transmembrane region. Their exact role in the secretion process is not yet known.; GO: 0008565 protein transporter activity, 0009306 protein secretion, 0016020 membrane. Probab=73.78 E-value=4.2 Score=21.00 Aligned_cols=33 Identities=33% Similarity=0.414 Sum_probs=20.6 Q ss_pred EEECCCCCEEEEEECCCCCCCCCC--CEEEEEEEC Q ss_conf 999899819999984898683699--729999534 Q gi|254780267|r 135 HIVAPCSGKVQDINVKDGQSVEYG--DALLVLEKT 167 (171) Q Consensus 135 ~I~a~~~G~I~~i~v~~G~~V~~g--q~L~~i~~~ 167 (171) .+.+|++|.|....+..|+.|..| ++|+.+... T Consensus 136 ~~~~p~~g~~~~~~~~~g~~~~~~~~~~~~~~~~~ 170 (333) T TIGR01730 136 EIRAPFDGTIGRRLVEVGEYVSAGQATPLLTLVDL 170 (333) T ss_pred EEECCCCCEEEEEECCCCCCCCCCCCCEEEEEEEC T ss_conf 55234343011111022331135554114676414 No 166 >COG0739 NlpD Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane] Probab=72.18 E-value=3.1 Score=21.82 Aligned_cols=19 Identities=26% Similarity=0.392 Sum_probs=12.0 Q ss_pred CCEECCCCCEEECCEEEEE Q ss_conf 1000124420104348999 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~ii 127 (171) ...|++||+|++||+|+.+ T Consensus 215 ~~~V~~G~~V~~G~~Ig~~ 233 (277) T COG0739 215 SILVKEGQKVKAGQVIGYV 233 (277) T ss_pred CCCCCCCCEEECCCEEEEC T ss_conf 0131589788469789972 No 167 >pfam06572 DUF1131 Protein of unknown function (DUF1131). This family consists of several hypothetical bacterial proteins of unknown function. Probab=70.29 E-value=9.1 Score=18.99 Aligned_cols=43 Identities=28% Similarity=0.309 Sum_probs=37.0 Q ss_pred ECCCCCEEECCEEEEEECCC---EEEEEECCCCCEEEEEECCCCCC Q ss_conf 01244201043489997377---26899989981999998489868 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMK---TMNHIVAPCSGKVQDINVKDGQS 154 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMK---m~n~I~a~~~G~I~~i~v~~G~~ 154 (171) +.-|-+-+.|+++-+.|||| ...+|.-+-.|+|.+|-|.|-+. T Consensus 70 lRsGM~t~nG~iv~~fqAmk~d~V~lvI~G~~kGtV~RI~V~D~di 115 (192) T pfam06572 70 LRSGMKTSNGQIVRYFQAMKGDQVKLVINGQPKGTVSRIDVMDSDI 115 (192) T ss_pred ECCCCCCCCCEEEEEEEEECCCCEEEEEECCCCCEEEEEEEECCCC T ss_conf 6365520598199998760688179999668886499999855775 No 168 >CHL00006 consensus Probab=69.79 E-value=7.1 Score=19.63 Aligned_cols=50 Identities=18% Similarity=0.264 Sum_probs=32.5 Q ss_pred CCCCEECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEE---------------------EECCCCCCCCCCCEEEEE Q ss_conf 6410001244201043489997377268999899819999---------------------984898683699729999 Q gi|254780267|r 107 GSDPFVNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQD---------------------INVKDGQSVEYGDALLVL 164 (171) Q Consensus 107 ~~~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~---------------------i~v~~G~~V~~gq~L~~i 164 (171) +..++|..|+-+.+|+-++ ....++.||.|.. +++.+|+.|.-|+.|+.+ T Consensus 1054 ~~~~~VsLGqfI~eg~~I~--------~~~~~~~SGQIi~I~~~~ivIRlakPYLaS~GA~vhv~~GdlI~eGDtLvtL 1124 (1372) T CHL00006 1054 ETSTIISLGQFICENVCIA--------KNGPHLKSGQVIIVQVDSVVIRSAKPYLATPGATVHGHYGEILYEGDTLVTF 1124 (1372) T ss_pred CCCCEECCCCEEECCCEEC--------CCCCCCCCCCEEEEECCEEEEEECCCEECCCCCEEEECCCCEECCCCEEEEE T ss_conf 3562513765564372771--------7987787741799957879999514236389968981588887689857998 No 169 >PRK03140 phosphatidylserine decarboxylase; Provisional Probab=68.98 E-value=5.5 Score=20.34 Aligned_cols=18 Identities=50% Similarity=0.934 Sum_probs=10.2 Q ss_pred CCCCCCCEEEECCCEEEE Q ss_conf 345466268704625677 Q gi|254780267|r 84 LSPDNYHTVTSPMVGTAY 101 (171) Q Consensus 84 ~~~~~~~~I~SPmvGtfy 101 (171) ..+.++|.+-||.-|++- T Consensus 133 LsP~DYHR~H~P~~g~v~ 150 (259) T PRK03140 133 LSPSHYHRIHSPISGTVT 150 (259) T ss_pred ECHHHCCCEECCCCEEEE T ss_conf 783460215356761999 No 170 >PRK11649 hypothetical protein; Provisional Probab=68.88 E-value=3.8 Score=21.30 Aligned_cols=22 Identities=18% Similarity=0.294 Sum_probs=13.3 Q ss_pred CCEECCCCCEEECCEEEEEECC Q ss_conf 1000124420104348999737 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAM 130 (171) +..|+.||+|+.||+++.+-+= T Consensus 343 ~~~V~~Gq~V~~Gq~Ig~vGsT 364 (418) T PRK11649 343 KLLVKPGQKVKRGDRIALSGNT 364 (418) T ss_pred CCCCCCCCEECCCCEEEEECCC T ss_conf 4777994999989989802688 No 171 >COG0213 DeoA Thymidine phosphorylase [Nucleotide transport and metabolism] Probab=67.85 E-value=5 Score=20.59 Aligned_cols=19 Identities=32% Similarity=0.330 Sum_probs=15.2 Q ss_pred EECCCCCEEECCEEEEEEC Q ss_conf 0012442010434899973 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiEA 129 (171) ..+.||+|++|++|+.|-| T Consensus 382 ~kk~ge~Vk~Gd~l~tiya 400 (435) T COG0213 382 HKKLGEKVKKGDPLATIYA 400 (435) T ss_pred EECCCCEECCCCEEEEEEC T ss_conf 8317983026974799862 No 172 >COG1155 NtpA Archaeal/vacuolar-type H+-ATPase subunit A [Energy production and conversion] Probab=65.57 E-value=11 Score=18.40 Aligned_cols=56 Identities=20% Similarity=0.304 Sum_probs=35.4 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEEEEE--CCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 10001244201043489997377268999--8998199999848986836997299995 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMNHIV--APCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n~I~--a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) .|-|++||+|..||+|+.+.---.++.|- -+++|.+..+.++.|+--= ++.+.+++ T Consensus 119 ~P~~~~Gd~V~~GdvlGtV~Et~~i~~imvpp~~~~~~v~~i~~~G~ytv-~d~ia~v~ 176 (588) T COG1155 119 VPAVKKGDTVYPGDVLGTVQETSLITHRIMVPPGVSGKVTWIAEEGEYTV-EDVIATVS 176 (588) T ss_pred CCCCCCCCEECCCCEEEEECCCCCEEEEEECCCCCCEEEEEEECCCCCEE-EEEEEEEE T ss_conf 34666698752685489860677357998378877628999955887156-67999995 No 173 >COG4072 Uncharacterized protein conserved in archaea [Function unknown] Probab=64.66 E-value=11 Score=18.61 Aligned_cols=40 Identities=25% Similarity=0.392 Sum_probs=32.7 Q ss_pred CCEECCCCCEEECCEEEEEECCCEEE-EEECCCCCEEEEEE Q ss_conf 10001244201043489997377268-99989981999998 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEAMKTMN-HIVAPCSGKVQDIN 148 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEAMKm~n-~I~a~~~G~I~~i~ 148 (171) -|.+.+|++|.+||.++-+-.-|-+. =+.+|++|+|.-|. T Consensus 104 tpIaDvG~RvrkGd~~AAvttRkG~vryv~~P~~g~Vvyi~ 144 (161) T COG4072 104 TPIADVGNRVRKGDPFAAVTTRKGEVRYVKPPVPGTVVYID 144 (161) T ss_pred EEEECCCCHHCCCCCEEEEEECCCCEEEECCCCCCEEEEEE T ss_conf 99601154100788405887314526882599982899995 No 174 >PRK10718 hypothetical protein; Provisional Probab=63.77 E-value=12 Score=18.19 Aligned_cols=42 Identities=29% Similarity=0.374 Sum_probs=34.4 Q ss_pred ECCCCCEEECCEEEEEECCC---EEEEEECCCCCEEEEEECCCCCC Q ss_conf 01244201043489997377---26899989981999998489868 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMK---TMNHIVAPCSGKVQDINVKDGQS 154 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMK---m~n~I~a~~~G~I~~i~v~~G~~ 154 (171) +.-|-+-..|+++-+.|||| ....|.-+ .|+|.+|-|.|-+. T Consensus 70 lRsGM~t~~G~iv~~fqA~k~d~V~lvi~G~-kgtV~rI~V~D~~i 114 (191) T PRK10718 70 LRSGMKTANGNVVRFFEAMKGDQVAMVINGQ-QGTVSRIDVLDSDI 114 (191) T ss_pred ECCCCEECCCEEEEEEEECCCCEEEEEEECC-CCEEEEEEEECCCC T ss_conf 6365521498499988541788189999668-86399999855554 No 175 >PRK06148 hypothetical protein; Provisional Probab=61.32 E-value=6.3 Score=19.94 Aligned_cols=43 Identities=14% Similarity=0.156 Sum_probs=27.8 Q ss_pred CCCEEEECCCEEEEEE-------------------------------CCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 6626870462567740-------------------------------367641000124420104348999737 Q gi|254780267|r 88 NYHTVTSPMVGTAYLA-------------------------------SSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~-------------------------------psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) .+..|.||+.|+++.+ -+|+...-.++||+|++|+.+|-+-+- T Consensus 452 ~gT~V~AP~dG~V~~~~n~~~~l~yGg~iiL~h~~~~~~~F~tLyGhl~~~~~~~l~~G~~i~~G~~ig~~G~~ 525 (1015) T PRK06148 452 AGTPVYAPLAGTVRSVEIEAVPLGYGGLIALEHEPPGCPPFYTLWGHLAHEAVSRLKPGDRLAAGELVGHMGDA 525 (1015) T ss_pred CCCCEECCCCCEEEEEECCCCCCCCCCEEEEEECCCCCCCEEEEECCCCHHHHHCCCCCCCCCCCCEEEECCCC T ss_conf 99804466586699962476555677548998425888750565235772553126888850467712342665 No 176 >pfam02749 QRPTase_N Quinolinate phosphoribosyl transferase, N-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to give rise to nicotinic acid mononucleotide (NaMN), pyrophosphate and carbon dioxide. The QA substrate is bound between the C-terminal domain of one subunit, and the N-terminal domain of the other. The N-terminal domain has an alpha/beta hammerhead fold. Probab=60.33 E-value=10 Score=18.66 Aligned_cols=22 Identities=45% Similarity=0.664 Sum_probs=13.9 Q ss_pred EEECCCCCCCCCCCEEEEEEEC Q ss_conf 9984898683699729999534 Q gi|254780267|r 146 DINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 146 ~i~v~~G~~V~~gq~L~~i~~~ 167 (171) ..++.||+.|+.|+.|++|+=+ T Consensus 48 ~~~~~dG~~v~~g~~i~~i~G~ 69 (88) T pfam02749 48 EWLVKDGERVEAGDVILEIEGP 69 (88) T ss_pred EEECCCCCEECCCCEEEEEEEC T ss_conf 9996889991799999999988 No 177 >KOG1668 consensus Probab=58.64 E-value=7.5 Score=19.51 Aligned_cols=35 Identities=23% Similarity=0.141 Sum_probs=28.3 Q ss_pred CCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEEE Q ss_conf 4625677403676410001244201043489997377268 Q gi|254780267|r 95 PMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTMN 134 (171) Q Consensus 95 PmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~n 134 (171) -|-|.+|-+ --||-+|.-+|+=|+.|+||.-|.-. T Consensus 172 ~~~gl~wga-----sklvpvGygikKlqi~~vveddkvs~ 206 (231) T KOG1668 172 EMDGLVWGA-----SKLVPVGYGIKKLQIQCVVEDDKVSI 206 (231) T ss_pred HHCCCCCCC-----CCCCCCCCCEEEEEEEEEEECCCCCC T ss_conf 311101043-----11133555403688999998375461 No 178 >PRK01497 DNA-directed RNA polymerase subunit beta'; Provisional Probab=57.29 E-value=16 Score=17.50 Aligned_cols=17 Identities=35% Similarity=0.454 Sum_probs=9.4 Q ss_pred CEECCCCCEEECCEEEE Q ss_conf 00012442010434899 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLI 126 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~i 126 (171) .++..|+.|++|++++- T Consensus 994 l~v~~~~~v~~~~vi~~ 1010 (1384) T PRK01497 994 LLVKDGDAVKAGQVLAE 1010 (1384) T ss_pred EEECCCCEEECCEEEEE T ss_conf 99637867742418985 No 179 >PRK06149 hypothetical protein; Provisional Probab=57.14 E-value=9.2 Score=18.96 Aligned_cols=39 Identities=28% Similarity=0.342 Sum_probs=27.2 Q ss_pred CCCEEEECCCEEEEEECCC--------------CCCCEECCCCCEEECCEEEE Q ss_conf 6626870462567740367--------------64100012442010434899 Q gi|254780267|r 88 NYHTVTSPMVGTAYLASSP--------------GSDPFVNKGNLVVEGQTLLI 126 (171) Q Consensus 88 ~~~~I~SPmvGtfy~~psP--------------~~~~~V~vGd~Vk~G~~l~i 126 (171) .+..|.||+-|.+.+.... |..+-+++|++|++||.|+- T Consensus 441 agTpV~AP~~G~V~~~~~g~~~l~~~~~~~~L~g~~~~~~~g~~i~~G~~ig~ 493 (972) T PRK06149 441 AGTAVAAPFAGEIGRDGDGRLTLRGDAWELHLTGVEPAVADGAAVEAGQPLGS 493 (972) T ss_pred CCCEEECCCCCEEEECCCCEEEEECCCEEEEEECCCCCCCCCCEEECCCCCCC T ss_conf 99805466576699716964898517548998256776677675206770033 No 180 >PRK10334 mechanosensitive channel MscS; Provisional Probab=53.81 E-value=18 Score=17.15 Aligned_cols=36 Identities=17% Similarity=0.227 Sum_probs=32.4 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEE Q ss_conf 012442010434899973772689998998199999 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDI 147 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i 147 (171) .++||.|+.|..-|.+|.+.+++...-..||+..-| T Consensus 130 f~vGD~I~i~g~~G~V~~I~lr~T~lrt~Dg~~v~I 165 (285) T PRK10334 130 FRAGEYVDLGGVAGTVLSVQIFSTTMRTADGKIIVI 165 (285) T ss_pred CCCCCEEEECCEEEEEEEEEEEEEEEECCCCCEEEE T ss_conf 147976999999999999886799998689978996 No 181 >pfam03829 PTSIIA_gutA PTS system glucitol/sorbitol-specific IIA component. Probab=51.57 E-value=8.2 Score=19.25 Aligned_cols=25 Identities=16% Similarity=0.302 Sum_probs=19.7 Q ss_pred EECCCEEEEEECCCCCCCEECCCCCEE Q ss_conf 704625677403676410001244201 Q gi|254780267|r 93 TSPMVGTAYLASSPGSDPFVNKGNLVV 119 (171) Q Consensus 93 ~SPmvGtfy~~psP~~~~~V~vGd~Vk 119 (171) .++++|++|+.+. ..|-+++|++++ T Consensus 93 e~~lpG~I~v~~~--~~p~i~vG~~I~ 117 (117) T pfam03829 93 EAELPGTIHVEGK--PPPDIKVGSTIK 117 (117) T ss_pred CCCCCCEEEECCC--CCCCCCCCCCCC T ss_conf 7676986898789--899788889939 No 182 >PRK06106 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=51.41 E-value=20 Score=16.92 Aligned_cols=19 Identities=21% Similarity=0.373 Sum_probs=11.9 Q ss_pred CEECCCCCEEECCEEEEEE Q ss_conf 0001244201043489997 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiE 128 (171) .+++.|+.|++||+|+-++ T Consensus 72 ~~~~DG~~v~~g~~i~~i~ 90 (281) T PRK06106 72 RHLPDGAAVAPGDVIATIS 90 (281) T ss_pred EECCCCCEECCCCEEEEEE T ss_conf 9858898813898689999 No 183 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=51.37 E-value=20 Score=16.91 Aligned_cols=25 Identities=24% Similarity=0.417 Sum_probs=17.4 Q ss_pred EEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 9999848986836997299995347 Q gi|254780267|r 144 VQDINVKDGQSVEYGDALLVLEKTG 168 (171) Q Consensus 144 I~~i~v~~G~~V~~gq~L~~i~~~~ 168 (171) -.+++|+.||.|..||.+-.+-.++ T Consensus 324 n~~~lVkeg~~V~~Gq~Ia~~G~tg 348 (374) T PRK10871 324 NDTMLVREQQEVKAGQKIATMGSTG 348 (374) T ss_pred CCCCCCCCCCEECCCCEEEEECCCC T ss_conf 6626678889988999898643789 No 184 >pfam01551 Peptidase_M23 Peptidase family M23. Members of this family are zinc metallopeptidases with a range of specificities. The peptidase family M23 is included in this family, these are Gly-Gly endopeptidases. Peptidase family M23 are also endopeptidases. This family also includes some bacterial lipoproteins, for which no proteolytic activity has been demonstrated. This family also includes leukocyte cell-derived chemotaxin 2 (LECT2) proteins. LECT2 is a liver-specific protein which is thought to be linked to hepatocyte growth although the exact function of this protein is unknown. Probab=49.30 E-value=22 Score=16.71 Aligned_cols=20 Identities=25% Similarity=0.312 Sum_probs=13.9 Q ss_pred CCEECCCCCEEECCEEEEEE Q ss_conf 10001244201043489997 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiE 128 (171) ...|++||.|++||.|+.+- T Consensus 54 ~~~V~~G~~V~~G~~IG~~G 73 (96) T pfam01551 54 KILVKVGQRVKAGQVIGTVG 73 (96) T ss_pred EEEECCCCCHHHCCEEEEEC T ss_conf 67795898901399999867 No 185 >PRK04192 V-type ATP synthase subunit A; Provisional Probab=46.27 E-value=21 Score=16.79 Aligned_cols=40 Identities=28% Similarity=0.443 Sum_probs=27.2 Q ss_pred CCEECCCCCEEECCEEEEEEC-CCEEEEEEC--CCCCEEEEEE Q ss_conf 100012442010434899973-772689998--9981999998 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEA-MKTMNHIVA--PCSGKVQDIN 148 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEA-MKm~n~I~a--~~~G~I~~i~ 148 (171) .|.+++||.|..||+++.+.- -=+.|.|-. ...|+|..|. T Consensus 121 ~P~~~~Gd~v~~Gdilg~V~Et~~i~H~imvPp~~~g~v~~i~ 163 (585) T PRK04192 121 KPTVKVGDKVVAGDVLGTVQETGSIEHKIMVPPGVSGTIKEIA 163 (585) T ss_pred CCCCCCCCCCCCCCEEEEEECCCCCCCCEECCCCCCCEEEEEE T ss_conf 1113558741578657885046653341106988873389970 No 186 >TIGR00078 nadC nicotinate-nucleotide pyrophosphorylase; InterPro: IPR004393 Nicotinate-nucleotide pyrophosphorylase (2.4.2.19 from EC), also known as quinolinate phosphoribosyltransferase (decarboxylating), catalyses the conversion of nicotinate D-ribonucleotide, pyrophosphate and carbon dioxide into pyridine-2,3-dicarboxylate and 5-phospho-alpha-D-ribose 1-diphosphate. This enzyme is a type II phosphoribosyltransferase which provides the de novo source of nicotinate mononucleotide (NAMN) for NAD biosynthesis in both prokaryotes and eukaryotes , . Structural studies have shown that the active form of this enzyme is a homodimer with an unsual fold , , . The N-terminal forms a mixed alpha/beta domain, while the C-terminal region forms a multi-stranded, open alpha/beta barrel. The active site is found at the C-terminal ends of the beta strands of the alpha/beta barrel, and is bordered by the N-terminal domain of the second subunit of the dimer. It contains several conserved charged residues thought to be important determinants of substrate binding and catalysis. ; GO: 0004514 nicotinate-nucleotide diphosphorylase (carboxylating) activity, 0019363 pyridine nucleotide biosynthetic process. Probab=43.65 E-value=15 Score=17.63 Aligned_cols=21 Identities=24% Similarity=0.403 Sum_probs=15.5 Q ss_pred CCEECCCCCEEECCEEEEEEC Q ss_conf 100012442010434899973 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEA 129 (171) ..+++.|+.|+.||+|+-||- T Consensus 63 ~~~~~DG~~~~~G~~v~~~~G 83 (276) T TIGR00078 63 EWLVKDGDEVEAGEVVAEVEG 83 (276) T ss_pred EEECCCCCEECCCCEEEEEEC T ss_conf 410378877447738999974 No 187 >cd01568 QPRTase_NadC Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=42.03 E-value=28 Score=16.03 Aligned_cols=25 Identities=44% Similarity=0.539 Sum_probs=15.3 Q ss_pred CEEEEEECCCCCCCCCCCEEEEEEE Q ss_conf 1999998489868369972999953 Q gi|254780267|r 142 GKVQDINVKDGQSVEYGDALLVLEK 166 (171) Q Consensus 142 G~I~~i~v~~G~~V~~gq~L~~i~~ 166 (171) |.-...++.+|+.|..|+.++.++= T Consensus 56 ~~~~~~~~~dG~~v~~g~~i~~i~G 80 (269) T cd01568 56 GIEVEWLVKDGDRVEAGQVLLEVEG 80 (269) T ss_pred CEEEEEECCCCCCCCCCCEEEEEEE T ss_conf 8199998788887379968999997 No 188 >PRK08385 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=41.75 E-value=27 Score=16.08 Aligned_cols=19 Identities=32% Similarity=0.725 Sum_probs=8.6 Q ss_pred EECCCCCCCCCCCEEEEEE Q ss_conf 9848986836997299995 Q gi|254780267|r 147 INVKDGQSVEYGDALLVLE 165 (171) Q Consensus 147 i~v~~G~~V~~gq~L~~i~ 165 (171) .++.+|+.|+.|+++++++ T Consensus 63 ~~~~dG~~v~~g~~i~~i~ 81 (279) T PRK08385 63 VKKRDGETVKAGEVILELK 81 (279) T ss_pred EECCCCCEECCCCEEEEEE T ss_conf 9928898906998999999 No 189 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=41.12 E-value=29 Score=15.94 Aligned_cols=40 Identities=20% Similarity=0.321 Sum_probs=25.5 Q ss_pred EEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEEEEEC Q ss_conf 89997377268999899819999984898683699729999534 Q gi|254780267|r 124 LLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLVLEKT 167 (171) Q Consensus 124 l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~i~~~ 167 (171) |.||+.-+-.+.|-+.. ..|+|..|+.|..|+|+-.+=.+ T Consensus 354 vvIldhG~gy~slyg~~----~~i~v~~G~~V~AGepIa~~G~s 393 (420) T COG4942 354 VVILDHGGGYHSLYGGN----QSILVNPGQFVKAGEPIALVGSS 393 (420) T ss_pred EEEEECCCCCEEEECCC----CEEEECCCCEEECCCCHHHCCCC T ss_conf 99997488537886166----42210689775569715322677 No 190 >PRK05848 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=39.59 E-value=31 Score=15.79 Aligned_cols=19 Identities=37% Similarity=0.691 Sum_probs=8.7 Q ss_pred EECCCCCCCCCCCEEEEEE Q ss_conf 9848986836997299995 Q gi|254780267|r 147 INVKDGQSVEYGDALLVLE 165 (171) Q Consensus 147 i~v~~G~~V~~gq~L~~i~ 165 (171) .++.+|+.|..|+++++++ T Consensus 61 ~~~~dG~~v~~g~~i~~i~ 79 (272) T PRK05848 61 WTIKDGESFKKGDTLMEIR 79 (272) T ss_pred EEECCCCEECCCCEEEEEE T ss_conf 9908898906999999999 No 191 >pfam00924 MS_channel Mechanosensitive ion channel. Two members of this protein family from M. jannaschii have been functionally characterized. Both proteins form mechanosensitive (MS) ion channels upon reconstitution into liposomes and functional examination by the patch-clamp technique. Therefore this family are likely to also be MS channel proteins. Probab=38.26 E-value=32 Score=15.67 Aligned_cols=33 Identities=12% Similarity=0.228 Sum_probs=14.8 Q ss_pred CCCCCEEECCEEEEEECCCEEEEEECCCCCEEE Q ss_conf 124420104348999737726899989981999 Q gi|254780267|r 113 NKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQ 145 (171) Q Consensus 113 ~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~ 145 (171) ++||.|+-|+.-+.+|.+.+.....-..+|.+. T Consensus 60 ~vGD~I~i~~~~G~V~~I~~r~T~irt~d~~~v 92 (203) T pfam00924 60 KIGDWIEIGDVEGTVEDIGIRSTTIRTFDGRLV 92 (203) T ss_pred ECCCEEEECCEEEEEEEEECCEEEEEECCCCEE T ss_conf 248689999988999997657899993699899 No 192 >cd01572 QPRTase Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=38.21 E-value=32 Score=15.66 Aligned_cols=22 Identities=41% Similarity=0.498 Sum_probs=10.5 Q ss_pred EEEEECCCCCCCCCCCEEEEEE Q ss_conf 9999848986836997299995 Q gi|254780267|r 144 VQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 144 I~~i~v~~G~~V~~gq~L~~i~ 165 (171) -...++++|+.|..|+.++.++ T Consensus 59 ~~~~~~~dG~~v~~g~~i~~i~ 80 (268) T cd01572 59 EVEWLVKDGDRVEPGQVLATVE 80 (268) T ss_pred EEEEECCCCCCCCCCCEEEEEE T ss_conf 9999858898807998999999 No 193 >PRK06096 molybdenum transport protein ModD; Provisional Probab=37.54 E-value=33 Score=15.60 Aligned_cols=20 Identities=30% Similarity=0.448 Sum_probs=10.9 Q ss_pred EEECCCCCCCCCCCEEEEEE Q ss_conf 99848986836997299995 Q gi|254780267|r 146 DINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 146 ~i~v~~G~~V~~gq~L~~i~ 165 (171) ..++.||+.|..||.|++++ T Consensus 64 ~~~~~dG~~v~~G~~i~~i~ 83 (284) T PRK06096 64 DDAVSDGSQANAGQRLIRAQ 83 (284) T ss_pred EEEECCCCCCCCCCEEEEEE T ss_conf 99918898827998999999 No 194 >PRK06543 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=36.39 E-value=34 Score=15.49 Aligned_cols=21 Identities=19% Similarity=0.284 Sum_probs=14.8 Q ss_pred CCEECCCCCEEECCEEEEEEC Q ss_conf 100012442010434899973 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiEA 129 (171) ..+++.|+.|++||+++.++. T Consensus 68 ~~~~~dG~~v~~g~~i~~i~G 88 (281) T PRK06543 68 ELAVADGERFEAGDILATVTG 88 (281) T ss_pred EEEECCCCCCCCCCEEEEEEE T ss_conf 998089988179989999973 No 195 >PRK07896 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=36.24 E-value=34 Score=15.47 Aligned_cols=20 Identities=35% Similarity=0.375 Sum_probs=13.4 Q ss_pred CCEECCCCCEEECCEEEEEE Q ss_conf 10001244201043489997 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiE 128 (171) ..+++.|+.|++||+||-++ T Consensus 78 ~~~~~DG~~v~~G~~i~~i~ 97 (288) T PRK07896 78 LDRVEDGARVPPGQALLTVT 97 (288) T ss_pred EEECCCCCEECCCCEEEEEE T ss_conf 99868898907998999999 No 196 >cd01573 modD_like ModD; Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase) present in some modABC operons in bacteria, which are involved in molybdate transport. In general, QPRTases are part of the de novo synthesis pathway of NAD in both prokaryotes and eukaryotes. They catalyse the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. Probab=35.00 E-value=36 Score=15.35 Aligned_cols=18 Identities=33% Similarity=0.420 Sum_probs=8.1 Q ss_pred ECCCCCCCCCCCEEEEEE Q ss_conf 848986836997299995 Q gi|254780267|r 148 NVKDGQSVEYGDALLVLE 165 (171) Q Consensus 148 ~v~~G~~V~~gq~L~~i~ 165 (171) ++.+|+.|..|+.|++|+ T Consensus 61 ~~~dG~~v~~g~~i~~i~ 78 (272) T cd01573 61 AAASGSRVAAGAVLLEAE 78 (272) T ss_pred ECCCCCEECCCCEEEEEE T ss_conf 948798907998999999 No 197 >PRK09016 quinolinate phosphoribosyltransferase; Validated Probab=34.33 E-value=37 Score=15.28 Aligned_cols=20 Identities=25% Similarity=0.361 Sum_probs=13.4 Q ss_pred CCEECCCCCEEECCEEEEEE Q ss_conf 10001244201043489997 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiE 128 (171) ..+++.|+.|++||+||-|+ T Consensus 88 ~~~~~DG~~v~~G~~i~~i~ 107 (296) T PRK09016 88 EWHVDDGDKITANQTLCELT 107 (296) T ss_pred EEECCCCCCCCCCCEEEEEE T ss_conf 99879898838999999999 No 198 >PRK08072 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=34.10 E-value=37 Score=15.26 Aligned_cols=19 Identities=21% Similarity=0.461 Sum_probs=12.6 Q ss_pred CEECCCCCEEECCEEEEEE Q ss_conf 0001244201043489997 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiE 128 (171) .+++.|+.|++||+++-++ T Consensus 68 ~~~~dG~~v~~g~~i~~i~ 86 (277) T PRK08072 68 LHKKDGDLVEKGEIIATVQ 86 (277) T ss_pred EECCCCCEECCCCEEEEEE T ss_conf 9828998816998689998 No 199 >COG0157 NadC Nicotinate-nucleotide pyrophosphorylase [Coenzyme metabolism] Probab=33.51 E-value=38 Score=15.20 Aligned_cols=19 Identities=37% Similarity=0.531 Sum_probs=13.5 Q ss_pred EECCCCCEEECCEEEEEEC Q ss_conf 0012442010434899973 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiEA 129 (171) +++.|+.|+.||+|+-++. T Consensus 69 ~~~DG~~v~~g~~i~~~~G 87 (280) T COG0157 69 LVKDGDRVKPGDVLAEIEG 87 (280) T ss_pred ECCCCCEECCCCEEEEEEC T ss_conf 7288888079978999956 No 200 >CHL00075 rpl21 ribosomal protein L21 Probab=32.13 E-value=40 Score=15.06 Aligned_cols=10 Identities=10% Similarity=0.295 Sum_probs=3.3 Q ss_pred CCCCCCCCCE Q ss_conf 9868369972 Q gi|254780267|r 151 DGQSVEYGDA 160 (171) Q Consensus 151 ~G~~V~~gq~ 160 (171) +|+.+..|+| T Consensus 45 ~~~~~~iG~P 54 (110) T CHL00075 45 NESEILIGKP 54 (110) T ss_pred CCCCEEECCC T ss_conf 8997799983 No 201 >PRK06559 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=29.82 E-value=34 Score=15.48 Aligned_cols=26 Identities=8% Similarity=0.014 Sum_probs=18.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 98763213989999999999865970 Q gi|254780267|r 1 MTDKKQKINLTLIRNLANILNETNLT 26 (171) Q Consensus 1 M~~~~~~mdi~~Ik~Li~~~~es~l~ 26 (171) |+-....++-..|+++++..=+.|+. T Consensus 1 ~~~~~~~l~~~~i~~~i~~aL~ED~~ 26 (290) T PRK06559 1 MSIISTDLTPFQIDDTLKAALREDVH 26 (290) T ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 97645899999999999999972799 No 202 >PRK07428 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=29.81 E-value=44 Score=14.82 Aligned_cols=18 Identities=33% Similarity=0.399 Sum_probs=11.3 Q ss_pred EECCCCCEEECCEEEEEE Q ss_conf 001244201043489997 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiE 128 (171) +++.|+.|++||+|+-++ T Consensus 74 ~~~dG~~v~~g~~i~~i~ 91 (285) T PRK07428 74 TVAEGEACESGQVVAEIE 91 (285) T ss_pred ECCCCCEECCCCEEEEEE T ss_conf 848898534897899999 No 203 >PRK06978 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=27.86 E-value=48 Score=14.62 Aligned_cols=20 Identities=15% Similarity=0.137 Sum_probs=14.2 Q ss_pred CCEECCCCCEEECCEEEEEE Q ss_conf 10001244201043489997 Q gi|254780267|r 109 DPFVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~l~iiE 128 (171) ..+++.||.|++||++|-++ T Consensus 79 ~~~~~DG~~v~~G~~i~~i~ 98 (288) T PRK06978 79 QWRYREGDRMTADSTVCELE 98 (288) T ss_pred EEECCCCCEECCCCEEEEEE T ss_conf 99838898917999999999 No 204 >pfam00166 Cpn10 Chaperonin 10 Kd subunit. This family contains GroES and Gp31-like chaperonins. Gp31 is a functional co-chaperonin that is required for the folding and assembly of Gp23, a major capsid protein, during phage morphogenesis. Probab=26.86 E-value=45 Score=14.77 Aligned_cols=11 Identities=27% Similarity=0.268 Sum_probs=6.2 Q ss_pred CEEECCEEEEE Q ss_conf 20104348999 Q gi|254780267|r 117 LVVEGQTLLII 127 (171) Q Consensus 117 ~Vk~G~~l~ii 127 (171) .|+.||.+++= T Consensus 56 ~vk~GD~Vl~~ 66 (92) T pfam00166 56 EVKVGDKVLFP 66 (92) T ss_pred CCCCCCEEEEC T ss_conf 55419999987 No 205 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=26.84 E-value=37 Score=15.28 Aligned_cols=24 Identities=13% Similarity=0.122 Sum_probs=17.8 Q ss_pred CCCCCCCEEEECCCEEEEEECCCC Q ss_conf 345466268704625677403676 Q gi|254780267|r 84 LSPDNYHTVTSPMVGTAYLASSPG 107 (171) Q Consensus 84 ~~~~~~~~I~SPmvGtfy~~psP~ 107 (171) ........|+||+-|+.|+.|.-. T Consensus 330 ~~~~~~~~ikAp~dGvLHl~~e~~ 353 (476) T TIGR01000 330 KEDSQKGVIKAPEDGVLHLNPETK 353 (476) T ss_pred HHHCCCCEEEECCCCEEEECCCCC T ss_conf 521225448721363785061444 No 206 >pfam12046 DUF3529 Protein of unknown function (DUF3529). This family of proteins is functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 176 to 190 amino acids in length. Probab=26.29 E-value=31 Score=15.74 Aligned_cols=39 Identities=18% Similarity=0.201 Sum_probs=19.4 Q ss_pred EEEECCCEEEEEECCCC-CCCEECCCCCEEECCEEEEEEC Q ss_conf 68704625677403676-4100012442010434899973 Q gi|254780267|r 91 TVTSPMVGTAYLASSPG-SDPFVNKGNLVVEGQTLLIIEA 129 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~-~~~~V~vGd~Vk~G~~l~iiEA 129 (171) ...||+.|.||.+.+-- +...++.-..-..+...+.++| T Consensus 116 ~lLsPlAG~~Yw~kA~R~E~v~lkl~~~~~~~~s~l~v~a 155 (173) T pfam12046 116 VLLSPLAGLYYWQRASRPEQVELKLVEADDDGGSDLRVRA 155 (173) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCEEEEEE T ss_conf 9973224442333278750189999846899873699996 No 207 >PRK05742 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=26.28 E-value=51 Score=14.44 Aligned_cols=18 Identities=33% Similarity=0.416 Sum_probs=12.1 Q ss_pred EECCCCCEEECCEEEEEE Q ss_conf 001244201043489997 Q gi|254780267|r 111 FVNKGNLVVEGQTLLIIE 128 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~iiE 128 (171) +++.|+.|++||+++-++ T Consensus 71 ~~~dG~~v~~g~~i~~i~ 88 (277) T PRK05742 71 QVADGDRVEANQVLFHLE 88 (277) T ss_pred EECCCCCCCCCCEEEEEE T ss_conf 838898705886899999 No 208 >TIGR02643 T_phosphoryl thymidine phosphorylase; InterPro: IPR013465 Thymidine phosphorylase (alternate name: pyrimidine phosphorylase) 2.4.2.4 from EC is involved, in E. coli and other Proteobacteria, in (deoxy)nucleotide degradation. It is often encoded in an operon together with a deoxyribose-phosphate aldolase, a phosphopentomutase and a purine nucleoside phosphorylase. In many other lineages, the corresponding enzyme is designated pyrimidine-nucleoside phosphorylase (2.4.2.2 from EC); the naming convention here follows standard literature practice.; GO: 0009032 thymidine phosphorylase activity. Probab=26.02 E-value=25 Score=16.33 Aligned_cols=21 Identities=24% Similarity=0.441 Sum_probs=18.1 Q ss_pred CEECCCCCEEECCEEEEEECC Q ss_conf 000124420104348999737 Q gi|254780267|r 110 PFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 110 ~~V~vGd~Vk~G~~l~iiEAM 130 (171) -+...|++|.+|++|.+|.|- T Consensus 387 ~l~~lG~~V~~~~plA~vHA~ 407 (440) T TIGR02643 387 DLLPLGEKVEKGEPLAVVHAA 407 (440) T ss_pred HHHHHHHHHCCCCCEEEEECC T ss_conf 898764430489806888438 No 209 >KOG3358 consensus Probab=25.92 E-value=47 Score=14.68 Aligned_cols=66 Identities=27% Similarity=0.411 Sum_probs=39.4 Q ss_pred CCCCCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCE-----EEEEECCCCCEEEEEEC-CCCCCC Q ss_conf 333454662687046256774036764100012442010434899973772-----68999899819999984-898683 Q gi|254780267|r 82 PLLSPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKT-----MNHIVAPCSGKVQDINV-KDGQSV 155 (171) Q Consensus 82 ~~~~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm-----~n~I~a~~~G~I~~i~v-~~G~~V 155 (171) +..++.+.+.+.-|-.|+. |+-||-|+.||++-+ +-|+. -+-..||.+|--+--.+ ++|+ = T Consensus 63 ~~~dD~NSyW~Ik~~~~~~-----------c~rG~pikcG~~iRL-~H~~TgknLHSHhf~sPlSgnqEVSafG~dge-g 129 (211) T KOG3358 63 EGVDDSNSYWRIKPVSGTT-----------CERGDPIKCGQTIRL-THLKTGKNLHSHHFTSPLSGNQEVSAFGEDGE-G 129 (211) T ss_pred CCCCCCCCEEEEECCCCCC-----------CCCCCCCCCCCEEEE-EEEECCCCHHHCCCCCCCCCCEEEEEECCCCC-C T ss_conf 3444676507984477876-----------668983025871787-88633653011145887789815775045578-8 Q ss_pred CCCCE Q ss_conf 69972 Q gi|254780267|r 156 EYGDA 160 (171) Q Consensus 156 ~~gq~ 160 (171) ++|+. T Consensus 130 DtgD~ 134 (211) T KOG3358 130 DTGDH 134 (211) T ss_pred CCCCC T ss_conf 86663 No 210 >PRK10027 cryptic adenine deaminase; Provisional Probab=25.68 E-value=52 Score=14.38 Aligned_cols=22 Identities=27% Similarity=0.341 Sum_probs=9.0 Q ss_pred HHHHHHHHHHHCCCEEEEEEEC Q ss_conf 9999999998659708999509 Q gi|254780267|r 12 LIRNLANILNETNLTEVEIDND 33 (171) Q Consensus 12 ~Ik~Li~~~~es~l~eleie~~ 33 (171) +++.|+.++.+.+...+-+-.+ T Consensus 263 ~l~~l~~~i~~~~~~~~~~~TD 284 (588) T PRK10027 263 NLNALAPLINEFNSPQCMLCTD 284 (588) T ss_pred HHHHHHHHHHCCCCCCEEEECC T ss_conf 4999998862368655379849 No 211 >CHL00117 rpoC2 RNA polymerase beta'' subunit; Reviewed Probab=25.33 E-value=53 Score=14.34 Aligned_cols=17 Identities=24% Similarity=0.280 Sum_probs=7.4 Q ss_pred EECCCCCEEECCEEEEE Q ss_conf 00124420104348999 Q gi|254780267|r 111 FVNKGNLVVEGQTLLII 127 (171) Q Consensus 111 ~V~vGd~Vk~G~~l~ii 127 (171) ||+-|+.|+.+|+|+=+ T Consensus 410 ~V~n~q~V~s~QvIAEi 426 (1350) T CHL00117 410 LVQNDQYVESEQVIAEI 426 (1350) T ss_pred EEECCCEEECCCEEEEE T ss_conf 99899587516189997 No 212 >PRK10929 hypothetical protein; Provisional Probab=25.24 E-value=53 Score=14.33 Aligned_cols=32 Identities=19% Similarity=0.340 Sum_probs=20.2 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCE Q ss_conf 0124420104348999737726899989981999998489868369972 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDA 160 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~ 160 (171) |.+||.|.-|+ .+|+|.+|.+..-..++++-. T Consensus 936 iRiGD~VTi~~-----------------~sGtVskIriRATTi~d~Drk 967 (1109) T PRK10929 936 IRIGDTVTIRD-----------------LTGSVTKINTRATTISDWDRK 967 (1109) T ss_pred CCCCCEEEECC-----------------CCCEEEEEEEEEEEEECCCCC T ss_conf 12476689878-----------------454477788643453447775 No 213 >PRK10377 glucitol/sorbitol-specific PTS system component IIA; Provisional Probab=25.11 E-value=45 Score=14.76 Aligned_cols=24 Identities=17% Similarity=0.239 Sum_probs=16.9 Q ss_pred ECCCEEEEEECCCCCCCEECCCCCEE Q ss_conf 04625677403676410001244201 Q gi|254780267|r 94 SPMVGTAYLASSPGSDPFVNKGNLVV 119 (171) Q Consensus 94 SPmvGtfy~~psP~~~~~V~vGd~Vk 119 (171) +.++|++|.... .+|-+++|++++ T Consensus 94 aelPG~i~v~~~--~~p~i~~G~~i~ 117 (120) T PRK10377 94 AEFPGTVHVAGP--VPDDIAPGSVLK 117 (120) T ss_pred CCCCCEEEECCC--CCCCCCCCCEEE T ss_conf 677867998678--998667888899 No 214 >PRK02173 DNA-directed RNA polymerase subunit beta'; Provisional Probab=24.94 E-value=54 Score=14.29 Aligned_cols=64 Identities=25% Similarity=0.243 Sum_probs=38.6 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE----ECCCEEEEEECCCCCEEEEEECCCCCCCCCCCEEEE Q ss_conf 6870462567740367641000124420104348999----737726899989981999998489868369972999 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII----EAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDALLV 163 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii----EAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~L~~ 163 (171) .+.|+.-|++..-.. ++.++.+..|+.+-+. ..-+...+..-|.+ ..+.|.+||.|..||.|.. T Consensus 1165 a~i~~~~g~v~~i~~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~i~~~---~~~~v~~gd~v~~G~~l~~ 1232 (1438) T PRK02173 1165 AKISPYYGIITKIKD------LKEKNAANKGFLITIEYKTSKNEKATHIIRVEQS---QKLRVKVGDKVIPGQKLVE 1232 (1438) T ss_pred CEEEEECCEEEEEEC------CCCCCCCCCCCEEEEEEECCCCCCEEEEEECCCC---CEEEECCCCEECCCCCCCC T ss_conf 557741567878630------3666554455137898874789824799975479---8756679999767998478 No 215 >smart00841 Elong-fact-P_C Elongation factor P, C-terminal. These nucleic acid binding domains are predominantly found in elongation factor P, where they adopt an OB-fold, with five beta-strands forming a beta-barrel in a Greek-key topology PUBMED:15210970. Probab=24.86 E-value=27 Score=16.12 Aligned_cols=14 Identities=21% Similarity=0.439 Sum_probs=10.9 Q ss_pred CCEECCCCCEEECC Q ss_conf 10001244201043 Q gi|254780267|r 109 DPFVNKGNLVVEGQ 122 (171) Q Consensus 109 ~~~V~vGd~Vk~G~ 122 (171) |+||+.||+|+... T Consensus 35 P~FI~~Gd~I~IdT 48 (56) T smart00841 35 PLFINEGDKIKVDT 48 (56) T ss_pred CCCCCCCCEEEEEC T ss_conf 87403899999989 No 216 >pfam09285 Elong-fact-P_C Elongation factor P, C-terminal. Members of this family of nucleic acid binding domains are predominantly found in elongation factor P, where they adopt an OB-fold, with five beta-strands forming a beta-barrel in a Greek-key topology. Probab=24.72 E-value=54 Score=14.27 Aligned_cols=25 Identities=12% Similarity=0.187 Sum_probs=16.4 Q ss_pred CCCCCCEEEECCCEEEEEECCCCCCCEECCCCCEEECCE Q ss_conf 454662687046256774036764100012442010434 Q gi|254780267|r 85 SPDNYHTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQT 123 (171) Q Consensus 85 ~~~~~~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~ 123 (171) .-+++..|.- |+||+.||+|+.... T Consensus 25 ~letG~~i~V--------------P~FI~~Gd~I~VdT~ 49 (56) T pfam09285 25 TLETGAEVQV--------------PLFIEEGEKIKVDTR 49 (56) T ss_pred EECCCCEEEC--------------CCCCCCCCEEEEECC T ss_conf 9838989983--------------874048999999899 No 217 >PRK00405 rpoB DNA-directed RNA polymerase subunit beta; Reviewed Probab=24.58 E-value=38 Score=15.18 Aligned_cols=18 Identities=33% Similarity=0.567 Sum_probs=13.4 Q ss_pred CCCEECCCCCEEECCEEE Q ss_conf 410001244201043489 Q gi|254780267|r 108 SDPFVNKGNLVVEGQTLL 125 (171) Q Consensus 108 ~~~~V~vGd~Vk~G~~l~ 125 (171) -.|+|+.||.|++||.++ T Consensus 702 Q~PlV~~~~~v~~~~~ia 719 (1127) T PRK00405 702 QRPIVKVGDRVEKGDVLA 719 (1127) T ss_pred EEEEEECCCEEECCCEEC T ss_conf 403773398883387814 No 218 >TIGR02303 HpaG-C-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit; InterPro: IPR012684 This entry represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation . In Escherichia coli and some other species, this enzyme is encoded by a single polypeptide containing both this domain and the closely related N-terminal domain (IPR012686 from INTERPRO). In other species such as Pasteurella multocida, these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).; GO: 0008704 5-carboxymethyl-2-hydroxymuconate delta-isomerase activity, 0018800 5-oxopent-3-ene-125-tricarboxylate decarboxylase activity. Probab=24.50 E-value=47 Score=14.64 Aligned_cols=23 Identities=39% Similarity=0.408 Sum_probs=14.9 Q ss_pred CEEECCEE-EEEECC-CEEEEEECC Q ss_conf 20104348-999737-726899989 Q gi|254780267|r 117 LVVEGQTL-LIIEAM-KTMNHIVAP 139 (171) Q Consensus 117 ~Vk~G~~l-~iiEAM-Km~n~I~a~ 139 (171) .||.||++ |=||-. +++|+|.++ T Consensus 225 dv~pGD~v~~eiEGvG~L~N~i~~E 249 (249) T TIGR02303 225 DVKPGDVVRLEIEGVGKLENPIVSE 249 (249) T ss_pred CCCCCCEEEEEECCCCCEECCCCCC T ss_conf 2355878999770026210311059 No 219 >TIGR01009 rpsC_bact ribosomal protein S3; InterPro: IPR005704 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein S3 is one of the proteins from the small ribosomal subunit. This family describes the bacterial type of ribosomal protein S3 and also and many chloroplast forms. Chloroplast and mitochondrial forms have large, variable inserts between conserved N-terminal and C-terminal domains. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=24.29 E-value=55 Score=14.22 Aligned_cols=31 Identities=32% Similarity=0.582 Sum_probs=24.8 Q ss_pred CHHHHHHHH-HHHH--HCCCEEEEEEECCEE-EEEE Q ss_conf 989999999-9998--659708999509969-9997 Q gi|254780267|r 9 NLTLIRNLA-NILN--ETNLTEVEIDNDGMR-IRLL 40 (171) Q Consensus 9 di~~Ik~Li-~~~~--es~l~eleie~~~~k-I~I~ 40 (171) |+ .|+++| +.++ ..+++++||+...-+ |+|. T Consensus 37 D~-KIR~~i~k~l~~~~a~iS~veIeR~~~~~v~V~ 71 (217) T TIGR01009 37 DL-KIRNLIKKELKKKNAGISDVEIERTADKKVRVT 71 (217) T ss_pred HH-HHHHHHHHHCCCCCCCCCEEEEEECCCCEEEEE T ss_conf 38-999987321133543313279981699788999 No 220 >CHL00037 petA cytochrome f Probab=24.29 E-value=56 Score=14.22 Aligned_cols=43 Identities=26% Similarity=0.369 Sum_probs=25.0 Q ss_pred ECCEEEEEECCCEEEEEE--CCCCCE-EE-------EEECCCCCCCCCCCEEE Q ss_conf 043489997377268999--899819-99-------99848986836997299 Q gi|254780267|r 120 EGQTLLIIEAMKTMNHIV--APCSGK-VQ-------DINVKDGQSVEYGDALL 162 (171) Q Consensus 120 ~G~~l~iiEAMKm~n~I~--a~~~G~-I~-------~i~v~~G~~V~~gq~L~ 162 (171) .|.+--|...-|-..+|. -..+|. +. +++|+.|+.|..||||- T Consensus 211 ~G~i~~I~~~ekgg~~itI~~~~dG~~v~~~iP~Gpel~V~eG~~vk~dqpLT 263 (320) T CHL00037 211 AGIVSKIIRKEKGGYEITIVDASDGRQVVDIIPPGPELLVSEGESIKLDQPLT 263 (320) T ss_pred CCEEEEEEECCCCCEEEEEEECCCCCEEEEECCCCCEEEECCCCEEECCCCCC T ss_conf 72788877657895699998658997888744999758972798885698555 No 221 >PRK11281 potassium efflux protein KefA; Provisional Probab=23.79 E-value=57 Score=14.16 Aligned_cols=32 Identities=25% Similarity=0.447 Sum_probs=18.8 Q ss_pred ECCCCCEEECCEEEEEECCCEEEEEECCCCCEEEEEECCCCCCCCCCCE Q ss_conf 0124420104348999737726899989981999998489868369972 Q gi|254780267|r 112 VNKGNLVVEGQTLLIIEAMKTMNHIVAPCSGKVQDINVKDGQSVEYGDA 160 (171) Q Consensus 112 V~vGd~Vk~G~~l~iiEAMKm~n~I~a~~~G~I~~i~v~~G~~V~~gq~ 160 (171) |.+||.|.-| +.+|+|.+|.+..-..++++-. T Consensus 938 iRiGD~vTi~-----------------~~sGtVskIriRATTi~d~Drk 969 (1107) T PRK11281 938 VRIGDTVTIG-----------------TFSGTVSKIRIRATTITDFDRK 969 (1107) T ss_pred CCCCCEEEEC-----------------CCCCEEEEEEEEEEEEECCCCC T ss_conf 1247668987-----------------8454477788643443447775 No 222 >PRK02693 apocytochrome f; Reviewed Probab=23.73 E-value=57 Score=14.15 Aligned_cols=30 Identities=33% Similarity=0.481 Sum_probs=19.0 Q ss_pred EEEEEECCCCCEEE--------EEECCCCCCCCCCCEEE Q ss_conf 26899989981999--------99848986836997299 Q gi|254780267|r 132 TMNHIVAPCSGKVQ--------DINVKDGQSVEYGDALL 162 (171) Q Consensus 132 m~n~I~a~~~G~I~--------~i~v~~G~~V~~gq~L~ 162 (171) ....|. ..+|... +++|+.||.|..||||- T Consensus 218 ~~v~I~-t~~G~~v~~~iP~Gp~liV~~G~~v~~~qpLT 255 (312) T PRK02693 218 YVVTIT-TEDGEVVTETIPAGPELIVKEGDAVEAGAPLT 255 (312) T ss_pred EEEEEE-CCCCCEEEEECCCCCEEEECCCCEEECCCCCC T ss_conf 799998-89999998852999858972798885698666 No 223 >pfam09160 FimH_man-bind FimH, mannose binding. Members of this family adopt a secondary structure consisting of a beta sandwich, with nine strands arranged in two sheets in a Greek key topology. They are predominantly found in bacterial mannose-specific adhesins, since they are capable of binding to D-mannose. Probab=23.52 E-value=57 Score=14.13 Aligned_cols=35 Identities=20% Similarity=0.348 Sum_probs=26.5 Q ss_pred ECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECCCEE Q ss_conf 0462567740367641000124420104348999737726 Q gi|254780267|r 94 SPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAMKTM 133 (171) Q Consensus 94 SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAMKm~ 133 (171) -|++=.+|+.|.|.+ =|-.|++||.|+.|+--|.- T Consensus 99 ~P~p~klYLtp~~~a-----gGvvI~~G~liAtl~m~k~~ 133 (150) T pfam09160 99 KPLPVKLYLTPVSGA-----GGVVIKAGELIATLYVHKFN 133 (150) T ss_pred CCCCEEEEEEECCCC-----CEEEEECCCEEEEEEEEEEC T ss_conf 554578999754878-----80899479889999999841 No 224 >cd05794 S1_EF-P_repeat_2 S1_EF-P_repeat_2: Translation elongation factor P (EF-P), S1-like RNA-binding domain, repeat 1. EF-P stimulates the peptidyltransferase activity in the prokaryotic 70S ribosome. EF-P enhances the synthesis of certain dipeptides with N-formylmethionyl-tRNA and puromycine in vitro. EF-P binds to both the 30S and 50S ribosomal subunits. EF-P binds near the streptomycine binding site of the 16S rRNA in the 30S subunit. EF-P interacts with domains 2 and 5 of the 23S rRNA. The L16 ribosomal protein of the 50S or its N-terminal fragment are required for EF-P mediated peptide bond synthesis, whereas L11, L15, and L7/L12 are not required in this reaction, suggesting that EF-P may function at a different ribosomal site than most other translation factors. EF-P is essential for cell viability and is required for protein synthesis. EF-P is mainly present in bacteria. The EF-P homologs in archaea and eukaryotes are the initiation factors aIF5A and eIF5A, respectively. EF-P Probab=23.20 E-value=58 Score=14.09 Aligned_cols=15 Identities=13% Similarity=0.310 Sum_probs=11.3 Q ss_pred CCEECCCCCEEECCE Q ss_conf 100012442010434 Q gi|254780267|r 109 DPFVNKGNLVVEGQT 123 (171) Q Consensus 109 ~~~V~vGd~Vk~G~~ 123 (171) |.||+.||+|+.... T Consensus 35 P~FI~~Gd~I~VdT~ 49 (56) T cd05794 35 PLFIKEGEKIKVDTR 49 (56) T ss_pred CCCCCCCCEEEEECC T ss_conf 874048999999899 No 225 >KOG2420 consensus Probab=21.93 E-value=47 Score=14.63 Aligned_cols=16 Identities=38% Similarity=0.754 Sum_probs=10.6 Q ss_pred EEECCCCCCCCCCCEE Q ss_conf 9984898683699729 Q gi|254780267|r 146 DINVKDGQSVEYGDAL 161 (171) Q Consensus 146 ~i~v~~G~~V~~gq~L 161 (171) +.-+..||.|.+||.| T Consensus 367 kf~~~~gq~vr~ge~l 382 (382) T KOG2420 367 KFDIKAGQKVRVGESL 382 (382) T ss_pred EEEEECCCEEECCCCC T ss_conf 4534138665415569 No 226 >COG4405 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.88 E-value=61 Score=13.98 Aligned_cols=26 Identities=27% Similarity=0.464 Sum_probs=18.3 Q ss_pred CCEECCCCCE----EECCEEEEEECCCEEE Q ss_conf 1000124420----1043489997377268 Q gi|254780267|r 109 DPFVNKGNLV----VEGQTLLIIEAMKTMN 134 (171) Q Consensus 109 ~~~V~vGd~V----k~G~~l~iiEAMKm~n 134 (171) +||..+|+.+ +.||..|+|+--|++- T Consensus 54 ep~p~vG~~~iv~d~~g~a~C~I~~t~V~~ 83 (140) T COG4405 54 EPYPTVGEFVIVLDGRGQARCVIRLTKVFL 83 (140) T ss_pred CCCCCCCCEEEEECCCCCEEEEEEEEEEEE T ss_conf 678888746999858998778999978999 No 227 >pfam10011 DUF2254 Predicted membrane protein (DUF2254). Members of this family of bacterial proteins comprises various hypothetical and putative membrane proteins. Their exact function, has not, as yet, been defined. Probab=21.84 E-value=62 Score=13.92 Aligned_cols=28 Identities=25% Similarity=0.141 Sum_probs=16.4 Q ss_pred CCCCCEEEEEECCCCCCCCCCCEEEEEE Q ss_conf 8998199999848986836997299995 Q gi|254780267|r 138 APCSGKVQDINVKDGQSVEYGDALLVLE 165 (171) Q Consensus 138 a~~~G~I~~i~v~~G~~V~~gq~L~~i~ 165 (171) |...+.+..+.+..|+.|..|+||+.+- T Consensus 217 A~~~d~~i~l~~~pG~fV~~g~pl~~v~ 244 (369) T pfam10011 217 AEEHDGVIYLLVRPGDFVTPGTPLARVW 244 (369) T ss_pred HHHCCCEEEEEECCCCEECCCCEEEEEE T ss_conf 9987978999827987368997699981 No 228 >cd03079 GST_N_Metaxin2 GST_N family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury. Probab=21.83 E-value=39 Score=15.14 Aligned_cols=18 Identities=39% Similarity=0.870 Sum_probs=12.0 Q ss_pred EEECCCCCCCEECCCCCEE Q ss_conf 7403676410001244201 Q gi|254780267|r 101 YLASSPGSDPFVNKGNLVV 119 (171) Q Consensus 101 y~~psP~~~~~V~vGd~Vk 119 (171) |.||+ |.-||+++|..|= T Consensus 44 ~MSPs-GkvPfi~~g~~vv 61 (74) T cd03079 44 FMSPS-GKVPFIRVGNQIV 61 (74) T ss_pred CCCCC-CCCCEEEECCEEE T ss_conf 11899-8776687578856 No 229 >KOG2362 consensus Probab=21.66 E-value=63 Score=13.90 Aligned_cols=29 Identities=28% Similarity=0.511 Sum_probs=18.6 Q ss_pred CEEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEE Q ss_conf 26870462567740367641000124420104348999 Q gi|254780267|r 90 HTVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLII 127 (171) Q Consensus 90 ~~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~ii 127 (171) ..-.||..|..| -.+++| .++.||++-++ T Consensus 302 ~~~~sp~fGv~~--------~~~~~g-~I~vGd~Vyv~ 330 (336) T KOG2362 302 VHMGSPLFGVYA--------GLVNEG-TIKVGDTVYVL 330 (336) T ss_pred CCCCCCCCCEEE--------CCCCCC-EEEECCEEEEE T ss_conf 347885430342--------354432-58738889997 No 230 >TIGR02828 TIGR02828 putative membrane fusion protein; InterPro: IPR014193 Members of this entry show similarity to those in IPR005695 from INTERPRO, the membrane fusion protein (MFP) cluster 2 family, which is linked to RND transport systems.. Probab=21.18 E-value=44 Score=14.81 Aligned_cols=10 Identities=0% Similarity=0.531 Sum_probs=3.4 Q ss_pred EEEEECCEEE Q ss_conf 9995099699 Q gi|254780267|r 28 VEIDNDGMRI 37 (171) Q Consensus 28 leie~~~~kI 37 (171) |.+++-+..+ T Consensus 85 l~~s~~~~el 94 (197) T TIGR02828 85 LQVSDKDLEL 94 (197) T ss_pred EEECCCCCCC T ss_conf 9762889752 No 231 >PRK13526 glutamine amidotransferase subunit PdxT; Provisional Probab=20.91 E-value=65 Score=13.80 Aligned_cols=21 Identities=29% Similarity=0.331 Sum_probs=14.4 Q ss_pred ECCCEEEEEECCCCCCCEECCCCCEE Q ss_conf 04625677403676410001244201 Q gi|254780267|r 94 SPMVGTAYLASSPGSDPFVNKGNLVV 119 (171) Q Consensus 94 SPmvGtfy~~psP~~~~~V~vGd~Vk 119 (171) .|..++|-||| -+.++|+.|+ T Consensus 120 ~~~~avFIRAP-----~I~~vg~~V~ 140 (179) T PRK13526 120 KNITGVFIRAP-----KFIVVGNQVD 140 (179) T ss_pred CCEEEEEECCC-----CEEECCCCCE T ss_conf 95369998275-----1888089958 No 232 >TIGR00259 TIGR00259 conserved hypothetical protein TIGR00259; InterPro: IPR005137 Photosystem I (PSI) is a large protein complex embedded within the photosynthetic thylakoid membrane. It consists of 11 subunits, ~100 chlorophyll a molecules, 2 phylloquinones, and 3 Fe4S4-clusters. The three dimensional structure of the PSI complex has been resolved at 2.5 A , which allows the precise localisation of each cofactor. PSI together with photosystem II (PSII) catalyses the light-induced steps in oxygenic photosynthesis - a process found in cyanobacteria, eukaryotic algae (e.g. red algae, green algae) and higher plants. To date, three thylakoid proteins involved in the stable accumulation of PSI have been identified: BtpA , Ycf3 , , and Ycf4 (IPR003359 from INTERPRO) . Because translation of the psaA and psaB mRNAs encoding the two reaction centre polypeptides, of PSI and PSII respectively, is not affected in mutant strains lacking functional ycf3 and ycf4, the products of these two genes appear to act at a post-translational step of PSI biosynthesis. These gene products are therefore involved either in the stabilisation or in the assembly of the PSI complex. However, their exact roles remain unknown. The BtpA protein appears to act at the level of PSI stabilisation . It is an extrinsic membrane protein located on the cytoplasmic side of the thylakoid membrane , . Homologs of BtpA are found in the crenarchaeota and euryarchaeota, where their function remains unknown. The Ycf4 protein is firmly associated with the thylakoid membrane, presumably through a transmembrane domain . Ycf4 co-fractionates with a protein complex larger than PSI upon sucrose density gradient centrifugation of solubilised thylakoids . The Ycf3 protein is loosely associated with the thylakoid membrane and can be released from the membrane with sodium carbonate. This suggests that Ycf3 is not part of a stable complex and that it probably interacts transiently with its partners . Ycf3 contains a number of tetratrico peptide repeats (TPR, IPR001440 from INTERPRO); TPR is a structural motif present in a wide range of proteins, which mediates proteinprotein interactions. . Probab=20.38 E-value=50 Score=14.47 Aligned_cols=17 Identities=41% Similarity=0.804 Sum_probs=8.9 Q ss_pred CCCEEEEEECCCCCCCE Q ss_conf 46256774036764100 Q gi|254780267|r 95 PMVGTAYLASSPGSDPF 111 (171) Q Consensus 95 PmvGtfy~~psP~~~~~ 111 (171) |..|.+|+.|=|+++-| T Consensus 3 ~vIg~vHL~pLPGs~~~ 19 (261) T TIGR00259 3 PVIGVVHLLPLPGSPSF 19 (261) T ss_pred CEEEEEECCCCCCCCCC T ss_conf 76899853687639763 No 233 >cd03443 PaaI_thioesterase PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria. Although orthologs of PaaI exist in archaea and eukaryotes, their function has not been determined. Sequence similarity between PaaI, E. coli medium chain acyl-CoA thioesterase II, and human thioesterase III suggests they all belong to the same thioesterase superfamily. The conserved fold present in these thioesterases is referred to as an asymmetric hot dog fold, similar to those of 4-hydroxybenzoyl-CoA thioesterase (4HBT) and the beta-hydroxydecanoyl-ACP dehydratases (FabA/FabZ). Probab=20.04 E-value=68 Score=13.69 Aligned_cols=38 Identities=24% Similarity=0.440 Sum_probs=26.8 Q ss_pred EEEECCCEEEEEECCCCCCCEECCCCCEEECCEEEEEECC Q ss_conf 6870462567740367641000124420104348999737 Q gi|254780267|r 91 TVTSPMVGTAYLASSPGSDPFVNKGNLVVEGQTLLIIEAM 130 (171) Q Consensus 91 ~I~SPmvGtfy~~psP~~~~~V~vGd~Vk~G~~l~iiEAM 130 (171) .++.-| -+=|++|.|. .++.-++.-++.|..++++++- T Consensus 58 ~vt~~l-~i~fl~p~~~-~~l~~~a~v~~~g~~~~~~~~~ 95 (113) T cd03443 58 AVTVDL-NVNYLRPARG-GDLTARARVVKLGRRLAVVEVE 95 (113) T ss_pred EEEEEE-EEEEECCCCC-CEEEEEEEEEEECCCEEEEEEE T ss_conf 076215-7999757899-7699999999958889999999 Done!