Query gi|254780386|ref|YP_003064799.1| Type I secretion membrane fusion protein, HlyD [Candidatus Liberibacter asiaticus str. psy62] Match_columns 440 No_of_seqs 809 out of 6523 Neff 10.7 Searched_HMMs 39220 Date Sun May 29 18:57:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780386.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01843 type_I_hlyD type I s 100.0 0 0 547.4 17.1 426 14-439 1-432 (434) 2 PRK03598 hypothetical protein; 100.0 1.7E-43 0 308.6 17.9 319 16-419 4-328 (331) 3 TIGR01000 bacteriocin_acc bact 100.0 8.3E-43 0 304.0 14.9 417 10-433 14-475 (476) 4 PRK10476 multidrug resistance 100.0 5.6E-42 0 298.5 19.1 318 18-419 14-346 (348) 5 TIGR00998 8a0101 efflux pump m 100.0 2.3E-38 5.9E-43 274.4 11.7 355 16-415 4-379 (379) 6 PRK10559 p-hydroxybenzoic acid 100.0 2.4E-33 6.1E-38 240.9 18.3 266 16-421 8-291 (310) 7 COG1566 EmrA Multidrug resista 100.0 2.6E-33 6.6E-38 240.6 16.4 283 55-421 52-344 (352) 8 PRK11556 multidrug efflux syst 100.0 1.9E-30 4.8E-35 221.5 20.2 271 12-421 2-319 (415) 9 PRK11578 macrolide transporter 100.0 2.6E-30 6.6E-35 220.6 17.6 258 38-421 21-281 (347) 10 PRK09859 multidrug efflux syst 99.9 3.3E-26 8.3E-31 193.3 18.3 236 43-420 49-298 (385) 11 PRK09578 periplasmic multidrug 99.9 7.7E-26 2E-30 190.8 19.8 239 42-420 50-301 (385) 12 TIGR02971 heterocyst_DevB ABC 99.9 4.7E-29 1.2E-33 212.2 3.0 329 44-415 1-363 (363) 13 PRK09783 copper/silver efflux 99.9 6E-24 1.5E-28 178.2 18.1 118 288-421 206-323 (407) 14 TIGR00999 8a0102 Membrane Fusi 99.8 2E-17 5.1E-22 134.6 11.9 117 287-420 102-218 (284) 15 COG0845 AcrA Membrane-fusion p 99.7 4.3E-16 1.1E-20 125.8 17.4 76 34-110 45-121 (372) 16 pfam00529 HlyD HlyD family sec 99.6 9.1E-15 2.3E-19 116.9 10.0 56 56-111 2-57 (304) 17 TIGR01730 RND_mfp efflux trans 99.4 7.6E-12 1.9E-16 97.4 11.1 242 40-421 11-259 (333) 18 PRK11637 hypothetical protein; 97.5 1.3E-05 3.4E-10 55.7 -0.5 21 406-426 382-402 (404) 19 PRK10476 multidrug resistance 97.5 0.0018 4.6E-08 41.5 10.2 123 4-131 3-132 (348) 20 TIGR01843 type_I_hlyD type I s 97.3 6.9E-05 1.8E-09 50.9 1.0 348 11-375 1-388 (434) 21 PRK07051 hypothetical protein; 97.3 0.00069 1.8E-08 44.3 5.6 43 47-89 38-80 (80) 22 PRK08225 acetyl-CoA carboxylas 97.1 0.0013 3.3E-08 42.4 5.6 32 57-88 39-70 (70) 23 PRK05641 putative acetyl-CoA c 97.0 0.0016 4E-08 41.9 5.3 32 289-321 79-110 (148) 24 PRK12999 pyruvate carboxylase; 97.0 0.0013 3.4E-08 42.3 4.9 32 289-321 1077-1108(1147) 25 PRK09282 pyruvate carboxylase 97.0 0.0015 3.9E-08 41.9 5.2 33 288-321 510-542 (580) 26 PRK08225 acetyl-CoA carboxylas 97.0 0.0013 3.3E-08 42.4 4.6 30 291-321 3-32 (70) 27 pfam06898 YqfD Putative stage 96.9 0.0071 1.8E-07 37.5 8.2 27 59-85 192-225 (383) 28 cd06850 biotinyl_domain The bi 96.9 0.0013 3.4E-08 42.3 4.1 31 56-86 36-66 (67) 29 PRK06549 acetyl-CoA carboxylas 96.9 0.0015 3.9E-08 42.0 4.4 31 290-321 64-94 (132) 30 PRK05889 putative acetyl-CoA c 96.9 0.0018 4.5E-08 41.5 4.6 31 57-87 40-70 (71) 31 COG0511 AccB Biotin carboxyl c 96.9 0.0022 5.5E-08 41.0 5.1 33 288-321 69-101 (140) 32 cd06663 Biotinyl_lipoyl_domain 96.9 0.0027 6.8E-08 40.3 5.5 40 48-87 34-73 (73) 33 PRK09824 beta-glucoside-specif 96.8 0.015 3.9E-07 35.3 9.3 15 291-305 478-492 (625) 34 PRK05641 putative acetyl-CoA c 96.8 0.0019 4.9E-08 41.3 4.6 32 58-89 81-112 (148) 35 COG0511 AccB Biotin carboxyl c 96.8 0.0018 4.6E-08 41.5 4.5 32 57-88 71-102 (140) 36 COG4942 Membrane-bound metallo 96.8 3.8E-05 9.6E-10 52.7 -4.2 12 291-302 298-309 (420) 37 PRK05889 putative acetyl-CoA c 96.8 0.0015 3.9E-08 42.0 3.9 30 291-321 4-33 (71) 38 KOG0559 consensus 96.8 0.002 5.1E-08 41.2 4.5 43 48-90 107-149 (457) 39 TIGR03007 pepcterm_ChnLen poly 96.7 0.00022 5.6E-09 47.6 -1.0 60 16-78 11-70 (510) 40 cd06850 biotinyl_domain The bi 96.6 0.0053 1.3E-07 38.4 5.5 29 292-321 2-30 (67) 41 PRK06302 acetyl-CoA carboxylas 96.5 0.0066 1.7E-07 37.7 5.7 33 289-321 79-117 (155) 42 pfam00364 Biotin_lipoyl Biotin 96.5 0.0054 1.4E-07 38.3 5.1 40 48-87 34-73 (73) 43 TIGR02712 urea_carbox urea car 96.5 0.0048 1.2E-07 38.6 4.7 33 288-321 1156-1188(1226) 44 PTZ00144 dihydrolipoamide succ 96.4 0.0082 2.1E-07 37.1 5.6 43 48-90 92-134 (430) 45 PRK06549 acetyl-CoA carboxylas 96.4 0.0069 1.8E-07 37.6 5.2 36 54-89 61-96 (132) 46 TIGR02876 spore_yqfD sporulati 96.3 0.0035 8.9E-08 39.6 3.0 32 56-87 191-229 (406) 47 cd06849 lipoyl_domain Lipoyl d 96.3 0.0092 2.3E-07 36.8 5.0 39 49-87 36-74 (74) 48 PTZ00144 dihydrolipoamide succ 96.2 0.0079 2E-07 37.2 4.4 40 53-92 54-99 (430) 49 PRK05704 dihydrolipoamide succ 96.2 0.013 3.2E-07 35.8 5.5 43 48-90 37-79 (406) 50 TIGR01347 sucB 2-oxoglutarate 96.2 0.0046 1.2E-07 38.8 3.1 41 49-89 36-76 (435) 51 TIGR01235 pyruv_carbox pyruvat 96.2 0.0052 1.3E-07 38.4 3.4 43 287-334 1098-1140(1169) 52 TIGR02169 SMC_prok_A chromosom 96.1 9.2E-05 2.4E-09 50.1 -5.9 25 340-368 1073-1097(1202) 53 COG4770 Acetyl/propionyl-CoA c 96.1 0.0092 2.4E-07 36.7 4.3 33 288-321 574-606 (645) 54 PRK11855 dihydrolipoamide acet 96.0 0.016 4E-07 35.2 5.4 42 49-90 38-79 (549) 55 COG0508 AceF Pyruvate/2-oxoglu 96.0 0.017 4.2E-07 35.1 5.4 44 48-91 37-80 (404) 56 PRK10255 N-acetyl glucosamine 96.0 0.022 5.5E-07 34.3 5.9 16 290-305 499-514 (648) 57 PRK11856 branched-chain alpha- 96.0 0.019 5E-07 34.6 5.7 41 48-88 36-76 (324) 58 TIGR01108 oadA oxaloacetate de 96.0 0.0068 1.7E-07 37.6 3.3 33 288-321 550-582 (616) 59 COG1038 PycA Pyruvate carboxyl 96.0 0.0075 1.9E-07 37.3 3.4 33 288-321 1078-1110(1149) 60 pfam00529 HlyD HlyD family sec 95.9 0.00084 2.1E-08 43.7 -1.6 110 299-412 182-303 (304) 61 PRK09439 glucose-specific PTS 95.9 0.034 8.6E-07 33.0 6.4 18 288-305 19-36 (169) 62 PRK11854 aceF dihydrolipoamide 95.6 0.024 6.2E-07 33.9 4.8 42 49-90 36-77 (630) 63 TIGR01348 PDHac_trf_long pyruv 95.5 0.021 5.3E-07 34.4 4.3 43 49-91 36-78 (655) 64 PRK09783 copper/silver efflux 95.5 0.026 6.8E-07 33.7 4.7 56 39-94 103-160 (407) 65 TIGR01348 PDHac_trf_long pyruv 95.5 0.028 7E-07 33.6 4.7 28 62-89 12-39 (655) 66 cd06663 Biotinyl_lipoyl_domain 95.4 0.017 4.3E-07 35.0 3.5 29 61-89 10-38 (73) 67 PRK05704 dihydrolipoamide succ 95.3 0.018 4.5E-07 34.9 3.3 31 62-92 14-44 (406) 68 COG0508 AceF Pyruvate/2-oxoglu 95.1 0.019 4.9E-07 34.6 3.1 31 62-92 14-44 (404) 69 TIGR01995 PTS-II-ABC-beta PTS 95.1 0.029 7.4E-07 33.4 4.0 19 287-305 507-525 (660) 70 TIGR00998 8a0101 efflux pump m 95.1 0.028 7.1E-07 33.5 3.8 119 13-142 4-140 (379) 71 PRK10559 p-hydroxybenzoic acid 95.1 0.038 9.6E-07 32.7 4.5 80 20-110 18-108 (310) 72 COG0845 AcrA Membrane-fusion p 95.1 0.029 7.5E-07 33.4 4.0 129 279-421 167-297 (372) 73 PRK11854 aceF dihydrolipoamide 95.1 0.056 1.4E-06 31.5 5.3 41 50-90 140-180 (630) 74 COG3608 Predicted deacylase [G 95.0 0.087 2.2E-06 30.3 6.2 34 289-324 256-289 (331) 75 PRK09510 tolA cell envelope in 95.0 0.17 4.4E-06 28.3 7.6 43 24-80 21-63 (402) 76 PRK11855 dihydrolipoamide acet 95.0 0.055 1.4E-06 31.6 5.0 42 49-90 153-194 (549) 77 PRK11637 hypothetical protein; 95.0 0.0018 4.5E-08 41.6 -2.6 12 362-373 321-332 (404) 78 cd06849 lipoyl_domain Lipoyl d 94.9 0.031 7.8E-07 33.3 3.6 29 61-89 11-39 (74) 79 COG1038 PycA Pyruvate carboxyl 94.8 0.036 9.2E-07 32.8 3.8 16 73-88 679-694 (1149) 80 PRK11892 pyruvate dehydrogenas 94.8 0.023 5.9E-07 34.1 2.8 29 62-90 14-42 (464) 81 KOG0238 consensus 94.7 0.025 6.3E-07 33.9 2.7 32 290-322 602-633 (670) 82 cd00210 PTS_IIA_glc PTS_IIA, P 94.7 0.17 4.4E-06 28.3 7.0 19 70-88 84-102 (124) 83 KOG0369 consensus 94.7 0.03 7.6E-07 33.4 3.0 33 288-321 1105-1137(1176) 84 PRK03598 hypothetical protein; 94.6 0.048 1.2E-06 31.9 4.0 89 10-110 2-104 (331) 85 TIGR01349 PDHac_trf_mito pyruv 94.6 0.031 7.9E-07 33.2 3.0 27 63-89 12-38 (584) 86 pfam00358 PTS_EIIA_1 phosphoen 94.6 0.21 5.2E-06 27.7 7.1 17 289-305 3-19 (133) 87 PRK11856 branched-chain alpha- 94.5 0.041 1.1E-06 32.4 3.5 31 61-91 12-42 (324) 88 cd06250 M14_PaAOTO_like An unc 94.3 0.078 2E-06 30.6 4.6 30 291-322 291-320 (359) 89 TIGR02168 SMC_prok_B chromosom 94.1 0.00071 1.8E-08 44.2 -6.2 198 91-288 295-497 (1191) 90 COG4770 Acetyl/propionyl-CoA c 94.1 0.093 2.4E-06 30.1 4.5 10 6-15 290-299 (645) 91 PRK09039 hypothetical protein; 94.1 0.033 8.5E-07 33.0 2.2 43 94-136 53-95 (343) 92 TIGR00830 PTBA PTS system, glu 94.0 0.058 1.5E-06 31.4 3.3 36 58-93 76-115 (129) 93 PRK07051 hypothetical protein; 93.6 0.032 8.2E-07 33.1 1.5 34 288-321 2-41 (80) 94 TIGR02712 urea_carbox urea car 92.8 0.039 9.9E-07 32.6 0.9 41 280-321 1185-1225(1226) 95 cd06252 M14_ASTE_ASPA_like_2 A 92.4 0.46 1.2E-05 25.4 5.9 30 291-322 246-275 (316) 96 pfam00364 Biotin_lipoyl Biotin 92.2 0.087 2.2E-06 30.2 2.0 21 69-89 18-38 (73) 97 cd06254 M14_ASTE_ASPA_like_4 A 92.1 0.36 9.2E-06 26.1 5.1 31 290-322 224-254 (288) 98 COG1566 EmrA Multidrug resista 92.0 0.23 5.7E-06 27.5 3.9 65 68-132 71-137 (352) 99 cd06253 M14_ASTE_ASPA_like_3 A 91.9 0.45 1.1E-05 25.5 5.4 31 290-322 230-260 (298) 100 pfam05896 NQRA Na(+)-transloca 91.8 0.14 3.5E-06 28.9 2.7 27 61-87 34-60 (257) 101 cd06255 M14_ASTE_ASPA_like_5 A 91.6 0.32 8.2E-06 26.5 4.4 30 291-322 233-262 (293) 102 COG2190 NagE Phosphotransferas 91.6 0.26 6.7E-06 27.0 3.9 18 71-88 92-109 (156) 103 TIGR02063 RNase_R ribonuclease 91.6 0.36 9.3E-06 26.1 4.6 53 318-370 167-223 (755) 104 pfam09726 Macoilin Transmembra 91.6 0.0074 1.9E-07 37.4 -4.0 24 98-121 410-433 (680) 105 PRK05352 Na(+)-translocating N 91.5 0.095 2.4E-06 30.0 1.6 28 61-88 35-62 (448) 106 KOG1029 consensus 91.2 0.012 3.1E-07 36.0 -3.2 10 333-342 741-750 (1118) 107 KOG0161 consensus 91.0 0.0096 2.5E-07 36.6 -3.9 186 92-277 990-1184(1930) 108 PRK11642 exoribonuclease R; Pr 91.0 0.57 1.5E-05 24.8 5.1 75 290-367 641-724 (813) 109 cd06251 M14_ASTE_ASPA_like_1 A 91.0 0.59 1.5E-05 24.7 5.2 31 290-322 220-250 (287) 110 COG0539 RpsA Ribosomal protein 90.6 1.1 2.9E-05 22.8 8.2 12 404-415 486-497 (541) 111 pfam07888 CALCOCO1 Calcium bin 90.5 0.052 1.3E-06 31.7 -0.5 15 3-17 25-39 (546) 112 TIGR01347 sucB 2-oxoglutarate 90.4 0.067 1.7E-06 31.0 -0.1 29 61-89 11-39 (435) 113 PRK09039 hypothetical protein; 90.3 0.18 4.6E-06 28.1 2.1 25 105-129 78-102 (343) 114 TIGR00717 rpsA ribosomal prote 90.0 1.3 3.2E-05 22.5 8.2 114 283-415 191-343 (534) 115 pfam01551 Peptidase_M23 Peptid 90.0 0.8 2E-05 23.8 5.2 51 289-349 13-64 (96) 116 PRK10871 nlpD lipoprotein NlpD 89.9 0.32 8.2E-06 26.4 3.1 52 290-349 285-336 (374) 117 KOG0459 consensus 89.9 1.3 3.3E-05 22.4 7.8 69 298-371 320-389 (501) 118 TIGR02645 ARCH_P_rylase putati 89.7 0.21 5.4E-06 27.6 2.1 36 289-324 419-477 (499) 119 PRK13729 conjugal transfer pil 89.3 0.6 1.5E-05 24.7 4.1 40 87-126 62-101 (474) 120 PRK05305 phosphatidylserine de 89.2 0.43 1.1E-05 25.6 3.3 42 305-349 161-202 (214) 121 KOG0369 consensus 89.1 0.48 1.2E-05 25.3 3.5 10 340-349 1126-1135(1176) 122 TIGR02645 ARCH_P_rylase putati 88.4 0.15 3.8E-06 28.7 0.5 20 69-88 456-475 (499) 123 KOG0559 consensus 88.4 0.27 6.8E-06 27.0 1.8 53 29-89 59-111 (457) 124 PRK05054 exoribonuclease II; P 88.3 1.3 3.4E-05 22.3 5.4 16 296-313 565-580 (644) 125 KOG0250 consensus 88.3 0.012 3.1E-07 35.9 -5.1 10 37-46 109-118 (1074) 126 COG3883 Uncharacterized protei 88.1 0.21 5.3E-06 27.7 1.1 15 18-32 3-17 (265) 127 PRK11649 hypothetical protein; 88.0 0.96 2.4E-05 23.3 4.5 51 290-349 303-353 (418) 128 PRK07899 rpsA 30S ribosomal pr 87.4 1.9 4.8E-05 21.3 8.0 50 35-84 28-86 (484) 129 COG3064 TolA Membrane protein 86.6 2.1 5.4E-05 21.0 6.5 49 17-79 12-62 (387) 130 COG4372 Uncharacterized protei 86.3 0.026 6.7E-07 33.7 -4.3 24 405-428 434-458 (499) 131 COG4372 Uncharacterized protei 85.7 0.013 3.4E-07 35.7 -6.1 36 102-137 96-131 (499) 132 KOG0250 consensus 85.6 0.11 2.9E-06 29.5 -1.4 10 292-301 495-504 (1074) 133 KOG0977 consensus 85.5 0.17 4.5E-06 28.2 -0.5 25 230-254 308-332 (546) 134 KOG0368 consensus 85.3 1.4 3.5E-05 22.2 4.1 28 59-87 725-752 (2196) 135 COG1196 Smc Chromosome segrega 85.2 0.016 4.1E-07 35.1 -5.9 202 87-288 225-428 (1163) 136 TIGR02828 TIGR02828 putative m 84.8 0.99 2.5E-05 23.2 3.2 59 309-372 41-103 (197) 137 PRK09578 periplasmic multidrug 84.1 0.42 1.1E-05 25.7 1.0 31 323-354 268-298 (385) 138 PRK09859 multidrug efflux syst 84.0 0.37 9.5E-06 26.0 0.7 57 53-114 69-126 (385) 139 COG1726 NqrA Na+-transporting 83.3 0.65 1.6E-05 24.4 1.7 24 65-88 38-61 (447) 140 TIGR02386 rpoC_TIGR DNA-direct 83.2 1.8 4.6E-05 21.5 3.9 47 42-90 1159-1211(1552) 141 KOG1853 consensus 82.3 0.86 2.2E-05 23.6 2.0 23 275-297 175-197 (333) 142 PRK04950 putative solute/DNA c 82.0 3.3 8.3E-05 19.7 5.2 49 334-414 158-206 (212) 143 pfam00261 Tropomyosin Tropomyo 82.0 0.026 6.6E-07 33.7 -5.9 23 95-117 9-31 (237) 144 pfam01333 Apocytochr_F_C Apocy 81.8 2 5.1E-05 21.1 3.7 22 289-310 2-23 (118) 145 TIGR02644 Y_phosphoryl pyrimid 81.8 1.6 4.2E-05 21.7 3.3 36 288-323 340-405 (425) 146 pfam00261 Tropomyosin Tropomyo 81.6 0.035 8.9E-07 32.9 -5.4 47 93-139 14-60 (237) 147 KOG0557 consensus 81.3 1.4 3.6E-05 22.2 2.8 29 63-91 51-79 (470) 148 PRK00087 4-hydroxy-3-methylbut 81.1 3.5 9E-05 19.5 8.3 114 284-416 472-621 (670) 149 PRK10361 DNA recombination pro 81.0 1.1 2.9E-05 22.8 2.2 29 309-339 236-264 (475) 150 KOG0980 consensus 81.0 0.051 1.3E-06 31.8 -4.7 194 90-287 354-549 (980) 151 PRK10676 DNA-binding transcrip 80.8 3.6 9.2E-05 19.5 8.7 51 361-417 201-253 (262) 152 TIGR01936 nqrA NADH:ubiquinone 80.7 0.86 2.2E-05 23.6 1.5 33 55-87 28-60 (466) 153 TIGR03309 matur_yqeB selenium- 80.6 2.3 5.8E-05 20.8 3.6 55 289-349 164-218 (256) 154 PRK12269 bifunctional cytidyla 80.2 3.8 9.6E-05 19.3 9.1 114 291-416 492-635 (863) 155 pfam07831 PYNP_C Pyrimidine nu 79.9 1.4 3.6E-05 22.2 2.4 27 61-89 29-55 (75) 156 KOG1029 consensus 79.7 0.17 4.3E-06 28.3 -2.4 30 341-370 711-742 (1118) 157 PRK13806 rpsA 30S ribosomal pr 79.5 4 0.0001 19.2 8.0 110 289-416 288-434 (489) 158 PRK04350 thymidine phosphoryla 78.8 2 5E-05 21.2 2.9 20 288-307 416-435 (502) 159 COG1579 Zn-ribbon protein, pos 78.6 0.41 1E-05 25.7 -0.7 18 332-349 205-222 (239) 160 PRK11281 potassium efflux prot 78.2 0.06 1.5E-06 31.3 -5.0 10 103-112 94-103 (1107) 161 TIGR02927 SucB_Actino 2-oxoglu 78.1 3 7.7E-05 20.0 3.6 32 58-89 180-211 (607) 162 COG1579 Zn-ribbon protein, pos 78.0 0.36 9.2E-06 26.1 -1.1 46 99-144 15-60 (239) 163 TIGR01945 rnfC electron transp 77.8 1 2.6E-05 23.1 1.1 25 65-89 40-64 (444) 164 COG0739 NlpD Membrane proteins 77.4 3.3 8.5E-05 19.7 3.7 54 288-349 172-225 (277) 165 KOG0558 consensus 77.0 2.2 5.6E-05 20.9 2.7 43 49-91 100-142 (474) 166 PRK06078 pyrimidine-nucleoside 76.9 2.4 6.1E-05 20.6 2.8 24 287-310 333-356 (434) 167 PRK05820 deoA thymidine phosph 76.4 2.6 6.6E-05 20.4 2.9 23 288-310 339-361 (440) 168 pfam01576 Myosin_tail_1 Myosin 76.1 0.1 2.6E-06 29.8 -4.4 190 91-280 135-326 (859) 169 PRK06106 nicotinate-nucleotide 75.6 4 0.0001 19.1 3.7 27 64-90 66-92 (281) 170 COG0157 NadC Nicotinate-nucleo 75.0 4.2 0.00011 19.0 3.7 25 66-90 64-88 (280) 171 TIGR00078 nadC nicotinate-nucl 74.7 3.6 9.3E-05 19.4 3.3 26 65-90 59-84 (276) 172 KOG0994 consensus 74.6 0.043 1.1E-06 32.3 -6.6 15 97-111 1235-1249(1758) 173 pfam10174 Cast RIM-binding pro 73.4 0.23 5.8E-06 27.5 -3.1 175 91-265 235-411 (774) 174 pfam02749 QRPTase_N Quinolinat 73.4 4.4 0.00011 18.9 3.5 23 68-90 47-69 (88) 175 pfam01601 Corona_S2 Coronaviru 73.4 3.5 9E-05 19.5 3.0 13 310-322 371-383 (609) 176 PRK09835 sensor kinase CusS; P 72.8 5.3 0.00013 18.3 3.7 29 5-33 2-30 (482) 177 TIGR01107 Na_K_ATPase_bet Na+/ 72.6 4 0.0001 19.1 3.1 42 2-43 18-69 (317) 178 pfam12128 DUF3584 Protein of u 72.6 0.13 3.3E-06 29.1 -4.5 23 90-112 601-623 (1192) 179 PRK06299 rpsA 30S ribosomal pr 72.6 6 0.00015 18.0 8.4 37 44-80 22-70 (556) 180 pfam02666 PS_Dcarbxylase Phosp 72.2 4.3 0.00011 18.9 3.2 22 291-313 83-104 (201) 181 TIGR03327 AMP_phos AMP phospho 72.1 3.8 9.7E-05 19.3 2.9 35 288-322 413-470 (500) 182 PRK06978 nicotinate-nucleotide 71.3 5.9 0.00015 18.0 3.7 27 64-90 74-100 (288) 183 cd04089 eRF3_II eRF3_II: domai 70.9 6.1 0.00016 17.9 3.7 25 339-371 23-47 (82) 184 PRK12784 hypothetical protein; 70.3 6.7 0.00017 17.6 4.3 36 291-327 7-42 (84) 185 KOG0971 consensus 69.7 0.59 1.5E-05 24.7 -1.7 21 91-111 228-248 (1243) 186 TIGR02161 napC_nirT periplasmi 69.2 5.3 0.00014 18.3 3.1 32 5-38 5-36 (185) 187 COG3402 Uncharacterized conser 69.0 7.1 0.00018 17.5 4.0 15 2-16 3-17 (161) 188 pfam07076 DUF1344 Protein of u 68.9 4.2 0.00011 19.0 2.5 50 362-420 24-75 (81) 189 PRK02693 apocytochrome f; Revi 68.8 6.6 0.00017 17.7 3.5 24 286-309 193-216 (312) 190 COG4656 RnfC Predicted NADH:ub 68.7 2.8 7.1E-05 20.2 1.6 25 65-89 42-66 (529) 191 TIGR02013 rpoB DNA-directed RN 68.7 2.8 7.1E-05 20.2 1.6 66 307-397 1142-1207(1449) 192 cd03698 eRF3_II_like eRF3_II_l 68.5 7 0.00018 17.5 3.6 42 310-352 25-67 (83) 193 PRK09016 quinolinate phosphori 68.2 6 0.00015 18.0 3.2 26 65-90 84-109 (296) 194 KOG2735 consensus 68.2 6.1 0.00016 17.9 3.3 38 3-41 110-147 (466) 195 PRK06748 hypothetical protein; 68.1 7.5 0.00019 17.3 4.2 35 291-326 7-41 (84) 196 KOG0933 consensus 67.3 0.065 1.7E-06 31.1 -7.0 41 74-114 663-704 (1174) 197 pfam04225 OapA Opacity-associa 67.2 1.4 3.6E-05 22.2 -0.2 44 311-354 6-55 (85) 198 PRK10617 cytochrome c-type pro 66.9 4.3 0.00011 18.9 2.3 32 6-39 15-46 (200) 199 pfam10481 Cenp-F_N Cenp-F N-te 66.9 2.5 6.3E-05 20.6 1.0 19 93-111 17-35 (288) 200 PRK09629 bifunctional thiosulf 66.7 5.5 0.00014 18.2 2.8 80 290-376 406-496 (610) 201 PRK06543 nicotinate-nucleotide 66.7 6.7 0.00017 17.6 3.2 28 63-90 62-89 (281) 202 PRK07896 nicotinate-nucleotide 65.6 8.3 0.00021 17.0 4.0 80 11-90 8-99 (288) 203 pfam04111 APG6 Autophagy prote 65.5 6.8 0.00017 17.6 3.0 14 296-309 166-179 (356) 204 pfam05557 MAD Mitotic checkpoi 65.5 5 0.00013 18.5 2.4 30 265-294 511-540 (722) 205 pfam08206 OB_RNB Ribonuclease 64.8 8.6 0.00022 16.9 3.7 45 322-370 14-58 (58) 206 pfam00038 Filament Intermediat 64.5 5.5 0.00014 18.2 2.4 29 93-121 60-88 (312) 207 pfam05667 DUF812 Protein of un 64.2 0.16 4E-06 28.5 -5.5 20 271-290 454-473 (536) 208 cd03695 CysN_NodQ_II CysN_NodQ 64.2 8.8 0.00022 16.9 4.2 41 312-353 27-68 (81) 209 cd01568 QPRTase_NadC Quinolina 63.3 3.7 9.4E-05 19.4 1.4 28 63-90 54-81 (269) 210 COG0213 DeoA Thymidine phospho 63.1 6.9 0.00018 17.6 2.7 22 288-309 335-356 (435) 211 pfam06447 consensus 63.0 9.3 0.00024 16.7 3.6 33 406-439 300-332 (407) 212 cd01573 modD_like ModD; Quinol 62.1 8.4 0.00021 17.0 3.0 22 69-90 59-80 (272) 213 KOG0976 consensus 61.7 0.31 7.8E-06 26.6 -4.4 11 412-422 887-897 (1265) 214 PRK05035 electron transport co 61.6 4.5 0.00012 18.8 1.6 26 64-89 51-76 (725) 215 cd01572 QPRTase Quinolinate ph 61.4 8.5 0.00022 16.9 2.9 27 64-90 56-82 (268) 216 PRK08072 nicotinate-nucleotide 61.3 9.9 0.00025 16.5 3.7 27 64-90 62-88 (277) 217 PRK06096 molybdenum transport 60.8 7.9 0.0002 17.2 2.7 25 66-90 61-85 (284) 218 PRK10803 hypothetical protein; 60.7 10 0.00026 16.4 4.6 16 28-43 16-31 (262) 219 pfam04952 AstE_AspA Succinylgl 60.5 10 0.00026 16.4 3.9 29 290-320 218-246 (289) 220 PRK07428 nicotinate-nucleotide 59.9 4.9 0.00012 18.6 1.5 22 69-90 72-93 (285) 221 TIGR02162 torC trimethylamine- 59.2 2.4 6.2E-05 20.6 -0.2 33 5-38 3-35 (394) 222 PRK05848 nicotinate-nucleotide 58.9 9.7 0.00025 16.6 2.9 23 68-90 59-81 (272) 223 PRK00809 hypothetical protein; 58.6 11 0.00028 16.2 5.6 37 335-373 28-64 (144) 224 KOG0804 consensus 58.6 10 0.00025 16.5 2.9 32 58-90 291-323 (493) 225 COG1463 Ttg2C ABC-type transpo 58.0 11 0.00029 16.1 6.0 23 15-37 8-30 (359) 226 CHL00117 rpoC2 RNA polymerase 58.0 11 0.00029 16.1 5.3 20 69-88 408-427 (1350) 227 cd03693 EF1_alpha_II EF1_alpha 57.3 12 0.00029 16.1 4.3 25 339-371 28-52 (91) 228 KOG2264 consensus 56.9 12 0.0003 16.0 6.1 51 99-149 91-141 (907) 229 pfam10574 UPF0552 Uncharacteri 56.8 12 0.0003 16.0 5.5 43 328-375 136-180 (224) 230 PRK13454 F0F1 ATP synthase sub 56.5 12 0.00029 16.1 2.9 31 7-39 23-54 (181) 231 pfam11488 Lge1 Transcriptional 56.4 12 0.0003 16.0 2.9 31 84-114 167-197 (234) 232 TIGR00527 gcvH glycine cleavag 56.2 5.3 0.00014 18.3 1.2 12 360-371 69-80 (132) 233 pfam09891 DUF2118 Uncharacteri 56.1 12 0.00031 15.9 5.6 37 308-348 71-107 (149) 234 PRK03481 consensus 56.1 9.2 0.00023 16.7 2.3 26 291-317 145-170 (322) 235 PRK08385 nicotinate-nucleotide 56.0 12 0.00031 15.9 3.4 25 66-90 59-83 (279) 236 KOG4360 consensus 55.1 11 0.00027 16.3 2.5 25 178-202 277-301 (596) 237 PRK01734 consensus 54.8 7.7 0.0002 17.3 1.8 24 291-315 155-178 (294) 238 pfam05103 DivIVA DivIVA protei 54.7 8.9 0.00023 16.8 2.1 36 87-122 18-53 (131) 239 TIGR02097 yccV hemimethylated 54.2 13 0.00033 15.7 3.3 48 341-396 3-61 (103) 240 TIGR00008 infA translation ini 54.0 13 0.00033 15.7 3.9 41 324-369 24-67 (69) 241 KOG0963 consensus 53.7 0.59 1.5E-05 24.7 -4.1 22 279-300 426-447 (629) 242 pfam11119 DUF2633 Protein of u 53.3 13 0.00032 15.8 2.7 30 10-39 3-32 (59) 243 KOG3814 consensus 53.3 13 0.00034 15.6 3.0 35 4-38 176-210 (531) 244 PRK04722 consensus 53.2 13 0.00034 15.6 3.8 22 291-313 142-163 (280) 245 CHL00037 petA cytochrome f 52.9 14 0.00035 15.6 3.7 25 286-310 200-224 (320) 246 KOG4674 consensus 52.6 0.5 1.3E-05 25.2 -4.6 15 406-420 1745-1759(1822) 247 COG3061 OapA Cell envelope opa 51.3 12 0.0003 16.0 2.3 53 310-371 162-220 (242) 248 COG3206 GumC Uncharacterized p 50.7 15 0.00037 15.4 5.5 34 18-51 26-59 (458) 249 pfam11623 DUF3252 Protein of u 50.7 15 0.00037 15.4 2.9 29 342-373 2-30 (53) 250 smart00787 Spc7 Spc7 kinetocho 50.4 6.9 0.00018 17.6 1.0 10 72-81 49-58 (312) 251 PRK06148 hypothetical protein; 49.6 15 0.00039 15.3 3.0 60 288-349 452-514 (1015) 252 KOG0018 consensus 49.6 2.1 5.2E-05 21.1 -1.8 10 348-357 580-589 (1141) 253 PRK05742 nicotinate-nucleotide 49.6 13 0.00034 15.7 2.3 25 66-90 66-90 (277) 254 PRK00044 psd phosphatidylserin 49.6 12 0.00032 15.9 2.2 24 291-315 145-168 (291) 255 pfam10168 Nup88 Nuclear pore c 49.5 8.4 0.00021 17.0 1.3 23 42-64 457-479 (717) 256 TIGR01394 TypA_BipA GTP-bindin 49.0 16 0.0004 15.2 4.9 52 35-86 223-284 (609) 257 PRK06559 nicotinate-nucleotide 48.9 14 0.00035 15.5 2.3 21 70-90 77-97 (290) 258 PRK03934 phosphatidylserine de 48.8 12 0.00031 15.9 2.0 21 291-312 131-151 (267) 259 pfam05911 DUF869 Plant protein 48.8 3 7.7E-05 20.0 -1.1 16 273-288 295-310 (767) 260 pfam05545 FixQ Cbb3-type cytoc 48.7 16 0.0004 15.2 2.7 20 18-37 12-31 (49) 261 COG3785 Uncharacterized conser 48.7 15 0.00038 15.3 2.5 50 339-396 12-71 (116) 262 pfam12389 Peptidase_M73 Camely 48.4 16 0.0004 15.2 2.6 51 24-81 18-70 (199) 263 CHL00071 tufA elongation facto 48.3 16 0.00041 15.1 9.6 58 295-355 232-292 (409) 264 PRK09838 periplasmic copper-bi 48.2 16 0.00041 15.1 4.6 56 296-354 44-100 (114) 265 TIGR00922 nusG transcription t 47.9 16 0.00041 15.1 2.5 27 341-372 139-165 (193) 266 pfam04849 HAP1_N HAP1 N-termin 47.8 14 0.00035 15.6 2.2 16 86-101 61-77 (307) 267 KOG1899 consensus 47.7 1.6 4.1E-05 21.8 -2.6 11 362-372 418-428 (861) 268 pfam04156 IncA IncA protein. C 47.4 16 0.00042 15.0 3.7 33 92-124 84-116 (186) 269 cd04467 S1_aIF5A S1_aIF5A: Arc 46.9 17 0.00043 15.0 2.7 22 330-351 27-48 (57) 270 KOG0964 consensus 46.1 1.1 2.8E-05 22.9 -3.7 32 269-300 479-510 (1200) 271 PRK05124 cysN sulfate adenylyl 45.7 17 0.00045 14.9 11.5 105 309-420 269-403 (475) 272 pfam08614 ATG16 Autophagy prot 44.8 13 0.00032 15.8 1.6 39 97-135 77-115 (194) 273 PRK11281 potassium efflux prot 44.1 0.13 3.4E-06 29.0 -8.6 26 95-120 79-104 (1107) 274 PRK09174 F0F1 ATP synthase sub 43.9 19 0.00048 14.7 3.0 31 6-38 44-75 (204) 275 pfam08317 Spc7 Spc7 kinetochor 43.6 9 0.00023 16.8 0.7 21 185-205 178-198 (322) 276 cd03696 selB_II selB_II: this 43.5 19 0.00048 14.7 4.6 40 312-352 27-67 (83) 277 pfam07423 DUF1510 Protein of u 43.4 14 0.00037 15.4 1.7 36 3-39 2-37 (214) 278 pfam12325 TMF_TATA_bd TATA ele 43.4 15 0.00039 15.3 1.8 37 94-130 17-53 (121) 279 TIGR03017 EpsF chain length de 43.2 19 0.00049 14.6 5.4 27 24-52 21-47 (444) 280 pfam11604 CusF_Ec Copper bindi 43.1 19 0.00049 14.6 5.0 54 298-354 1-55 (61) 281 TIGR02728 spore_gerQ spore coa 42.6 19 0.0005 14.6 4.6 64 337-400 9-74 (82) 282 KOG3925 consensus 42.3 19 0.00049 14.6 2.2 97 274-394 163-264 (371) 283 COG2433 Uncharacterized conser 42.3 20 0.0005 14.5 2.7 10 341-350 541-550 (652) 284 TIGR03593 yidC_nterm membrane 42.0 20 0.0005 14.5 2.2 23 18-40 2-24 (366) 285 PRK09173 F0F1 ATP synthase sub 41.9 20 0.00051 14.5 3.1 18 14-31 3-20 (159) 286 CHL00006 consensus 41.4 20 0.00052 14.4 5.1 20 69-88 405-424 (1372) 287 PRK11519 tyrosine kinase; Prov 41.2 20 0.00052 14.4 4.4 28 23-52 37-64 (720) 288 PRK10512 selenocysteinyl-tRNA- 41.1 21 0.00052 14.4 4.6 11 76-86 248-258 (615) 289 cd00516 PRTase_typeII Phosphor 41.0 21 0.00053 14.4 3.5 28 64-91 48-75 (281) 290 cd04471 S1_RNase_R S1_RNase_R: 40.9 21 0.00053 14.4 4.1 17 295-313 4-20 (83) 291 TIGR00531 BCCP acetyl-CoA carb 40.8 21 0.00053 14.4 3.1 34 284-317 77-110 (159) 292 TIGR00575 dnlj DNA ligase, NAD 40.5 21 0.00053 14.3 2.9 13 293-305 287-299 (706) 293 PRK04778 septation ring format 40.2 6.6 0.00017 17.7 -0.4 13 23-35 9-21 (569) 294 pfam02706 Wzz Chain length det 40.1 21 0.00054 14.3 5.6 33 20-54 19-51 (139) 295 PRK10629 hypothetical protein; 40.0 21 0.00054 14.3 4.6 77 13-95 7-88 (127) 296 PRK00226 greA transcription el 39.7 15 0.00038 15.3 1.3 31 266-296 49-79 (157) 297 PRK11144 modC molybdate transp 39.0 22 0.00056 14.2 7.4 47 361-417 294-347 (352) 298 TIGR02977 phageshock_pspA phag 38.8 22 0.00057 14.2 2.5 12 216-227 168-179 (222) 299 PRK00723 phosphatidylserine de 37.7 23 0.00059 14.1 2.5 20 291-311 168-187 (297) 300 PRK07193 fliF flagellar MS-rin 37.6 23 0.00059 14.1 2.3 32 5-36 16-49 (550) 301 PRK08717 consensus 37.4 23 0.00059 14.0 3.0 36 1-36 1-39 (311) 302 COG0688 Psd Phosphatidylserine 37.0 24 0.0006 14.0 2.0 39 291-332 126-164 (239) 303 PRK00624 glycine cleavage syst 36.6 24 0.00061 14.0 2.7 35 362-396 66-103 (113) 304 PRK00882 consensus 36.3 21 0.00055 14.3 1.7 20 291-311 167-186 (295) 305 COG4093 Uncharacterized protei 35.7 25 0.00063 13.9 2.4 26 11-36 7-32 (338) 306 PRK04599 consensus 35.7 18 0.00047 14.7 1.3 25 291-316 144-168 (285) 307 cd01324 cbb3_Oxidase_CcoQ Cyto 35.6 25 0.00063 13.8 2.5 19 19-37 14-32 (48) 308 pfam06818 Fez1 Fez1. This fami 35.3 7 0.00018 17.5 -0.9 16 97-112 13-28 (199) 309 TIGR02163 napH_ ferredoxin-typ 35.1 25 0.00065 13.8 2.8 30 13-42 116-145 (263) 310 TIGR03077 not_gcvH glycine cle 34.6 26 0.00066 13.8 2.6 36 361-396 64-102 (110) 311 COG4325 Predicted membrane pro 34.5 21 0.00053 14.4 1.4 35 54-88 227-279 (464) 312 pfam03264 Cytochrom_NNT NapC/N 34.5 26 0.00066 13.7 3.2 21 17-37 6-26 (174) 313 PRK03140 phosphatidylserine de 34.3 26 0.00066 13.7 1.9 21 291-312 140-160 (259) 314 CHL00078 rpl5 ribosomal protei 34.0 26 0.00067 13.7 2.6 21 402-424 145-165 (180) 315 PRK13168 rumA 23S rRNA 5-methy 33.9 26 0.00067 13.7 4.3 43 45-87 27-70 (440) 316 TIGR02231 TIGR02231 conserved 33.7 5.5 0.00014 18.2 -1.7 34 319-352 395-429 (558) 317 PRK05529 cell division protein 33.6 27 0.00068 13.6 2.4 20 77-96 84-103 (255) 318 PRK03992 proteasome-activating 33.3 5.1 0.00013 18.4 -1.9 14 290-304 161-174 (390) 319 TIGR01043 ATP_syn_A_arch ATP s 33.2 26 0.00067 13.7 1.7 29 307-336 124-152 (584) 320 KOG4403 consensus 32.6 28 0.00071 13.5 3.0 30 8-37 205-234 (575) 321 PRK06676 rpsA 30S ribosomal pr 32.3 28 0.00072 13.5 7.3 62 289-356 189-250 (390) 322 PRK08515 flgA flagellar basal 32.0 28 0.00072 13.5 4.3 53 309-369 170-222 (229) 323 PRK12736 elongation factor Tu; 31.9 28 0.00072 13.5 9.1 57 295-354 222-281 (394) 324 PRK13415 flagella biosynthesis 31.9 28 0.00073 13.5 5.0 34 53-91 115-149 (216) 325 COG0557 VacB Exoribonuclease R 31.7 25 0.00065 13.8 1.4 23 296-318 278-300 (706) 326 pfam09671 Spore_GerQ Spore coa 31.4 29 0.00074 13.4 4.0 63 337-399 11-75 (81) 327 PRK09731 putative general secr 31.3 27 0.0007 13.6 1.6 29 12-40 32-60 (178) 328 PRK10920 putative uroporphyrin 30.5 30 0.00077 13.3 7.2 22 16-37 35-56 (389) 329 pfam02699 YajC Preprotein tran 30.3 30 0.00077 13.3 3.9 25 338-371 34-58 (83) 330 TIGR03545 conserved hypothetic 30.2 30 0.00077 13.3 1.6 20 15-34 3-22 (554) 331 PRK12735 elongation factor Tu; 30.1 30 0.00078 13.3 9.0 56 297-355 226-284 (396) 332 TIGR00996 Mtu_fam_mce virulenc 29.9 31 0.00078 13.2 6.3 11 65-75 60-71 (304) 333 COG5349 Uncharacterized protei 29.9 31 0.00078 13.2 2.3 30 3-32 74-103 (126) 334 PRK13183 psbN photosystem II r 29.8 31 0.00079 13.2 2.5 25 17-41 10-34 (46) 335 COG4682 Predicted membrane pro 29.4 31 0.0008 13.2 2.8 36 5-40 1-38 (128) 336 pfam00575 S1 S1 RNA binding do 29.3 31 0.0008 13.2 3.4 47 296-356 8-63 (74) 337 TIGR03021 pilP_fam type IV pil 28.8 19 0.00048 14.7 0.4 29 318-346 71-99 (119) 338 pfam11356 Pilus_PilP Type IV p 28.8 19 0.00048 14.6 0.4 29 318-346 60-88 (108) 339 cd04461 S1_Rrp5_repeat_hs8_sc7 28.7 32 0.00082 13.1 3.2 25 287-313 9-33 (83) 340 cd05685 S1_Tex S1_Tex: The C-t 28.5 32 0.00083 13.1 2.9 56 296-356 4-59 (68) 341 TIGR02794 tolA_full protein To 28.4 32 0.00083 13.1 3.0 15 341-356 377-391 (452) 342 TIGR01144 ATP_synt_b ATP synth 28.4 12 0.0003 16.0 -0.7 10 97-106 25-34 (147) 343 COG1545 Predicted nucleic-acid 27.9 33 0.00085 13.0 5.8 50 362-416 66-121 (140) 344 pfam10011 DUF2254 Predicted me 27.7 33 0.00085 13.0 2.8 33 290-322 195-244 (369) 345 pfam02468 PsbN Photosystem II 27.6 34 0.00086 13.0 2.1 26 16-41 6-31 (43) 346 PRK12317 elongation factor 1-a 27.6 34 0.00086 13.0 9.6 119 298-419 241-388 (426) 347 pfam10828 DUF2570 Protein of u 27.5 34 0.00086 13.0 2.7 19 19-37 4-22 (110) 348 pfam06638 Strabismus Strabismu 27.4 34 0.00086 13.0 4.0 26 13-38 163-188 (507) 349 TIGR00878 purM phosphoribosylf 26.9 35 0.00088 12.9 2.3 26 65-90 161-186 (338) 350 CHL00020 psbN photosystem II p 26.8 35 0.00088 12.9 1.6 26 16-41 6-31 (43) 351 PRK05338 rplS 50S ribosomal pr 26.7 35 0.00089 12.9 3.9 36 336-371 13-50 (112) 352 PRK13380 glycine cleavage syst 26.5 35 0.00089 12.9 2.7 36 360-395 71-109 (132) 353 PRK02655 psbI photosystem II r 26.3 25 0.00063 13.9 0.6 23 17-39 4-26 (38) 354 PRK10698 phage shock protein P 26.3 32 0.00081 13.1 1.2 13 97-109 34-46 (222) 355 TIGR00739 yajC preprotein tran 26.2 36 0.00091 12.8 3.0 27 337-372 33-59 (86) 356 pfam01245 Ribosomal_L19 Riboso 26.1 36 0.00091 12.8 5.3 72 336-416 13-86 (113) 357 PRK00049 elongation factor Tu; 26.1 36 0.00091 12.8 8.1 120 297-419 227-368 (397) 358 pfam01597 GCV_H Glycine cleava 25.9 36 0.00092 12.8 2.8 34 362-395 66-102 (122) 359 CHL00182 tatC Sec-independent 25.8 36 0.00092 12.8 2.5 20 4-23 94-113 (239) 360 CHL00024 psbI photosystem II p 25.7 34 0.00086 13.0 1.2 23 17-39 4-26 (36) 361 KOG4807 consensus 25.7 0.54 1.4E-05 25.0 -8.0 31 313-344 496-526 (593) 362 pfam09962 DUF2196 Uncharacteri 25.3 37 0.00094 12.7 4.1 36 337-373 4-39 (62) 363 PRK01318 putative inner membra 25.3 37 0.00094 12.7 6.5 23 18-40 4-26 (565) 364 TIGR00849 gutA PTS system, glu 25.2 37 0.00094 12.7 2.8 63 301-371 48-110 (129) 365 PRK01294 lipase chaperone; Pro 25.2 37 0.00094 12.7 3.3 23 16-38 4-26 (339) 366 PRK05585 yajC preprotein trans 24.6 38 0.00097 12.6 3.4 26 338-372 50-75 (107) 367 PRK07400 30S ribosomal protein 24.4 38 0.00098 12.6 4.9 55 289-358 193-256 (314) 368 KOG2419 consensus 24.3 38 0.00098 12.6 2.5 22 405-426 949-970 (975) 369 PRK01202 glycine cleavage syst 24.2 39 0.00099 12.6 2.9 35 361-395 71-108 (127) 370 cd05697 S1_Rrp5_repeat_hs5 S1_ 24.2 39 0.00099 12.6 4.0 17 296-314 4-20 (69) 371 PRK02749 photosystem I reactio 24.1 39 0.00099 12.6 3.0 43 341-397 2-45 (71) 372 TIGR02902 spore_lonB ATP-depen 23.8 39 0.001 12.5 2.0 13 404-416 463-475 (532) 373 PRK01885 greB transcription el 23.7 38 0.00097 12.6 1.2 30 267-296 52-81 (159) 374 PRK00010 rplE 50S ribosomal pr 23.5 40 0.001 12.5 3.2 20 403-424 146-165 (179) 375 COG3096 MukB Uncharacterized p 23.4 31 0.00079 13.2 0.7 18 353-371 728-745 (1480) 376 PRK08582 hypothetical protein; 22.7 41 0.0011 12.4 3.3 56 292-356 5-64 (139) 377 COG3871 Uncharacterized stress 22.7 41 0.0011 12.4 2.0 44 326-370 45-89 (145) 378 pfam10498 IFT57 Intra-flagella 22.6 41 0.0011 12.4 2.3 16 307-322 315-330 (355) 379 PRK00276 infA translation init 22.4 42 0.0011 12.3 2.8 58 296-369 9-69 (72) 380 pfam09787 Golgin_A5 Golgin sub 22.0 7.5 0.00019 17.3 -2.7 18 96-113 115-132 (508) 381 PRK08059 general stress protei 21.9 43 0.0011 12.3 4.4 71 289-374 3-82 (119) 382 PRK08559 nusG transcription an 21.8 43 0.0011 12.3 5.6 33 335-372 88-120 (153) 383 cd06848 GCS_H Glycine cleavage 21.6 43 0.0011 12.2 3.1 12 363-374 65-76 (96) 384 COG4072 Uncharacterized protei 21.5 44 0.0011 12.2 6.0 12 310-321 85-96 (161) 385 COG3745 CpaB Flp pilus assembl 21.4 44 0.0011 12.2 3.8 17 66-82 49-65 (276) 386 PRK06569 F0F1 ATP synthase sub 21.4 44 0.0011 12.2 3.1 30 7-38 2-32 (157) 387 TIGR01416 Rieske_proteo ubiqui 21.4 39 0.00099 12.5 0.8 63 17-90 3-71 (192) 388 cd05691 S1_RPS1_repeat_ec6 S1_ 21.3 44 0.0011 12.2 2.8 46 296-355 4-58 (73) 389 PRK09841 cryptic autophosphory 21.2 44 0.0011 12.2 5.5 32 19-52 33-64 (726) 390 TIGR02388 rpoC2_cyan DNA-direc 21.2 44 0.0011 12.2 3.2 59 282-349 871-930 (1252) 391 COG4148 ModC ABC-type molybdat 21.1 44 0.0011 12.2 5.6 47 361-417 294-347 (352) 392 cd07021 Clp_protease_NfeD_like 20.8 35 0.0009 12.8 0.5 41 294-337 28-68 (178) 393 cd01134 V_A-ATPase_A V/A-type 20.8 45 0.0011 12.1 2.1 19 71-89 54-72 (369) 394 TIGR01204 bioW 6-carboxyhexano 20.6 45 0.0012 12.1 1.8 52 312-373 168-220 (238) 395 TIGR02830 spore_III_AG stage I 20.6 43 0.0011 12.2 0.9 14 29-42 14-27 (193) 396 pfam02532 PsbI Photosystem II 20.5 43 0.0011 12.2 0.9 22 17-38 4-25 (36) 397 TIGR03185 DNA_S_dndD DNA sulfu 20.3 46 0.0012 12.1 3.1 32 405-436 599-630 (650) 398 TIGR01069 mutS2 MutS2 family p 20.3 46 0.0012 12.1 2.2 15 330-344 706-720 (834) 399 KOG3809 consensus 20.2 46 0.0012 12.1 1.3 10 9-18 27-36 (583) 400 PRK10921 twin-arginine protein 20.0 47 0.0012 12.0 1.9 24 3-26 93-116 (255) 401 PRK08563 DNA-directed RNA poly 20.0 47 0.0012 12.0 3.7 15 403-417 134-148 (182) 402 cd04473 S1_RecJ_like S1_RecJ_l 20.0 47 0.0012 12.0 4.5 58 287-357 11-68 (77) No 1 >TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family; InterPro: IPR010129 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Type I secretion is an ABC transporter that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C terminus of the transported protein. This entry represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This entry selects a group of sequences closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.; GO: 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=100.00 E-value=0 Score=547.44 Aligned_cols=426 Identities=19% Similarity=0.277 Sum_probs=379.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHH Q ss_conf 61689999999999999998764067125899979999538738984688868999985789893369789999648998 Q gi|254780386|r 14 PASLKRMTLLSIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETT 93 (440) Q Consensus 14 p~~~~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~ 93 (440) +++.|+++|+++++|++|++||+|+++|.++.|+|.|+|+++++.|||+++|+|.+|+|+|||.|++||+|++||.+..+ T Consensus 1 ~~~~r~~~~~~~~~v~~~~~WA~~~~ld~va~A~G~V~~~g~~k~vQhlegGi~~~I~V~EG~~V~~Gq~L~~Ld~t~~~ 80 (434) T TIGR01843 1 SRFARLITWLIALLVVIFLLWAAFAPLDSVATAQGKVVPSGNVKVVQHLEGGIVREILVREGDRVKAGQVLVELDATSVE 80 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEEECCCEEEEECCCCCEEEEEEECCCCEECCCCCCEEECHHHHH T ss_conf 93279889999999999999888761025899866999707477996686534413552258642037701354135789 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCH-HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 889998743433233210012333322211-45554223568888-7765546788654333655420011222112210 Q gi|254780386|r 94 DLIDLKKVSIKNLRCHIDTEKSALDFLNNK-YDVLEKAFESREEQ-LLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRM 171 (440) Q Consensus 94 ~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 171 (440) +++.....++..+++.....++.++..... ...-.........- ........-..+...+..+...+..++.....+. T Consensus 81 A~~~~~~~~~~~l~~~~~R~~Ae~~~~~~~~~~~~~~~~~~~~~p~~r~~~~~~~~~~~~~f~~~~~~l~~~~~~~~~~~ 160 (434) T TIGR01843 81 ADAAELESQVLALEAEVARLRAEINSEEASVIEFPDILLSAIDDPFERRAVAELVKGQQSLFESRKSALRAQLESILAQI 160 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999832325754467577620114120371289999999999999998889999999999 Q ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 02232101122322222210233332221012222221110000111002466653101223322222222222100012 Q gi|254780386|r 172 RSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEA 251 (440) Q Consensus 172 ~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~ 251 (440) .....+...++.+...++.++...++++...++|.+++..++.++.+++++..+...++....++......++.+...+. T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~L~~~g~~~~~~~L~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (434) T TIGR01843 161 KQLEAELEGLQAQLQALRQQLELISEELEALRKLKEKGLVSRLELLELERERAELQGELARLEAELEVLKAQIDELQLKL 240 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999878999999999999863066334321678999999999999999987999999999999999999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 23332112332221222222221222332212222212345554116873121201222124445542103355650477 Q gi|254780386|r 252 NLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIR 331 (440) Q Consensus 252 ~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~ 331 (440) ........+++.+++.+++.++.++++++..+.+++.+..||||+||+|..+.++|.|+||+||.+||.|||.++++.|| T Consensus 241 ~~~~~~~~~e~~~~l~~A~~~l~~~~~~l~~a~~~~~~~~irsPvdG~v~~l~~~T~GgVv~pG~~lm~IVP~~~~l~iE 320 (434) T TIGR01843 241 QQLEQTFREEVLEELTEAQARLAELREELSKARDRLQRLIIRSPVDGTVQSLKVHTVGGVVQPGETLMEIVPEDDPLEIE 320 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEEEEEEEEEEECCEECCCCCEEEEECCCCCEEEE T ss_conf 88999999999999999999999999999999987532114425451999999984054432786178983399965899 Q ss_pred EEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCCC-CCCEEEEEEEEECHHHCC---CCCCEECC Q ss_conf 53262231101589831799981288754439999999713344467776-551189999993501125---67851523 Q gi|254780386|r 332 AKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNNL-QTQEYYEVILKITDLAFS---DNNIELRN 407 (440) Q Consensus 332 ~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~-~~~~~y~v~i~~~~~~~~---~~~~~L~~ 407 (440) |+|.+.||++|++||+|.|+|+|||+.+||.+.|+|.+||||+..++... ....||.|+|.++..++. +.++.|.| T Consensus 321 a~~~~~Dig~v~~Gq~a~ik~~AF~~~ryG~~~G~V~~iS~D~~~DE~~~~~~~~yY~~ri~~~~~~~~k~~~~~~~l~P 400 (434) T TIGR01843 321 AKLSNKDIGFVHVGQPAEIKFSAFPYTRYGLLNGKVKSISPDTVEDERQGRQQGLYYRVRISIDENTLGKAGGKGLELSP 400 (434) T ss_pred EEECCCCCCEEECCCEEEEEEECCCCCCCCCEEEEEEEECCCCHHHHHCCCCCCCEEEEEEEECCCEECCCCCCCCEEEC T ss_conf 99735465345558716999963686033518999998627511444247876766679999732001365776601406 Q ss_pred CCEEEEEEEECEEEHHHHHHHHHHHHHHHHHC Q ss_conf 77569999922054799999999989988731 Q gi|254780386|r 408 GFPAEVLFTAKNTTLAQEIIHPITDNWPKIFE 439 (440) Q Consensus 408 Gm~~~~~I~~~~~~~~~~~~~p~~~~~~~~f~ 439 (440) ||+|+|.|+||+||+|+|||+||++++.++|| T Consensus 401 GM~v~a~i~TG~Rt~~~YLl~P~~~~~~~AlR 432 (434) T TIGR01843 401 GMPVEADIKTGERTVIEYLLKPITDSVQEALR 432 (434) T ss_pred CCEEEEEEEECCCHHHHHHHHHHHHHHHHHCC T ss_conf 85379999817323378877789998876437 No 2 >PRK03598 hypothetical protein; Provisional Probab=100.00 E-value=1.7e-43 Score=308.58 Aligned_cols=319 Identities=15% Similarity=0.218 Sum_probs=216.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEEE-EEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHH Q ss_conf 689999999999999998764067-1258999799995387389846888689999857898933697899996489988 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILLP-IEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTD 94 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~~-~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~ 94 (440) ++.+++++++++.++++.|.++.. -+......|.|.. ....|.+.++|+|.+|+|++||.|++||+|++||+.+.+. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ve~--~~V~vs~~v~G~V~~v~V~eGd~Vk~Gq~La~LD~~~~~~ 81 (331) T PRK03598 4 PVVIGLAVVALVAVGAGGWWWYQSRQDNGLTLYGNVDI--RTVNLSFRVGGRLASLAVDEGDAVKAGQVLGELDHAPYEN 81 (331) T ss_pred HHHHHHHHHHHHHHHHHHHHHEEECCCCCEEEEEEEEE--EEEEECCCCCEEEEEEECCCCCEECCCCEEEEECCHHHHH T ss_conf 38999999999999999765214057886035799999--9999913277599999868979877998899988089999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH Q ss_conf 89998743433233210012333322211455542235688887765546788654333655420011222112210022 Q gi|254780386|r 95 LIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSI 174 (440) Q Consensus 95 ~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 174 (440) .+...+.++...++++...+..... .++..... T Consensus 82 ~l~~a~a~l~~a~a~l~~~~~~~~~----------------~ei~~a~a------------------------------- 114 (331) T PRK03598 82 ALMQAKAGVSVAQAQLDLMLAGYRD----------------EEIAQARA------------------------------- 114 (331) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCH----------------HHHHHHHH------------------------------- T ss_conf 9999999999999999998720125----------------68999999------------------------------- Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 32101122322222210233332221012222221110000111002466653101223322222222222100012233 Q gi|254780386|r 175 YLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLE 254 (440) Q Consensus 175 ~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~ 254 (440) .....+..+...+..+++.+.|.+.+..++.++.+.+..+......+......+...... T Consensus 115 ---------~~~~a~a~l~~a~~~~~R~~~L~~~g~is~~~ld~a~~~~~~A~a~l~~a~~~l~~~~~~----------- 174 (331) T PRK03598 115 ---------AVKQAQAAYDYAQNFYNRQQGLWPSGTISANDLENARSSRDQAQATLKSAQDKLSQYREG----------- 174 (331) T ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC----------- T ss_conf ---------999999999999999999998987799578899999999999999999999999998743----------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 32112332221222222221222332212222212345554116873121201222124445542103355650477532 Q gi|254780386|r 255 FANYLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKV 334 (440) Q Consensus 255 ~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v 334 (440) -...++...+..+...+..+..++..|+++.|+||+||+|....+ .+|++|++|+|+++|+ +.+.+||+++| T Consensus 175 ------~~~~~i~~a~a~l~~a~a~l~~a~~~L~~t~I~AP~dGvV~~~~v-~~Ge~V~~G~~l~ti~-~~~~l~v~~~v 246 (331) T PRK03598 175 ------NREEDIAQAKASLEQAQAQLAQAELNLQDTTLIAPSDGTILTRAV-EPGTMLSAGSTVFTLS-LTRPVWVRAYV 246 (331) T ss_pred ------CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCEEEEEEEC-CCCCEECCCCCEEEEE-ECCCEEEEEEE T ss_conf ------123469988999999999999999876067997899669998975-8898848998179999-17945999997 Q ss_pred CHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCC-----CEEEEEEEEECHHHCCCCCCEECCCC Q ss_conf 6223110158983179998128875443999999971334446777655-----11899999935011256785152377 Q gi|254780386|r 335 TPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNNLQT-----QEYYEVILKITDLAFSDNNIELRNGF 409 (440) Q Consensus 335 ~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~-----~~~y~v~i~~~~~~~~~~~~~L~~Gm 409 (440) ||.|+++|++||+|.|++|+||+. .|+|+|++|||.+...+.+..+ .-+|+|+|.++++ +..|+||| T Consensus 247 ~E~~i~~v~~Gq~v~i~~da~p~~---~f~G~V~~Isp~a~~tp~~~~t~~~rt~~v~~V~i~i~~~-----~~~LrpGM 318 (331) T PRK03598 247 DERNLGQAQPGRKVLVYTDGRPDK---PYHGQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVTDA-----DDALRQGM 318 (331) T ss_pred CHHHHHHCCCCCEEEEEEECCCCC---EEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEEEECCC-----CCCCCCCC T ss_conf 789995288999899999469996---8999999985644367875556652035799999995698-----77669998 Q ss_pred EEEEEEEECE Q ss_conf 5699999220 Q gi|254780386|r 410 PAEVLFTAKN 419 (440) Q Consensus 410 ~~~~~I~~~~ 419 (440) +++|.|.++. T Consensus 319 ~v~V~i~~~~ 328 (331) T PRK03598 319 PVTVRFADEA 328 (331) T ss_pred EEEEEECCCC T ss_conf 8999987788 No 3 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=100.00 E-value=8.3e-43 Score=304.02 Aligned_cols=417 Identities=13% Similarity=0.179 Sum_probs=273.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 35686168999999999999999876406712589997999953873898468886899998578989336978999964 Q gi|254780386|r 10 KRNHPASLKRMTLLSIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 10 ~~~~p~~~~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) .|-+..|..++|++++++||++++|++||+.++||.++|.|.|...+..|||+..-.|.+-+.+||..|||||+|+.++. T Consensus 14 ~~Ry~NFs~l~I~P~~lLLv~~~lFs~fAkkEivi~~~g~~~p~~~~~~iQst~n~~I~~N~L~Enk~VKKG~~Lv~y~~ 93 (476) T TIGR01000 14 QKRYHNFSTLVIVPIFLLLVFLVLFSLFAKKEIVIRSTGSIEPAKVIAKIQSTSNNAIKENYLKENKFVKKGDLLVVYTN 93 (476) T ss_pred CCHHHCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEEECCCCCCHHHCCCCCCHHHHHHHHHCCEEECCCEEEEEEC T ss_conf 10211012689999999999999972201078898411266143201212288421456513320978833878999717 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCC---CCCCCCCCCHHHHH-----HHHHHHHHHHHHHHHHHHHH------HHHHHHHH Q ss_conf 899888999874343323321001---23333222114555-----42235688887765546788------65433365 Q gi|254780386|r 90 IETTDLIDLKKVSIKNLRCHIDTE---KSALDFLNNKYDVL-----EKAFESREEQLLDFLGKYHS------TCSKVFNY 155 (440) Q Consensus 90 ~~~~~~l~~~~~~l~~l~~~l~~~---~~~l~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~------~~~~~~~~ 155 (440) ...+.+......++..|..+...+ +..+..-..+.... +..+.....+...+...... ........ T Consensus 94 ~~~~~~~~~~~~q~~~L~~q~~~L~~l~~Si~~g~n~F~~~D~FGY~~~l~~Y~~Q~~sl~~~~~q~~~~~~~~~~~~~~ 173 (476) T TIGR01000 94 TNEENQKQELEQQLDNLKDQKKSLDTLKESIENGKNQFPTTDSFGYENLLKGYLAQVESLTSETQQQNDKSATQNEAAEK 173 (476) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 43577899999999999999999999998643487757888725589999999999999999998778889999999999 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHC------CCCCCCCCCCCCHHHHHHHHHHCCCC-CCCCC---CCCCCCCCC------- Q ss_conf 54200112221122100223210------11223222222102333322210122-22221---110000111------- Q gi|254780386|r 156 GISSVYRSLKMKLTRMRSIYLNR------LDLENKSSLLRALLSSHQKDLTMMKN-LFEKN---AVSQNDLNQ------- 218 (440) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~------~~l~~~~~~l~~~~~~~~~el~~~~~-l~~~~---~~~~~~l~~------- 218 (440) ..+.+...+......+.....-. ..+......+....+.++.++..... +..+. ..++....+ T Consensus 174 ~~~~l~~~i~~~~~~l~~yq~~~~Ai~~~~~~~~~~~~~~s~Y~~Y~~q~~~~~~dlknqknPdetakqaa~~~k~~vK~ 253 (476) T TIGR01000 174 TKAQLDQQISKTQQKLQNYQALKNAISSGTKVSNSSKPYLSLYENYQAQLKSAEDDLKNQKNPDETAKQAAKDQKEQVKS 253 (476) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCHHHHHHH T ss_conf 99999888866899999999999999808812245631367668799999988877640468314676542132688999 Q ss_pred -CHHHHHHHHHCCCCCCCCCCCCCCCC----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf -00246665310122332222222222----------2100012233321123322212222222212223322122222 Q gi|254780386|r 219 -QERIVYKSTLELRENIHEQKNMLRHI----------NELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTR 287 (440) Q Consensus 219 -~~~~~~~~~~el~~~~~~~~~~~~~l----------~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l 287 (440) .-..+.....+|+..++..+.+.+.+ ....+++.....+.+..+.+++.++...+.+++..+..+.... T Consensus 254 ~~la~~~~QId~L~~~~~~y~~Q~a~~~~~~~s~~~~~s~~~k~~~LK~Q~L~~~~~~~~~~~q~l~~~~~~~k~~~~~~ 333 (476) T TIGR01000 254 QILATIQQQIDQLKKSIASYQVQKAGLTKSDASKNYASSQSSKLAQLKEQQLAKVKQELTDLNQKLLELESKIKSLKEDS 333 (476) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99874886599999998876444420247787521224524888878988899999999999987999887778765212 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCC---CCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEE Q ss_conf 1234555411687312120122212444554210335---5650477532622311015898317999812887544399 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPH---SKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFK 364 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~---~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~ 364 (440) .+..|+||.||++|-......-..|+.|++|+.|+|. ...+.|++|||..||..||+||.|+++++- +....-.++ T Consensus 334 ~~~~ikAp~dGvLHl~~e~~g~~~~~~GT~~A~IYP~l~~~~~~~~~~y~p~~d~~~iK~Gq~vRF~~~~-n~~~p~ild 412 (476) T TIGR01000 334 QKGVIKAPEDGVLHLNPETKGIKYVPKGTLIAQIYPILKKERKLKVTAYIPSKDISGIKVGQKVRFTVTQ-NVPKPIILD 412 (476) T ss_pred CCCEEEECCCCEEEECCCCCCHHHCCCCCHHHHHCCCCCCCEEEEEEEECCHHHHHHCCCCCEEEEEEEC-CCCCCEEEE T ss_conf 2544872136378506144450203465237776152023504788786062233322327427788634-799855884 Q ss_pred EEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEEHHHHHHHHHHHH Q ss_conf 999997133444677765511899999935011256785152377569999922054799999999989 Q gi|254780386|r 365 AIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTTLAQEIIHPITDN 433 (440) Q Consensus 365 G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~~~~~~~~p~~~~ 433 (440) |+|+.||..|+.. ..++||.|...+.=.. +....||+||+|...+|+|++|||+|++|-|+|. T Consensus 413 G~i~~Is~~pt~~----~~G~FY~v~a~~~i~~--~~~~~lRYGl~G~~~~I~GkKTyf~Yy~D~~Ln~ 475 (476) T TIGR01000 413 GTITSISSAPTAT----KKGNFYKVTATIKITK--NQAEKLRYGLTGKIALITGKKTYFNYYKDKLLNK 475 (476) T ss_pred EEECCCCCCCCEE----CCCCEEEEEEEEECCH--HHHHCCCCCCCEEEEEEEEEEEHHHHHHHHHCCC T ss_conf 1143535640101----1672699998873584--3342043133136999974462355667776179 No 4 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=100.00 E-value=5.6e-42 Score=298.47 Aligned_cols=318 Identities=17% Similarity=0.181 Sum_probs=206.6 Q ss_pred HHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHH Q ss_conf 99999999999999987640671258999799995387389846888689999857898933697899996489988899 Q gi|254780386|r 18 KRMTLLSIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLID 97 (440) Q Consensus 18 ~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~ 97 (440) ++++++++++.++++.|.++.......+-.+.|. ++...|.+.++|.|.+|+|++||.|++||+|++||+.+.+..++ T Consensus 14 ~~i~~~~~~~~~~~~~~~~~~~~~~~~TddA~V~--~~iv~Is~~VsG~V~~V~V~egq~VkkGq~L~~LD~~~~~~~v~ 91 (348) T PRK10476 14 KVIALAIVALALVALVYVIWRTDSAPSTDDAYID--ADVVHVASEVGGRIVELAVTENQAVKKGDLLFRIDPRPYELTVA 91 (348) T ss_pred EEHHHHHHHHHHHHHHHHHEECCCEEECCCEEEE--EEEEEEECCCCEEEEEEEECCCCEECCCCEEEEECCHHHHHHHH T ss_conf 2278999999999987651324640143625999--79999934344599999946999864897899977689999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH Q ss_conf 98743433233210012333322211455542235688887765546788654333655420011222112210022321 Q gi|254780386|r 98 LKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLN 177 (440) Q Consensus 98 ~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (440) +.+..+...++++...+..+........ ....+ T Consensus 92 ~A~A~l~~a~a~l~~~~~~~~~~~a~~~-------~a~a~---------------------------------------- 124 (348) T PRK10476 92 QAQADLALADAQLDTTQRSVDAERSNAA-------SANEQ---------------------------------------- 124 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHH---------------------------------------- T ss_conf 9999999999999999878888898999-------99999---------------------------------------- Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 01122322222210233332221012222221110000111002466653101223322222222222100012233321 Q gi|254780386|r 178 RLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFAN 257 (440) Q Consensus 178 ~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~ 257 (440) ....+..++..+.++++.+.|.+++..++.++...+..+......+.....+....... T Consensus 125 -------~~~A~a~~~~a~~~~~R~~~L~~~g~vS~~~ld~a~~~~~~A~a~l~~a~~q~~~a~~~-------------- 183 (348) T PRK10476 125 -------VERARANAKLATRTLERLEPLLAKGYVSAQQVDQARTAQRDAEVSLNQALLQAQAAAAA-------------- 183 (348) T ss_pred -------HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------- T ss_conf -------99999999999999999998876699799999999999999999999999999999875-------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHH Q ss_conf 12332221222222221222332212222212345554116873121201222124445542103355650477532622 Q gi|254780386|r 258 YLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPK 337 (440) Q Consensus 258 ~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~ 337 (440) .......+..+...+..|..++.+|+++.|+||+||+|...++ .+|++|.+|+||++|+ +.+.+||+++|+|. T Consensus 184 -----~~~~~~~~a~v~~a~a~L~~A~~~L~~t~I~AP~dG~V~~~~v-~~G~~V~pG~pl~~lv-~~~~~wV~An~~Et 256 (348) T PRK10476 184 -----VGGVDALVAQRAAREAALAIAELHLEDTTVRAPFDGRVVGLKV-SVGEFAAPMQPIFTLI-DTDHWYAIANFRET 256 (348) T ss_pred -----HCCHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCEEEEEEEC-CCCCEECCCCEEEEEE-ECCCEEEEEEECHH T ss_conf -----4015668999999999999999878558997789849998866-9998778998489999-68957999993277 Q ss_pred HHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCC---------CCCCCC------EEEEEEEEECHHHCCCCC Q ss_conf 31101589831799981288754439999999713344467---------776551------189999993501125678 Q gi|254780386|r 338 QIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQD---------NNLQTQ------EYYEVILKITDLAFSDNN 402 (440) Q Consensus 338 di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~---------~~~~~~------~~y~v~i~~~~~~~~~~~ 402 (440) ++++|++||+|.|++|+||+. .|.|+|.+|++.+.+.. ..+..+ .-++|+|.++++. . T Consensus 257 ~l~~v~~Gq~v~I~~Da~p~~---~f~G~V~sI~~g~~~~~~~~~~~~lp~~natgn~~Kv~QRvPVrI~~d~~~----~ 329 (348) T PRK10476 257 DLKNIRVGDCATVYSMIDRGR---PIEGKVDSIGWGVLPDDGGNVPRGLPYVPRSINWVRVAQRFPVRIMLDKPD----P 329 (348) T ss_pred HHCCCCCCCEEEEEEECCCCC---EEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCC----C T ss_conf 825279989899999768997---899999998878778100555445899799987688878985999988997----1 Q ss_pred CEECCCCEEEEEEEECE Q ss_conf 51523775699999220 Q gi|254780386|r 403 IELRNGFPAEVLFTAKN 419 (440) Q Consensus 403 ~~L~~Gm~~~~~I~~~~ 419 (440) ..|+|||+++|.|.||. T Consensus 330 ~~lr~G~s~~v~i~tg~ 346 (348) T PRK10476 330 ELFRIGASAVVELRPGA 346 (348) T ss_pred CCCCCCCEEEEEEECCC T ss_conf 01478887899997899 No 5 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=100.00 E-value=2.3e-38 Score=274.36 Aligned_cols=355 Identities=15% Similarity=0.225 Sum_probs=241.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEEC-CCCCEEEEEECCCCCEEECCCEEEEECHHHHHH Q ss_conf 6899999999999999987640671258999799995387389846-888689999857898933697899996489988 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKS-PFSGIIKKFQVENGSQILQGTPLLTFEDIETTD 94 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s-~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~ 94 (440) +..+++++++++++++++|-++.-.+...+=. ....++...|.| .++|.|.+|++.++|.||.||+|++||++.+.. T Consensus 4 ~~~l~vv~Li~~a~a~~~y~~~~~~~~~~T~d--AYV~a~~~~v~sF~VsG~v~~v~~~d~d~Vk~GD~Lv~LD~t~a~~ 81 (379) T TIGR00998 4 FLLLLVVLLIVVALAYAIYWFLVLRDYESTDD--AYVKANQLQVSSFQVSGSVIEVNVDDTDYVKQGDVLVRLDPTDAEL 81 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH--HHCCCCCEEEEEEEEEEEEEEEEECCCCCCEECCEEEEECCCHHHH T ss_conf 67999999999999999999988740112114--6212773023246573248999743788420311566767716899 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH Q ss_conf 89998743433233210012333322211455542235688887765546788654333655420011222112210022 Q gi|254780386|r 95 LIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSI 174 (440) Q Consensus 95 ~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 174 (440) .++..+..|..+..+...+....+.++.-+.. .+... .-+.... T Consensus 82 ~~~~Ae~~LAa~~rq~~~l~~~~~~l~~GLHi---sLsak---------------------------------kL~v~s~ 125 (379) T TIGR00998 82 ALAKAEANLAALVRQTKQLEITVQQLQAGLHI---SLSAK---------------------------------KLKVESL 125 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH---HHHHH---------------------------------HHHHHHH T ss_conf 99999999999999888873116776356400---02456---------------------------------7889999 Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCC---CCCCCCHHHHHHHHHCCCCCCCCCC--CCCCCCCCCCC Q ss_conf 321011223222222102333322210122222211100---0011100246665310122332222--22222221000 Q gi|254780386|r 175 YLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQ---NDLNQQERIVYKSTLELRENIHEQK--NMLRHINELYD 249 (440) Q Consensus 175 ~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~---~~l~~~~~~~~~~~~el~~~~~~~~--~~~~~l~~~~~ 249 (440) .......+....+.+......+-.+.+...|..++..+. +++......+....+.|+..+.+.. ...+...+-.. T Consensus 126 ~~~~~qar~~l~qae~~~~~~~~dl~Rr~~L~~~G~I~~RkFE~l~~ar~a~~~a~A~L~aa~~~I~evT~~~R~~~G~~ 205 (379) T TIGR00998 126 KIKLEQAREKLKQAEEKIKLVKLDLRRRVPLLKKGLISRRKFEELDHARKALLSAKAALNAAIQEINEVTEIKRAKKGQL 205 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998767876565321535348898478998754443476899999999878899854334567899999999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 12233321123322212222222212223322122222123455541168731212012221244455421033556504 Q gi|254780386|r 250 EANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTY 329 (440) Q Consensus 250 ~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~ 329 (440) .+...+ -....+..+.+..+.++.++..+++.+.-.|++|.|+||+||+|....+ ++|++|+||+|||+|| +...|| T Consensus 206 ~An~Al-~~~~~L~~qpasLeeev~~A~~~~~~A~L~L~rT~iraP~dG~Va~r~v-qvG~~vS~g~p~ma~V-~~~~~y 282 (379) T TIGR00998 206 NANQAL-VRGTPLKKQPASLEEEVQEAKERLKKAWLALKRTEIRAPFDGYVARRKV-QVGQVVSPGQPLMAVV-PAEQMY 282 (379) T ss_pred HHHHHH-HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCCCCEEEEEEC-CCCCEECCCCCEEEEE-CCCCEE T ss_conf 998741-1244204334107899999999999998740185643672205877630-7886115897257778-454048 Q ss_pred EEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCC---------CC------CCEEEEEEEEEC Q ss_conf 775326223110158983179998128875443999999971334446777---------65------511899999935 Q gi|254780386|r 330 IRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNN---------LQ------TQEYYEVILKIT 394 (440) Q Consensus 330 v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~---------~~------~~~~y~v~i~~~ 394 (440) |+|++.|+++..|++||+|+|++|.||.+. .|.|+|+.|++...+++++ +. ...-+||+|++| T Consensus 283 V~ANFkETqL~~vR~G~~v~i~~DlYg~dv--~F~GkV~gi~~GvLPDDtG~~f~llP~qnatGNWiKvVQR~PV~I~lD 360 (379) T TIGR00998 283 VEANFKETQLKNVRIGQPVTIRSDLYGSDV--VFEGKVTGISMGVLPDDTGSAFSLLPAQNATGNWIKVVQRLPVRIKLD 360 (379) T ss_pred EEEEEECCCCCCCCCCCCEEEEEEECCCCC--EEEEEEEECCCCCCCCCHHHHHHCCCCCCCCCCEEEEEEEEEEEEEEC T ss_conf 987520132057457872589985438980--797488422663356740455410443125545289989742688626 Q ss_pred HHHCCCCCCEECCCCEEEEEE Q ss_conf 011256785152377569999 Q gi|254780386|r 395 DLAFSDNNIELRNGFPAEVLF 415 (440) Q Consensus 395 ~~~~~~~~~~L~~Gm~~~~~I 415 (440) +..+... .||-||++.|.| T Consensus 361 ~~~L~~~--pLRiGlS~~v~i 379 (379) T TIGR00998 361 KKELDEH--PLRIGLSAEVEI 379 (379) T ss_pred HHHHHCC--CCEECCCEEEEC T ss_conf 2554006--651216326739 No 6 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=100.00 E-value=2.4e-33 Score=240.91 Aligned_cols=266 Identities=13% Similarity=0.224 Sum_probs=166.0 Q ss_pred HHHHHHHHHHHHH---HHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHH Q ss_conf 6899999999999---9999876406712589997999953873898468886899998578989336978999964899 Q gi|254780386|r 16 SLKRMTLLSIIGI---IGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIET 92 (440) Q Consensus 16 ~~~~~~~~~~~~~---~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~ 92 (440) +.|+.+.++++++ .++..|.++..... +-.+.|. +++..|.|.++|.|.+++|++||.|+|||+|++||+.+. T Consensus 8 ~~r~~i~i~~v~~~~~~~~~~w~~~~~~p~--TddAyV~--a~iv~Vsp~VsG~V~eV~V~dnq~VkkGdvL~~ID~~~y 83 (310) T PRK10559 8 ISRTAITVVLVILAFIAIFNAWVYYTESPW--TRDARFS--ADVVAIAPDVSGLITQVNVHDNQLVKKGQVLFTIDQPRY 83 (310) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--CCCEEEE--EEEEEEECCCCEEEEEEECCCCCEECCCCEEEEECCHHH T ss_conf 999999999999999999999986268851--6773898--579999366746999999179498769988999876179 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH Q ss_conf 88899987434332332100123333222114555422356888877655467886543336554200112221122100 Q gi|254780386|r 93 TDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMR 172 (440) Q Consensus 93 ~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (440) +..+++.+..+...++.+...+ .+ . .+...+..+ T Consensus 84 ~~al~qA~A~la~~qa~~~~~~---------------------~e---------~-------~R~~~l~~~--------- 117 (310) T PRK10559 84 QKALAEAEADVAYYQVLAQEKR---------------------QE---------A-------GRRNRLGVQ--------- 117 (310) T ss_pred HHHHHHHHHHHHHHHHHHHHHH---------------------HH---------H-------HHHHHHHHH--------- T ss_conf 9999999999999999999999---------------------99---------9-------999986651--------- Q ss_pred HHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22321011223222222102333322210122222211100001110024666531012233222222222221000122 Q gi|254780386|r 173 SIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEAN 252 (440) Q Consensus 173 ~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~ 252 (440) ..++.++...+..+.... T Consensus 118 -------------------------------------aiS~~~ld~a~~~l~~a~------------------------- 135 (310) T PRK10559 118 -------------------------------------AMSREEIDQANNVLQTVL------------------------- 135 (310) T ss_pred -------------------------------------HCCHHHHHHHHHHHHHHH------------------------- T ss_conf -------------------------------------005999999999999999------------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 33321123322212222222212223322122222123455541168731212012221244455421033556504775 Q gi|254780386|r 253 LEFANYLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRA 332 (440) Q Consensus 253 ~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~ 332 (440) .++...+..+..++-+|+++.|+||+||+|..+.+ .+|++|.+|+++++|+ +.+++||+| T Consensus 136 ------------------aql~~A~A~l~~A~l~L~~T~I~AP~dG~V~~~~v-~~Gq~V~~Gq~l~slV-~~~~~wV~A 195 (310) T PRK10559 136 ------------------HQLAKAQATRDLAKLDLERTVIRAPADGWVTNLNV-YTGEFITRGSTAVALV-KKNSFYVLA 195 (310) T ss_pred ------------------HHHHHHHHHHHHHHHHHHCCEEECCCCCEEEEEEC-CCCCEECCCCEEEEEE-ECCCEEEEE T ss_conf ------------------99999999999999887258781798867844554-8898856998589999-689669999 Q ss_pred EECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCC---------CCCC------EEEEEEEEECHHH Q ss_conf 326223110158983179998128875443999999971334446777---------6551------1899999935011 Q gi|254780386|r 333 KVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNN---------LQTQ------EYYEVILKITDLA 397 (440) Q Consensus 333 ~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~---------~~~~------~~y~v~i~~~~~~ 397 (440) ++.|+++.+|++||+|.|. +.+|.. .|.|+|.+|++.+....+. +..+ .-++|+|.+|+. T Consensus 196 NfkETqL~~ir~Gq~v~I~-~~~p~~---~f~G~V~si~~g~~~~~s~~~~~~LP~~NatgnwvKv~QRvPVrI~lD~~- 270 (310) T PRK10559 196 YMEETKLEGVRPGYRAEIT-PLGSNR---VLKGTVDSVAAGVTNASSTRDSKGMATIDSNLEWVRLAQRVPVRIRLDNQ- 270 (310) T ss_pred EECHHHCCCCCCCCEEEEE-EECCCC---EEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCC- T ss_conf 7052212538999979999-917998---89999998874446632367523389968988748677874179997799- Q ss_pred CCCCCCEECCCCEEEEEEEECEEE Q ss_conf 256785152377569999922054 Q gi|254780386|r 398 FSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 398 ~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) ++..|+.||+++|.|.|+..+ T Consensus 271 ---~~~~l~~G~sa~V~i~t~~~~ 291 (310) T PRK10559 271 ---QGNLWPAGTTATVVITGKQDR 291 (310) T ss_pred ---CCCCCCCCCEEEEEEECCCCC T ss_conf ---834126888799999388876 No 7 >COG1566 EmrA Multidrug resistance efflux pump [Defense mechanisms] Probab=100.00 E-value=2.6e-33 Score=240.64 Aligned_cols=283 Identities=18% Similarity=0.260 Sum_probs=197.8 Q ss_pred CEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 73898468886899998578989336978999964899888999874343323321001233332221145554223568 Q gi|254780386|r 55 NVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESR 134 (440) Q Consensus 55 ~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~ 134 (440) ++..|.+-++|.|.+|+|++++.|++||+|++||+.+.+..+++.+..+......+...+..+...+......+.. T Consensus 52 ~vv~Iap~VsG~V~eV~V~dnq~Vk~Gd~L~~iD~~~y~~al~qAea~la~a~~~~~~~~a~~~~~~A~i~~a~a~---- 127 (352) T COG1566 52 DVVPIAPQVSGRVTEVNVKDNQLVKKGDVLFRIDPRDYRAALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQ---- 127 (352) T ss_pred EEEEECCCCCEEEEEEEECCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- T ss_conf 7999837673589999916898851897689989578999999999999999999998987888888899999989---- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 88877655467886543336554200112221122100223210112232222221023333222101222222111000 Q gi|254780386|r 135 EEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQN 214 (440) Q Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~ 214 (440) .+...+.++.+...|...+..+.. T Consensus 128 --------------------------------------------------------~l~~a~~~~~R~~~L~~~g~vs~~ 151 (352) T COG1566 128 --------------------------------------------------------DLDQAQNELERRAELAQRGVVSRE 151 (352) T ss_pred --------------------------------------------------------HHHHHHHHHHHHHHHHHCCCCCHH T ss_conf --------------------------------------------------------999989888999988756744399 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 01110024666531012233222222222221000122333211233222122222222122233221222221234555 Q gi|254780386|r 215 DLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTREQKTILS 294 (440) Q Consensus 215 ~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~~~I~A 294 (440) ++.............+....+.. ......+.......+.++..+...+..++-.|+++.|+| T Consensus 152 ~~~~a~~a~~~A~A~~~~a~~~~------------------~~~~~~l~~~~~~~~~~v~~a~a~~~~A~l~L~~T~IrA 213 (352) T COG1566 152 ELDRARAALQAAEAALAAAQAAQ------------------KQNLALLESEVSGAQAQVASAEAALDQAKLDLERTVIRA 213 (352) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCEEEC T ss_conf 99988889999988887768888------------------887887742101121689999987998998864789987 Q ss_pred CCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCC Q ss_conf 41168731212012221244455421033556504775326223110158983179998128875443999999971334 Q gi|254780386|r 295 PIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPII 374 (440) Q Consensus 295 P~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~ 374 (440) |+||+|..+.. .+|++|.+|+|||.+|| .+.+||.+++.|+++.++++||+|+|++|+||.. +.|.|+|..|+|.+ T Consensus 214 P~dG~V~~~~v-~~G~~V~~G~~l~alVp-~~~~yV~AnFkETqL~~~r~Gq~a~I~~da~~~~--~~~~G~v~~i~~~t 289 (352) T COG1566 214 PVDGYVTNLSV-RVGQYVSAGTPLMALVP-LDSFYVVANFKETQLARVRPGQPAEITLDAYPGN--GVVEGIVEGIAPAT 289 (352) T ss_pred CCCCEEEEECC-CCCCEECCCCCEEEEEC-CCCEEEEEEEEEEECCCCCCCCEEEEEEECCCCC--EEEEEEEEEECCCC T ss_conf 78966970035-57875048973499940-4226999520011347445897089999737984--58999998744786 Q ss_pred CCCC----CCCCCC------EEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 4467----776551------1899999935011256785152377569999922054 Q gi|254780386|r 375 SQQD----NNLQTQ------EYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 375 ~~~~----~~~~~~------~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) .... ..+..+ .-.+|+|.|+... .....|++||++.+.+-++... T Consensus 290 g~~fsllp~~natgN~tkvvQRvPVrI~ld~~~--~~~~~l~~g~sv~v~vd~~~~~ 344 (352) T COG1566 290 GSAFSLLPAQNATGNWTKVVQRVPVRIELDPEP--LDRHLLRGGLSVVVVVDTSTSD 344 (352) T ss_pred CCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCC--HHCCCCCCCEEEEEEECCCCCC T ss_conf 530015787467888899997665899966773--1025655723799995788761 No 8 >PRK11556 multidrug efflux system subunit MdtA; Provisional Probab=99.98 E-value=1.9e-30 Score=221.55 Aligned_cols=271 Identities=12% Similarity=0.172 Sum_probs=180.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHEE------------------------------------------EEEEEEEEEEE Q ss_conf 686168999999999999999876406------------------------------------------71258999799 Q gi|254780386|r 12 NHPASLKRMTLLSIIGIIGLITWSILL------------------------------------------PIEISVSSSGE 49 (440) Q Consensus 12 ~~p~~~~~~~~~~~~~~~~~~~w~~~~------------------------------------------~~~~~v~~~G~ 49 (440) +.++.+||+++++++..+++..|.|.. .+...+.+.|+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~~~v~~~~~~~~l~~~Gt 81 (415) T PRK11556 2 KGSYKFRWVIVIVVVIAAIAAFWFWHGRSDSQTAAPGATAQAQQTPAGGRRGMRSGPLAPVQAATATEQAVPRYLTGLGT 81 (415) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECEEEEEEEE T ss_conf 84115589999999999999998740477777767665566777888877667889987779999878761569999999 Q ss_pred EEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHH Q ss_conf 99538738984688868999985789893369789999648998889998743433233210012333322211455542 Q gi|254780386|r 50 ILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEK 129 (440) Q Consensus 50 i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~ 129 (440) |.|.. ...|.+.++|+|.+++|++||.|++||+|++||+.+.+.++.+.+.++...++.++..+ T Consensus 82 v~a~~-~v~v~~~VsG~I~~i~v~~Gd~VkkGqvLa~iD~~~~~~~l~qA~A~l~~a~A~l~~A~--------------- 145 (415) T PRK11556 82 VTAAN-TVTVRSRVDGQLMALHFQEGQQVKAGDLLAEIDPSQFKVALAQAQGQLAKDQATLANAR--------------- 145 (415) T ss_pred EEEEE-EEEEEEEEEEEEEEEECCCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHH--------------- T ss_conf 98646-99998785029999985898997899999998835789999999999999998999999--------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 23568888776554678865433365542001122211221002232101122322222210233332221012222221 Q gi|254780386|r 130 AFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKN 209 (440) Q Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~ 209 (440) .++++.+.|.+++ T Consensus 146 -------------------------------------------------------------------~~~~R~~~L~~~~ 158 (415) T PRK11556 146 -------------------------------------------------------------------RDLARYQQLAKTN 158 (415) T ss_pred -------------------------------------------------------------------HHHHHHHHHHHCC T ss_conf -------------------------------------------------------------------9999999987659 Q ss_pred CCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 11000011100246665310122332222222222210001223332112332221222222221222332212222212 Q gi|254780386|r 210 AVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTREQ 289 (440) Q Consensus 210 ~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~~ 289 (440) ..++.++...+..+...... +...+..+..++.+|++ T Consensus 159 ~iS~~~ld~~~a~~~~A~A~-------------------------------------------l~~a~a~l~~A~~~L~~ 195 (415) T PRK11556 159 LVSRQELDAQQALVSETEGT-------------------------------------------IKADEAAVASAQLQLDW 195 (415) T ss_pred CCCHHHHHHHHHHHHHHHHH-------------------------------------------HHHHHHHHHHHHHHHCC T ss_conf 95699999899999999999-------------------------------------------99999999999875248 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCC--CCCCCCCCCCCEEEEEEECHHHHHHCCCCCCE--EEEEEECCCCCCC-EEE Q ss_conf 345554116873121201222124445--54210335565047753262231101589831--7999812887544-399 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQ--PLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTA--TVRFPHYADIREK-KFK 364 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~--~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v--~i~~~a~~~~~~~-~~~ 364 (440) +.|+||+||+|....+ ..|++|.+|+ ++++|. +.+.+++.+.||+.|++.|..|+.+ .+.+++++....+ .+. T Consensus 196 t~I~AP~~GvV~~r~v-~~G~~V~~G~~~pl~~i~-~~~~l~v~~~vpe~di~~v~~g~~~g~~~~v~~~~~~~~~~~~~ 273 (415) T PRK11556 196 SRITAPISGRVGLKQV-DVGNQISSGDTTGIVVIT-QTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKLSE 273 (415) T ss_pred CEEECCCCCEEEEEEC-CCCCEECCCCCCCEEEEE-CCCCEEEEEEECHHHHHHHHHHHHCCCCEEEEEECCCCCCEEEE T ss_conf 8998266515776885-899737778865269995-27848999997799999987654338845899863678715777 Q ss_pred EEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 999997133444677765511899999935011256785152377569999922054 Q gi|254780386|r 365 AIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 365 G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) |+|..|++.....+ ..|.+++.+++ .+..|+|||++++.|.++... T Consensus 274 g~v~~i~~~~d~~t------~t~~v~~~~~n-----~~~~L~pG~~v~v~i~~~~~~ 319 (415) T PRK11556 274 GTLLSLDNQIDATT------GTIKLKARFNN-----QDDALFPNQFVNARMLVDTEQ 319 (415) T ss_pred EEEEEECCCCCCCC------EEEEEEEEECC-----CCCCCCCCCEEEEEEECCCCC T ss_conf 89996436224774------08999999538-----766538998899999736758 No 9 >PRK11578 macrolide transporter subunit MacA; Provisional Probab=99.97 E-value=2.6e-30 Score=220.63 Aligned_cols=258 Identities=16% Similarity=0.241 Sum_probs=173.6 Q ss_pred EEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 67125899979999538738984688868999985789893369789999648998889998743433233210012333 Q gi|254780386|r 38 LPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSAL 117 (440) Q Consensus 38 ~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l 117 (440) ..+..++.++|+|.|.. ...|.++++|+|.+++|++||.|+|||+|++||+.+.+.++.+.+..+..+++.+...+..+ T Consensus 21 ~~i~~~v~~~G~v~a~~-~~~i~a~v~G~I~~i~v~~Gd~V~kGqvL~~ld~~~~~~~l~~a~a~l~~~~a~l~~a~~~~ 99 (347) T PRK11578 21 GDLQQSVLATGKLDALR-KVDVGAQVSGQLKTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELNAQRQQAEAEL 99 (347) T ss_pred EEEEEEEEEEEEEEEEE-EEEEECCCCEEEEEEECCCCCEECCCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87259999999999838-99997407789999998993988899999998749999999999999999999999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 32221145554223568888776554678865433365542001122211221002232101122322222210233332 Q gi|254780386|r 118 DFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQK 197 (440) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~ 197 (440) +. .+. T Consensus 100 ~~---------------------------------------------------------------------------a~~ 104 (347) T PRK11578 100 KL---------------------------------------------------------------------------ARV 104 (347) T ss_pred HH---------------------------------------------------------------------------HHH T ss_conf 99---------------------------------------------------------------------------899 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22101222222111000011100246665310122332222222222210001223332112332221222222221222 Q gi|254780386|r 198 DLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEM 277 (440) Q Consensus 198 el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~ 277 (440) ++++.+.|.+.+..++.++...+..+.....++ ...+.++...+ T Consensus 105 ~~~r~~~L~~~~~is~~~~d~a~~~~~~~~a~l------------------------------------~~~~a~~~~~~ 148 (347) T PRK11578 105 TLSRQQRLAKTQAVSQQDLDTAATEMAVKQAQI------------------------------------GTIDAQIKRNQ 148 (347) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH------------------------------------HHHHHHHHHHH T ss_conf 999999998757624788988787999876458------------------------------------89999999999 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCC---CCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEE Q ss_conf 3322122222123455541168731212012221244455---4210335565047753262231101589831799981 Q gi|254780386|r 278 SKLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQP---LMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPH 354 (440) Q Consensus 278 ~~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~---l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a 354 (440) ..+..++..++.+.|+||+||+|..... ..|++|.+|++ |++|. +.+.++|.+.||+.|+..|++||+|.|++++ T Consensus 149 a~l~~a~~~l~~t~I~AP~~GvV~~~~~-~~G~~v~~~~~~~~l~~i~-~~~~l~v~~~V~e~d~~~i~~G~~v~v~~~~ 226 (347) T PRK11578 149 ASLDTAKTNLDYTRIVAPMAGEVTQITT-LQGQTVIAAQQAPNILTLA-DMSTMLVKAQVSEADVIHLKPGQKAWFTVLG 226 (347) T ss_pred HHHHHHHHHHHCCEEECCCCCEEEEEEE-ECCCEEECCCCCCEEEEEC-CCCCEEEECCCCHHHHHHCCCCCEEEEEEEC T ss_conf 9999998765256998257844867998-3398886378874135731-6861799816667788633579769999976 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 2887544399999997133444677765511899999935011256785152377569999922054 Q gi|254780386|r 355 YADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 355 ~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) +|.. .|.|+|..|+|.+...+ ...+|.|++.++++ +..|+|||++++.|.+..+. T Consensus 227 ~~~~---~~~G~v~~I~~~~~~~~----~~~~~~v~i~~~~~-----~~~l~~G~~~~v~i~~~~~~ 281 (347) T PRK11578 227 DPLT---RYEGVLKDILPTPEKVN----DAIFYYARFEVPNP-----NGLLRLDMTAQVHIQLTDVK 281 (347) T ss_pred CCCC---EEEEEEEEEECCCCCCC----CEEEEEEEEECCCC-----CCCCCCCCEEEEEEEECCCC T ss_conf 9995---99999999966103433----30799999852582-----01367895799999966868 No 10 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=99.95 E-value=3.3e-26 Score=193.27 Aligned_cols=236 Identities=15% Similarity=0.196 Sum_probs=156.1 Q ss_pred EEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 89997999953873898468886899998578989336978999964899888999874343323321001233332221 Q gi|254780386|r 43 SVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNN 122 (440) Q Consensus 43 ~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~ 122 (440) ....+|++.|.. ...|.+.++|+|.+++|++||.|++||+|++||+.+.+..++..+..+...++.+...+. T Consensus 49 ~~~~~G~v~~~~-~~~l~~~VsG~v~~~~v~~Gd~VkkGq~La~LD~~~~~~~l~~A~A~l~~A~a~~~~A~~------- 120 (385) T PRK09859 49 LSELPGRTVPYE-VAEIRPQVGGIIIKRNFIEGDKVNQGDSLYQIDPAPLQAELNSAKGSLAKALSTASNARI------- 120 (385) T ss_pred EEEEEEEEEEEE-EEEEEEECCEEEEEEECCCCCEEECCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH------- T ss_conf 899879999628-999981456299999838999884898899828479999999999999999999999998------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCC Q ss_conf 14555422356888877655467886543336554200112221122100223210112232222221023333222101 Q gi|254780386|r 123 KYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMM 202 (440) Q Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~ 202 (440) ++++. T Consensus 121 ---------------------------------------------------------------------------~~~R~ 125 (385) T PRK09859 121 ---------------------------------------------------------------------------TFNRQ 125 (385) T ss_pred ---------------------------------------------------------------------------HHHHH T ss_conf ---------------------------------------------------------------------------99999 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22222211100001110024666531012233222222222221000122333211233222122222222122233221 Q gi|254780386|r 203 KNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTI 282 (440) Q Consensus 203 ~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~ 282 (440) +.|++++..++.++...+..+.... ..+...+..+.. T Consensus 126 ~~L~~~~~iS~~~~d~a~~~~~~A~-------------------------------------------A~~~~A~a~l~~ 162 (385) T PRK09859 126 ASLLKTNYVSRQDYDTARTQLNEAE-------------------------------------------ANVTVAKAAVEQ 162 (385) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHH-------------------------------------------HHHHHHHHHHHH T ss_conf 9887658623547998999999999-------------------------------------------999999999987 Q ss_pred CCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCC--CCCCCCCCCCCEEEEEE------------ECHHHHHHCCCCCCE Q ss_conf 2222212345554116873121201222124445--54210335565047753------------262231101589831 Q gi|254780386|r 283 LEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQ--PLMKIVPHSKLTYIRAK------------VTPKQIQHVKKGYTA 348 (440) Q Consensus 283 l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~--~l~~Ivp~~~~l~v~~~------------v~~~di~~i~~Gq~v 348 (440) ++..|+++.|+||+||+|....+ ..|++|.+|+ +++++. +.+++++.+. +++.++..++.|++| T Consensus 163 a~~~l~~t~i~APf~GvV~~~~v-~~G~~V~~~~~~~l~~v~-~~~~v~v~~~~~~~~~l~~~~~v~~~~~~~v~~~~~v 240 (385) T PRK09859 163 ATINLQYANVTSPITGVSGKSSV-TVGALVTANQADSLVTVQ-RLDPIYVDLTQSVQDFLRMKEEVASGQIKQVQGSTPV 240 (385) T ss_pred HHHHHCCEEEECCCCCCEEEEEC-CCCCEECCCCCCEEEEEE-CCCCCEEEEECCHHHHHHHHHHHHCCHHHHCCCCCEE T ss_conf 55321226998777870410354-778776478863169997-1676589961378899888877501105531579529 Q ss_pred EEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEE Q ss_conf 799981288754439999999713344467776551189999993501125678515237756999992205 Q gi|254780386|r 349 TVRFPHYADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNT 420 (440) Q Consensus 349 ~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~ 420 (440) .+.+++.+. -.+.|+|..|+|.+...+. .|.|++.+++ ++..|+|||++++.|..+.+ T Consensus 241 ~~~~~~~~~---~~~~g~v~~~~p~~d~~t~------t~~v~~~~~n-----~~~~l~pGm~~~v~i~~~~~ 298 (385) T PRK09859 241 QLNLENGKR---YSQTGTLKFSDPTVDETTG------SVTLRAIFPN-----PNGDLLPGMYVTALVDEGSR 298 (385) T ss_pred EEEECCCCC---EEEEEEEEEECCCCCCCCC------CEEEEEEECC-----CCCCCCCCCEEEEEEECCCC T ss_conf 999658982---5777889971464336666------4899999248-----76556899679999834765 No 11 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=99.95 E-value=7.7e-26 Score=190.79 Aligned_cols=239 Identities=14% Similarity=0.168 Sum_probs=155.8 Q ss_pred EEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 58999799995387389846888689999857898933697899996489988899987434332332100123333222 Q gi|254780386|r 42 ISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLN 121 (440) Q Consensus 42 ~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~ 121 (440) .++...|+|.+... ..|.+.++|+|.+++|++||.|++||+|++||+.+.+..++..+..+...++.+...+. T Consensus 50 ~~~~~~G~v~a~~~-v~l~~~v~G~v~~v~v~~Gd~V~kGq~La~lD~~~~~a~~~~a~a~l~~A~a~~~~A~~------ 122 (385) T PRK09578 50 MTVELPGRLDAYRQ-AEVRARVAGIVTARTYEEGQEVKQGAVLFRIDPAPLKAARDAAAGALAKAEAAHLAAAD------ 122 (385) T ss_pred EEEEEEEEEEEEEE-EEEEEECCEEEEEEEECCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHH------ T ss_conf 69999999998799-99996644699999807999873898899985346999999999999999999999999------ Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 11455542235688887765546788654333655420011222112210022321011223222222102333322210 Q gi|254780386|r 122 NKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTM 201 (440) Q Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~ 201 (440) ++++ T Consensus 123 ----------------------------------------------------------------------------~~~R 126 (385) T PRK09578 123 ----------------------------------------------------------------------------KRRR 126 (385) T ss_pred ----------------------------------------------------------------------------HHHH T ss_conf ----------------------------------------------------------------------------9999 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 12222221110000111002466653101223322222222222100012233321123322212222222212223322 Q gi|254780386|r 202 MKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLT 281 (440) Q Consensus 202 ~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~ 281 (440) .+.|.+++..++.++...+...... +.++...+.++. T Consensus 127 ~~~L~~~~~is~~~~~~a~~~~~~a-------------------------------------------~a~l~~A~a~l~ 163 (385) T PRK09578 127 YDDLVRDRAVSERDHTEAVADERQA-------------------------------------------KAEVASAKAELA 163 (385) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHH-------------------------------------------HHHHHHHHHHHH T ss_conf 9998863663267899999999999-------------------------------------------999999999999 Q ss_pred CCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCC--CCCCCCCCCCCEEEEEEECHHHHHHCCCC-----------CCE Q ss_conf 12222212345554116873121201222124445--54210335565047753262231101589-----------831 Q gi|254780386|r 282 ILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQ--PLMKIVPHSKLTYIRAKVTPKQIQHVKKG-----------YTA 348 (440) Q Consensus 282 ~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~--~l~~Ivp~~~~l~v~~~v~~~di~~i~~G-----------q~v 348 (440) .++.+|+++.|+||+||+|....+ ..|++|.+|+ |+++|. +.+.+++.+.+|+.++..++.| ..+ T Consensus 164 ~A~~~l~~t~i~AP~~G~V~~~~v-~~G~~v~~~~~~~l~~i~-~~~~~~v~~~~p~~~v~~~~~~~~~g~~~~~~~~~~ 241 (385) T PRK09578 164 RAQLQLDYATVTAPIDGRARRALV-TEGALVGQDQATPLTTVE-QLDPIYVNFSQPAADVEALRRAVKSGRATGIAQQDV 241 (385) T ss_pred HHHHHHHCCEECCCCCCEEEEEEC-CCCCEECCCCCCCEEEEE-ECCEEEEEEECCHHHHHHHHHHHHCCCCCCCCCCCE T ss_conf 888754103523897745888881-788656688873149998-266399999517999999987654143112356753 Q ss_pred EEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEE Q ss_conf 799981288754439999999713344467776551189999993501125678515237756999992205 Q gi|254780386|r 349 TVRFPHYADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNT 420 (440) Q Consensus 349 ~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~ 420 (440) .+++...+...| .+.|+|..|++.....+ ..|.|++.++++ +..|+|||++++.|.++.. T Consensus 242 ~v~~~~~~g~~~-~~~g~v~~~~~~vd~~t------rt~~v~~~~~np-----~~~L~pGm~~~v~i~~~~~ 301 (385) T PRK09578 242 AVTLVRADGSEY-PLKGKLLFSDLAVDPST------DTVAMRALFPNP-----ERELLPGAYVRIALDRAVD 301 (385) T ss_pred EEEEECCCCCCC-CEEEEEEEEECCCCCCC------CEEEEEEECCCC-----CCCCCCCCEEEEEEECCCC T ss_conf 799992699723-14689998532314776------529999965787-----5545899689999954766 No 12 >TIGR02971 heterocyst_DevB ABC exporter membrane fusion protein, DevB family; InterPro: IPR014315 Members of this protein are found mostly in the Cyanobacteria, but also in the Planctomycetes. DevB from Anabaena sp. (strain PCC 7120) is partially characterised as a membrane fusion protein of the DevBCA ABC exporter, probably a glycolipid exporter required for heterocyst formation. Most Cyanobacteria have one member only, but Nostoc sp. which has seven members.. Probab=99.95 E-value=4.7e-29 Score=212.22 Aligned_cols=329 Identities=17% Similarity=0.245 Sum_probs=239.2 Q ss_pred EEEEEEEEECCCEEEEECCCC-----CEEEEEE-CCCCCEEECCCEEEEECHHH-HHHHHHHHHHHHHHHHHH-HHCCCC Q ss_conf 999799995387389846888-----6899998-57898933697899996489-988899987434332332-100123 Q gi|254780386|r 44 VSSSGEILNEDNVVEIKSPFS-----GIIKKFQ-VENGSQILQGTPLLTFEDIE-TTDLIDLKKVSIKNLRCH-IDTEKS 115 (440) Q Consensus 44 v~~~G~i~~~~~~~~I~s~~~-----G~V~~i~-VkeGd~VkkGq~L~~ld~~~-~~~~l~~~~~~l~~l~~~-l~~~~~ 115 (440) |.+.|++.|.|++..|.+|.+ -+|.++| |++||.|++||+|+.||+.. ..+.++..+.+++..+.+ +...+. T Consensus 1 vtALGRLeP~G~v~~v~aP~~Gf~g~~Ri~~LlPV~eGD~V~~Gq~lA~Ld~~~~r~a~l~~A~~qL~~A~~~~~A~~ra 80 (363) T TIGR02971 1 VTALGRLEPEGEVVAVAAPSSGFAGTDRIKKLLPVAEGDRVKAGQVLAELDSRPERRAELDVARTQLDEAKARILAQVRA 80 (363) T ss_pred CCCCCCCCCCCCEEEEECCCCCCCCCCHHHHHCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 92344005678668841478888775315550677888701157536764570579999999998999999999999987 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHH Q ss_conf 33322211455542235688887765546788654333655420011222112210022321011223222222102333 Q gi|254780386|r 116 ALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSH 195 (440) Q Consensus 116 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~ 195 (440) ........ ...+.. .+ ...........+.......+..++.++... T Consensus 81 g~k~~~~a-----------~~~~kq-----ae------------------t~~~~~~~~~~~~aA~~A~l~rl~~e~~~A 126 (363) T TIGR02971 81 GAKKGEIA-----------AQRAKQ-----AE------------------TVARAAAKLVKDVAAQQATLNRLEAELETA 126 (363) T ss_pred HCCHHHHH-----------HHHHHH-----HH------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 21634899-----------998776-----63------------------089999888888742014699999999987 Q ss_pred HHHHHCCCCCCCCCC----CCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCC Q ss_conf 322210122222211----1000011100246665310122332222222222210001223332112332-22122222 Q gi|254780386|r 196 QKDLTMMKNLFEKNA----VSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEI-SRDLEQNQ 270 (440) Q Consensus 196 ~~el~~~~~l~~~~~----~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~-~~~l~e~~ 270 (440) +.++.+++++..++. .+.+.|....-.+.....+|.+..+........+.+.+.+.+..+....+++ ...+...+ T Consensus 127 ~re~~Ry~~L~~~GARqRLvS~~~lD~~~L~~~~~~~~L~~a~A~~~~~q~TL~~~~~~~~~a~~~~~e~v~~~dv~l~~ 206 (363) T TIGR02971 127 QRELDRYRSLFRDGARQRLVSASDLDSKKLKLRTAEEELEEAKASLSRTQETLAEQIDEARAALASIAEEVRETDVDLAQ 206 (363) T ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999996488210147600333489999999999999987688778999999998899999999877773199997 Q ss_pred CCCC--CCCCCCCCCCCCCCCCCCCC----CCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCC- Q ss_conf 2221--22233221222221234555----4116873121201222124445542103355650477532622311015- Q gi|254780386|r 271 RTFA--DEMSKLTILEKTREQKTILS----PIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVK- 343 (440) Q Consensus 271 ~~l~--~~~~~l~~l~~~l~~~~I~A----P~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~- 343 (440) .++. ...+.++.++..++.+.||| |.+|+|+.++. ..|+.+. -.+|+.|. +++.|++-|.|+++||++|+ T Consensus 207 aev~llsa~~~~~~A~~~le~t~~~aRLvaP~~G~vLKi~~-~~Ge~~~-~~Gil~~G-~T~~M~aVAEVYe~Di~Rv~G 283 (363) T TIGR02971 207 AEVKLLSAAEAVQQAEALLELTYVKARLVAPIDGRVLKIHA-REGEVIG-SEGILELG-DTSQMYAVAEVYETDINRVRG 283 (363) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECC-CCCCCCC-CHHHHEEC-CCCCCEEEEEEEECCHHHCEE T ss_conf 37899999999999999987676777630789876887557-2276578-30341102-678877999886221000010 Q ss_pred --CCCCEEEEEE--ECCCCCCCEEEEEEE---EEECCCCCC------CCCCCCCEEEEEEEEEC-HHHCCCCCCEECCCC Q ss_conf --8983179998--128875443999999---971334446------77765511899999935-011256785152377 Q gi|254780386|r 344 --KGYTATVRFP--HYADIREKKFKAIIE---KIDPIISQQ------DNNLQTQEYYEVILKIT-DLAFSDNNIELRNGF 409 (440) Q Consensus 344 --~Gq~v~i~~~--a~~~~~~~~~~G~V~---~Is~~~~~~------~~~~~~~~~y~v~i~~~-~~~~~~~~~~L~~Gm 409 (440) +||+|+|+.. ||+ |++.|+|. .|++..... +...-.-.+++|+|.+| ..... ...=..|| T Consensus 284 Wv~GQ~A~i~S~NgAl~----~pL~G~V~sea~vg~~vak~~v~~~dP~aD~DaRVvEVkI~lDnp~~S~--~v~rlt~l 357 (363) T TIGR02971 284 WVVGQRATITSTNGALS----GPLRGTVRSEARVGSLVAKNDVLSTDPAADVDARVVEVKIRLDNPADSE--RVSRLTNL 357 (363) T ss_pred EECCCCEEEEECCCCCC----CCCCCCEEECCEEEHHEEHHHCCCCCCCHHHCCEEEEEEEEECCCCCCH--HHHCCCCC T ss_conf 20375179986288987----2027544220000000110001788871333843899998626853522--44122046 Q ss_pred EEEEEE Q ss_conf 569999 Q gi|254780386|r 410 PAEVLF 415 (440) Q Consensus 410 ~~~~~I 415 (440) .|+|.| T Consensus 358 QV~VaI 363 (363) T TIGR02971 358 QVDVAI 363 (363) T ss_pred EEEEEC T ss_conf 135629 No 13 >PRK09783 copper/silver efflux system membrane fusion protein CusB; Provisional Probab=99.93 E-value=6e-24 Score=178.17 Aligned_cols=118 Identities=15% Similarity=0.177 Sum_probs=103.4 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEE Q ss_conf 12345554116873121201222124445542103355650477532622311015898317999812887544399999 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAII 367 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V 367 (440) .+..|+||+||+|...++ ..|++|.+|++|++|+ |.+.+||++.||+.|++.|++||+|.|++++||.. .|.|+| T Consensus 206 ~~~ti~AP~sGvV~~~~v-~~G~~V~~G~~Lf~Ia-DLs~vWv~a~V~E~d~~~vk~Gq~v~v~~~a~Pg~---~f~G~V 280 (407) T PRK09783 206 TRFTLKAPIDGVITAFDL-RAGMNIAKDNVVAKIQ-GMDPVWVTAAIPESIAWLVKDASQFTLTVPARPDK---TLTIRK 280 (407) T ss_pred CEEEEECCCCCEEEEEEC-CCCCEECCCCEEEEEE-CCCEEEEEEEECHHHHHHCCCCCEEEEEECCCCCC---CEEEEE T ss_conf 338997688807999744-7895667998069996-47728999996278686456898899983268996---125899 Q ss_pred EEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 997133444677765511899999935011256785152377569999922054 Q gi|254780386|r 368 EKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 368 ~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) .+|+|.....+.+ ++|++.+++ ++..|+|||.+++.|.++.+. T Consensus 281 ~~I~P~vD~~TRT------~~VRv~l~N-----~d~~LkPGMfa~v~i~~~~~~ 323 (407) T PRK09783 281 WTLLPGVDAATRT------LQLRLEVDN-----ADEALKPGMNAWLQLNTASEP 323 (407) T ss_pred EEEECCCCCCCCE------EEEEEEECC-----CCCCCCCCCEEEEEECCCCCC T ss_conf 9980455777754------999999549-----987308998899998248886 No 14 >TIGR00999 8a0102 Membrane Fusion Protein cluster 2 protein; InterPro: IPR005695 Gram-negative bacteria have evolved transport complexes that export macromolecules and toxic substances such as heavy metals across the two membranes of the cell envelope in a single energy coupled step. The process requires (1) a cytoplasmic membrane export system, (2) a membrane fusion protein (MFP), and (3) an outer membrane factor. Phylogenetic analyses reveal that the MFPs cluster in accordance with the type of cytoplasmic membrane transport systems with which they function - proteins in this family are associated with reisistance to heavy metal poisoning. ; GO: 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport. Probab=99.75 E-value=2e-17 Score=134.62 Aligned_cols=117 Identities=22% Similarity=0.354 Sum_probs=98.2 Q ss_pred CCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEE Q ss_conf 21234555411687312120122212444554210335565047753262231101589831799981288754439999 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAI 366 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~ 366 (440) +.-+.+|||+||.|+.-.+ +.|++|.+|++++.|. +.+.+||++.||+.|.+.|+.|+++.|.+..- ...|+|+ T Consensus 102 ~sy~evrsP~~G~i~~k~v-~~G~~v~~~~~~f~~~-~~~~~~v~~~v~~~~~~~i~~G~~a~v~~~~~----g~~~~~r 175 (284) T TIGR00999 102 LSYAEVRSPFDGYIVEKSV-TLGDYVAPQAELFRVA-DLGAVWVEAEVPAKYVSRIKKGSKATVLLEND----GKKLPAR 175 (284) T ss_pred EHHHHHCCCCCCEEEEEEE-ECCCCCCCCCCEEEEE-CCCEEEEEEEECHHHHHHHHCCCEEEEEECCC----CCEEEEE T ss_conf 0022211886878999974-1172006897337776-47169999997111366662698048988368----9622169 Q ss_pred EEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEE Q ss_conf 999713344467776551189999993501125678515237756999992205 Q gi|254780386|r 367 IEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNT 420 (440) Q Consensus 367 V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~ 420 (440) |.+|+|.....+. .++|++.+++ ++..|+|||.+++.+.+... T Consensus 176 V~~v~p~v~~~~R------~~kVr~~~~n-----~~~~lk~G~fV~v~~~~~~~ 218 (284) T TIGR00999 176 VDYVGPEVDKSSR------TAKVRVLLKN-----ENLKLKPGLFVKVEVETKIE 218 (284) T ss_pred EEEEEEEECCCCC------EEEEEEEEEC-----CCCEEECCCCEEEEEECCCC T ss_conf 9997001366535------8999999978-----98646037633899730223 No 15 >COG0845 AcrA Membrane-fusion protein [Cell envelope biogenesis, outer membrane] Probab=99.74 E-value=4.3e-16 Score=125.75 Aligned_cols=76 Identities=14% Similarity=0.267 Sum_probs=60.9 Q ss_pred HHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 76406712589997999953873898468886899998578989336978999964-899888999874343323321 Q gi|254780386|r 34 WSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED-IETTDLIDLKKVSIKNLRCHI 110 (440) Q Consensus 34 w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~-~~~~~~l~~~~~~l~~l~~~l 110 (440) |..+.+++..+.+.|.+.| .....|.++.+|.|.+++|++||.|++||+|+.+|+ ...+..+...+.++...++.+ T Consensus 45 ~~~~~~~~~~~~~~G~v~~-~~~~~v~~~~~G~v~~i~v~~G~~Vk~Gq~L~~ld~~~~~~~~~~~~~~~~~~~~a~~ 121 (372) T COG0845 45 KVKPAELSAAVRAPGRVEA-TRSVEVLARVAGIVAEILVKEGDRVKKGQLLARLDPSAVLQAALDQAEAQLARAQALL 121 (372) T ss_pred EEEECCCCCEEEEEEEEEE-EEEEEEECCCCCEEEEEEECCCCEEECCCEEEEECCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8641122214787569986-1347874036747989983689888359748995532678888999999999999987 No 16 >pfam00529 HlyD HlyD family secretion protein. Probab=99.59 E-value=9.1e-15 Score=116.87 Aligned_cols=56 Identities=18% Similarity=0.373 Sum_probs=48.5 Q ss_pred EEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 38984688868999985789893369789999648998889998743433233210 Q gi|254780386|r 56 VVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHID 111 (440) Q Consensus 56 ~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~ 111 (440) +..|.|.++|+|.+++|++||.|++||+|++||+...+.+++..+.++...++.+. T Consensus 2 ~v~V~~~vsG~V~~i~V~~Gd~VkkGq~L~~lD~~~~~~~~~~a~a~l~~a~a~~~ 57 (304) T pfam00529 2 VAEVQPQVSGIVIRILVKEGDRVKAGDVLVRLDPTPYQAALDRAEAELAAAQADAQ 57 (304) T ss_pred EEEECCCCCEEEEEEECCCCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 66881789879999986890998899989999868999999999999999999999 No 17 >TIGR01730 RND_mfp efflux transporter, RND family, MFP subunit; InterPro: IPR006143 Gram-negative bacteria produce a number of proteins which are secreted into the growth medium by a mechanism that does not require a cleaved N-terminal signal sequence. These proteins, while having different functions, require the help of two or more proteins for their secretion across the cell envelope. These secretion proteins include members belonging to the ABC transporter family (see the relevant entry IPR003439 from INTERPRO) and a protein belonging to a family which includes the following members , , , , : Hemolysin secretion protein D from Escherichia coli. Lactococcin A secretion protein lcnD from Lactococcus lactis. RTX-I toxin determinant D from Actinobacillus pleuropneumoniae. Calmodulin-sensitive adenylate cyclase-hemolysin (cyclolysin) cyaD from Bordetella pertussis. Colicin Vsecretion protein cvaA from Escherichia coli. Proteases secretion protein prtE from Erwinia chrysanthemi. Alkaline protease secretion protein aprE from Pseudomonas aeruginosa. Multidrug resistance protein A from Escherichia coli. The secretion proteins are evolutionary related and consist of from 390 to 480 amino acid residues. They seem to be anchored in the inner membrane by a N-terminal transmembrane region. Their exact role in the secretion process is not yet known.; GO: 0008565 protein transporter activity, 0009306 protein secretion, 0016020 membrane. Probab=99.38 E-value=7.6e-12 Score=97.39 Aligned_cols=242 Identities=19% Similarity=0.287 Sum_probs=163.5 Q ss_pred EEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 12589997999953873898468886899998578989336978999964899888999874343323321001233332 Q gi|254780386|r 40 IEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDF 119 (440) Q Consensus 40 ~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~ 119 (440) +.......|.+.+.... .+.+..+|.+..+.+++|+.|++|++|+.+|+...+..++....++...++.+...+... T Consensus 11 ~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~g~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 87 (333) T TIGR01730 11 LALSLTFPGSLEALDEA-ELAPEVAGKLTKLLVREGQKVKKGQLLARLDDDDYRLALDAALAQLAAAEAQLELAEVSL-- 87 (333) T ss_pred CCCCCEEEEEECCCCHH-HHHHHCCCEEEEEECCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- T ss_conf 12220000332010001-001110440022110036420136245765545788888877777888888998888888-- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 22114555422356888877655467886543336554200112221122100223210112232222221023333222 Q gi|254780386|r 120 LNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYGISSVYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDL 199 (440) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el 199 (440) T Consensus 88 -------------------------------------------------------------------------------- 87 (333) T TIGR01730 88 -------------------------------------------------------------------------------- 87 (333) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 10122222211100001110024666531012233222222222221000122333211233222122222222122233 Q gi|254780386|r 200 TMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSK 279 (440) Q Consensus 200 ~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~ 279 (440) .....+..........+...... .............. T Consensus 88 ~~~~~l~~~~~~~~~~~~~~~~~-------------------------------------------~~~~~~~~~~~~~~ 124 (333) T TIGR01730 88 ERAERLLKRSAVSQADLDDAKAA-------------------------------------------LEAARADLEAAKAS 124 (333) T ss_pred HHHHHHHHHHCCCHHHHHHHHHH-------------------------------------------HHHHHHHHHHHHHH T ss_conf 88888775301105677778888-------------------------------------------88777666666667 Q ss_pred CCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCC--CCCCCCCCCCCCEEEEEEECHHHHHHCCCC-----CCEEEEE Q ss_conf 221222221234555411687312120122212444--554210335565047753262231101589-----8317999 Q gi|254780386|r 280 LTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQS--QPLMKIVPHSKLTYIRAKVTPKQIQHVKKG-----YTATVRF 352 (440) Q Consensus 280 l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G--~~l~~Ivp~~~~l~v~~~v~~~di~~i~~G-----q~v~i~~ 352 (440) +...+..+..+.|+||+||+|....+ ..|.++.+| .+++++. +.+.+.+...+|+.+...++.| +...+.+ T Consensus 125 ~~~~~~~~~~~~~~~p~~g~~~~~~~-~~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (333) T TIGR01730 125 LASARLNLSYTEIRAPFDGTIGRRLV-EVGEYVSAGQATPLLTLV-DLDPLEADFSVPERDLPQLRRGAPGGWQKLTVVL 202 (333) T ss_pred HHHHHHHHHHCEEECCCCCEEEEEEC-CCCCCCCCCCCCEEEEEE-ECCEEEEEEECCHHHHHHHHCCCCCCCCCCEEEE T ss_conf 88888754301552343430111110-223311355541146764-1441689985133455444124444543311465 Q ss_pred EECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 812887544399999997133444677765511899999935011256785152377569999922054 Q gi|254780386|r 353 PHYADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 353 ~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) +.+|.. ...+.|++..+++....... .|.+.+.+++ .+..|.|||.+.+.+..+.+. T Consensus 203 ~~~~~~-~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~-----~~~~~~~g~~~~~~~~~~~~~ 259 (333) T TIGR01730 203 DALPGT-GEEFKGKLRFIDPRVDAGTG------TLRVRALFPN-----PDGALLPGMFGRVTLLLGERS 259 (333) T ss_pred ECCCCC-CEEEEEEEEEECCCCCCCCC------EEEEEEEECC-----CCCCCCCCCEEEEEEEECCCC T ss_conf 136765-21445544431123465542------3799998537-----654201562478998403566 No 18 >PRK11637 hypothetical protein; Provisional Probab=97.53 E-value=1.3e-05 Score=55.70 Aligned_cols=21 Identities=10% Similarity=0.004 Sum_probs=10.8 Q ss_pred CCCCEEEEEEEECEEEHHHHH Q ss_conf 237756999992205479999 Q gi|254780386|r 406 RNGFPAEVLFTAKNTTLAQEI 426 (440) Q Consensus 406 ~~Gm~~~~~I~~~~~~~~~~~ 426 (440) -|++..++......-.++.|| T Consensus 382 g~~LhFEir~~g~~vnP~~wL 402 (404) T PRK11637 382 RPSLYFEIRRQGQAVNPQPWL 402 (404) T ss_pred CCEEEEEEEECCEEECHHHHH T ss_conf 965899999899984758862 No 19 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=97.51 E-value=0.0018 Score=41.50 Aligned_cols=123 Identities=10% Similarity=0.116 Sum_probs=70.5 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHE-EEEEEEEEEEEEEEE-----CCCEEEEECCCCCEEEEEECCCCCE Q ss_conf 3457633568616899999999999999987640-671258999799995-----3873898468886899998578989 Q gi|254780386|r 4 FMWKNLKRNHPASLKRMTLLSIIGIIGLITWSIL-LPIEISVSSSGEILN-----EDNVVEIKSPFSGIIKKFQVENGSQ 77 (440) Q Consensus 4 ~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~-~~~~~~v~~~G~i~~-----~~~~~~I~s~~~G~V~~i~VkeGd~ 77 (440) ..-.|-+++-.+.+.++++++.++++++..|... .+...-..+.|.+++ +|.+..|--..+ =.|+.||. T Consensus 3 ~~~~p~~~~r~~~i~~~~~~~~~~~~~~~~~~~~~~~~TddA~V~~~iv~Is~~VsG~V~~V~V~eg-----q~VkkGq~ 77 (348) T PRK10476 3 MAGTPKKAPRAKVIALAIVALALVALVYVIWRTDSAPSTDDAYIDADVVHVASEVGGRIVELAVTEN-----QAVKKGDL 77 (348) T ss_pred CCCCCCCCCCCEEHHHHHHHHHHHHHHHHHEECCCEEECCCEEEEEEEEEEECCCCEEEEEEEECCC-----CEECCCCE T ss_conf 6789998741122789999999999876513246401436259997999993434459999994699-----98648978 Q ss_pred EECCC-EEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHH Q ss_conf 33697-8999964899888999874343323321001233332221145554223 Q gi|254780386|r 78 ILQGT-PLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAF 131 (440) Q Consensus 78 VkkGq-~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~ 131 (440) .-.=| .-+++.-.++++.+...+.++...+..+...+........+....+..+ T Consensus 78 L~~LD~~~~~~~v~~A~A~l~~a~a~l~~~~~~~~~~~a~~~~a~a~~~~A~a~~ 132 (348) T PRK10476 78 LFRIDPRPYELTVAQAQADLALADAQLDTTQRSVDAERSNAASANEQVERARANA 132 (348) T ss_pred EEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9997768999999999999999999999998788888989999999999999999 No 20 >TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family; InterPro: IPR010129 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Type I secretion is an ABC transporter that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C terminus of the transported protein. This entry represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This entry selects a group of sequences closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.; GO: 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=97.32 E-value=6.9e-05 Score=50.94 Aligned_cols=348 Identities=13% Similarity=0.132 Sum_probs=154.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHEEEE---EEEEEEEEEEEE-----CCCEEEEECCCCCEEEEEECCCCCEEECCC Q ss_conf 568616899999999999999987640671---258999799995-----387389846888689999857898933697 Q gi|254780386|r 11 RNHPASLKRMTLLSIIGIIGLITWSILLPI---EISVSSSGEILN-----EDNVVEIKSPFSGIIKKFQVENGSQILQGT 82 (440) Q Consensus 11 ~~~p~~~~~~~~~~~~~~~~~~~w~~~~~~---~~~v~~~G~i~~-----~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq 82 (440) +++++...+++..++++++++.+|+-+-.+ ...|..+|.... .|-+..|-=.+|- .|+.||.-=.=| T Consensus 1 ~~~~r~~~~~~~~~v~~~~~WA~~~~ld~va~A~G~V~~~g~~k~vQhlegGi~~~I~V~EG~-----~V~~Gq~L~~Ld 75 (434) T TIGR01843 1 SRFARLITWLIALLVVIFLLWAAFAPLDSVATAQGKVVPSGNVKVVQHLEGGIVREILVREGD-----RVKAGQVLVELD 75 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEEEEEECCCEEEEECCCCCEEEEEEECCCC-----EECCCCCCEEEC T ss_conf 932798899999999999998887610258998669997074779966865344135522586-----420377013541 Q ss_pred EE-EEECHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89-99964899888999874343323321001233-3322211455542235-688887765546788654333655420 Q gi|254780386|r 83 PL-LTFEDIETTDLIDLKKVSIKNLRCHIDTEKSA-LDFLNNKYDVLEKAFE-SREEQLLDFLGKYHSTCSKVFNYGISS 159 (440) Q Consensus 83 ~L-~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~-l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (440) .+ ++=|-.....+...++.....+++.+...... ...-...........+ .....+..........+...+...+.. T Consensus 76 ~t~~~A~~~~~~~~~~~l~~~~~R~~Ae~~~~~~~~~~~~~~~~~~~~~p~~r~~~~~~~~~~~~~f~~~~~~l~~~~~~ 155 (434) T TIGR01843 76 ATSVEADAAELESQVLALEAEVARLRAEINSEEASVIEFPDILLSAIDDPFERRAVAELVKGQQSLFESRKSALRAQLES 155 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 35789999999999999999999999998323257544675776201141203712899999999999999988899999 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCC Q ss_conf 01122211221002232101122322222210233332221012222221110000111002466653101223322222 Q gi|254780386|r 160 VYRSLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKN 239 (440) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~ 239 (440) +..+++....++..+......++.+...+..++.....-.+.----.-.....+.+..+.+.++.....++.....+... T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~L~~~g~~~~~~~L~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 235 (434) T TIGR01843 156 ILAQIKQLEAELEGLQAQLQALRQQLELISEELEALRKLKEKGLVSRLELLELERERAELQGELARLEAELEVLKAQIDE 235 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999998789999999999998630663343216789999999999999999879999999999999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---------CCCCCCCCCCCCCEEEEECCEEECCCC Q ss_conf 22222210001223332112332221222222221222332212---------222212345554116873121201222 Q gi|254780386|r 240 MLRHINELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTIL---------EKTREQKTILSPIAGTIVYNQSFSSSN 310 (440) Q Consensus 240 ~~~~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l---------~~~l~~~~I~AP~~G~V~~l~~~~~G~ 310 (440) ...++.........+..+...+.+..|.+.+.++.....++... =..+.-.+ +.|+| .+|+ T Consensus 236 ~~~~~~~~~~~~~~e~~~~l~~A~~~l~~~~~~l~~a~~~~~~~~irsPvdG~v~~l~~~T----~GgVv------~pG~ 305 (434) T TIGR01843 236 LQLKLQQLEQTFREEVLEELTEAQARLAELREELSKARDRLQRLIIRSPVDGTVQSLKVHT----VGGVV------QPGE 305 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCEEEEEEEEEE----ECCEE------CCCC T ss_conf 9999988999999999999999999999999999999987532114425451999999984----05443------2786 Q ss_pred ----CCCCCCCCC---CCCCCC-----CCEEEEEE---ECHHHHHHCCCCCCEEEEEEECCCCC-----CCEEEEEEEEE Q ss_conf ----124445542---103355-----65047753---26223110158983179998128875-----44399999997 Q gi|254780386|r 311 ----YAQQSQPLM---KIVPHS-----KLTYIRAK---VTPKQIQHVKKGYTATVRFPHYADIR-----EKKFKAIIEKI 370 (440) Q Consensus 311 ----~v~~G~~l~---~Ivp~~-----~~l~v~~~---v~~~di~~i~~Gq~v~i~~~a~~~~~-----~~~~~G~V~~I 370 (440) .|+..+++. .|.|.+ -...+++. +|-.-.|.|+ |--+.|..|++-+.. ..+|.++| +| T Consensus 306 ~lm~IVP~~~~l~iEa~~~~~Dig~v~~Gq~a~ik~~AF~~~ryG~~~-G~V~~iS~D~~~DE~~~~~~~~yY~~ri-~~ 383 (434) T TIGR01843 306 TLMEIVPEDDPLEIEAKLSNKDIGFVHVGQPAEIKFSAFPYTRYGLLN-GKVKSISPDTVEDERQGRQQGLYYRVRI-SI 383 (434) T ss_pred CEEEEECCCCCEEEEEEECCCCCCEEECCCEEEEEEECCCCCCCCCEE-EEEEEECCCCHHHHHCCCCCCCEEEEEE-EE T ss_conf 178983399965899997354653455587169999636860335189-9999862751144424787676667999-97 Q ss_pred ECCCC Q ss_conf 13344 Q gi|254780386|r 371 DPIIS 375 (440) Q Consensus 371 s~~~~ 375 (440) ..... T Consensus 384 ~~~~~ 388 (434) T TIGR01843 384 DENTL 388 (434) T ss_pred CCCEE T ss_conf 32001 No 21 >PRK07051 hypothetical protein; Validated Probab=97.26 E-value=0.00069 Score=44.25 Aligned_cols=43 Identities=21% Similarity=0.428 Sum_probs=37.0 Q ss_pred EEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 7999953873898468886899998578989336978999964 Q gi|254780386|r 47 SGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 47 ~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) -+.|....-...|.|+.+|+|.+|+|++||.|..||+|+++++ T Consensus 38 v~iiEaMKm~~~V~A~~~G~V~~ilv~~G~~V~~Gq~L~~Iee 80 (80) T PRK07051 38 VGLVEVMKQFSEVEAEVAGRVVRFLVDDGEPVDAGQVLMRIEE 80 (80) T ss_pred EEEEEECCCCCEEECCCCCEEEEEEECCCCEECCCCEEEEEEC T ss_conf 9999700277467269982999999569898679997899829 No 22 >PRK08225 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=97.09 E-value=0.0013 Score=42.43 Aligned_cols=32 Identities=31% Similarity=0.516 Sum_probs=22.0 Q ss_pred EEEECCCCCEEEEEECCCCCEEECCCEEEEEC Q ss_conf 89846888689999857898933697899996 Q gi|254780386|r 57 VEIKSPFSGIIKKFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 57 ~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld 88 (440) ..|.||.+|+|.+++|++||.|..||+|+.++ T Consensus 39 ~~i~A~~~G~V~~i~v~~Gd~V~~G~~L~~ie 70 (70) T PRK08225 39 IPIVAEEAGTVKKINVQEGDFVNEGDVLLEIE 70 (70) T ss_pred CEEECCCCEEEEEEEECCCCEECCCCEEEEEC T ss_conf 76847999099999958989989999999979 No 23 >PRK05641 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=97.00 E-value=0.0016 Score=41.90 Aligned_cols=32 Identities=16% Similarity=0.341 Sum_probs=27.2 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 234555411687312120122212444554210 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) ...|+||+.|.|+.+.+ ..|+.|..|++|+.| T Consensus 79 ~~~v~apmpG~v~~v~v-~~Gd~V~~g~~l~vl 110 (148) T PRK05641 79 ENVVTAPMPGKILKILV-KEGDQVKTGQGLLIL 110 (148) T ss_pred CCEEECCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 99797699908999996-899998799999999 No 24 >PRK12999 pyruvate carboxylase; Reviewed Probab=96.99 E-value=0.0013 Score=42.32 Aligned_cols=32 Identities=22% Similarity=0.401 Sum_probs=27.6 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 234555411687312120122212444554210 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) -..|.||+.|.|+.+.+ .+|+.|..|++|+.| T Consensus 1077 p~~VgAPmpG~V~kV~V-k~GD~VkkGd~Llvl 1108 (1147) T PRK12999 1077 PGHIGAPMPGSVVTVAV-KEGDEVKAGDPLLVI 1108 (1147) T ss_pred CCCCCCCCCCEEEEEEE-CCCCEECCCCEEEEE T ss_conf 88136999825899996-899987899989998 No 25 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=96.98 E-value=0.0015 Score=41.92 Aligned_cols=33 Identities=18% Similarity=0.326 Sum_probs=28.2 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) .-..|.||+.|.|..+.+ ++|+.|..|++|+.| T Consensus 510 ~~~~V~APmpG~V~~V~V-k~Gd~V~~Gd~L~vl 542 (580) T PRK09282 510 AGGPVTAPMAGNIVKVLV-KEGDKVKEGDVLLIL 542 (580) T ss_pred CCCEECCCCCCEEEEEEE-CCCCEECCCCEEEEE T ss_conf 998532799826999997-899987899989998 No 26 >PRK08225 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.96 E-value=0.0013 Score=42.44 Aligned_cols=30 Identities=10% Similarity=0.220 Sum_probs=18.3 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 4555411687312120122212444554210 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) +|+||++|.|+.+.+ ..|+.|.+|++|+.| T Consensus 3 ~V~Apm~G~V~~v~V-~~Gd~V~~G~~l~vl 32 (70) T PRK08225 3 KVYASMAGNVWKIVV-KVGDTVEEGQDVVIL 32 (70) T ss_pred EEECCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 785789949999992-898998999999999 No 27 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=96.93 E-value=0.0071 Score=37.49 Aligned_cols=27 Identities=30% Similarity=0.465 Sum_probs=13.1 Q ss_pred EECCCCCEEEEEEC-------CCCCEEECCCEEE Q ss_conf 84688868999985-------7898933697899 Q gi|254780386|r 59 IKSPFSGIIKKFQV-------ENGSQILQGTPLL 85 (440) Q Consensus 59 I~s~~~G~V~~i~V-------keGd~VkkGq~L~ 85 (440) |-|.-+|+|.+++| ++||.|++||+|+ T Consensus 192 iVA~k~GvI~~i~v~~G~p~Vk~GD~VkkGqiLV 225 (383) T pfam06898 192 IVAKKDGIITRMYVTKGTAVVKVGDVVKKGDILV 225 (383) T ss_pred EEECCCCEEEEEEECCCEEEECCCCEECCCCEEE T ss_conf 5899997999999647717765899877898999 No 28 >cd06850 biotinyl_domain The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine. Probab=96.88 E-value=0.0013 Score=42.34 Aligned_cols=31 Identities=29% Similarity=0.589 Sum_probs=13.1 Q ss_pred EEEEECCCCCEEEEEECCCCCEEECCCEEEE Q ss_conf 3898468886899998578989336978999 Q gi|254780386|r 56 VVEIKSPFSGIIKKFQVENGSQILQGTPLLT 86 (440) Q Consensus 56 ~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ 86 (440) ...|.||.+|+|.++++++||.|..||+|+. T Consensus 36 ~~~i~a~~~G~v~~i~~~~G~~V~~G~~L~~ 66 (67) T cd06850 36 ENEVTAPVAGVVKEILVKEGDQVEAGQLLVV 66 (67) T ss_pred EEEEEECCCEEEEEEEECCCCEECCCCEEEE T ss_conf 1999968999999999789899899999997 No 29 >PRK06549 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.88 E-value=0.0015 Score=41.99 Aligned_cols=31 Identities=29% Similarity=0.484 Sum_probs=23.5 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 34555411687312120122212444554210 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) ..|+||+.|+|+.+.+ ..|+.|..|++|+-| T Consensus 64 ~~v~aPm~G~v~~i~v-~~Gd~V~~g~~l~vl 94 (132) T PRK06549 64 DAIPSPMPGTILKVLV-AVGDQVTENQPLLIL 94 (132) T ss_pred CEECCCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 9980899979999993-899998799989999 No 30 >PRK05889 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Provisional Probab=96.86 E-value=0.0018 Score=41.52 Aligned_cols=31 Identities=19% Similarity=0.369 Sum_probs=19.0 Q ss_pred EEEECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 8984688868999985789893369789999 Q gi|254780386|r 57 VEIKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 57 ~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) ..|.||.+|+|.+++|++||.|..||+|+.+ T Consensus 40 ~~v~A~~~G~V~~i~v~~Gd~V~~Gd~L~~I 70 (71) T PRK05889 40 IPVLAEVAGTVSKVSVSVGDVIQAGDLIAVI 70 (71) T ss_pred CEEECCCCCEEEEEEECCCCEECCCCEEEEE T ss_conf 7784799909979997899998899999996 No 31 >COG0511 AccB Biotin carboxyl carrier protein [Lipid metabolism] Probab=96.86 E-value=0.0022 Score=40.96 Aligned_cols=33 Identities=21% Similarity=0.271 Sum_probs=25.9 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) .-..|+||+.|+++..-+ .+|+.|..|++|+-| T Consensus 69 ~~~~V~SPm~Gtv~~~~V-~vGd~V~~Gq~l~Ii 101 (140) T COG0511 69 GGTQVTSPMVGTVYKPFV-EVGDTVKAGQTLAII 101 (140) T ss_pred CCCEEECCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 786674787618999876-579997589999999 No 32 >cd06663 Biotinyl_lipoyl_domains Biotinyl_lipoyl_domains are present in biotin-dependent carboxylases/decarboxylases, the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases, and the H-protein of the glycine cleavage system (GCS). These domains transport CO2, acyl, or methylamine, respectively, between components of the complex/protein via a biotinyl or lipoyl group, which is covalently attached to a highly conserved lysine residue. Probab=96.85 E-value=0.0027 Score=40.35 Aligned_cols=40 Identities=30% Similarity=0.520 Sum_probs=33.7 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 9999538738984688868999985789893369789999 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) ..+....-...|.||.+|+|.++++++||.|..|++|+.+ T Consensus 34 ~~iEa~K~~~ei~ap~~G~v~~i~v~~G~~V~~G~~l~~I 73 (73) T cd06663 34 AEIEAMKATSDVEAPKSGTVKKVLVKEGTKVEGDTPLVKI 73 (73) T ss_pred EEEEEEEEEEEEECCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 9999500569986699989999995898997899999979 No 33 >PRK09824 beta-glucoside-specific PTS system components IIABC; Provisional Probab=96.84 E-value=0.015 Score=35.26 Aligned_cols=15 Identities=40% Similarity=0.445 Sum_probs=7.8 Q ss_pred CCCCCCEEEEECCEE Q ss_conf 455541168731212 Q gi|254780386|r 291 TILSPIAGTIVYNQS 305 (440) Q Consensus 291 ~I~AP~~G~V~~l~~ 305 (440) .|.||++|.|..+.. T Consensus 478 ~i~sp~~G~~~~l~~ 492 (625) T PRK09824 478 GICSPMTGEIVPLIH 492 (625) T ss_pred EEECCCCCEEEECCC T ss_conf 595557715877131 No 34 >PRK05641 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.83 E-value=0.0019 Score=41.29 Aligned_cols=32 Identities=34% Similarity=0.574 Sum_probs=25.9 Q ss_pred EEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 98468886899998578989336978999964 Q gi|254780386|r 58 EIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 58 ~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) .|.||.+|.|.+++|++||.|++||+|+.|+. T Consensus 81 ~v~apmpG~v~~v~v~~Gd~V~~g~~l~vlEa 112 (148) T PRK05641 81 VVTAPMPGKILKILVKEGDQVKTGQGLLILEA 112 (148) T ss_pred EEECCCCEEEEEEEECCCCEECCCCEEEEEEH T ss_conf 79769990899999689999879999999994 No 35 >COG0511 AccB Biotin carboxyl carrier protein [Lipid metabolism] Probab=96.83 E-value=0.0018 Score=41.47 Aligned_cols=32 Identities=31% Similarity=0.473 Sum_probs=21.2 Q ss_pred EEEECCCCCEEEEEECCCCCEEECCCEEEEEC Q ss_conf 89846888689999857898933697899996 Q gi|254780386|r 57 VEIKSPFSGIIKKFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 57 ~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld 88 (440) ..|.||..|++-+++|++||.|++||.|+.++ T Consensus 71 ~~V~SPm~Gtv~~~~V~vGd~V~~Gq~l~IiE 102 (140) T COG0511 71 TQVTSPMVGTVYKPFVEVGDTVKAGQTLAIIE 102 (140) T ss_pred CEEECCCCEEEEEEEECCCCEECCCCEEEEEE T ss_conf 66747876189998765799975899999998 No 36 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=96.83 E-value=3.8e-05 Score=52.70 Aligned_cols=12 Identities=25% Similarity=0.556 Sum_probs=7.0 Q ss_pred CCCCCCEEEEEC Q ss_conf 455541168731 Q gi|254780386|r 291 TILSPIAGTIVY 302 (440) Q Consensus 291 ~I~AP~~G~V~~ 302 (440) .+.-|++|.|.. T Consensus 298 ~l~~PV~G~il~ 309 (420) T COG4942 298 QLAWPVTGRILR 309 (420) T ss_pred CCCCCCCCHHHH T ss_conf 768887870787 No 37 >PRK05889 putative acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Provisional Probab=96.80 E-value=0.0015 Score=41.99 Aligned_cols=30 Identities=7% Similarity=0.165 Sum_probs=19.0 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 4555411687312120122212444554210 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) .|+||+.|.|+.+.+ +.|+.|..|++|+.| T Consensus 4 ~V~Apm~G~V~~v~V-~~Gd~V~~Gd~l~vl 33 (71) T PRK05889 4 DVRAEIVASVLEVVV-NEGDQIGEGDTLVLL 33 (71) T ss_pred CCCCCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 314899848999997-999998899999999 No 38 >KOG0559 consensus Probab=96.79 E-value=0.002 Score=41.19 Aligned_cols=43 Identities=30% Similarity=0.501 Sum_probs=38.0 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 9999538738984688868999985789893369789999648 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) ..|..+.....|-||.+|+|.+++|++||.|+.|+.|+.++.. T Consensus 107 a~IETDK~tv~V~sP~sGvi~e~lvk~gdtV~~g~~la~i~~g 149 (457) T KOG0559 107 AEIETDKTTVEVPSPASGVITELLVKDGDTVTPGQKLAKISPG 149 (457) T ss_pred EEEECCCEEEECCCCCCCEEEEEECCCCCCCCCCCEEEEECCC T ss_conf 1542153233136887642457740789750478645773588 No 39 >TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily; InterPro: IPR014345 Members of this entry belong to the family of polysaccharide chain length determinant proteins. They are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the epsH homologue that is the putative exosortase gene.. Probab=96.69 E-value=0.00022 Score=47.57 Aligned_cols=60 Identities=8% Similarity=0.263 Sum_probs=45.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEE Q ss_conf 689999999999999998764067125899979999538738984688868999985789893 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQI 78 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~V 78 (440) |.++++.++++.++++++|.+.+-++..-.++++|-.++++ .|.+...|. .|-....|.| T Consensus 11 w~~rw~~~~~Aw~V~~~gw~~v~~lP~~Y~a~a~VYVdTqs-vlrpLl~G~--a~~P~~~q~i 70 (510) T TIGR03007 11 WRRRWLFVAVAWVVMIVGWVVVYVLPDRYEASARVYVDTQS-VLRPLLKGL--AVTPNVDQKI 70 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHEECCCCCEEEEEEEEECCC-HHHHHCCCC--CCCCCHHHHH T ss_conf 99999999999999998767121406740143688884320-165641667--3438678898 No 40 >cd06850 biotinyl_domain The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine. Probab=96.60 E-value=0.0053 Score=38.37 Aligned_cols=29 Identities=24% Similarity=0.455 Sum_probs=18.9 Q ss_pred CCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 555411687312120122212444554210 Q gi|254780386|r 292 ILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 292 I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) |+||+.|+|+.+.+ ..|+.|..|++|+.| T Consensus 2 i~ap~~G~v~~~~V-~~Gd~V~~G~~l~~i 30 (67) T cd06850 2 VTAPMPGTVVKVLV-KEGDKVEAGQPLAVL 30 (67) T ss_pred EECCCCEEEEEEEE-CCCCEECCCCEEEEE T ss_conf 10879889999995-799999789999999 No 41 >PRK06302 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.55 E-value=0.0066 Score=37.73 Aligned_cols=33 Identities=21% Similarity=0.296 Sum_probs=22.4 Q ss_pred CCCCCCCCEEEEECC------EEECCCCCCCCCCCCCCC Q ss_conf 234555411687312------120122212444554210 Q gi|254780386|r 289 QKTILSPIAGTIVYN------QSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 289 ~~~I~AP~~G~V~~l------~~~~~G~~v~~G~~l~~I 321 (440) ...|+||.=|+...- .+.++|+.|..|+.|+-| T Consensus 79 ~~~I~SPmVGtFY~sp~P~~~pFV~vGd~V~~Gq~v~iI 117 (155) T PRK06302 79 GHVVTSPMVGTFYRAPSPDAPPFVEVGDTVKEGQTLCII 117 (155) T ss_pred CCEEECCCEEEEEECCCCCCCCCCCCCCEECCCCEEEEE T ss_conf 987867843679878998999742468672489889999 No 42 >pfam00364 Biotin_lipoyl Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not recognize the Glycine cleavage system H proteins. Probab=96.51 E-value=0.0054 Score=38.28 Aligned_cols=40 Identities=28% Similarity=0.506 Sum_probs=34.1 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 9999538738984688868999985789893369789999 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) ..+....-...|.||.+|+|.++++++||.|..||+|+.+ T Consensus 34 ~~iEt~K~~~~v~a~~~G~i~~i~v~~G~~V~~G~~l~~I 73 (73) T pfam00364 34 CEVEAMKMEMEIPAPVAGVVKEILVKEGDTVEVGDPLAKI 73 (73) T ss_pred EEEEECCCCCEECCCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 9999665352250898889999997998998999999979 No 43 >TIGR02712 urea_carbox urea carboxylase; InterPro: IPR014084 Members of this family are ATP-dependent urea carboxylases, including characterised members from Oleomonas sagaranensis (alpha class Proteobacterium) and yeasts such as Saccharomyces cerevisiae (Baker's yeast). The allophanate hydrolase domain of the yeast enzyme is not included in this entry and is represented by an adjacent gene in Oleomonas sagaranensis. The fusion of urea carboxylase and allophanate hydrolase is designated urea amidolyase. The enzyme from O. sagaranensis was shown to be highly active on acetamide and formamide as well as urea.. Probab=96.48 E-value=0.0048 Score=38.62 Aligned_cols=33 Identities=21% Similarity=0.257 Sum_probs=28.4 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) ....|.|++.|.||++.+ ..|+.|.+|++|+.| T Consensus 1156 g~~~V~s~~~G~~WK~~V-~~G~~V~~G~~l~i~ 1188 (1226) T TIGR02712 1156 GAVQVESEVAGNVWKVLV-EVGDRVEAGQPLVII 1188 (1226) T ss_pred CCEEEEEECEEEEEEEEE-CCCCEEECCCEEEEE T ss_conf 714898502014656652-126632048738999 No 44 >PTZ00144 dihydrolipoamide succinyltransferase; Provisional Probab=96.43 E-value=0.0082 Score=37.10 Aligned_cols=43 Identities=30% Similarity=0.522 Sum_probs=35.0 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 9999538738984688868999985789893369789999648 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) -+|..+.-...|.||.+|+|.+++|++||.|.-|++|+.++.. T Consensus 92 ~EVETDKA~~EVeSp~~G~l~kIlv~eGd~V~VG~~la~I~~~ 134 (430) T PTZ00144 92 AIVETDKVSVDINSKFSGVLSKIFAEVGDTVLVGKPLCEIDTS 134 (430) T ss_pred EEEECCCCCEEECCCCCEEEEEEEECCCCEEECCCEEEEEECC T ss_conf 9998588417965778979999863799989469869999668 No 45 >PRK06549 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=96.42 E-value=0.0069 Score=37.59 Aligned_cols=36 Identities=36% Similarity=0.531 Sum_probs=29.1 Q ss_pred CCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 873898468886899998578989336978999964 Q gi|254780386|r 54 DNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 54 ~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) +....|.+|..|.|.+++|++||.|++||.|+.|+. T Consensus 61 ~~~~~v~aPm~G~v~~i~v~~Gd~V~~g~~l~vlEA 96 (132) T PRK06549 61 AGADAIPSPMPGTILKVLVAVGDQVTENQPLLILEA 96 (132) T ss_pred CCCCEECCCCCEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 889998089997999999389999879998999980 No 46 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=96.28 E-value=0.0035 Score=39.57 Aligned_cols=32 Identities=25% Similarity=0.376 Sum_probs=23.3 Q ss_pred EEEEECCCCCEEEEEEC-------CCCCEEECCCEEEEE Q ss_conf 38984688868999985-------789893369789999 Q gi|254780386|r 56 VVEIKSPFSGIIKKFQV-------ENGSQILQGTPLLTF 87 (440) Q Consensus 56 ~~~I~s~~~G~V~~i~V-------keGd~VkkGq~L~~l 87 (440) ...|-|.-+|+|.++|| +.||.|++||+|+.= T Consensus 191 p~niVAkk~G~i~~~~v~~G~~~Vk~GD~VkkGd~Li~G 229 (406) T TIGR02876 191 PRNIVAKKDGVIKRVYVTSGEAVVKKGDVVKKGDLLISG 229 (406) T ss_pred CCCEEEECCEEEEEEEEECCEEEECCCCEECCCCEEEEE T ss_conf 723266334088886551116485488876577178711 No 47 >cd06849 lipoyl_domain Lipoyl domain of the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases. 2-oxo acid dehydrogenase multienzyme complexes, like pyruvate dehydrogenase (PDH), 2-oxoglutarate dehydrogenase (OGDH) and branched-chain 2-oxo acid dehydrogenase (BCDH), contain at least three different enzymes, 2-oxo acid dehydrogenase (E1), dihydrolipoyl acyltransferase (E2) and dihydrolipoamide dehydrogenase (E3) and play a key role in redox regulation. E2, the central component of the complex, catalyzes the transfer of the acyl group of CoA from E1 to E3 via reductive acetylation of a lipoyl group covalently attached to a lysine residue. Probab=96.25 E-value=0.0092 Score=36.76 Aligned_cols=39 Identities=26% Similarity=0.535 Sum_probs=28.3 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 999538738984688868999985789893369789999 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) .+....-...|.|+.+|+|.++++++||.|+.|++|+.+ T Consensus 36 ~vEt~K~~~~i~a~~~G~v~~i~~~~G~~v~~G~~l~~i 74 (74) T cd06849 36 EVETDKATVEVEAPAAGVLAKILVEEGDTVPVGQVIAVI 74 (74) T ss_pred EEECCCEEEEEECCCCEEEEEEEECCCCEECCCCEEEEC T ss_conf 998585078410798999999987899998899999979 No 48 >PTZ00144 dihydrolipoamide succinyltransferase; Provisional Probab=96.18 E-value=0.0079 Score=37.19 Aligned_cols=40 Identities=15% Similarity=0.293 Sum_probs=30.6 Q ss_pred CCCEEEEECC------CCCEEEEEECCCCCEEECCCEEEEECHHHH Q ss_conf 3873898468------886899998578989336978999964899 Q gi|254780386|r 53 EDNVVEIKSP------FSGIIKKFQVENGSQILQGTPLLTFEDIET 92 (440) Q Consensus 53 ~~~~~~I~s~------~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~ 92 (440) ....++|.-| ..|.|.+|+|++||.|++||+|++++...+ T Consensus 54 ~~~i~~ikMP~LGetm~EG~Iv~WlvkeGD~V~~gd~L~EVETDKA 99 (430) T PTZ00144 54 SKDIKTIKVPRLGDSISEGTLSEWKKKVGDYVKVDETIAIVETDKV 99 (430) T ss_pred CCCCEEEECCCCCCCCEEEEEEEEECCCCCEECCCCCEEEEECCCC T ss_conf 6786498798899982069997998189998689990899985884 No 49 >PRK05704 dihydrolipoamide succinyltransferase; Validated Probab=96.18 E-value=0.013 Score=35.81 Aligned_cols=43 Identities=23% Similarity=0.413 Sum_probs=37.6 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 9999538738984688868999985789893369789999648 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) -+|..+.-...|.|+.+|+|.++++++||.|.-|++|+.++.. T Consensus 37 ~eVETDKa~~Ei~s~~~G~l~ki~~~eGd~v~VG~~ia~I~~~ 79 (406) T PRK05704 37 VEIETDKVVLEVPAPAAGVLSEILAEEGDTVTVGQVLGRIDEG 79 (406) T ss_pred EEEECCCCCEEECCCCCEEEEEEEECCCCEECCCCEEEEEECC T ss_conf 9998386207974888989999830898996689989998268 No 50 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=96.17 E-value=0.0046 Score=38.78 Aligned_cols=41 Identities=27% Similarity=0.532 Sum_probs=35.9 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 99953873898468886899998578989336978999964 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) .|+++.=+..|.||.+|+|.+|..+||+.|.-||+|+.|.. T Consensus 36 eiETDKV~lEV~Sp~dGvl~~~~~~eG~TV~~g~~La~~~E 76 (435) T TIGR01347 36 EIETDKVVLEVPSPADGVLQEILEKEGDTVTSGQVLAVLEE 76 (435) T ss_pred EEECCCEEEEEECCCCCHHHHHHCCCCCEECCCEEEEEEEC T ss_conf 75207446788288530217430689888505507788823 No 51 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=96.15 E-value=0.0052 Score=38.41 Aligned_cols=43 Identities=12% Similarity=0.298 Sum_probs=33.0 Q ss_pred CCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 212345554116873121201222124445542103355650477532 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKV 334 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v 334 (440) -+...|-||+.|+|..+.+ ..|+-|..|++|+.| +.|.-|.-| T Consensus 1098 ~n~~hvGApMpG~v~~v~v-~~G~~v~kGd~l~~~----~AMKME~~v 1140 (1169) T TIGR01235 1098 GNPAHVGAPMPGVVIEVKV-EAGQKVNKGDPLVVL----EAMKMETVV 1140 (1169) T ss_pred CCHHHCCCCCCCCEEEEEE-ECCCEECCCCEEEEE----ECCCCCCEE T ss_conf 8743225689970799994-138730167848996----037632046 No 52 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=96.07 E-value=9.2e-05 Score=50.09 Aligned_cols=25 Identities=20% Similarity=0.251 Sum_probs=12.1 Q ss_pred HHCCCCCCEEEEEEECCCCCCCEEEEEEE Q ss_conf 10158983179998128875443999999 Q gi|254780386|r 340 QHVKKGYTATVRFPHYADIREKKFKAIIE 368 (440) Q Consensus 340 ~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~ 368 (440) +.+.||=.+.+.++. |.+ +|.|=|+ T Consensus 1073 ~~LSP~G~g~L~Le~-PdD---PF~GGl~ 1097 (1202) T TIGR02169 1073 AELSPGGTGELILEN-PDD---PFAGGLE 1097 (1202) T ss_pred HHHCCCCCEEEECCC-CCC---CCCCCCE T ss_conf 853889824654358-887---4368717 No 53 >COG4770 Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism] Probab=96.07 E-value=0.0092 Score=36.75 Aligned_cols=33 Identities=18% Similarity=0.328 Sum_probs=28.1 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) ..-.+.||++|+|..+.+ +.|+.|..|++|+.+ T Consensus 574 ~~~~l~aPMpG~v~~v~V-~~G~~V~~G~~lvvl 606 (645) T COG4770 574 SSGELLAPMPGTVVSVAV-KEGQEVSAGDLLVVL 606 (645) T ss_pred CCCCEECCCCCEEEEEEE-CCCCEECCCCEEEEE T ss_conf 877165678962899993-689775479867886 No 54 >PRK11855 dihydrolipoamide acetyltransferase; Reviewed Probab=96.05 E-value=0.016 Score=35.21 Aligned_cols=42 Identities=31% Similarity=0.577 Sum_probs=27.7 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 999538738984688868999985789893369789999648 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .|..+.-...|-||.+|+|.+|+|.+||.|+-|++|+.++.. T Consensus 38 evETDKa~~Evps~~~G~v~~i~v~~Gd~v~vG~~i~~i~~~ 79 (549) T PRK11855 38 TVETDKATMEIPSPAAGVVKKIKVKVGDTVSVGAPLAVIEAA 79 (549) T ss_pred EEECCCCCEEECCCCCEEEEEEEECCCCEECCCCEEEEEECC T ss_conf 998177136843888979999961899884699859999568 No 55 >COG0508 AceF Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes [Energy production and conversion] Probab=96.03 E-value=0.017 Score=35.06 Aligned_cols=44 Identities=23% Similarity=0.430 Sum_probs=37.7 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 99995387389846888689999857898933697899996489 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) -+|..+.-+..|-||.+|+|.+|++++||.|.-|++|+.++... T Consensus 37 ~eVeTDKa~~EV~ap~~G~l~~i~~~~G~~V~Vg~~I~~i~~~~ 80 (404) T COG0508 37 VEVETDKATMEVPAPDAGVLAKILVEEGDTVPVGAVIARIEEEG 80 (404) T ss_pred EEEEECCEEEEECCCCCCEEEEEECCCCCEECCCCEEEEEECCC T ss_conf 99980653355147778599676205898875898589993377 No 56 >PRK10255 N-acetyl glucosamine specific PTS system components IIABC; Provisional Probab=96.00 E-value=0.022 Score=34.29 Aligned_cols=16 Identities=31% Similarity=0.573 Sum_probs=10.4 Q ss_pred CCCCCCCEEEEECCEE Q ss_conf 3455541168731212 Q gi|254780386|r 290 KTILSPIAGTIVYNQS 305 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~ 305 (440) .++.||++|.|..+.. T Consensus 499 ~~~~aP~~G~vi~l~~ 514 (648) T PRK10255 499 AELVSPITGDVVALDQ 514 (648) T ss_pred EEEECCCCCEEEECCC T ss_conf 5897557852887010 No 57 >PRK11856 branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed Probab=95.99 E-value=0.019 Score=34.59 Aligned_cols=41 Identities=29% Similarity=0.590 Sum_probs=36.4 Q ss_pred EEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEEC Q ss_conf 99995387389846888689999857898933697899996 Q gi|254780386|r 48 GEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 48 G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld 88 (440) -.|..+.-...|.||.+|+|.++++++||.|.-|++|+.+. T Consensus 36 ~eVeTDKa~~ei~a~~~G~l~~i~~~~G~~v~VG~~ia~i~ 76 (324) T PRK11856 36 AEVETDKATVEVESPVAGTLRKILVEEGDVVPVGAPIAVIV 76 (324) T ss_pred EEEECCCEEEEEECCCCEEEEEEECCCCCEECCCCEEEEEC T ss_conf 99980766689806888399899608979957999899944 No 58 >TIGR01108 oadA oxaloacetate decarboxylase alpha subunit; InterPro: IPR005776 This family describes the bacterial oxaloacetate decarboxylase alpha subunit and its equivalents in archaea . The oxaloacetate decarboxylase Na+ pump is the paradigm of the family of Na+ transport decarboxylases that present in bacteria and archaea. It a multi subunit enzyme consisting of a peripheral alpha-subunit and integral membrane subunits beta and gamma. The energy released by the decarboxylation reaction of oxaloacetate is coupled to Na+ ion pumping across the membrane.; GO: 0008948 oxaloacetate decarboxylase activity, 0006814 sodium ion transport. Probab=95.98 E-value=0.0068 Score=37.61 Aligned_cols=33 Identities=24% Similarity=0.393 Sum_probs=28.2 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) .-..|.||+.|.|+.+.+ +.|+.|..|+.|+-| T Consensus 550 a~~~vTaP~aG~i~~~~V-~~G~~V~~G~vlL~~ 582 (616) T TIGR01108 550 AITDVTAPIAGSIVKITV-SEGDEVKEGEVLLVV 582 (616) T ss_pred CCCCCCCCCCCCEEEEEE-CCCCEECCCCEEEEE T ss_conf 898756677773699983-578743157758853 No 59 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=95.95 E-value=0.0075 Score=37.33 Aligned_cols=33 Identities=18% Similarity=0.284 Sum_probs=25.6 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) +-..|-||+.|+|..+.+ ..|+-|.+|++|+.| T Consensus 1078 Np~higApmpG~Vv~v~V-~~G~~Vk~Gd~l~~i 1110 (1149) T COG1038 1078 NPGHIGAPMPGVVVEVKV-KKGDKVKKGDVLAVI 1110 (1149) T ss_pred CCCCCCCCCCCCEEEEEE-CCCCCCCCCCEEEEE T ss_conf 811217888872699997-458730479865330 No 60 >pfam00529 HlyD HlyD family secretion protein. Probab=95.92 E-value=0.00084 Score=43.71 Aligned_cols=110 Identities=15% Similarity=0.053 Sum_probs=66.7 Q ss_pred EEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECC--CCC Q ss_conf 873121201222124445542103355650477532622311015898317999812887544399999997133--444 Q gi|254780386|r 299 TIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPI--ISQ 376 (440) Q Consensus 299 ~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~--~~~ 376 (440) .+..... ..|.++.+|.+++.++ +....+|.+.+.+.++..+.+|+.+.++.++|+...+ .+.|.+..|..- +.. T Consensus 182 ~~~~~~~-~~~~~~~~~~~~~~~v-~~~~~~v~~~~~~~~~~~~~~n~~~~l~~~~~~~~~~-~~~~~~~~~~vp~~a~~ 258 (304) T pfam00529 182 EAQLRSV-QVGEEVGVGSPLMGVV-PFNDVEVDANLGETQLRLVRPGPRAPLTGGVYVLVVL-TEGGVVAAILMPTVAVF 258 (304) T ss_pred CCCCCEE-EEECCCCCCCCEEEEE-ECCCCCCCCCCEEEEEEEEECCCCCEEECCEEEEEEE-ECCCCCCCEECCCCCCC T ss_conf 4456327-8701247887278999-6242322455027999999639986687566646999-64845353005862334 Q ss_pred CC----------CCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEE Q ss_conf 67----------7765511899999935011256785152377569 Q gi|254780386|r 377 QD----------NNLQTQEYYEVILKITDLAFSDNNIELRNGFPAE 412 (440) Q Consensus 377 ~~----------~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~ 412 (440) .. +........+|+|.++.. +..-...|.||+.+. T Consensus 259 ~~~~~~~~v~~~~~~~v~~~~pV~i~~~~~-~~~~~~gl~~G~~vv 303 (304) T pfam00529 259 RLLPVEAQVAVVNIDKVVQGVPVEIGLDPA-FNQSTTPLVPGEVVI 303 (304) T ss_pred CCCCCCCEEEEECCCCEEEEEEEEEEEECC-CCHHCCCCCCCCEEE T ss_conf 568887448876787789968999998278-333126989998897 No 61 >PRK09439 glucose-specific PTS system component; Provisional Probab=95.86 E-value=0.034 Score=33.00 Aligned_cols=18 Identities=28% Similarity=0.525 Sum_probs=12.7 Q ss_pred CCCCCCCCCEEEEECCEE Q ss_conf 123455541168731212 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQS 305 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~ 305 (440) ...+|.||++|.|..+.. T Consensus 19 ~~i~i~AP~~G~vi~L~~ 36 (169) T PRK09439 19 GTIEIIAPLSGEIVNIED 36 (169) T ss_pred CCEEEEECCCCEEEEHHH T ss_conf 848999647716988004 No 62 >PRK11854 aceF dihydrolipoamide acetyltransferase; Validated Probab=95.58 E-value=0.024 Score=33.94 Aligned_cols=42 Identities=24% Similarity=0.544 Sum_probs=34.9 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 999538738984688868999985789893369789999648 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +|..+.-...|-||.+|+|.+|.|++||.|+.|++|+.|+.. T Consensus 36 ~vetDKa~~evps~~~G~~~~i~v~~gd~v~~g~~l~~i~~~ 77 (630) T PRK11854 36 TVEGDKASMEVPSPQAGVVKEIKVKVGDKVDTGALIMIFESA 77 (630) T ss_pred EEECCCCCEECCCCCCCEEEEEEECCCCCCCCCCEEEEEECC T ss_conf 998366233666888838999985689812789868997057 No 63 >TIGR01348 PDHac_trf_long pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006256 This group of sequences are a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and percent identity (UPGMA) trees. Members contain two or three copies of the lipoyl-binding domain. Escherichia coli AceF is included in this set while mitochondrial and some other bacterial forms are excluded. dihydrolipoyllysine-residue acetyltransferase E2 component of pyruvate dehydrogenase complex; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006096 glycolysis, 0045254 pyruvate dehydrogenase complex. Probab=95.52 E-value=0.021 Score=34.39 Aligned_cols=43 Identities=23% Similarity=0.390 Sum_probs=36.2 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 9995387389846888689999857898933697899996489 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) +|+.+..+-.|=|+..|+|++|.||.||..+.|++++.|+... T Consensus 36 t~E~DKA~mEvPss~AG~ik~ikvkVGDt~~~G~~i~~l~~~~ 78 (655) T TIGR01348 36 TLESDKASMEVPSSKAGIIKEIKVKVGDTLKEGDVIALLEVSA 78 (655) T ss_pred EECCCEEEEEECCCCCCEEEEEEEEECCCCCCCCEEEEECCCC T ss_conf 8706712784068999737889754466547787899971665 No 64 >PRK09783 copper/silver efflux system membrane fusion protein CusB; Provisional Probab=95.49 E-value=0.026 Score=33.69 Aligned_cols=56 Identities=20% Similarity=0.341 Sum_probs=45.7 Q ss_pred EEEEEEEEEEEEEECCC-EEEEECCCCCEEEEEEC-CCCCEEECCCEEEEECHHHHHH Q ss_conf 71258999799995387-38984688868999985-7898933697899996489988 Q gi|254780386|r 39 PIEISVSSSGEILNEDN-VVEIKSPFSGIIKKFQV-ENGSQILQGTPLLTFEDIETTD 94 (440) Q Consensus 39 ~~~~~v~~~G~i~~~~~-~~~I~s~~~G~V~~i~V-keGd~VkkGq~L~~ld~~~~~~ 94 (440) .+..++.+.|+|..+.+ ...|.+.++|.|++++| ..||.|+|||+|+.+-..+.-. T Consensus 103 ~l~~~i~~~G~v~~dE~~~~~v~~r~~G~Ie~L~v~~~G~~V~kGq~L~~lYSPel~~ 160 (407) T PRK09783 103 PLTFAQSFPANVSYNEYQYAIVQARAAGFIDKVYPLTVGDKVQKGTPLLDLTIPDWVE 160 (407) T ss_pred CCCCEEEEEEEEEECCCCEEEEEECCCCEEEEEEECCCCCEECCCCEEEEEECHHHHH T ss_conf 3332189989998777732899710352688987425787873897889993888999 No 65 >TIGR01348 PDHac_trf_long pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006256 This group of sequences are a subset of pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase specifically close by both phylogenetic and percent identity (UPGMA) trees. Members contain two or three copies of the lipoyl-binding domain. Escherichia coli AceF is included in this set while mitochondrial and some other bacterial forms are excluded. dihydrolipoyllysine-residue acetyltransferase E2 component of pyruvate dehydrogenase complex; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006096 glycolysis, 0045254 pyruvate dehydrogenase complex. Probab=95.45 E-value=0.028 Score=33.57 Aligned_cols=28 Identities=25% Similarity=0.380 Sum_probs=22.2 Q ss_pred CCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 8886899998578989336978999964 Q gi|254780386|r 62 PFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 62 ~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) ...|.|.+|+|+.||.|++||.|++|+. T Consensus 12 ~~~g~v~evLV~~GD~V~~~Qslit~E~ 39 (655) T TIGR01348 12 IEEGEVIEVLVKVGDKVEAGQSLITLES 39 (655) T ss_pred CCCEEEEEEEECCCCEECCCCCEEEECC T ss_conf 7751389998688974326874688706 No 66 >cd06663 Biotinyl_lipoyl_domains Biotinyl_lipoyl_domains are present in biotin-dependent carboxylases/decarboxylases, the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases, and the H-protein of the glycine cleavage system (GCS). These domains transport CO2, acyl, or methylamine, respectively, between components of the complex/protein via a biotinyl or lipoyl group, which is covalently attached to a highly conserved lysine residue. Probab=95.41 E-value=0.017 Score=35.00 Aligned_cols=29 Identities=21% Similarity=0.296 Sum_probs=20.3 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 68886899998578989336978999964 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) +...|.|.+++|++||.|++||+|+.++. T Consensus 10 ~~~eg~I~~~~v~~Gd~V~~Gd~l~~iEa 38 (73) T cd06663 10 HLGDGTVVKWLKKVGDKVKKGDVLAEIEA 38 (73) T ss_pred CCEEEEEEEEEECCCCEECCCCEEEEEEE T ss_conf 83079999999269999979999999995 No 67 >PRK05704 dihydrolipoamide succinyltransferase; Validated Probab=95.29 E-value=0.018 Score=34.88 Aligned_cols=31 Identities=13% Similarity=0.211 Sum_probs=26.2 Q ss_pred CCCCEEEEEECCCCCEEECCCEEEEECHHHH Q ss_conf 8886899998578989336978999964899 Q gi|254780386|r 62 PFSGIIKKFQVENGSQILQGTPLLTFEDIET 92 (440) Q Consensus 62 ~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~ 92 (440) ...|.|.+|+|++||.|++||+|++++...+ T Consensus 14 m~Eg~I~~Wlvk~GD~V~~gd~L~eVETDKa 44 (406) T PRK05704 14 VTEATIATWHKKPGDAVARDEVLVEIETDKV 44 (406) T ss_pred CEEEEEEEEECCCCCEECCCCEEEEEECCCC T ss_conf 3079998998189999789990899983862 No 68 >COG0508 AceF Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes [Energy production and conversion] Probab=95.14 E-value=0.019 Score=34.63 Aligned_cols=31 Identities=19% Similarity=0.302 Sum_probs=26.3 Q ss_pred CCCCEEEEEECCCCCEEECCCEEEEECHHHH Q ss_conf 8886899998578989336978999964899 Q gi|254780386|r 62 PFSGIIKKFQVENGSQILQGTPLLTFEDIET 92 (440) Q Consensus 62 ~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~~ 92 (440) ...|.|.+|+|++||.|++||+|++++.... T Consensus 14 ~~EG~I~~W~~k~GD~V~~gd~L~eVeTDKa 44 (404) T COG0508 14 MTEGTIVEWLKKVGDKVKEGDVLVEVETDKA 44 (404) T ss_pred CEEEEEEEEEECCCCEECCCCCEEEEEECCE T ss_conf 1367885897269986369980699980653 No 69 >TIGR01995 PTS-II-ABC-beta PTS system, beta-glucoside-specific IIABC component; InterPro: IPR011297 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . This entry represents a family of PTS enzyme II proteins in which all three domains are found in the same polypeptide chain and which appear to have a broad specificity for beta-glucosides including salicin (beta-D-glucose-1-salicylate) and arbutin (hydroquinone-O-beta-D-glucopyranoside) . These are distinct from the closely related sucrose-specific and trehalose-specific PTS transporters.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=95.12 E-value=0.029 Score=33.41 Aligned_cols=19 Identities=21% Similarity=0.450 Sum_probs=10.8 Q ss_pred CCCCCCCCCCEEEEECCEE Q ss_conf 2123455541168731212 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQS 305 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~ 305 (440) +...+|.||+.|.+..++. T Consensus 507 ~~~~~~~~P~~Ge~~~L~e 525 (660) T TIGR01995 507 LRKEELYAPVAGEVVPLNE 525 (660) T ss_pred HHHHHCCCCEEEEEEECCC T ss_conf 2133205760134675056 No 70 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=95.11 E-value=0.028 Score=33.54 Aligned_cols=119 Identities=13% Similarity=0.078 Sum_probs=64.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHH-H---------HHE--EEEEEEE-EEEEEEEECCCEEEEECCCCCEEEEEECCCCCEE- Q ss_conf 861689999999999999998-7---------640--6712589-9979999538738984688868999985789893- Q gi|254780386|r 13 HPASLKRMTLLSIIGIIGLIT-W---------SIL--LPIEISV-SSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQI- 78 (440) Q Consensus 13 ~p~~~~~~~~~~~~~~~~~~~-w---------~~~--~~~~~~v-~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~V- 78 (440) -|...-.+++++.+.+.+|.+ . +|. =.++++. .++|.|+ .|-.-- .=+||.||.. T Consensus 4 ~~~l~vv~Li~~a~a~~~y~~~~~~~~~~T~dAYV~a~~~~v~sF~VsG~v~------~v~~~d-----~d~Vk~GD~Lv 72 (379) T TIGR00998 4 FLLLLVVLLIVVALAYAIYWFLVLRDYESTDDAYVKANQLQVSSFQVSGSVI------EVNVDD-----TDYVKQGDVLV 72 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCEEEEEEEEEEEEE------EEEECC-----CCCCEECCEEE T ss_conf 6799999999999999999998874011211462127730232465732489------997437-----88420311566 Q ss_pred ECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCC----CCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 3697899996489988899987434332332100123----333222114555422356888877655 Q gi|254780386|r 79 LQGTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKS----ALDFLNNKYDVLEKAFESREEQLLDFL 142 (440) Q Consensus 79 kkGq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~----~l~~~~~~~~~~~~~~~~~~~~~~~~~ 142 (440) +=++.=+++.-.++...+.....+...+....+.++. .+....-..+.....+++.+..+.... T Consensus 73 ~LD~t~a~~~~~~Ae~~LAa~~rq~~~l~~~~~~l~~GLHisLsakkL~v~s~~~~~~qar~~l~qae 140 (379) T TIGR00998 73 RLDPTDAELALAKAEANLAALVRQTKQLEITVQQLQAGLHISLSAKKLKVESLKIKLEQAREKLKQAE 140 (379) T ss_pred EECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 76771689999999999999999888873116776356400024567889999999999999987678 No 71 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=95.10 E-value=0.038 Score=32.67 Aligned_cols=80 Identities=15% Similarity=0.084 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHH-----HHHEEEEE-----EEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCE-EEEEC Q ss_conf 99999999999998-----76406712-----589997999953873898468886899998578989336978-99996 Q gi|254780386|r 20 MTLLSIIGIIGLIT-----WSILLPIE-----ISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTP-LLTFE 88 (440) Q Consensus 20 ~~~~~~~~~~~~~~-----w~~~~~~~-----~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~-L~~ld 88 (440) +++.+++++..+.. |.--+.+. ++..++|.|. .|--.. .=.|+.||..-.=|. -++.. T Consensus 18 v~~~~~~~~~~w~~~~~~p~TddAyV~a~iv~Vsp~VsG~V~------eV~V~d-----nq~VkkGdvL~~ID~~~y~~a 86 (310) T PRK10559 18 VILAFIAIFNAWVYYTESPWTRDARFSADVVAIAPDVSGLIT------QVNVHD-----NQLVKKGQVLFTIDQPRYQKA 86 (310) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEEEEEEEECCCCEEEE------EEECCC-----CCEECCCCEEEEECCHHHHHH T ss_conf 999999999999862688516773898579999366746999------999179-----498769988999876179999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4899888999874343323321 Q gi|254780386|r 89 DIETTDLIDLKKVSIKNLRCHI 110 (440) Q Consensus 89 ~~~~~~~l~~~~~~l~~l~~~l 110 (440) -.++++.+...+..+...+.+. T Consensus 87 l~qA~A~la~~qa~~~~~~~e~ 108 (310) T PRK10559 87 LAEAEADVAYYQVLAQEKRQEA 108 (310) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999 No 72 >COG0845 AcrA Membrane-fusion protein [Cell envelope biogenesis, outer membrane] Probab=95.10 E-value=0.029 Score=33.38 Aligned_cols=129 Identities=21% Similarity=0.243 Sum_probs=96.9 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCC--CCEEEEEEECHHHHHHCCCCCCEEEEEEECC Q ss_conf 32212222212345554116873121201222124445542103355--6504775326223110158983179998128 Q gi|254780386|r 279 KLTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHS--KLTYIRAKVTPKQIQHVKKGYTATVRFPHYA 356 (440) Q Consensus 279 ~l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~--~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~ 356 (440) .+..++..+.++.++||++|++..... ..|.++.+++.+..+.... ..+.+...+++.+...++.|+.+.+.+++++ T Consensus 167 ~~~~~~~~~~~~~i~a~~~g~~~~~~~-~~g~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 245 (372) T COG0845 167 ALASAKLNLEYTRITAPISGVIGARLV-RVGQLVSAGQALATIADLDPAAALWVLAAVLERDLLAVKVGQKVTVTLAAGP 245 (372) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHCCC-CCCCEECCCCHHHHHHHHHHHHHEEEHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 999999987554211762441000004-6442103564045676431312100126546777875115886999962777 Q ss_pred CCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEECEEE Q ss_conf 87544399999997133444677765511899999935011256785152377569999922054 Q gi|254780386|r 357 DIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTAKNTT 421 (440) Q Consensus 357 ~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~~~~~ 421 (440) .. .+.|++..|++.+.....+ +.+.+.++.+. ...|+|||.+.+.|..+... T Consensus 246 ~~---~~~~~~~~i~~~~~~~~~~------~~~~~~~~~~~----~~~l~~g~~~~~~i~~~~~~ 297 (372) T COG0845 246 DQ---LFAGTVEFISPTVDATTRT------VTVRAEVPNPN----GDALKPGMFVEVRIEVGDGK 297 (372) T ss_pred CE---EECEEEEEECCCCCCCCCE------EEEEEEECCCC----CCCCCCCEEEEEEEECCCCC T ss_conf 50---4120578850564675415------99999984787----65313755999999537757 No 73 >PRK11854 aceF dihydrolipoamide acetyltransferase; Validated Probab=95.07 E-value=0.056 Score=31.52 Aligned_cols=41 Identities=24% Similarity=0.595 Sum_probs=31.1 Q ss_pred EEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 99538738984688868999985789893369789999648 Q gi|254780386|r 50 ILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 50 i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) |........|-||.+|+|.+|+|++||.|..|++|+.++.. T Consensus 140 vetdKa~~evps~~~G~~~~i~~~~g~~v~~g~~i~~i~~~ 180 (630) T PRK11854 140 VEGDKASMEVPAPFAGTVKEIKVNVGDKVSTGSLIMVFEVA 180 (630) T ss_pred EECCCEEEEEECCCCCEEEEECCCCCCEECCCCEEEEECCC T ss_conf 75252468852688816887126899884479767885367 No 74 >COG3608 Predicted deacylase [General function prediction only] Probab=95.03 E-value=0.087 Score=30.25 Aligned_cols=34 Identities=15% Similarity=0.237 Sum_probs=27.9 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCC Q ss_conf 234555411687312120122212444554210335 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPH 324 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~ 324 (440) -.-|+||.+|.|..+ ...|+-|.+|+.|+.|.+. T Consensus 256 ~~~i~Ap~~G~v~~~--v~lGd~VeaG~~la~i~~~ 289 (331) T COG3608 256 DEMIRAPAGGLVEFL--VDLGDKVEAGDVLATIHDP 289 (331) T ss_pred CCEEECCCCCEEEEE--ECCCCCCCCCCEEEEEECC T ss_conf 525656888548774--1678802178767997368 No 75 >PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional Probab=94.98 E-value=0.17 Score=28.26 Aligned_cols=43 Identities=23% Similarity=0.294 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEEC Q ss_conf 999999999876406712589997999953873898468886899998578989336 Q gi|254780386|r 24 SIIGIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQ 80 (440) Q Consensus 24 ~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~Vkk 80 (440) +-++||++++|++|...- ...+.| ..|.+|.-|.|-+|-.|.. T Consensus 21 LHiiLi~lLIwgS~~~~~-e~~ggG-------------~gG~~I~AVmVDpgaV~qq 63 (402) T PRK09510 21 LHIILFALLIWSSFDENV-EASGGG-------------GGGSVIDAVMVDPGAVVEQ 63 (402) T ss_pred HHHHHHHHHHHHHCCCCC-CCCCCC-------------CCCCEEEEEEECCHHHHHH T ss_conf 999999999980011132-146899-------------9877323788780899999 No 76 >PRK11855 dihydrolipoamide acetyltransferase; Reviewed Probab=94.96 E-value=0.055 Score=31.55 Aligned_cols=42 Identities=26% Similarity=0.526 Sum_probs=34.0 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 999538738984688868999985789893369789999648 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .|..+.-...|.|+.+|+|.+++|++||.|.-|++|+.++.. T Consensus 153 eVETDKa~~Ev~sp~~G~l~ki~v~eGd~v~VG~~ia~i~~~ 194 (549) T PRK11855 153 TVETDKATMEIPSPVAGVVKEIKVKVGDKVSVGSLLFKIEVA 194 (549) T ss_pred EEEECCCCEEEECCCCCEEEEEEECCCCCCCCCCEEEEEECC T ss_conf 886055312674365875447650589804579878997257 No 77 >PRK11637 hypothetical protein; Provisional Probab=94.95 E-value=0.0018 Score=41.55 Aligned_cols=12 Identities=8% Similarity=-0.158 Sum_probs=6.3 Q ss_pred EEEEEEEEEECC Q ss_conf 399999997133 Q gi|254780386|r 362 KFKAIIEKIDPI 373 (440) Q Consensus 362 ~~~G~V~~Is~~ 373 (440) .+.|+|.+.++. T Consensus 321 v~~G~Vv~a~~~ 332 (404) T PRK11637 321 IADGRVILADWL 332 (404) T ss_pred ECCEEEEEEEEC T ss_conf 017699991140 No 78 >cd06849 lipoyl_domain Lipoyl domain of the dihydrolipoyl acyltransferase component (E2) of 2-oxo acid dehydrogenases. 2-oxo acid dehydrogenase multienzyme complexes, like pyruvate dehydrogenase (PDH), 2-oxoglutarate dehydrogenase (OGDH) and branched-chain 2-oxo acid dehydrogenase (BCDH), contain at least three different enzymes, 2-oxo acid dehydrogenase (E1), dihydrolipoyl acyltransferase (E2) and dihydrolipoamide dehydrogenase (E3) and play a key role in redox regulation. E2, the central component of the complex, catalyzes the transfer of the acyl group of CoA from E1 to E3 via reductive acetylation of a lipoyl group covalently attached to a lysine residue. Probab=94.91 E-value=0.031 Score=33.27 Aligned_cols=29 Identities=28% Similarity=0.407 Sum_probs=19.2 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 68886899998578989336978999964 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) +...|.|.+++|++||.|++||+|+.++. T Consensus 11 ~~~eg~i~~w~v~~Gd~V~~gd~l~~vEt 39 (74) T cd06849 11 SMTEGTIVEWLVKEGDSVEEGDVLAEVET 39 (74) T ss_pred CCCEEEEEEEEECCCCEECCCCEEEEEEC T ss_conf 85069999999279899978999999985 No 79 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=94.81 E-value=0.036 Score=32.78 Aligned_cols=16 Identities=19% Similarity=0.262 Sum_probs=6.9 Q ss_pred CCCCEEECCCEEEEEC Q ss_conf 7898933697899996 Q gi|254780386|r 73 ENGSQILQGTPLLTFE 88 (440) Q Consensus 73 keGd~VkkGq~L~~ld 88 (440) -.||....+-....|| T Consensus 679 YTGDildp~r~kY~L~ 694 (1149) T COG1038 679 YTGDILDPGRKKYTLD 694 (1149) T ss_pred ECCCCCCCCCCCCCHH T ss_conf 3056678776444489 No 80 >PRK11892 pyruvate dehydrogenase subunit beta; Provisional Probab=94.81 E-value=0.023 Score=34.10 Aligned_cols=29 Identities=17% Similarity=0.287 Sum_probs=24.6 Q ss_pred CCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 88868999985789893369789999648 Q gi|254780386|r 62 PFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 62 ~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .+.|.+.+++||+||.|+.||+|++++.. T Consensus 14 m~~g~l~kw~~~~gd~v~~gd~~aeietd 42 (464) T PRK11892 14 MEEGTLAKWLKKEGDKVKSGDVIAEIETD 42 (464) T ss_pred CCCCCCEEEECCCCCCCCCCCEEEEEEEC T ss_conf 45771214644689752578668997506 No 81 >KOG0238 consensus Probab=94.71 E-value=0.025 Score=33.89 Aligned_cols=32 Identities=19% Similarity=0.357 Sum_probs=25.8 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 345554116873121201222124445542103 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) ..+.||+.|+|..+.+ .+|+.|..|++++-+. T Consensus 602 ~v~~aPMpG~Iekv~V-kpgd~V~~Gq~l~Vl~ 633 (670) T KOG0238 602 GVIVAPMPGIIEKVLV-KPGDKVKEGQELVVLI 633 (670) T ss_pred CCEECCCCCEEEEEEC-CCHHHHCCCCCEEEEE T ss_conf 8565578870446651-6435531567348999 No 82 >cd00210 PTS_IIA_glc PTS_IIA, PTS system, glucose/sucrose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. Probab=94.70 E-value=0.17 Score=28.25 Aligned_cols=19 Identities=42% Similarity=0.735 Sum_probs=8.1 Q ss_pred EECCCCCEEECCCEEEEEC Q ss_conf 9857898933697899996 Q gi|254780386|r 70 FQVENGSQILQGTPLLTFE 88 (440) Q Consensus 70 i~VkeGd~VkkGq~L~~ld 88 (440) .+|++||.|++||+|+++| T Consensus 84 ~~v~~Gd~V~~G~~L~~~D 102 (124) T cd00210 84 SHVEEGQRVKQGDKLLEFD 102 (124) T ss_pred EEECCCCEECCCCEEEEEC T ss_conf 9974889998999999985 No 83 >KOG0369 consensus Probab=94.66 E-value=0.03 Score=33.35 Aligned_cols=33 Identities=21% Similarity=0.366 Sum_probs=28.0 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) ..-.|-||+.|+|..+.+ ..|+-|..|+||+.+ T Consensus 1105 ~~g~igAPMpG~vieikv-k~G~kV~Kgqpl~VL 1137 (1176) T KOG0369 1105 VKGHIGAPMPGTVIEIKV-KEGAKVKKGQPLAVL 1137 (1176) T ss_pred CCCCCCCCCCCCEEEEEE-ECCCEECCCCCEEEE T ss_conf 856536989984699998-307510379835761 No 84 >PRK03598 hypothetical protein; Provisional Probab=94.63 E-value=0.048 Score=31.95 Aligned_cols=89 Identities=9% Similarity=0.053 Sum_probs=45.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHH--------EEEE-----EEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCC Q ss_conf 356861689999999999999998764--------0671-----258999799995387389846888689999857898 Q gi|254780386|r 10 KRNHPASLKRMTLLSIIGIIGLITWSI--------LLPI-----EISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGS 76 (440) Q Consensus 10 ~~~~p~~~~~~~~~~~~~~~~~~~w~~--------~~~~-----~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd 76 (440) +|+-. +.-.++.++.+++.++..|.. .+.+ ++....+|+| ..|--..|- .|+.|| T Consensus 2 kk~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ve~~~V~vs~~v~G~V------~~v~V~eGd-----~Vk~Gq 69 (331) T PRK03598 2 KKPVV-IGLAVVALVAVGAGGWWWYQSRQDNGLTLYGNVDIRTVNLSFRVGGRL------ASLAVDEGD-----AVKAGQ 69 (331) T ss_pred CCHHH-HHHHHHHHHHHHHHHHHHEEECCCCCEEEEEEEEEEEEEECCCCCEEE------EEEECCCCC-----EECCCC T ss_conf 83389-999999999999997652140578860357999999999913277599------999868979-----877998 Q ss_pred EEECCCE-EEEECHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9336978-999964899888999874343323321 Q gi|254780386|r 77 QILQGTP-LLTFEDIETTDLIDLKKVSIKNLRCHI 110 (440) Q Consensus 77 ~VkkGq~-L~~ld~~~~~~~l~~~~~~l~~l~~~l 110 (440) .+-.=|. -+...-.++++.+...+.++..+++.. T Consensus 70 ~La~LD~~~~~~~l~~a~a~l~~a~a~l~~~~~~~ 104 (331) T PRK03598 70 VLGELDHAPYENALMQAKAGVSVAQAQLDLMLAGY 104 (331) T ss_pred EEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89998808999999999999999999999987201 No 85 >TIGR01349 PDHac_trf_mito pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase; InterPro: IPR006257 This group of sequences represent one of several closely related clades of the dihydrolipoamide acetyltransferase subunit of the pyruvate dehydrogenase complex. It includes sequences from mitochondria and from alpha and beta branches of the proteobacteria, as well as from some other bacteria, but not the Gram-positive bacteria.; GO: 0004742 dihydrolipoyllysine-residue acetyltransferase activity, 0006090 pyruvate metabolic process, 0045254 pyruvate dehydrogenase complex. Probab=94.62 E-value=0.031 Score=33.24 Aligned_cols=27 Identities=19% Similarity=0.373 Sum_probs=23.8 Q ss_pred CCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 886899998578989336978999964 Q gi|254780386|r 63 FSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 63 ~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) ..|-|.+|.+|+||.|+.||+|++++. T Consensus 12 ~~Gnla~W~KKeGDkv~~GDViaEIET 38 (584) T TIGR01349 12 TEGNLAKWLKKEGDKVKPGDVIAEIET 38 (584) T ss_pred CCCCCEEEEECCCCEECCCCEEEEEEC T ss_conf 656210244113777168876988745 No 86 >pfam00358 PTS_EIIA_1 phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 1. Probab=94.56 E-value=0.21 Score=27.75 Aligned_cols=17 Identities=41% Similarity=0.579 Sum_probs=11.9 Q ss_pred CCCCCCCCEEEEECCEE Q ss_conf 23455541168731212 Q gi|254780386|r 289 QKTILSPIAGTIVYNQS 305 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~ 305 (440) ...|.||++|.+..+.. T Consensus 3 ~~~i~aP~~G~vi~L~~ 19 (133) T pfam00358 3 DETLASPITGEIVPLDQ 19 (133) T ss_pred CCEEEECCCCEEEEHHH T ss_conf 66896027827988456 No 87 >PRK11856 branched-chain alpha-keto acid dehydrogenase subunit E2; Reviewed Probab=94.55 E-value=0.041 Score=32.39 Aligned_cols=31 Identities=26% Similarity=0.415 Sum_probs=26.5 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 6888689999857898933697899996489 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) +-..|.|.+|+|++||.|++||+|++++... T Consensus 12 ~m~eg~i~~W~~~~Gd~V~~g~~l~eVeTDK 42 (324) T PRK11856 12 GMTEGEIVEWLVKEGDTVKKGDPLAEVETDK 42 (324) T ss_pred CCCEEEEEEEEECCCCEECCCCEEEEEECCC T ss_conf 9517999999908999978999499998076 No 88 >cd06250 M14_PaAOTO_like An uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the the M14 family of metallocarboxypeptidases. This subgroup includes Pseudomonas aeruginosa AotO and related proteins. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. The gene encoding Probab=94.34 E-value=0.078 Score=30.57 Aligned_cols=30 Identities=20% Similarity=0.540 Sum_probs=26.2 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 45554116873121201222124445542103 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) -|+||.+|++... ...|+.|.+|+.|++|+ T Consensus 291 ~v~Ap~~Gi~~~~--~~~G~~V~~G~~la~I~ 320 (359) T cd06250 291 MLYAPAGGMVVYR--AAPGDWVEAGDVLAEIL 320 (359) T ss_pred EEECCCCEEEEEC--CCCCCEECCCCEEEEEE T ss_conf 9956998689872--79999977999899994 No 89 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=94.13 E-value=0.00071 Score=44.20 Aligned_cols=198 Identities=17% Similarity=0.229 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHCCCCCCCCC Q ss_conf 99888999874343323321001233332221145554223568888776554678865433--3655420011222112 Q gi|254780386|r 91 ETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKV--FNYGISSVYRSLKMKL 168 (440) Q Consensus 91 ~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 168 (440) .....+..++.++..+..++.........+..++.............+.....+........ +...+........... T Consensus 295 ~~~~~i~~le~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 374 (1191) T TIGR02168 295 ELKAEISRLEQQIQILKERLENLEQQLEELEEQLEELESKLDELEEELAELEEKLEELKEEVESLEAELEELEAELEELE 374 (1191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999989888999999999988748999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 21002232101122322222210233332221012222221110000111002466653101223322222222222100 Q gi|254780386|r 169 TRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELY 248 (440) Q Consensus 169 ~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~ 248 (440) ............+......+..+......++..++...+.......++.+....+.....+......+....+....... T Consensus 375 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l 454 (1191) T TIGR02168 375 SRLEELEEELETLRSEVAELEEQLASLNNELERLEARLERLEERRERLQQEIEELLKKLEEAEAELKELQEELAELEEEL 454 (1191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999999999999999999999999999999999999999999998769999999999997788999999 Q ss_pred CCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0122333211233---222122222222122233221222221 Q gi|254780386|r 249 DEANLEFANYLKE---ISRDLEQNQRTFADEMSKLTILEKTRE 288 (440) Q Consensus 249 ~~~~~~~~~~~~e---~~~~l~e~~~~l~~~~~~l~~l~~~l~ 288 (440) .++.......... ++.++.+.+..+.....++..+..++. T Consensus 455 ~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~l~~~~~r~~ 497 (1191) T TIGR02168 455 EELQEELERLEEALEELREELEEAEQALDAAKRELAQLQARLD 497 (1191) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999999999999999999 No 90 >COG4770 Acetyl/propionyl-CoA carboxylase, alpha subunit [Lipid metabolism] Probab=94.07 E-value=0.093 Score=30.05 Aligned_cols=10 Identities=30% Similarity=0.498 Sum_probs=4.2 Q ss_pred CCCCCCCCCH Q ss_conf 5763356861 Q gi|254780386|r 6 WKNLKRNHPA 15 (440) Q Consensus 6 ~~~~~~~~p~ 15 (440) +--|+=-||. T Consensus 290 NTRLQVEHPV 299 (645) T COG4770 290 NTRLQVEHPV 299 (645) T ss_pred ECCEECCCCC T ss_conf 2202025641 No 91 >PRK09039 hypothetical protein; Validated Probab=94.05 E-value=0.033 Score=33.04 Aligned_cols=43 Identities=16% Similarity=0.166 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 8899987434332332100123333222114555422356888 Q gi|254780386|r 94 DLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREE 136 (440) Q Consensus 94 ~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~ 136 (440) ..++.+..++.++...+..++.....+..+...+...+..... T Consensus 53 ~~L~~L~~qia~L~~~L~Ler~~~~~L~~~l~~L~~~L~~~~~ 95 (343) T PRK09039 53 KALDRLNAQIAELADLLSLERQGNQDLQDSVAQLRASLSLAEA 95 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999876565545699999999999887799 No 92 >TIGR00830 PTBA PTS system, glucose subfamily, IIA component; InterPro: IPR001127 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . ; GO: 0005351 sugar:hydrogen ion symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=93.97 E-value=0.058 Score=31.43 Aligned_cols=36 Identities=28% Similarity=0.409 Sum_probs=24.4 Q ss_pred EEECCCCC----EEEEEECCCCCEEECCCEEEEECHHHHH Q ss_conf 98468886----8999985789893369789999648998 Q gi|254780386|r 58 EIKSPFSG----IIKKFQVENGSQILQGTPLLTFEDIETT 93 (440) Q Consensus 58 ~I~s~~~G----~V~~i~VkeGd~VkkGq~L~~ld~~~~~ 93 (440) +|+=.-.| -|=+.+|++||.|++||.|+++|....+ T Consensus 76 TV~L~G~GPGELEvF~~~v~~G~~V~~Gd~l~efD~~~i~ 115 (129) T TIGR00830 76 TVKLNGEGPGELEVFTSHVEEGDKVKKGDPLLEFDLPAIK 115 (129) T ss_pred EEEECCCCCCEEEEEEEEEEECCEECCCCEEEEECHHHHH T ss_conf 5887874896131247876438787179779995669996 No 93 >PRK07051 hypothetical protein; Validated Probab=93.64 E-value=0.032 Score=33.13 Aligned_cols=34 Identities=18% Similarity=0.255 Sum_probs=21.3 Q ss_pred CCCCCCCCCEEEEECC------EEECCCCCCCCCCCCCCC Q ss_conf 1234555411687312------120122212444554210 Q gi|254780386|r 288 EQKTILSPIAGTIVYN------QSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l------~~~~~G~~v~~G~~l~~I 321 (440) ....|.||.-|++..- .+...|+.|.+|+.|+-| T Consensus 2 ~~~~I~sPm~G~fy~~~~p~~~pfV~~Gd~V~~G~~v~ii 41 (80) T PRK07051 2 AQHEIVSPLPGTFYRRPSPDAPPYVEVGDTVAAGDVVGLV 41 (80) T ss_pred CCCEEECCCCEEEECCCCCCCCCCCCCCCEECCCCEEEEE T ss_conf 9977728944789827995999813679998899999999 No 94 >TIGR02712 urea_carbox urea carboxylase; InterPro: IPR014084 Members of this family are ATP-dependent urea carboxylases, including characterised members from Oleomonas sagaranensis (alpha class Proteobacterium) and yeasts such as Saccharomyces cerevisiae (Baker's yeast). The allophanate hydrolase domain of the yeast enzyme is not included in this entry and is represented by an adjacent gene in Oleomonas sagaranensis. The fusion of urea carboxylase and allophanate hydrolase is designated urea amidolyase. The enzyme from O. sagaranensis was shown to be highly active on acetamide and formamide as well as urea.. Probab=92.85 E-value=0.039 Score=32.59 Aligned_cols=41 Identities=20% Similarity=0.364 Sum_probs=28.3 Q ss_pred CCCCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCC Q ss_conf 221222221234555411687312120122212444554210 Q gi|254780386|r 280 LTILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKI 321 (440) Q Consensus 280 l~~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~I 321 (440) |-.+|.---...|.||.+|+|..+.. ++|+.|.+|+.|+-| T Consensus 1185 l~i~EaMKmE~~v~Ap~~G~V~~i~~-~~G~~V~aG~~l~~~ 1225 (1226) T TIGR02712 1185 LVIIEAMKMEMTVSAPVAGKVTKILC-KPGDMVDAGDILVVL 1225 (1226) T ss_pred EEEEEECCCCEEECCCCCCEEEEEEE-CCCCCCCCCCEEEEE T ss_conf 89994357211243898854889973-684822678637874 No 95 >cd06252 M14_ASTE_ASPA_like_2 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=92.41 E-value=0.46 Score=25.43 Aligned_cols=30 Identities=13% Similarity=0.108 Sum_probs=25.6 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 45554116873121201222124445542103 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) -|+||.+|.+.. . ...|+.|..|+.|..|+ T Consensus 246 ~v~Ap~~Gif~~-~-v~lG~~V~kGq~lg~I~ 275 (316) T cd06252 246 YVFAPHPGLFEP-L-VDLGDEVSAGQVAGRIH 275 (316) T ss_pred EEECCCCEEEEE-E-CCCCCEECCCCEEEEEE T ss_conf 997699869999-1-78899977999999996 No 96 >pfam00364 Biotin_lipoyl Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not recognize the Glycine cleavage system H proteins. Probab=92.18 E-value=0.087 Score=30.25 Aligned_cols=21 Identities=24% Similarity=0.408 Sum_probs=12.8 Q ss_pred EEECCCCCEEECCCEEEEECH Q ss_conf 998578989336978999964 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld~ 89 (440) +++|++||.|++||+|+.++. T Consensus 18 ~w~v~~Gd~V~~g~~l~~iEt 38 (73) T pfam00364 18 EWLVKVGDKVKAGQVLCEVEA 38 (73) T ss_pred EEEECCCCEECCCCEEEEEEE T ss_conf 486589899928998999996 No 97 >cd06254 M14_ASTE_ASPA_like_4 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=92.14 E-value=0.36 Score=26.13 Aligned_cols=31 Identities=16% Similarity=0.101 Sum_probs=26.1 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 345554116873121201222124445542103 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) .-|+||.+|.+.. .+ ..|+.|.+|++|++|+ T Consensus 224 ~~v~A~~~Gl~~~-~v-~~G~~V~~Gq~lg~I~ 254 (288) T cd06254 224 YYVTSPASGLWYP-FV-KAGDTVQKGALLGYVT 254 (288) T ss_pred EEEECCCCEEEEE-EC-CCCCEECCCCEEEEEE T ss_conf 7996599849999-36-8899976899999996 No 98 >COG1566 EmrA Multidrug resistance efflux pump [Defense mechanisms] Probab=91.98 E-value=0.23 Score=27.48 Aligned_cols=65 Identities=11% Similarity=0.205 Sum_probs=45.0 Q ss_pred EEEECCCCCEEEC-CCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH-HHHHHHHHH Q ss_conf 9998578989336-9789999648998889998743433233210012333322211-455542235 Q gi|254780386|r 68 KKFQVENGSQILQ-GTPLLTFEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNK-YDVLEKAFE 132 (440) Q Consensus 68 ~~i~VkeGd~Vkk-Gq~L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~-~~~~~~~~~ 132 (440) ..=+|+.||..-. .+.-+++.-.++.+.+...+..+..+++++...++.++..... +.....+.. T Consensus 71 dnq~Vk~Gd~L~~iD~~~y~~al~qAea~la~a~~~~~~~~a~~~~~~A~i~~a~a~~l~~a~~~~~ 137 (352) T COG1566 71 DNQLVKKGDVLFRIDPRDYRAALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQNELE 137 (352) T ss_pred CCCEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8988518976899895789999999999999999999989878888888999999899999898889 No 99 >cd06253 M14_ASTE_ASPA_like_3 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=91.95 E-value=0.45 Score=25.50 Aligned_cols=31 Identities=13% Similarity=0.126 Sum_probs=26.2 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 345554116873121201222124445542103 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) .-|+||.+|.+.. . ...|+.|.+|++|.+|+ T Consensus 230 ~~i~Ap~~Gl~~~-~-~~lG~~V~~G~~lg~I~ 260 (298) T cd06253 230 VYVNAETSGIFVP-A-KHLGDIVKRGDVIGEIV 260 (298) T ss_pred EEEECCCCEEEEE-E-CCCCCEECCCCEEEEEE T ss_conf 8996599989999-4-68899887899999996 No 100 >pfam05896 NQRA Na(+)-translocating NADH-quinone reductase subunit A (NQRA). This family consists of several bacterial Na(+)-translocating NADH-quinone reductase subunit A (NQRA) proteins. The Na(+)-translocating NADH: ubiquinone oxidoreductase (Na(+)-NQR) generates an electrochemical Na(+) potential driven by aerobic respiration. Probab=91.83 E-value=0.14 Score=28.94 Aligned_cols=27 Identities=26% Similarity=0.540 Sum_probs=23.1 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 688868999985789893369789999 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) +-.-|+.=++.|+|||.|++||+|+.= T Consensus 34 ~Df~glkPkl~VkeGD~Vk~G~pLF~d 60 (257) T pfam05896 34 EDYVGMKPKMLVKEGDKVKAGQPLFED 60 (257) T ss_pred CCCCCCCEEEEEECCCEEECCCEEEEE T ss_conf 014887326998349998558757997 No 101 >cd06255 M14_ASTE_ASPA_like_5 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=91.62 E-value=0.32 Score=26.47 Aligned_cols=30 Identities=17% Similarity=0.204 Sum_probs=25.5 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 45554116873121201222124445542103 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) -|+||.+|.+.. . ...|+.|..|+.|.+|. T Consensus 233 ~v~A~~~Gl~~~-~-v~~G~~V~kGq~lg~I~ 262 (293) T cd06255 233 WVAAIHGGLFEP-S-VPAGDTIPAGQPLGRVV 262 (293) T ss_pred EEECCCCEEEEE-E-CCCCCEECCCCEEEEEE T ss_conf 997599859998-3-78899977999999998 No 102 >COG2190 NagE Phosphotransferase system IIA components [Carbohydrate transport and metabolism] Probab=91.60 E-value=0.26 Score=27.05 Aligned_cols=18 Identities=39% Similarity=0.835 Sum_probs=7.2 Q ss_pred ECCCCCEEECCCEEEEEC Q ss_conf 857898933697899996 Q gi|254780386|r 71 QVENGSQILQGTPLLTFE 88 (440) Q Consensus 71 ~VkeGd~VkkGq~L~~ld 88 (440) +|++||.|++||+|+++| T Consensus 92 ~v~~Gd~Vk~Gd~Li~fD 109 (156) T COG2190 92 LVKEGDKVKAGDPLLEFD 109 (156) T ss_pred EEECCCEECCCCEEEEEC T ss_conf 750799881598989997 No 103 >TIGR02063 RNase_R ribonuclease R; InterPro: IPR011805 This family consists of an exoribonuclease, ribonuclease R, also called VacB. It is one of the eight exoribonucleases reported in Escherichia coli and is broadly distributed throughout the bacteria. In E. coli, double mutants of this protein and polynucleotide phosphorylase are not viable. Not included in this entry are shorter, divergent forms from the Chlamydiae and the Campylobacterales (including Helicobacter pylori) and Leptospira interrogans , .; GO: 0003723 RNA binding, 0004518 nuclease activity. Probab=91.58 E-value=0.36 Score=26.10 Aligned_cols=53 Identities=26% Similarity=0.400 Sum_probs=42.6 Q ss_pred CCCCCCCCCC-EEEEEEECHH--HHHHCCCCCCEEEEEEECCC-CCCCEEEEEEEEE Q ss_conf 4210335565-0477532622--31101589831799981288-7544399999997 Q gi|254780386|r 318 LMKIVPHSKL-TYIRAKVTPK--QIQHVKKGYTATVRFPHYAD-IREKKFKAIIEKI 370 (440) Q Consensus 318 l~~Ivp~~~~-l~v~~~v~~~--di~~i~~Gq~v~i~~~a~~~-~~~~~~~G~V~~I 370 (440) +.-++|++.. +.-+.+||+. ++...+.|+.|.+.++.||. ...+...|+|..| T Consensus 167 ~~fv~p~D~~~~~~~i~~~~~~~~~~~~~~G~~V~~ei~~~p~~~~~~~~~G~v~~i 223 (755) T TIGR02063 167 IGFVVPDDKRKIYQDIFIPPESVQILGAKEGDKVLVEITKYPDKKRNRPAEGKVVEI 223 (755) T ss_pred EEEEEECCCCCCCCCCEECCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEEEE T ss_conf 899820687321424211644310167646868999988617886678625999998 No 104 >pfam09726 Macoilin Transmembrane protein. This entry is a highly conserved protein present in eukaryotes. Probab=91.55 E-value=0.0074 Score=37.37 Aligned_cols=24 Identities=17% Similarity=0.206 Sum_probs=9.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 987434332332100123333222 Q gi|254780386|r 98 LKKVSIKNLRCHIDTEKSALDFLN 121 (440) Q Consensus 98 ~~~~~l~~l~~~l~~~~~~l~~~~ 121 (440) .++..++.++++|...++.-.++. T Consensus 410 rLE~dlKkLraELq~~Rq~E~ELR 433 (680) T pfam09726 410 RLEQDIKKLQAELQQARQNESELR 433 (680) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999998787799999 No 105 >PRK05352 Na(+)-translocating NADH-quinone reductase subunit A; Provisional Probab=91.54 E-value=0.095 Score=29.99 Aligned_cols=28 Identities=21% Similarity=0.561 Sum_probs=23.5 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEEC Q ss_conf 6888689999857898933697899996 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld 88 (440) +-.-|..-++.||+||.|++||+|+.=. T Consensus 35 ~df~gl~pkl~VkeGD~V~~Gq~Lf~dK 62 (448) T PRK05352 35 EEYVGLKPTMKVKEGDKVKKGQPLFEDK 62 (448) T ss_pred CCCCCCCCCEEECCCCEECCCCEEEECC T ss_conf 1049999856855799974798556538 No 106 >KOG1029 consensus Probab=91.24 E-value=0.012 Score=35.97 Aligned_cols=10 Identities=0% Similarity=0.129 Sum_probs=3.9 Q ss_pred EECHHHHHHC Q ss_conf 3262231101 Q gi|254780386|r 333 KVTPKQIQHV 342 (440) Q Consensus 333 ~v~~~di~~i 342 (440) ++|+.++-+| T Consensus 741 WFPenyvEki 750 (1118) T KOG1029 741 WFPENYVEKI 750 (1118) T ss_pred CCCHHHHHHC T ss_conf 6728888644 No 107 >KOG0161 consensus Probab=91.01 E-value=0.0096 Score=36.63 Aligned_cols=186 Identities=13% Similarity=0.133 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHCCC Q ss_conf 9888999874343323321001233332221145554223568888776554678865433---------3655420011 Q gi|254780386|r 92 TTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKV---------FNYGISSVYR 162 (440) Q Consensus 92 ~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~ 162 (440) ....+..+...+...+.....+......+...++.++..++.......+..........++ +......+.. T Consensus 990 lEe~~~~l~~~l~~~eek~~~l~k~~~kle~~l~~le~~le~e~~~r~e~Ek~~rkle~el~~~~e~~~~~~~~~~el~~ 1069 (1930) T KOG0161 990 LEERIRELQDDLQAEEEKAKSLNKAKAKLEQQLDDLEVTLEREKRIRMELEKAKRKLEGELKDLQESIEELKKQKEELDN 1069 (1930) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999989989999999998999999889999999999999999998889999999887888766579999988999999 Q ss_pred CCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 22211221002232101122322222210233332221012222221110000111002466653101223322222222 Q gi|254780386|r 163 SLKMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLR 242 (440) Q Consensus 163 ~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~ 242 (440) ++.....++..+.....+.......+...+..++..+..+....+........+....+.+.....++...+.+...... T Consensus 1070 ~l~kke~El~~l~~k~e~e~~~~~~l~k~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ele~l~~~Lee~~~~t~ 1149 (1930) T KOG0161 1070 QLKKKESELSQLQSKLEDEQAEVAQLQKQIKELEARIKELEEELEAERASRAKAERQRRDLSEELEELKEELEEQGGTTA 1149 (1930) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98888978999998865799999998999999999999999889999999999999999999999999999998760499 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22210001223332112332221222222221222 Q gi|254780386|r 243 HINELYDEANLEFANYLKEISRDLEQNQRTFADEM 277 (440) Q Consensus 243 ~l~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~ 277 (440) ...+.......++....+.++.........+.+++ T Consensus 1150 ~q~e~~~k~e~e~~~l~~~leee~~~~e~~~~~lr 1184 (1930) T KOG0161 1150 AQLELNKKREAEVQKLRRDLEEETLDHEAQIEELR 1184 (1930) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998887799999999999999870799999999 No 108 >PRK11642 exoribonuclease R; Provisional Probab=90.98 E-value=0.57 Score=24.80 Aligned_cols=75 Identities=12% Similarity=0.087 Sum_probs=43.5 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCC----CCCCCCCCCCEEE-----EEEECHHHHHHCCCCCCEEEEEEECCCCCC Q ss_conf 3455541168731212012221244455----4210335565047-----753262231101589831799981288754 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQP----LMKIVPHSKLTYI-----RAKVTPKQIQHVKKGYTATVRFPHYADIRE 360 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~----l~~Ivp~~~~l~v-----~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~ 360 (440) ..|-..++|+|..+. .-|=||.-... |+.|-.-.+++|. ...|-+.--...+.|+.|+|++...+... T Consensus 641 ~~iGe~f~G~IsgVt--~fG~FVeL~~~~vEGLVhvs~L~dD~Y~yd~~~~~LvG~rtg~~y~lGD~V~V~v~~Vd~~~- 717 (813) T PRK11642 641 DQVGNVFKGVIASVT--GFGFFVRLDDLFIDGLVHVSSLDNDYYRFDQVGQRLIGESSGQTYRLGDRVEVRVEAVNMDE- 717 (813) T ss_pred HHCCCEEEEEEEEEE--EEEEEEEECCCCEEEEEECCCCCCCEEEECCCCCEEEEEECCCEEECCCEEEEEEEEECHHH- T ss_conf 744997889996076--10089995488637888801288831898335788986507978826997999999974213- Q ss_pred CEEEEEE Q ss_conf 4399999 Q gi|254780386|r 361 KKFKAII 367 (440) Q Consensus 361 ~~~~G~V 367 (440) +.+.-.+ T Consensus 718 ~qIDfeL 724 (813) T PRK11642 718 RKIDFSL 724 (813) T ss_pred CEEEEEE T ss_conf 7355898 No 109 >cd06251 M14_ASTE_ASPA_like_1 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=90.97 E-value=0.59 Score=24.71 Aligned_cols=31 Identities=16% Similarity=0.154 Sum_probs=26.0 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCC Q ss_conf 345554116873121201222124445542103 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIV 322 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Iv 322 (440) .-|+||.+|.+... ...|+.|.+|+.|.+|+ T Consensus 220 ~~v~A~~~Gl~~~~--~~~G~~V~~Gq~lg~I~ 250 (287) T cd06251 220 VWVRAPQGGLLRSL--VKLGDKVKKGQLLATIT 250 (287) T ss_pred EEEECCCCCEEEEC--CCCCCEECCCCEEEEEE T ss_conf 79976999879885--89889947999899997 No 110 >COG0539 RpsA Ribosomal protein S1 [Translation, ribosomal structure and biogenesis] Probab=90.61 E-value=1.1 Score=22.79 Aligned_cols=12 Identities=25% Similarity=0.277 Sum_probs=5.2 Q ss_pred EECCCCEEEEEE Q ss_conf 152377569999 Q gi|254780386|r 404 ELRNGFPAEVLF 415 (440) Q Consensus 404 ~L~~Gm~~~~~I 415 (440) .+.+|-.++|.| T Consensus 486 ~~~~gd~v~a~v 497 (541) T COG0539 486 VLKVGDEVEAVV 497 (541) T ss_pred HCCCCCEEEEEE T ss_conf 235798899999 No 111 >pfam07888 CALCOCO1 Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region. Probab=90.49 E-value=0.052 Score=31.73 Aligned_cols=15 Identities=7% Similarity=0.018 Sum_probs=10.8 Q ss_pred CCCCCCCCCCCCHHH Q ss_conf 334576335686168 Q gi|254780386|r 3 HFMWKNLKRNHPASL 17 (440) Q Consensus 3 ~~~~~~~~~~~p~~~ 17 (440) |+--+|..+|||+=| T Consensus 25 HYTLt~~i~ph~kDW 39 (546) T pfam07888 25 HYTIPSGHDWSSRDW 39 (546) T ss_pred EEECCCCCCCCCCCC T ss_conf 552588786798761 No 112 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=90.37 E-value=0.067 Score=31.02 Aligned_cols=29 Identities=14% Similarity=0.310 Sum_probs=26.3 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 68886899998578989336978999964 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) |-..|.|.+|.-+.||.|++|++|++|+. T Consensus 11 SvtegTva~W~KkvGD~V~~DE~iveiET 39 (435) T TIGR01347 11 SVTEGTVAEWHKKVGDTVKRDELIVEIET 39 (435) T ss_pred CCCCCHHHHHCCCCCCEEECCCEEEEEEC T ss_conf 35533055410688871122776897520 No 113 >PRK09039 hypothetical protein; Validated Probab=90.26 E-value=0.18 Score=28.12 Aligned_cols=25 Identities=12% Similarity=0.246 Sum_probs=8.5 Q ss_pred HHHHHHHCCCCCCCCCCCHHHHHHH Q ss_conf 3233210012333322211455542 Q gi|254780386|r 105 NLRCHIDTEKSALDFLNNKYDVLEK 129 (440) Q Consensus 105 ~l~~~l~~~~~~l~~~~~~~~~~~~ 129 (440) .+..++...+..+.....+.+.++. T Consensus 78 ~L~~~l~~L~~~L~~~~~e~~~L~~ 102 (343) T PRK09039 78 DLQDSVAQLRASLSLAEAERSRLQA 102 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6999999999998877999999999 No 114 >TIGR00717 rpsA ribosomal protein S1; InterPro: IPR000110 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein S1 contains the S1 domain that has been found in a large number of RNA-associated proteins. S1 is a prominent component of the Escherichia coli ribosome and is most probably required for translation of most, if not all, natural mRNAs in E. coli in vivo . It has been suggested that S1 is a RNA-binding protein helping polynucleotide phosphorylase (PNPase, known to be phylogenetically related to S1) to degrade mRNA, or helper molecule involved in other RNase activities . Unique among ribosomal proteins, the primary structure of S1 contains four repeating homologous stretches in the central and terminal region of the molecule. S1 is organised into at least two distinct domains; a ribosome-binding domain at the N-terminal region and a nucleic acid-binding domain at the C-terminal region . There may be a flexible region between the two domains permitting free movement of the domains relative to each other. ; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005840 ribosome. Probab=90.04 E-value=1.3 Score=22.48 Aligned_cols=114 Identities=11% Similarity=0.145 Sum_probs=63.7 Q ss_pred CCCCCCCCCCCCCCEEEEECCEEECCC--CCCCCCCCCCCCCCCCCCEEEEEEECHHHHH---------HCCCCCCEEEE Q ss_conf 222221234555411687312120122--2124445542103355650477532622311---------01589831799 Q gi|254780386|r 283 LEKTREQKTILSPIAGTIVYNQSFSSS--NYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQ---------HVKKGYTATVR 351 (440) Q Consensus 283 l~~~l~~~~I~AP~~G~V~~l~~~~~G--~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~---------~i~~Gq~v~i~ 351 (440) .+.-++...+-.++.|+|-.++. -| .+|..|- |++.++-+|++ .++.||.+.|+ T Consensus 191 r~e~le~l~~g~~v~GvVk~iT~--fGGv~Fvdlgg-------------~dGLLH~~d~sWkr~~~P~e~~~~G~~~~Vk 255 (534) T TIGR00717 191 REELLEKLKEGEVVKGVVKNITD--FGGVAFVDLGG-------------VDGLLHITDMSWKRVKHPSEYVKVGQEVKVK 255 (534) T ss_pred HHHHHHHCCCCCEEEEEEEEECC--CCCEEEEECCC-------------EEECCCHHHCCCCCCCCCHHHCCCCCEEEEE T ss_conf 98887615558576108888889--87499998164-------------3320350121478878834640079889999 Q ss_pred EEECCCCC------------------------CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCC----CCCC Q ss_conf 98128875------------------------4439999999713344467776551189999993501125----6785 Q gi|254780386|r 352 FPHYADIR------------------------EKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFS----DNNI 403 (440) Q Consensus 352 ~~a~~~~~------------------------~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~----~~~~ 403 (440) +-+|+..+ ...++|+|+.|.+.-..-+-..+- +-.+-++..+.. +++. T Consensus 256 v~~~D~~~~Ri~LsLK~~~~dPW~~i~~~~~~G~~~~G~Vt~~~dyGvFvEi~eGi----EGlvH~SE~sW~kk~~~P~~ 331 (534) T TIGR00717 256 VIKFDKEKKRISLSLKQLKEDPWEEIEKKFPVGDKIKGRVTNLTDYGVFVEIEEGI----EGLVHVSEISWVKKNSHPSK 331 (534) T ss_pred EEEECCCCCEEEEEEECCCCCCCHHHHHHCCCCCEEEEEEEEECCCEEEEEECCCC----EEEECCEEEECCCCCCCHHH T ss_conf 99652657607787120588830899983889978899999883770799923894----23423144405788558158 Q ss_pred EECCCCEEEEEE Q ss_conf 152377569999 Q gi|254780386|r 404 ELRNGFPAEVLF 415 (440) Q Consensus 404 ~L~~Gm~~~~~I 415 (440) -+.-|-.+++.| T Consensus 332 ~v~~Gd~Ve~~v 343 (534) T TIGR00717 332 VVKIGDEVEVMV 343 (534) T ss_pred CEECCCEEEEEE T ss_conf 054387999999 No 115 >pfam01551 Peptidase_M23 Peptidase family M23. Members of this family are zinc metallopeptidases with a range of specificities. The peptidase family M23 is included in this family, these are Gly-Gly endopeptidases. Peptidase family M23 are also endopeptidases. This family also includes some bacterial lipoproteins, for which no proteolytic activity has been demonstrated. This family also includes leukocyte cell-derived chemotaxin 2 (LECT2) proteins. LECT2 is a liver-specific protein which is thought to be linked to hepatocyte growth although the exact function of this protein is unknown. Probab=90.02 E-value=0.8 Score=23.80 Aligned_cols=51 Identities=16% Similarity=0.208 Sum_probs=25.2 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCC-EEEEEEECHHHHHHCCCCCCEE Q ss_conf 234555411687312120122212444554210335565-0477532622311015898317 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKL-TYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~-l~v~~~v~~~di~~i~~Gq~v~ 349 (440) .+.|+||.+|+|...... .+ .|.. +.|- ..+. ..+.+.+.. ..|++|+.|+ T Consensus 13 G~~V~A~~~G~V~~~~~~-~~----~G~~-v~i~-h~~g~~~~Y~hl~~---~~V~~G~~V~ 64 (96) T pfam01551 13 GTPVYAAADGVVVFAGYL-GG----YGNL-VIID-HGNGYETLYAHLSK---ILVKVGQRVK 64 (96) T ss_pred CCEEEEEECEEEEEEEEC-CC----CCEE-EEEE-ECCCCEEEECCCEE---EEECCCCCHH T ss_conf 998996236799998983-99----8859-9999-38987699836446---7795898901 No 116 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=89.90 E-value=0.32 Score=26.44 Aligned_cols=52 Identities=15% Similarity=0.290 Sum_probs=23.5 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEE Q ss_conf 345554116873121201222124445542103355650477532622311015898317 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~ 349 (440) .-|+|.-+|.|...- ...-.=|.-| |+ +-+.-++.+|=. .+--.|+.||.|+ T Consensus 285 ~pV~Aaa~G~VvyaG----~~l~gyGnli--iI-kH~~~~lsaYah-n~~~lVkeg~~V~ 336 (374) T PRK10871 285 QAIIATADGRVVYAG----NALRGYGNLI--II-KHNDDYLSAYAH-NDTMLVREQQEVK 336 (374) T ss_pred CEEEECCCEEEEEEC----CCCCCCCEEE--EE-ECCCCCEEEEEC-CCCCCCCCCCEEC T ss_conf 817853783899924----7878776289--99-849986786167-6626678889988 No 117 >KOG0459 consensus Probab=89.87 E-value=1.3 Score=22.39 Aligned_cols=69 Identities=17% Similarity=0.205 Sum_probs=45.3 Q ss_pred EEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEE-EECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 68731212012221244455421033556504775-326223110158983179998128875443999999971 Q gi|254780386|r 298 GTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRA-KVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 298 G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~-~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) |+|..-.+ . .+-+..|++|+ +.|+...+.|.. +=...++..+.+|..|+|++-.... ....+|.|-..+ T Consensus 320 GTvv~GKv-E-sGsi~kg~~lv-vMPnk~~veV~~I~~dd~E~~~~~pGenvk~rlkgiee--edi~~GfiL~~~ 389 (501) T KOG0459 320 GTVVGGKV-E-SGSIKKGQQLV-VMPNKTNVEVLGIYSDDVETDRVAPGENVKLRLKGIEE--EDISPGFILCSP 389 (501) T ss_pred CEEEEEEE-C-CCCEECCCEEE-ECCCCCCEEEEEEECCCCEEEECCCCCCEEEEECCCCH--HHCCCCEEEECC T ss_conf 52788786-0-26030598479-72588625789875165201001588515899645335--424673487068 No 118 >TIGR02645 ARCH_P_rylase putative thymidine phosphorylase; InterPro: IPR013466 Proteins in this entry are closely related to characterised examples of thymidine phosphorylase (2.4.2.4 from EC) and pyrimidine nucleoside phosphorylase (2.4.2.2 from EC). Most examples are found in the archaea, but other examples are found in bacteria such as Legionella pneumophila (strain Paris) and Rhodopseudomonas palustris CGA009.; GO: 0009032 thymidine phosphorylase activity. Probab=89.72 E-value=0.21 Score=27.65 Aligned_cols=36 Identities=14% Similarity=0.230 Sum_probs=27.8 Q ss_pred CCCCCCCCEEEEECCEE-----------------------ECCCCCCCCCCCCCCCCCC Q ss_conf 23455541168731212-----------------------0122212444554210335 Q gi|254780386|r 289 QKTILSPIAGTIVYNQS-----------------------FSSSNYAQQSQPLMKIVPH 324 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~-----------------------~~~G~~v~~G~~l~~Ivp~ 324 (440) +..|.|+.||+|+.+.. ...|+-|..|+||++|+-+ T Consensus 419 ~~DI~A~~DG~Vt~IDN~~i~~IAr~AGAP~DKgAGv~lhvK~G~~Vk~GdPL~TIyA~ 477 (499) T TIGR02645 419 TADIHAETDGYVTEIDNKRITRIARLAGAPNDKGAGVELHVKVGDKVKKGDPLYTIYAE 477 (499) T ss_pred EEEEEECCCCEEHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCEEECCCCCEEEECC T ss_conf 57787368861504238899999987188876768438886546772038701588505 No 119 >PRK13729 conjugal transfer pilus assembly protein TraB; Provisional Probab=89.28 E-value=0.6 Score=24.66 Aligned_cols=40 Identities=18% Similarity=0.245 Sum_probs=23.9 Q ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHH Q ss_conf 9648998889998743433233210012333322211455 Q gi|254780386|r 87 FEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDV 126 (440) Q Consensus 87 ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~ 126 (440) +++...+..+...+..+..++.++..++.+++.+...... T Consensus 62 Ft~k~~qsAlt~qQ~~~~~lek~l~~l~~~l~~~~~~~~~ 101 (474) T PRK13729 62 FDDKVRQHATTEMQVTAAQMQKQYEEIRRELDVLNKQRGD 101 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4367788799999999999999999999999999876562 No 120 >PRK05305 phosphatidylserine decarboxylase; Provisional Probab=89.16 E-value=0.43 Score=25.64 Aligned_cols=42 Identities=7% Similarity=-0.001 Sum_probs=32.5 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEE Q ss_conf 201222124445542103355650477532622311015898317 Q gi|254780386|r 305 SFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 305 ~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~ 349 (440) +..+|+.+..|+.+.-|- -.+ .|+.++|..---.|++||+|. T Consensus 161 ~~~~g~~l~~G~r~G~I~-fGS--rvdl~lP~~~~i~V~~G~~V~ 202 (214) T PRK05305 161 YVEEGDELERGERFGLIR-FGS--RVDVYLPLGTEPLVSVGQKVI 202 (214) T ss_pred CCCCCCEEECCCEEEEEE-CCC--CEEEEECCCCEEEECCCCEEE T ss_conf 267897885377773796-087--279993799838777699998 No 121 >KOG0369 consensus Probab=89.14 E-value=0.48 Score=25.28 Aligned_cols=10 Identities=40% Similarity=0.488 Sum_probs=4.2 Q ss_pred HHCCCCCCEE Q ss_conf 1015898317 Q gi|254780386|r 340 QHVKKGYTAT 349 (440) Q Consensus 340 ~~i~~Gq~v~ 349 (440) .+|+.||+.- T Consensus 1126 ~kV~Kgqpl~ 1135 (1176) T KOG0369 1126 AKVKKGQPLA 1135 (1176) T ss_pred CEECCCCCEE T ss_conf 5103798357 No 122 >TIGR02645 ARCH_P_rylase putative thymidine phosphorylase; InterPro: IPR013466 Proteins in this entry are closely related to characterised examples of thymidine phosphorylase (2.4.2.4 from EC) and pyrimidine nucleoside phosphorylase (2.4.2.2 from EC). Most examples are found in the archaea, but other examples are found in bacteria such as Legionella pneumophila (strain Paris) and Rhodopseudomonas palustris CGA009.; GO: 0009032 thymidine phosphorylase activity. Probab=88.43 E-value=0.15 Score=28.66 Aligned_cols=20 Identities=30% Similarity=0.615 Sum_probs=12.3 Q ss_pred EEECCCCCEEECCCEEEEEC Q ss_conf 99857898933697899996 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld 88 (440) +++||-||+||+||+|+++- T Consensus 456 ~lhvK~G~~Vk~GdPL~TIy 475 (499) T TIGR02645 456 ELHVKVGDKVKKGDPLYTIY 475 (499) T ss_pred EEEEEECCEEECCCCCEEEE T ss_conf 88865467720387015885 No 123 >KOG0559 consensus Probab=88.36 E-value=0.27 Score=27.00 Aligned_cols=53 Identities=11% Similarity=0.219 Sum_probs=34.2 Q ss_pred HHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 9999876406712589997999953873898468886899998578989336978999964 Q gi|254780386|r 29 IGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 29 ~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) +.++.|.++.+-+.+ .+. .| ..-.|...|.+..++-+.||.|.+++.+++++. T Consensus 59 ~~~~~~~~~~~ts~v-tv~---vP----~faESiteG~l~~~lK~~Gd~v~~DE~va~IET 111 (457) T KOG0559 59 TRLVLWARVRSTSVV-TVE---VP----PFAESITEGDLAQWLKKVGDRVNEDEAVAEIET 111 (457) T ss_pred CHHHHHEEEECCCEE-EEE---CC----CCCCCCCCCHHHHHHHCCCCCCCCCHHHEEEEC T ss_conf 102200015325504-774---68----851213424199996474642130013115421 No 124 >PRK05054 exoribonuclease II; Provisional Probab=88.33 E-value=1.3 Score=22.31 Aligned_cols=16 Identities=13% Similarity=0.079 Sum_probs=9.1 Q ss_pred CEEEEECCEEECCCCCCC Q ss_conf 116873121201222124 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQ 313 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~ 313 (440) ++|+|..+. .-|-+|. T Consensus 565 f~g~Is~vt--~fG~fV~ 580 (644) T PRK05054 565 FAAEIIDIS--RGGMRVR 580 (644) T ss_pred EEEEEEEEE--ECCEEEE T ss_conf 789997598--1778999 No 125 >KOG0250 consensus Probab=88.29 E-value=0.012 Score=35.93 Aligned_cols=10 Identities=30% Similarity=0.358 Sum_probs=4.1 Q ss_pred EEEEEEEEEE Q ss_conf 0671258999 Q gi|254780386|r 37 LLPIEISVSS 46 (440) Q Consensus 37 ~~~~~~~v~~ 46 (440) .|.|.++.+- T Consensus 109 ~A~IsItL~N 118 (1074) T KOG0250 109 SAKISITLSN 118 (1074) T ss_pred CEEEEEEEEC T ss_conf 1489999825 No 126 >COG3883 Uncharacterized protein conserved in bacteria [Function unknown] Probab=88.13 E-value=0.21 Score=27.71 Aligned_cols=15 Identities=13% Similarity=0.386 Sum_probs=7.3 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999999 Q gi|254780386|r 18 KRMTLLSIIGIIGLI 32 (440) Q Consensus 18 ~~~~~~~~~~~~~~~ 32 (440) +.++|.++++++++. T Consensus 3 kk~~~a~~~s~v~~s 17 (265) T COG3883 3 KKILLAVLLSLVIIS 17 (265) T ss_pred HHHHHHHHHHHHHHH T ss_conf 489999999999999 No 127 >PRK11649 hypothetical protein; Provisional Probab=88.05 E-value=0.96 Score=23.29 Aligned_cols=51 Identities=20% Similarity=0.237 Sum_probs=23.6 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEE Q ss_conf 345554116873121201222124445542103355650477532622311015898317 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~ 349 (440) +.|.|+-||+|..... . |.+ |--|+ | ....-+...|-.-..+. |++||.|+ T Consensus 303 TPV~A~~dG~V~~ag~-~-~~y---G~~V~-I--~H~~~~~T~Y~HLs~~~-V~~Gq~V~ 353 (418) T PRK11649 303 TPVLSVGDGEVVVAKR-S-GAA---GNYVA-I--RHGRQYTTRYMHLRKLL-VKPGQKVK 353 (418) T ss_pred CEEEEECCEEEEEEEE-C-CCC---CCEEE-E--ECCCCEEEEEECCCCCC-CCCCCEEC T ss_conf 9898524879999788-1-899---97799-9--81993099961768477-79949999 No 128 >PRK07899 rpsA 30S ribosomal protein S1; Reviewed Probab=87.43 E-value=1.9 Score=21.33 Aligned_cols=50 Identities=14% Similarity=0.250 Sum_probs=25.3 Q ss_pred HHEEEEEEEEEEEEEEEECCC---EEEEECCCCCEEEE--EE----CCCCCEEECCCEE Q ss_conf 640671258999799995387---38984688868999--98----5789893369789 Q gi|254780386|r 35 SILLPIEISVSSSGEILNEDN---VVEIKSPFSGIIKK--FQ----VENGSQILQGTPL 84 (440) Q Consensus 35 ~~~~~~~~~v~~~G~i~~~~~---~~~I~s~~~G~V~~--i~----VkeGd~VkkGq~L 84 (440) .++..+...-.+.|+|+.-.. ...|.....|+|.. +. +...+.|+.||.+ T Consensus 28 ~s~k~~~eGdIV~GtVv~I~~d~VlVDIG~KsEGiIP~~E~s~~~d~d~~e~vkvGDeI 86 (484) T PRK07899 28 KTIKYFNDGDIVEGTVVKVDRDEVLLDIGYKTEGVIPSRELSIKHDVDPNEVVSVGDEV 86 (484) T ss_pred HHHHCCCCCCEEEEEEEEEECCEEEEEECCCEEEEEEHHHHCCCCCCCCCCCCCCCCEE T ss_conf 78735889988999999997898999826870478799994667577845457899999 No 129 >COG3064 TolA Membrane protein involved in colicin uptake [Cell envelope biogenesis, outer membrane] Probab=86.56 E-value=2.1 Score=21.02 Aligned_cols=49 Identities=16% Similarity=0.375 Sum_probs=23.6 Q ss_pred HHHHHHHHHH--HHHHHHHHHHEEEEEEEEEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEE Q ss_conf 8999999999--99999987640671258999799995387389846888689999857898933 Q gi|254780386|r 17 LKRMTLLSII--GIIGLITWSILLPIEISVSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQIL 79 (440) Q Consensus 17 ~~~~~~~~~~--~~~~~~~w~~~~~~~~~v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~Vk 79 (440) .+.+||.+++ ++|++++|++|... +...| .|. -|.+|.-|.|.+|-.|. T Consensus 12 ~~AiiiS~vLH~iLfalLIwgS~~~~---~e~~~----gG~-------gg~vi~AVmVDpgav~q 62 (387) T COG3064 12 KRAIIISAVLHIILFALLIWGSLDET---IEASG----GGG-------GGSVIDAVMVDPGAVVQ 62 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC---CCCCC----CCC-------CCCEEEEEEECCHHHHH T ss_conf 23776999999999999998300220---35568----888-------76400046767278999 No 130 >COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Probab=86.29 E-value=0.026 Score=33.71 Aligned_cols=24 Identities=21% Similarity=0.077 Sum_probs=14.0 Q ss_pred ECCCCEEEEEEEEC-EEEHHHHHHH Q ss_conf 52377569999922-0547999999 Q gi|254780386|r 405 LRNGFPAEVLFTAK-NTTLAQEIIH 428 (440) Q Consensus 405 L~~Gm~~~~~I~~~-~~~~~~~~~~ 428 (440) =+||..+-+-+.-| -.+|+.|+.- T Consensus 434 rr~gI~~p~q~~dg~~~t~~~f~~~ 458 (499) T COG4372 434 RRPGIPAPAQNSDGAAPTWSTFGWC 458 (499) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 1788986422455666303018888 No 131 >COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown] Probab=85.69 E-value=0.013 Score=35.67 Aligned_cols=36 Identities=6% Similarity=0.169 Sum_probs=12.3 Q ss_pred HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 343323321001233332221145554223568888 Q gi|254780386|r 102 SIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQ 137 (440) Q Consensus 102 ~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~ 137 (440) +....+.+.+..+.+++....+....+........+ T Consensus 96 ~k~~~e~er~~~~~El~~~r~e~~~v~~~~~~a~~n 131 (499) T COG4372 96 EKRAAETEREAARSELQKARQEREAVRQELAAARQN 131 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899998788888889998877899999999999988 No 132 >KOG0250 consensus Probab=85.58 E-value=0.11 Score=29.47 Aligned_cols=10 Identities=10% Similarity=0.225 Sum_probs=3.7 Q ss_pred CCCCCEEEEE Q ss_conf 5554116873 Q gi|254780386|r 292 ILSPIAGTIV 301 (440) Q Consensus 292 I~AP~~G~V~ 301 (440) -+-|..-.|+ T Consensus 495 P~GPlG~~Vt 504 (1074) T KOG0250 495 PKGPLGKYVT 504 (1074) T ss_pred CCCCCCCEEE T ss_conf 9887654367 No 133 >KOG0977 consensus Probab=85.49 E-value=0.17 Score=28.22 Aligned_cols=25 Identities=12% Similarity=0.259 Sum_probs=9.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 1223322222222222100012233 Q gi|254780386|r 230 LRENIHEQKNMLRHINELYDEANLE 254 (440) Q Consensus 230 l~~~~~~~~~~~~~l~~~~~~~~~~ 254 (440) ++..+.+++.....+...++.+... T Consensus 308 Lr~klselE~~n~~L~~~I~dL~~q 332 (546) T KOG0977 308 LRAKLSELESRNSALEKRIEDLEYQ 332 (546) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHH T ss_conf 1100113012573589898898865 No 134 >KOG0368 consensus Probab=85.27 E-value=1.4 Score=22.23 Aligned_cols=28 Identities=21% Similarity=0.334 Sum_probs=13.6 Q ss_pred EECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 84688868999985789893369789999 Q gi|254780386|r 59 IKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 59 I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) +.++++|+|. ..+++|+...+|++|+.| T Consensus 725 lva~~~G~i~-~i~~~G~~i~aG~vlakL 752 (2196) T KOG0368 725 LVAKEPGRIQ-LIKQEGDAIEAGSVLAKL 752 (2196) T ss_pred EECCCCCEEE-EECCCCCCCCCCCEEEEE T ss_conf 6305786689-835899735776637887 No 135 >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Probab=85.16 E-value=0.016 Score=35.12 Aligned_cols=202 Identities=15% Similarity=0.179 Sum_probs=0.0 Q ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCC Q ss_conf 9648998889998743433233210012333322211455542235688887765546788654333655--42001122 Q gi|254780386|r 87 FEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKVFNYG--ISSVYRSL 164 (440) Q Consensus 87 ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 164 (440) ++..-....+.....++..+...+......++.+..........+......+.+........+....... +..+.... T Consensus 225 ~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~le~~~ 304 (1163) T COG1196 225 LELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEI 304 (1163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99899999999999999999999998899999999999999999999998999999999998799999999999999999 Q ss_pred CCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 21122100223210112232222221023333222101222222111000011100246665310122332222222222 Q gi|254780386|r 165 KMKLTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHI 244 (440) Q Consensus 165 ~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l 244 (440) .....+.................+...+...+.++.....+..+.......+.............+.............- T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 384 (1163) T COG1196 305 SLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREE 384 (1163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998768899888888899999999998876558999999999999988789988766776389999999999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 21000122333211233222122222222122233221222221 Q gi|254780386|r 245 NELYDEANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTRE 288 (440) Q Consensus 245 ~~~~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l~ 288 (440) .................+..++......+..+..++..+..++. T Consensus 385 ~~~~~~~~~~~~~~l~~l~~~i~~l~~~~~~~~~~~~~~~~~~~ 428 (1163) T COG1196 385 LAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELK 428 (1163) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999878999999999999999999989999998888 No 136 >TIGR02828 TIGR02828 putative membrane fusion protein; InterPro: IPR014193 Members of this entry show similarity to those in IPR005695 from INTERPRO, the membrane fusion protein (MFP) cluster 2 family, which is linked to RND transport systems.. Probab=84.85 E-value=0.99 Score=23.19 Aligned_cols=59 Identities=19% Similarity=0.236 Sum_probs=41.7 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECHH---HH-HHCCCCCCEEEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 22124445542103355650477532622---31-101589831799981288754439999999713 Q gi|254780386|r 309 SNYAQQSQPLMKIVPHSKLTYIRAKVTPK---QI-QHVKKGYTATVRFPHYADIREKKFKAIIEKIDP 372 (440) Q Consensus 309 G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~---di-~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~ 372 (440) .+-|..|+||+.|+ |+=++.+-+.|+.. +. ..+.+|+.+++.+..+|+ .++|+|.++.. T Consensus 41 ~G~v~eG~Pl~Kiv-Dnl~~e~Llvi~~dsl~~~qqal~eg~e~~l~~s~~~~----el~arv~r~~~ 103 (197) T TIGR02828 41 DGEVKEGQPLVKIV-DNLDLEILLVISKDSLAESQQALLEGREVKLQVSDKDL----ELKARVVRLKK 103 (197) T ss_pred ECCCCCCCCCEEEE-ECCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCC----CCEEEEEEEEE T ss_conf 25521499427998-34895573200034442210002578624897628897----52033899740 No 137 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=84.05 E-value=0.42 Score=25.67 Aligned_cols=31 Identities=13% Similarity=0.234 Sum_probs=21.6 Q ss_pred CCCCCEEEEEEECHHHHHHCCCCCCEEEEEEE Q ss_conf 35565047753262231101589831799981 Q gi|254780386|r 323 PHSKLTYIRAKVTPKQIQHVKKGYTATVRFPH 354 (440) Q Consensus 323 p~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a 354 (440) |....+.+.+.++..+ ..+++||.|++.+.. T Consensus 268 ~~trt~~v~~~~~np~-~~L~pGm~~~v~i~~ 298 (385) T PRK09578 268 PSTDTVAMRALFPNPE-RELLPGAYVRIALDR 298 (385) T ss_pred CCCCEEEEEEECCCCC-CCCCCCCEEEEEEEC T ss_conf 7765299999657875-545899689999954 No 138 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=84.03 E-value=0.37 Score=26.02 Aligned_cols=57 Identities=14% Similarity=0.092 Sum_probs=32.7 Q ss_pred CCCEEEEECCCCCEEEEEECCCCCEEECCCE-EEEECHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 3873898468886899998578989336978-9999648998889998743433233210012 Q gi|254780386|r 53 EDNVVEIKSPFSGIIKKFQVENGSQILQGTP-LLTFEDIETTDLIDLKKVSIKNLRCHIDTEK 114 (440) Q Consensus 53 ~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~-L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~ 114 (440) .|.+..+.-..|.. |+.||..-.=|. -++..-.++++.+...+..+...+..++..+ T Consensus 69 sG~v~~~~v~~Gd~-----VkkGq~La~LD~~~~~~~l~~A~A~l~~A~a~~~~A~~~~~R~~ 126 (385) T PRK09859 69 GGIIIKRNFIEGDK-----VNQGDSLYQIDPAPLQAELNSAKGSLAKALSTASNARITFNRQA 126 (385) T ss_pred CEEEEEEECCCCCE-----EECCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 62999998389998-----84898899828479999999999999999999999998999999 No 139 >COG1726 NqrA Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrA [Energy production and conversion] Probab=83.31 E-value=0.65 Score=24.43 Aligned_cols=24 Identities=21% Similarity=0.429 Sum_probs=18.9 Q ss_pred CEEEEEECCCCCEEECCCEEEEEC Q ss_conf 689999857898933697899996 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld 88 (440) |.=-.+.|++||.|++||+|++.. T Consensus 38 gmrp~mkV~~gD~VkkGq~LfEdK 61 (447) T COG1726 38 GMRPSMKVREGDAVKKGQVLFEDK 61 (447) T ss_pred CCCCCCEECCCCEEECCCEEEECC T ss_conf 888741533477220265311215 No 140 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=83.19 E-value=1.8 Score=21.48 Aligned_cols=47 Identities=32% Similarity=0.437 Sum_probs=30.2 Q ss_pred EEEEEEEEEEECCC------EEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 58999799995387------38984688868999985789893369789999648 Q gi|254780386|r 42 ISVSSSGEILNEDN------VVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 42 ~~v~~~G~i~~~~~------~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +++.-.|++..... ...+-+...|. .|.|++|+.|+.|++|+++|.- T Consensus 1159 ~v~~~~G~~~~~~~~f~i~~ee~~~~V~~G~--~i~V~~g~~V~~G~~~aefDP~ 1211 (1552) T TIGR02386 1159 IVVSEEGRIKIEDKKFLIVEEEEVVPVKLGT--KILVEDGDAVKEGQRIAEFDPY 1211 (1552) T ss_pred EEECCCCEEEEECCCEEEECCCEEEECCCCC--EEEEECCCCCCCCCEEEEECCC T ss_conf 7707888788505857764032156426566--8886158620588757884477 No 141 >KOG1853 consensus Probab=82.32 E-value=0.86 Score=23.60 Aligned_cols=23 Identities=26% Similarity=0.395 Sum_probs=9.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 22233221222221234555411 Q gi|254780386|r 275 DEMSKLTILEKTREQKTILSPIA 297 (440) Q Consensus 275 ~~~~~l~~l~~~l~~~~I~AP~~ 297 (440) +++..|..-+.+-++..|.+|.+ T Consensus 175 dlrqelavr~kq~E~pR~~~Pss 197 (333) T KOG1853 175 DLRQELAVRTKQTERPRIVEPSS 197 (333) T ss_pred HHHHHHHHHHHHCCCCCCCCCCC T ss_conf 88898888876024777678640 No 142 >PRK04950 putative solute/DNA competence effector; Provisional Probab=82.04 E-value=3.3 Score=19.74 Aligned_cols=49 Identities=18% Similarity=0.270 Sum_probs=35.9 Q ss_pred ECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEE Q ss_conf 26223110158983179998128875443999999971334446777655118999999350112567851523775699 Q gi|254780386|r 334 VTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEV 413 (440) Q Consensus 334 v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~ 413 (440) ++..++..+++||.|+|++-. ..+.|+|..|+-+ + |.| +|-.||.+.| T Consensus 158 ~~~~~~~~L~vgq~VkVk~Gk------~~~~Ati~ei~Kd-----~---------V~V------------qL~sGl~~kV 205 (212) T PRK04950 158 TPVVDISELTVGQAVKVKAGK------SPMDATVLEINKD-----D---------VRV------------QLDSGLSMKV 205 (212) T ss_pred CCCCCHHHHCCCCEEEEEECC------CCCCEEEEEEECC-----C---------EEE------------EECCCCEEEE T ss_conf 786769993259889996068------8777589998258-----7---------799------------9279968998 Q ss_pred E Q ss_conf 9 Q gi|254780386|r 414 L 414 (440) Q Consensus 414 ~ 414 (440) . T Consensus 206 ~ 206 (212) T PRK04950 206 R 206 (212) T ss_pred E T ss_conf 5 No 143 >pfam00261 Tropomyosin Tropomyosin. Probab=82.00 E-value=0.026 Score=33.74 Aligned_cols=23 Identities=4% Similarity=0.133 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 89998743433233210012333 Q gi|254780386|r 95 LIDLKKVSIKNLRCHIDTEKSAL 117 (440) Q Consensus 95 ~l~~~~~~l~~l~~~l~~~~~~l 117 (440) .++..+..+..+...+......+ T Consensus 9 eld~~~~~~~~~~~~l~e~~~~~ 31 (237) T pfam00261 9 ELDKAEERLEEAQEKLEEAEKRA 31 (237) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999 No 144 >pfam01333 Apocytochr_F_C Apocytochrome F, C-terminal. This is a sub-family of cytochrome C. See pfam00034. Probab=81.82 E-value=2 Score=21.15 Aligned_cols=22 Identities=18% Similarity=0.107 Sum_probs=13.5 Q ss_pred CCCCCCCCEEEEECCEEECCCC Q ss_conf 2345554116873121201222 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSN 310 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~ 310 (440) ++...||.+|+|..+.....|+ T Consensus 2 Nnv~~as~~G~I~~I~~~ekg~ 23 (118) T pfam01333 2 NNVYNASAAGTITKITRNEKGG 23 (118) T ss_pred CCEEECCCCEEEEEEEECCCCC T ss_conf 8547166770899977567885 No 145 >TIGR02644 Y_phosphoryl pyrimidine-nucleoside phosphorylase; InterPro: IPR000053 Two highly similar activities are represented in this group: thymidine phosphorylase (TP, gene deoA, 2.4.2.4 from EC) and pyrimidine-nucleoside phosphorylase (PyNP, gene pdp, 2.4.2.2 from EC). Both are dimeric enzymes that function in the salvage pathway to catalyse the reversible phosphorolysis of pyrimidine nucleosides to the free base and sugar moieties. In the case of thymidine phosphorylase, thymidine (and to a lesser extent, 2'-deoxyuridine) is lysed to produce thymine (or uracil) and 2'-deoxyribose-1-phosphate. Pyrimidine-nucleoside phosphorylase performs the analogous reaction on thymidine (to produce the same products) and uridine (to produce uracil and ribose-1-phosphate). PyNP is typically the only pyrimidine nucleoside phosphorylase encoded by Gram positive bacteria, while eukaryotes and proteobacteria encode two: TP, and the unrelated uridine phosphorylase. In humans, TP was originally characterised as platelet-derived endothelial cell growth factor and gliostatin . Structurally, the enzymes are homodimers, each composed of a rigid all alpha-helix lobe and a mixed alpha-helix/beta-sheet lobe, which are connected by a flexible hinge , . Prior to substrate binding, the lobes are separated by a large cleft. A functional active site and subsequent catalysis occurs upon closing of the cleft. The active site, composed of a phosphate binding site and a (deoxy)ribonucleotide binding site within the cleft region, is highly conserved between the two enzymes of this group. Active site residues (Escherichia coli DeoA numbering) include the phosphate binding Lys84 and Ser86 (close to a glycine-rich loop), Ser113, and Thr123, and the pyrimidine nucleoside-binding Arg171, Ser186, and Lys190. Sequence comparison between the active site residues for both enzymes reveals only one difference , which has been proposed to partially mediate substrate specificity. In TP, position 111 is a methionine, while the analogous position in PyNP is lysine. It should be noted that the uncharacterised archaeal members of this family differ in a number of respects from either of the characterised activities. The residue at position 108 is lysine, indicating the activity might be PyNP-like (though the determinants of substrate specificity have not been fully elucidated). Position 171 is glutamate (negative charge side chain) rather than arginine (positive charge side chain). In addition, a large loop that may "lock in" the substrates within the active site is much smaller than in the characterised members. It is not clear what effect these and other differences have on activity and specificity.; GO: 0006206 pyrimidine base metabolic process. Probab=81.75 E-value=1.6 Score=21.72 Aligned_cols=36 Identities=8% Similarity=0.162 Sum_probs=24.8 Q ss_pred CCCCCCCCCEEEEECCEE------------------------------ECCCCCCCCCCCCCCCCC Q ss_conf 123455541168731212------------------------------012221244455421033 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQS------------------------------FSSSNYAQQSQPLMKIVP 323 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~------------------------------~~~G~~v~~G~~l~~Ivp 323 (440) .-..|+|+.+|+|..++. ...|+.|..|++|++|+. T Consensus 340 ~~~~v~A~~~G~i~~~~a~~~G~a~~~LGAGR~~k~D~ID~~aGi~l~~k~Gd~V~~Gd~~~tlY~ 405 (425) T TIGR02644 340 YKEEVKAEKSGYISEIDAEELGLAAVDLGAGRARKEDKIDHEAGIYLHKKVGDKVKKGDPIATLYS 405 (425) T ss_pred CEEEEECCCCCEEEECCHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEECCCCCEEECCCEEEEEEC T ss_conf 202330156613651078899899986158876989630011355676566875323757999862 No 146 >pfam00261 Tropomyosin Tropomyosin. Probab=81.58 E-value=0.035 Score=32.89 Aligned_cols=47 Identities=17% Similarity=0.323 Sum_probs=17.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 88899987434332332100123333222114555422356888877 Q gi|254780386|r 93 TDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLL 139 (440) Q Consensus 93 ~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 139 (440) ...+...+..+......+......+..+...+..++..+......+. T Consensus 14 ~~~~~~~~~~l~e~~~~~~~~e~e~~~l~~~i~~le~el~~~e~~l~ 60 (237) T pfam00261 14 EERLEEAQEKLEEAEKRAEKAEAEVASLNRRIQLLEEDLERSEERLA 60 (237) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999999999999899999 No 147 >KOG0557 consensus Probab=81.28 E-value=1.4 Score=22.20 Aligned_cols=29 Identities=24% Similarity=0.357 Sum_probs=20.5 Q ss_pred CCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 88689999857898933697899996489 Q gi|254780386|r 63 FSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 63 ~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) ..|-|..|.-||||.+.+||+|++++... T Consensus 51 eeGnIvsW~kKeGdkls~GDvl~EVETDK 79 (470) T KOG0557 51 EEGNIVSWKKKEGDKLSAGDVLLEVETDK 79 (470) T ss_pred CCCCEEEEEECCCCCCCCCCEEEEEECCC T ss_conf 57842667522577568886589996045 No 148 >PRK00087 4-hydroxy-3-methylbut-2-enyl diphosphate reductase/S1 RNA-binding domain protein; Reviewed Probab=81.11 E-value=3.5 Score=19.52 Aligned_cols=114 Identities=11% Similarity=0.097 Sum_probs=56.8 Q ss_pred CCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHH---------HCCCCCCEEEEEEE Q ss_conf 222212345554116873121201222124445542103355650477532622311---------01589831799981 Q gi|254780386|r 284 EKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQ---------HVKKGYTATVRFPH 354 (440) Q Consensus 284 ~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~---------~i~~Gq~v~i~~~a 354 (440) +.-++....-..+.|+|..+. .-|.||.-|- ++..++.+|++ .+++||.|.+++-. T Consensus 472 ~~~~~~l~~G~~v~G~V~~i~--~~GaFvdlgG-------------vdGLvHiSelSw~rv~~p~~~~~~G~~v~vkIl~ 536 (670) T PRK00087 472 EETWNSLEEGDVVEGEVKRLT--DFGAFVDIGG-------------VDGLLHVSEISWGRVEKPSDVLKVGDEIKVYILE 536 (670) T ss_pred HHHHHHCCCCCEEEEEEEEEE--CCCEEEEECC-------------EEEEEEHHHCCCCCCCCHHHHCCCCCEEEEEEEE T ss_conf 999974889988999999997--3407999779-------------8999984344767778989962789779999999 Q ss_pred CCCCC------------------------CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCC---CCCCEECC Q ss_conf 28875------------------------4439999999713344467776551189999993501125---67851523 Q gi|254780386|r 355 YADIR------------------------EKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFS---DNNIELRN 407 (440) Q Consensus 355 ~~~~~------------------------~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~---~~~~~L~~ 407 (440) ++..+ ...+.|+|++|.+.-..-+-..+ ..-.|.+++.+.. ++...+.. T Consensus 537 iD~e~~rIsLs~K~l~~dPw~~~~~~~~~G~~v~g~V~~i~~fGaFVel~~g----veGlih~sels~~~~~~~~~~~~~ 612 (670) T PRK00087 537 IDKENKKLSLSLKKLLPDPWENVEEKYPVGSIVLGKVVRIAPFGAFVELEPG----VDGLVHISEISWKRIDKPEDVLSE 612 (670) T ss_pred ECCCCCEEEEEECCCCCCCHHHHHHHCCCCCEEEEEEEEEECCEEEEEECCC----CEEEEEHHHCCCCCCCCHHHCCCC T ss_conf 8388788987634377894788886489969999999999677079995799----789989899484324897774899 Q ss_pred CCEEEEEEE Q ss_conf 775699999 Q gi|254780386|r 408 GFPAEVLFT 416 (440) Q Consensus 408 Gm~~~~~I~ 416 (440) |-.+.|.|. T Consensus 613 Gd~v~~kIl 621 (670) T PRK00087 613 GEEVKAKIL 621 (670) T ss_pred CCEEEEEEE T ss_conf 998999999 No 149 >PRK10361 DNA recombination protein RmuC; Provisional Probab=81.01 E-value=1.1 Score=22.82 Aligned_cols=29 Identities=24% Similarity=0.279 Sum_probs=16.9 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECHHHH Q ss_conf 2212444554210335565047753262231 Q gi|254780386|r 309 SNYAQQSQPLMKIVPHSKLTYIRAKVTPKQI 339 (440) Q Consensus 309 G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di 339 (440) |....|. -|+. .|++..++|+++||=... T Consensus 236 g~r~rPD-vIV~-LP~~k~iVIDSKvsL~AY 264 (475) T PRK10361 236 RSRMQPD-VIVR-LPQGKDVVIDAKMTLVAY 264 (475) T ss_pred CCCCCCC-EEEE-CCCCCEEEEECCCCCHHH T ss_conf 8822897-6898-789975888627884169 No 150 >KOG0980 consensus Probab=80.95 E-value=0.051 Score=31.81 Aligned_cols=194 Identities=10% Similarity=0.036 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCCCCCCCC Q ss_conf 899888999874343323321001233332221145554223568888776554678865--433365542001122211 Q gi|254780386|r 90 IETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTC--SKVFNYGISSVYRSLKMK 167 (440) Q Consensus 90 ~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 167 (440) .+++..+...+.++..+..++...+...+....+.+ .+.....++.....+....+ .+.........+...+.. T Consensus 354 ~ear~~~~q~~~ql~~le~~~~e~q~~~qe~~~e~e----qLr~elaql~a~r~q~eka~~~~ee~e~~~l~~e~ry~kl 429 (980) T KOG0980 354 EEARRRIEQYENQLLALEGELQEQQREAQENREEQE----QLRNELAQLLASRTQLEKAQVLVEEAENKALAAENRYEKL 429 (980) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999998887888889982999999799999----9999999999998888998888876888889999999999 Q ss_pred CCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 22100223210112232222221023333222101222222111000011100246665310122332222222222210 Q gi|254780386|r 168 LTRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINEL 247 (440) Q Consensus 168 ~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~ 247 (440) ......+.....++-.+......+++..+.....+............++............+....++..++....+... T Consensus 430 kek~t~l~~~h~~lL~K~~di~kQle~~~~s~~~~~~~~~~L~d~le~~~~~~~~~~~K~e~~~~~le~l~~El~~l~~e 509 (980) T KOG0980 430 KEKYTELRQEHADLLRKYDDIQKQLESAEQSIDDVEEENTNLNDQLEELQRAAGRAETKTESQAKALESLRQELALLLIE 509 (980) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999998789999999889869999999887899999998788788885587999999999999987799 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0012233321123322212222222212223322122222 Q gi|254780386|r 248 YDEANLEFANYLKEISRDLEQNQRTFADEMSKLTILEKTR 287 (440) Q Consensus 248 ~~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l~~l~~~l 287 (440) ..+++..+.+..+....++.+....+.+.......+..++ T Consensus 510 ~~~lq~~~~~~~qs~~~~~~~l~~~l~~KD~~~~~~~~~~ 549 (980) T KOG0980 510 LEELQRTLSNLAQSHNNQLAQLEDLLKQKDRLAAELVARE 549 (980) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999987655788888899999998870678888988878 No 151 >PRK10676 DNA-binding transcriptional regulator ModE; Provisional Probab=80.79 E-value=3.6 Score=19.45 Aligned_cols=51 Identities=16% Similarity=0.205 Sum_probs=29.2 Q ss_pred CEEEEEEEEEECCCCCCCCC--CCCCEEEEEEEEECHHHCCCCCCEECCCCEEEEEEEE Q ss_conf 43999999971334446777--6551189999993501125678515237756999992 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNN--LQTQEYYEVILKITDLAFSDNNIELRNGFPAEVLFTA 417 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~--~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~~I~~ 417 (440) ..|.|+|.+|...+...+-. -.++...-..++ . .....|.+|..+.|.|.. T Consensus 201 N~l~GtV~~I~~G~vnaEV~l~l~~G~tl~A~it--~----~~~l~L~~G~~v~A~fKA 253 (262) T PRK10676 201 NQLPGTISHIERGAEQCEVLMALPDGQTLCATVP--N----NEATSLQQGQNVTAYFNA 253 (262) T ss_pred CEEEEEEEEEEECCCCEEEEEEECCCCEEEEEEC--C----HHHCCCCCCCEEEEEECC T ss_conf 3676899999877841599999589999999976--7----776788899989999940 No 152 >TIGR01936 nqrA NADH:ubiquinone oxidoreductase, Na(+)-translocating, A subunit; InterPro: IPR008703 This family consists of several bacterial Na^+-translocating NADH-quinone reductase subunit A (NQRA) proteins. The Na^+-translocating NADH: ubiquinone oxidoreductase (Na^+-NQR) generates an electrochemical Na^+ potential driven by aerobic respiration .; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0006814 sodium ion transport. Probab=80.67 E-value=0.86 Score=23.61 Aligned_cols=33 Identities=24% Similarity=0.422 Sum_probs=27.4 Q ss_pred CEEEEECCCCCEEEEEECCCCCEEECCCEEEEE Q ss_conf 738984688868999985789893369789999 Q gi|254780386|r 55 NVVEIKSPFSGIIKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 55 ~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~l 87 (440) .+..+-.+-.|..-.+.|++||.|++|++|++. T Consensus 28 ~va~~~~~f~g~~p~~~v~~GD~V~~G~~L~~~ 60 (466) T TIGR01936 28 TVAVLGEDFEGLRPKMKVRPGDKVKAGQVLFED 60 (466) T ss_pred EEEEECCCCCCCCCEEEEEECCEEEECCHHHHH T ss_conf 688636788786650677007701002223330 No 153 >TIGR03309 matur_yqeB selenium-dependent molybdenum hydroxylase system protein, YqeB family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes with labile selenium-containing centers, different from selenocysteine-containing proteins. Probab=80.57 E-value=2.3 Score=20.78 Aligned_cols=55 Identities=11% Similarity=0.184 Sum_probs=39.9 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEE Q ss_conf 2345554116873121201222124445542103355650477532622311015898317 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~ 349 (440) ...|+||.+|++... ...|+.|..|+.+..+ +.. -|.|.++-.-.+-|+.|-+|. T Consensus 164 ERvi~Ap~~G~~~~~--~~IGd~V~kG~vig~v--~~~--pV~A~i~GilRGlirdG~~V~ 218 (256) T TIGR03309 164 ERVLRAPADGIVTPT--KAIGDSVKKGDVIATV--GDV--PVVAPIDGLLRGLIHEGLTVT 218 (256) T ss_pred CEEEECCCCCCEECH--HHCCCEEECCCEEEEE--CCE--EEEEECCCEEEEEECCCCCCC T ss_conf 418975878538541--4418968679689998--998--898765848972412897867 No 154 >PRK12269 bifunctional cytidylate kinase/ribosomal protein S1; Provisional Probab=80.18 E-value=3.8 Score=19.32 Aligned_cols=114 Identities=14% Similarity=0.180 Sum_probs=54.4 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCC--CCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCC--------- Q ss_conf 45554116873121201222124445--5421033556504775326223110158983179998128875--------- Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQ--PLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIR--------- 359 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~--~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~--------- 359 (440) ..-.-+.|+|..+. .-|.||.-|- -|+.|- +.+ |-...=| + .-+++||.+++++-.|+..+ T Consensus 492 ~eG~vv~G~VKnit--dyGaFvDLGGvDGLLHit-DiS--W~Rv~hP-s--e~~~vG~~i~vkVlk~D~e~~riSLglKQ 563 (863) T PRK12269 492 HIEDSVSGVVKSFT--SFGAFIDLGGFDGLLHVN-DMS--WGHVARP-R--EFVKKGQTIELKVIRLDQAEKRINLSLKH 563 (863) T ss_pred CCCCEEEEEEEECC--CCEEEEECCCCCEEEEEE-CCC--CCCCCCH-H--HHHCCCCEEEEEEEEECCCCCEEEECCCC T ss_conf 36877889998315--640589617710146621-113--5788998-9--97127977999998525654514311321 Q ss_pred ---------------CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCC----EECCCCEEEEEEE Q ss_conf ---------------44399999997133444677765511899999935011256785----1523775699999 Q gi|254780386|r 360 ---------------EKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNI----ELRNGFPAEVLFT 416 (440) Q Consensus 360 ---------------~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~----~L~~Gm~~~~~I~ 416 (440) ...+.|+|++|.+.-..-+-..+ +.-.|.++..++.+++. -+..|-.+++.|. T Consensus 564 l~~dPW~~i~~ky~vg~~v~gkVtnitdyGaFVEle~G----iEGLvHvSEmsWtkk~~~P~k~v~~GdeVev~VL 635 (863) T PRK12269 564 FQPDPWLEFENKFGVNDVVKGRVTKIADFGAFIELAEG----IEGLAHISEFSWVKKTSKPSDMVKIGDEVECMIL 635 (863) T ss_pred CCCCCHHHHHHHCCCCCEEEEEEEEEECCCEEEEECCC----CEEEEEEHHHCCCCCCCCHHHCCCCCCEEEEEEE T ss_conf 77897678886388999899999997446569996389----2578870250202367997980768977999999 No 155 >pfam07831 PYNP_C Pyrimidine nucleoside phosphorylase C-terminal domain. This domain is found at the C-terminal end of the large alpha/beta domain making up various pyrimidine nucleoside phosphorylases. It has slightly different conformations in different members of this family. For example, in pyrimidine nucleoside phosphorylase (PYNP) there is an added three-stranded anti-parallel beta sheet as compared to other members of the family, such as E. coli thymidine phosphorylase (TP). The domain contains an alpha/ beta hammerhead fold and residues in this domain seem to be important in formation of the homodimer. Probab=79.93 E-value=1.4 Score=22.20 Aligned_cols=27 Identities=30% Similarity=0.514 Sum_probs=21.3 Q ss_pred CCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 68886899998578989336978999964 Q gi|254780386|r 61 SPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 61 s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) .+.-|+ .+.++.||.|++||+|+++.. T Consensus 29 D~~vGi--~l~~k~Gd~V~~g~pl~~i~~ 55 (75) T pfam07831 29 DPGAGI--YLHKKVGDKVKKGDPLATIYA 55 (75) T ss_pred CHHHHH--HEECCCCCEECCCCEEEEEEC T ss_conf 711225--402468999669993999977 No 156 >KOG1029 consensus Probab=79.68 E-value=0.17 Score=28.32 Aligned_cols=30 Identities=7% Similarity=-0.095 Sum_probs=14.6 Q ss_pred HCCCCCCEEEEEE--ECCCCCCCEEEEEEEEE Q ss_conf 0158983179998--12887544399999997 Q gi|254780386|r 341 HVKKGYTATVRFP--HYADIREKKFKAIIEKI 370 (440) Q Consensus 341 ~i~~Gq~v~i~~~--a~~~~~~~~~~G~V~~I 370 (440) .+.+|+-+.|... +-|.=-+|.|.|++-+. T Consensus 711 sf~pGDII~V~esq~aEPGWlaGel~gktGWF 742 (1118) T KOG1029 711 SFEPGDIIIVFESQAAEPGWLAGELRGKTGWF 742 (1118) T ss_pred CCCCCCEEEEEHHCCCCCCCCCCEECCCCCCC T ss_conf 12688779973210578540001132655767 No 157 >PRK13806 rpsA 30S ribosomal protein S1; Provisional Probab=79.51 E-value=4 Score=19.18 Aligned_cols=110 Identities=13% Similarity=0.087 Sum_probs=54.6 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHH----------HCCCCCCEEEEEEECC-- Q ss_conf 2345554116873121201222124445542103355650477532622311----------0158983179998128-- Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQ----------HVKKGYTATVRFPHYA-- 356 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~----------~i~~Gq~v~i~~~a~~-- 356 (440) ++.+-.-+.|.|..+. .-|.+|. |-| . |+..|+.++++ .+++|+.|.+++-..+ T Consensus 288 k~~vG~~v~g~V~~i~--~fGaFVe-------l~~---g--ieGlvHiSelSw~~~~~~p~~~~~~Gd~v~vkIl~iD~e 353 (489) T PRK13806 288 RLEADQVVEGKVVRLA--PFGAFVE-------VLP---G--VEGLVHISEMSWTRRVNKPEEVVAPGDTVSVKIKELDPA 353 (489) T ss_pred HCCCCCEEEEEEEECC--CCCEEEE-------ECC---C--EEEEEEEHHCCCCCCCCCHHHHCCCCCEEEEEEEEECCC T ss_conf 3679983799999724--5406999-------638---8--079998044255345789899606687299999997143 Q ss_pred ----------------------CCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCC---CCEECCCCEE Q ss_conf ----------------------875443999999971334446777655118999999350112567---8515237756 Q gi|254780386|r 357 ----------------------DIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDN---NIELRNGFPA 411 (440) Q Consensus 357 ----------------------~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~---~~~L~~Gm~~ 411 (440) +.....++|+|..|......-+-... ....+..++.+.... ...+.+|-.+ T Consensus 354 ~rrisLs~Kq~~~nPw~~~~~~~~~G~~v~g~V~~i~~fG~fV~l~~g----i~Glih~sdls~~~~~~~~~~~k~Gd~V 429 (489) T PRK13806 354 KRRISLSLRDAEGDPWATAAERFAPGTTVTGTVEKRAQFGLFVNIAPG----VTGLLPESIIKSSPQSKSLAKLNAGDSV 429 (489) T ss_pred CCEEEEEHHHCCCCCHHHHHHHCCCCCEEEEEEEEEECCEEEEECCCC----CEEEEEHHHCCCCCCCCHHHHCCCCCEE T ss_conf 378996567534695889987636898799999997634899975899----0899998986756442738628999989 Q ss_pred EEEEE Q ss_conf 99999 Q gi|254780386|r 412 EVLFT 416 (440) Q Consensus 412 ~~~I~ 416 (440) +|.|. T Consensus 430 ~~~Il 434 (489) T PRK13806 430 TLTVR 434 (489) T ss_pred EEEEE T ss_conf 99999 No 158 >PRK04350 thymidine phosphorylase; Provisional Probab=78.83 E-value=2 Score=21.20 Aligned_cols=20 Identities=15% Similarity=0.212 Sum_probs=14.2 Q ss_pred CCCCCCCCCEEEEECCEEEC Q ss_conf 12345554116873121201 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFS 307 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~ 307 (440) -...|+||.+|+|..++... T Consensus 416 ~~~~v~A~~~G~V~~ID~~~ 435 (502) T PRK04350 416 HTYDVLAPRDGYVTAIDNRR 435 (502) T ss_pred EEEEEECCCCEEEEEECHHH T ss_conf 57999769984999851499 No 159 >COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Probab=78.64 E-value=0.41 Score=25.75 Aligned_cols=18 Identities=17% Similarity=0.268 Sum_probs=9.2 Q ss_pred EEECHHHHHHCCCCCCEE Q ss_conf 532622311015898317 Q gi|254780386|r 332 AKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 332 ~~v~~~di~~i~~Gq~v~ 349 (440) +.+|..-++.|..+..+. T Consensus 205 m~l~~~~~~~V~~~d~iv 222 (239) T COG1579 205 MKLPSQTLSKVRKKDEIV 222 (239) T ss_pred EEECHHHHHHHHCCCCCC T ss_conf 451688998874489975 No 160 >PRK11281 potassium efflux protein KefA; Provisional Probab=78.19 E-value=0.06 Score=31.30 Aligned_cols=10 Identities=0% Similarity=0.361 Sum_probs=3.4 Q ss_pred HHHHHHHHHC Q ss_conf 4332332100 Q gi|254780386|r 103 IKNLRCHIDT 112 (440) Q Consensus 103 l~~l~~~l~~ 112 (440) +.+++.+++. T Consensus 94 ~~~~~~~l~~ 103 (1107) T PRK11281 94 LRQAQAELTA 103 (1107) T ss_pred HHHHHHHHHH T ss_conf 9999999985 No 161 >TIGR02927 SucB_Actino 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR014276 This entry represents the mainly actinobacterial clade of the E2 component of the 2-oxoglutarate dehydrogenase complex involved in the TCA cycle. These proteins have multiple domains including the catalytic domain (IPR001078 from INTERPRO), one or two biotin domains (IPR000089 from INTERPRO) and an E3-component binding domain (IPR004167 from INTERPRO).. Probab=78.07 E-value=3 Score=19.96 Aligned_cols=32 Identities=28% Similarity=0.444 Sum_probs=29.3 Q ss_pred EEECCCCCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 98468886899998578989336978999964 Q gi|254780386|r 58 EIKSPFSGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 58 ~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) .|=||++|+|.+|.+.|-|.|.-|-+|+++=| T Consensus 180 EiPSPvaGtlleI~a~EDdtVeVG~vlA~IGD 211 (607) T TIGR02927 180 EIPSPVAGTLLEILAEEDDTVEVGAVLAKIGD 211 (607) T ss_pred CCCCCCCCEEEEEEECCCCEEECCCEEEEECC T ss_conf 66887143688898177886431506654237 No 162 >COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Probab=77.95 E-value=0.36 Score=26.11 Aligned_cols=46 Identities=24% Similarity=0.270 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8743433233210012333322211455542235688887765546 Q gi|254780386|r 99 KKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGK 144 (440) Q Consensus 99 ~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (440) +..++..+.......+..+.....+.+.....+...+..+.+...+ T Consensus 15 lD~e~~rl~~~~~~~~~~l~k~~~e~e~~~~~~~~~~~e~e~le~q 60 (239) T COG1579 15 LDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQ 60 (239) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9989888777541059999999999999999999999999999989 No 163 >TIGR01945 rnfC electron transport complex, RnfABCDGE type, C subunit; InterPro: IPR010208 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation . A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized . This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase (IPR003667 from INTERPRO), but is distinct. This entry describes the C subunit.; GO: 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport, 0016020 membrane. Probab=77.79 E-value=1 Score=23.14 Aligned_cols=25 Identities=20% Similarity=0.327 Sum_probs=19.7 Q ss_pred CEEEEEECCCCCEEECCCEEEEECH Q ss_conf 6899998578989336978999964 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) |.=-+-.|+.||.|.|||+|+.-+. T Consensus 40 G~p~~p~V~~GD~VLkGq~Ia~~~G 64 (444) T TIGR01945 40 GAPAEPIVKVGDKVLKGQLIAKADG 64 (444) T ss_pred CCCCCCEECCCCEEECCCEECCCCC T ss_conf 8777730027865206611006774 No 164 >COG0739 NlpD Membrane proteins related to metalloendopeptidases [Cell envelope biogenesis, outer membrane] Probab=77.42 E-value=3.3 Score=19.66 Aligned_cols=54 Identities=19% Similarity=0.349 Sum_probs=26.3 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEE Q ss_conf 12345554116873121201222124445542103355650477532622311015898317 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~ 349 (440) ..+.|+|+.+|+|..... ..+.+ |-.+ ++--.+ =+...|-.-. --.|+.||.|+ T Consensus 172 ~gtpV~A~~~G~V~~~~~-~~~~y---Gn~v--~i~H~~-g~~t~Y~Hl~-~~~V~~G~~V~ 225 (277) T COG0739 172 EGTPVYAAADGVVVFAGN-GGWGY---GNLV--IIKHGD-GYVTVYAHLS-SILVKEGQKVK 225 (277) T ss_pred CCCEEEECCCEEEEEECC-CCCCC---CEEE--EEECCC-CEEEEEECCC-CCCCCCCCEEE T ss_conf 998678516818999704-76667---7089--997489-6588996065-01315897884 No 165 >KOG0558 consensus Probab=76.97 E-value=2.2 Score=20.87 Aligned_cols=43 Identities=26% Similarity=0.388 Sum_probs=31.2 Q ss_pred EEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 9995387389846888689999857898933697899996489 Q gi|254780386|r 49 EILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 49 ~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) +|.++....+|.++-.|+|+++|-+.+|..+-|++|+.++-.. T Consensus 100 EVQSDKAsvtItsRydG~v~ki~h~~ddia~VGk~Lvd~eve~ 142 (474) T KOG0558 100 EVQSDKASVTITSRYDGKVKKIYHSPDDIAKVGKPLVDLEVED 142 (474) T ss_pred HCCCCCCEEEEEEEECCEEEEEEECCHHHHHHCCCEEEEEECC T ss_conf 0204652489986406567787407334667386115566115 No 166 >PRK06078 pyrimidine-nucleoside phosphorylase; Reviewed Probab=76.90 E-value=2.4 Score=20.63 Aligned_cols=24 Identities=13% Similarity=-0.090 Sum_probs=16.9 Q ss_pred CCCCCCCCCCEEEEECCEEECCCC Q ss_conf 212345554116873121201222 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQSFSSSN 310 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~~~~G~ 310 (440) .....|+||.+|+|..++....|. T Consensus 333 ~~~~~i~A~~~G~V~~id~~~iG~ 356 (434) T PRK06078 333 KYQIEVPAKESGYISEIVADEIGL 356 (434) T ss_pred CEEEEEECCCCEEEEEECHHHHHH T ss_conf 868999738865999954899999 No 167 >PRK05820 deoA thymidine phosphorylase; Reviewed Probab=76.35 E-value=2.6 Score=20.42 Aligned_cols=23 Identities=9% Similarity=0.001 Sum_probs=15.9 Q ss_pred CCCCCCCCCEEEEECCEEECCCC Q ss_conf 12345554116873121201222 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSN 310 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~ 310 (440) ....|+||.+|+|..++....|. T Consensus 339 ~~~~i~A~~~G~v~~id~~~iG~ 361 (440) T PRK05820 339 VIKPVYADRSGFLSAMDTRALGM 361 (440) T ss_pred EEEEEECCCCEEEEEECHHHHHH T ss_conf 68999728966999853799999 No 168 >pfam01576 Myosin_tail_1 Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament. Probab=76.08 E-value=0.1 Score=29.80 Aligned_cols=190 Identities=15% Similarity=0.140 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHCCCCCCCCC Q ss_conf 99888999874343323321001233332221145554223568888776554678865433--3655420011222112 Q gi|254780386|r 91 ETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKV--FNYGISSVYRSLKMKL 168 (440) Q Consensus 91 ~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 168 (440) .....++.+......+..........++.+...++........................... ....+..+........ T Consensus 135 el~~q~e~l~~~~~~lek~~~~l~~e~~~l~~~l~~~~~~k~~~e~~~~~le~~l~el~~k~~e~~~~~~el~~~~~~l~ 214 (859) T pfam01576 135 ELSEQIEQLQKQKAKAEKEKSQLQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQ 214 (859) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999998899999999999999997788877779877888999998899999999988877888998998889 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 21002232101122322222210233332221012222221110000111002466653101223322222222222100 Q gi|254780386|r 169 TRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELY 248 (440) Q Consensus 169 ~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~ 248 (440) .....+.............+.........++..++...+........+......+......+...+.+.......+.... T Consensus 215 ~e~~~l~~~lee~e~~~~~l~~~~~~l~~qlee~~~~lee~~~~~~~l~~~l~~le~e~~~l~~~lee~~~~~~~le~~l 294 (859) T pfam01576 215 SENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQL 294 (859) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999998777789999998888999999999999888999999999999999999999999998999999999999999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 01223332112332221222222221222332 Q gi|254780386|r 249 DEANLEFANYLKEISRDLEQNQRTFADEMSKL 280 (440) Q Consensus 249 ~~~~~~~~~~~~e~~~~l~e~~~~l~~~~~~l 280 (440) ......+......+...+......+.+....+ T Consensus 295 ~~~~~el~~~~~k~e~e~~~~~~e~Ee~~k~l 326 (859) T pfam01576 295 SKANAEIQQWRSKFESEGALRAEELEELKKKL 326 (859) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98677899999989888887788899999999 No 169 >PRK06106 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=75.64 E-value=4 Score=19.12 Aligned_cols=27 Identities=15% Similarity=0.255 Sum_probs=22.8 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 868999985789893369789999648 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +++.-++++++|+.|++||+++++... T Consensus 66 ~~i~~~~~~~DG~~v~~g~~i~~i~G~ 92 (281) T PRK06106 66 PEIEMRRHLPDGAAVAPGDVIATISGP 92 (281) T ss_pred CCEEEEEECCCCCEECCCCEEEEEEEC T ss_conf 965999985889881389868999947 No 170 >COG0157 NadC Nicotinate-nucleotide pyrophosphorylase [Coenzyme metabolism] Probab=75.02 E-value=4.2 Score=18.99 Aligned_cols=25 Identities=24% Similarity=0.381 Sum_probs=20.7 Q ss_pred EEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8999985789893369789999648 Q gi|254780386|r 66 IIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 66 ~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +-..++|++|+.|++||+|+++... T Consensus 64 i~~~~~~~DG~~v~~g~~i~~~~G~ 88 (280) T COG0157 64 IEIQWLVKDGDRVKPGDVLAEIEGP 88 (280) T ss_pred EEEEEECCCCCEECCCCEEEEEECC T ss_conf 6898872888880799789999565 No 171 >TIGR00078 nadC nicotinate-nucleotide pyrophosphorylase; InterPro: IPR004393 Nicotinate-nucleotide pyrophosphorylase (2.4.2.19 from EC), also known as quinolinate phosphoribosyltransferase (decarboxylating), catalyses the conversion of nicotinate D-ribonucleotide, pyrophosphate and carbon dioxide into pyridine-2,3-dicarboxylate and 5-phospho-alpha-D-ribose 1-diphosphate. This enzyme is a type II phosphoribosyltransferase which provides the de novo source of nicotinate mononucleotide (NAMN) for NAD biosynthesis in both prokaryotes and eukaryotes , . Structural studies have shown that the active form of this enzyme is a homodimer with an unsual fold , , . The N-terminal forms a mixed alpha/beta domain, while the C-terminal region forms a multi-stranded, open alpha/beta barrel. The active site is found at the C-terminal ends of the beta strands of the alpha/beta barrel, and is bordered by the N-terminal domain of the second subunit of the dimer. It contains several conserved charged residues thought to be important determinants of substrate binding and catalysis. ; GO: 0004514 nicotinate-nucleotide diphosphorylase (carboxylating) activity, 0019363 pyridine nucleotide biosynthetic process. Probab=74.71 E-value=3.6 Score=19.43 Aligned_cols=26 Identities=19% Similarity=0.404 Sum_probs=22.2 Q ss_pred CEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 68999985789893369789999648 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) |.=...++++|+.|++||+++++... T Consensus 59 ~~~~~~~~~DG~~~~~G~~v~~~~G~ 84 (276) T TIGR00078 59 GVQVEWLVKDGDEVEAGEVVAEVEGP 84 (276) T ss_pred CEEEEEECCCCCEECCCCEEEEEECC T ss_conf 17984103788774477389999746 No 172 >KOG0994 consensus Probab=74.63 E-value=0.043 Score=32.30 Aligned_cols=15 Identities=7% Similarity=0.151 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 998743433233210 Q gi|254780386|r 97 DLKKVSIKNLRCHID 111 (440) Q Consensus 97 ~~~~~~l~~l~~~l~ 111 (440) +.++.++..+...|. T Consensus 1235 ~~lr~~l~~~~e~L~ 1249 (1758) T KOG0994 1235 ESLRRQLQALTEDLP 1249 (1758) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999875415 No 173 >pfam10174 Cast RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains. Probab=73.42 E-value=0.23 Score=27.47 Aligned_cols=175 Identities=10% Similarity=0.073 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHCCCCCCCCC Q ss_conf 99888999874343323321001233332221145554223568888776554678865433--3655420011222112 Q gi|254780386|r 91 ETTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTCSKV--FNYGISSVYRSLKMKL 168 (440) Q Consensus 91 ~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 168 (440) +....+...+..+..+...+...+..+.....+.+.+...+......+............++ ....+..+...+.... T Consensus 235 ek~~~~~~~e~~l~~le~ei~~l~~~l~~~~~e~~~l~~elE~~~~~l~~~k~~~~~~~~el~~~~~el~~lq~~le~l~ 314 (774) T pfam10174 235 EKDTKIKEFEKMLEKAENEIYRLQSRCDTSEADRNRLDKEVEAERSALAAMKAKCDRAAQELSRKKTELLGLQTELETLA 314 (774) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88878999999999989999999998888776899999999999999999887799999889999999999999999998 Q ss_pred CCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 21002232101122322222210233332221012222221110000111002466653101223322222222222100 Q gi|254780386|r 169 TRMRSIYLNRLDLENKSSLLRALLSSHQKDLTMMKNLFEKNAVSQNDLNQQERIVYKSTLELRENIHEQKNMLRHINELY 248 (440) Q Consensus 169 ~~~~~~~~~~~~l~~~~~~l~~~~~~~~~el~~~~~l~~~~~~~~~~l~~~~~~~~~~~~el~~~~~~~~~~~~~l~~~~ 248 (440) ............+.............++.++..++..++........+......+......+...+......+....... T Consensus 315 ~e~~e~~~~le~lq~el~~~e~e~~~lq~eie~L~~eLe~~~~~l~k~~~~le~lqee~~~l~~ele~l~~~l~~~e~el 394 (774) T pfam10174 315 NQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKL 394 (774) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999999989999998899999999999999989999999999988899999999999999999999999 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 01223332112332221 Q gi|254780386|r 249 DEANLEFANYLKEISRD 265 (440) Q Consensus 249 ~~~~~~~~~~~~e~~~~ 265 (440) ..+...+......+... T Consensus 395 ~~lq~e~e~L~~~l~~~ 411 (774) T pfam10174 395 RVLQKKIENLQETFRRK 411 (774) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999989 No 174 >pfam02749 QRPTase_N Quinolinate phosphoribosyl transferase, N-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to give rise to nicotinic acid mononucleotide (NaMN), pyrophosphate and carbon dioxide. The QA substrate is bound between the C-terminal domain of one subunit, and the N-terminal domain of the other. The N-terminal domain has an alpha/beta hammerhead fold. Probab=73.39 E-value=4.4 Score=18.88 Aligned_cols=23 Identities=22% Similarity=0.440 Sum_probs=20.0 Q ss_pred EEEECCCCCEEECCCEEEEECHH Q ss_conf 99985789893369789999648 Q gi|254780386|r 68 KKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 68 ~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) -.+++++|+.|++|++++++... T Consensus 47 v~~~~~dG~~v~~g~~i~~i~G~ 69 (88) T pfam02749 47 VEWLVKDGERVEAGDVILEIEGP 69 (88) T ss_pred EEEECCCCCEECCCCEEEEEEEC T ss_conf 99996889991799999999988 No 175 >pfam01601 Corona_S2 Coronavirus S2 glycoprotein. The coronavirus spike glycoprotein forms the characteristic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2. Probab=73.38 E-value=3.5 Score=19.52 Aligned_cols=13 Identities=8% Similarity=0.404 Sum_probs=6.9 Q ss_pred CCCCCCCCCCCCC Q ss_conf 2124445542103 Q gi|254780386|r 310 NYAQQSQPLMKIV 322 (440) Q Consensus 310 ~~v~~G~~l~~Iv 322 (440) ++-..|..|++|. T Consensus 371 gFCGnG~Hi~Sl~ 383 (609) T pfam01601 371 GFCGNGTHIFSLV 383 (609) T ss_pred CEECCCCEEEEEC T ss_conf 5206984798742 No 176 >PRK09835 sensor kinase CusS; Provisional Probab=72.83 E-value=5.3 Score=18.35 Aligned_cols=29 Identities=14% Similarity=0.228 Sum_probs=19.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 45763356861689999999999999998 Q gi|254780386|r 5 MWKNLKRNHPASLKRMTLLSIIGIIGLIT 33 (440) Q Consensus 5 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 33 (440) -.||++||..-..|+.+|+++++++++++ T Consensus 2 ~~~~~~R~~SL~~RLtl~~~l~~lv~~~~ 30 (482) T PRK09835 2 VSKPFQRPFSLATRLTFFISLATIAAFFA 30 (482) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98889996129999999999999999999 No 177 >TIGR01107 Na_K_ATPase_bet Na+/K+ ATPase, beta subunit; InterPro: IPR000402 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the beta subunit found in the P-type cation exchange ATPases located in the plasma membranes of animal cells. These P-ATPases include both H^+/K^+-ATPases (3.6.3.10 from EC) and Na^+/K^+-ATPases (3.6.3.9 from EC), which belong to the IIC subfamily of ATPases , . These ATPases catalyse the hydrolysis of ATP coupled with the exchange of cations, pumping one cation out of the cell (H^+ or Na^+) in exchange for K^+. These ATPases contain an alpha subunit (IPR005775 from INTERPRO) that is the catalytic component, and a glycosylated beta subunit that regulates the number of sodium pumps transported to the plasma membrane through the assembly of alpha/beta heterodimers. The beta subunit has three highly conserved disulphide bonds within the extracellular domain that stabilize the alpha subunit, the alpha/beta interaction, and the catalytic activity of the alpha subunit . Different beta isoforms exist, permitting greater regulatory control. An example of a H^+/K^+-ATPase is the gastric pump responsible for acid secretion in the stomach, transporting protons from the cytoplasm of parietal cells to create a large pH gradient in exchange for the internalization of potassium ions, using ATP hydrolysis to drive the pump . More information about this protein can be found at Protein of the Month: ATP Synthases . ; GO: 0005391 sodium:potassium-exchanging ATPase activity, 0006813 potassium ion transport, 0006814 sodium ion transport, 0016020 membrane. Probab=72.63 E-value=4 Score=19.13 Aligned_cols=42 Identities=21% Similarity=0.326 Sum_probs=28.2 Q ss_pred CCCCCCC-----CCCCCCHHHHHHHHH-----HHHHHHHHHHHHHEEEEEEE Q ss_conf 6334576-----335686168999999-----99999999987640671258 Q gi|254780386|r 2 HHFMWKN-----LKRNHPASLKRMTLL-----SIIGIIGLITWSILLPIEIS 43 (440) Q Consensus 2 ~~~~~~~-----~~~~~p~~~~~~~~~-----~~~~~~~~~~w~~~~~~~~~ 43 (440) .+|.|-| |-|+.-+|.+|+++. ++++++++-+|..+-.|+.- T Consensus 18 K~F~WNp~~~efLGRTg~SW~~ILLFYl~FYG~LAglF~~Ti~VmLlT~~d~ 69 (317) T TIGR01107 18 KKFIWNPEKKEFLGRTGGSWFKILLFYLVFYGCLAGLFIGTIQVMLLTISDF 69 (317) T ss_pred EEEEECCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 0354668985223711233555788889999999999999999986103667 No 178 >pfam12128 DUF3584 Protein of unknown function (DUF3584). This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. There are two conserved sequence motifs: GKT and YLP. Probab=72.58 E-value=0.13 Score=29.09 Aligned_cols=23 Identities=4% Similarity=0.086 Sum_probs=8.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89988899987434332332100 Q gi|254780386|r 90 IETTDLIDLKKVSIKNLRCHIDT 112 (440) Q Consensus 90 ~~~~~~l~~~~~~l~~l~~~l~~ 112 (440) .+.+..+...+..+.......+. T Consensus 601 ~~l~~~l~~~~~~l~~~~~~~~~ 623 (1192) T pfam12128 601 TELRERLQQAEEALQSAVAKQKQ 623 (1192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999 No 179 >PRK06299 rpsA 30S ribosomal protein S1; Reviewed Probab=72.58 E-value=6 Score=17.98 Aligned_cols=37 Identities=19% Similarity=0.308 Sum_probs=16.8 Q ss_pred EEEEEEEEECCC---EEEEECCCCCEEEE---------EECCCCCEEEC Q ss_conf 999799995387---38984688868999---------98578989336 Q gi|254780386|r 44 VSSSGEILNEDN---VVEIKSPFSGIIKK---------FQVENGSQILQ 80 (440) Q Consensus 44 v~~~G~i~~~~~---~~~I~s~~~G~V~~---------i~VkeGd~Vkk 80 (440) -...|+|+.-.. ...|.....|+|.. ..++.||.|+. T Consensus 22 ~iV~G~Vv~i~~~~v~VdiG~K~eG~I~~~E~~~~~~~~~~~vGd~i~v 70 (556) T PRK06299 22 SIVKGTVVAIDKDYVLVDVGLKSEGRIPLEEFKNDQGELEVKVGDEVEV 70 (556) T ss_pred CEEEEEEEEEECCEEEEEECCCCEEEEEHHHHCCCCCCCCCCCCCEEEE T ss_conf 9899999999799999996888357879999347767746799999999 No 180 >pfam02666 PS_Dcarbxylase Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <= phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in the biosynthesis of aminophospholipids by converting phosphatidylserine to phosphatidylethanolamine. Probab=72.21 E-value=4.3 Score=18.93 Aligned_cols=22 Identities=14% Similarity=0.167 Sum_probs=12.1 Q ss_pred CCCCCCEEEEECCEEECCCCCCC Q ss_conf 45554116873121201222124 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQ 313 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~ 313 (440) ..+||++|.|..... -+|.+.. T Consensus 83 r~hsPv~G~v~~~~~-~~G~~~~ 104 (201) T pfam02666 83 RNHAPVDGTVKEVRH-IPGKLYP 104 (201) T ss_pred EEECCCCEEEEEEEE-ECCCEEE T ss_conf 487673459999999-4994023 No 181 >TIGR03327 AMP_phos AMP phosphorylase. This enzyme family is found, so far, strictly in the Archaea, and only in those with a type III Rubisco enzyme. Most of the members previously were annotated as thymidine phosphorylase, or DeoA. The AMP metabolized by this enzyme may be produced by ADP-dependent sugar kinases. Probab=72.11 E-value=3.8 Score=19.30 Aligned_cols=35 Identities=20% Similarity=0.249 Sum_probs=22.4 Q ss_pred CCCCCCCCCEEEEECCEEE-----------------------CCCCCCCCCCCCCCCC Q ss_conf 1234555411687312120-----------------------1222124445542103 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSF-----------------------SSSNYAQQSQPLMKIV 322 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~-----------------------~~G~~v~~G~~l~~Iv 322 (440) ....|.||.+|+|..++.. +.|+.|..|+||++|. T Consensus 413 ~~~~v~A~~~G~V~~id~~~i~~~a~~aGap~D~~aGi~l~~k~Gd~V~~GdpL~tih 470 (500) T TIGR03327 413 YTYTITAPTDGYVTDIDNKAITQIAREAGAPNDKGAGVYLHVKVGEKVKKGDPLYTIY 470 (500) T ss_pred EEEEEECCCCEEEEEEEHHHHHHHHHHHCCCCCCCCCEEEECCCCCEECCCCEEEEEE T ss_conf 5799972898699985069999999986899886457698436879877999399997 No 182 >PRK06978 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=71.29 E-value=5.9 Score=18.02 Aligned_cols=27 Identities=11% Similarity=0.271 Sum_probs=22.1 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 868999985789893369789999648 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +++--.+++++|+.|++|++++++... T Consensus 74 ~~i~v~~~~~DG~~v~~G~~i~~i~G~ 100 (288) T PRK06978 74 PSIEVQWRYREGDRMTADSTVCELEGP 100 (288) T ss_pred CCEEEEEECCCCCEECCCCEEEEEEEC T ss_conf 972999983889891799999999988 No 183 >cd04089 eRF3_II eRF3_II: domain II of the eukaryotic class II release factor (eRF3). In eukaryotes, translation termination is mediated by two interacting release factors, eRF1 and eRF3, which act as class I and II factors, respectively. eRF1 functions as an omnipotent release factor, decoding all three stop codons and triggering the release of the nascent peptide catalyzed by the ribsome. eRF3 is a GTPase, which enhances the termination efficiency by stimulating the eRF1 activity in a GTP-dependent manner. Sequence comparison of class II release factors with elongation factors shows that eRF3 is more similar to eEF1alpha whereas prokaryote RF3 is more similar to EF-G, implying that their precise function may differ. Only eukaryote RF3s are found in this group. Saccharomyces cerevisiae eRF3 (Sup35p) is a translation termination factor which is divided into three regions N, M and a C-terminal eEF1a-like region essential for translation termination. Sup35NM is a non-pathogenic prion-li Probab=70.88 E-value=6.1 Score=17.90 Aligned_cols=25 Identities=20% Similarity=0.131 Sum_probs=11.3 Q ss_pred HHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 110158983179998128875443999999971 Q gi|254780386|r 339 IQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 339 i~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) -+.|++|+++.+ +|.. ..++|.+|- T Consensus 23 sG~v~~Gd~v~i----~P~~----~~~~Vk~I~ 47 (82) T cd04089 23 SGTIKKGDKLLV----MPNK----TQVEVLSIY 47 (82) T ss_pred EEEECCCCEEEE----ECCC----CEEEEEEEE T ss_conf 677829999999----5899----889999999 No 184 >PRK12784 hypothetical protein; Provisional Probab=70.27 E-value=6.7 Score=17.64 Aligned_cols=36 Identities=22% Similarity=0.182 Sum_probs=28.2 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 4555411687312120122212444554210335565 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKL 327 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~ 327 (440) .|.||.-|.|..+.+ +.+.+|-...+|+.|-..+.. T Consensus 7 ~iyS~c~G~Vekifi-~e~SyVYEWEkL~~I~~~dg~ 42 (84) T PRK12784 7 EICSSYEGKVEEIFV-NESSYVYEWEKLMMIRKNNGE 42 (84) T ss_pred HHCCCCCCEEEEEEE-CCCCEEEEEEEEEEEEECCCC T ss_conf 344766318889998-587537864234577506982 No 185 >KOG0971 consensus Probab=69.73 E-value=0.59 Score=24.68 Aligned_cols=21 Identities=14% Similarity=0.078 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 998889998743433233210 Q gi|254780386|r 91 ETTDLIDLKKVSIKNLRCHID 111 (440) Q Consensus 91 ~~~~~l~~~~~~l~~l~~~l~ 111 (440) ..+.++..+...++.++.+.. T Consensus 228 eLr~QvrdLtEkLetlR~kR~ 248 (1243) T KOG0971 228 ELRAQVRDLTEKLETLRLKRA 248 (1243) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999999999874533 No 186 >TIGR02161 napC_nirT periplasmic nitrate (or nitrite) reductase c-type cytochrome, NapC/NirT family; InterPro: IPR011885 This entry contains NapC, a predicted membrane-anchored four-heme c-type cytochrome that forms one component of the periplasmic nitrate reductase along with NapA, NapB, NapD, NapE, and NapF subunits. A single known exception at this time is NirT, which is instead a component of a nitrite reductase. This family excludes TorC subunits of trimethylamine N-oxide (TMAO) reductases .; GO: 0020037 heme binding, 0006118 electron transport, 0016021 integral to membrane. Probab=69.16 E-value=5.3 Score=18.31 Aligned_cols=32 Identities=28% Similarity=0.426 Sum_probs=22.7 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEE Q ss_conf 4576335686168999999999999999876406 Q gi|254780386|r 5 MWKNLKRNHPASLKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 5 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~ 38 (440) .|+-+.+|+ .+.+.+++++.++.|++.|+.|- T Consensus 5 ~~~~~~~PS--~~~~G~l~~~~~~~g~~~W~gFN 36 (185) T TIGR02161 5 FWKVLRRPS--RLALGLLLLVGFVGGIVFWGGFN 36 (185) T ss_pred HHHHHCCCH--HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899850734--68899999999999998765687 No 187 >COG3402 Uncharacterized conserved protein [Function unknown] Probab=69.05 E-value=7.1 Score=17.47 Aligned_cols=15 Identities=13% Similarity=-0.186 Sum_probs=11.9 Q ss_pred CCCCCCCCCCCCCHH Q ss_conf 633457633568616 Q gi|254780386|r 2 HHFMWKNLKRNHPAS 16 (440) Q Consensus 2 ~~~~~~~~~~~~p~~ 16 (440) +.||.|.-+|..+.+ T Consensus 3 ~~~m~~~s~k~~~vw 17 (161) T COG3402 3 ENPMNPLSKKVLTVW 17 (161) T ss_pred CCCCCCCCCCCHHHH T ss_conf 664465785301078 No 188 >pfam07076 DUF1344 Protein of unknown function (DUF1344). This family consists of several short, hypothetical bacterial proteins of around 80 residues in length. Members of this family are found in Rhizobium, Agrobacterium and Brucella species. The function of this family is unknown. Probab=68.88 E-value=4.2 Score=19.00 Aligned_cols=50 Identities=12% Similarity=0.038 Sum_probs=22.7 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEE--EEEECEE Q ss_conf 3999999971334446777655118999999350112567851523775699--9992205 Q gi|254780386|r 362 KFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEV--LFTAKNT 420 (440) Q Consensus 362 ~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~--~I~~~~~ 420 (440) ..+|+|.+|++...+-+-. .+..|.+.-+++-. .|.|||.+-+ ..+.|+| T Consensus 24 d~Eg~I~~Vd~~~~tiTLd--DGKtY~lp~efd~~-------~L~~G~kVvV~Y~~~dGkr 75 (81) T pfam07076 24 DAEGTIASVDKDSLSITLD--DGKTYKLPEEFDFE-------ALKPGMKVVVFYDEVDGKR 75 (81) T ss_pred HCCCEEEEECCCCCEEEEC--CCCEEECCCCCCCC-------CCCCCCEEEEEEECCCCEE T ss_conf 3001345545776379956--99554189755700-------1378988999998317807 No 189 >PRK02693 apocytochrome f; Reviewed Probab=68.78 E-value=6.6 Score=17.71 Aligned_cols=24 Identities=17% Similarity=0.177 Sum_probs=14.8 Q ss_pred CCCCCCCCCCCEEEEECCEEECCC Q ss_conf 221234555411687312120122 Q gi|254780386|r 286 TREQKTILSPIAGTIVYNQSFSSS 309 (440) Q Consensus 286 ~l~~~~I~AP~~G~V~~l~~~~~G 309 (440) .-+++...||.+|+|..+.....| T Consensus 193 kSNN~~~~as~~G~I~~I~~~e~G 216 (312) T PRK02693 193 KSNNNVFTASAAGTITSIETGEDG 216 (312) T ss_pred CCCCCEECCCCCEEEEEEEECCCC T ss_conf 455725536677278887766888 No 190 >COG4656 RnfC Predicted NADH:ubiquinone oxidoreductase, subunit RnfC [Energy production and conversion] Probab=68.74 E-value=2.8 Score=20.20 Aligned_cols=25 Identities=32% Similarity=0.470 Sum_probs=19.3 Q ss_pred CEEEEEECCCCCEEECCCEEEEECH Q ss_conf 6899998578989336978999964 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) |.=-.++|++||.|.+||+|.+-+. T Consensus 42 g~~~~~~Vkvgd~V~~GQ~l~~~~g 66 (529) T COG4656 42 GAPGILLVKVGDKVLKGQPLTRGEG 66 (529) T ss_pred CCCCCEEEEECCEEECCCEEECCCC T ss_conf 9863448830888850864203678 No 191 >TIGR02013 rpoB DNA-directed RNA polymerase, beta subunit; InterPro: IPR010243 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry describes orthologues of the beta subunit of bacterial RNA polymerase. The core enzyme consists of two alpha chains, one beta chain, and one beta' subunit.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=68.72 E-value=2.8 Score=20.21 Aligned_cols=66 Identities=15% Similarity=0.151 Sum_probs=37.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEE Q ss_conf 12221244455421033556504775326223110158983179998128875443999999971334446777655118 Q gi|254780386|r 307 SSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDNNLQTQEY 386 (440) Q Consensus 307 ~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~ 386 (440) +.|+-++||- .-.|.+||...- ||++|++= ...+. =+|.|++|=|.-. .+....|. T Consensus 1142 ~~GDeLppGV----------~~~VkVYiA~KR--KiqvGDKM-------AGRHG--NKGvvSkIlP~ED-MPfL~DGt-- 1197 (1449) T TIGR02013 1142 EEGDELPPGV----------NKLVKVYIAQKR--KIQVGDKM-------AGRHG--NKGVVSKILPIED-MPFLEDGT-- 1197 (1449) T ss_pred CCCCCCCCCC----------EEEEEEEEEECC--CCCCCCCC-------CCCCC--CCEEEEEECCCCC-CCCCCCCC-- T ss_conf 1788787652----------368899888612--05888843-------55557--8415655245247-78695438-- Q ss_pred EEEEEEECHHH Q ss_conf 99999935011 Q gi|254780386|r 387 YEVILKITDLA 397 (440) Q Consensus 387 y~v~i~~~~~~ 397 (440) +|.|-|+... T Consensus 1198 -PVDivLNPLG 1207 (1449) T TIGR02013 1198 -PVDIVLNPLG 1207 (1449) T ss_pred -EEEEEECCCC T ss_conf -7778978878 No 192 >cd03698 eRF3_II_like eRF3_II_like: domain similar to domain II of the eukaryotic class II release factor (eRF3). In eukaryotes, translation termination is mediated by two interacting release factors, eRF1 and eRF3, which act as class I and II factors, respectively. eRF1 functions as an omnipotent release factor, decoding all three stop codons and triggering the release of the nascent peptide catalyzed by the ribsome. eRF3 is a GTPase, which enhances the termination efficiency by stimulating the eRF1 activity in a GTP-dependent manner. Sequence comparison of class II release factors with elongation factors shows that eRF3 is more similar to eEF1alpha whereas prokaryote RF3 is more similar to EF-G, implying that their precise function may differ. Only eukaryote RF3s are found in this group. Saccharomyces cerevisiae eRF3 (Sup35p) is a translation termination factor which is divided into three regions N, M and a C-terminal eEF1a-like region essential for translation termination. Sup35NM Probab=68.53 E-value=7 Score=17.52 Aligned_cols=42 Identities=12% Similarity=0.302 Sum_probs=17.3 Q ss_pred CCCCCCCCCCCCCCCCCCEEEE-EEECHHHHHHCCCCCCEEEEE Q ss_conf 2124445542103355650477-532622311015898317999 Q gi|254780386|r 310 NYAQQSQPLMKIVPHSKLTYIR-AKVTPKQIQHVKKGYTATVRF 352 (440) Q Consensus 310 ~~v~~G~~l~~Ivp~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~ 352 (440) +.+..|+.+. +.|......|. ..+...++.....|+.|.+++ T Consensus 25 G~i~~Gd~v~-i~P~~~~~~VksI~~~~~~~~~A~aG~~V~l~L 67 (83) T cd03698 25 GSIQKGDTLL-VMPSKESVEVKSIYVDDEEVDYAVAGENVRLKL 67 (83) T ss_pred CEECCCCEEE-EECCCCEEEEEEEEECCEEECEECCCCEEEEEE T ss_conf 5872899999-978998899999999991729888999799998 No 193 >PRK09016 quinolinate phosphoribosyltransferase; Validated Probab=68.20 E-value=6 Score=17.99 Aligned_cols=26 Identities=15% Similarity=0.363 Sum_probs=21.8 Q ss_pred CEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 68999985789893369789999648 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .+--++++++|+.|++|++++++... T Consensus 84 ~v~~~~~~~DG~~v~~G~~i~~i~G~ 109 (296) T PRK09016 84 DVTIEWHVDDGDKITANQTLCELTGP 109 (296) T ss_pred CEEEEEECCCCCCCCCCCEEEEEEEC T ss_conf 73999987989883899999999976 No 194 >KOG2735 consensus Probab=68.19 E-value=6.1 Score=17.91 Aligned_cols=38 Identities=18% Similarity=0.321 Sum_probs=23.8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEEEE Q ss_conf 334576335686168999999999999999876406712 Q gi|254780386|r 3 HFMWKNLKRNHPASLKRMTLLSIIGIIGLITWSILLPIE 41 (440) Q Consensus 3 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~~~~ 41 (440) .|-+-|+-||||.++|++.-+.++.++ ++.+..|..+| T Consensus 110 afpngpF~RPHPa~WR~vfg~SViYll-~L~FlLFQ~~~ 147 (466) T KOG2735 110 AFPNGPFIRPHPALWRIVFGVSVIYLL-FLQFLLFQTFD 147 (466) T ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHH T ss_conf 378999788871499998776999999-99999972157 No 195 >PRK06748 hypothetical protein; Validated Probab=68.05 E-value=7.5 Score=17.34 Aligned_cols=35 Identities=23% Similarity=0.222 Sum_probs=27.7 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 455541168731212012221244455421033556 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSK 326 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~ 326 (440) .|.||.-|+|..+.+ +.+.+|-...+|+.|-..+. T Consensus 7 ~IyS~~~G~Vekifi-~e~syVyEWEkL~~I~~~dg 41 (84) T PRK06748 7 GVYSPCYGKVEKLFV-TESSYVYEWEKLALIETIDK 41 (84) T ss_pred HHCCCCCCEEEEEEE-CCCCEEEEEEEEEEEEECCC T ss_conf 233666308899998-58753786423457750698 No 196 >KOG0933 consensus Probab=67.33 E-value=0.065 Score=31.10 Aligned_cols=41 Identities=10% Similarity=0.177 Sum_probs=22.5 Q ss_pred CCCEEECCCEEEEECHH-HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 89893369789999648-998889998743433233210012 Q gi|254780386|r 74 NGSQILQGTPLLTFEDI-ETTDLIDLKKVSIKNLRCHIDTEK 114 (440) Q Consensus 74 eGd~VkkGq~L~~ld~~-~~~~~l~~~~~~l~~l~~~l~~~~ 114 (440) -|-.-+.+++|..|.+. +.+.+++..+.++..++.++.... T Consensus 663 GGs~~~~a~~L~~l~~l~~~~~~~~~~q~el~~le~eL~~le 704 (1174) T KOG0933 663 GGSRSKGADLLRQLQKLKQAQKELRAIQKELEALERELKSLE 704 (1174) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 887887643789879899988999999989999999999878 No 197 >pfam04225 OapA Opacity-associated protein A LysM-like domain. This family includes the Haemophilus influenzae opacity-associated protein. This protein is required for efficient nasopharyngeal mucosal colonisation, and its expression is associated with a distinctive transparent colony phenotype. OapA is thought to be a secreted protein, and its expression exhibits high-frequency phase variation. This is a LysM-like domain. Probab=67.21 E-value=1.4 Score=22.19 Aligned_cols=44 Identities=11% Similarity=0.246 Sum_probs=30.2 Q ss_pred CCCCCCCCCCCCCCC----CCEE--EEEEECHHHHHHCCCCCCEEEEEEE Q ss_conf 124445542103355----6504--7753262231101589831799981 Q gi|254780386|r 311 YAQQSQPLMKIVPHS----KLTY--IRAKVTPKQIQHVKKGYTATVRFPH 354 (440) Q Consensus 311 ~v~~G~~l~~Ivp~~----~~l~--v~~~v~~~di~~i~~Gq~v~i~~~a 354 (440) .|++|++|++|.-+. +.++ +.+.-+...+.++++||.+.+++++ T Consensus 6 ~V~~GdTLs~iFr~~~L~~sDv~~~~~~~g~~k~Ls~Lk~Gq~v~~~~n~ 55 (85) T pfam04225 6 QVPKGDTLMQVFRDNNLPISDVNAMAKVEGADKPLSNIKSGQLVRIKLNA 55 (85) T ss_pred EECCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCHHHCCCCCEEEEEECC T ss_conf 80799859999988299989999998611567805445899999999999 No 198 >PRK10617 cytochrome c-type protein NapC; Provisional Probab=66.90 E-value=4.3 Score=18.91 Aligned_cols=32 Identities=31% Similarity=0.354 Sum_probs=19.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEE Q ss_conf 5763356861689999999999999998764067 Q gi|254780386|r 6 WKNLKRNHPASLKRMTLLSIIGIIGLITWSILLP 39 (440) Q Consensus 6 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~~ 39 (440) |.-|++|+ .+.+.+++++.+++|+++|+.|.. T Consensus 15 w~~~~~ps--~~~lg~l~~iG~~~Gii~w~gf~~ 46 (200) T PRK10617 15 WKWWRTPS--RLALGTLLLIGFVGGIIFWGGFNT 46 (200) T ss_pred HHHHCCCH--HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99975815--779999999999999999999999 No 199 >pfam10481 Cenp-F_N Cenp-F N-terminal domain. Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal par to f the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein. Probab=66.87 E-value=2.5 Score=20.57 Aligned_cols=19 Identities=11% Similarity=0.052 Sum_probs=7.9 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 8889998743433233210 Q gi|254780386|r 93 TDLIDLKKVSIKNLRCHID 111 (440) Q Consensus 93 ~~~l~~~~~~l~~l~~~l~ 111 (440) -..+.+++.+++.|..+.. T Consensus 17 LqKI~elE~QlekLkKE~q 35 (288) T pfam10481 17 LQKIQELEGQLDKLKKEKQ 35 (288) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999989988 No 200 >PRK09629 bifunctional thiosulfate sulfurtransferase/phosphatidylserine decarboxylase; Provisional Probab=66.73 E-value=5.5 Score=18.24 Aligned_cols=80 Identities=18% Similarity=0.257 Sum_probs=48.5 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCC---CCCCCC--------CCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCC Q ss_conf 345554116873121201222124445---542103--------355650477532622311015898317999812887 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQ---PLMKIV--------PHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADI 358 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~---~l~~Iv--------p~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~ 358 (440) ..|.||.||.|..+-....|..+++-- ++..++ +-.+..++..|+++.|.-+|+-=..-+++ . T Consensus 406 ~~ivSP~DG~isq~G~I~~~~l~QAKG~~Ysl~~LLggd~~~a~~f~~G~F~TIYLsP~DYHRiHmP~~G~l~--~---- 479 (610) T PRK09629 406 GAILSPADGAISQLGPIDHGRIFQAKGHSFSVLELLGGDPKLSAPFMGGEFATVYLSPKDYHRVHMPLAGTLR--E---- 479 (610) T ss_pred CCEECCCCCCEEEEEEECCCCEEEECCCCCCHHHHHCCCHHHHHHCCCCEEEEEEECCCCCCEEECCCCCEEE--E---- T ss_conf 8478788862231037458958985899556999849598787113597189999786663306156785798--9---- Q ss_pred CCCEEEEEEEEEECCCCC Q ss_conf 544399999997133444 Q gi|254780386|r 359 REKKFKAIIEKIDPIISQ 376 (440) Q Consensus 359 ~~~~~~G~V~~Is~~~~~ 376 (440) .-.++|..-+|.|.+.. T Consensus 480 -~~yvPG~LfSVNp~t~~ 496 (610) T PRK09629 480 -MVYVPGRIFSVNQTTAE 496 (610) T ss_pred -EEEECCCCCCCCHHHHH T ss_conf -99947766531889984 No 201 >PRK06543 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=66.68 E-value=6.7 Score=17.63 Aligned_cols=28 Identities=21% Similarity=0.247 Sum_probs=23.5 Q ss_pred CCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8868999985789893369789999648 Q gi|254780386|r 63 FSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 63 ~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) -.|.--.+++++|+.|++|++++.+... T Consensus 62 ~~~~~~~~~~~dG~~v~~g~~i~~i~G~ 89 (281) T PRK06543 62 DPAITVELAVADGERFEAGDILATVTGS 89 (281) T ss_pred CCCEEEEEEECCCCCCCCCCEEEEEEEC T ss_conf 9975999980899881799899999736 No 202 >PRK07896 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=65.62 E-value=8.3 Score=17.03 Aligned_cols=80 Identities=13% Similarity=0.117 Sum_probs=40.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH-HHHHEEEEEEEEEEEEEEEECCCEEEEE-----------CCCCCEEEEEECCCCCEE Q ss_conf 5686168999999999999999-8764067125899979999538738984-----------688868999985789893 Q gi|254780386|r 11 RNHPASLKRMTLLSIIGIIGLI-TWSILLPIEISVSSSGEILNEDNVVEIK-----------SPFSGIIKKFQVENGSQI 78 (440) Q Consensus 11 ~~~p~~~~~~~~~~~~~~~~~~-~w~~~~~~~~~v~~~G~i~~~~~~~~I~-----------s~~~G~V~~i~VkeGd~V 78 (440) -.+|..++.++-..+.==++++ =+.+.+-++....+++.+....+-..-- ....|.--++++++|+.| T Consensus 8 ~l~p~~i~~~I~~aL~ED~~~g~DlTt~~li~~~~~~~a~i~ake~gvl~G~~~a~~vf~~l~~~~~~~v~~~~~DG~~v 87 (288) T PRK07896 8 GLEPDEARAVIARALDEDLRYGPDVTTVATVPADAVATASMVTREAGVVAGLDVALLVLDEVLGTDGYRVLDRVEDGARV 87 (288) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCEEEEEEEECCCEEEECHHHHHHHHHHHCCCCCEEEEEECCCCCEE T ss_conf 97999999999999982099998836573488887599999968997997289999999984789984999986889890 Q ss_pred ECCCEEEEECHH Q ss_conf 369789999648 Q gi|254780386|r 79 LQGTPLLTFEDI 90 (440) Q Consensus 79 kkGq~L~~ld~~ 90 (440) ++||+++++... T Consensus 88 ~~G~~i~~i~G~ 99 (288) T PRK07896 88 PPGQALLTVTAP 99 (288) T ss_pred CCCCEEEEEEEC T ss_conf 799899999967 No 203 >pfam04111 APG6 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway. Probab=65.53 E-value=6.8 Score=17.63 Aligned_cols=14 Identities=7% Similarity=0.079 Sum_probs=5.3 Q ss_pred CEEEEECCEEECCC Q ss_conf 11687312120122 Q gi|254780386|r 296 IAGTIVYNQSFSSS 309 (440) Q Consensus 296 ~~G~V~~l~~~~~G 309 (440) .||.+-.+|..-.| T Consensus 166 hdG~fGTINglRLG 179 (356) T pfam04111 166 HSGPFATINGLRLG 179 (356) T ss_pred ECCCCEEECCCCCC T ss_conf 54883546464027 No 204 >pfam05557 MAD Mitotic checkpoint protein. This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated. Probab=65.47 E-value=5 Score=18.47 Aligned_cols=30 Identities=17% Similarity=0.151 Sum_probs=14.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 122222222122233221222221234555 Q gi|254780386|r 265 DLEQNQRTFADEMSKLTILEKTREQKTILS 294 (440) Q Consensus 265 ~l~e~~~~l~~~~~~l~~l~~~l~~~~I~A 294 (440) ++.+.+.....++.++..++.++++..+++ T Consensus 511 ei~~Le~E~~rLr~e~~~LE~~le~~~L~g 540 (722) T pfam05557 511 KIETLERERDRLRQEKSLLEMKLEHLCLQG 540 (722) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 989999899999999999999999987536 No 205 >pfam08206 OB_RNB Ribonuclease B OB domain. This family includes the N-terminal OB domain found in ribonuclease B proteins in one or two copies. Probab=64.81 E-value=8.6 Score=16.93 Aligned_cols=45 Identities=20% Similarity=0.302 Sum_probs=34.4 Q ss_pred CCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEE Q ss_conf 3355650477532622311015898317999812887544399999997 Q gi|254780386|r 322 VPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKI 370 (440) Q Consensus 322 vp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~I 370 (440) +|++.. =+.|||+.+.+..--|+.|.+++.. ....+.-.|+|..| T Consensus 14 ~~d~~~--~DifI~~~~l~~A~~GD~V~v~v~~--~~~~~r~eG~vv~I 58 (58) T pfam08206 14 IPDDEE--DDIFIPPEQMKKAMHGDRVLVRITK--GDRRGRREGRIVRI 58 (58) T ss_pred EECCCC--CCEEECHHHHCCCCCCCEEEEEEEC--CCCCCCCCEEEEEC T ss_conf 868998--8999998997456599999999953--88899836999989 No 206 >pfam00038 Filament Intermediate filament protein. Probab=64.49 E-value=5.5 Score=18.23 Aligned_cols=29 Identities=17% Similarity=0.117 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 88899987434332332100123333222 Q gi|254780386|r 93 TDLIDLKKVSIKNLRCHIDTEKSALDFLN 121 (440) Q Consensus 93 ~~~l~~~~~~l~~l~~~l~~~~~~l~~~~ 121 (440) +..++........+..++...+..++.+. T Consensus 60 r~~ld~~~~ek~~le~e~~~l~~e~e~~k 88 (312) T pfam00038 60 RKQLDELTNERARLQLEIDNLRLAAEDFR 88 (312) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999889999998711898999999 No 207 >pfam05667 DUF812 Protein of unknown function (DUF812). This family consists of several eukaryotic proteins of unknown function. Probab=64.22 E-value=0.16 Score=28.52 Aligned_cols=20 Identities=0% Similarity=-0.009 Sum_probs=9.4 Q ss_pred CCCCCCCCCCCCCCCCCCCC Q ss_conf 22212223322122222123 Q gi|254780386|r 271 RTFADEMSKLTILEKTREQK 290 (440) Q Consensus 271 ~~l~~~~~~l~~l~~~l~~~ 290 (440) ....+++.+++.+..+|.|+ T Consensus 454 ~DTR~LQkeiN~~~~~L~Rt 473 (536) T pfam05667 454 SDTRSLQKEINNITGKLDRT 473 (536) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998888888407 No 208 >cd03695 CysN_NodQ_II CysN_NodQ_II: This subfamily represents the domain II of the large subunit of ATP sulfurylase (ATPS): CysN or the N-terminal portion of NodQ, found mainly in proteobacteria and homologous to the domain II of EF-Tu. Escherichia coli ATPS consists of CysN and a smaller subunit CysD and CysN. ATPS produces adenosine-5'-phosphosulfate (APS) from ATP and sulfate, coupled with GTP hydrolysis. In the subsequent reaction APS is phosphorylated by an APS kinase (CysC), to produce 3'-phosphoadenosine-5'-phosphosulfate (PAPS) for use in amino acid (aa) biosynthesis. The Rhizobiaceae group (alpha-proteobacteria) appears to carry out the same chemistry for the sufation of a nodulation factor. In Rhizobium meliloti, a the hererodimeric complex comprised of NodP and NodQ appears to possess both ATPS and APS kinase activities. The N and C termini of NodQ correspond to CysN and CysC, respectively. Other eubacteria, Archaea, and eukaryotes use a different ATP sulfurylase, which sho Probab=64.22 E-value=8.8 Score=16.85 Aligned_cols=41 Identities=10% Similarity=0.266 Sum_probs=20.7 Q ss_pred CCCCCCCCCCCCCCCCEEE-EEEECHHHHHHCCCCCCEEEEEE Q ss_conf 2444554210335565047-75326223110158983179998 Q gi|254780386|r 312 AQQSQPLMKIVPHSKLTYI-RAKVTPKQIQHVKKGYTATVRFP 353 (440) Q Consensus 312 v~~G~~l~~Ivp~~~~l~v-~~~v~~~di~~i~~Gq~v~i~~~ 353 (440) +..|+.|. |.|++..-.| ...+...+......|+.|.|.++ T Consensus 27 v~~GD~v~-i~Psg~~a~Vk~I~~~~~~~~~A~aG~~v~l~L~ 68 (81) T cd03695 27 IRVGDEVV-VLPSGKTSRVKSIETFDGELDEAGAGESVTLTLE 68 (81) T ss_pred EECCCEEE-ECCCCCCEEEEEEEECCCCCCEECCCCEEEEEEC T ss_conf 83899999-6899981899899958998248879982899978 No 209 >cd01568 QPRTase_NadC Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=63.29 E-value=3.7 Score=19.37 Aligned_cols=28 Identities=29% Similarity=0.493 Sum_probs=23.8 Q ss_pred CCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8868999985789893369789999648 Q gi|254780386|r 63 FSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 63 ~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .+|.--.+++++|+.|++|++++++... T Consensus 54 ~~~~~~~~~~~dG~~v~~g~~i~~i~G~ 81 (269) T cd01568 54 LDGIEVEWLVKDGDRVEAGQVLLEVEGP 81 (269) T ss_pred CCCEEEEEECCCCCCCCCCCEEEEEEEC T ss_conf 5981999987888873799689999977 No 210 >COG0213 DeoA Thymidine phosphorylase [Nucleotide transport and metabolism] Probab=63.14 E-value=6.9 Score=17.57 Aligned_cols=22 Identities=5% Similarity=-0.012 Sum_probs=13.2 Q ss_pred CCCCCCCCCEEEEECCEEECCC Q ss_conf 1234555411687312120122 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSS 309 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G 309 (440) -...|.|..+|+|..+.....| T Consensus 335 ~~~~v~A~~~G~v~~id~~~ig 356 (435) T COG0213 335 YTAEVKAQTSGYVSEIDARAIG 356 (435) T ss_pred EEEEEECCCCEEEEEECHHHHH T ss_conf 5888864676268762558999 No 211 >pfam06447 consensus Probab=63.04 E-value=9.3 Score=16.71 Aligned_cols=33 Identities=12% Similarity=0.082 Sum_probs=18.4 Q ss_pred CCCCEEEEEEEECEEEHHHHHHHHHHHHHHHHHC Q ss_conf 2377569999922054799999999989988731 Q gi|254780386|r 406 RNGFPAEVLFTAKNTTLAQEIIHPITDNWPKIFE 439 (440) Q Consensus 406 ~~Gm~~~~~I~~~~~~~~~~~~~p~~~~~~~~f~ 439 (440) +.||.+.+.-.-|+- +-.-++.=++..|.++|. T Consensus 300 k~GirG~vV~k~G~~-l~~A~~aG~l~G~g~~~~ 332 (407) T pfam06447 300 KNGIRGRLVSRNGQI-IAKAGGAGFLQGIGGGFS 332 (407) T ss_pred CCCCCCEEECCCHHH-HHHHHHHHHHHHHHHHHH T ss_conf 589775187465599-999999888888887641 No 212 >cd01573 modD_like ModD; Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase) present in some modABC operons in bacteria, which are involved in molybdate transport. In general, QPRTases are part of the de novo synthesis pathway of NAD in both prokaryotes and eukaryotes. They catalyse the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. Probab=62.06 E-value=8.4 Score=17.01 Aligned_cols=22 Identities=27% Similarity=0.416 Sum_probs=19.6 Q ss_pred EEECCCCCEEECCCEEEEECHH Q ss_conf 9985789893369789999648 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .+++++|+.|++|++++++... T Consensus 59 ~~~~~dG~~v~~g~~i~~i~G~ 80 (272) T cd01573 59 DLAAASGSRVAAGAVLLEAEGP 80 (272) T ss_pred EEECCCCCEECCCCEEEEEEEC T ss_conf 9994879890799899999977 No 213 >KOG0976 consensus Probab=61.65 E-value=0.31 Score=26.60 Aligned_cols=11 Identities=27% Similarity=0.289 Sum_probs=4.9 Q ss_pred EEEEEECEEEH Q ss_conf 99999220547 Q gi|254780386|r 412 EVLFTAKNTTL 422 (440) Q Consensus 412 ~~~I~~~~~~~ 422 (440) ++.|+-.-++. T Consensus 887 nvliIK~~~r~ 897 (1265) T KOG0976 887 NVLIIKDLERL 897 (1265) T ss_pred EEEEECCHHHH T ss_conf 14455121535 No 214 >PRK05035 electron transport complex protein RnfC; Provisional Probab=61.59 E-value=4.5 Score=18.78 Aligned_cols=26 Identities=27% Similarity=0.448 Sum_probs=21.7 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECH Q ss_conf 86899998578989336978999964 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFED 89 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~ 89 (440) -|.-....|+.||.|.+||+|.+-+. T Consensus 51 iGap~~p~VkvGD~VlkGQ~I~~~~g 76 (725) T PRK05035 51 IGAEGELLVSVGDRVLKGQPLTQGDG 76 (725) T ss_pred CCCCCCCCCCCCCEECCCCEEEECCC T ss_conf 99877141478999768887454698 No 215 >cd01572 QPRTase Quinolinate phosphoribosyl transferase (QAPRTase or QPRTase), also called nicotinate-nucleotide pyrophosphorylase, is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide. QPRTase functions as a homodimer with two active sites, each formed by the C-terminal region of one subunit and the N-terminal region of the other. Probab=61.45 E-value=8.5 Score=16.95 Aligned_cols=27 Identities=30% Similarity=0.453 Sum_probs=23.2 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 868999985789893369789999648 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) +|.--.+++++|+.|++|++++++... T Consensus 56 ~~~~~~~~~~dG~~v~~g~~i~~i~G~ 82 (268) T cd01572 56 PGIEVEWLVKDGDRVEPGQVLATVEGP 82 (268) T ss_pred CCCEEEEECCCCCCCCCCCEEEEEEEC T ss_conf 986999985889880799899999975 No 216 >PRK08072 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=61.33 E-value=9.9 Score=16.51 Aligned_cols=27 Identities=15% Similarity=0.243 Sum_probs=22.6 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 868999985789893369789999648 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) ++.--.+++++|+.|++||+++++... T Consensus 62 ~~~~~~~~~~dG~~v~~g~~i~~i~G~ 88 (277) T PRK08072 62 ERIEVELHKKDGDLVEKGEIIATVQGP 88 (277) T ss_pred CCEEEEEECCCCCEECCCCEEEEEEEC T ss_conf 965999982899881699868999806 No 217 >PRK06096 molybdenum transport protein ModD; Provisional Probab=60.84 E-value=7.9 Score=17.19 Aligned_cols=25 Identities=24% Similarity=0.244 Sum_probs=20.8 Q ss_pred EEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8999985789893369789999648 Q gi|254780386|r 66 IIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 66 ~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .--.+++++|+.|++|++++++... T Consensus 61 ~~v~~~~~dG~~v~~G~~i~~i~G~ 85 (284) T PRK06096 61 LTIDDAVSDGSQANAGQRLIRAQGN 85 (284) T ss_pred CEEEEEECCCCCCCCCCEEEEEEEC T ss_conf 9999991889882799899999956 No 218 >PRK10803 hypothetical protein; Provisional Probab=60.72 E-value=10 Score=16.44 Aligned_cols=16 Identities=19% Similarity=0.256 Sum_probs=9.5 Q ss_pred HHHHHHHHHEEEEEEE Q ss_conf 9999987640671258 Q gi|254780386|r 28 IIGLITWSILLPIEIS 43 (440) Q Consensus 28 ~~~~~~w~~~~~~~~~ 43 (440) +++...|++|++-++. T Consensus 16 ~~~~~~~~a~a~apv~ 31 (262) T PRK10803 16 VGIAAPWAAFAQAPIS 31 (262) T ss_pred HHHHHHHHHCCCCCHH T ss_conf 9986078770344376 No 219 >pfam04952 AstE_AspA Succinylglutamate desuccinylase / Aspartoacylase family. This family includes Succinylglutamate desuccinylase EC:3.1.-.- that catalyses the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway. The family also include aspartoacylase EC:3.5.1.15 which cleaves acylaspartate into a fatty acid and aspartate. Mutations in human aspartoacylase (ACY-2) lead to Canavan disease. This family is probably structurally related to pfam00246 (Bateman A pers. obs.). Probab=60.48 E-value=10 Score=16.42 Aligned_cols=29 Identities=14% Similarity=0.130 Sum_probs=22.8 Q ss_pred CCCCCCCEEEEECCEEECCCCCCCCCCCCCC Q ss_conf 3455541168731212012221244455421 Q gi|254780386|r 290 KTILSPIAGTIVYNQSFSSSNYAQQSQPLMK 320 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~ 320 (440) ..++||.+|.+.. . ...|+.|..|+.+.. T Consensus 218 ~~~~a~~~Gl~~~-~-~~~g~~V~~G~~l~~ 246 (289) T pfam04952 218 RDIRAEAAGLVEP-H-VNLGDDVEAGDLLPG 246 (289) T ss_pred EECCCCCCEEEEE-E-CCCCCCEECCCCCCC T ss_conf 1303776469999-4-278996627864177 No 220 >PRK07428 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=59.94 E-value=4.9 Score=18.58 Aligned_cols=22 Identities=18% Similarity=0.167 Sum_probs=19.4 Q ss_pred EEECCCCCEEECCCEEEEECHH Q ss_conf 9985789893369789999648 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld~~ 90 (440) ..++++|+.|++|++++++... T Consensus 72 ~~~~~dG~~v~~g~~i~~i~G~ 93 (285) T PRK07428 72 TPTVAEGEACESGQVVAEIEGP 93 (285) T ss_pred EEECCCCCEECCCCEEEEEEEC T ss_conf 9884889853489789999987 No 221 >TIGR02162 torC trimethylamine-N-oxide reductase c-type cytochrome TorC; InterPro: IPR009154 This family includes TorC, a pentahemic c-type cytochrome subunit of periplasmic reductases for trimethylamine-N-oxide (TMAO). The N-terminal half is closely related to tetrahemic NapC (or NirT) subunits of periplasmic nitrate (or nitrite) reductases; some species have both TMAO and nitrate reductase complexes .; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0006118 electron transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=59.22 E-value=2.4 Score=20.61 Aligned_cols=33 Identities=27% Similarity=0.422 Sum_probs=22.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEE Q ss_conf 4576335686168999999999999999876406 Q gi|254780386|r 5 MWKNLKRNHPASLKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 5 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~ 38 (440) ||+-|.||+-+ +-+++++++-+++|+++|.-|- T Consensus 3 ~w~~~~rPS~k-~slGvL~~~G~iaG~~~w~~F~ 35 (394) T TIGR02162 3 IWKAFTRPSAK-ISLGVLVVIGFIAGAILWQGFN 35 (394) T ss_pred HHHHHHCHHHH-HHHHHHHHHHHHEEEEEECCCC T ss_conf 13444040588-9999999850530212453431 No 222 >PRK05848 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=58.93 E-value=9.7 Score=16.59 Aligned_cols=23 Identities=13% Similarity=0.351 Sum_probs=20.1 Q ss_pred EEEECCCCCEEECCCEEEEECHH Q ss_conf 99985789893369789999648 Q gi|254780386|r 68 KKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 68 ~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) -.+++++|+.|++|++++++... T Consensus 59 v~~~~~dG~~v~~g~~i~~i~G~ 81 (272) T PRK05848 59 CVWTIKDGESFKKGDTLMEIRGD 81 (272) T ss_pred EEEEECCCCEECCCCEEEEEEEC T ss_conf 99990889890699999999977 No 223 >PRK00809 hypothetical protein; Provisional Probab=58.65 E-value=11 Score=16.21 Aligned_cols=37 Identities=19% Similarity=0.229 Sum_probs=23.7 Q ss_pred CHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECC Q ss_conf 622311015898317999812887544399999997133 Q gi|254780386|r 335 TPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPI 373 (440) Q Consensus 335 ~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~ 373 (440) ...-+.+|++|+.+.+.+..+.+.+ .+.+.+.-|... T Consensus 28 ~k~~~~kikpGD~li~Yv~~~~g~~--~~~~~~vg~~eV 64 (144) T PRK00809 28 YKNTISKVKPGDKLIIYVMKYKGAE--RLPGKIVGIYEV 64 (144) T ss_pred HHHHHHHCCCCCEEEEEEECCCCCC--CCCCEEEEEEEE T ss_conf 6668850799988999984455884--157369999996 No 224 >KOG0804 consensus Probab=58.63 E-value=10 Score=16.49 Aligned_cols=32 Identities=16% Similarity=0.093 Sum_probs=17.5 Q ss_pred EEECCCCCEEEEEECCCCCEEECCC-EEEEECHH Q ss_conf 9846888689999857898933697-89999648 Q gi|254780386|r 58 EIKSPFSGIIKKFQVENGSQILQGT-PLLTFEDI 90 (440) Q Consensus 58 ~I~s~~~G~V~~i~VkeGd~VkkGq-~L~~ld~~ 90 (440) .+++...|.+...- .+|+.-.+++ ..+.+..+ T Consensus 291 l~~~~~dGklve~~-~~~~~~~~~~~~~~~~~~s 323 (493) T KOG0804 291 LPQSKTDGKLVESS-TEGDDSRKDDCDSLELEYS 323 (493) T ss_pred CCCCCCCCCEEEEC-CCCCCCCCCCCCEEEEECC T ss_conf 43567787368840-5554122567453776334 No 225 >COG1463 Ttg2C ABC-type transport system involved in resistance to organic solvents, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism] Probab=58.00 E-value=11 Score=16.14 Aligned_cols=23 Identities=13% Similarity=0.387 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHE Q ss_conf 16899999999999999987640 Q gi|254780386|r 15 ASLKRMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~w~~~ 37 (440) .++..++++.++.+++|++|+.. T Consensus 8 ~~v~~f~~~~l~~~~~~~~~~~~ 30 (359) T COG1463 8 VLVGLFLLLGLLAVLLFVLWLAT 30 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89999999999999999998610 No 226 >CHL00117 rpoC2 RNA polymerase beta'' subunit; Reviewed Probab=57.97 E-value=11 Score=16.14 Aligned_cols=20 Identities=10% Similarity=0.112 Sum_probs=18.1 Q ss_pred EEECCCCCEEECCCEEEEEC Q ss_conf 99857898933697899996 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld 88 (440) -++|++|+.|+++|+|+++. T Consensus 408 lL~V~n~q~V~s~QvIAEi~ 427 (1350) T CHL00117 408 LLLVQNDQYVESEQVIAEIR 427 (1350) T ss_pred EEEEECCCEEECCCEEEEEE T ss_conf 89998995875161899970 No 227 >cd03693 EF1_alpha_II EF1_alpha_II: this family represents the domain II of elongation factor 1-alpha (EF-1a) that is found in archaea and all eukaryotic lineages. EF-1A is very abundant in the cytosol, where it is involved in the GTP-dependent binding of aminoacyl-tRNAs to the A site of the ribosomes in the second step of translation from mRNAs to proteins. Both domain II of EF1A and domain IV of IF2/eIF5B have been implicated in recognition of the 3'-ends of tRNA. More than 61% of eukaryotic elongation factor 1A (eEF-1A) in cells is estimated to be associated with actin cytoskeleton. The binding of eEF1A to actin is a noncanonical function that may link two distinct cellular processes, cytoskeleton organization and gene expression. Probab=57.33 E-value=12 Score=16.07 Aligned_cols=25 Identities=12% Similarity=0.154 Sum_probs=11.5 Q ss_pred HHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 110158983179998128875443999999971 Q gi|254780386|r 339 IQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 339 i~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) -+.|++|+.+.+ +|.. ..++|.+|- T Consensus 28 ~G~i~~gd~v~i----~P~~----~~~~VksI~ 52 (91) T cd03693 28 TGVLKPGMVVTF----APAG----VTGEVKSVE 52 (91) T ss_pred ECEEECCCEEEE----CCCC----CEEEEEEEE T ss_conf 177857999772----7864----337999999 No 228 >KOG2264 consensus Probab=56.87 E-value=12 Score=16.02 Aligned_cols=51 Identities=14% Similarity=0.249 Sum_probs=24.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 874343323321001233332221145554223568888776554678865 Q gi|254780386|r 99 KKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESREEQLLDFLGKYHSTC 149 (440) Q Consensus 99 ~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (440) ...++.+++.+.+.++++++.++..++.+++.+...+.++..+.....+.+ T Consensus 91 Vs~EL~ele~krqel~seI~~~n~kiEelk~~i~~~q~eL~~Lk~~ieqaq 141 (907) T KOG2264 91 VSLELTELEVKRQELNSEIEEINTKIEELKRLIPQKQLELSALKGEIEQAQ 141 (907) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 879999889888887747999999999999887776778998770799999 No 229 >pfam10574 UPF0552 Uncharacterized protein family UPF0552. This family of proteins has no known function. Probab=56.76 E-value=12 Score=16.01 Aligned_cols=43 Identities=9% Similarity=0.106 Sum_probs=30.8 Q ss_pred EEEEEEECHHHHHH--CCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCC Q ss_conf 04775326223110--1589831799981288754439999999713344 Q gi|254780386|r 328 TYIRAKVTPKQIQH--VKKGYTATVRFPHYADIREKKFKAIIEKIDPIIS 375 (440) Q Consensus 328 l~v~~~v~~~di~~--i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~ 375 (440) -.+..++|+.++.+ +.+|+.|+++-- ..|.|-..+..++..+. T Consensus 136 ~~faFW~~E~e~~~~ele~G~evRlKT~-----GDgpFi~SlaKlD~~tv 180 (224) T pfam10574 136 ETFAFWAPESEMDKTELELGQEVRLKTR-----GDGPFIFSLAKMDSGTV 180 (224) T ss_pred CEEEEECCHHHHHCCCCCCCCEEEEEEC-----CCCCCEEEEEEECCCCE T ss_conf 5576652677731014136875899851-----78741444432025715 No 230 >PRK13454 F0F1 ATP synthase subunit B'; Provisional Probab=56.55 E-value=12 Score=16.08 Aligned_cols=31 Identities=13% Similarity=0.107 Sum_probs=17.3 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHH-HHHEEE Q ss_conf 763356861689999999999999998-764067 Q gi|254780386|r 7 KNLKRNHPASLKRMTLLSIIGIIGLIT-WSILLP 39 (440) Q Consensus 7 ~~~~~~~p~~~~~~~~~~~~~~~~~~~-w~~~~~ 39 (440) |+|. +..|.--+.|++++|++.|++ |-++.| T Consensus 23 PQld--~~~f~sQiFWl~i~F~ily~vl~k~~lP 54 (181) T PRK13454 23 PQLD--FDTFPNQIFWLLVTLVAIYFVLTRVALP 54 (181) T ss_pred CCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999--0240789999999999999999999998 No 231 >pfam11488 Lge1 Transcriptional regulatory protein LGE1. This family of proteins is conserved in yeasts where it is involved in the ubiquitination of histones H2A and H2B. This ubiquitination step is a vital one in the regulation of the transcriptional activity of RNA polymerase II. In S. cerevisiae, Rad6 and Bre1 are present in a complex, also containing Lge1, that is required for H2B ubiquitination. Bre1 is the H2B ubiquitin ligase that interacts with acidic activators, such as Gal4, and recruits Rad6 and its binding partner Lge1 to target promoters. In S. pombe the equivalent protein to Lge1 appears to be Shf1. Probab=56.45 E-value=12 Score=16.03 Aligned_cols=31 Identities=10% Similarity=0.288 Sum_probs=16.1 Q ss_pred EEEECHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999648998889998743433233210012 Q gi|254780386|r 84 LLTFEDIETTDLIDLKKVSIKNLRCHIDTEK 114 (440) Q Consensus 84 L~~ld~~~~~~~l~~~~~~l~~l~~~l~~~~ 114 (440) |+.+|+...+..+++.-.++.++...|..++ T Consensus 167 Im~idd~~t~e~le~~~~e~~~~d~~L~~~~ 197 (234) T pfam11488 167 IMDIDDEPTQEKLEERYKELEELDQKLEELQ 197 (234) T ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 1105673148899998766888899999999 No 232 >TIGR00527 gcvH glycine cleavage system H protein; InterPro: IPR002930 This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group is attached to a completely conserved lysine residue. The H protein shuttles the methylamine group of glycine from the P protein to the T protein.; GO: 0006546 glycine catabolic process, 0005960 glycine cleavage complex. Probab=56.24 E-value=5.3 Score=18.31 Aligned_cols=12 Identities=0% Similarity=0.127 Sum_probs=5.6 Q ss_pred CCEEEEEEEEEE Q ss_conf 443999999971 Q gi|254780386|r 360 EKKFKAIIEKID 371 (440) Q Consensus 360 ~~~~~G~V~~Is 371 (440) |-++.|+|..|- T Consensus 69 Y~Pv~Gtv~e~N 80 (132) T TIGR00527 69 YAPVDGTVVEVN 80 (132) T ss_pred CCCCCCEEEEEC T ss_conf 389874499853 No 233 >pfam09891 DUF2118 Uncharacterized protein conserved in archaea (DUF2118). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=56.09 E-value=12 Score=15.94 Aligned_cols=37 Identities=16% Similarity=0.152 Sum_probs=17.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCE Q ss_conf 22212444554210335565047753262231101589831 Q gi|254780386|r 308 SSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTA 348 (440) Q Consensus 308 ~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v 348 (440) .|-++.+|+.|+.+ | -..+.+ ++-...-.+|..|... T Consensus 71 ~~~~lkkGt~l~lv-p-veGy~v--~~i~~~G~rV~~g~~l 107 (149) T pfam09891 71 TGIYLKKGTELKLV-P-AEGYKV--YPIVDEGDRVLKGDRL 107 (149) T ss_pred EEEEECCCCEEEEE-E-ECCEEE--EEECCCCCEEECCCEE T ss_conf 79998699999999-8-074699--8872656477218648 No 234 >PRK03481 consensus Probab=56.07 E-value=9.2 Score=16.75 Aligned_cols=26 Identities=8% Similarity=0.029 Sum_probs=15.7 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCC Q ss_conf 455541168731212012221244455 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQP 317 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~ 317 (440) .+-+|.||++....+ -+|..-+.... T Consensus 145 R~H~P~dg~l~~~~~-ipG~LfsVNp~ 170 (322) T PRK03481 145 RVHMPCNGILREMIY-VPGDLFSVNHL 170 (322) T ss_pred EEECCCCCEEEEEEE-CCCCCEEECHH T ss_conf 586356858999998-78773316889 No 235 >PRK08385 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=55.97 E-value=12 Score=15.92 Aligned_cols=25 Identities=12% Similarity=0.319 Sum_probs=20.6 Q ss_pred EEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8999985789893369789999648 Q gi|254780386|r 66 IIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 66 ~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .-..+++++|+.|++|++++++... T Consensus 59 ~~~~~~~~dG~~v~~g~~i~~i~G~ 83 (279) T PRK08385 59 VAVEVKKRDGETVKAGEVILELKGN 83 (279) T ss_pred CEEEEECCCCCEECCCCEEEEEEEC T ss_conf 8899992889890699899999987 No 236 >KOG4360 consensus Probab=55.08 E-value=11 Score=16.34 Aligned_cols=25 Identities=20% Similarity=0.286 Sum_probs=8.4 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHCC Q ss_conf 0112232222221023333222101 Q gi|254780386|r 178 RLDLENKSSLLRALLSSHQKDLTMM 202 (440) Q Consensus 178 ~~~l~~~~~~l~~~~~~~~~el~~~ 202 (440) ..+++.+.+.....+...+.+|..+ T Consensus 277 ~~EleDkyAE~m~~~~EaeeELk~l 301 (596) T KOG4360 277 LEELEDKYAECMQMLHEAEEELKCL 301 (596) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9998888999999889889999863 No 237 >PRK01734 consensus Probab=54.78 E-value=7.7 Score=17.27 Aligned_cols=24 Identities=8% Similarity=0.014 Sum_probs=15.7 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCC Q ss_conf 4555411687312120122212444 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQS 315 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G 315 (440) .+-+|+||+|..... -+|..-+.. T Consensus 155 R~H~P~~g~l~~~~~-ipG~L~sVn 178 (294) T PRK01734 155 RVHMPCDATLRKMIY-VPGDLFSVN 178 (294) T ss_pred EEECCCCCEEEEEEE-CCCCCCCCC T ss_conf 364355768989998-698554509 No 238 >pfam05103 DivIVA DivIVA protein. The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate mini-cells. Inactivation of divIVA produces a mini-cell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype. These proteins appear to contain coiled-coils. Probab=54.66 E-value=8.9 Score=16.82 Aligned_cols=36 Identities=14% Similarity=0.196 Sum_probs=14.7 Q ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 964899888999874343323321001233332221 Q gi|254780386|r 87 FEDIETTDLIDLKKVSIKNLRCHIDTEKSALDFLNN 122 (440) Q Consensus 87 ld~~~~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~ 122 (440) +|..+..+-++..-..+..+..+...++..+..+.. T Consensus 18 Yd~~EVD~FLd~va~e~e~l~~en~~L~~~~~~l~~ 53 (131) T pfam05103 18 YDPDEVDEFLDQVIKDYEALYKENEELKEEIERLEE 53 (131) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999999999999999 No 239 >TIGR02097 yccV hemimethylated DNA binding domain; InterPro: IPR011722 This entry describes the small protein from E. coli YccV and its homologs in other Proteobacteria. YccV is now described as a hemimethylated DNA binding protein . The model entry describes a domain in longer eukaryotic proteins.. Probab=54.15 E-value=13 Score=15.73 Aligned_cols=48 Identities=13% Similarity=0.429 Sum_probs=35.4 Q ss_pred HCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCC----------CC-CCCEEEEEEEEECHH Q ss_conf 015898317999812887544399999997133444677----------76-551189999993501 Q gi|254780386|r 341 HVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDN----------NL-QTQEYYEVILKITDL 396 (440) Q Consensus 341 ~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~----------~~-~~~~~y~v~i~~~~~ 396 (440) +.++||.|+=++ || |.|.|..|+|....... .. ..++||.|-++-++. T Consensus 3 ~F~iG~~vrH~~--fg------yrgVv~d~DP~~~~~~~w~~~~~~~~~p~G~~qPfY~vl~e~~~~ 61 (103) T TIGR02097 3 KFRIGQVVRHKL--FG------YRGVVIDVDPEYANTEEWLEAIPVEIRPRGKDQPFYHVLVEDDEG 61 (103) T ss_pred CCCCCCEEEEEE--CC------CCEEEECCCCHHHCCCHHHHHHHHHCCCCCCCCCCEEEEEECCCC T ss_conf 202576754123--68------544886658010068633787643001256578972688864899 No 240 >TIGR00008 infA translation initiation factor IF-1; InterPro: IPR004368 This family consists of translation initiation factor IF-1 as found in bacteria and chloroplasts. This protein, about 70 residues in length, consists largely of an S1 RNA binding domain (IPR003029 from INTERPRO).; GO: 0003743 translation initiation factor activity, 0006413 translational initiation. Probab=53.98 E-value=13 Score=15.71 Aligned_cols=41 Identities=15% Similarity=0.108 Sum_probs=27.9 Q ss_pred CCCCEEEEEEECHH---HHHHCCCCCCEEEEEEECCCCCCCEEEEEEEE Q ss_conf 55650477532622---31101589831799981288754439999999 Q gi|254780386|r 324 HSKLTYIRAKVTPK---QIQHVKKGYTATVRFPHYADIREKKFKAIIEK 369 (440) Q Consensus 324 ~~~~l~v~~~v~~~---di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~ 369 (440) -++.-.|.|+|+-. ..-+|=+|+.|.|-+..|+.. .|+|++ T Consensus 24 LeN~~~v~a~iSGk~r~~~iriLpGD~V~ve~SpYdl~-----~GRI~y 67 (69) T TIGR00008 24 LENGHEVLAHISGKIRKNYIRILPGDKVKVELSPYDLT-----RGRIIY 67 (69) T ss_pred ECCCCEEEEEECCEEECCEEEECCCCEEEEEECCCCCC-----CCCEEE T ss_conf 47997898997103413517554798789987136487-----750877 No 241 >KOG0963 consensus Probab=53.65 E-value=0.59 Score=24.68 Aligned_cols=22 Identities=18% Similarity=0.273 Sum_probs=10.3 Q ss_pred CCCCCCCCCCCCCCCCCCEEEE Q ss_conf 3221222221234555411687 Q gi|254780386|r 279 KLTILEKTREQKTILSPIAGTI 300 (440) Q Consensus 279 ~l~~l~~~l~~~~I~AP~~G~V 300 (440) -+..+++.+.......|-.|.- T Consensus 426 ~i~klE~dl~~~~~~~~~~~~~ 447 (629) T KOG0963 426 LIAKLEQDLLKVQVSPPAEGAT 447 (629) T ss_pred HHHHHHHHHHHCCCCCCCCCCC T ss_conf 9998876564114689987630 No 242 >pfam11119 DUF2633 Protein of unknown function (DUF2633). This family is conserved largely in the Bacillaceae family. Several members are named as YfgG. The function is not known. Probab=53.32 E-value=13 Score=15.79 Aligned_cols=30 Identities=20% Similarity=0.305 Sum_probs=18.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHEEE Q ss_conf 356861689999999999999998764067 Q gi|254780386|r 10 KRNHPASLKRMTLLSIIGIIGLITWSILLP 39 (440) Q Consensus 10 ~~~~p~~~~~~~~~~~~~~~~~~~w~~~~~ 39 (440) +|++.+-.++++++.+++|+|=+++++.+- T Consensus 3 ~r~~~~mt~iVLLISFiilFgRl~Y~aI~a 32 (59) T pfam11119 3 RKKNGRMTKIVLLISFIIFFGRLIYSAINA 32 (59) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 776661256899999999999999987747 No 243 >KOG3814 consensus Probab=53.26 E-value=13 Score=15.64 Aligned_cols=35 Identities=20% Similarity=0.328 Sum_probs=25.3 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEE Q ss_conf 34576335686168999999999999999876406 Q gi|254780386|r 4 FMWKNLKRNHPASLKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 4 ~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~ 38 (440) |+-++.-.-+..|.-+.+++++++++.|..|.++. T Consensus 176 f~Rk~~A~mPRvf~~RAlll~LV~~~~fayWLFYi 210 (531) T KOG3814 176 FFRKAMADMPRVFVVRALLLVLVFLIVFAYWLFYI 210 (531) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 82321046745389999999999999999999885 No 244 >PRK04722 consensus Probab=53.22 E-value=13 Score=15.63 Aligned_cols=22 Identities=14% Similarity=0.170 Sum_probs=10.4 Q ss_pred CCCCCCEEEEECCEEECCCCCCC Q ss_conf 45554116873121201222124 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQ 313 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~ 313 (440) .+-+|.||.+...... +|..-+ T Consensus 142 R~H~P~dg~~~~~~~i-pG~L~s 163 (280) T PRK04722 142 RVHMPWTGTLRETVHV-PGRLFS 163 (280) T ss_pred EEECCCCEEEEEEEEC-CCCCEE T ss_conf 5726764499999985-985231 No 245 >CHL00037 petA cytochrome f Probab=52.94 E-value=14 Score=15.61 Aligned_cols=25 Identities=8% Similarity=-0.003 Sum_probs=14.7 Q ss_pred CCCCCCCCCCCEEEEECCEEECCCC Q ss_conf 2212345554116873121201222 Q gi|254780386|r 286 TREQKTILSPIAGTIVYNQSFSSSN 310 (440) Q Consensus 286 ~l~~~~I~AP~~G~V~~l~~~~~G~ 310 (440) .-+++...||.+|+|..+.....|+ T Consensus 200 KSNN~~~~as~~G~i~~I~~~ekgg 224 (320) T CHL00037 200 KSNNTVYNATAAGIVSKIIRKEKGG 224 (320) T ss_pred CCCCCEECCCCCCEEEEEEECCCCC T ss_conf 4668266055772788877657895 No 246 >KOG4674 consensus Probab=52.61 E-value=0.5 Score=25.15 Aligned_cols=15 Identities=20% Similarity=0.085 Sum_probs=7.2 Q ss_pred CCCCEEEEEEEECEE Q ss_conf 237756999992205 Q gi|254780386|r 406 RNGFPAEVLFTAKNT 420 (440) Q Consensus 406 ~~Gm~~~~~I~~~~~ 420 (440) --||.+..++..++. T Consensus 1745 ~~~~~~e~d~~k~kE 1759 (1822) T KOG4674 1745 EQGTDVENDTKKKKE 1759 (1822) T ss_pred CCCCCCCCCCCCCCC T ss_conf 135655554102456 No 247 >COG3061 OapA Cell envelope opacity-associated protein A [Cell envelope biogenesis, outer membrane] Probab=51.35 E-value=12 Score=16.00 Aligned_cols=53 Identities=15% Similarity=0.233 Sum_probs=33.7 Q ss_pred CCCCCCCCCCCCCCCCC----CEEEEEEE--CHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 21244455421033556----50477532--6223110158983179998128875443999999971 Q gi|254780386|r 310 NYAQQSQPLMKIVPHSK----LTYIRAKV--TPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 310 ~~v~~G~~l~~Ivp~~~----~l~v~~~v--~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) =.|..|.+|+.+.-|++ +++.-+.| .-.-.+.++-||.|+|++++ .|.|+... T Consensus 162 y~V~~G~TLaQlFRdn~LpitDVnAMakveGagkpLSnlkaGq~Vki~~na---------qG~Vt~L~ 220 (242) T COG3061 162 YTVPQGKTLAQLFRDNNLPITDVNAMAKVEGAGKPLSNLKAGQKVKISLNA---------QGRVTELR 220 (242) T ss_pred EEECCCCCHHHHHHCCCCCHHHHHHHHHHCCCCCCHHHCCCCCEEEEEECC---------CCCEEEEE T ss_conf 871687659998760699857767877530478702221477779999747---------66457899 No 248 >COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane] Probab=50.74 E-value=15 Score=15.38 Aligned_cols=34 Identities=9% Similarity=0.007 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEE Q ss_conf 9999999999999998764067125899979999 Q gi|254780386|r 18 KRMTLLSIIGIIGLITWSILLPIEISVSSSGEIL 51 (440) Q Consensus 18 ~~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~ 51 (440) ++++.++++++++.++|+++++......+...|. T Consensus 26 ~~il~~~~~~~~~a~~~a~~~~p~Y~a~a~l~i~ 59 (458) T COG3206 26 WLILLVAALVVGLAVLYAFLAPPIYEADAQLLVD 59 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEE T ss_conf 9999999999999999997088621688899984 No 249 >pfam11623 DUF3252 Protein of unknown function (DUF3252). This family of proteins has no known function. Some members are annotated as Ssl0352 however this cannot be confirmed. Currently there is no known function. Probab=50.74 E-value=15 Score=15.38 Aligned_cols=29 Identities=10% Similarity=0.160 Sum_probs=23.2 Q ss_pred CCCCCCEEEEEEECCCCCCCEEEEEEEEEECC Q ss_conf 15898317999812887544399999997133 Q gi|254780386|r 342 VKKGYTATVRFPHYADIREKKFKAIIEKIDPI 373 (440) Q Consensus 342 i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~ 373 (440) |-||+.|+|+= |.+.|-.|.|-|.+||.. T Consensus 2 ilPG~~V~V~n---~~~~Y~gy~G~VQRvsd~ 30 (53) T pfam11623 2 ILPGMTVKVKN---PNDIYYGYEGQVQRVSDG 30 (53) T ss_pred CCCCCEEEEEC---CCCCCHHCEEEEEEECCC T ss_conf 25997899868---877600204899994199 No 250 >smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association. Probab=50.42 E-value=6.9 Score=17.56 Aligned_cols=10 Identities=10% Similarity=0.238 Sum_probs=5.4 Q ss_pred CCCCCEEECC Q ss_conf 5789893369 Q gi|254780386|r 72 VENGSQILQG 81 (440) Q Consensus 72 VkeGd~VkkG 81 (440) +.-.|.|.+| T Consensus 49 ~sl~d~v~A~ 58 (312) T smart00787 49 CSLDQYVVAG 58 (312) T ss_pred CCHHHHHHHH T ss_conf 8899999971 No 251 >PRK06148 hypothetical protein; Provisional Probab=49.62 E-value=15 Score=15.27 Aligned_cols=60 Identities=13% Similarity=0.194 Sum_probs=25.2 Q ss_pred CCCCCCCCCEEEEECCEEECCCCCCCCCC-CCCCCCCCC-CCEE-EEEEECHHHHHHCCCCCCEE Q ss_conf 12345554116873121201222124445-542103355-6504-77532622311015898317 Q gi|254780386|r 288 EQKTILSPIAGTIVYNQSFSSSNYAQQSQ-PLMKIVPHS-KLTY-IRAKVTPKQIQHVKKGYTAT 349 (440) Q Consensus 288 ~~~~I~AP~~G~V~~l~~~~~G~~v~~G~-~l~~Ivp~~-~~l~-v~~~v~~~di~~i~~Gq~v~ 349 (440) ..+.|.||++|.|...... .|. ..-|- -++.=.+++ ..++ .....+..-+.++++|+++. T Consensus 452 ~gT~V~AP~dG~V~~~~n~-~~~-l~yGg~iiL~h~~~~~~~F~tLyGhl~~~~~~~l~~G~~i~ 514 (1015) T PRK06148 452 AGTPVYAPLAGTVRSVEIE-AVP-LGYGGLIALEHEPPGCPPFYTLWGHLAHEAVSRLKPGDRLA 514 (1015) T ss_pred CCCCEECCCCCEEEEEECC-CCC-CCCCCEEEEEECCCCCCCEEEEECCCCHHHHHCCCCCCCCC T ss_conf 9980446658669996247-655-56775489984258887505652357725531268888504 No 252 >KOG0018 consensus Probab=49.61 E-value=2.1 Score=21.07 Aligned_cols=10 Identities=0% Similarity=-0.190 Sum_probs=5.1 Q ss_pred EEEEEEECCC Q ss_conf 1799981288 Q gi|254780386|r 348 ATVRFPHYAD 357 (440) Q Consensus 348 v~i~~~a~~~ 357 (440) |++.+|+.+| T Consensus 580 ~rlv~Dvi~y 589 (1141) T KOG0018 580 VRLVIDVINY 589 (1141) T ss_pred EEEEEEECCC T ss_conf 0799873278 No 253 >PRK05742 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=49.60 E-value=13 Score=15.68 Aligned_cols=25 Identities=20% Similarity=0.391 Sum_probs=20.6 Q ss_pred EEEEEECCCCCEEECCCEEEEECHH Q ss_conf 8999985789893369789999648 Q gi|254780386|r 66 IIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 66 ~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .--.+.+++|+.|++|++++++... T Consensus 66 ~~~~~~~~dG~~v~~g~~i~~i~G~ 90 (277) T PRK05742 66 VAVHWQVADGDRVEANQVLFHLEGP 90 (277) T ss_pred CEEEEEECCCCCCCCCCEEEEEEEC T ss_conf 0999983889870588689999977 No 254 >PRK00044 psd phosphatidylserine decarboxylase; Reviewed Probab=49.56 E-value=12 Score=15.85 Aligned_cols=24 Identities=13% Similarity=0.090 Sum_probs=15.2 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCC Q ss_conf 4555411687312120122212444 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQS 315 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G 315 (440) .+-||.||+|..... -+|..-+.. T Consensus 145 R~H~P~~g~i~~~~~-ipG~l~sVn 168 (291) T PRK00044 145 RVHMPCDGTLREMIY-VPGDLFSVN 168 (291) T ss_pred EEECCCCCEEEEEEE-ECCCCCCCC T ss_conf 142675667989999-697379889 No 255 >pfam10168 Nup88 Nuclear pore component. Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein has no obvious structural motifs but is the region for binding to Nup98, one of the components of the nuclear pore. the C-terminal end is a predicted coiled-coil domain. Nup88 is overexpressed in tumour cells. Probab=49.46 E-value=8.4 Score=16.99 Aligned_cols=23 Identities=9% Similarity=0.151 Sum_probs=9.6 Q ss_pred EEEEEEEEEEECCCEEEEECCCC Q ss_conf 58999799995387389846888 Q gi|254780386|r 42 ISVSSSGEILNEDNVVEIKSPFS 64 (440) Q Consensus 42 ~~v~~~G~i~~~~~~~~I~s~~~ 64 (440) +...++|.++...-....+.+.. T Consensus 457 i~ll~~g~visl~Ll~~~~~~~p 479 (717) T pfam10168 457 ICITGTYECISLPLLIDAQPLSP 479 (717) T ss_pred EEEECCCCEEEEECCCCCCCCCC T ss_conf 99945883897330014556788 No 256 >TIGR01394 TypA_BipA GTP-binding protein TypA; InterPro: IPR006298 This bacterial (and Arabidopsis) protein, termed TypA or BipA, is a GTP-binding protein. It is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways, but the precise function is unknown.; GO: 0005525 GTP binding, 0005622 intracellular. Probab=48.98 E-value=16 Score=15.20 Aligned_cols=52 Identities=21% Similarity=0.172 Sum_probs=38.1 Q ss_pred HHEEEEEEEEEEEEEEEECCCEEEEEC----CCCCEEEEEECCCC------CEEECCCEEEE Q ss_conf 640671258999799995387389846----88868999985789------89336978999 Q gi|254780386|r 35 SILLPIEISVSSSGEILNEDNVVEIKS----PFSGIIKKFQVENG------SQILQGTPLLT 86 (440) Q Consensus 35 ~~~~~~~~~v~~~G~i~~~~~~~~I~s----~~~G~V~~i~VkeG------d~VkkGq~L~~ 86 (440) .|+++|-+.=.-+|+|.+...+..+.. ...|+|.+++..+| |.+++||+.+- T Consensus 223 ~y~GRI~~GRv~~G~vk~Gq~V~~~~~d~g~~~~~ri~~L~~f~GL~R~~~d~A~AGDIvAv 284 (609) T TIGR01394 223 EYLGRIAIGRVHRGTVKKGQQVALMKRDDGTIENGRISKLLGFEGLERVEIDEAGAGDIVAV 284 (609) T ss_pred CCCCEEEEEEEEECEECCCCEEEEEECCCCEEEEEEEEEEEECCCCEEEECCCCCCCCEEEE T ss_conf 67766999987505654686479872469689777764542015711000455798778999 No 257 >PRK06559 nicotinate-nucleotide pyrophosphorylase; Provisional Probab=48.87 E-value=14 Score=15.55 Aligned_cols=21 Identities=19% Similarity=0.366 Sum_probs=18.4 Q ss_pred EECCCCCEEECCCEEEEECHH Q ss_conf 985789893369789999648 Q gi|254780386|r 70 FQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 70 i~VkeGd~VkkGq~L~~ld~~ 90 (440) ..|++|+.|++|++++++... T Consensus 77 ~~~~dG~~v~~G~~i~~i~G~ 97 (290) T PRK06559 77 HQFKDGDRLTSGDLVLEIIGS 97 (290) T ss_pred CCCCCCCEECCCCEEEEEEEC T ss_conf 336889994799999999956 No 258 >PRK03934 phosphatidylserine decarboxylase; Provisional Probab=48.81 E-value=12 Score=15.89 Aligned_cols=21 Identities=10% Similarity=0.059 Sum_probs=10.8 Q ss_pred CCCCCCEEEEECCEEECCCCCC Q ss_conf 4555411687312120122212 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYA 312 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v 312 (440) .+-+|++|.|...... +|..- T Consensus 131 R~H~P~~g~i~~~~~i-pG~l~ 151 (267) T PRK03934 131 RYHAPCDLEILSARYI-PGKLY 151 (267) T ss_pred EEECCCCCEEEEEEEE-CCCCC T ss_conf 6970458879789993-88236 No 259 >pfam05911 DUF869 Plant protein of unknown function (DUF869). This family consists of a number of sequences found in Arabidopsis thaliana, Oryza sativa and Lycopersicon esculentum (Tomato). The function of this family is unknown. Probab=48.79 E-value=3 Score=19.97 Aligned_cols=16 Identities=44% Similarity=0.621 Sum_probs=6.4 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 2122233221222221 Q gi|254780386|r 273 FADEMSKLTILEKTRE 288 (440) Q Consensus 273 l~~~~~~l~~l~~~l~ 288 (440) ......++..++.+++ T Consensus 295 ~AktaskLs~~eaQl~ 310 (767) T pfam05911 295 YAKTASKLSVLEAQLE 310 (767) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8998765688999887 No 260 >pfam05545 FixQ Cbb3-type cytochrome oxidase component FixQ. This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon. Probab=48.70 E-value=16 Score=15.17 Aligned_cols=20 Identities=25% Similarity=0.466 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHHHE Q ss_conf 99999999999999987640 Q gi|254780386|r 18 KRMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 18 ~~~~~~~~~~~~~~~~w~~~ 37 (440) .+.+++++++|+|.++|++- T Consensus 12 ~~~~l~~~~~Figiv~wa~~ 31 (49) T pfam05545 12 SWGTLVLFVFFIGIVFWAYS 31 (49) T ss_pred HHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999998728 No 261 >COG3785 Uncharacterized conserved protein [Function unknown] Probab=48.67 E-value=15 Score=15.32 Aligned_cols=50 Identities=16% Similarity=0.435 Sum_probs=35.0 Q ss_pred HHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECCCCCCCC----------CCCCCEEEEEEEEECHH Q ss_conf 11015898317999812887544399999997133444677----------76551189999993501 Q gi|254780386|r 339 IQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPIISQQDN----------NLQTQEYYEVILKITDL 396 (440) Q Consensus 339 i~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~----------~~~~~~~y~v~i~~~~~ 396 (440) .++..+||.|+=++ || |.|.|..|+|.-...+. ....++||.+.++=++. T Consensus 12 ~aKF~IGQvVRHrl--fp------frGVV~DvDPeyanteew~~~ip~~~rp~rdqPfYHllaEnde~ 71 (116) T COG3785 12 AAKFGIGQVVRHRL--FP------FRGVVFDVDPEYANTEEWPDEIPVNIRPLRDQPFYHLLAENDET 71 (116) T ss_pred HHHCCHHHHHHHHC--CC------CCEEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCEEEEEECCCC T ss_conf 75302204221010--35------43378833834146863844540014644458610144206886 No 262 >pfam12389 Peptidase_M73 Camelysin metallo-endopeptidase. Probab=48.43 E-value=16 Score=15.16 Aligned_cols=51 Identities=14% Similarity=0.135 Sum_probs=28.5 Q ss_pred HHHHHHHHHHHHHEEEEEEE--EEEEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECC Q ss_conf 99999999987640671258--99979999538738984688868999985789893369 Q gi|254780386|r 24 SIIGIIGLITWSILLPIEIS--VSSSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQG 81 (440) Q Consensus 24 ~~~~~~~~~~w~~~~~~~~~--v~~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkG 81 (440) +-+.+||-+.||+|.-..++ .-+.|++=..-++.. +|.=-..|+||.+.+. T Consensus 18 LGlsLiGGGTfAyFsD~e~snnTFAAGTLDL~~~p~~-------ii~v~nLKPGD~i~r~ 70 (199) T pfam12389 18 LGAALIGGGTFAFFSDKEVSNNTFAAGTLDLALDPKT-------LVDIKDLKPGDSVEKE 70 (199) T ss_pred HHHHEECCCEEEEECCCEEECCEEEEEEEECCCCCCE-------EEECCCCCCCCCEEEE T ss_conf 5431645723799704145522564004420368741-------8871358998631146 No 263 >CHL00071 tufA elongation factor Tu Probab=48.34 E-value=16 Score=15.14 Aligned_cols=58 Identities=12% Similarity=0.123 Sum_probs=33.1 Q ss_pred CCEEEEECCEEECCCCCCCCCCCCCCCCC--CCCCEEE-EEEECHHHHHHCCCCCCEEEEEEEC Q ss_conf 41168731212012221244455421033--5565047-7532622311015898317999812 Q gi|254780386|r 295 PIAGTIVYNQSFSSSNYAQQSQPLMKIVP--HSKLTYI-RAKVTPKQIQHVKKGYTATVRFPHY 355 (440) Q Consensus 295 P~~G~V~~l~~~~~G~~v~~G~~l~~Ivp--~~~~l~v-~~~v~~~di~~i~~Gq~v~i~~~a~ 355 (440) |-.|+|..-.+ ..+.+..|+.+. |+| +.....| ...+....+.....||.|.+.+... T Consensus 232 ~G~GtVv~G~v--~sG~v~~Gd~v~-i~~~~~~~~~~VksI~~~~~~~~~a~aG~~v~l~L~gi 292 (409) T CHL00071 232 TGRGTVATGRI--ERGTVKVGDTVE-IVGLRETRTTTVTGLEMFQKTLDEGLAGDNVGILLRGI 292 (409) T ss_pred CCCEEEEEEEE--EECCCCCCCEEE-EEECCCCCEEEEEEEEECCCCCCEECCCCEEEEEECCC T ss_conf 89788999999--145634999899-99769986079999998895988887998899997599 No 264 >PRK09838 periplasmic copper-binding protein; Provisional Probab=48.20 E-value=16 Score=15.12 Aligned_cols=56 Identities=16% Similarity=0.291 Sum_probs=35.2 Q ss_pred CEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHH-HHHHCCCCCCEEEEEEE Q ss_conf 116873121201222124445542103355650477532622-31101589831799981 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPK-QIQHVKKGYTATVRFPH 354 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~-di~~i~~Gq~v~i~~~a 354 (440) -.|+|..++. ..|.+.=.=.||-.| +-..+.-...++.. ++..+++|++|.+.|.- T Consensus 44 ~~G~V~~ID~-~~~kVTI~H~pI~aL--~WPaMTM~F~v~~~~~l~~lk~Gd~V~F~Fvq 100 (114) T PRK09838 44 ATGVVKGIDL-ESKKITIHHEPIPAV--NWPEMTMRFTITPQTKMSEIKTGDKVAFNFVQ 100 (114) T ss_pred EEEEEEEEEC-CCCEEEEECCCCCCC--CCCCCEEEEECCCHHHHHCCCCCCEEEEEEEE T ss_conf 3479999826-799799825861306--99863576333895676059999989999996 No 265 >TIGR00922 nusG transcription termination/antitermination factor NusG; InterPro: IPR001062 Bacterial transcription antitermination protein, nusG, is a component of the transcription complex and interacts with the termination factor Rho and RNA polymerase , . NusG is a bacterial transcriptional elongation factor involved in transcription termination and anti-termination .; GO: 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=47.89 E-value=16 Score=15.09 Aligned_cols=27 Identities=19% Similarity=0.263 Sum_probs=20.8 Q ss_pred HCCCCCCEEEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 01589831799981288754439999999713 Q gi|254780386|r 341 HVKKGYTATVRFPHYADIREKKFKAIIEKIDP 372 (440) Q Consensus 341 ~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~ 372 (440) ...+|..|+|. |+ |+ -.|+|+|..|++ T Consensus 139 ~fE~Ge~Vrv~-dG-PF---~~F~G~Veev~~ 165 (193) T TIGR00922 139 DFEVGEQVRVN-DG-PF---ANFTGTVEEVDY 165 (193) T ss_pred CCCCCCEEEEE-CC-CC---CCCCEEEEEEEH T ss_conf 23579888980-38-88---885147988802 No 266 >pfam04849 HAP1_N HAP1 N-terminal conserved region. This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34. Probab=47.82 E-value=14 Score=15.57 Aligned_cols=16 Identities=38% Similarity=0.663 Sum_probs=5.9 Q ss_pred EECHHHH-HHHHHHHHH Q ss_conf 9964899-888999874 Q gi|254780386|r 86 TFEDIET-TDLIDLKKV 101 (440) Q Consensus 86 ~ld~~~~-~~~l~~~~~ 101 (440) .++|.++ ...+++.+. T Consensus 61 Ty~Di~avt~LLEEKEk 77 (307) T pfam04849 61 TYNDIDAVTRLLEEKER 77 (307) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 40019999999998754 No 267 >KOG1899 consensus Probab=47.74 E-value=1.6 Score=21.77 Aligned_cols=11 Identities=0% Similarity=-0.139 Sum_probs=4.8 Q ss_pred EEEEEEEEEEC Q ss_conf 39999999713 Q gi|254780386|r 362 KFKAIIEKIDP 372 (440) Q Consensus 362 ~~~G~V~~Is~ 372 (440) .+.|.-..+++ T Consensus 418 ~~~g~~~~~~s 428 (861) T KOG1899 418 GNEGANHNYSS 428 (861) T ss_pred CCCCCCCCCCC T ss_conf 77653345676 No 268 >pfam04156 IncA IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is non-fusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA, is functionally associated with the homotypic fusion of inclusions. This family probably includes members of the wider Inc family rather than just IncA. Probab=47.45 E-value=16 Score=15.05 Aligned_cols=33 Identities=15% Similarity=0.175 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHH Q ss_conf 988899987434332332100123333222114 Q gi|254780386|r 92 TTDLIDLKKVSIKNLRCHIDTEKSALDFLNNKY 124 (440) Q Consensus 92 ~~~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~ 124 (440) .+..+.....++.++..........+....... T Consensus 84 ~~~~l~~l~~el~~~~~~~~~~~~~l~~~~~~~ 116 (186) T pfam04156 84 LKQQLSELQEELEDLEERIAELESELEDLKEDL 116 (186) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 999999999989989876888744759788789 No 269 >cd04467 S1_aIF5A S1_aIF5A: Archaeal translation Initiation Factor 5A (aIF5A), S1-like RNA-binding domain. aIF5A is a homolog of eukaryotic eIF5A. IF5A is the only protein known to have the unusual amino acid hypusine. Hypusine is a post-translationally modified lysine and is essential for IF5A function. In yeast, eIF5A interacts with components of the 80S ribosome and translation elongation factors 2 (eEF2) in a hypusine-dependent manner. This C-terminal S1 domain resembles the cold-shock domain which binds RNA. Moreover, IF5A prefers binding to the actively translating ribosome. This evidence suggests that IF5A plays a role in translation elongation instead of translation initiation as previously proposed. Probab=46.88 E-value=17 Score=14.99 Aligned_cols=22 Identities=5% Similarity=0.099 Sum_probs=12.8 Q ss_pred EEEEECHHHHHHCCCCCCEEEE Q ss_conf 7753262231101589831799 Q gi|254780386|r 330 IRAKVTPKQIQHVKKGYTATVR 351 (440) Q Consensus 330 v~~~v~~~di~~i~~Gq~v~i~ 351 (440) +++.||+.--+++++|..|.+. T Consensus 27 fe~~ipee~~~~l~~G~eVeY~ 48 (57) T cd04467 27 FEVPIPEEIKDKLEPGKEVEYW 48 (57) T ss_pred EEEECCHHHHHCCCCCCEEEEE T ss_conf 7765887887246799749883 No 270 >KOG0964 consensus Probab=46.11 E-value=1.1 Score=22.91 Aligned_cols=32 Identities=19% Similarity=0.020 Sum_probs=17.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 22222122233221222221234555411687 Q gi|254780386|r 269 NQRTFADEMSKLTILEKTREQKTILSPIAGTI 300 (440) Q Consensus 269 ~~~~l~~~~~~l~~l~~~l~~~~I~AP~~G~V 300 (440) .+..+..+.+.+...+..|..+.=++-..|+. T Consensus 479 l~~~i~~~~~dl~~~~~~L~~~~~r~v~nGi~ 510 (1200) T KOG0964 479 LRSLIANLEEDLSRAEKNLRATMNRSVANGID 510 (1200) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 99899889999999999998754601442357 No 271 >PRK05124 cysN sulfate adenylyltransferase subunit 1; Provisional Probab=45.68 E-value=17 Score=14.87 Aligned_cols=105 Identities=11% Similarity=0.080 Sum_probs=59.1 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEE-EEECHHHHHHCCCCCCEEEEEEECCC--------------CCCCEEEEEEEEEECC Q ss_conf 22124445542103355650477-53262231101589831799981288--------------7544399999997133 Q gi|254780386|r 309 SNYAQQSQPLMKIVPHSKLTYIR-AKVTPKQIQHVKKGYTATVRFPHYAD--------------IREKKFKAIIEKIDPI 373 (440) Q Consensus 309 G~~v~~G~~l~~Ivp~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~~a~~~--------------~~~~~~~G~V~~Is~~ 373 (440) ++.+..|+.|. +.|+...-.|. ......+......||.|.++++.--+ .....|.++|.+.++. T Consensus 269 sG~i~~GD~V~-vlPsg~~a~Vk~I~~~~~~~~~A~aG~sV~l~L~deiDIsRGdVl~~~~~~p~~s~~F~A~Ivwm~~~ 347 (475) T PRK05124 269 SGVVKVGDRVK-VLPSGKESNVARIVTFDGDLEEAFAGEAITLVLEDEIDISRGDLLVAADEAPPAAQHASADVVWMAEQ 347 (475) T ss_pred ECEECCCCEEE-ECCCCCEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCCCEEEEEEEEECCC T ss_conf 33271799899-93899758989999658652543899879999603446899838973899867025889999995487 Q ss_pred CCCCCCCCCCCEEEEEEE-------E-------ECHHHCCC-CCCEECCCCEEEEEEEECEE Q ss_conf 444677765511899999-------9-------35011256-78515237756999992205 Q gi|254780386|r 374 ISQQDNNLQTQEYYEVIL-------K-------ITDLAFSD-NNIELRNGFPAEVLFTAKNT 420 (440) Q Consensus 374 ~~~~~~~~~~~~~y~v~i-------~-------~~~~~~~~-~~~~L~~Gm~~~~~I~~~~~ 420 (440) +.... ..|.+.+ . ++..+... ....|..+-.|.+.|.+.+. T Consensus 348 pL~~G------~~y~lk~~t~~v~a~i~~I~~kiD~~T~~~~~~~~L~~Ndia~V~i~~~~p 403 (475) T PRK05124 348 PLQPG------QSYDIKIAGKKTRARVDAIRYQVDINTLTQRDVENLPLNGIGLVELTFDEP 403 (475) T ss_pred CCCCC------CEEEEEECCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEEECCC T ss_conf 67799------868999847499899987347885786671787711899889999998990 No 272 >pfam08614 ATG16 Autophagy protein 16 (ATG16). Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway. Probab=44.79 E-value=13 Score=15.82 Aligned_cols=39 Identities=15% Similarity=0.194 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 998743433233210012333322211455542235688 Q gi|254780386|r 97 DLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKAFESRE 135 (440) Q Consensus 97 ~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~~~~~~ 135 (440) -..+.++..+..........+..+......+...+.... T Consensus 77 ~~lqeEL~el~r~~ge~~q~l~~~n~~l~~l~~~l~~~~ 115 (194) T pfam08614 77 AKLREELTELHKKRGELAQRLLLLNDELEQLRREIQQLE 115 (194) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999768999999999999999999999889 No 273 >PRK11281 potassium efflux protein KefA; Provisional Probab=44.12 E-value=0.13 Score=28.99 Aligned_cols=26 Identities=12% Similarity=0.019 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 89998743433233210012333322 Q gi|254780386|r 95 LIDLKKVSIKNLRCHIDTEKSALDFL 120 (440) Q Consensus 95 ~l~~~~~~l~~l~~~l~~~~~~l~~~ 120 (440) ..++++..++..-.++...+.+++.+ T Consensus 79 ~~~~l~k~i~~aP~~~~~~~~~l~~l 104 (1107) T PRK11281 79 ETVQLKQQLAQAPEKLRQAQAELTAL 104 (1107) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999984489999999999855 No 274 >PRK09174 F0F1 ATP synthase subunit B'; Validated Probab=43.86 E-value=19 Score=14.69 Aligned_cols=31 Identities=10% Similarity=0.071 Sum_probs=16.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHH-HHHEE Q ss_conf 5763356861689999999999999998-76406 Q gi|254780386|r 6 WKNLKRNHPASLKRMTLLSIIGIIGLIT-WSILL 38 (440) Q Consensus 6 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~-w~~~~ 38 (440) -|+|. +-.|.--+.|++|.|.+.+++ |-++. T Consensus 44 ~Pqld--~~t~~sQifWL~I~F~~lY~~~sk~~l 75 (204) T PRK09174 44 FPPFD--STHYASQLLWLAITFGLFYLFLSRVIL 75 (204) T ss_pred CCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998--345089999999999999999999976 No 275 >pfam08317 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association. Probab=43.55 E-value=9 Score=16.81 Aligned_cols=21 Identities=19% Similarity=0.395 Sum_probs=8.1 Q ss_pred CCCCCCHHHHHHHHHHCCCCC Q ss_conf 222221023333222101222 Q gi|254780386|r 185 SSLLRALLSSHQKDLTMMKNL 205 (440) Q Consensus 185 ~~~l~~~~~~~~~el~~~~~l 205 (440) ...+......+..++..++.. T Consensus 178 ~p~l~~~~~~L~~e~~~l~~~ 198 (322) T pfam08317 178 KPKLRKKLQALKEEIANLRQL 198 (322) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999973 No 276 >cd03696 selB_II selB_II: this subfamily represents the domain of elongation factor SelB, homologous to domain II of EF-Tu. SelB may function by replacing EF-Tu. In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3' or 5' non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation. Probab=43.49 E-value=19 Score=14.65 Aligned_cols=40 Identities=20% Similarity=0.353 Sum_probs=15.1 Q ss_pred CCCCCCCCCCCCCCCCEEEE-EEECHHHHHHCCCCCCEEEEE Q ss_conf 24445542103355650477-532622311015898317999 Q gi|254780386|r 312 AQQSQPLMKIVPHSKLTYIR-AKVTPKQIQHVKKGYTATVRF 352 (440) Q Consensus 312 v~~G~~l~~Ivp~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~ 352 (440) +..|+.+. |.|.+....|. ..+...+......|+.|.+.+ T Consensus 27 i~~gd~l~-i~P~~~~~~VksI~~~~~~~~~a~aG~~v~i~L 67 (83) T cd03696 27 VKVGDKVE-ILPLGEETRVRSIQVHGKDVEEAKAGDRVALNL 67 (83) T ss_pred CCCCCEEE-ECCCCCEEEEEEEEECCCEECEECCCCEEEEEE T ss_conf 56999999-978773389999998995968897998799997 No 277 >pfam07423 DUF1510 Protein of unknown function (DUF1510). This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown. Probab=43.43 E-value=14 Score=15.45 Aligned_cols=36 Identities=14% Similarity=0.304 Sum_probs=18.1 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHEEE Q ss_conf 3345763356861689999999999999998764067 Q gi|254780386|r 3 HFMWKNLKRNHPASLKRMTLLSIIGIIGLITWSILLP 39 (440) Q Consensus 3 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~~~ 39 (440) .|..+.-+|..-..+.++|.+++++ |+++.|..|++ T Consensus 2 Rf~~r~k~Rk~n~vLNiaI~iV~ll-Iiiva~~lf~~ 37 (214) T pfam07423 2 RFEQRQKRRKINRVLNIAIGIVVVL-IIIVAYQLFFP 37 (214) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHEEEC T ss_conf 1677777764534557999999999-99976640235 No 278 >pfam12325 TMF_TATA_bd TATA element modulatory factor 1 TATA binding. This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family. Probab=43.35 E-value=15 Score=15.28 Aligned_cols=37 Identities=16% Similarity=0.258 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH Q ss_conf 8899987434332332100123333222114555422 Q gi|254780386|r 94 DLIDLKKVSIKNLRCHIDTEKSALDFLNNKYDVLEKA 130 (440) Q Consensus 94 ~~l~~~~~~l~~l~~~l~~~~~~l~~~~~~~~~~~~~ 130 (440) ..++...+++..++.++...+.++..+..+.+.+..+ T Consensus 17 ~lve~l~s~lr~lE~E~~~lk~el~~l~~~r~~~~~E 53 (121) T pfam12325 17 QLVERLSSTLRRLEGELASLKDELARLEAERDEARQE 53 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999869989999999999999999999999 No 279 >TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer. Probab=43.18 E-value=19 Score=14.62 Aligned_cols=27 Identities=19% Similarity=0.285 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHEEEEEEEEEEEEEEEE Q ss_conf 99999999987640671258999799995 Q gi|254780386|r 24 SIIGIIGLITWSILLPIEISVSSSGEILN 52 (440) Q Consensus 24 ~~~~~~~~~~w~~~~~~~~~v~~~G~i~~ 52 (440) +++++++.++|+++.| .+-.++++|.. T Consensus 21 ~~~~~~~~~~~~~~~p--~~Y~a~a~l~v 47 (444) T TIGR03017 21 LLITVTTTAVVSLLLP--KEYTATTSVVL 47 (444) T ss_pred HHHHHHHHHHHHHHCC--CEEEEEEEEEE T ss_conf 9999999999997089--73699899998 No 280 >pfam11604 CusF_Ec Copper binding periplasmic protein CusF. CusF is a periplasmic protein involved in copper and silver resistance in Escherichia coil. CusF forms a five-stranded beta-barrel OB fold. Cu(I) binds to H36, M47 and M49 which are conserved residues in the protein. Probab=43.12 E-value=19 Score=14.61 Aligned_cols=54 Identities=13% Similarity=0.228 Sum_probs=30.2 Q ss_pred EEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEEC-HHHHHHCCCCCCEEEEEEE Q ss_conf 68731212012221244455421033556504775326-2231101589831799981 Q gi|254780386|r 298 GTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVT-PKQIQHVKKGYTATVRFPH 354 (440) Q Consensus 298 G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~-~~di~~i~~Gq~v~i~~~a 354 (440) |+|..+.. ..|.+.=.=.+|-.+ +-..+.-...+. ..++..+++|+.|.+.+.- T Consensus 1 G~V~~vd~-~~~~iti~H~~I~~l--~wPaMTM~F~v~~~~~l~~l~~G~~V~F~~~~ 55 (61) T pfam11604 1 GVVKSVDA-EAGKVTISHGPIPEL--GWPAMTMDFAVADPADLAGLKPGDKVRFEFEK 55 (61) T ss_pred CEEEEECC-CCCEEEEECCCCCCC--CCCCEEEEEECCCHHHHHCCCCCCEEEEEEEE T ss_conf 97999807-899799965673226--98754888783796687449999989999997 No 281 >TIGR02728 spore_gerQ spore coat protein GerQ; InterPro: IPR014099 Members of this protein family are the spore coat protein GerQ of endospore-forming Firmicutes (low GC Gram-positive bacteria). This protein is cross-linked by a spore coat-associated transglutaminase.. Probab=42.58 E-value=19 Score=14.56 Aligned_cols=64 Identities=19% Similarity=0.261 Sum_probs=49.1 Q ss_pred HHHHHCCCCCCEEEEEEECCCCCC--CEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCC Q ss_conf 231101589831799981288754--4399999997133444677765511899999935011256 Q gi|254780386|r 337 KQIQHVKKGYTATVRFPHYADIRE--KKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSD 400 (440) Q Consensus 337 ~di~~i~~Gq~v~i~~~a~~~~~~--~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~ 400 (440) ..|-++.-|..++|..+==+...+ ..|.|.|..=++|-.--....++..|....|.+|-.+|.+ T Consensus 9 ENILR~NRG~~aT~Y~Tf~N~~~~~~~~FRG~iE~AgRDHi~~~d~~s~~RyllL~v~lDY~TFD~ 74 (82) T TIGR02728 9 ENILRMNRGKTATVYMTFENSKEWAAKVFRGVIEEAGRDHIVISDPKSGRRYLLLMVYLDYVTFDE 74 (82) T ss_pred EEEEECCCCCEEEEEEEEECCCCCCCEEECCEECCCCCCEEEEECCCCCCEEEEEEEEEEEEECCC T ss_conf 100011688615789885088000422202212046677588815999824898888753332267 No 282 >KOG3925 consensus Probab=42.35 E-value=19 Score=14.57 Aligned_cols=97 Identities=15% Similarity=0.169 Sum_probs=47.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC-EEEEECCEEECCCC--CCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCC-CCCCEE Q ss_conf 12223322122222123455541-16873121201222--124445542103355650477532622311015-898317 Q gi|254780386|r 274 ADEMSKLTILEKTREQKTILSPI-AGTIVYNQSFSSSN--YAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVK-KGYTAT 349 (440) Q Consensus 274 ~~~~~~l~~l~~~l~~~~I~AP~-~G~V~~l~~~~~G~--~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~-~Gq~v~ 349 (440) +.+..-|... ..+.+.+. .++ -|+|..... ..|. ++..| |.=+++|+ ++.|. +|..|+ T Consensus 163 ag~lppLd~~-hh~~q~e~-~~YREGvvi~~~~-~~g~~~~in~g------------L~k~v~I~---~~~v~~p~~Rvt 224 (371) T KOG3925 163 AGLLPPLDTP-HHLRQDEK-FRYREGVVIKRKS-PAGHGYIINAG------------LKKEVYID---SKAVEAPGTRVT 224 (371) T ss_pred CCCCCCCCCH-HHHHHHHH-HCCCCCEEECCCC-CCCCCCEECCC------------CEEEEEEC---CHHHCCCCCEEE T ss_conf 1367998866-77645333-1311434643678-77776430234------------30237865---500037762478 Q ss_pred EEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEE-EEEEEEEC Q ss_conf 9998128875443999999971334446777655118-99999935 Q gi|254780386|r 350 VRFPHYADIREKKFKAIIEKIDPIISQQDNNLQTQEY-YEVILKIT 394 (440) Q Consensus 350 i~~~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~-y~v~i~~~ 394 (440) |.+.+- ..+-+++.|+|. ||.... ...|..+ |.|++.-. T Consensus 225 V~l~aq-~~~~~t~~g~IV--Sp~~pr---~~tGlywGY~VR~ass 264 (371) T KOG3925 225 VRLNAQ-TVTCDTLRGKIV--SPQDPR---EETGLYWGYKVRLASS 264 (371) T ss_pred EECCCC-CCHHHHHCCCCC--CCCCCC---CCCCCCCCEEEEHHHH T ss_conf 753776-750341115134--887751---0048644146554665 No 283 >COG2433 Uncharacterized conserved protein [Function unknown] Probab=42.32 E-value=20 Score=14.53 Aligned_cols=10 Identities=30% Similarity=0.375 Sum_probs=4.0 Q ss_pred HCCCCCCEEE Q ss_conf 0158983179 Q gi|254780386|r 341 HVKKGYTATV 350 (440) Q Consensus 341 ~i~~Gq~v~i 350 (440) .|+.|+-+-+ T Consensus 541 gik~GDvi~v 550 (652) T COG2433 541 GIKEGDVILV 550 (652) T ss_pred CCCCCCEEEE T ss_conf 4466838999 No 284 >TIGR03593 yidC_nterm membrane protein insertase, YidC/Oxa1 family, N-terminal domain. Essentially all bacteria have a member of the YidC family, whose C-terminal domain is modeled by TIGR03592. The two copies are found in endospore-forming bacteria such as Bacillus subtilis appear redundant during vegetative growth, although the member designated spoIIIJ (stage III sporulation protein J) has a distinct role in spore formation. YidC, its mitochondrial homolog Oxa1, and its chloroplast homolog direct insertion into the bacterial/organellar inner (or only) membrane. This model describes an N-terminal sequence region, including a large periplasmic domain lacking in YidC members from Gram-positive species. The multifunctional YidC protein acts both with and independently of the Sec system. Probab=41.96 E-value=20 Score=14.55 Aligned_cols=23 Identities=13% Similarity=0.317 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHEEEE Q ss_conf 99999999999999987640671 Q gi|254780386|r 18 KRMTLLSIIGIIGLITWSILLPI 40 (440) Q Consensus 18 ~~~~~~~~~~~~~~~~w~~~~~~ 40 (440) |++++.++++++++++|.+|.+. T Consensus 2 kn~lia~~Ls~~ili~~~~~~~~ 24 (366) T TIGR03593 2 RRLILAIALSFVIFLLWQAWQVD 24 (366) T ss_pred CHHHHHHHHHHHHHHHHHHHCCC T ss_conf 10369999999999999997077 No 285 >PRK09173 F0F1 ATP synthase subunit B; Validated Probab=41.90 E-value=20 Score=14.49 Aligned_cols=18 Identities=6% Similarity=0.093 Sum_probs=8.7 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 616899999999999999 Q gi|254780386|r 14 PASLKRMTLLSIIGIIGL 31 (440) Q Consensus 14 p~~~~~~~~~~~~~~~~~ 31 (440) |.||..+.++++++++.+ T Consensus 3 ~~FW~~I~fvi~~~ll~~ 20 (159) T PRK09173 3 ATFWAFVGLVLFLALVVY 20 (159) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 129999999999999999 No 286 >CHL00006 consensus Probab=41.44 E-value=20 Score=14.44 Aligned_cols=20 Identities=10% Similarity=0.092 Sum_probs=18.2 Q ss_pred EEECCCCCEEECCCEEEEEC Q ss_conf 99857898933697899996 Q gi|254780386|r 69 KFQVENGSQILQGTPLLTFE 88 (440) Q Consensus 69 ~i~VkeGd~VkkGq~L~~ld 88 (440) -++|+.|+.|+++|+|+++. T Consensus 405 lL~V~n~q~V~s~QvIAEi~ 424 (1372) T CHL00006 405 FLLVQNDQYVESEQVIAEIR 424 (1372) T ss_pred EEEEECCCEEECCCEEEEEE T ss_conf 89998995885052899982 No 287 >PRK11519 tyrosine kinase; Provisional Probab=41.24 E-value=20 Score=14.42 Aligned_cols=28 Identities=4% Similarity=0.062 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHHEEEEEEEEEEEEEEEE Q ss_conf 999999999987640671258999799995 Q gi|254780386|r 23 LSIIGIIGLITWSILLPIEISVSSSGEILN 52 (440) Q Consensus 23 ~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~ 52 (440) ++++++++.++|.++++--. .+++.|.. T Consensus 37 ~~~i~~~~a~~y~~~~tP~Y--~A~a~v~I 64 (720) T PRK11519 37 ITAVFALCAVVYTFFATPIY--SADALVQI 64 (720) T ss_pred HHHHHHHHHHHHHHCCCCEE--EEEEEEEE T ss_conf 99999999999995189847--88899999 No 288 >PRK10512 selenocysteinyl-tRNA-specific translation factor; Provisional Probab=41.06 E-value=21 Score=14.41 Aligned_cols=11 Identities=45% Similarity=0.474 Sum_probs=5.7 Q ss_pred CEEECCCEEEE Q ss_conf 89336978999 Q gi|254780386|r 76 SQILQGTPLLT 86 (440) Q Consensus 76 d~VkkGq~L~~ 86 (440) +.|+.||+|+. T Consensus 248 ~~i~RGd~L~~ 258 (615) T PRK10512 248 EQINRGDWLLA 258 (615) T ss_pred HHCCCCCEEEC T ss_conf 23178668923 No 289 >cd00516 PRTase_typeII Phosphoribosyltransferase (PRTase) type II; This family contains two enzymes that play an important role in NAD production by either allowing quinolinic acid (QA) , quinolinate phosphoribosyl transferase (QAPRTase), or nicotinic acid (NA), nicotinate phosphoribosyltransferase (NAPRTase), to be used in the synthesis of NAD. QAPRTase catalyses the reaction of quinolinic acid (QA) with 5-phosphoribosyl-1-pyrophosphate (PRPP) in the presence of Mg2+ to produce nicotinic acid mononucleotide (NAMN), pyrophosphate and carbon dioxide, an important step in the de novo synthesis of NAD. NAPRTase catalyses a similar reaction leading to NAMN and pyrophosphate, using nicotinic acid an PPRP as substrates, used in the NAD salvage pathway. Probab=40.96 E-value=21 Score=14.40 Aligned_cols=28 Identities=32% Similarity=0.506 Sum_probs=24.0 Q ss_pred CCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 8689999857898933697899996489 Q gi|254780386|r 64 SGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 64 ~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) ++......+++|+.|++|++++++...- T Consensus 48 ~~~~~~~~~~eG~~v~~~~~il~i~G~~ 75 (281) T cd00516 48 PGPLVILAVPEGTVVEPGEPLLTIEGPA 75 (281) T ss_pred CCCEEEEECCCCCEECCCCEEEEEEECH T ss_conf 9847999779999926998899999839 No 290 >cd04471 S1_RNase_R S1_RNase_R: RNase R C-terminal S1 domain. RNase R is a processive 3' to 5' exoribonuclease, which is a homolog of RNase II. RNase R degrades RNA with secondary structure having a 3' overhang of at least 7 nucleotides. RNase R and PNPase play an important role in the degradation of RNA with extensive secondary structure, such as rRNA, tRNA, and certain mRNA which contains repetitive extragenic palindromic sequences. The C-terminal S1 domain binds ssRNA. Probab=40.90 E-value=21 Score=14.39 Aligned_cols=17 Identities=18% Similarity=0.145 Sum_probs=10.5 Q ss_pred CCEEEEECCEEECCCCCCC Q ss_conf 4116873121201222124 Q gi|254780386|r 295 PIAGTIVYNQSFSSSNYAQ 313 (440) Q Consensus 295 P~~G~V~~l~~~~~G~~v~ 313 (440) .++|+|..++. -|-+|. T Consensus 4 ~f~g~I~~v~~--~GiFV~ 20 (83) T cd04471 4 EFDGVISGVTS--FGLFVE 20 (83) T ss_pred EEEEEEEEEEE--EEEEEE T ss_conf 99999998983--069999 No 291 >TIGR00531 BCCP acetyl-CoA carboxylase, biotin carboxyl carrier protein; InterPro: IPR001249 The proteins in this family are a component of the acetyl coenzyme A carboxylase complex (6.4.1.2 from EC) and are involved in the first step in long-chain fatty acid synthesis. In plants this is usually located in the chloroplast. In the first step, biotin carboxylase catalyses the carboxylation of the carrier protein to form an intermediate. Next, the transcarboxylase complex transfers the carboxyl group from the intermediate to acetyl-CoA forming malonyl-CoA. This protein functions in the transfer of CO_2 from one site to another, the biotin binding site locates to the C-terminal of this protein. The biotin is specifically attached to a lysine residue in the sequence AMKM. The structure of the C-terminal domain of the biotin carboxyl carrier (BCC) protein was shown to be a flattened beta-barrel structure comprising two four-stranded beta sheets interrupted by a structural loop forming a thumb structure. The biotinyl-lysine is located on a tight beta-turn on the opposite end of the molecule. The thumb structure has been shown to attached biotin, thus stabilising the structure. ; GO: 0003989 acetyl-CoA carboxylase activity, 0006633 fatty acid biosynthetic process, 0009317 acetyl-CoA carboxylase complex. Probab=40.83 E-value=21 Score=14.38 Aligned_cols=34 Identities=15% Similarity=0.240 Sum_probs=17.5 Q ss_pred CCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCC Q ss_conf 2222123455541168731212012221244455 Q gi|254780386|r 284 EKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQP 317 (440) Q Consensus 284 ~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~ 317 (440) ..+..-..+++|.-|+...-..-..+.++..|+. T Consensus 77 ~~~~~~~~~~~p~~gtfy~~p~p~~~p~~~~g~~ 110 (159) T TIGR00531 77 PADEKGHLVRSPMVGTFYRAPSPDAKPFVEVGDK 110 (159) T ss_pred CCCCCCCEEECCCCCCEECCCCCCCCCEEEECCE T ss_conf 1102330100442110002788886640210550 No 292 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=40.47 E-value=21 Score=14.35 Aligned_cols=13 Identities=23% Similarity=0.238 Sum_probs=6.9 Q ss_pred CCCCEEEEECCEE Q ss_conf 5541168731212 Q gi|254780386|r 293 LSPIAGTIVYNQS 305 (440) Q Consensus 293 ~AP~~G~V~~l~~ 305 (440) ---+||+|..++. T Consensus 287 ~y~IDGvV~Kvd~ 299 (706) T TIGR00575 287 PYEIDGVVVKVDD 299 (706) T ss_pred CCCCCCEEEEECC T ss_conf 8730437898378 No 293 >PRK04778 septation ring formation regulator EzrA; Provisional Probab=40.22 E-value=6.6 Score=17.71 Aligned_cols=13 Identities=23% Similarity=0.253 Sum_probs=4.7 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999876 Q gi|254780386|r 23 LSIIGIIGLITWS 35 (440) Q Consensus 23 ~~~~~~~~~~~w~ 35 (440) +++++++++++|. T Consensus 9 iivlvii~~~~~~ 21 (569) T PRK04778 9 IVVIIIIAYLVGY 21 (569) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 294 >pfam02706 Wzz Chain length determinant protein. This family includes proteins involved in lipopolysaccharide (lps) biosynthesis. This family comprises the whole length of chain length determinant protein (or wzz protein) that confers a modal distribution of chain length on the O-antigen component of lps. This region is also found as part of bacterial tyrosine kinases. Probab=40.11 E-value=21 Score=14.31 Aligned_cols=33 Identities=15% Similarity=0.333 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEECC Q ss_conf 99999999999998764067125899979999538 Q gi|254780386|r 20 MTLLSIIGIIGLITWSILLPIEISVSSSGEILNED 54 (440) Q Consensus 20 ~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~~~ 54 (440) +++++++++++.++|+++.+-.. .++..|.+.. T Consensus 19 i~~~~~~~~~~~~~~~~~~~p~Y--~A~a~i~v~~ 51 (139) T pfam02706 19 IILVTLLFALLAAAYAFLAPPKY--TATAQILVPQ 51 (139) T ss_pred HHHHHHHHHHHHHHHHHCCCCCE--EEEEEEEEEC T ss_conf 99999999999999984388867--9999999825 No 295 >PRK10629 hypothetical protein; Provisional Probab=39.95 E-value=21 Score=14.29 Aligned_cols=77 Identities=21% Similarity=0.254 Sum_probs=38.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHEEEEEEEEEE----EEEEEECCCEEEEECCCCCE-EEEEECCCCCEEECCCEEEEE Q ss_conf 8616899999999999999987640671258999----79999538738984688868-999985789893369789999 Q gi|254780386|r 13 HPASLKRMTLLSIIGIIGLITWSILLPIEISVSS----SGEILNEDNVVEIKSPFSGI-IKKFQVENGSQILQGTPLLTF 87 (440) Q Consensus 13 ~p~~~~~~~~~~~~~~~~~~~w~~~~~~~~~v~~----~G~i~~~~~~~~I~s~~~G~-V~~i~VkeGd~VkkGq~L~~l 87 (440) +++...+++.+.++..+.+++|..+..-+.+... .|.-.|+|=..--.=-..|+ ++.|-- +.|.+|+++ T Consensus 7 ~~r~~~w~~~~~~ll~~~~l~~~~~~~~es~l~ir~~~qg~~~pdgf~~~q~Lda~GI~~KSIt~------en~~lLIRf 80 (127) T PRK10629 7 SLRQLAWILAVLLLLLALLLAWSALRQQESTLAIRAVQQGASLPDGFSVYHHLDANGIHFKSITP------ENDSLLITF 80 (127) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCCCCCCCHHHHHHHHCCCCEEEECC------CCCEEEEEE T ss_conf 47788999999999999999999997332115875546898799823489988766973256521------598089997 Q ss_pred CHHHHHHH Q ss_conf 64899888 Q gi|254780386|r 88 EDIETTDL 95 (440) Q Consensus 88 d~~~~~~~ 95 (440) ++++.+.. T Consensus 81 ~s~eqQl~ 88 (127) T PRK10629 81 DSPEQSAA 88 (127) T ss_pred CCHHHHHH T ss_conf 99899999 No 296 >PRK00226 greA transcription elongation factor GreA; Reviewed Probab=39.73 E-value=15 Score=15.35 Aligned_cols=31 Identities=10% Similarity=0.144 Sum_probs=20.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2222222212223322122222123455541 Q gi|254780386|r 266 LEQNQRTFADEMSKLTILEKTREQKTILSPI 296 (440) Q Consensus 266 l~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~ 296 (440) ...++.+...+..++..++..|.++.|..|. T Consensus 49 Y~aAke~q~~~e~rI~~Le~~L~~A~iid~~ 79 (157) T PRK00226 49 YHAAKEEQGFIEGRIRELEDKLSNAEVIDPT 79 (157) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9999999999999999999999850145866 No 297 >PRK11144 modC molybdate transporter ATP-binding protein; Provisional Probab=39.03 E-value=22 Score=14.20 Aligned_cols=47 Identities=15% Similarity=0.135 Sum_probs=28.7 Q ss_pred CEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHC-------CCCCCEECCCCEEEEEEEE Q ss_conf 43999999971334446777655118999999350112-------5678515237756999992 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAF-------SDNNIELRNGFPAEVLFTA 417 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~-------~~~~~~L~~Gm~~~~~I~~ 417 (440) ..|+|+|..|-+.. ..+.|.+.+....+ .-....|.||+++-|.|+. T Consensus 294 N~l~~~V~~i~~~g----------~~v~v~l~~gg~~l~A~IT~~s~~~L~L~~G~~V~a~iKa 347 (352) T PRK11144 294 NILRAKVVEIYDDN----------GQVEVKLEVGGKTLWARITPWARDELALKPGQWVYAQIKS 347 (352) T ss_pred CEEEEEEEEEEECC----------CEEEEEEEECCCEEEEEECHHHHHHCCCCCCCEEEEEEEE T ss_conf 18999999999879----------9899999959958999978899986799999999999988 No 298 >TIGR02977 phageshock_pspA phage shock protein A; InterPro: IPR014319 Members of this protein are the phage shock protein PspA, from the phage shock operon. PspA appears to maintain the proton motive force under stress conditions that include over expression of certain phage secretins, heat shock, ethanol and protein export defects.. Probab=38.80 E-value=22 Score=14.18 Aligned_cols=12 Identities=33% Similarity=0.266 Sum_probs=4.0 Q ss_pred CCCCHHHHHHHH Q ss_conf 111002466653 Q gi|254780386|r 216 LNQQERIVYKST 227 (440) Q Consensus 216 l~~~~~~~~~~~ 227 (440) +...++.+.... T Consensus 168 Fe~~ERrvD~lE 179 (222) T TIGR02977 168 FEQLERRVDELE 179 (222) T ss_pred HHHHHHHHHHHH T ss_conf 999988799998 No 299 >PRK00723 phosphatidylserine decarboxylase; Provisional Probab=37.73 E-value=23 Score=14.07 Aligned_cols=20 Identities=15% Similarity=0.180 Sum_probs=11.7 Q ss_pred CCCCCCEEEEECCEEECCCCC Q ss_conf 455541168731212012221 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNY 311 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~ 311 (440) .+-+|+||+|...... +|.+ T Consensus 168 R~H~P~~G~i~~~~~I-~G~l 187 (297) T PRK00723 168 RFHFPDDGICEETRKI-NGYY 187 (297) T ss_pred EEECCCCEEEEEEEEE-CCEE T ss_conf 3753566189899996-8745 No 300 >PRK07193 fliF flagellar MS-ring protein; Reviewed Probab=37.58 E-value=23 Score=14.05 Aligned_cols=32 Identities=22% Similarity=0.385 Sum_probs=16.7 Q ss_pred CCCCCCCCCCHHHHHHHH--HHHHHHHHHHHHHH Q ss_conf 457633568616899999--99999999998764 Q gi|254780386|r 5 MWKNLKRNHPASLKRMTL--LSIIGIIGLITWSI 36 (440) Q Consensus 5 ~~~~~~~~~p~~~~~~~~--~~~~~~~~~~~w~~ 36 (440) .|++|....++.+-+++. +++++++++++|+. T Consensus 16 ~w~~~~~~~~~k~~~i~~~a~~v~~~~~l~~~~~ 49 (550) T PRK07193 16 KWSPFRLRGNRKLILLALLALLVAAAIVLSLWRS 49 (550) T ss_pred HHHHCCCCCCCEEHHHHHHHHHHHHHHHHHHHHC T ss_conf 9985042567230479999999999999986417 No 301 >PRK08717 consensus Probab=37.41 E-value=23 Score=14.04 Aligned_cols=36 Identities=11% Similarity=0.283 Sum_probs=21.4 Q ss_pred CCCCCCCCCCCC--CCHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf 963345763356--86168999999999-9999998764 Q gi|254780386|r 1 MHHFMWKNLKRN--HPASLKRMTLLSII-GIIGLITWSI 36 (440) Q Consensus 1 ~~~~~~~~~~~~--~p~~~~~~~~~~~~-~~~~~~~w~~ 36 (440) |.+|-.|.++.. ||+|+-..+.+.++ .++.++.|.. T Consensus 1 ~~~~~~~~f~~~~l~pkyw~~wl~~~~l~~l~slLP~~~ 39 (311) T PRK08717 1 MDKYPKPEFSLSLLHPKYWGVWLGFGLLAILVNLLPYKV 39 (311) T ss_pred CCCCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHCCHHH T ss_conf 998899985026508057999999999999999787999 No 302 >COG0688 Psd Phosphatidylserine decarboxylase [Lipid metabolism] Probab=37.02 E-value=24 Score=14.01 Aligned_cols=39 Identities=10% Similarity=0.137 Sum_probs=23.4 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 455541168731212012221244455421033556504775 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRA 332 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~ 332 (440) ..+||+||+|..+.. -+|.+......- +...|....+.+ T Consensus 126 r~haP~~G~i~~~~~-~~G~~~~v~~~~--~~~~NER~~~~i 164 (239) T COG0688 126 RNHAPVDGTIIEVRY-VPGKFFSANLDK--AFTENERNSVLI 164 (239) T ss_pred EEECCCCCEEEEEEE-ECCCEECCCHHH--HHCCCCEEEEEE T ss_conf 274797879999999-889253227004--541561699999 No 303 >PRK00624 glycine cleavage system protein H; Provisional Probab=36.57 E-value=24 Score=13.95 Aligned_cols=35 Identities=9% Similarity=0.159 Sum_probs=16.6 Q ss_pred EEEEEEEEEECCCCCCCC---CCCCCEEEEEEEEECHH Q ss_conf 399999997133444677---76551189999993501 Q gi|254780386|r 362 KFKAIIEKIDPIISQQDN---NLQTQEYYEVILKITDL 396 (440) Q Consensus 362 ~~~G~V~~Is~~~~~~~~---~~~~~~~y~v~i~~~~~ 396 (440) ++.|+|..|-.....++. ....+.-..+++++++. T Consensus 66 PvsG~VvevN~~l~~~P~liN~dpy~~GWl~~vk~~d~ 103 (113) T PRK00624 66 PVSGEVIEVNLALEDDIQPINNAPESEGWFVVVQLDED 103 (113) T ss_pred CCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECCH T ss_conf 86649998578554498987289777963899998986 No 304 >PRK00882 consensus Probab=36.34 E-value=21 Score=14.28 Aligned_cols=20 Identities=10% Similarity=0.008 Sum_probs=11.5 Q ss_pred CCCCCCEEEEECCEEECCCCC Q ss_conf 455541168731212012221 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNY 311 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~ 311 (440) .+-+|++|+|...... +|.+ T Consensus 167 R~H~P~~G~v~~~~~I-~G~l 186 (295) T PRK00882 167 RFHFIDSGICSETKKI-KGFY 186 (295) T ss_pred EEECCCCEEEEEEEEE-CCEE T ss_conf 4852566399999998-9864 No 305 >COG4093 Uncharacterized protein conserved in bacteria [Function unknown] Probab=35.68 E-value=25 Score=13.86 Aligned_cols=26 Identities=23% Similarity=0.226 Sum_probs=16.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 56861689999999999999998764 Q gi|254780386|r 11 RNHPASLKRMTLLSIIGIIGLITWSI 36 (440) Q Consensus 11 ~~~p~~~~~~~~~~~~~~~~~~~w~~ 36 (440) -+.+.+.|+++|+++++.+.+.+|++ T Consensus 7 a~~~~~rkr~~wl~i~ivv~~g~ySa 32 (338) T COG4093 7 APQSATRKRLFWLVIAIVVLIGAYSA 32 (338) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 89986432234899999998877501 No 306 >PRK04599 consensus Probab=35.66 E-value=18 Score=14.73 Aligned_cols=25 Identities=12% Similarity=0.097 Sum_probs=17.3 Q ss_pred CCCCCCEEEEECCEEECCCCCCCCCC Q ss_conf 45554116873121201222124445 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYAQQSQ 316 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v~~G~ 316 (440) .+-+|+||.|..... -+|..-+... T Consensus 144 R~H~P~~g~i~~~~~-ipG~L~sVnp 168 (285) T PRK04599 144 RVHMPCDGTLRQMIY-VPGDLFSVNP 168 (285) T ss_pred CEECCCCCEEEEEEE-CCCCCCCCCH T ss_conf 163455668979998-5886666697 No 307 >cd01324 cbb3_Oxidase_CcoQ Cytochrome cbb oxidase CcoQ. Cytochrome cbb3 oxidase, the terminal oxidase in the respiratory chains of proteobacteria, is a multi-chain transmembrane protein located in the cell membrane. Like other cytochrome oxidases, it catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. Found exclusively in proteobacteria, cbb3 is believed to be a modern enzyme that has evolved independently to perform a specialized function in microaerobic energy metabolism. The cbb3 operon contains four genes (ccoNOQP or fixNOQP), with ccoN coding for subunit I. Instead of a CuA-containing subunit II analogous to other cytochrome oxidases, cbb3 utilizes subunits ccoO and ccoP, which contain one and two hemes, respectively, to transfer electrons to the binuclear center. ccoQ, the fourth subunit, is a single transmembrane helix protein. It has been shown to protect the core complex from proteolytic degradation by serine proteases. See cd00919, cd01322 Probab=35.57 E-value=25 Score=13.85 Aligned_cols=19 Identities=16% Similarity=0.382 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHE Q ss_conf 9999999999999987640 Q gi|254780386|r 19 RMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 19 ~~~~~~~~~~~~~~~w~~~ 37 (440) +.+++.+++|+|.++|++. T Consensus 14 WgLl~~~~~Figvv~wa~~ 32 (48) T cd01324 14 WGLLYLALFFLGVVVWAFR 32 (48) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999875058 No 308 >pfam06818 Fez1 Fez1. This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth. Note that some family members contain more than one copy of this region. Probab=35.31 E-value=7 Score=17.52 Aligned_cols=16 Identities=19% Similarity=0.293 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHHHHHC Q ss_conf 9987434332332100 Q gi|254780386|r 97 DLKKVSIKNLRCHIDT 112 (440) Q Consensus 97 ~~~~~~l~~l~~~l~~ 112 (440) ..++.++....+++.. T Consensus 13 sLLKqQLkesq~e~~~ 28 (199) T pfam06818 13 SLLKQQLKESQAELAQ 28 (199) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999988999999987 No 309 >TIGR02163 napH_ ferredoxin-type protein, NapH/MauN family; InterPro: IPR011886 Most members of this family are the NapH protein, found next to NapG,in operons that encode the periplasmic nitrate reductase. Some species with this reductase lack NapC but accomplish electron transfer to NapAB in some other manner, likely to involve NapH, NapG, and/or some other protein. A few members of this protein are designated MauN and are found in methylamine utilization operons in species that appear to lack a periplasmic nitrate reductase.; GO: 0005506 iron ion binding, 0006118 electron transport. Probab=35.07 E-value=25 Score=13.80 Aligned_cols=30 Identities=17% Similarity=0.329 Sum_probs=19.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHEEEEEE Q ss_conf 861689999999999999998764067125 Q gi|254780386|r 13 HPASLKRMTLLSIIGIIGLITWSILLPIEI 42 (440) Q Consensus 13 ~p~~~~~~~~~~~~~~~~~~~w~~~~~~~~ 42 (440) +-||+=++.++++.++.|+++|-++.|+.. T Consensus 116 ~~RY~~L~~~LllS~~~G~l~wE~~nPVg~ 145 (263) T TIGR02163 116 NLRYWVLVLVLLLSFLSGLLIWEWVNPVGL 145 (263) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCHHHH T ss_conf 404999999999999970677767442778 No 310 >TIGR03077 not_gcvH glycine cleavage protein H homolog, Chlamydial. The H protein (GcvH) of the glycine cleavage system shuttles the methylamine group of glycine from the P protein to the T protein. Most Chlamydia but lack the P and T proteins, and have a single homolog of GcvH that appears deeply split from canonical GcvH in molecular phylogenetic trees. The protein family modeled here is observed the Chlamydial GcvH homolog, so far always seen as part of a two-gene operon, downstream of a member of the uncharacterized protein family TIGR03076. The function of this protein is unknown. Probab=34.64 E-value=26 Score=13.75 Aligned_cols=36 Identities=6% Similarity=0.136 Sum_probs=15.5 Q ss_pred CEEEEEEEEEECCCCCCCCC---CCCCEEEEEEEEECHH Q ss_conf 43999999971334446777---6551189999993501 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNN---LQTQEYYEVILKITDL 396 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~---~~~~~~y~v~i~~~~~ 396 (440) -++.|+|..|-.....++.- ...+.-..+++++++. T Consensus 64 sPvsG~VvevN~~l~~~P~liN~dpy~~gWl~~vk~~d~ 102 (110) T TIGR03077 64 SPVSGEVIEVNIALEDDTQPINHSPESEGWFVVVQLDED 102 (110) T ss_pred CCCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECCH T ss_conf 787669998668454498987289777965999999986 No 311 >COG4325 Predicted membrane protein [Function unknown] Probab=34.50 E-value=21 Score=14.37 Aligned_cols=35 Identities=23% Similarity=0.333 Sum_probs=18.5 Q ss_pred CCEEEEECCCCCEEEE------------------EECCCCCEEECCCEEEEEC Q ss_conf 8738984688868999------------------9857898933697899996 Q gi|254780386|r 54 DNVVEIKSPFSGIIKK------------------FQVENGSQILQGTPLLTFE 88 (440) Q Consensus 54 ~~~~~I~s~~~G~V~~------------------i~VkeGd~VkkGq~L~~ld 88 (440) ...+.|.++.+|.+.. +++..||.|..|++|..+- T Consensus 227 ~~a~~Ira~~sGyLq~ID~~~L~k~A~a~~~~Ihl~~r~Gdfvv~g~~L~~~~ 279 (464) T COG4325 227 ADAVPIRAPHSGYLQTIDVDDLAKKAAASRYTIHLVTRVGDFVVAGGLLGWCW 279 (464) T ss_pred CCCCEEECCCCCCEEEEEHHHHHHHHHHHCEEEEEEECCCCEECCCCCEEEEE T ss_conf 65643233886606896378999999863128999953755353897189995 No 312 >pfam03264 Cytochrom_NNT NapC/NirT cytochrome c family, N-terminal region. Within the NapC/NirT family of cytochrome c proteins, some members, such as NapC and NirT, bind four haem groups, while others, such as TorC, bind five haems. This family aligns the common N-terminal region that contains four haem-binding C-X(2)-CH motifs. Probab=34.45 E-value=26 Score=13.73 Aligned_cols=21 Identities=24% Similarity=0.393 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHE Q ss_conf 899999999999999987640 Q gi|254780386|r 17 LKRMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 17 ~~~~~~~~~~~~~~~~~w~~~ 37 (440) +.+++++++.+++|+++|..+ T Consensus 6 ~~~~~~~~~g~~~G~~~~~~~ 26 (174) T pfam03264 6 LSLGVLLLVGFLLGVVFWGGF 26 (174) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999 No 313 >PRK03140 phosphatidylserine decarboxylase; Provisional Probab=34.29 E-value=26 Score=13.71 Aligned_cols=21 Identities=29% Similarity=0.355 Sum_probs=12.7 Q ss_pred CCCCCCEEEEECCEEECCCCCC Q ss_conf 4555411687312120122212 Q gi|254780386|r 291 TILSPIAGTIVYNQSFSSSNYA 312 (440) Q Consensus 291 ~I~AP~~G~V~~l~~~~~G~~v 312 (440) .+-+|.||+|...... +|..- T Consensus 140 R~H~P~~g~v~~~~~i-~G~l~ 160 (259) T PRK03140 140 RIHSPISGTVTEQFVL-GRKSY 160 (259) T ss_pred CEECCCCEEEEEEEEE-CCCCC T ss_conf 1535676199999997-99467 No 314 >CHL00078 rpl5 ribosomal protein L5 Probab=34.00 E-value=26 Score=13.68 Aligned_cols=21 Identities=14% Similarity=0.199 Sum_probs=11.4 Q ss_pred CCEECCCCEEEEEEEECEEEHHH Q ss_conf 85152377569999922054799 Q gi|254780386|r 402 NIELRNGFPAEVLFTAKNTTLAQ 424 (440) Q Consensus 402 ~~~L~~Gm~~~~~I~~~~~~~~~ 424 (440) .+.-.+||.++ |+|--++--+ T Consensus 145 ~~~~~~G~~It--ivTta~td~e 165 (180) T CHL00078 145 KIDQIRGMDIS--IVTTAKTDEE 165 (180) T ss_pred CCCCCCCCEEE--EEECCCCHHH T ss_conf 45787887589--9917799899 No 315 >PRK13168 rumA 23S rRNA 5-methyluridine methyltransferase; Reviewed Probab=33.89 E-value=26 Score=13.67 Aligned_cols=43 Identities=19% Similarity=0.213 Sum_probs=20.8 Q ss_pred EEEEEEEECCCEEEEECCCCCEEEEEEC-CCCCEEECCCEEEEE Q ss_conf 9979999538738984688868999985-789893369789999 Q gi|254780386|r 45 SSSGEILNEDNVVEIKSPFSGIIKKFQV-ENGSQILQGTPLLTF 87 (440) Q Consensus 45 ~~~G~i~~~~~~~~I~s~~~G~V~~i~V-keGd~VkkGq~L~~l 87 (440) .|.|.....+.+.-|....+|-+.++.+ +.....-.|.++-.+ T Consensus 27 ~G~Gv~r~~g~vvFV~galPGE~V~v~I~~~kk~~~~a~v~eil 70 (440) T PRK13168 27 DGRGVARHNGKTVFIEGALPGETVEVQVTERKKQYARAKVVRIL 70 (440) T ss_pred CCEEEEEECCEEEEECCCCCCCEEEEEEEEEECCEEEEEEEEEE T ss_conf 96499998999999899999989999999962887999999992 No 316 >TIGR02231 TIGR02231 conserved hypothetical protein; InterPro: IPR011935 This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.. Probab=33.65 E-value=5.5 Score=18.22 Aligned_cols=34 Identities=12% Similarity=0.209 Sum_probs=23.7 Q ss_pred CCCCCCCCCEE-EEEEECHHHHHHCCCCCCEEEEE Q ss_conf 21033556504-77532622311015898317999 Q gi|254780386|r 319 MKIVPHSKLTY-IRAKVTPKQIQHVKKGYTATVRF 352 (440) Q Consensus 319 ~~Ivp~~~~l~-v~~~v~~~di~~i~~Gq~v~i~~ 352 (440) +.|.|-...+| =.+||-...+..|.+|+.+++.| T Consensus 395 ~pLLPG~~~~fr~~~FVg~~~l~~v~pGe~f~l~l 429 (558) T TIGR02231 395 FPLLPGKVNIFRDNGFVGRSHLENVAPGERFELSL 429 (558) T ss_pred CEECCCCEEEEECCEEEEECCCCCCCCCCEEEEEC T ss_conf 30068632333376358800002479997279831 No 317 >PRK05529 cell division protein FtsQ; Provisional Probab=33.58 E-value=27 Score=13.64 Aligned_cols=20 Identities=15% Similarity=0.144 Sum_probs=14.0 Q ss_pred EEECCCEEEEECHHHHHHHH Q ss_conf 93369789999648998889 Q gi|254780386|r 77 QILQGTPLLTFEDIETTDLI 96 (440) Q Consensus 77 ~VkkGq~L~~ld~~~~~~~l 96 (440) .+..|++|+.+|.......+ T Consensus 84 ~~~~G~pL~~vd~~~v~~~l 103 (255) T PRK05529 84 RDQFGKPLPLVDPETVRKKL 103 (255) T ss_pred CCCCCCEEEEECHHHHHHHH T ss_conf 58789851388699999987 No 318 >PRK03992 proteasome-activating nucleotidase; Provisional Probab=33.33 E-value=5.1 Score=18.44 Aligned_cols=14 Identities=29% Similarity=0.601 Sum_probs=6.7 Q ss_pred CCCCCCCEEEEECCE Q ss_conf 345554116873121 Q gi|254780386|r 290 KTILSPIAGTIVYNQ 304 (440) Q Consensus 290 ~~I~AP~~G~V~~l~ 304 (440) .-|..| .|+...-. T Consensus 161 ~Gi~pP-kGvLLyGP 174 (390) T PRK03992 161 VGIEPP-KGVLLYGP 174 (390) T ss_pred CCCCCC-CEEEEECC T ss_conf 699999-72786898 No 319 >TIGR01043 ATP_syn_A_arch ATP synthase archaeal, A subunit; InterPro: IPR005726 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. A-ATPases (or A1A0-ATPase) (3.6.3.14 from EC) are found exclusively in Archaea and display a close resemblance in structure and subunit composition with V-ATPases, although their function in both ATP synthesis and ATP hydrolysis is closer to that of F-ATPases . A-ATPases are composed of two linked complexes: the A1 complex consisting of seven subunits contains the catalytic core that synthesizes/hydrolyses ATP, while the A0 complex consisting of at least two subunits forms the membrane-spanning pore . The rotary motor in A-ATPases is composed of only two subunits, the stator subunit I and the rotor subunit C . A-ATPases may have arisen as an adaptation to the different cellular needs and the more extreme environmental conditions faced by Archaeal species. This entry represents the alpha subunit from the A1 complex of A-ATPases. The A1 complex contains three copies each of subunits alpha (or A) and beta (or B) (IPR005724 from INTERPRO), both of which form the headpiece that participates in nucleotide binding. However, only the alpha subunit is catalytic, the beta subunit being regulatory in function . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016021 integral to membrane, 0016469 proton-transporting two-sector ATPase complex. Probab=33.18 E-value=26 Score=13.68 Aligned_cols=29 Identities=14% Similarity=0.146 Sum_probs=11.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEECH Q ss_conf 122212444554210335565047753262 Q gi|254780386|r 307 SSSNYAQQSQPLMKIVPHSKLTYIRAKVTP 336 (440) Q Consensus 307 ~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~ 336 (440) ..|+.|..|+-|.. |+.+..+.-.+.||+ T Consensus 124 ~~Gd~V~~Gd~~G~-V~ET~~i~h~ilvPp 152 (584) T TIGR01043 124 KEGDKVEGGDIIGV-VQETSLIEHKILVPP 152 (584) T ss_pred CCCCEECCCCEEEE-ECCCCCEEEEEEECC T ss_conf 26864347615877-637650334777357 No 320 >KOG4403 consensus Probab=32.62 E-value=28 Score=13.54 Aligned_cols=30 Identities=17% Similarity=0.020 Sum_probs=17.3 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHE Q ss_conf 633568616899999999999999987640 Q gi|254780386|r 8 NLKRNHPASLKRMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 8 ~~~~~~p~~~~~~~~~~~~~~~~~~~w~~~ 37 (440) |+.++...|.+=+++.+++.+...++|.++ T Consensus 205 pp~~~~~n~~KD~iLv~lili~v~gcw~ay 234 (575) T KOG4403 205 PPYKTNHNWTKDFILVVLILIGVGGCWFAY 234 (575) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 875788631356899999999860555454 No 321 >PRK06676 rpsA 30S ribosomal protein S1; Reviewed Probab=32.28 E-value=28 Score=13.50 Aligned_cols=62 Identities=16% Similarity=0.117 Sum_probs=29.0 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECC Q ss_conf 23455541168731212012221244455421033556504775326223110158983179998128 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYA 356 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~ 356 (440) ...+-.-++|+|..+. .-|.+|.-|. +--+++-.+--|-. + ..=-..+++||.+.+++-.++ T Consensus 189 ~l~~Gdvv~G~V~~i~--~~G~FV~lgg-v~glvh~sels~~~--~-~~~~~~~k~G~~v~~~Vl~id 250 (390) T PRK06676 189 SLKEGDVVEGKVARLT--DFGAFVDIGG-VDGLVHISELSHER--V-NKPSEVVSVGQEVEVKVLSID 250 (390) T ss_pred HCCCCCEEEEEEEEEE--CCCEEEECCC-EEEEEEECCCCHHH--H-CCHHHHHCCCCEEEEEEEEEE T ss_conf 3468984789999994--5636998077-67999961057333--3-598883047975799999994 No 322 >PRK08515 flgA flagellar basal body P-ring biosynthesis protein FlgA; Reviewed Probab=31.99 E-value=28 Score=13.47 Aligned_cols=53 Identities=9% Similarity=0.055 Sum_probs=19.3 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEE Q ss_conf 2212444554210335565047753262231101589831799981288754439999999 Q gi|254780386|r 309 SNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEK 369 (440) Q Consensus 309 G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~ 369 (440) -.+|..|+.+.-++ .+..+.|.+.--. ...=..|+.++|+= . ....+.|+|+. T Consensus 170 p~lVkkGd~V~iv~-~~g~i~v~~~g~A--Le~G~~Gd~IrVkN---~--S~Kvv~a~V~g 222 (229) T PRK08515 170 LILVRKGDIINGVL-SEGGVSIEISLKA--LEDGNLGDIIQAKN---K--SNKILKAKVLG 222 (229) T ss_pred CCEECCCCEEEEEE-ECCCEEEEEEEEE--HHCCCCCCEEEEEC---C--CCCEEEEEEEE T ss_conf 62773799999999-3698999999983--30877798899994---8--89999999995 No 323 >PRK12736 elongation factor Tu; Reviewed Probab=31.93 E-value=28 Score=13.46 Aligned_cols=57 Identities=14% Similarity=0.123 Sum_probs=31.3 Q ss_pred CCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEE--EE-EEECHHHHHHCCCCCCEEEEEEE Q ss_conf 41168731212012221244455421033556504--77-53262231101589831799981 Q gi|254780386|r 295 PIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTY--IR-AKVTPKQIQHVKKGYTATVRFPH 354 (440) Q Consensus 295 P~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~--v~-~~v~~~di~~i~~Gq~v~i~~~a 354 (440) +-.|+|..-.+ ..+.+..|+.+. |+|..+... |. ..+..........|+.|.+.+.. T Consensus 222 ~G~GtVvtG~V--~sG~i~~Gd~v~-i~~~~~~~~~~VrsI~~~~~~v~~a~aG~~v~l~L~g 281 (394) T PRK12736 222 TGRGTVVTGRV--ERGTVKVGDEVE-IVGIKETQKTTVTGVEMFRKLLDEGQAGDNVGVLLRG 281 (394) T ss_pred CCCEEEEEEEE--EECEEECCCEEE-EEECCCCCEEEEEEEEECCCCCCCCCCCCEEEEEECC T ss_conf 89758999998--014686599899-9707998079999873678372480476789999768 No 324 >PRK13415 flagella biosynthesis protein FliZ; Provisional Probab=31.88 E-value=28 Score=13.46 Aligned_cols=34 Identities=18% Similarity=0.175 Sum_probs=13.9 Q ss_pred CCCEEEEEC-CCCCEEEEEECCCCCEEECCCEEEEECHHH Q ss_conf 387389846-888689999857898933697899996489 Q gi|254780386|r 53 EDNVVEIKS-PFSGIIKKFQVENGSQILQGTPLLTFEDIE 91 (440) Q Consensus 53 ~~~~~~I~s-~~~G~V~~i~VkeGd~VkkGq~L~~ld~~~ 91 (440) .|++..||- .+|++| +.|-.||.| |+|-++||.+ T Consensus 115 lG~nrSvQlvkvG~~l--~vvGVGe~i---qLLkeI~de~ 149 (216) T PRK13415 115 VGQNRSVQLIKVGNRV--LVVGVGESI---QLLKEIEDEK 149 (216) T ss_pred CCCCCEEEEEEECCEE--EEEECCCHH---HHHHHCCCHH T ss_conf 5788657899976889--999547257---8998619999 No 325 >COG0557 VacB Exoribonuclease R [Transcription] Probab=31.74 E-value=25 Score=13.80 Aligned_cols=23 Identities=22% Similarity=0.160 Sum_probs=11.0 Q ss_pred CEEEEECCEEECCCCCCCCCCCC Q ss_conf 11687312120122212444554 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPL 318 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l 318 (440) -.|.+..+.+..+.-+|++|++| T Consensus 278 ~g~~~LgVhIADVs~yV~~~s~L 300 (706) T COG0557 278 NGGYRLGVHIADVSHYVQPGSAL 300 (706) T ss_pred CCEEEEEEEEECHHHHCCCCCHH T ss_conf 99089999994445415998746 No 326 >pfam09671 Spore_GerQ Spore coat protein (Spore_GerQ). Members of this protein family are the spore coat protein GerQ of endospore-forming Firmicutes (low GC Gram-positive bacteria). This protein is cross-linked by a spore coat-associated transglutaminase. Probab=31.43 E-value=29 Score=13.41 Aligned_cols=63 Identities=17% Similarity=0.269 Sum_probs=45.5 Q ss_pred HHHHHCCCCCCEEEEEEECCCCC--CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCC Q ss_conf 23110158983179998128875--4439999999713344467776551189999993501125 Q gi|254780386|r 337 KQIQHVKKGYTATVRFPHYADIR--EKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFS 399 (440) Q Consensus 337 ~di~~i~~Gq~v~i~~~a~~~~~--~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~ 399 (440) ..|-++..|..+++.++=-+..+ ...|.|.|..-+++-.--....++..|..-.|.++-.++. T Consensus 11 eNILRlN~GK~~T~y~Tfenn~eWnakvf~G~iE~AGRDhiiisdp~tg~RyLl~mvyldyvtFd 75 (81) T pfam09671 11 ENILRLNRGKQATVYMTFENNSEWGAQSFRGIIEAAGRDHIVISEPQSGKRYLLLMIYLDYVTFD 75 (81) T ss_pred HHHHHHCCCCEEEEEEEECCCHHHHHHHEEEEHHHCCCCEEEECCCCCCCEEEEEEEEEEEEECC T ss_conf 98997627854899999758667623310334555276449972798785887668656547625 No 327 >PRK09731 putative general secretion pathway protein YghD; Provisional Probab=31.27 E-value=27 Score=13.57 Aligned_cols=29 Identities=10% Similarity=-0.038 Sum_probs=15.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHEEEE Q ss_conf 68616899999999999999987640671 Q gi|254780386|r 12 NHPASLKRMTLLSIIGIIGLITWSILLPI 40 (440) Q Consensus 12 ~~p~~~~~~~~~~~~~~~~~~~w~~~~~~ 40 (440) -+||=-+++.....+.+++++.|..+.|+ T Consensus 32 rspREqrLv~~~gavLll~i~YwgiWqP~ 60 (178) T PRK09731 32 RSPREKGMLLAAVVFLFSVGYYVLIWQPL 60 (178) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98899999998799999999999988369 No 328 >PRK10920 putative uroporphyrinogen III C-methyltransferase; Provisional Probab=30.46 E-value=30 Score=13.30 Aligned_cols=22 Identities=5% Similarity=-0.198 Sum_probs=8.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHE Q ss_conf 6899999999999999987640 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~ 37 (440) ...++..++|++.+++++|.|| T Consensus 35 ~g~~l~~lai~l~la~g~g~y~ 56 (389) T PRK10920 35 TALILSAVAIAIALAAGAGLYG 56 (389) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 4199999999999999999999 No 329 >pfam02699 YajC Preprotein translocase subunit. See. Probab=30.34 E-value=30 Score=13.29 Aligned_cols=25 Identities=24% Similarity=0.295 Sum_probs=13.3 Q ss_pred HHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 3110158983179998128875443999999971 Q gi|254780386|r 338 QIQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 338 di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) -++.+++|++|.- .|.+-|+|.+|+ T Consensus 34 m~~~L~~Gd~VvT---------~gGi~G~V~~i~ 58 (83) T pfam02699 34 MLSSLKKGDEVVT---------IGGIHGKIVKVD 58 (83) T ss_pred HHHHCCCCCEEEE---------CCCEEEEEEEEE T ss_conf 9985789999998---------997199999997 No 330 >TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair. Probab=30.19 E-value=30 Score=13.29 Aligned_cols=20 Identities=10% Similarity=0.195 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 16899999999999999987 Q gi|254780386|r 15 ASLKRMTLLSIIGIIGLITW 34 (440) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~w 34 (440) ||..++.++++++++++++| T Consensus 3 RW~Gli~f~~v~a~I~~~~~ 22 (554) T TIGR03545 3 RWPGLIAFLIVVAVILALLY 22 (554) T ss_pred EEHHHHHHHHHHHHHHHHHH T ss_conf 12338999999999999999 No 331 >PRK12735 elongation factor Tu; Reviewed Probab=30.07 E-value=30 Score=13.26 Aligned_cols=56 Identities=16% Similarity=0.190 Sum_probs=31.2 Q ss_pred EEEEECCEEECCCCCCCCCCCCCCCC--CCCCCEEEE-EEECHHHHHHCCCCCCEEEEEEEC Q ss_conf 16873121201222124445542103--355650477-532622311015898317999812 Q gi|254780386|r 297 AGTIVYNQSFSSSNYAQQSQPLMKIV--PHSKLTYIR-AKVTPKQIQHVKKGYTATVRFPHY 355 (440) Q Consensus 297 ~G~V~~l~~~~~G~~v~~G~~l~~Iv--p~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~~a~ 355 (440) .|+|..-.+ ..+.+..|+.+. |+ |+.....|. ..+..........|+.|.+.+... T Consensus 226 ~GtVVtGtV--~sG~i~~Gd~v~-i~~~~~~~~~~V~sIq~~~~~v~~a~aG~~v~l~L~gi 284 (396) T PRK12735 226 RGTVVTGRV--ERGIVKVGDEVE-IVGIKDTQKTTVTGVEMFRKLLDEGQAGDNVGVLLRGT 284 (396) T ss_pred CCEEEEEEE--EEEEECCCCEEE-EEECCCCCEEEEEEEEECCCCCCEECCCCEEEEEECCC T ss_conf 715999898--121562799899-97269984699999998670802714887899994799 No 332 >TIGR00996 Mtu_fam_mce virulence factor Mce family protein; InterPro: IPR005693 Mycobacterial species are usually slender, curved rods with a unique cell wall of complex waxes and glycolipids. They are resistant to acids, alkalis and dehydration, and are very slow to grow in vitro. The human pathogenic Mycobacteria (Mycobacterium tuberculosis and Mycobacterium leprae) are becoming resistant to conventional treatments and, together with HIV-related diseases, are fast posing a global health threat. An essential requirement, particularly of M. tuberculosis, is to gain entrance to, and to resist, the hostile intra-cellular environment of epithelial cells . The genome of M. tuberculosis contains four mammalian cell entry (mce) operons , which are widely distributed in both pathogenic and non-pathogenic mycobacteria suggesting that the presence of these putative virulence genes is not an indicator for the pathogenicity of the bacilli. At the 5' end of the transcriptional unit are two genes that have evolved from a tandem duplication, and whose products resemble YrbE, a conserved hypothetical protein found in Escherichia. coli, Haemophilus influenzae and Porphyra purpurea. All of the YrbE proteins, including the eight from M. tuberculosis, are probable integral membrane proteins with six TM alpha helices. The next six genes in each operon, the mce genes, are related, their products ranging in size from 275 to 564 amino acid residues. The corresponding protein sequences contain a number of highly conserved motifs that define a 24- member family with a common organization. Twenty of these proteins have a strongly hydrophobic segment at the NH2-terminal end that could span the lipid bilayer whereas the remaining four, all of which correspond to the seventh gene in their respective operons, mce1E to mce4E, are probably lipoprotein precursors. In all 24 cases the COOH-terminal domain of the mce proteins is predicted to be exposed on the external face of the cytoplasmic membrane . The ability to gain entry and resist the antimicrobial intracellular environment of mammalian cells is an essential virulence property of M. tuberculosis. This property is conferred by Mce1A, the third gene of operon 1, which when expressed in Escherichia coli conferred the ability to invade HeLa cells. The recombinant protein when used to coat latex spheres also promoted their uptake into HeLa cells. N-terminus deletion constructs of Mce1A identified a domain located between amino acid positions 106 and 163 that was needed for this cell uptake activity. Mce1A contains hydrophobic stretches at the N-terminus predictive of a signal sequence, and colloidal gold immunoelectron microscopy indicated that the corresponding native protein is expressed on the surface of M. tuberculosis. Recombinant Mce2A, which had the highest level of identity (67%) to Mce1A, was unable to promote the association of microspheres with HeLa cells and an mce-deletion mutant in Mycobacterium bovis greatly impaired the ability of the microbe to infect epithelial cells in vitro. Although the exact function of Mce1A is still unknown, it appears to serve as an effector molecule expressed on the surface of M. tuberculosis that is capable of eliciting plasma membrane perturbations in non-phagocytic mammalian cells . The distribution of the mce operons in both pathogenic and non-pathogenic mycobacteria suggests that the presence of these putative virulence genes is not an indicator for the pathogenicity of the bacilli - it may be that pathogenicity is determined by their expression , . The members of this family represent all 24 genes associated with the four mammalian cell entry operons of M. tuberculosis and their homologs in other Actinomycetales.; GO: 0009405 pathogenesis. Probab=29.92 E-value=31 Score=13.24 Aligned_cols=11 Identities=18% Similarity=0.591 Sum_probs=5.8 Q ss_pred CEEEEEE-CCCC Q ss_conf 6899998-5789 Q gi|254780386|r 65 GIIKKFQ-VENG 75 (440) Q Consensus 65 G~V~~i~-VkeG 75 (440) |.|..|- |..| T Consensus 60 G~V~~i~~i~~~ 71 (304) T TIGR00996 60 GKVTAIEEIPRG 71 (304) T ss_pred EEEEEEEECCCC T ss_conf 056432122788 No 333 >COG5349 Uncharacterized protein conserved in bacteria [Function unknown] Probab=29.90 E-value=31 Score=13.24 Aligned_cols=30 Identities=17% Similarity=0.348 Sum_probs=21.6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 334576335686168999999999999999 Q gi|254780386|r 3 HFMWKNLKRNHPASLKRMTLLSIIGIIGLI 32 (440) Q Consensus 3 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~ 32 (440) -|||-+..-+.|.|.+..+|+.+.++.++. T Consensus 74 ~~l~~e~~~~~p~W~hl~vw~P~~~~~sL~ 103 (126) T COG5349 74 LALWVEVTLNPPLWVHLLVWLPLTLALSLL 103 (126) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 888642201797300358888899999998 No 334 >PRK13183 psbN photosystem II reaction center protein N; Provisional Probab=29.78 E-value=31 Score=13.23 Aligned_cols=25 Identities=16% Similarity=0.284 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHEEEEE Q ss_conf 8999999999999999876406712 Q gi|254780386|r 17 LKRMTLLSIIGIIGLITWSILLPIE 41 (440) Q Consensus 17 ~~~~~~~~~~~~~~~~~w~~~~~~~ 41 (440) +.+.+..+++++.|+.+|.+|+|-. T Consensus 10 ~~I~i~~ll~~iTgy~iYtaFGppS 34 (46) T PRK13183 10 LAITILAVLLALTGFGIYTAFGPPS 34 (46) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 9999999999876666444228853 No 335 >COG4682 Predicted membrane protein [Function unknown] Probab=29.40 E-value=31 Score=13.19 Aligned_cols=36 Identities=14% Similarity=0.121 Sum_probs=21.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHH--HHHHHEEEE Q ss_conf 457633568616899999999999999--987640671 Q gi|254780386|r 5 MWKNLKRNHPASLKRMTLLSIIGIIGL--ITWSILLPI 40 (440) Q Consensus 5 ~~~~~~~~~p~~~~~~~~~~~~~~~~~--~~w~~~~~~ 40 (440) |.|...++++++.-.....++++++.+ ++|-+..+. T Consensus 1 m~~~~~~~s~af~~va~~~lv~G~vvyLiGLWna~~qL 38 (128) T COG4682 1 MDPKMEIVSKAFTIVAWLALVVGAVVYLVGLWNACMQL 38 (128) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 98410111799999999999999999998621143433 No 336 >pfam00575 S1 S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure. Probab=29.35 E-value=31 Score=13.18 Aligned_cols=47 Identities=13% Similarity=0.195 Sum_probs=22.4 Q ss_pred CEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHH---------HCCCCCCEEEEEEECC Q ss_conf 116873121201222124445542103355650477532622311---------0158983179998128 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQ---------HVKKGYTATVRFPHYA 356 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~---------~i~~Gq~v~i~~~a~~ 356 (440) +.|+|..+. .-|.+|.-|..+- ++||..+++ .+++||.+.+++...+ T Consensus 8 v~G~V~~i~--~~G~fV~l~~~~~------------g~v~~s~ls~~~~~~~~~~~~~G~~v~~~V~~vd 63 (74) T pfam00575 8 VKGTVTRVT--KGGAFVDLGNGVE------------GFIPISEISDDRVEDPDEVLKVGDEVEVKVLKVD 63 (74) T ss_pred EEEEEEEEE--CCEEEEEECCCCE------------EEEEHHHCCCCCCCCHHHHHCCCCEEEEEEEEEE T ss_conf 999999998--7759999789979------------9998446136555677683059999999999999 No 337 >TIGR03021 pilP_fam type IV pilus biogenesis protein PilP. Members of this protein family are found in type IV pilus biogenesis loci and include proteins designated PilP. Probab=28.83 E-value=19 Score=14.65 Aligned_cols=29 Identities=10% Similarity=0.071 Sum_probs=13.5 Q ss_pred CCCCCCCCCCEEEEEEECHHHHHHCCCCC Q ss_conf 42103355650477532622311015898 Q gi|254780386|r 318 LMKIVPHSKLTYIRAKVTPKQIQHVKKGY 346 (440) Q Consensus 318 l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq 346 (440) |.+|.-.+..|.....+|.-..-.++.|+ T Consensus 71 v~eI~G~~~~l~A~l~l~~G~~~dv~~Gd 99 (119) T TIGR03021 71 VVEIFGRGGRLTATLRLPGGREVDVQVGD 99 (119) T ss_pred EEEEEEECCEEEEEEEECCCCEEEEECCC T ss_conf 99997038808999992899589860477 No 338 >pfam11356 Pilus_PilP Type IV pilus biogenesis. Type IV pili are required for auto-agglutination, twitching motility, biofilm formation, adherence and DNA uptake during transformation. PilP is an inner membrane protein, required for pilus expression and transformation. PilP interacts with PilQ which suggests that the two proteins may have coordinated activity in functions such as pilus extrusion/retraction. Probab=28.79 E-value=19 Score=14.64 Aligned_cols=29 Identities=14% Similarity=0.066 Sum_probs=14.9 Q ss_pred CCCCCCCCCCEEEEEEECHHHHHHCCCCC Q ss_conf 42103355650477532622311015898 Q gi|254780386|r 318 LMKIVPHSKLTYIRAKVTPKQIQHVKKGY 346 (440) Q Consensus 318 l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq 346 (440) +.+|.-.+..|.....+|.-..-.++.|+ T Consensus 60 v~eI~G~~~~l~A~l~l~~G~~~dv~~Gd 88 (108) T pfam11356 60 VVEIFGRGGGLTATLRLPGGREVDVKVGD 88 (108) T ss_pred EEEEEEECCCEEEEEEECCCCEEEEECCC T ss_conf 99998038818999991899489860387 No 339 >cd04461 S1_Rrp5_repeat_hs8_sc7 S1_Rrp5_repeat_hs8_sc7: Rrp5 Homo sapiens S1 repeat 8 (hs8) and Saccharomyces cerevisiae S1 repeat 7 (sc7)-like domains. Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in S. cerevisiae Rrp5 and 14 S1 repeats in H. sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 8 and S. cerevisiae S1 repeat 7. Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=28.67 E-value=32 Score=13.10 Aligned_cols=25 Identities=4% Similarity=-0.088 Sum_probs=13.1 Q ss_pred CCCCCCCCCCEEEEECCEEECCCCCCC Q ss_conf 212345554116873121201222124 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQSFSSSNYAQ 313 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~~~~G~~v~ 313 (440) ++...+=.-+.|+|..+. .-|-+|. T Consensus 9 ~~~lk~G~iv~G~V~~i~--~~G~FV~ 33 (83) T cd04461 9 FSDLKPGMVVHGYVRNIT--PYGVFVE 33 (83) T ss_pred HHHCCCCCEEEEEEEEEE--CCEEEEE T ss_conf 778799999999999992--7819999 No 340 >cd05685 S1_Tex S1_Tex: The C-terminal S1 domain of a transcription accessory factor called Tex, which has been characterized in Bordetella pertussis and Pseudomonas aeruginosa. The tex gene is essential in Bortella pertusis and is named for its role in toxin expression. Tex has two functional domains, an N-terminal domain homologous to the Escherichia coli maltose repression protein, which is a poorly defined transcriptional factor, and a C-terminal S1 RNA-binding domain. Tex is found in prokaryotes, eukaryotes, and archaea. Probab=28.48 E-value=32 Score=13.08 Aligned_cols=56 Identities=9% Similarity=0.037 Sum_probs=24.3 Q ss_pred CEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECC Q ss_conf 1168731212012221244455421033556504775326223110158983179998128 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYA 356 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~ 356 (440) ++|+|..+. .-|.+|.-|..+--++|- +.+- .-++ ..--..+++||.|++++...+ T Consensus 4 v~G~V~~i~--~fG~FV~l~~~~~GLvhi-sels-~~~~-~~~~~~~~vGd~v~~kVi~id 59 (68) T cd05685 4 LEGVVTNVT--DFGAFVDIGVKQDGLIHI-SKMA-DRFV-SHPSDVVSVGDIVEVKVISID 59 (68) T ss_pred EEEEEEEEE--CEEEEEEECCCCEEEEEC-HHHC-CCCC-CCHHHCCCCCCEEEEEEEEEE T ss_conf 999999996--528999914997899981-8976-7876-798685799999999999998 No 341 >TIGR02794 tolA_full protein TolA; InterPro: IPR014161 TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cut-offs are based largely on conserved operon structure. The Tol-Pal complex is required for maintaining outer membrane integrity, and is also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins OmpC, PhoE and LamB.. Probab=28.44 E-value=32 Score=13.08 Aligned_cols=15 Identities=33% Similarity=0.379 Sum_probs=9.8 Q ss_pred HCCCCCCEEEEEEECC Q ss_conf 0158983179998128 Q gi|254780386|r 341 HVKKGYTATVRFPHYA 356 (440) Q Consensus 341 ~i~~Gq~v~i~~~a~~ 356 (440) .+. |..|+|++--.| T Consensus 377 s~~-g~~c~~~i~l~p 391 (452) T TIGR02794 377 SFR-GKTCTVRIRLAP 391 (452) T ss_pred HCC-CCEEEEEEEECC T ss_conf 017-988999888748 No 342 >TIGR01144 ATP_synt_b ATP synthase F0, B subunit; InterPro: IPR005864 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit B from the F0 complex in F-ATPases found in bacterial plasma membranes. The B subunits are part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In bacterial F-ATPases, the peripheral stalk is composed of one copy of the delta subunit (homologous to OSCP in mitochondria), and two copies of subunit B . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0046933 hydrogen ion transporting ATP synthase activity rotational mechanism, 0046961 hydrogen ion transporting ATPase activity rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=28.35 E-value=12 Score=16.03 Aligned_cols=10 Identities=10% Similarity=0.195 Sum_probs=3.3 Q ss_pred HHHHHHHHHH Q ss_conf 9987434332 Q gi|254780386|r 97 DLKKVSIKNL 106 (440) Q Consensus 97 ~~~~~~l~~l 106 (440) +..+..+..- T Consensus 25 ~~R~~~IA~~ 34 (147) T TIGR01144 25 ETRQKKIADD 34 (147) T ss_pred HHHHHHHHHH T ss_conf 9889985988 No 343 >COG1545 Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only] Probab=27.88 E-value=33 Score=13.01 Aligned_cols=50 Identities=14% Similarity=0.200 Sum_probs=21.6 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHH-----CCC-CCCEECCCCEEEEEEE Q ss_conf 399999997133444677765511899999935011-----256-7851523775699999 Q gi|254780386|r 362 KFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLA-----FSD-NNIELRNGFPAEVLFT 416 (440) Q Consensus 362 ~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~-----~~~-~~~~L~~Gm~~~~~I~ 416 (440) .++-+|.++++..... ..+|.-..|+++... +.+ ...++..||.|.+... T Consensus 66 V~t~Tv~~~~~~~~~~-----~~P~viaiV~l~~~~~i~~~i~~~dp~~v~iGm~V~~v~~ 121 (140) T COG1545 66 VETYTVVYVKPPGFSL-----EEPYVIAIVELEEGGRILGQLVDVDPDDVEIGMKVEAVFR 121 (140) T ss_pred EEEEEEEEECCCCCCC-----CCCEEEEEEEECCCCEEEEEEECCCCCCCCCCCEEEEEEE T ss_conf 9999999517888666-----8998999999278870699998068564507999999998 No 344 >pfam10011 DUF2254 Predicted membrane protein (DUF2254). Members of this family of bacterial proteins comprises various hypothetical and putative membrane proteins. Their exact function, has not, as yet, been defined. Probab=27.71 E-value=33 Score=12.99 Aligned_cols=33 Identities=15% Similarity=0.345 Sum_probs=23.6 Q ss_pred CCCCCCCEEEEECCEE-----------------ECCCCCCCCCCCCCCCC Q ss_conf 3455541168731212-----------------01222124445542103 Q gi|254780386|r 290 KTILSPIAGTIVYNQS-----------------FSSSNYAQQSQPLMKIV 322 (440) Q Consensus 290 ~~I~AP~~G~V~~l~~-----------------~~~G~~v~~G~~l~~Iv 322 (440) ..|+|+.+|+|..++. ..+|++|.+|+||+.+. T Consensus 195 ~~i~a~~sGYlq~Id~~~L~~~A~~~d~~i~l~~~pG~fV~~g~pl~~v~ 244 (369) T pfam10011 195 TPIYAPRSGYLQHIDYDALLELAEEHDGVIYLLVRPGDFVTPGTPLARVW 244 (369) T ss_pred EEEECCCCEEEEEECHHHHHHHHHHCCCEEEEEECCCCEECCCCEEEEEE T ss_conf 58866984489985599999999987978999827987368997699981 No 345 >pfam02468 PsbN Photosystem II reaction centre N protein (psbN). This is a family of small proteins encoded on the chloroplast genome. psbN is involved in photosystem II during photosynthesis, but its exact role is unknown. Probab=27.60 E-value=34 Score=12.98 Aligned_cols=26 Identities=15% Similarity=0.212 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEEEEE Q ss_conf 68999999999999999876406712 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILLPIE 41 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~~~~ 41 (440) .+.+++..+++++.|+.+|.+|++-. T Consensus 6 ~~~I~i~~lli~~TgyaiYtaFGppS 31 (43) T pfam02468 6 LLSIFISALLLSITGYSIYTAFGPPS 31 (43) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 19999999999875231222048834 No 346 >PRK12317 elongation factor 1-alpha; Reviewed Probab=27.57 E-value=34 Score=12.98 Aligned_cols=119 Identities=11% Similarity=0.117 Sum_probs=60.2 Q ss_pred EEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEE-EEECHHHHHHCCCCCCEEEEEEECCC----------------CCC Q ss_conf 6873121201222124445542103355650477-53262231101589831799981288----------------754 Q gi|254780386|r 298 GTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIR-AKVTPKQIQHVKKGYTATVRFPHYAD----------------IRE 360 (440) Q Consensus 298 G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~~a~~~----------------~~~ 360 (440) |+|..-.+ .++.+..|+.+. +.|.+....|. ..+...++.....|+.|.+.+...+. ... T Consensus 241 Gtvv~G~v--~sG~i~~Gd~v~-i~Ps~~~~~VksI~~~~~~v~~a~aG~~v~l~L~~i~~~dI~rG~Vl~~~~~~~~~~ 317 (426) T PRK12317 241 GTVPVGRV--ETGVLKVGDKVV-FMPAGKVGEVKSIEMHHEELPQAEPGDNIGFNVRGVGKNDIKRGDVAGHTDNPPTVA 317 (426) T ss_pred EEEEEEEE--CCCCCCCCCEEE-EECCCCCEEEEEEEECCCEEEEEECCCEEEEEEECCCHHCCCCCCEEECCCCCCCEE T ss_conf 07998684--438543799999-967998658976886694788995898699998367442277864894799998430 Q ss_pred CEEEEEEEEEECCCCCCCCCCCCCE--EEEEEEEE-------CHHHC---CCCCCEECCCCEEEEEEEECE Q ss_conf 4399999997133444677765511--89999993-------50112---567851523775699999220 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNNLQTQE--YYEVILKI-------TDLAF---SDNNIELRNGFPAEVLFTAKN 419 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~~~~~~--~y~v~i~~-------~~~~~---~~~~~~L~~Gm~~~~~I~~~~ 419 (440) ..|.+.|.+.+....-..+-..... ...+.+++ +..+. ......|.+|-.+.+.+.+.+ T Consensus 318 ~~F~A~i~~l~~~~~i~~G~~~vlh~~t~~~~~~i~~I~~~id~~tg~~~~~~~~~l~~gd~a~v~l~~~~ 388 (426) T PRK12317 318 EEFTAQIVVLQHPSAITVGYTPVFHAHTAQVACTFEELVKKLDPRTGQVVEENPQFIKTGDAAIVKIKPTK 388 (426) T ss_pred EEEEEEEEEECCCCCCCCCCEEEEEECCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEECC T ss_conf 08999999974898678996799995366899999998999827877551358546089998999999898 No 347 >pfam10828 DUF2570 Protein of unknown function (DUF2570). This is a family of proteins with unknown function. Probab=27.47 E-value=34 Score=12.97 Aligned_cols=19 Identities=26% Similarity=0.293 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHHHHHE Q ss_conf 9999999999999987640 Q gi|254780386|r 19 RMTLLSIIGIIGLITWSIL 37 (440) Q Consensus 19 ~~~~~~~~~~~~~~~w~~~ 37 (440) ++.....+.++|+++|+++ T Consensus 4 ~i~~~l~~lilgL~gwlw~ 22 (110) T pfam10828 4 YIKATLAALILGLGGWSWY 22 (110) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 8999999999999999999 No 348 >pfam06638 Strabismus Strabismus protein. This family consists of several strabismus (STB) or Van Gogh-like (VANGL) proteins 1 and 2. The exact function of this family is unknown. It is thought, however that STB1 gene and STB2 may be potent tumour suppressor gene candidates. Probab=27.42 E-value=34 Score=12.96 Aligned_cols=26 Identities=15% Similarity=0.126 Sum_probs=17.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHEE Q ss_conf 86168999999999999999876406 Q gi|254780386|r 13 HPASLKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 13 ~p~~~~~~~~~~~~~~~~~~~w~~~~ 38 (440) +..++-+..++++++++.+..|.+++ T Consensus 163 PRif~fRa~ll~Lv~l~~~syWLFY~ 188 (507) T pfam06638 163 PRIFVFRALLLVLVFLFLFSYWLFYG 188 (507) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHE T ss_conf 72189999999999999999987613 No 349 >TIGR00878 purM phosphoribosylformylglycinamidine cyclo-ligase; InterPro: IPR004733 The purine biosynthetic pathway in procaryotes enlists eleven enzymes, six of which use ATP. Enzymes 5 and 6 of this pathway, formylglycinamide ribonucleotide (FGAR) amidotransferase (PurL) and aminoimidazole ribonucleotide (AIR) synthetase (PurM) utilise ATP to activate the oxygen of an amide within their substrate toward nucleophilic attack by a nitrogen. AIR synthetase uses the product of PurL, formylglycinamidine ribonucleotide (FGAM) and ATP to make AIR, ADP and P(i).; GO: 0004641 phosphoribosylformylglycinamidine cyclo-ligase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=26.88 E-value=35 Score=12.90 Aligned_cols=26 Identities=19% Similarity=0.447 Sum_probs=16.1 Q ss_pred CEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 68999985789893369789999648 Q gi|254780386|r 65 GIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 65 G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) |+|++=.|=.|+.|+.||+|+=|..+ T Consensus 161 G~Vekd~iitG~~v~~GDv~igl~SS 186 (338) T TIGR00878 161 GVVEKDEIITGEKVKPGDVLIGLGSS 186 (338) T ss_pred EEEEECCCCCCCCCCCCCEEEEECCC T ss_conf 99962127882320678789983178 No 350 >CHL00020 psbN photosystem II protein N Probab=26.82 E-value=35 Score=12.89 Aligned_cols=26 Identities=4% Similarity=0.006 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEEEEE Q ss_conf 68999999999999999876406712 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILLPIE 41 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~~~~ 41 (440) .+.++++.+++.+.++.+|.+|++-. T Consensus 6 ~~~i~i~~ll~~~TgyaiYtaFGpps 31 (43) T CHL00020 6 LVAIFISGLLVSFTGYALYTAFGQPS 31 (43) T ss_pred HHHHHHHHHHHHHHHHEEHHCCCCHH T ss_conf 17999999999975210001148815 No 351 >PRK05338 rplS 50S ribosomal protein L19; Provisional Probab=26.65 E-value=35 Score=12.87 Aligned_cols=36 Identities=19% Similarity=0.240 Sum_probs=26.1 Q ss_pred HHHHHHCCCCCCEEEEEEECCCC--CCCEEEEEEEEEE Q ss_conf 22311015898317999812887--5443999999971 Q gi|254780386|r 336 PKQIQHVKKGYTATVRFPHYADI--REKKFKAIIEKID 371 (440) Q Consensus 336 ~~di~~i~~Gq~v~i~~~a~~~~--~~~~~~G~V~~Is 371 (440) ..|++.+++|+.|.|.+--.... +.-.|.|.|..+- T Consensus 13 k~~iP~f~~GDtV~V~~~i~EG~KeRiQ~feGvvI~~~ 50 (112) T PRK05338 13 RKDIPEFRPGDTVRVHVKVVEGNKERIQAFEGVVIARR 50 (112) T ss_pred CCCCCCCCCCCEEEEEEEEEECCCEEEEEEEEEEEEEE T ss_conf 44699859999999999999689638875689999973 No 352 >PRK13380 glycine cleavage system protein H; Provisional Probab=26.54 E-value=35 Score=12.86 Aligned_cols=36 Identities=8% Similarity=0.089 Sum_probs=16.3 Q ss_pred CCEEEEEEEEEECCCCCCCCCC---CCCEEEEEEEEECH Q ss_conf 4439999999713344467776---55118999999350 Q gi|254780386|r 360 EKKFKAIIEKIDPIISQQDNNL---QTQEYYEVILKITD 395 (440) Q Consensus 360 ~~~~~G~V~~Is~~~~~~~~~~---~~~~~y~v~i~~~~ 395 (440) +-++.|+|..+-.....++..- ..+.-..++|++++ T Consensus 71 ~sPvsG~VvevN~~l~~~P~liN~dPy~~GWl~~ik~~d 109 (132) T PRK13380 71 PMPLDGEVVEVNDALEDSPELVNEDPYGEGWFFRFKPAD 109 (132) T ss_pred CCCCCCEEEEECHHHHHCHHHHCCCCCCCCEEEEEEECC T ss_conf 277252588853434539586527988896799999899 No 353 >PRK02655 psbI photosystem II reaction center I protein I; Provisional Probab=26.31 E-value=25 Score=13.86 Aligned_cols=23 Identities=17% Similarity=0.377 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHEEE Q ss_conf 89999999999999998764067 Q gi|254780386|r 17 LKRMTLLSIIGIIGLITWSILLP 39 (440) Q Consensus 17 ~~~~~~~~~~~~~~~~~w~~~~~ 39 (440) .++..|.++++|+++.++.+++. T Consensus 4 LKi~VY~vV~fFv~lFiFGfLs~ 26 (38) T PRK02655 4 LKISVYIVVFFFVGIFVFGFLSS 26 (38) T ss_pred EEEEHHHHHHHHHHHHHHHHCCC T ss_conf 88733778999999998863058 No 354 >PRK10698 phage shock protein PspA; Provisional Probab=26.27 E-value=32 Score=13.13 Aligned_cols=13 Identities=8% Similarity=0.125 Sum_probs=4.4 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9987434332332 Q gi|254780386|r 97 DLKKVSIKNLRCH 109 (440) Q Consensus 97 ~~~~~~l~~l~~~ 109 (440) .+.+..+...+.. T Consensus 34 ~eme~~l~~~r~~ 46 (222) T PRK10698 34 QEMEDTLVEVRST 46 (222) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 355 >TIGR00739 yajC preprotein translocase, YajC subunit; InterPro: IPR003849 This entry describes proteins of unknown function.. Probab=26.19 E-value=36 Score=12.82 Aligned_cols=27 Identities=26% Similarity=0.265 Sum_probs=18.2 Q ss_pred HHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 231101589831799981288754439999999713 Q gi|254780386|r 337 KQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDP 372 (440) Q Consensus 337 ~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~ 372 (440) .=+..+++|++|.. .|.+.|+|..|+- T Consensus 33 kl~~~L~KGd~V~T---------~gGi~G~V~~i~e 59 (86) T TIGR00739 33 KLIESLKKGDKVLT---------IGGIIGTVTKIAE 59 (86) T ss_pred HHHHCCCCCCEEEE---------CCCEEEEEEEEEC T ss_conf 88852799778998---------3883899988523 No 356 >pfam01245 Ribosomal_L19 Ribosomal protein L19. Probab=26.11 E-value=36 Score=12.81 Aligned_cols=72 Identities=10% Similarity=0.048 Sum_probs=38.1 Q ss_pred HHHHHHCCCCCCEEEEEEECCCCC--CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHHCCCCCCEECCCCEEEE Q ss_conf 223110158983179998128875--443999999971334446777655118999999350112567851523775699 Q gi|254780386|r 336 PKQIQHVKKGYTATVRFPHYADIR--EKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLAFSDNNIELRNGFPAEV 413 (440) Q Consensus 336 ~~di~~i~~Gq~v~i~~~a~~~~~--~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~~~~~~~~L~~Gm~~~~ 413 (440) ..|++.+++|+.|.|.+--..... .-.|.|.|..+..... ...|.|+=-+..... ..-++|.+=+-..+ T Consensus 13 k~~ip~f~~GDtv~V~~~i~Eg~k~RiQ~FeGvvI~~~~~G~--------~~tftvRki~~g~GV-Er~fpl~SP~i~~I 83 (113) T pfam01245 13 KKDLPEFRVGDTVRVHVKIKEGNKERIQVFEGVVIARRGRGL--------NSTFTVRKISQGVGV-ERVFPLHSPRIDSI 83 (113) T ss_pred CCCCCCCCCCCEEEEEEEEEECCEEEEEEEEEEEEEEECCCC--------CEEEEEEEECCCCCE-EEEEECCCCCCCEE T ss_conf 245998499999999999984881688899999999957887--------416999983168758-99998779974679 Q ss_pred EEE Q ss_conf 999 Q gi|254780386|r 414 LFT 416 (440) Q Consensus 414 ~I~ 416 (440) .|+ T Consensus 84 eV~ 86 (113) T pfam01245 84 EVV 86 (113) T ss_pred EEE T ss_conf 999 No 357 >PRK00049 elongation factor Tu; Reviewed Probab=26.09 E-value=36 Score=12.80 Aligned_cols=120 Identities=12% Similarity=0.104 Sum_probs=50.8 Q ss_pred EEEEECCEEECCCCCCCCCCCCCCCCC--CCCCEEEE-EEECHHHHHHCCCCCCEEEEEEECCCC--------------- Q ss_conf 168731212012221244455421033--55650477-532622311015898317999812887--------------- Q gi|254780386|r 297 AGTIVYNQSFSSSNYAQQSQPLMKIVP--HSKLTYIR-AKVTPKQIQHVKKGYTATVRFPHYADI--------------- 358 (440) Q Consensus 297 ~G~V~~l~~~~~G~~v~~G~~l~~Ivp--~~~~l~v~-~~v~~~di~~i~~Gq~v~i~~~a~~~~--------------- 358 (440) .|+|..-.+ ..+.+..|+.|. |+| +.....|. ..+....+.....|+.|.+.+...+.. T Consensus 227 ~GtVVtGtv--~sG~i~~Gd~v~-i~~~~~~~~~~VksIq~~~~~v~~a~aG~~v~i~L~gi~~~~I~rG~vL~~p~~~~ 303 (397) T PRK00049 227 RGTVVTGRV--ERGIIKVGEEVE-IVGIRDTQKTTVTGVEMFRKLLDEGQAGDNVGLLLRGIKREDVERGQVLAKPGSIT 303 (397) T ss_pred CCEEEEEEE--EEEEECCCCEEE-EEECCCCCEEEEEEEEECCEEEEEECCCCEEEEEECCCCHHHCCCCCEEECCCCCC T ss_conf 727999898--000560799899-96069884799999996270702635887799997799889857601995699888 Q ss_pred CCCEEEEEEEEEECCCC-C-CCCCCC-CCEEEEEEEEECH-HHCCCCCCEECCCCEEEEEEEECE Q ss_conf 54439999999713344-4-677765-5118999999350-112567851523775699999220 Q gi|254780386|r 359 REKKFKAIIEKIDPIIS-Q-QDNNLQ-TQEYYEVILKITD-LAFSDNNIELRNGFPAEVLFTAKN 419 (440) Q Consensus 359 ~~~~~~G~V~~Is~~~~-~-~~~~~~-~~~~y~v~i~~~~-~~~~~~~~~L~~Gm~~~~~I~~~~ 419 (440) ....|.++|.-.+..-. . .+-... ...+|--.+.+.. ..+......++||..+.|.|...+ T Consensus 304 p~~~f~A~v~vL~~~~~~~h~pi~~gy~~~lh~~t~~v~~~i~l~~~~e~l~~Gd~a~v~i~l~~ 368 (397) T PRK00049 304 PHTKFEAEVYVLSKEEGGRHTPFFNGYRPQFYFRTTDVTGVIELPEGVEMVMPGDNVKMTVELIA 368 (397) T ss_pred CCEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEECEEEEEEEECCCCCEECCCCEEEEEEEECC T ss_conf 77299999999715457765653378269999850569999993689716358998999999898 No 358 >pfam01597 GCV_H Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group is attached to a completely conserved lysine residue. The H protein shuttles the methylamine group of glycine from the P protein to the T protein. Probab=25.93 E-value=36 Score=12.78 Aligned_cols=34 Identities=6% Similarity=0.059 Sum_probs=16.4 Q ss_pred EEEEEEEEEECCCCCCCCC---CCCCEEEEEEEEECH Q ss_conf 3999999971334446777---655118999999350 Q gi|254780386|r 362 KFKAIIEKIDPIISQQDNN---LQTQEYYEVILKITD 395 (440) Q Consensus 362 ~~~G~V~~Is~~~~~~~~~---~~~~~~y~v~i~~~~ 395 (440) ++.|+|..+-.....++.. ...+.-..+++++.+ T Consensus 66 PisG~Vv~vN~~l~~~P~liN~dpy~~gWl~~i~~~d 102 (122) T pfam01597 66 PVSGEVVEVNEKLEDNPGLINKDPYEDGWIAKLKPSN 102 (122) T ss_pred CCCEEEEEEHHHHHHCHHHHCCCCCCCCEEEEEEECC T ss_conf 6342799875534219386438988895699999899 No 359 >CHL00182 tatC Sec-independent translocase component C; Provisional Probab=25.79 E-value=36 Score=12.77 Aligned_cols=20 Identities=10% Similarity=0.222 Sum_probs=9.1 Q ss_pred CCCCCCCCCCCHHHHHHHHH Q ss_conf 34576335686168999999 Q gi|254780386|r 4 FMWKNLKRNHPASLKRMTLL 23 (440) Q Consensus 4 ~~~~~~~~~~p~~~~~~~~~ 23 (440) |..|-|.++..++.+.++.. T Consensus 94 Fi~PgL~~~Ekr~~~~~~~~ 113 (239) T CHL00182 94 FILPGLTKKERKIILPLLIG 113 (239) T ss_pred HHHHHCCHHHHHHHHHHHHH T ss_conf 97621473458999999999 No 360 >CHL00024 psbI photosystem II protein I Probab=25.68 E-value=34 Score=12.96 Aligned_cols=23 Identities=22% Similarity=0.307 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHEEE Q ss_conf 89999999999999998764067 Q gi|254780386|r 17 LKRMTLLSIIGIIGLITWSILLP 39 (440) Q Consensus 17 ~~~~~~~~~~~~~~~~~w~~~~~ 39 (440) .+++.|.++++|+++.++.+++. T Consensus 4 LKifVYtvV~fFvslFiFGfLsn 26 (36) T CHL00024 4 LKLFVYTVVIFFVSLFIFGFLSN 26 (36) T ss_pred EEEHHHHHHHHHHHHHHHHHHCC T ss_conf 87407999999999998863147 No 361 >KOG4807 consensus Probab=25.66 E-value=0.54 Score=24.96 Aligned_cols=31 Identities=19% Similarity=0.180 Sum_probs=20.9 Q ss_pred CCCCCCCCCCCCCCCEEEEEEECHHHHHHCCC Q ss_conf 44455421033556504775326223110158 Q gi|254780386|r 313 QQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKK 344 (440) Q Consensus 313 ~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~ 344 (440) ..|.|+..=- +.-.+.|-..|.+++|..+|. T Consensus 496 GtGsplaqgk-dayELEVLLRVKEsEiQYLKq 526 (593) T KOG4807 496 GTGSPLAQGK-DAYELEVLLRVKESEIQYLKQ 526 (593) T ss_pred CCCCCCCCCC-CHHHHHHHHHHHHHHHHHHHH T ss_conf 8888534575-126678888722989999999 No 362 >pfam09962 DUF2196 Uncharacterized conserved protein (DUF2196). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=25.35 E-value=37 Score=12.71 Aligned_cols=36 Identities=14% Similarity=0.178 Sum_probs=25.4 Q ss_pred HHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEECC Q ss_conf 2311015898317999812887544399999997133 Q gi|254780386|r 337 KQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDPI 373 (440) Q Consensus 337 ~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~~ 373 (440) .....|++|..|.|-+.. +..+...-.|.|..|=.. T Consensus 4 ~~R~~I~~G~~V~IVlK~-DQ~tG~lt~G~V~~iLT~ 39 (62) T pfam09962 4 TTRSNIKPGLEVDIVLKQ-DQRTGKLTEGIVKDILTN 39 (62) T ss_pred CCCCCCCCCCEEEEEECC-CCCCCCEEEEEHHHHHCC T ss_conf 340536899989999816-568896865784453328 No 363 >PRK01318 putative inner membrane protein translocase component YidC; Provisional Probab=25.26 E-value=37 Score=12.70 Aligned_cols=23 Identities=26% Similarity=0.602 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHHHHEEEE Q ss_conf 99999999999999987640671 Q gi|254780386|r 18 KRMTLLSIIGIIGLITWSILLPI 40 (440) Q Consensus 18 ~~~~~~~~~~~~~~~~w~~~~~~ 40 (440) |++++.+++.++++++|.+|.+- T Consensus 4 Kn~lLaivLs~~vL~gw~~~~~~ 26 (565) T PRK01318 4 KRLILAIALSFVVLLGWQYFFPD 26 (565) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88999999999999999997389 No 364 >TIGR00849 gutA PTS system, glucitol/sorbitol-specific IIA component; InterPro: IPR004716 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . The Man family is unique in several respects among PTS permease families. It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. This family consists only of glucitol-specific transporters, and occur both in Gram-negative and Gram-positive bacteria. The system in Escherichia coli consists of a IIA protein, and a IIBC protein. This family is specific for the IIA component.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0005737 cytoplasm. Probab=25.21 E-value=37 Score=12.70 Aligned_cols=63 Identities=16% Similarity=0.057 Sum_probs=33.8 Q ss_pred ECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 31212012221244455421033556504775326223110158983179998128875443999999971 Q gi|254780386|r 301 VYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKID 371 (440) Q Consensus 301 ~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is 371 (440) +.... ..|+.+.||..++ | +...+-|+|==....=+....|- ++++||..+ ...|+|+|.=-. T Consensus 48 ~~~~e-~kgG~l~pG~~f~-i--g~~~ypvTAVG~VAe~NLr~LGH-iTvrFdG~~---~aefpGtvhV~g 110 (129) T TIGR00849 48 HNIKE-VKGGTLKPGQVFS-I--GEIAYPVTAVGDVAEKNLRELGH-ITVRFDGSN---EAEFPGTVHVEG 110 (129) T ss_pred HCCCC-CCCCEECCCCEEE-E--CCEECCEEEECHHHHHCCHHCCE-EEEEECCCC---CCCCCCEEEECC T ss_conf 30322-3676006864343-5--77531045310477532142480-788876888---887871068469 No 365 >PRK01294 lipase chaperone; Provisional Probab=25.20 E-value=37 Score=12.70 Aligned_cols=23 Identities=26% Similarity=0.092 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHEE Q ss_conf 68999999999999999876406 Q gi|254780386|r 16 SLKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 16 ~~~~~~~~~~~~~~~~~~w~~~~ 38 (440) ..+..+|+++.++++.++|.++. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~ 26 (339) T PRK01294 4 MRKVALYLIIGAAAVLAVWYWLG 26 (339) T ss_pred HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89999999999999999999806 No 366 >PRK05585 yajC preprotein translocase subunit YajC; Validated Probab=24.62 E-value=38 Score=12.63 Aligned_cols=26 Identities=12% Similarity=0.201 Sum_probs=13.7 Q ss_pred HHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 31101589831799981288754439999999713 Q gi|254780386|r 338 QIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDP 372 (440) Q Consensus 338 di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~ 372 (440) -++.+++|++|.- .|.+-|+|..|+. T Consensus 50 m~~~L~~Gd~VvT---------~gGi~G~I~~v~d 75 (107) T PRK05585 50 MLSSLAKGDEVVT---------NGGIIGKVTKVSE 75 (107) T ss_pred HHHHCCCCCEEEE---------CCCCEEEEEEEEC T ss_conf 9984589999998---------9985899999979 No 367 >PRK07400 30S ribosomal protein S1; Reviewed Probab=24.37 E-value=38 Score=12.60 Aligned_cols=55 Identities=9% Similarity=0.071 Sum_probs=34.2 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHH---------CCCCCCEEEEEEECCCC Q ss_conf 23455541168731212012221244455421033556504775326223110---------15898317999812887 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQH---------VKKGYTATVRFPHYADI 358 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~---------i~~Gq~v~i~~~a~~~~ 358 (440) ...+-.-+.|+|..+. .-|.+|.-|. |+..++-++++. +++||.|.+++-.++.. T Consensus 193 ~l~vG~vv~G~V~~I~--~fGaFVdigG-------------vdGLlHiSeiSw~rv~~p~dv~kvGd~V~vkVi~iD~e 256 (314) T PRK07400 193 RLEVGEVVVGTVRGIK--PYGAFIDIGG-------------VSGLLHISEISHEHIETPHSVFNVNDQMKVMIIDLDAE 256 (314) T ss_pred CCCCCCEEEEEEEEEE--CEEEEEEECC-------------CEEEEEHHHCCCCCCCCHHHHCCCCCEEEEEEEEECCC T ss_conf 0889988999999996--4088999789-------------18999814616544489878267999999999997388 No 368 >KOG2419 consensus Probab=24.30 E-value=38 Score=12.59 Aligned_cols=22 Identities=14% Similarity=0.258 Sum_probs=13.8 Q ss_pred ECCCCEEEEEEEECEEEHHHHH Q ss_conf 5237756999992205479999 Q gi|254780386|r 405 LRNGFPAEVLFTAKNTTLAQEI 426 (440) Q Consensus 405 L~~Gm~~~~~I~~~~~~~~~~~ 426 (440) .+-||..-+.|.+.+.+-+..+ T Consensus 949 VkvGm~iGv~i~~~~p~~~e~~ 970 (975) T KOG2419 949 VKVGMQIGVSIKVDDPKKLERI 970 (975) T ss_pred HHHHHHHCEECCCCCCCHHHHC T ss_conf 9877761411368995222201 No 369 >PRK01202 glycine cleavage system protein H; Provisional Probab=24.18 E-value=39 Score=12.57 Aligned_cols=35 Identities=11% Similarity=0.127 Sum_probs=16.1 Q ss_pred CEEEEEEEEEECCCCCCCCCC---CCCEEEEEEEEECH Q ss_conf 439999999713344467776---55118999999350 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNNL---QTQEYYEVILKITD 395 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~~---~~~~~y~v~i~~~~ 395 (440) -++.|+|..|-.....++.-- ..+.-..+++++.+ T Consensus 71 sPvsG~Vv~vN~~l~~~P~liN~dpy~~GWl~~i~~~d 108 (127) T PRK01202 71 APVSGEVVEVNEALEDSPELVNSDPYGEGWLFKLKPSD 108 (127) T ss_pred CCCCEEEEEECHHHHHCHHHHCCCCCCCCEEEEEEECC T ss_conf 88520899864654529386448988896799999899 No 370 >cd05697 S1_Rrp5_repeat_hs5 S1_Rrp5_repeat_hs5: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 5 (hs5) and S. cerevisiae S1 repeat 5 (sc5). Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=24.16 E-value=39 Score=12.57 Aligned_cols=17 Identities=24% Similarity=0.305 Sum_probs=9.9 Q ss_pred CEEEEECCEEECCCCCCCC Q ss_conf 1168731212012221244 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQ 314 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~ 314 (440) +.|+|..+. .-|.+|.- T Consensus 4 v~G~V~~i~--~fG~fV~l 20 (69) T cd05697 4 VKGTIRKLR--PSGIFVKL 20 (69) T ss_pred EEEEEEEEC--CCEEEEEE T ss_conf 999999973--84279995 No 371 >PRK02749 photosystem I reaction center subunit IV; Provisional Probab=24.12 E-value=39 Score=12.56 Aligned_cols=43 Identities=14% Similarity=0.220 Sum_probs=27.9 Q ss_pred HCCCCCCEEEEE-EECCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHHH Q ss_conf 015898317999-812887544399999997133444677765511899999935011 Q gi|254780386|r 341 HVKKGYTATVRF-PHYADIREKKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDLA 397 (440) Q Consensus 341 ~i~~Gq~v~i~~-~a~~~~~~~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~~ 397 (440) -|+.|.+|+|.- .+|=++. -|+|.+|+.. +--|+|.|++++.. T Consensus 2 ai~rGskVrIlR~ESYWyne----vGtVasvd~s----------gi~YPV~VRF~kvN 45 (71) T PRK02749 2 AISRGDKVRILRPESYWYNE----VGTVASVDKS----------GIKYPVIVRFDKVN 45 (71) T ss_pred CCCCCCEEEECCCCCEEECC----CCEEEEECCC----------CCCCCEEEEEEEEE T ss_conf 64368898873644442126----5259998157----------96500799976641 No 372 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=23.80 E-value=39 Score=12.52 Aligned_cols=13 Identities=8% Similarity=0.258 Sum_probs=7.6 Q ss_pred EECCCCEEEEEEE Q ss_conf 1523775699999 Q gi|254780386|r 404 ELRNGFPAEVLFT 416 (440) Q Consensus 404 ~L~~Gm~~~~~I~ 416 (440) +=+--||+++++. T Consensus 463 dn~vAmTGEISL~ 475 (532) T TIGR02902 463 DNKVAMTGEISLN 475 (532) T ss_pred CCCEEEEEEEEEE T ss_conf 7721113338640 No 373 >PRK01885 greB transcription elongation factor GreB; Reviewed Probab=23.68 E-value=38 Score=12.63 Aligned_cols=30 Identities=10% Similarity=0.124 Sum_probs=19.9 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 222222212223322122222123455541 Q gi|254780386|r 267 EQNQRTFADEMSKLTILEKTREQKTILSPI 296 (440) Q Consensus 267 ~e~~~~l~~~~~~l~~l~~~l~~~~I~AP~ 296 (440) ..++.+...+..++..++..|.++.|.-|. T Consensus 52 ~aAke~q~~le~rI~~Le~~L~~A~iid~~ 81 (159) T PRK01885 52 IYGKKRLREIDRRVRFLTKRLENLKVVDYS 81 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 999999999999999999998733035766 No 374 >PRK00010 rplE 50S ribosomal protein L5; Validated Probab=23.45 E-value=40 Score=12.48 Aligned_cols=20 Identities=15% Similarity=0.182 Sum_probs=10.9 Q ss_pred CEECCCCEEEEEEEECEEEHHH Q ss_conf 5152377569999922054799 Q gi|254780386|r 403 IELRNGFPAEVLFTAKNTTLAQ 424 (440) Q Consensus 403 ~~L~~Gm~~~~~I~~~~~~~~~ 424 (440) ++-.+||.++ |+|--++--+ T Consensus 146 ~~~~~G~~It--ivTtA~t~~e 165 (179) T PRK00010 146 IDKIRGMDIT--IVTTAKTDEE 165 (179) T ss_pred CCCCCCCEEE--EEECCCCHHH T ss_conf 5688897589--9947699899 No 375 >COG3096 MukB Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning] Probab=23.41 E-value=31 Score=13.20 Aligned_cols=18 Identities=6% Similarity=-0.071 Sum_probs=7.6 Q ss_pred EECCCCCCCEEEEEEEEEE Q ss_conf 8128875443999999971 Q gi|254780386|r 353 PHYADIREKKFKAIIEKID 371 (440) Q Consensus 353 ~a~~~~~~~~~~G~V~~Is 371 (440) +-+|.+-| -+.|-+...+ T Consensus 728 ~dCPeDLY-LIEGDp~aFD 745 (1480) T COG3096 728 TDCPEDLY-LIEGDPQSFD 745 (1480) T ss_pred CCCCCHHE-EECCCCCCCC T ss_conf 04960223-4248975211 No 376 >PRK08582 hypothetical protein; Provisional Probab=22.71 E-value=41 Score=12.39 Aligned_cols=56 Identities=18% Similarity=0.290 Sum_probs=30.8 Q ss_pred CCCCCEEEEECCEEECCCCCCCCCC---CCCCCCCCCCCEEEEEEECHHHH-HHCCCCCCEEEEEEECC Q ss_conf 5554116873121201222124445---54210335565047753262231-10158983179998128 Q gi|254780386|r 292 ILSPIAGTIVYNQSFSSSNYAQQSQ---PLMKIVPHSKLTYIRAKVTPKQI-QHVKKGYTATVRFPHYA 356 (440) Q Consensus 292 I~AP~~G~V~~l~~~~~G~~v~~G~---~l~~Ivp~~~~l~v~~~v~~~di-~~i~~Gq~v~i~~~a~~ 356 (440) +=.-+.|+|..+. .-|.||.-+. -|+.|- .-+.-+| .++ ..+++||.|++++-..+ T Consensus 5 vG~iv~G~V~~I~--~fGaFV~l~~g~~GLvHIS-eis~~~V------~~i~d~lkvGd~V~vKVi~id 64 (139) T PRK08582 5 VGSKLQGKVTGIT--NFGAFVELPEGKTGLVHIS-EVADNYV------KDINDHLSVGDEVEVKVLNVE 64 (139) T ss_pred CCCEEEEEEEEEE--CCEEEEEECCCCEEEEEHH-HHCCCCC------CCHHHHCCCCCEEEEEEEEEC T ss_conf 7879999998996--4178999569966788736-6256656------899995789999999999888 No 377 >COG3871 Uncharacterized stress protein (general stress protein 26) [General function prediction only] Probab=22.66 E-value=41 Score=12.38 Aligned_cols=44 Identities=16% Similarity=0.232 Sum_probs=22.8 Q ss_pred CCEEEEEEECHHHHHHCCCCCCEEEEEEECCCC-CCCEEEEEEEEE Q ss_conf 650477532622311015898317999812887-544399999997 Q gi|254780386|r 326 KLTYIRAKVTPKQIQHVKKGYTATVRFPHYADI-REKKFKAIIEKI 370 (440) Q Consensus 326 ~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~-~~~~~~G~V~~I 370 (440) .++|.-..-...-+-.|+.+..|.|-|- ++.. -|-.+.|+..-+ T Consensus 45 ~tiwf~T~kds~~v~eik~n~~v~v~~~-~~~~~~fv~v~Gtael~ 89 (145) T COG3871 45 GTIWFFTNKDSRKVEEIKKNPKVCVLFG-YDDHDAFVEVSGTAELV 89 (145) T ss_pred CCEEEECCCCHHHHHHHHHCCCEEEEEE-CCCCCCEEEEEEEEEEE T ss_conf 4078633675388898750880799996-38986169998778720 No 378 >pfam10498 IFT57 Intra-flagellar transport protein 57. Eukaryotic cilia and flagella are specialized organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bidirectional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated. This protein is known as Huntington-interacting protein-1 in humans. Probab=22.56 E-value=41 Score=12.37 Aligned_cols=16 Identities=25% Similarity=0.374 Sum_probs=9.6 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 1222124445542103 Q gi|254780386|r 307 SSSNYAQQSQPLMKIV 322 (440) Q Consensus 307 ~~G~~v~~G~~l~~Iv 322 (440) ..|..++-|+||+.|- T Consensus 315 erg~~mtD~sPl~~IK 330 (355) T pfam10498 315 ERGASMSDGSPLVKIK 330 (355) T ss_pred HHCCCCCCCCHHHHHH T ss_conf 8524567887799999 No 379 >PRK00276 infA translation initiation factor IF-1; Validated Probab=22.37 E-value=42 Score=12.34 Aligned_cols=58 Identities=17% Similarity=0.176 Sum_probs=32.8 Q ss_pred CEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHH---HHHHCCCCCCEEEEEEECCCCCCCEEEEEEEE Q ss_conf 116873121201222124445542103355650477532622---31101589831799981288754439999999 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPK---QIQHVKKGYTATVRFPHYADIREKKFKAIIEK 369 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~---di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~ 369 (440) ++|+|...- +++.. -+. .++...|.++++-. .-=+|-+|+.|.|-++.|+-. .|+|++ T Consensus 9 ~~G~V~e~l---pn~~F-----~V~---Leng~~v~a~~sGKmR~~~Iril~GD~V~vE~spYDlt-----kGRIv~ 69 (72) T PRK00276 9 MEGTVLETL---PNAMF-----RVE---LENGHEVLAHISGKMRKNYIRILPGDKVTVELSPYDLT-----KGRITY 69 (72) T ss_pred EEEEEEEEC---CCCEE-----EEE---ECCCCEEEEEECHHEEEEEEEECCCCEEEEEECCCCCC-----CEEEEE T ss_conf 999999985---99889-----999---78999999997413110169975899899998866799-----578999 No 380 >pfam09787 Golgin_A5 Golgin subfamily A member 5. Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1. Probab=22.03 E-value=7.5 Score=17.31 Aligned_cols=18 Identities=6% Similarity=0.281 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHHHHHCC Q ss_conf 999874343323321001 Q gi|254780386|r 96 IDLKKVSIKNLRCHIDTE 113 (440) Q Consensus 96 l~~~~~~l~~l~~~l~~~ 113 (440) +.+...++..++.+++.. T Consensus 115 l~e~~~El~~lk~~le~l 132 (508) T pfam09787 115 LQEAAQELRELKSELEDL 132 (508) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 887778763478999987 No 381 >PRK08059 general stress protein 13; Validated Probab=21.88 E-value=43 Score=12.28 Aligned_cols=71 Identities=14% Similarity=0.135 Sum_probs=39.5 Q ss_pred CCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHH---------HHCCCCCCEEEEEEECCCCC Q ss_conf 234555411687312120122212444554210335565047753262231---------10158983179998128875 Q gi|254780386|r 289 QKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQI---------QHVKKGYTATVRFPHYADIR 359 (440) Q Consensus 289 ~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di---------~~i~~Gq~v~i~~~a~~~~~ 359 (440) ...+-.-+.|+|..+. .-|.+|.-+.. +++.|+-+++ ..+++||.|.+++-..+.. T Consensus 3 k~kvGdiv~G~V~~I~--~fG~FV~l~~g------------v~GLihiSEls~~~v~~~~~~~~vGd~V~vkVi~iD~e- 67 (119) T PRK08059 3 QYEVGVVVTGKVTGIQ--DYGAFVALDEE------------TQGLVHISEITHGFVKDIHDFLKVGDEVKVKVLSVDEE- 67 (119) T ss_pred CCCCCCEEEEEEEEEE--CEEEEEEEECC------------CEEEEEHHHHCCCCCCCHHHHHCCCCEEEEEEEEEECC- T ss_conf 7899999999999997--40489999439------------53788878824566679999705899899999999899- Q ss_pred CCEEEEEEEEEECCC Q ss_conf 443999999971334 Q gi|254780386|r 360 EKKFKAIIEKIDPII 374 (440) Q Consensus 360 ~~~~~G~V~~Is~~~ 374 (440) .+.+.=.+..+.+++ T Consensus 68 ~~ri~LSlK~l~~~P 82 (119) T PRK08059 68 HGKISLSIRATEEAP 82 (119) T ss_pred CCEEEEECCCCCCCC T ss_conf 999987503576483 No 382 >PRK08559 nusG transcription antitermination protein NusG; Validated Probab=21.84 E-value=43 Score=12.27 Aligned_cols=33 Identities=18% Similarity=0.160 Sum_probs=24.7 Q ss_pred CHHHHHHCCCCCCEEEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 62231101589831799981288754439999999713 Q gi|254780386|r 335 TPKQIQHVKKGYTATVRFPHYADIREKKFKAIIEKIDP 372 (440) Q Consensus 335 ~~~di~~i~~Gq~v~i~~~a~~~~~~~~~~G~V~~Is~ 372 (440) |...+..+++|+.|+|.-- | |..+.|+|.+|+. T Consensus 88 ~~~~~~~i~~G~~V~v~~G--p---fkg~~a~V~~Vd~ 120 (153) T PRK08559 88 PKPIVEGIKEGDIVELIAG--P---FKGEKARVVRVDE 120 (153) T ss_pred CCCCCCCCCCCCEEEEECC--C---CCCCCEEEEEECC T ss_conf 5765234689999999135--7---6996179999816 No 383 >cd06848 GCS_H Glycine cleavage H-protein. Glycine cleavage H-proteins are part of the glycine cleavage system (GCS) found in bacteria, archea and the mitochondria of eukaryotes. GCS is a multienzyme complex consisting of 4 different components (P-, H-, T- and L-proteins) which catalyzes the oxidative cleavage of glycine. The H-protein shuttles the methylamine group of glycine from the P-protein (glycine dehydrogenase) to the T-protein (aminomethyltransferase) via a lipoyl group, attached to a completely conserved lysine residue. Probab=21.58 E-value=43 Score=12.24 Aligned_cols=12 Identities=0% Similarity=0.202 Sum_probs=5.9 Q ss_pred EEEEEEEEECCC Q ss_conf 999999971334 Q gi|254780386|r 363 FKAIIEKIDPII 374 (440) Q Consensus 363 ~~G~V~~Is~~~ 374 (440) +.|+|..+-+.. T Consensus 65 v~G~vv~~N~~l 76 (96) T cd06848 65 VSGEVVEVNEAL 76 (96) T ss_pred CCEEEEEEHHHH T ss_conf 166999837965 No 384 >COG4072 Uncharacterized protein conserved in archaea [Function unknown] Probab=21.46 E-value=44 Score=12.22 Aligned_cols=12 Identities=25% Similarity=0.545 Sum_probs=6.4 Q ss_pred CCCCCCCCCCCC Q ss_conf 212444554210 Q gi|254780386|r 310 NYAQQSQPLMKI 321 (440) Q Consensus 310 ~~v~~G~~l~~I 321 (440) .++.+|+.++.| T Consensus 85 ~ylkkGd~ll~i 96 (161) T COG4072 85 TYLKKGDELLLI 96 (161) T ss_pred EEECCCCEEEEE T ss_conf 996379889997 No 385 >COG3745 CpaB Flp pilus assembly protein CpaB [Intracellular trafficking and secretion] Probab=21.43 E-value=44 Score=12.22 Aligned_cols=17 Identities=6% Similarity=0.278 Sum_probs=7.5 Q ss_pred EEEEEECCCCCEEECCC Q ss_conf 89999857898933697 Q gi|254780386|r 66 IIKKFQVENGSQILQGT 82 (440) Q Consensus 66 ~V~~i~VkeGd~VkkGq 82 (440) .|....++-|+.....+ T Consensus 49 lVA~~~L~~G~~L~~d~ 65 (276) T COG3745 49 LVAAVDLPVGQRLSADQ 65 (276) T ss_pred EEEECCCCCCCCCCCCC T ss_conf 99931455677325142 No 386 >PRK06569 F0F1 ATP synthase subunit B'; Validated Probab=21.40 E-value=44 Score=12.21 Aligned_cols=30 Identities=7% Similarity=0.030 Sum_probs=14.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHH-HHHEE Q ss_conf 763356861689999999999999998-76406 Q gi|254780386|r 7 KNLKRNHPASLKRMTLLSIIGIIGLIT-WSILL 38 (440) Q Consensus 7 ~~~~~~~p~~~~~~~~~~~~~~~~~~~-w~~~~ 38 (440) |+|. +-.|..-+.|+++.|.+.+++ |-++. T Consensus 2 PQld--~~~~~sQifWL~itF~~ly~~~sk~~l 32 (157) T PRK06569 2 PQFD--IATYYSQIFWLIVTFGLLYIFVYKFIT 32 (157) T ss_pred CCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9998--224089999999999999999999986 No 387 >TIGR01416 Rieske_proteo ubiquinol-cytochrome c reductase, iron-sulfur subunit; InterPro: IPR006317 These sequences represent the Proteobacterial and mitochondrial type of the Rieske [2Fe-2S] iron-sulphur subunit as found in ubiquinol-cytochrome c reductase. Not included in this group are the Rieske iron-sulphur protein as found in the cytochrome b6-f complex of the Cyanobacteria and chloroplasts. Most members of this family have a recognizable twin-arginine translocation (tat) signal sequence (DeltaPh-dependent translocation in chloroplast) for transport across the membrane with the 2Fe-2S group already bound. These signal sequences include a motif resembling RRxFLK before the transmembrane helix. ; GO: 0008121 ubiquinol-cytochrome-c reductase activity, 0006118 electron transport. Probab=21.37 E-value=39 Score=12.55 Aligned_cols=63 Identities=11% Similarity=0.185 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHH----HHHHHHHHEEEEEEEEE--EEEEEEECCCEEEEECCCCCEEEEEECCCCCEEECCCEEEEECHH Q ss_conf 899999999999----99998764067125899--979999538738984688868999985789893369789999648 Q gi|254780386|r 17 LKRMTLLSIIGI----IGLITWSILLPIEISVS--SSGEILNEDNVVEIKSPFSGIIKKFQVENGSQILQGTPLLTFEDI 90 (440) Q Consensus 17 ~~~~~~~~~~~~----~~~~~w~~~~~~~~~v~--~~G~i~~~~~~~~I~s~~~G~V~~i~VkeGd~VkkGq~L~~ld~~ 90 (440) .|.+|++....+ .++.+|.++..++-.-. +-| ..-.+.|.+...|....+. =+|.++.-+.-+ T Consensus 3 RRdFL~~at~~vGAVGa~~~a~Pfi~~m~PsA~~~AaG----A~~eVDvs~~~~Gq~~tv~-------WRGKPvfi~~Rt 71 (192) T TIGR01416 3 RRDFLYAATGAVGAVGAAAAAVPFIDSMNPSASALAAG----APTEVDVSKIQEGQQLTVE-------WRGKPVFILNRT 71 (192) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCC----CCEEEECCCCCCCCEEEEE-------ECCCEEEEEECC T ss_conf 15788999999999988777413553038886998578----8548862577988678887-------728079997089 No 388 >cd05691 S1_RPS1_repeat_ec6 S1_RPS1_repeat_ec6: Ribosomal protein S1 (RPS1) domain. RPS1 is a component of the small ribosomal subunit thought to be involved in the recognition and binding of mRNA's during translation initiation. The bacterial RPS1 domain architecture consists of 4-6 tandem S1 domains. In some bacteria, the tandem S1 array is located C-terminal to a 4-hydroxy-3-methylbut-2-enyl diphosphate reductase (HMBPP reductase) domain. While RPS1 is found primarily in bacteria, proteins with tandem RPS1-like domains have been identified in plants and humans, however these lack the N-terminal HMBPP reductase domain. This CD includes S1 repeat 6 (ec6) of the Escherichia coli RPS1. Autoantibodies to double-stranded DNA from patients with systemic lupus erythematosus cross-react with the human RPS1 homolog. Probab=21.35 E-value=44 Score=12.21 Aligned_cols=46 Identities=7% Similarity=0.186 Sum_probs=23.5 Q ss_pred CEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHH---------CCCCCCEEEEEEEC Q ss_conf 1168731212012221244455421033556504775326223110---------15898317999812 Q gi|254780386|r 296 IAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQH---------VKKGYTATVRFPHY 355 (440) Q Consensus 296 ~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~---------i~~Gq~v~i~~~a~ 355 (440) +.|.|..+. .-|.+|.-|.. ++++|+-++++. +++|+.|..++... T Consensus 4 V~G~V~~i~--~~G~fV~l~~~------------v~G~ih~sels~~~~~~~~~~~~vGd~v~akIi~i 58 (73) T cd05691 4 VTGKVTEVD--AKGATVKLGDG------------VEGFLRAAELSRDRVEDATERFKVGDEVEAKITNV 58 (73) T ss_pred EEEEEEEEE--CCCEEEEECCC------------CEEEEEHHHCCCCHHHCHHHCCCCCCEEEEEEEEE T ss_conf 999999997--78689992899------------69999989978211329254389999999999999 No 389 >PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional Probab=21.21 E-value=44 Score=12.19 Aligned_cols=32 Identities=6% Similarity=-0.010 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHHHHEEEEEEEEEEEEEEEE Q ss_conf 9999999999999987640671258999799995 Q gi|254780386|r 19 RMTLLSIIGIIGLITWSILLPIEISVSSSGEILN 52 (440) Q Consensus 19 ~~~~~~~~~~~~~~~w~~~~~~~~~v~~~G~i~~ 52 (440) +++.++++++++.++|+++++-- -.++++|.. T Consensus 33 ~Ii~v~~~~~~la~~y~~~~tPi--Y~Ata~v~I 64 (726) T PRK09841 33 FIISVTALFTLIAVAYSLLSTPI--YQADTLVQV 64 (726) T ss_pred HHHHHHHHHHHHHHHHHHHCCCE--EEEEEEEEE T ss_conf 99999999999999999756984--788899999 No 390 >TIGR02388 rpoC2_cyan DNA-directed RNA polymerase, beta'' subunit; InterPro: IPR012756 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The family consists of the product of the rpoC2 gene, a subunit of DNA-directed RNA polymerase of cyanobacteria and chloroplasts. RpoC2 corresponds largely to the C-terminal region of the RpoC (the beta' subunit) of other bacteria. Members of this family are designated beta'' in chloroplasts/plastids, and beta' (confusingly) in Cyanobacteria, where RpoC1 is called beta' in chloroplasts/plastids and gamma in Cyanobacteria. We prefer to name this family beta'', after its organellar members, to emphasize that this RpoC1 and RpoC2 together replace RpoC in other bacteria.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=21.19 E-value=44 Score=12.19 Aligned_cols=59 Identities=15% Similarity=0.104 Sum_probs=24.4 Q ss_pred CCCCCCCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHH-HHHHCCCCCCEE Q ss_conf 12222212345554116873121201222124445542103355650477532622-311015898317 Q gi|254780386|r 282 ILEKTREQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPK-QIQHVKKGYTAT 349 (440) Q Consensus 282 ~l~~~l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~-di~~i~~Gq~v~ 349 (440) ..-.-+.++.|.+-..|+|..+.. ..|+.|. .++ +..+.+.+.|+ -. +-+.||.|+.|. T Consensus 871 ~pG~V~A~T~I~~Ke~G~v~g~~~-~~~E~~R---R~l-v~r~~D~~~~~----~~g~kp~VK~Gdl~v 930 (1252) T TIGR02388 871 KPGAVVARTQILCKEAGVVRGIKK-GEGESIR---RVL-VERNSDRLKVN----IKGAKPKVKTGDLVV 930 (1252) T ss_pred CCCCEEEEEEECCCCCCEEEECCC-CCCCCEE---EEE-EEECCCEEEEE----ECCCCCCCCCCCEEE T ss_conf 677558730000156766850334-6776135---899-97257648984----057886124676541 No 391 >COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=21.15 E-value=44 Score=12.18 Aligned_cols=47 Identities=11% Similarity=0.140 Sum_probs=28.7 Q ss_pred CEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEECHH-------HCCCCCCEECCCCEEEEEEEE Q ss_conf 439999999713344467776551189999993501-------125678515237756999992 Q gi|254780386|r 361 KKFKAIIEKIDPIISQQDNNLQTQEYYEVILKITDL-------AFSDNNIELRNGFPAEVLFTA 417 (440) Q Consensus 361 ~~~~G~V~~Is~~~~~~~~~~~~~~~y~v~i~~~~~-------~~~~~~~~L~~Gm~~~~~I~~ 417 (440) ..++|+|+.|+.. ... ..|.+.+... ........|+||+.+-|-|++ T Consensus 294 NiLp~~v~~i~~~---~~~-------V~v~ld~~g~~l~Arit~~srd~L~l~~G~~v~AqIKs 347 (352) T COG4148 294 NILPGKVVGIEDD---DGQ-------VDVQLDCGGKTLWARITPWARDELALKPGQWVYAQIKS 347 (352) T ss_pred HCCCEEEEEEECC---CCC-------EEEEEECCCCEEEEEECHHHHHHHCCCCCCEEEEEEEE T ss_conf 2154069999648---982-------79999639958887705756776456889857999998 No 392 >cd07021 Clp_protease_NfeD_like Nodulation formation efficiency D (NfeD) is a membrane-bound ClpP-class protease. Nodulation formation efficiency D (NfeD; stomatin operon partner protein, STOPP; DUF107) is a member of membrane-anchored ClpP-class proteases. Currently, more than 300 NfeD homologs have been identified - all of which are bacterial or archaeal in origin. Majority of these genomes have been shown to possess operons containing a homologous NfeD/stomatin gene pair, causing NfeD to be previously named STOPP (stomatin operon partner protein). NfeD homologs can be divided into two groups: long and short forms. Long-form homologs have a putative ClpP-class serine protease domain while the short form homologs do not. Downstream from the ClpP-class domain is the so-called NfeD or DUF107 domain. N-terminal region of the NfeD homolog PH1510 (1510-N or PH1510-N) from Pyrococcus horikoshii has been shown to possess serine protease activity and has a Ser-Lys catalytic dyad, preferentiall Probab=20.84 E-value=35 Score=12.84 Aligned_cols=41 Identities=5% Similarity=0.054 Sum_probs=28.6 Q ss_pred CCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHH Q ss_conf 54116873121201222124445542103355650477532622 Q gi|254780386|r 294 SPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPK 337 (440) Q Consensus 294 AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~ 337 (440) .+.+.+|..++ ++|+.|.++..|...+ ...+.-|.+||+.. T Consensus 28 ~~a~~ivl~id--TpGG~v~~~~~I~~~I-~~~~~pvv~~V~~~ 68 (178) T cd07021 28 EGADAVVLDID--TPGGRVDSALEIVDLI-LNSPIPTIAYVNDR 68 (178) T ss_pred CCCCEEEEEEE--CCCCCHHHHHHHHHHH-HHCCCCEEEEECCC T ss_conf 89978999997--9998689999999999-84899999999992 No 393 >cd01134 V_A-ATPase_A V/A-type ATP synthase catalytic subunit A. These ATPases couple ATP hydrolysis to the build up of a H+ gradient, but V-type ATPases do not catalyze the reverse reaction. The Vacuolar (V-type) ATPase is found in the membranes of vacuoles, the golgi apparatus and in other coated vesicles in eukaryotes. Archaea have a protein which is similar in sequence to V-ATPases, but functions like an F-ATPase (called A-ATPase). A similar protein is also found in a few bacteria. Probab=20.77 E-value=45 Score=12.13 Aligned_cols=19 Identities=26% Similarity=0.323 Sum_probs=12.7 Q ss_pred ECCCCCEEECCCEEEEECH Q ss_conf 8578989336978999964 Q gi|254780386|r 71 QVENGSQILQGTPLLTFED 89 (440) Q Consensus 71 ~VkeGd~VkkGq~L~~ld~ 89 (440) .|+.||.|+.||+|-.... T Consensus 54 ~v~~Gd~V~~GDi~G~V~E 72 (369) T cd01134 54 LVKVGDHVTGGDILGTVPE 72 (369) T ss_pred CCCCCCEECCCCEEEEEEC T ss_conf 2677999627867999855 No 394 >TIGR01204 bioW 6-carboxyhexanoate--CoA ligase; InterPro: IPR005499 This family contains the enzyme 6-carboxyhexanoate--CoA ligase 6.2.1.14 from EC. This enzyme is involved in the first step of biotin synthesis, where it converts pimelate into pimeloyl-CoA . The enzyme requires magnesium as a cofactor and forms a homodimer .; GO: 0009102 biotin biosynthetic process. Probab=20.65 E-value=45 Score=12.11 Aligned_cols=52 Identities=12% Similarity=0.052 Sum_probs=35.8 Q ss_pred CCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCCCCCC-EEEEEEEEEECC Q ss_conf 24445542103355650477532622311015898317999812887544-399999997133 Q gi|254780386|r 312 AQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYADIREK-KFKAIIEKIDPI 373 (440) Q Consensus 312 v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~~~~~-~~~G~V~~Is~~ 373 (440) ...|..|+.|==++++=||+.||+...|+.+++- |-.++| ..-|+|.+||.. T Consensus 168 ~~~p~vvAELCWSDDPdYvTGYVsgkeiGYvRIT----------plKe~G~e~GgRvfFvs~~ 220 (238) T TIGR01204 168 ARSPAVVAELCWSDDPDYVTGYVSGKEIGYVRIT----------PLKEKGTEEGGRVFFVSDK 220 (238) T ss_pred HCCCCEEEEECCCCCCCCEEEEEECCEECCEECC----------CCCCCCCCCCCEEEEECCC T ss_conf 2489649872148888812013527452446403----------2111689999668996085 No 395 >TIGR02830 spore_III_AG stage III sporulation protein AG; InterPro: IPR014195 This entry represents the stage III sporulation protein AG, which is encoded in a spore formation operon: spoIIIAABCDEFGH under the control of sigma G . A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species.. Probab=20.64 E-value=43 Score=12.25 Aligned_cols=14 Identities=14% Similarity=0.045 Sum_probs=8.0 Q ss_pred HHHHHHHHEEEEEE Q ss_conf 99998764067125 Q gi|254780386|r 29 IGLITWSILLPIEI 42 (440) Q Consensus 29 ~~~~~w~~~~~~~~ 42 (440) +.|++|+.|+..+. T Consensus 14 ~~ll~~s~~F~~~~ 27 (193) T TIGR02830 14 LLLLLVSSFFSSSE 27 (193) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999987617856 No 396 >pfam02532 PsbI Photosystem II reaction centre I protein (PSII 4.8 kDa protein). This family consists of various Photosystem II (PSII) reaction centre I proteins or PSII 4.8 kDa proteins, PsbI, from the chloroplast genome of many plants and Cyanobacteria. PsbI is a small, integral membrane component of PSII the role of which is not clear. Synechocystis mutants lacking PsbI have 20-30% loss of PSII activity however the PSII complex is not destabilized. Probab=20.48 E-value=43 Score=12.24 Aligned_cols=22 Identities=23% Similarity=0.332 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHEE Q ss_conf 8999999999999999876406 Q gi|254780386|r 17 LKRMTLLSIIGIIGLITWSILL 38 (440) Q Consensus 17 ~~~~~~~~~~~~~~~~~w~~~~ 38 (440) .++..|.++++|+++.++.+++ T Consensus 4 LKi~Vy~vVifFv~lFiFgfLs 25 (36) T pfam02532 4 LKIFVYTVVIFFVSLFIFGFLS 25 (36) T ss_pred EEEEHHHHHHHHHHHHHHHHHC T ss_conf 7740588999999999874204 No 397 >TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria. Probab=20.31 E-value=46 Score=12.07 Aligned_cols=32 Identities=6% Similarity=0.042 Sum_probs=12.5 Q ss_pred ECCCCEEEEEEEECEEEHHHHHHHHHHHHHHH Q ss_conf 52377569999922054799999999989988 Q gi|254780386|r 405 LRNGFPAEVLFTAKNTTLAQEIIHPITDNWPK 436 (440) Q Consensus 405 L~~Gm~~~~~I~~~~~~~~~~~~~p~~~~~~~ 436 (440) ..|-++-+|.|-.-+.=|=.-+..-+.+.+++ T Consensus 599 y~P~~s~QVIlLsTdtEi~~~~~~~l~~~I~k 630 (650) T TIGR03185 599 YFPKASHQVLLLSTDEEVDEKHYNLLKPFISH 630 (650) T ss_pred HCCCCCCEEEEECCHHHCCHHHHHHHHHHHCC T ss_conf 48540573899725154178999999887322 No 398 >TIGR01069 mutS2 MutS2 family protein; InterPro: IPR005747 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication . MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base . MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch . MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level . Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA. MutS is a modular protein with a complex structure , and is composed of: N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions . Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts . This entry represents a family of MutS proteins.; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0045005 maintenance of fidelity during DNA-dependent DNA replication. Probab=20.29 E-value=46 Score=12.06 Aligned_cols=15 Identities=27% Similarity=0.601 Sum_probs=8.8 Q ss_pred EEEEECHHHHHHCCC Q ss_conf 775326223110158 Q gi|254780386|r 330 IRAKVTPKQIQHVKK 344 (440) Q Consensus 330 v~~~v~~~di~~i~~ 344 (440) ..+.|+..|+.+|.. T Consensus 706 ~~m~v~~~~l~~~~~ 720 (834) T TIGR01069 706 LRMKVKESELEKINK 720 (834) T ss_pred EEEEECHHHHHHHHC T ss_conf 044313788887414 No 399 >KOG3809 consensus Probab=20.22 E-value=46 Score=12.05 Aligned_cols=10 Identities=20% Similarity=-0.001 Sum_probs=4.7 Q ss_pred CCCCCCHHHH Q ss_conf 3356861689 Q gi|254780386|r 9 LKRNHPASLK 18 (440) Q Consensus 9 ~~~~~p~~~~ 18 (440) |.+|+.+|+. T Consensus 27 LskPPFrfl~ 36 (583) T KOG3809 27 LSKPPFRFLV 36 (583) T ss_pred HCCCCHHHHH T ss_conf 4489658999 No 400 >PRK10921 twin-arginine protein translocation system subunit TatC; Provisional Probab=20.04 E-value=47 Score=12.03 Aligned_cols=24 Identities=13% Similarity=0.146 Sum_probs=13.6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 334576335686168999999999 Q gi|254780386|r 3 HFMWKNLKRNHPASLKRMTLLSII 26 (440) Q Consensus 3 ~~~~~~~~~~~p~~~~~~~~~~~~ 26 (440) .|+.|-|.+...++...++....+ T Consensus 93 ~Fi~PgL~~~Ek~~~~~~~~~s~~ 116 (255) T PRK10921 93 AFIAPALYKHERRLVVPLLVSSSL 116 (255) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 996222318899999999999999 No 401 >PRK08563 DNA-directed RNA polymerase subunit E'; Provisional Probab=20.02 E-value=47 Score=12.02 Aligned_cols=15 Identities=20% Similarity=0.093 Sum_probs=8.8 Q ss_pred CEECCCCEEEEEEEE Q ss_conf 515237756999992 Q gi|254780386|r 403 IELRNGFPAEVLFTA 417 (440) Q Consensus 403 ~~L~~Gm~~~~~I~~ 417 (440) ..++.|-.+.+.|+. T Consensus 134 ~~i~~Gd~VR~RIv~ 148 (182) T PRK08563 134 RTLKVGDVVRARIVA 148 (182) T ss_pred EEECCCCEEEEEEEE T ss_conf 498289989999999 No 402 >cd04473 S1_RecJ_like S1_RecJ_like: The S1 domain of the archaea-specific RecJ-like exonuclease. The function of this family is not fully understood. In Escherichia coli, RecJ degrades single-stranded DNA in the 5'-3' direction and participates in homologous recombination and mismatch repair. Probab=20.00 E-value=47 Score=12.02 Aligned_cols=58 Identities=10% Similarity=0.117 Sum_probs=32.7 Q ss_pred CCCCCCCCCCEEEEECCEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHCCCCCCEEEEEEECCC Q ss_conf 21234555411687312120122212444554210335565047753262231101589831799981288 Q gi|254780386|r 287 REQKTILSPIAGTIVYNQSFSSSNYAQQSQPLMKIVPHSKLTYIRAKVTPKQIQHVKKGYTATVRFPHYAD 357 (440) Q Consensus 287 l~~~~I~AP~~G~V~~l~~~~~G~~v~~G~~l~~Ivp~~~~l~v~~~v~~~di~~i~~Gq~v~i~~~a~~~ 357 (440) ++..++=.-+.|+|..+. .-|.||.-+ |..+.|.=...+ ....++|+.+.+++...+. T Consensus 11 ~~dlevG~~y~G~V~~v~--~fGaFV~l~-------~~~~GLiHiS~l----~~~~~vGD~V~VkV~~I~~ 68 (77) T cd04473 11 MEDLEVGKLYKGKVNGVA--KYGVFVDLN-------DHVRGLIHRSNL----LRDYEVGDEVIVQVTDIPE 68 (77) T ss_pred CCCCCCCCEEEEEEEEEE--ECCEEEECC-------CCCEEEEEEHHC----CCCCCCCCEEEEEEEEECC T ss_conf 303523999999998896--628899926-------998796893166----6878999999999999999 Done!