Query gi|254780970|ref|YP_003065383.1| phosphoribosylformylglycinamidine synthase II [Candidatus Liberibacter asiaticus str. psy62] Match_columns 737 No_of_seqs 308 out of 2682 Neff 6.7 Searched_HMMs 39220 Date Mon May 30 04:29:20 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780970.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01736 FGAM_synth_II phosph 100.0 0 0 1965.0 47.3 717 16-732 1-763 (763) 2 PRK01213 phosphoribosylformylg 100.0 0 0 1773.1 60.5 728 5-734 5-736 (736) 3 COG0046 PurL Phosphoribosylfor 100.0 0 0 1685.4 54.9 715 9-726 17-743 (743) 4 PRK05297 phosphoribosylformylg 100.0 0 0 1436.0 54.0 718 5-730 168-990 (1294) 5 TIGR01735 FGAM_synt phosphorib 100.0 0 0 1378.4 38.0 719 5-727 183-1076(1401) 6 KOG1907 consensus 100.0 0 0 1118.3 40.2 722 5-729 184-1011(1320) 7 TIGR01857 FGAM-synthase phosph 100.0 0 0 1094.4 23.3 697 4-733 178-970 (1279) 8 cd02203 PurL_repeat1 PurL subu 100.0 0 0 851.6 20.2 304 36-362 1-313 (313) 9 cd02204 PurL_repeat2 PurL subu 100.0 0 0 487.9 23.7 263 436-702 1-264 (264) 10 TIGR01739 tegu_FGAM_synt herpe 100.0 0 0 457.2 35.9 593 119-732 216-978 (1313) 11 cd02193 PurL Formylglycinamide 100.0 0 0 468.8 15.6 259 84-344 1-272 (272) 12 cd02193 PurL Formylglycinamide 100.0 0 0 432.3 21.4 251 449-701 2-271 (272) 13 TIGR03267 methan_mark_2 putati 100.0 0 0 425.8 15.5 305 43-369 10-322 (323) 14 TIGR03267 methan_mark_2 putati 100.0 0 0 382.0 23.5 295 397-711 1-305 (323) 15 TIGR01736 FGAM_synth_II phosph 100.0 0 0 375.2 19.9 305 53-365 417-749 (763) 16 cd02204 PurL_repeat2 PurL subu 100.0 0 0 346.7 15.2 257 74-344 1-264 (264) 17 cd02192 PurM-like3 AIR synthas 100.0 2.8E-45 0 329.8 12.7 265 55-343 18-283 (283) 18 PRK01213 phosphoribosylformylg 100.0 4.8E-44 0 320.9 16.3 289 64-367 429-722 (736) 19 cd02203 PurL_repeat1 PurL subu 100.0 1.4E-42 0 310.8 16.6 260 448-711 26-305 (313) 20 cd06061 PurM-like1 AIR synthas 100.0 2.8E-40 7E-45 295.0 13.1 274 54-344 12-298 (298) 21 cd02192 PurM-like3 AIR synthas 100.0 2.9E-37 7.5E-42 274.4 18.4 252 427-701 28-283 (283) 22 COG0046 PurL Phosphoribosylfor 100.0 8.9E-37 2.3E-41 271.1 20.1 302 414-722 63-381 (743) 23 cd00396 PurM-like AIR (aminoim 100.0 8.5E-39 2.2E-43 284.9 8.5 215 86-343 2-222 (222) 24 PRK05731 thiamine monophosphat 100.0 6.4E-37 1.6E-41 272.1 15.2 277 64-359 18-310 (320) 25 cd00396 PurM-like AIR (aminoim 100.0 1.5E-35 3.8E-40 262.7 17.4 213 449-701 1-222 (222) 26 cd02197 HypE HypE (Hydrogenase 100.0 4.5E-35 1.1E-39 259.4 16.7 271 53-344 6-293 (293) 27 cd02194 ThiL ThiL (Thiamine-mo 100.0 3.1E-35 7.9E-40 260.5 15.1 261 63-345 15-291 (291) 28 cd06061 PurM-like1 AIR synthas 100.0 3.2E-34 8.2E-39 253.6 20.0 256 428-702 24-298 (298) 29 cd02195 SelD Selenophosphate s 100.0 3.3E-33 8.5E-38 246.6 20.0 246 430-702 35-287 (287) 30 cd02195 SelD Selenophosphate s 100.0 1.5E-34 3.7E-39 255.9 11.6 275 44-344 6-287 (287) 31 PRK05731 thiamine monophosphat 100.0 1.2E-32 3E-37 242.8 20.4 264 425-709 17-303 (320) 32 cd02691 PurM-like2 AIR synthas 100.0 3.8E-33 9.6E-38 246.2 17.0 287 56-358 20-317 (346) 33 cd02194 ThiL ThiL (Thiamine-mo 100.0 2.1E-31 5.4E-36 234.2 20.1 252 428-703 18-291 (291) 34 COG0309 HypE Hydrogenase matur 100.0 2.7E-31 7E-36 233.5 17.4 280 66-365 32-332 (339) 35 cd02197 HypE HypE (Hydrogenase 100.0 1E-30 2.5E-35 229.6 19.5 257 427-702 19-293 (293) 36 TIGR01379 thiL thiamine-monoph 100.0 1.4E-29 3.6E-34 221.7 20.1 261 424-705 15-311 (336) 37 TIGR02124 hypE hydrogenase exp 100.0 9.7E-31 2.5E-35 229.7 13.8 290 53-365 13-338 (345) 38 PRK00943 selenophosphate synth 100.0 1.6E-29 4.2E-34 221.3 19.0 252 434-705 49-337 (347) 39 COG2144 Selenophosphate synthe 100.0 6.8E-29 1.7E-33 217.0 18.2 270 419-712 27-302 (324) 40 COG2144 Selenophosphate synthe 100.0 3.5E-29 8.9E-34 219.0 16.3 296 45-368 13-317 (324) 41 TIGR02124 hypE hydrogenase exp 100.0 5.3E-27 1.4E-31 204.1 14.8 260 425-706 31-317 (345) 42 cd02691 PurM-like2 AIR synthas 99.9 2E-26 5.1E-31 200.1 17.1 291 427-734 29-338 (346) 43 PRK00943 selenophosphate synth 99.9 7.4E-27 1.9E-31 203.1 11.9 282 48-348 17-338 (347) 44 COG0611 ThiL Thiamine monophos 99.9 3.7E-25 9.3E-30 191.5 18.8 262 416-703 8-292 (317) 45 PRK05297 phosphoribosylformylg 99.9 9.3E-26 2.4E-30 195.5 15.6 285 70-368 674-981 (1294) 46 pfam02769 AIRS_C AIR synthase 99.9 2.5E-25 6.3E-30 192.6 10.5 149 199-354 1-149 (150) 47 COG0309 HypE Hydrogenase matur 99.9 9.2E-25 2.3E-29 188.7 13.3 264 427-705 31-311 (339) 48 TIGR01379 thiL thiamine-monoph 99.9 3.3E-24 8.4E-29 184.9 12.6 282 61-359 15-325 (336) 49 COG0611 ThiL Thiamine monophos 99.9 2E-20 5.1E-25 159.0 15.3 271 64-358 18-305 (317) 50 cd02196 PurM PurM (Aminoimidaz 99.9 4.3E-20 1.1E-24 156.7 15.8 222 113-344 54-296 (297) 51 pfam02769 AIRS_C AIR synthase 99.9 2.7E-20 6.8E-25 158.1 14.5 138 567-706 1-143 (150) 52 cd02196 PurM PurM (Aminoimidaz 99.8 1E-18 2.6E-23 147.3 17.1 257 434-703 4-297 (297) 53 TIGR00476 selD selenide, water 99.8 4.8E-19 1.2E-23 149.5 14.6 252 434-702 57-371 (372) 54 PRK05385 phosphoribosylaminoim 99.8 1.7E-17 4.5E-22 138.8 17.2 265 63-349 30-335 (338) 55 TIGR01735 FGAM_synt phosphorib 99.8 5.7E-18 1.5E-22 142.1 10.7 385 290-712 171-609 (1401) 56 TIGR00878 purM phosphoribosylf 99.8 4.5E-18 1.2E-22 142.8 9.9 247 435-704 43-337 (338) 57 TIGR00878 purM phosphoribosylf 99.8 1.6E-16 4E-21 132.3 17.5 261 61-347 31-338 (338) 58 COG1973 HypE Hydrogenase matur 99.7 2.6E-17 6.5E-22 137.7 11.1 265 69-359 108-394 (449) 59 PRK05385 phosphoribosylaminoim 99.7 2.4E-16 6.1E-21 131.0 15.5 260 433-705 38-333 (338) 60 COG0709 SelD Selenophosphate s 99.7 1E-16 2.6E-21 133.6 12.1 249 433-705 48-338 (346) 61 TIGR01739 tegu_FGAM_synt herpe 99.7 1.7E-15 4.3E-20 125.2 17.0 280 67-369 655-968 (1313) 62 COG0150 PurM Phosphoribosylami 99.6 4.5E-14 1.1E-18 115.5 16.9 260 433-705 39-335 (345) 63 COG0150 PurM Phosphoribosylami 99.6 1.9E-14 4.7E-19 118.1 11.5 269 62-349 30-337 (345) 64 COG1973 HypE Hydrogenase matur 99.6 6E-14 1.5E-18 114.6 12.4 256 430-705 105-381 (449) 65 COG0709 SelD Selenophosphate s 99.6 3.4E-13 8.6E-18 109.5 14.8 276 47-348 16-339 (346) 66 KOG1907 consensus 99.5 1.8E-13 4.7E-18 111.3 13.2 163 199-369 836-1004(1320) 67 TIGR00476 selD selenide, water 99.5 3.1E-13 7.8E-18 109.7 10.2 269 64-344 41-371 (372) 68 KOG0237 consensus 99.4 2.5E-12 6.3E-17 103.5 12.4 263 431-704 464-769 (788) 69 pfam00586 AIRS AIR synthase re 99.4 1.5E-13 3.8E-18 111.9 3.0 92 434-530 1-94 (95) 70 KOG0237 consensus 99.4 1.9E-12 4.8E-17 104.3 8.2 228 113-349 523-772 (788) 71 pfam00586 AIRS AIR synthase re 99.4 1.1E-12 2.7E-17 106.1 6.1 92 72-167 1-95 (95) 72 KOG3939 consensus 99.3 2.8E-10 7.2E-15 89.4 15.8 252 436-705 2-293 (312) 73 TIGR01857 FGAM-synthase phosph 99.1 3.9E-11 9.9E-16 95.3 2.6 347 320-709 193-598 (1279) 74 KOG3939 consensus 96.9 0.019 4.8E-07 35.8 10.0 258 74-350 2-296 (312) 75 PRK07226 fructose-bisphosphate 84.6 0.78 2E-05 24.7 2.5 49 470-528 96-144 (266) 76 PRK08227 aldolase; Validated 83.7 0.95 2.4E-05 24.1 2.6 70 448-528 102-172 (291) 77 TIGR02033 D-hydantoinase dihyd 82.3 0.94 2.4E-05 24.1 2.2 35 432-468 397-431 (466) 78 TIGR00608 radc DNA repair prot 77.7 0.83 2.1E-05 24.5 0.7 25 273-299 79-103 (223) 79 TIGR01950 SoxR redox-sensitive 74.4 5.1 0.00013 19.1 4.0 44 9-52 51-96 (142) 80 cd00958 DhnA Class I fructose- 71.2 4.5 0.00011 19.5 3.1 56 464-529 73-128 (235) 81 TIGR02479 FliA_WhiG RNA polyme 71.0 4.4 0.00011 19.6 3.0 23 680-702 181-203 (227) 82 TIGR02634 xylF D-xylose ABC tr 70.7 7.2 0.00018 18.1 5.2 129 141-279 65-206 (307) 83 TIGR01034 metK S-adenosylmethi 68.9 7.2 0.00018 18.1 3.7 101 392-497 201-313 (393) 84 TIGR02103 pullul_strch alpha-1 68.0 6.3 0.00016 18.5 3.3 48 486-538 390-452 (945) 85 COG4008 Predicted metal-bindin 65.3 7.4 0.00019 18.0 3.2 10 482-491 11-20 (153) 86 PRK06108 aspartate aminotransf 65.3 6.5 0.00017 18.4 2.9 32 498-529 166-197 (382) 87 TIGR02356 adenyl_thiF thiazole 62.1 4.7 0.00012 19.3 1.7 43 267-312 27-71 (210) 88 PRK08175 aminotransferase; Val 61.9 6.3 0.00016 18.5 2.3 31 498-529 172-203 (395) 89 PRK07878 molybdopterin biosynt 61.6 8.1 0.00021 17.7 2.8 44 267-311 48-91 (392) 90 PRK07367 consensus 61.4 3.5 9E-05 20.2 1.0 32 498-529 169-200 (385) 91 TIGR01949 AroFGH_arch predicte 58.5 4.9 0.00012 19.2 1.3 51 479-538 98-148 (259) 92 pfam07141 Phage_term_sma Putat 58.4 6.4 0.00016 18.4 1.9 30 23-57 23-52 (174) 93 PRK07550 hypothetical protein; 56.6 3.5 9E-05 20.2 0.3 37 498-536 172-208 (387) 94 PRK07568 aspartate aminotransf 56.5 4.1 0.0001 19.8 0.6 94 430-529 108-201 (396) 95 pfam04539 Sigma70_r3 Sigma-70 56.1 10 0.00026 17.1 2.6 21 22-42 11-31 (78) 96 PRK07212 consensus 55.5 6.5 0.00016 18.4 1.5 92 430-530 111-202 (378) 97 TIGR02885 spore_sigF RNA polym 55.1 10 0.00026 17.0 2.5 22 500-523 36-57 (231) 98 TIGR01833 HMG-CoA-S_euk hydrox 54.1 11 0.00028 16.8 2.5 69 285-361 8-87 (461) 99 cd01459 vWA_copine_like VWA Co 54.0 11 0.00028 16.9 2.4 18 464-481 109-126 (254) 100 PRK06852 aldolase; Validated 53.9 7.2 0.00018 18.1 1.5 60 463-528 111-172 (303) 101 PRK09148 aminotransferase; Val 53.7 6.2 0.00016 18.5 1.2 19 511-529 187-205 (406) 102 PRK05957 aspartate aminotransf 53.4 11 0.00029 16.7 2.5 32 498-529 168-199 (389) 103 TIGR02187 GlrX_arch Glutaredox 53.2 4.1 0.0001 19.8 0.2 65 16-80 1-73 (237) 104 pfam04227 Indigoidine_A Indigo 51.9 8.7 0.00022 17.5 1.7 15 323-337 156-170 (293) 105 COG5616 Predicted integral mem 51.6 14 0.00035 16.1 2.7 121 104-231 6-129 (152) 106 TIGR02978 phageshock_pspC phag 51.4 4.7 0.00012 19.3 0.3 19 143-164 12-30 (128) 107 PRK06056 consensus 51.2 8.1 0.00021 17.7 1.4 32 498-529 176-207 (402) 108 PRK12399 tagatose 1,6-diphosph 50.7 16 0.00041 15.7 3.1 82 446-531 81-163 (324) 109 COG2860 Predicted membrane pro 50.6 3.3 8.4E-05 20.4 -0.6 23 104-127 39-62 (209) 110 PRK05973 replicative DNA helic 50.4 16 0.00041 15.7 2.8 22 197-222 60-81 (237) 111 pfam07302 AroM AroM protein. T 50.3 11 0.00028 16.8 2.0 61 455-530 149-209 (221) 112 PRK07778 consensus 49.0 6.3 0.00016 18.5 0.6 32 498-529 170-201 (386) 113 COG1191 FliA DNA-directed RNA 49.0 17 0.00043 15.5 3.3 13 511-523 58-70 (247) 114 pfam04002 DUF2466 Protein of u 48.3 6 0.00015 18.6 0.4 56 460-526 48-103 (123) 115 COG1830 FbaB DhnA-type fructos 48.1 17 0.00044 15.5 3.4 42 481-528 107-148 (265) 116 cd01121 Sms Sms (bacterial rad 48.0 12 0.00029 16.7 1.8 29 322-350 187-215 (372) 117 TIGR00416 sms DNA repair prote 47.9 8 0.0002 17.8 1.0 47 322-372 225-273 (481) 118 TIGR01924 rsbW_low_gc anti-sig 47.7 6.5 0.00017 18.4 0.5 38 437-482 20-57 (161) 119 TIGR00853 pts-lac PTS system, 46.5 16 0.0004 15.8 2.3 73 257-341 46-124 (142) 120 PRK05597 molybdopterin biosynt 46.1 19 0.00047 15.2 3.0 21 267-287 34-54 (355) 121 TIGR03277 methan_mark_9 putati 45.9 19 0.00048 15.2 3.1 25 4-28 27-51 (109) 122 COG1010 CobJ Precorrin-3B meth 45.7 19 0.00048 15.2 4.8 112 261-373 77-230 (249) 123 cd05565 PTS_IIB_lactose PTS_II 45.4 19 0.00049 15.2 3.7 60 272-341 16-78 (99) 124 PRK00024 radC DNA repair prote 45.1 6.8 0.00017 18.2 0.3 54 462-526 151-204 (224) 125 pfam11212 DUF2999 Protein of u 45.1 19 0.00049 15.1 3.0 27 6-32 3-29 (82) 126 cd00757 ThiF_MoeB_HesA_family 44.0 20 0.00051 15.0 5.1 22 267-288 27-48 (228) 127 PRK08443 consensus 43.4 10 0.00026 17.0 1.0 33 497-529 169-201 (388) 128 PRK08328 hypothetical protein; 43.0 14 0.00037 16.0 1.7 44 267-311 33-76 (230) 129 COG4915 XpaC 5-bromo-4-chloroi 42.9 20 0.00051 15.0 2.4 14 718-731 170-183 (204) 130 PRK07324 transaminase; Validat 42.6 6 0.00015 18.6 -0.3 95 429-530 99-193 (373) 131 PRK11823 DNA repair protein Ra 41.4 16 0.00041 15.7 1.7 35 309-350 188-222 (454) 132 TIGR02171 Fb_sc_TIGR02171 Fibr 41.0 14 0.00037 16.0 1.4 180 30-229 162-402 (982) 133 PRK09276 aspartate aminotransf 40.4 15 0.00038 15.9 1.4 32 498-529 174-205 (385) 134 PRK10481 hypothetical protein; 39.9 20 0.00051 15.0 2.0 59 457-530 154-212 (224) 135 PRK08363 alanine aminotransfer 39.9 12 0.0003 16.6 0.8 32 498-529 174-205 (398) 136 COG2003 RadC DNA repair protei 39.8 10 0.00027 17.0 0.5 53 463-526 152-204 (224) 137 PRK13805 bifunctional acetalde 39.2 24 0.0006 14.5 2.5 20 314-333 349-368 (862) 138 cd01492 Aos1_SUMO Ubiquitin ac 38.7 23 0.00058 14.6 2.1 44 267-311 27-70 (197) 139 cd02430 PTH2 Peptidyl-tRNA hyd 38.7 24 0.00061 14.5 3.0 38 304-341 41-79 (115) 140 PRK04250 dihydroorotase; Provi 38.5 24 0.00062 14.5 4.2 13 76-88 19-31 (408) 141 PTZ00079 NADP-specific glutama 38.4 24 0.00062 14.5 6.4 83 12-105 20-114 (469) 142 PRK09517 multifunctional thiam 38.2 24 0.00062 14.4 2.8 38 314-351 278-316 (738) 143 pfam10112 Halogen_Hydrol 5-bro 38.2 23 0.0006 14.6 2.1 20 715-734 167-186 (199) 144 PRK07777 aminotransferase; Val 37.3 9.8 0.00025 17.2 0.1 47 98-152 5-51 (386) 145 TIGR00313 cobQ cobyric acid sy 37.2 25 0.00064 14.3 3.6 113 181-345 59-177 (502) 146 PRK07683 aminotransferase A; V 36.4 12 0.00032 16.5 0.5 35 495-529 166-200 (387) 147 PRK05690 molybdopterin biosynt 36.4 19 0.00048 15.2 1.4 60 267-335 38-99 (245) 148 TIGR02719 repress_PhaQ poly-be 36.1 19 0.00048 15.2 1.4 31 609-639 59-89 (138) 149 KOG1327 consensus 36.1 16 0.00041 15.7 1.0 44 461-504 313-368 (529) 150 cd01122 GP4d_helicase GP4d_hel 35.9 26 0.00067 14.2 3.0 25 505-529 163-187 (271) 151 TIGR02326 transamin_PhnW 2-ami 35.3 27 0.00069 14.1 4.0 111 604-725 246-357 (366) 152 COG4721 ABC-type cobalt transp 35.0 16 0.00041 15.7 0.9 57 93-163 37-95 (192) 153 pfam02244 Propep_M14 Carboxype 34.5 28 0.00071 14.1 2.0 25 674-698 38-62 (75) 154 PRK12475 thiamine/molybdopteri 34.4 22 0.00056 14.7 1.5 22 267-288 30-51 (337) 155 TIGR03538 DapC_gpp succinyldia 34.3 14 0.00037 16.0 0.5 32 498-529 174-205 (394) 156 PRK07682 hypothetical protein; 33.6 13 0.00034 16.3 0.3 38 113-151 9-46 (378) 157 PRK08762 molybdopterin biosynt 33.4 21 0.00054 14.9 1.3 45 267-312 144-188 (379) 158 PRK06507 consensus 33.1 25 0.00065 14.3 1.6 32 498-529 172-204 (400) 159 PRK10664 transcriptional regul 32.9 29 0.00075 13.9 3.5 24 1-24 1-24 (90) 160 PRK07681 aspartate aminotransf 32.7 16 0.00041 15.7 0.6 25 672-696 365-389 (399) 161 PRK07688 thiamine/molybdopteri 32.6 25 0.00064 14.3 1.6 21 267-287 30-50 (339) 162 PRK05764 aspartate aminotransf 32.4 15 0.00039 15.8 0.4 32 498-529 172-203 (389) 163 TIGR00366 TIGR00366 conserved 32.1 15 0.00039 15.9 0.4 27 394-422 267-293 (463) 164 PRK06207 aspartate aminotransf 32.1 21 0.00053 14.9 1.1 32 498-529 187-218 (406) 165 PRK05463 hypothetical protein; 31.8 31 0.00078 13.8 2.3 53 316-375 39-91 (264) 166 TIGR02662 dinitro_DRAG ADP-rib 31.6 15 0.00037 16.0 0.2 20 413-433 98-117 (295) 167 pfam06032 DUF917 Protein of un 31.5 22 0.00056 14.7 1.1 55 240-299 22-76 (352) 168 TIGR01466 cobJ_cbiH precorrin- 31.4 31 0.00079 13.7 5.4 179 183-374 3-236 (254) 169 PRK06107 aspartate aminotransf 31.4 22 0.00055 14.8 1.1 18 512-529 188-206 (402) 170 PRK06348 aspartate aminotransf 31.2 15 0.00038 15.9 0.2 33 497-529 169-201 (383) 171 PRK04914 ATP-dependent helicas 31.2 16 0.00041 15.7 0.4 43 300-342 186-229 (955) 172 pfam01988 VIT1 VIT family. Thi 31.1 32 0.0008 13.7 3.4 21 323-343 81-101 (210) 173 PRK07337 aminotransferase; Val 31.1 13 0.00034 16.2 -0.1 17 512-528 185-201 (388) 174 TIGR02355 moeB molybdopterin s 31.0 26 0.00066 14.2 1.4 43 327-369 131-174 (240) 175 TIGR00987 himA integration hos 30.7 32 0.00082 13.6 3.4 15 9-23 10-24 (96) 176 PRK08362 consensus 30.6 15 0.00038 15.9 0.1 16 321-336 179-194 (389) 177 cd01487 E1_ThiF_like E1_ThiF_l 29.8 33 0.00084 13.5 2.0 59 267-335 5-65 (174) 178 TIGR01304 IMP_DH_rel_2 IMP deh 29.6 22 0.00056 14.7 0.9 19 514-532 181-199 (376) 179 PRK04322 peptidyl-tRNA hydrola 29.4 34 0.00086 13.5 3.2 46 304-352 42-88 (116) 180 PRK10753 transcriptional regul 28.9 34 0.00087 13.4 3.4 20 3-22 3-22 (90) 181 PRK11706 TDP-4-oxo-6-deoxy-D-g 28.9 13 0.00033 16.3 -0.5 37 15-51 8-47 (375) 182 cd04914 ACT_AKi-DapG-BS_1 ACT 28.9 34 0.00087 13.4 3.4 32 667-698 34-65 (67) 183 PRK08361 aspartate aminotransf 28.9 16 0.00041 15.7 0.1 20 510-529 185-204 (390) 184 PRK08912 hypothetical protein; 28.8 15 0.00038 15.9 -0.1 32 498-529 167-198 (387) 185 pfam01726 LexA_DNA_bind LexA D 28.6 35 0.00088 13.4 3.8 37 21-64 12-54 (65) 186 PRK05794 consensus 28.5 20 0.00052 15.0 0.5 32 498-529 171-202 (397) 187 TIGR02393 RpoD_Cterm RNA polym 27.7 36 0.00091 13.3 2.2 21 505-526 59-80 (240) 188 PRK07366 succinyldiaminopimela 27.7 16 0.00041 15.7 -0.2 32 498-529 173-204 (388) 189 cd02962 TMX2 TMX2 family; comp 27.6 4.1 0.00011 19.7 -3.2 40 322-363 86-125 (152) 190 PRK05942 aspartate aminotransf 27.4 19 0.00049 15.2 0.2 32 498-529 178-209 (394) 191 PRK07411 hypothetical protein; 27.4 36 0.00093 13.3 2.7 20 267-286 44-63 (390) 192 cd00616 AHBA_syn 3-amino-5-hyd 27.3 12 0.00031 16.5 -0.8 26 672-697 269-297 (352) 193 cd01316 CAD_DHOase The eukaryo 27.3 36 0.00093 13.2 2.9 16 510-525 265-281 (344) 194 pfam01791 DeoC DeoC/LacD famil 27.2 37 0.00093 13.2 2.8 50 477-529 80-129 (231) 195 COG4840 Uncharacterized protei 27.1 37 0.00094 13.2 2.9 18 13-30 51-68 (71) 196 COG0436 Aspartate/tyrosine/aro 26.9 14 0.00036 16.1 -0.5 93 430-529 109-202 (393) 197 PRK08960 hypothetical protein; 26.8 25 0.00063 14.4 0.7 32 498-529 173-204 (387) 198 COG2759 MIS1 Formyltetrahydrof 26.7 37 0.00095 13.2 2.2 46 2-68 9-56 (554) 199 cd01899 Ygr210 Ygr210 subfamil 26.7 14 0.00036 16.1 -0.6 16 89-104 22-38 (318) 200 PRK10578 hypothetical protein; 26.2 4.3 0.00011 19.6 -3.3 20 105-125 37-57 (207) 201 cd05383 SCP_CRISP SCP_CRISP: S 26.1 35 0.0009 13.3 1.4 22 34-55 30-51 (138) 202 pfam07002 Copine Copine. This 25.9 16 0.0004 15.8 -0.5 15 464-478 11-25 (145) 203 PRK07543 consensus 25.8 33 0.00084 13.6 1.2 19 130-148 36-54 (400) 204 PRK11032 hypothetical protein; 25.8 39 0.00099 13.1 1.7 17 16-32 43-59 (160) 205 PRK08644 thiamine biosynthesis 25.3 39 0.001 13.0 1.5 22 267-288 33-54 (209) 206 COG1066 Sms Predicted ATP-depe 24.8 40 0.001 12.9 1.8 29 322-350 197-225 (456) 207 PRK08068 transaminase; Reviewe 24.3 21 0.00053 14.9 -0.1 18 511-528 188-205 (389) 208 PRK12319 acetyl-CoA carboxylas 24.2 41 0.0011 12.9 2.0 29 495-528 75-103 (256) 209 smart00198 SCP SCP / Tpx-1 / A 24.2 38 0.00097 13.1 1.2 17 34-50 36-52 (144) 210 TIGR03600 phage_DnaB phage rep 24.1 42 0.0011 12.9 2.0 57 473-529 292-350 (421) 211 PRK08069 consensus 24.0 27 0.00068 14.2 0.4 15 321-335 182-196 (390) 212 pfam01041 DegT_DnrJ_EryC1 DegT 24.0 31 0.00078 13.8 0.7 36 16-51 3-41 (363) 213 cd02975 PfPDO_like_N Pyrococcu 24.0 16 0.0004 15.8 -0.7 50 16-67 3-54 (113) 214 cd02434 Nodulin-21_like_3 Nodu 23.9 42 0.0011 12.8 3.6 18 323-340 82-99 (225) 215 TIGR02980 SigBFG RNA polymeras 23.9 42 0.0011 12.8 2.3 26 617-642 130-158 (229) 216 pfam11198 DUF2857 Protein of u 23.9 42 0.0011 12.8 3.9 26 596-621 117-142 (181) 217 pfam03796 DnaB_C DnaB-like hel 23.9 42 0.0011 12.8 2.8 24 506-529 154-177 (186) 218 PRK07309 aromatic amino acid a 23.8 21 0.00053 14.9 -0.1 31 498-529 174-205 (390) 219 COG2344 AT-rich DNA-binding pr 23.6 42 0.0011 12.8 6.8 85 611-695 72-172 (211) 220 TIGR02374 nitri_red_nirB nitri 23.5 43 0.0011 12.8 1.5 14 567-580 711-724 (813) 221 PRK06749 replicative DNA helic 23.4 43 0.0011 12.8 3.5 57 473-529 285-346 (428) 222 PRK12414 putative aminotransfe 23.1 18 0.00046 15.4 -0.6 23 129-151 33-55 (384) 223 PRK06290 aspartate aminotransf 23.0 16 0.00041 15.7 -0.8 32 498-529 187-218 (411) 224 TIGR01709 typeII_sec_gspL gene 23.0 44 0.0011 12.7 2.6 33 309-341 102-134 (433) 225 TIGR00028 Mtu_PIN_fam Mycobact 22.8 44 0.0011 12.7 2.4 44 40-83 18-61 (155) 226 PRK06225 aspartate aminotransf 22.8 22 0.00056 14.7 -0.2 33 497-529 159-191 (375) 227 PRK06375 consensus 22.7 24 0.00061 14.5 -0.0 32 498-529 167-198 (381) 228 KOG2584 consensus 22.7 44 0.0011 12.7 3.5 25 323-347 234-258 (522) 229 PRK00199 ihfB integration host 22.5 44 0.0011 12.7 3.5 17 5-21 5-22 (91) 230 COG1574 Predicted metal-depend 22.5 45 0.0011 12.7 2.1 26 73-98 49-74 (535) 231 pfam01981 PTH2 Peptidyl-tRNA h 22.4 45 0.0011 12.6 3.8 53 304-359 42-95 (116) 232 cd06218 DHOD_e_trans FAD/NAD b 22.4 45 0.0011 12.6 3.9 38 174-211 98-135 (246) 233 pfam03458 UPF0126 UPF0126 doma 22.4 12 0.0003 16.6 -1.6 24 102-126 32-56 (81) 234 TIGR03540 DapC_direct LL-diami 22.1 17 0.00043 15.6 -0.9 32 498-529 172-203 (383) 235 TIGR02624 rhamnu_1P_ald rhamnu 21.8 46 0.0012 12.6 2.0 15 430-444 83-97 (273) 236 PRK10241 hydroxyacylglutathion 21.8 46 0.0012 12.6 1.8 45 292-340 4-48 (251) 237 pfam08731 AFT Transcription fa 21.7 46 0.0012 12.6 1.8 14 83-96 76-89 (90) 238 PRK04161 tagatose 1,6-diphosph 21.6 46 0.0012 12.5 3.0 80 446-529 83-163 (326) 239 TIGR02823 oxido_YhdH putative 21.1 47 0.0012 12.5 11.3 188 530-726 43-287 (330) 240 PRK12427 flagellar biosynthesi 21.0 48 0.0012 12.4 3.0 13 511-523 50-62 (229) 241 cd00168 SCP SCP: SCP-like extr 20.8 46 0.0012 12.6 1.1 21 33-53 23-43 (122) 242 KOG1004 consensus 20.8 42 0.0011 12.9 0.9 50 197-246 62-118 (230) 243 pfam02864 STAT_bind STAT prote 20.7 33 0.00084 13.6 0.4 11 500-510 236-246 (254) 244 TIGR02997 Sig70-cyanoRpoD RNA 20.5 49 0.0012 12.4 2.3 27 469-495 167-197 (336) 245 pfam08652 RAI1 RAI1 like. RAI1 20.1 50 0.0013 12.3 1.6 29 51-79 39-67 (68) No 1 >TIGR01736 FGAM_synth_II phosphoribosylformylglycinamidine synthase II; InterPro: IPR010074 Phosphoribosylformylglycinamidine synthase is a single, long polypeptide in most bacteria. Three proteins are required in Bacillus subtilis and many other species. This is the longest of the three and is designated PurL, phosphoribosylformylglycinamidine synthase II, or FGAM synthase II.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1965.03 Aligned_cols=717 Identities=48% Similarity=0.778 Sum_probs=688.6 Q ss_pred CCHHHHHHHH---HHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCC----CCEEECCC-CCEEEEEECCCEEEE Q ss_conf 9989999999---995789718899998533001107445889997512069----74899334-773689836983999 Q gi|254780970|r 16 LTKDECDQII---HILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTG----KHVIQGPG-ENAGVVDIGGGDCVV 87 (737) Q Consensus 16 L~~~e~~~i~---~~l~R~Pt~~El~~~~~~wSEHC~~k~~~~~l~~~~~~~----~~vi~~~~-~~a~vi~~~~~~~~~ 87 (737) |+++||+.|+ +.|||+||++|+.||++||||||||||||+|||+||+++ ||||+||| ||||||||+|+|+++ T Consensus 1 L~~~E~~~I~~YF~~LGRePn~~E~~~f~~mWSEHCSYKSSk~~L~~fp~~~n~k~~~Vi~GPGWeDAGVV~i~D~~avV 80 (763) T TIGR01736 1 LSDEELELIRDYFKELGREPNDTELAMFEAMWSEHCSYKSSKKLLRQFPTKGNKKAPNVIQGPGWEDAGVVDIDDGYAVV 80 (763) T ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCCCCCCEEECCCCCCCCEEEECCCEEEE T ss_conf 97658999999999738883489999998530001131002477615786434668725864553686068875977999 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCC-------CHHHHHHHHHHHHHHHHHHCCC Q ss_conf 995178877765888872105581022576778743687750457875686-------3888998887458866543087 Q gi|254780970|r 88 FKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHP-------KTKHLLSGVVAGIAGYSNSFGV 160 (737) Q Consensus 88 ~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p-------~~~~~~~~~~~G~~~ygN~~G~ 160 (737) |||||||||||||||+|||||||||+|||+||||||||+||+||||++++| |.|||+.+||+||+|||||+|| T Consensus 81 ~k~ESHNHPSaiePY~GAATGVGGI~RDvlsMGArPIALlD~LrFG~l~~~k~ekL~~k~ryL~~gVV~GI~dYGNriGV 160 (763) T TIGR01736 81 FKIESHNHPSAIEPYNGAATGVGGIVRDVLSMGARPIALLDSLRFGPLDSPKTEKLLRKNRYLFEGVVKGISDYGNRIGV 160 (763) T ss_pred EEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCEEEEECCCCCCCCCCCCCCCCCCCCCCEEECCCEEEECCCCCEEEE T ss_conf 99965588974258887000216068767763663010016534786887544774010400110223004577664210 Q ss_pred CCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCC-CCCCCCCCCC Q ss_conf 211320355215676642211221121135543112256897799965765644410466667772865-3334443326 Q gi|254780970|r 161 PTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGEN-IAKKRPTVQV 239 (737) Q Consensus 161 P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~-~~~~~~~VQ~ 239 (737) |||||+|.||++|++|||||++|+||++.|+++++++.+|||.||++|++|||||||||||||++|+|+ .+++|||||+ T Consensus 161 PTVgGE~~FD~~Y~~NpLVNv~CvGlv~~d~I~~g~A~~~G~~lvl~G~~TGRDGigGA~FAS~eL~E~~eee~RPAVQv 240 (763) T TIGR01736 161 PTVGGEVEFDESYNGNPLVNVMCVGLVRKDDIVTGKAKGPGNKLVLVGSKTGRDGIGGATFASEELSEEAEEEDRPAVQV 240 (763) T ss_pred CCCCCEEEECCCCCCCCCEEEEEEEEECCCCEEEECCCCCCCEEEEEECCCCCCCCCCCEECCCCCCCCHHHHCCCCEEE T ss_conf 13672777766778898255589887456760330025898449994137787765751222421272200332884110 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEE Q ss_conf 79899988899987401138301103477520111223320147752899620220466789998986178885089999 Q gi|254780970|r 240 GDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMIL 319 (737) Q Consensus 240 gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v 319 (737) |||||||+|||||||++++++|++|||||||||+||++|||.++|+|++|+|||||+||++|+|||||+|||||||+++| T Consensus 241 GDPF~EK~LieA~LEa~~~g~V~g~kDLGaaGLt~a~~EMa~~gg~G~~~~LDkVPlRE~gM~P~EimlSESQERML~vv 320 (763) T TIGR01736 241 GDPFTEKLLIEATLEAVDTGLVKGIKDLGAAGLTSASSEMAAKGGLGAEIDLDKVPLREEGMTPYEIMLSESQERMLLVV 320 (763) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCCEEEEE T ss_conf 77112357999999887249268873118713789999997158872699973335469887488888752100117850 Q ss_pred ECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCC---CCCCCC Q ss_conf 54675889998887097131122121044310002586443443210146650114555315655510123---468725 Q gi|254780970|r 320 NPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQ---LLCSDE 396 (737) Q Consensus 320 ~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~~~---~~~~~~ 396 (737) .|++++++.+||+||+++|+|||+||++++++++|+|++|+|||+++|.+++|.|+||++++.+.+..... ..+++. T Consensus 321 ~P~~v~~~~~i~~KyeL~~sVvG~vT~~~~~~~~~~Ge~v~dlP~~~L~~~AP~~~rP~~~~~~~~~~~~~~~~~~~p~~ 400 (763) T TIGR01736 321 APEDVEEVLEIFEKYELDASVVGEVTDEGRIRVKYKGEVVADLPIELLADEAPEYERPSEEPKYPEEEKEEDLSPEPPPA 400 (763) T ss_pred CCCCHHHHHHHHHHHCCCEEEEEEEECCCEEEEEECCEEEEEECHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 47878999999997289849998993486699999383999712453088777414887133354431010466657875 Q ss_pred CHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCCCCCCCC---------CCCEEEECCCCC--CEEEEECCCCCCCHHCCC Q ss_conf 588999998507321010356531121000110000135---------763142037776--227973045321000010 Q gi|254780970|r 397 DCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPGG---------DAGVIRVEGHET--KALAFSSDVTPRYVKADP 465 (737) Q Consensus 397 d~~~~l~~vL~~pnV~SK~~i~~qyD~~V~g~tv~~Pg~---------DaaVv~~~~~~~--~gia~s~g~~p~~~~~dP 465 (737) |+.++|++||+|||+|||+|||+||||+||+|||++||. ||||+|+.+... +|+|+|+||||+|+++|| T Consensus 401 ~l~~~~~~vL~~pN~asK~wvY~QYDheVq~rTVvkPG~G~~N~ETH~DAAVlr~~~~~~~~~glA~t~D~NPr~~~~dP 480 (763) T TIGR01736 401 DLEDALLKVLSSPNIASKEWVYRQYDHEVQTRTVVKPGEGKMNKETHADAAVLRIKETGKLDKGLALTADCNPRYVYLDP 480 (763) T ss_pred CHHHHHHHHHHCCCHHHCCCEEEEECEEECCEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEECCCCCCEEECH T ss_conf 67899999862333021541376511067325776078887774111541346640568775278997078766033160 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC----CC Q ss_conf 01135678977764102785102232210245666862134699999999852210288665443200112368----66 Q gi|254780970|r 466 FEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNG----QS 541 (737) Q Consensus 466 ~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~----~~ 541 (737) |+||+.||+||+|||+|+||+||+++||||||||++||+||||.++|+||+|+|++|++|||||||||||||++ ++ T Consensus 481 y~Ga~~~vaEa~RNLa~vGA~PLa~vDcLNFGNPErPEvywQf~~~v~Gl~daa~~l~~PVVgGNVSLYNE~~~~~~~~~ 560 (763) T TIGR01736 481 YAGAAAAVAEAYRNLAAVGAEPLAIVDCLNFGNPERPEVYWQFVEAVKGLGDAARALGTPVVGGNVSLYNETNGEDAKVP 560 (763) T ss_pred HHHHHHHHHHHHHCEEEECCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEECCEECCCCCCCCCCCCC T ss_conf 35588977743112564087548888405689888703678999999999999986388988211000231037872435 Q ss_pred CCCCCCCCCCCCCCCCEEEEE-CCCCCCCCCEEECCCCC-CCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 676311122223335202331-02143898075137766-5543267888861167899964798999999999999999 Q gi|254780970|r 542 IFPTPTIAGVGILPDYSLMTR-IDSAHEGDLILMIGNDG-CHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMIN 619 (737) Q Consensus 542 I~pTpvi~~vG~v~d~~~~it-~~~k~~Gd~I~liG~~~-~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~ 619 (737) |.|||+|+++|+++|+++..+ ..+++.||.|||+|.|+ ++||||+|++.+++...|.+|.+|++.+++..+.++++|+ T Consensus 561 i~PTP~i~~VG~~~Dv~~~~~~~~~~~~Gd~~~l~Get~~~ElGGS~Y~~~~~g~~~G~~P~vDl~~e~~~~d~v~~~i~ 640 (763) T TIGR01736 561 IAPTPTIGMVGLVEDVEKLLTSLNFKKEGDAIYLIGETKKDELGGSEYLKVVHGIVSGQVPAVDLEEEKELADAVREAIR 640 (763) T ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHCCCEEEEECCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 37851589998842742125642054479769993143236664767877778844786888786888999999999974 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCC--CCHHHHHHCCCCCCEEEEECHHHH-HHHHHHHHHCCCCE Q ss_conf 598069946486579999999984589708998056668--987888729883218999968809-98999998579849 Q gi|254780970|r 620 AKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIE--KDPKPFLFGEDQGRYVVCISPENQ-DLVMSEANNKNIPL 696 (737) Q Consensus 620 ~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~--~d~~~~LFsEs~Gr~ii~V~~~~~-~~~~~~~~~~gi~~ 696 (737) +++|+++||||.|||++||+||++++++|++|+++.... .++...||||++||+|++|+.++. ......|...++++ T Consensus 641 ~g~v~~ahDvS~GGLAvaLAem~~~~G~Ga~~~~~~~~~~~~r~~~~LFSEs~gR~iv~~~~~~~v~~~~~ff~~~~~~a 720 (763) T TIGR01736 641 EGLVSAAHDVSRGGLAVALAEMAIASGIGAEVDIDEIASETARADELLFSESNGRAIVAVREEKAVEEAVKFFESKGVPA 720 (763) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHHCCCCEE T ss_conf 06820554433523899999999744862599725788500002688706778866999579799999998731236506 Q ss_pred EEEE-EECCCEEEEE------CCEECCHHHHHHHHHHHHHHHH Q ss_conf 9989-9879868980------7412149999999999999872 Q gi|254780970|r 697 RYLG-KVSGSTLSIH------NILNIPVATLQTKYESWFPEFI 732 (737) Q Consensus 697 ~~IG-~Vt~~~l~i~------~~~~~~i~~l~~~w~~~l~~~~ 732 (737) .+|| ++.++.++|+ +.++.++++|++.|+++|+++| T Consensus 721 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~~el~~~~~~~~~~~~ 763 (763) T TIGR01736 721 KVIGEKTGGEKLTIKTGDFKLDEISVSVKELKEAWEEALPELM 763 (763) T ss_pred EEEEEEECCCEEEEEECCCEEEEEEECHHHHHHHHHHHHHHCC T ss_conf 7774786486379851673034432017889999988765309 No 2 >PRK01213 phosphoribosylformylglycinamidine synthase II; Provisional Probab=100.00 E-value=0 Score=1773.12 Aligned_cols=728 Identities=54% Similarity=0.889 Sum_probs=701.3 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCEEECCCCCEEEEEECCCE Q ss_conf 14788998619998999999999578971889999853300110744588999751206974899334773689836983 Q gi|254780970|r 5 SPILEILTAHGLTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENAGVVDIGGGD 84 (737) Q Consensus 5 ~~~~e~~~~~gL~~~e~~~i~~~l~R~Pt~~El~~~~~~wSEHC~~k~~~~~l~~~~~~~~~vi~~~~~~a~vi~~~~~~ 84 (737) ...++++++|||+.+||++|++.|||+|||+||+||+|+||||||||+||+|||+|++++++||++|+||||||+++++| T Consensus 5 ~~~~~~~~~~gL~~~E~~~i~~~LgR~Ptd~El~~~~q~wSEHC~yK~sk~~l~~~~~~~~~vl~~~~dnAgvv~~~~~~ 84 (736) T PRK01213 5 PITPELYAEHGLTDDEYERILEILGREPNFTELGIFSVMWSEHCSYKSSKPHLRKFPTKGPRVLQGPGENAGVVDIGDGQ 84 (736) T ss_pred CCCHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCEEEECCCCEEEEEECCCE T ss_conf 66688998839999999999998688987999999988875540677638999747888995678548860599957986 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999517887776588887210558102257677874368775045787568638889988874588665430872113 Q gi|254780970|r 85 CVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVG 164 (737) Q Consensus 85 ~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~ 164 (737) +++||+|||||||||+||+|||||+||+|||+++|||||+|++|.||||+|+.|+++|++.+++.|+++||||||+|||+ T Consensus 85 ~~~fK~ETHNHPsaIePf~GAaTGvGG~iRD~~~~GA~Pia~~d~l~fG~~~~~~~~~i~~gvv~G~s~YGN~~G~Ptv~ 164 (736) T PRK01213 85 AVVFKIESHNHPSAVEPYQGAATGVGGILRDIFTMGARPIALLNSLRFGELDHPKTRYLVEGVVAGIGGYGNCVGVPTVG 164 (736) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHCCCCHHHCCCCCCCCCCC T ss_conf 99999788899987788796101668700178766874155245555689898635344321244321057444657533 Q ss_pred CEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 20355215676642211221121135543112256897799965765644410466667772865333444332679899 Q gi|254780970|r 165 GEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFT 244 (737) Q Consensus 165 G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ 244 (737) |+++||++|.+||+|+++|+|+++.++++++++.++||+||++||+|||||||||||||.+++++.+.++++|||||||| T Consensus 165 G~~~F~~~y~~nplv~a~~vG~v~~~~i~~~~~~~~G~~vv~lGg~tgrdGigGas~sS~~~~~~s~~~~~aVQ~GdP~~ 244 (736) T PRK01213 165 GEVYFDPSYNGNPLVNAMCVGLVEHDDIVKGKASGVGNPVVYVGAKTGRDGIGGATFASAELSEESEEKRPAVQVGDPFM 244 (736) T ss_pred EEEEECCCCCCCEEEEEEEEEEEEHHHCEECCCCCCCCEEEEECCCCCCCCCCCHHHHHHHCCCCCHHHCCCEECCCHHH T ss_conf 38897156477623675578874154430256899998899934755545677501212332455544356601288678 Q ss_pred HHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCH Q ss_conf 98889998740113830110347752011122332014775289962022046678999898617888508999954675 Q gi|254780970|r 245 GKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQ 324 (737) Q Consensus 245 ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~ 324 (737) ||+|+|+|+|++++++|.+|||+|||||+||++||+.++++|++||||+||+||++|+|||||||||||||+++|+|+++ T Consensus 245 ek~li~ac~el~~~~~I~~i~D~GAGGls~a~~Ela~~~g~G~~i~Ld~Vpl~~~gmsp~EI~~SESQERm~l~v~p~~~ 324 (736) T PRK01213 245 EKLLIEACLELFATGAVVGIQDMGAAGLTCSSSEMAAKGGLGIELDLDKVPLREEGMTPYEIMLSESQERMLLVVKPGKE 324 (736) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHEEEEECCHHH T ss_conf 88999999987406987899856996177628998742797899991247289999888997776134315999883468 Q ss_pred HHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCC---CCCCCCCCCHHHH Q ss_conf 8899988870971311221210443100025864434432101466501145553156555101---2346872558899 Q gi|254780970|r 325 HKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSN---PQLLCSDEDCTKS 401 (737) Q Consensus 325 ~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~---~~~~~~~~d~~~~ 401 (737) ++|.+||+||+|+|++||+||++++++++|++++++|||+++|++++|+|+|++++|....... ........+++++ T Consensus 325 ~~~~~i~~re~~~~~vVG~vt~~~~l~v~~~g~~vvdlp~~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 404 (736) T PRK01213 325 EEVLAIFEKWDLDAAVIGEVTDTGRLRVYHKGEVVADIPIKALADEAPVYDRPVEEPAYLDELQEDAAELPRPEDDLKEA 404 (736) T ss_pred HHHHHHHHHCCCCEEEEEEEECCCEEEEEECCCEEEEEEHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 99999998859988999999338829999899779994789966899755676667554321223433367875319999 Q ss_pred HHHHHHCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHH Q ss_conf 99985073210103565311210001100001357631420377762279730453210000100113567897776410 Q gi|254780970|r 402 LLKLLSSPNLSSRRWVYEQYDTMIQSNSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNII 481 (737) Q Consensus 402 l~~vL~~pnV~SK~~i~~qyD~~V~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~ 481 (737) |+++|+|||||||+|||+||||+||++|||+|++||||++++ ++++|+|||+|++|+|+.+|||+||++||+||+|||+ T Consensus 405 l~~vLs~pnVaSK~~l~~qyD~~V~g~TV~~P~~DaaVi~i~-~s~~g~a~s~g~~p~~~~~dP~~ga~~aV~Ea~rNlv 483 (736) T PRK01213 405 LLKLLSSPNIASKRWVYEQYDHQVQGNTVVKPGGDAAVVRIE-GTGKGLAMTTDCNPRYVYLDPYEGAKLAVAEAYRNVA 483 (736) T ss_pred HHHHHCCCCCCCCHHHHHHCCCHHCCCCCCCCCCCEEEEEEC-CCCEEEEEECCCCCHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 999861876335112677418101477134689871799855-8975899955878515206999999999999997788 Q ss_pred CCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 27851022322102456668621346999999998522102886654432001123686667631112222333520233 Q gi|254780970|r 482 ATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMT 561 (737) Q Consensus 482 a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~i 561 (737) |+||+|++++|||||+||++|+.||||+++++||.|+|++|++|++||||||||++.+.+|+|||+|+++|.++|+++.+ T Consensus 484 a~Ga~plai~d~lnfgnPe~pe~~~~l~~av~g~~dac~~l~ipvv~GkdSlyne~~~~~I~PTpvI~~vg~v~Dv~~~v 563 (736) T PRK01213 484 AVGAEPLAITDCLNFGNPEKPEVMWQFVEAVRGIAEACRALGTPVVGGNVSLYNETNGTAIYPTPVIGMVGLIDDVTKAI 563 (736) T ss_pred HHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCEEEEEEEEECCHHCC T ss_conf 61786132212532389778778788999999999999981999675753773666993578751599877750600023 Q ss_pred ECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHH Q ss_conf 10214389807513776655432678888611678999647989999999999999995980699464865799999999 Q gi|254780970|r 562 RIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEM 641 (737) Q Consensus 562 t~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EM 641 (737) |+++|++||.||++|.++.+||||+|++.+++...+.+|.+|++.+++.++++++++++++|.|+||+|||||++||+|| T Consensus 564 t~~~k~~g~~i~lig~~~~~LGGS~~~~~~~~~~~~~~P~vd~~~~~~~~~~i~~li~~~~I~s~HDiSdGGL~~aLaEM 643 (736) T PRK01213 564 TSGFKEEGDLIYLLGETKGELGGSEYLKVVHGRVGGRPPAVDLAAEKRLGELLREAIRAGLVSSAHDVSDGGLAVALAEM 643 (736) T ss_pred CCCCCCCCCEEEEECCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHH T ss_conf 73335789749997178776562199999727558989987999999999999999977986999888615899999999 Q ss_pred HHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEEECCEECCHHHH Q ss_conf 8458970899805666898788872988321899996880998999998579849998998798-689807412149999 Q gi|254780970|r 642 TISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSIHNILNIPVATL 720 (737) Q Consensus 642 a~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i~~~~~~~i~~l 720 (737) ||++++|++|+|+.. ..++..+||||++||+|++|++++.++|.++++++|+++.+||+|+++ .+.+++.+++++++| T Consensus 644 af~~~~G~~i~l~~~-~~~~~~~LFsE~~gr~vveV~~~~~~~~~~~~~~~gi~~~~IG~v~~~~~i~~~~~~~~~~~eL 722 (736) T PRK01213 644 AIASGLGATVDLSSD-GGRPDALLFSESQGRYVVSVPPENEEAFEELAEALGVPATVIGVVGGDDAVLGQGLISLSVEEL 722 (736) T ss_pred HHCCCCCEEEECCCC-CCCHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCEEECCCCEEEEHHHH T ss_conf 856997379962888-8876776128888659999858899999999987799889988998855395177224159999 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999987245 Q gi|254780970|r 721 QTKYESWFPEFISE 734 (737) Q Consensus 721 ~~~w~~~l~~~~~~ 734 (737) ++.|++||+++|+. T Consensus 723 ~~~w~~~L~~~m~~ 736 (736) T PRK01213 723 REAWEGALPRLLGG 736 (736) T ss_pred HHHHHHHHHHHHCC T ss_conf 99999998897487 No 3 >COG0046 PurL Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=1685.43 Aligned_cols=715 Identities=47% Similarity=0.745 Sum_probs=678.7 Q ss_pred HHHHHCCCCHHHHHHHHH-HHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHC--CCCEEECCCCCEEEEEECCCEE Q ss_conf 899861999899999999-9578971889999853300110744588999751206--9748993347736898369839 Q gi|254780970|r 9 EILTAHGLTKDECDQIIH-ILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTT--GKHVIQGPGENAGVVDIGGGDC 85 (737) Q Consensus 9 e~~~~~gL~~~e~~~i~~-~l~R~Pt~~El~~~~~~wSEHC~~k~~~~~l~~~~~~--~~~vi~~~~~~a~vi~~~~~~~ 85 (737) +.+..++|+++||++|++ .+||+||++||+||+|||||||||||||+|||+|+++ +++|++||+||||+|+++|+|+ T Consensus 17 ~~~~gl~L~~dE~~~i~~~~~GR~P~~~El~mf~~~WSEHC~yKssk~~lk~~~~t~~~~~vi~g~gdnAgvvdi~d~~~ 96 (743) T COG0046 17 NTELGLALSDDEYDYIREFLLGREPTDTELGMFAQMWSEHCSYKSSKSLLKMFPTTHTGEYVLSGPGDNAGVVDIGDGWA 96 (743) T ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCEEEECCCCCCCEEEECCCEE T ss_conf 06755389989999999974688986999999999876344765269999619987888717875267763588679679 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99995178877765888872105581022576778743687750457875686388899888745886654308721132 Q gi|254780970|r 86 VVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGG 165 (737) Q Consensus 86 ~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G 165 (737) ++||+||||||||||||+|||||+||||||+++|||||+|++|+|+||.|+.|+++||+.+++.||++||||||+|||+| T Consensus 97 v~fKvESHNHPSaIePy~GAATGvGGiIRDv~smGArPiA~l~~l~fG~~~~~~~~~i~~gvv~Gia~YGN~iGvPtv~G 176 (743) T COG0046 97 VVFKVESHNHPSAIEPYQGAATGVGGIIRDVLSMGARPIALLDSLRFGIPDIEKTRYIMKGVVAGIAAYGNEIGVPTVGG 176 (743) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCEEECCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHCCCCHHHCCCCCCCCCCCE T ss_conf 99985467898866887973046576562243467600010102124799973056787400640332376346554424 Q ss_pred EEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHC-CCCCCCCCCCCCCCHHH Q ss_conf 03552156766422112211211355431122568977999657656444104666677728-65333444332679899 Q gi|254780970|r 166 EVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFG-ENIAKKRPTVQVGDPFT 244 (737) Q Consensus 166 ~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~-~~~~~~~~~VQ~gdp~~ 244 (737) +++||++|.+|||||++|+|+++.++++++++.++||+||++|++||||||||++|||++++ ++.+.+||+||+||||| T Consensus 177 e~~fd~~y~~npLvna~~vG~i~~~~i~~~~a~~~G~~li~~Gg~TgrDGigGat~aS~~~~~~~~e~dr~aVQvGdPf~ 256 (743) T COG0046 177 EFRFDESYVGNPLVNAGCVGLIRKEHIVKGEAIGPGDKLILLGGKTGRDGIGGATFASMELGEESEEEDRPSVQVGDPFM 256 (743) T ss_pred EEEECCCCCCCCEEEEEEEECCCHHHEEECCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCHHHHCCCCCCCCCHHH T ss_conf 89974443789607878862011444110467899987999747756566573323104427451553477501278588 Q ss_pred HHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCH Q ss_conf 98889998740113830110347752011122332014775289962022046678999898617888508999954675 Q gi|254780970|r 245 GKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQ 324 (737) Q Consensus 245 ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~ 324 (737) ||++||+|+|+.++++|++|||+|||||+||++||+.++++|++|+||+||+||++|+|||||||||||||+++|.|+++ T Consensus 257 Ek~l~ea~le~~~~~~I~~i~DlGAgGLs~A~~Ela~~gg~G~~i~Ld~VP~rE~gMsp~Ei~~SESQERMllvv~p~~~ 336 (743) T COG0046 257 EKRLQEAILECVQTGLIKGIQDLGAGGLSCAISELAAKGGLGAEIDLDKVPLREPGMSPYEIWLSESQERMLLVVAPEDV 336 (743) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEHHCCCCCCCCCCHHHHHHHCCCHHEEEEECCCCH T ss_conf 89999999998731976887416873132238999733697069980008346799988998873110105899851349 Q ss_pred HHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 88999888709713112212104431000258644344321014665011455531565551012346872558899999 Q gi|254780970|r 325 HKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLK 404 (737) Q Consensus 325 ~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~d~~~~l~~ 404 (737) ++|.+||+||+++|+|||+||++++++++|+|+.++|+|+++|.+++|+|+|++.+|.+..+ .....+...++++++++ T Consensus 337 e~~~~i~~k~~l~~aVVG~vT~~~~~~~~~~ge~v~dlP~~~L~~~aP~~~r~~~~~~~~~~-~~~~~~~~~~l~~a~~~ 415 (743) T COG0046 337 EEFLEICEKERLPAAVVGEVTDEPRLVVDWKGEPVVDLPIDVLAGKAPKYDRPVKEPKKEEA-GDVPEPEEPDLEEALLK 415 (743) T ss_pred HHHHHHHHHCCCCEEEEEEEECCCEEEEEECCCEEEECCHHHHCCCCCEEECCCCCCCCCCC-CCCCCCCCCCHHHHHHH T ss_conf 99999999718980799999437659999789678866579976899744035534565445-55567776309999999 Q ss_pred HHHCCCCCCHHHHHHHCCCCCCCCCCCCCC--CCCCEEEECCCC--CCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHH Q ss_conf 850732101035653112100011000013--576314203777--6227973045321000010011356789777641 Q gi|254780970|r 405 LLSSPNLSSRRWVYEQYDTMIQSNSIQLPG--GDAGVIRVEGHE--TKALAFSSDVTPRYVKADPFEGTKQAVAECWRNI 480 (737) Q Consensus 405 vL~~pnV~SK~~i~~qyD~~V~g~tv~~Pg--~DaaVv~~~~~~--~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl 480 (737) +|++||||||+|||+||||+||++|++.|| +||||++++..+ ++|+|||+|+||+|+++|||+||++||+||+||| T Consensus 416 vL~~p~vask~~l~~qyD~~V~~~Tv~~pG~~~DaaV~r~~~~~~~~~g~A~t~~~np~~~~ldpy~Ga~~aVaEa~rNl 495 (743) T COG0046 416 VLSSPNVASKEWLYEQYDRSVGGNTVVDPGPVADAAVLRISEDSGTGKGLAMTTGENPRYALLDPYAGAKLAVAEALRNL 495 (743) T ss_pred HHCCCCHHHHCCEEEECCEEECCCEEECCCCCCCEEEEEECCCCCCCCEEEEECCCCCCCEEECHHHHHHHHHHHHHHHH T ss_conf 86385411112435642423034245058887664799850467876248997488986067776677999999999778 Q ss_pred HCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 02785102232210245666862134699999999852210288665443200112368666763111222233352023 Q gi|254780970|r 481 IATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLM 560 (737) Q Consensus 481 ~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~ 560 (737) +|+||+|+++++||||++|++||.||||.++++|+.|+|++|++||+||||||||+|+|.+|+|||+|+++|+++|+++. T Consensus 496 ~a~Ga~pla~~d~lnfg~pe~pE~~~q~~~a~~g~~eac~~l~~pvv~GnvSlynet~~~~i~PTpvi~~vg~v~dv~k~ 575 (743) T COG0046 496 AATGAKPLALTDNLNFGNPEKPEVMGQFVAAVAGLAEACRALGIPVVGGNVSLYNETNGQPIPPTPVIGAVGLVEDVRKV 575 (743) T ss_pred HHHCCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCEEEEEEEEECHHHH T ss_conf 76087004120032057999801566799999999999998099800342300200179646996569999986232342 Q ss_pred EECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHH Q ss_conf 31021438980751377665543267888861167899964798999999999999999598069946486579999999 Q gi|254780970|r 561 TRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAE 640 (737) Q Consensus 561 it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~E 640 (737) +++.+++.|+.||++|.++++||||+|++.+++...+.+|.+|++.+|+.++++++++++++|+|+||+|+|||+++|+| T Consensus 576 ~~~~~~~~g~~i~llG~~~~~lggS~~~~~~~~~~~g~~p~~dl~~ek~~~~~i~~~i~~~~v~aahD~s~GGL~~aLae 655 (743) T COG0046 576 LTPELKKAGLLLYLLGETKDELGGSELAQVYHGLGDGPPPVVDLAEEKKFFDAIRALIADGKVLAAHDVSDGGLAVALAE 655 (743) T ss_pred CCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHH T ss_conf 37554668868999089756665549999985113799997798999999999999975696579987365609999999 Q ss_pred HHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCEEEEE----CCEECC Q ss_conf 9845897089980566689878887298832189999688099899999857984999899879868980----741214 Q gi|254780970|r 641 MTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGSTLSIH----NILNIP 716 (737) Q Consensus 641 Ma~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~~l~i~----~~~~~~ 716 (737) |||++++|++|+++.....++...||||++||+|+++++++.. .......++++.+||.++++..... ..+..+ T Consensus 656 maf~~~~G~~i~~~~~~~~~~~~~LFsE~~gr~i~~~~~~~~~--~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~ 733 (743) T COG0046 656 MAFAGGIGLEIDLDELGDDRLDALLFSESLGRVIVVVAEEEAE--VEGAEAVGVPLKVIGLVGGDHEIGAAVNEFLVPVS 733 (743) T ss_pred HHHCCCCCEEEECCCCCCCCHHHHHHCCCCCCEEEEECCHHHH--HHHHHHCCCEEEEEEECCCCCEEEECCCCCCCCCC T ss_conf 9851587227875545653146675276787389995631444--43201027524563201465067511466544556 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780970|r 717 VATLQTKYES 726 (737) Q Consensus 717 i~~l~~~w~~ 726 (737) ++++++.|+. T Consensus 734 ~~el~~~~~~ 743 (743) T COG0046 734 VEELREAWEG 743 (743) T ss_pred HHHHHHHHCC T ss_conf 9999876449 No 4 >PRK05297 phosphoribosylformylglycinamidine synthase; Provisional Probab=100.00 E-value=0 Score=1436.00 Aligned_cols=718 Identities=24% Similarity=0.341 Sum_probs=627.7 Q ss_pred HHHHHHHHHCCC--CHHHHHHHHH---HHCCCCCHHHHHHHHHHHCCCCCCCCCH------------HHHHHHHH----C Q ss_conf 147889986199--9899999999---9578971889999853300110744588------------99975120----6 Q gi|254780970|r 5 SPILEILTAHGL--TKDECDQIIH---ILKRQPTLTEIGIISAMWNEHCSYKSSK------------KWLRTLPT----T 63 (737) Q Consensus 5 ~~~~e~~~~~gL--~~~e~~~i~~---~l~R~Pt~~El~~~~~~wSEHC~~k~~~------------~~l~~~~~----~ 63 (737) +.+.++++++|| +.+|++|+.+ .|||+|||+||+||+|.||||||||+|+ +++++++. . T Consensus 168 ~al~~~n~~~glal~~~ei~yl~~~f~~l~RnptdvEl~~~aQ~nSEHcrHkiF~~~~~idg~~~~~slf~mi~~t~~~~ 247 (1294) T PRK05297 168 AALEAANVELGLALAEDEIDYLVEAFTKLGRNPTDVELMMFAQANSEHCRHKIFNADWTIDGEEQEKSLFKMIKNTHETT 247 (1294) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHCCCEECCCEEECCCCCCHHHHHHHHHHHHHC T ss_conf 99998788726467988999999999971899707888776431123412510258523258417345999999999759 Q ss_pred CCCEEECCCCCEEEEE------------------ECCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCEE Q ss_conf 9748993347736898------------------369839999951788777658888721055810225767--78743 Q gi|254780970|r 64 GKHVIQGPGENAGVVD------------------IGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFT--MGARP 123 (737) Q Consensus 64 ~~~vi~~~~~~a~vi~------------------~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~--~Ga~p 123 (737) ++++|++|.|||+||+ ..+.++++||+|||||||+|+||+|||||+||+|||+++ |||+| T Consensus 248 ~~~~~sa~~DNa~vi~g~~~~~~~p~~~~~~~~~~~~~~~~~~KvETHNHPTaIePf~GAATG~GG~IRD~~atG~Ga~P 327 (1294) T PRK05297 248 PDGVLSAYKDNAAVMEGSKVGRFFPDPETGRYRYHQEPTHILMKVETHNHPTAISPFPGAATGSGGEIRDEGATGRGSKP 327 (1294) T ss_pred CCCCEEEECCCCCEEECCCCCEEEECCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCEE T ss_conf 88707886067524642422124326999865554210058998514789876688897414657612356667876442 Q ss_pred EEEEEECCCCCCCCCC-----------------HHHHHHHHHHHHHHHHHHCCCCCCCCEEE-EEC-----------CCC Q ss_conf 6877504578756863-----------------88899888745886654308721132035-521-----------567 Q gi|254780970|r 124 VAAMNSLRFGAIHHPK-----------------TKHLLSGVVAGIAGYSNSFGVPTVGGEVE-FLP-----------CYN 174 (737) Q Consensus 124 ~a~~~~l~~g~~~~p~-----------------~~~~~~~~~~G~~~ygN~~G~P~v~G~~~-~~~-----------~y~ 174 (737) +|++++|||+++..|. +..|+.+++.|+++||||||+|+++|+++ |.. +|+ T Consensus 328 ~Ag~~gy~~~~lripg~~~Pwe~~~~~P~~i~s~l~I~~~a~~G~s~yGN~fG~P~i~Gy~rtfe~~~~~~~~~e~~gy~ 407 (1294) T PRK05297 328 KAGLTGFSVSNLRIPGFEQPWEEDYGKPERIASALDIMIEGPLGGAAFNNEFGRPNLLGYFRTFEQKVNSHNGGEVRGYH 407 (1294) T ss_pred ECCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHEEEEEEECCCCCCCCCCCCEECCEEEEEEEECCCCCCCEEECCC T ss_conf 03531254046668887887533468864346830046410001410285137733053456632202556775262035 Q ss_pred CCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCC-CCCCCCCCCCHHHHHH---HHH Q ss_conf 664221122112113554311225689779996576564441046666777286533-3444332679899988---899 Q gi|254780970|r 175 NNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIA-KKRPTVQVGDPFTGKC---LLE 250 (737) Q Consensus 175 ~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~-~~~~~VQ~gdp~~ek~---~~~ 250 (737) +|+++++++|.++.+++.+..+ .+||+||++|||||||||||+++||.+.+++.+ .|+++||||||+|||| +++ T Consensus 408 -kpim~aGgvG~i~~~~v~k~~~-~~G~~vi~lGG~tgrdGiGGat~SS~~~~~~s~~ld~~sVQrGnp~mErr~qevi~ 485 (1294) T PRK05297 408 -KPIMLAGGIGNIRADHVQKGEI-PVGAKLIVLGGPAMRIGLGGGAASSMASGQSSEDLDFASVQRGNPEMERRCQEVID 485 (1294) T ss_pred -CCEEEEEEEEECCHHHCCCCCC-CCCCEEEEECCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf -7769986677046665414589-99999999789655457785012123216764234541102488789999999999 Q ss_pred HHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHH Q ss_conf 98740113830110347752011122332014775289962022046678999898617888508999954675889998 Q gi|254780970|r 251 ACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEI 330 (737) Q Consensus 251 ~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i 330 (737) +|+++.++|+|++|||+|||||+||++||++++++|++|+||+||++|++|+|||||||||||||+++|+|+++++|.+| T Consensus 486 ~c~el~~~npI~~IhD~GAGGlsnai~Ela~~~~~G~~i~Ld~Vp~~~~glsp~EIw~sESQERm~l~v~~~~~~~f~~i 565 (1294) T PRK05297 486 RCWQLGDNNPILSIHDVGAGGLSNAFPELVNDGGRGGRFDLRKVPSDEPGMSPLEIWCNESQERYVLAIAPEDLELFEAI 565 (1294) T ss_pred HHHHCCCCCCEEEEEECCCCCCCCCHHHHHHCCCCEEEEEHHHCCCCCCCCCHHHHHHHCCCHHEEEEECHHHHHHHHHH T ss_conf 99971689877999855887311012877613687189992107167899888998664330007999887569999999 Q ss_pred HHHHCCEEECCCCEEECCCCCCC--CCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 88709713112212104431000--2586443443210146650114555315655510123468725588999998507 Q gi|254780970|r 331 LNKWGLHFSIIGITTNDKLFRVI--HRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSS 408 (737) Q Consensus 331 ~~~~~~~~~~iG~vt~~~~~~v~--~~g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~d~~~~l~~vL~~ 408 (737) |+||+||+++||+||++++++++ ++++.++|||+++|++++|+++|....+.. ..+.......++++++++||+| T Consensus 566 ~~rE~~~~~vVG~vT~~~~l~~~d~~~~~~~vdl~~~~L~g~~P~~~~~~~~~~~---~~~~~~~~~~~l~~al~~vL~~ 642 (1294) T PRK05297 566 CERERCPFAVVGEATEERHLTLNDSHFDNKPVDLPLDVLLGKPPKMHRDVKTVKA---KGPALDYTGIDLAEAVERVLRL 642 (1294) T ss_pred HHHCCCCEEEEEEECCCCEEEEEECCCCCEEEEEEHHHHCCCCCCEEEEECCCCC---CCCCCCCCCCCHHHHHHHHHCC T ss_conf 9980998368998725865999843679704870889965899734776103575---6787677877899999997428 Q ss_pred CCCCCHHHHHHHCCCCCCCCC--------CCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHH Q ss_conf 321010356531121000110--------000135763142037776227973045321000010011356789777641 Q gi|254780970|r 409 PNLSSRRWVYEQYDTMIQSNS--------IQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNI 480 (737) Q Consensus 409 pnV~SK~~i~~qyD~~V~g~t--------v~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl 480 (737) ||||||+||++||||+|+|.+ +|.|++||||++.++.+.+|+|||+|++|+++.+|||+|+++||+||+||| T Consensus 643 p~VaSK~fLi~~~DrsVgG~vardq~VGP~Q~PvaD~aV~~~~~~~~~G~Ama~Ge~P~~a~~dP~agar~AVaEAltNL 722 (1294) T PRK05297 643 PTVASKSFLITIGDRSVTGLVARDQMVGPWQVPVADCAVTAASYDGYAGEAMSMGERTPVALLDAAASARMAVGEALTNI 722 (1294) T ss_pred CCCCCCCCEEEECCCCCCCEEEECCCCCCCCCCCCCCEEEEECCCCCCEEEEEECCCCCEEECCHHHHHHHHHHHHHHHH T ss_conf 87345323056414214735730445566545767722785137897269998347882050398899999999999986 Q ss_pred HCCCCCHHH-HHHHCCCCCC-CCCCHHHHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCCC--------CCCCCCCCCC Q ss_conf 027851022-3221024566-68621346999999998-52210288665443200112368--------6667631112 Q gi|254780970|r 481 IATGAKPLA-ITDNLNFGNP-EKEEIMGQFVHSVKGIR-EACQILDFPIVSGNVSFYNETNG--------QSIFPTPTIA 549 (737) Q Consensus 481 ~a~Ga~Pla-it~~lnf~~P-~~~e~~~ql~~av~gl~-d~c~~lgipivgGkvSl~n~t~~--------~~I~pTpvi~ 549 (737) +++|+.++. +.++.||++| .+|..-++|+++|+++. ++|.+||+|+++|||||||.+.+ ...|||++|+ T Consensus 723 vaa~i~~l~~i~LSaNWM~a~g~pgE~a~Ly~av~Ai~~e~c~~Lgi~ip~GKDSlSm~~~~~e~~~~~~v~~P~TLVis 802 (1294) T PRK05297 723 AAAPIGDLKRIKLSANWMAAAGHPGEDAGLYDAVKAVGMELCPALGITIPVGKDSLSMKTKWQEGGEDKEVTSPLSLIIS 802 (1294) T ss_pred HHCCCCCHHHEEEEHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCCEECCCCCEEEE T ss_conf 63456667663744100445799853689999999999988998398634576666755265218975243378877999 Q ss_pred CCCCCCCCEEEEECCCCCCCCCEEEC--CCCCCCCCHHHHHHHHHCCCCCCCCCC-CHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 22233352023310214389807513--776655432678888611678999647-989999999999999995980699 Q gi|254780970|r 550 GVGILPDYSLMTRIDSAHEGDLILMI--GNDGCHLDCSMYSLECASSNIGPPPKV-DCHLEKNHGFFVLSMINAKKITAC 626 (737) Q Consensus 550 ~vG~v~d~~~~it~~~k~~Gd~I~li--G~~~~~LggS~~~~~~~~~~~g~~p~~-d~~~~k~~~~~v~~li~~g~I~S~ 626 (737) ++|.++|+++.+||++|+.|+.||++ +.++.+||||.|++.+.. ..+.+|++ |.+.+|+.|+++++++++++|.|+ T Consensus 803 a~a~v~Dv~~~vTPelK~~gs~L~lIdl~~~~~~LGGS~laqv~~~-~g~~~Pdvd~~~~lk~~f~~iq~Li~~glI~S~ 881 (1294) T PRK05297 803 AFAPVEDVRKTLTPQLRTDDTVLLLIDLGRGKNRLGGSALAQVYNQ-LGDEAPDVDDPEDLKGFFNAIQALVAEGLLLAY 881 (1294) T ss_pred EEEEECCCCCCCCCCEECCCCEEEEEECCCCCCCCHHHHHHHHHCC-CCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 9997147110326443159987999976899675214499998565-679896977899999999999999977987898 Q ss_pred EECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCC--EEEEEEECC Q ss_conf 464865799999999845897089980566689878887298832189999688099899999857984--999899879 Q gi|254780970|r 627 HDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIP--LRYLGKVSG 704 (737) Q Consensus 627 HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~--~~~IG~Vt~ 704 (737) ||+|||||++||+||||++++|++|+++.. ..+++..||+|++| +|++|++++.+.+.+.+++.++. +.+||+++. T Consensus 882 HDvSDGGL~~alaEMaFag~~G~~I~l~~~-~~d~~~~LFsEe~G-~Viqv~~~~~~~V~~~l~~~gl~~~~~~IG~~~~ 959 (1294) T PRK05297 882 HDRSDGGLFATLAEMAFAGHCGLDIDLDAL-GDDLLAALFNEELG-AVIQVRAADRDAVEAILAEAGLSDCSHVIGKPNA 959 (1294) T ss_pred EECCCCHHHHHHHHHHHCCCCEEEEECCCC-CHHHHHHHHCCCCE-EEEEECCCCHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 317778099999999846785148966765-23578887451753-8999713428999999987598520489753767 Q ss_pred C-EEEEEC----CEECCHHHHHHHHHHHHHH Q ss_conf 8-689807----4121499999999999998 Q gi|254780970|r 705 S-TLSIHN----ILNIPVATLQTKYESWFPE 730 (737) Q Consensus 705 ~-~l~i~~----~~~~~i~~l~~~w~~~l~~ 730 (737) + .+.|.. +++.++.+|++.|.++-.+ T Consensus 960 ~~~i~I~~~~~~i~~~~~~~L~~~W~etS~~ 990 (1294) T PRK05297 960 GDRIEITRNGKTVFSESRTELRRWWAETSYQ 990 (1294) T ss_pred CCCEEEEECCEEEEHHHHHHHHHHHHHHHHH T ss_conf 8728999899987362399999999886666 No 5 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=100.00 E-value=0 Score=1378.45 Aligned_cols=719 Identities=24% Similarity=0.355 Sum_probs=629.2 Q ss_pred HHHHHHHHHCCC--CHHHHHHH---H-HHHCCCCCHHHHHHHHHHHCCCCCCCCC------------HHHHHHHHHC--- Q ss_conf 147889986199--98999999---9-9957897188999985330011074458------------8999751206--- Q gi|254780970|r 5 SPILEILTAHGL--TKDECDQI---I-HILKRQPTLTEIGIISAMWNEHCSYKSS------------KKWLRTLPTT--- 63 (737) Q Consensus 5 ~~~~e~~~~~gL--~~~e~~~i---~-~~l~R~Pt~~El~~~~~~wSEHC~~k~~------------~~~l~~~~~~--- 63 (737) ..+.+++.++|| +.+|+||+ + ..|+|+|+||||+||||.||||||||+| |++++++++| T Consensus 183 ~ALE~AN~elGLAL~~de~DYl~~~fr~~L~RnP~dVEL~mfAQaNSEHcRHkiFna~~~idG~~q~kSLF~mIk~T~e~ 262 (1401) T TIGR01735 183 KALEKANQELGLALDEDEIDYLTKRFRQELKRNPSDVELMMFAQANSEHCRHKIFNADWIIDGKKQDKSLFQMIKSTHEA 262 (1401) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHCCCEEECCCHHCCHHHHHHHHHHHC T ss_conf 99999989853045756889999999998378987246887753031010003112533778812110378988988733 Q ss_pred -CCCEEECCCCCEEEEEEC--------------------CCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC--CCCC Q ss_conf -974899334773689836--------------------98399999517887776588887210558102257--6778 Q gi|254780970|r 64 -GKHVIQGPGENAGVVDIG--------------------GGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDI--FTMG 120 (737) Q Consensus 64 -~~~vi~~~~~~a~vi~~~--------------------~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi--~~~G 120 (737) .+++|++|.|||+||+.. +..+|+||+|||||||||.||+||+||.||+|||. ++.| T Consensus 263 ~~~~~v~aykDNssvI~G~~v~r~rP~~~t~~~y~~~~~~~~~i~~kvETHNhPTAiaPFpGA~TGaGGeIRDegATGRG 342 (1401) T TIGR01735 263 NPENTVSAYKDNSSVIEGHKVGRLRPDPPTRVEYRQHQEDDVHILMKVETHNHPTAIAPFPGASTGAGGEIRDEGATGRG 342 (1401) T ss_pred CCCCEEEEEECCHHHHCCCCEECCCCCCCCCCHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCEECCCCCCCC T ss_conf 99960898831124654742000078887660244431267218998513788864677678887878736436644887 Q ss_pred CEEEEEEEECCCCCCCCC------------------CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCC-CC------ Q ss_conf 743687750457875686------------------38889988874588665430872113203552156-76------ Q gi|254780970|r 121 ARPVAAMNSLRFGAIHHP------------------KTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCY-NN------ 175 (737) Q Consensus 121 a~p~a~~~~l~~g~~~~p------------------~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y-~~------ 175 (737) |+|+|+|.+|||||+..| .+..|+.+...|.|+|||+||+|++.|++|.++.. +. T Consensus 343 a~pkAGl~Gf~VsNL~IPg~~~PWE~~S~~~P~~~aspLdImieap~G~aafnNeFGrP~L~GYfRtfe~~~~~P~~~~R 422 (1401) T TIGR01735 343 AKPKAGLTGFCVSNLNIPGLEQPWEDPSFQKPERIASPLDIMIEAPLGAAAFNNEFGRPALLGYFRTFELKVSLPGGEVR 422 (1401) T ss_pred CCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 71433345610202517788878875421365224576787640676524327765773022233203332116775544 Q ss_pred -CCCCCEEEECCCCHHHHHHCCCC-CCCCEEEEECCCCCCCCCHHHHHHH-HHHCCCCCC-CCCCCCCCCHHHHHH---H Q ss_conf -64221122112113554311225-6897799965765644410466667-772865333-444332679899988---8 Q gi|254780970|r 176 -NIIVNTFAAGIAKTNAIFSSKAR-GIGLPLVYLGAKTGRDGIGGASMAS-EEFGENIAK-KRPTVQVGDPFTGKC---L 248 (737) Q Consensus 176 -~~~v~~~~~Gi~~~~~~~~~~~~-~~Gd~ii~~G~~tg~dGi~Ga~~sS-~~~~~~~~~-~~~~VQ~gdp~~ek~---~ 248 (737) ...|+|+++||+.++..+-.|.. .+|+.||+||||.||||+||+++|| ...+.+.++ ||+|||||||+|||| | T Consensus 423 ~y~KPIMlaGGiG~I~~~hi~K~e~~~G~~~i~lGGPam~IGlGGGAASSvm~~G~n~~~LDF~sVQR~nPEMerr~qeV 502 (1401) T TIGR01735 423 GYHKPIMLAGGIGSIDAEHIQKGEIEVGALLIVLGGPAMLIGLGGGAASSVMVSGTNDADLDFASVQRGNPEMERRCQEV 502 (1401) T ss_pred CCCCCHHHHHHHCCHHHHCCCCCCCCCCCEEEEECCCEEECCCCCCCHHCEEECCCCCHHHCCCCCCCCCHHHHHHHHHH T ss_conf 43351244310100012121057878886899857862254565410000010578815524333577886887688899 Q ss_pred HHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHH Q ss_conf 99987401138301103477520111223320147752899620220466789998986178885089999546758899 Q gi|254780970|r 249 LEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQ 328 (737) Q Consensus 249 ~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~ 328 (737) ||+|+++.++|||.+|||+|||||+||++|+..+++.|+.|+|.+||+.|++|||+||||+||||||+|.|.+++++.|. T Consensus 503 Id~C~~lG~~NPI~siHDvGAGGlsNa~pEL~~d~~~Ga~i~lr~v~~~d~~lspLEIWcnEsQEryvL~v~~~~l~~f~ 582 (1401) T TIGR01735 503 IDRCVQLGEKNPIISIHDVGAGGLSNALPELINDGGRGAVIDLRAVPLDDPGLSPLEIWCNESQERYVLLVKAEDLEIFT 582 (1401) T ss_pred HHHHHHCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 99998587986846771257572001001000614478558884652588887767663004555577632720079999 Q ss_pred HHHHHHCCEEECCCCEEECCCCCCCCCC-----------------------------------CCCCCCCCCCCCCCCCE Q ss_conf 9888709713112212104431000258-----------------------------------64434432101466501 Q gi|254780970|r 329 EILNKWGLHFSIIGITTNDKLFRVIHRG-----------------------------------EEVANLPIKALSDEAPE 373 (737) Q Consensus 329 ~i~~~~~~~~~~iG~vt~~~~~~v~~~g-----------------------------------~~v~dlp~~~L~~~~P~ 373 (737) .||+||+||++|||++|++.|++|.+.- ..+||||++.|+|++|+ T Consensus 583 ~iC~RERcPfAvvG~at~~~~l~l~D~~~lPaalhvteqqtPqranht~~~~~~~~~~P~~~~n~pvDLpLevlLGK~Pk 662 (1401) T TIGR01735 583 AICERERCPFAVVGTATGDRRLILVDDRELPAALHVTEQQTPQRANHTEVAKTSKEDAPSHFPNNPVDLPLEVLLGKAPK 662 (1401) T ss_pred HHHHHHCCCEEEEEEEECCCEEEEEECCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCC T ss_conf 88564069837898786682489973761440100000257430113556668788631232138642740534488786 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHCCCCCCC--------CCCCCCCCCCCEEEECCC Q ss_conf 145553156555101234687255889999985073210103565311210001--------100001357631420377 Q gi|254780970|r 374 YDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQS--------NSIQLPGGDAGVIRVEGH 445 (737) Q Consensus 374 ~~r~~~~~~~~~~~~~~~~~~~~d~~~~l~~vL~~pnV~SK~~i~~qyD~~V~g--------~tv~~Pg~DaaVv~~~~~ 445 (737) ++|...++.+ ...+.......++.+++.|||+||+||||+||++++||+|+| |++|.|++||||+...++ T Consensus 663 ~~r~~~~~a~--~~~pl~~~~~l~l~~A~~RVLRlP~VA~K~FL~TigDRsvgGLvAr~QmVGP~Q~PlADvAVt~~sf~ 740 (1401) T TIGR01735 663 MTRKVKRKAP--AKQPLDIAPGLDLHEALERVLRLPAVASKRFLITIGDRSVGGLVARDQMVGPWQIPLADVAVTAASFD 740 (1401) T ss_pred CCCCCCCCCC--CCCCCCCCCCCCHHHHHHHHHCCCCHHHCCCCEEECCEEECCCCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 5423267787--67774678786489998876306200203443330220125620122000661565677898872102 Q ss_pred CCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHH---HHHHHCCCCCCCCCCHHHHHHHHHHHH-HHHHHH Q ss_conf 7622797304532100001001135678977764102785102---232210245666862134699999999-852210 Q gi|254780970|r 446 ETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPL---AITDNLNFGNPEKEEIMGQFVHSVKGI-REACQI 521 (737) Q Consensus 446 ~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Pl---ait~~lnf~~P~~~e~~~ql~~av~gl-~d~c~~ 521 (737) ++.|-||++||+|..+.+||.++||+||+||+|||+++-+..| .++.||||.--..-|. ..||++|+++ .++|.+ T Consensus 741 ~y~GeA~A~GE~p~~aLLD~~A~ARlAVgEAiTNl~~A~v~~L~~vKlSaNWM~AA~~~GEd-a~LYdav~A~G~e~c~a 819 (1401) T TIGR01735 741 TYTGEAMAIGERPPLALLDPKASARLAVGEAITNLAAALVGDLSDVKLSANWMAAAGHPGED-AALYDAVKAVGSELCPA 819 (1401) T ss_pred CCCCCCEECCCCCCHHHCCHHHHHHHHHHHHHHHHHHCCCCHHHCCCCCCHHHHHCCCCCCC-HHHHHHHHHHHHHHHHH T ss_conf 12210111144110233385688889999999887530400110324450279746899961-78999999999999886 Q ss_pred CCCCCCCCCCCCCCCCCCCC-----C--CCCCCCCCCCCCCCCEEEEECCCCCC-----CCCEEE-CCCCCCCCCHHHHH Q ss_conf 28866544320011236866-----6--76311122223335202331021438-----980751-37766554326788 Q gi|254780970|r 522 LDFPIVSGNVSFYNETNGQS-----I--FPTPTIAGVGILPDYSLMTRIDSAHE-----GDLILM-IGNDGCHLDCSMYS 588 (737) Q Consensus 522 lgipivgGkvSl~n~t~~~~-----I--~pTpvi~~vG~v~d~~~~it~~~k~~-----Gd~I~l-iG~~~~~LggS~~~ 588 (737) |||.|++|||||||+|.|.. + |-+++||++..|+|+++.+||++|.. +++||+ ++..+.+||||+++ T Consensus 820 Lgi~IpvGKDSLSM~~~w~~ge~~~V~aP~SLvISAfa~v~Dvr~tvTP~Lk~~~~v~~s~Ll~vdL~~Gk~RLGGsaLA 899 (1401) T TIGR01735 820 LGIAIPVGKDSLSMKTRWQDGEKKSVTAPVSLVISAFAPVPDVRKTVTPDLKLDKDVGDSELLLVDLGKGKNRLGGSALA 899 (1401) T ss_pred CCCCCCCCCCCCCHHHHHCCCCCEEEEECCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCEEE T ss_conf 48614777776211321048874088806731787556336643357854457887664314688743476541443433 Q ss_pred HHHH-----CCCCCCCCC-CCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCC-- Q ss_conf 8861-----167899964-79899999999999999959806994648657999999998458970899805666898-- Q gi|254780970|r 589 LECA-----SSNIGPPPK-VDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKD-- 660 (737) Q Consensus 589 ~~~~-----~~~~g~~p~-~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d-- 660 (737) |.+. -...+.+|+ .+.+.+|+.|..+|.|+++++++|+||+|||||+++|+||||||+||++||++...... T Consensus 900 QV~~dsaalP~lG~~~PDlD~~~~Lk~fF~~~Q~L~a~~~llAyHDrSDGGL~~~l~EMAFAG~cGl~~d~~~~~~~~~~ 979 (1401) T TIGR01735 900 QVFGDSAALPQLGGDCPDLDDVERLKAFFAVIQGLVAEDLLLAYHDRSDGGLVTTLLEMAFAGHCGLDVDLDALGDSLEI 979 (1401) T ss_pred EEECHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEECCCCCHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHH T ss_conf 21000243344067888988867899999999999766870401511425899999998751368657613442013667 Q ss_pred ------------------------HHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEC---CCEEEEEC-- Q ss_conf ------------------------7888729883218999968809989999985798499989987---98689807-- Q gi|254780970|r 661 ------------------------PKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVS---GSTLSIHN-- 711 (737) Q Consensus 661 ------------------------~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt---~~~l~i~~-- 711 (737) ....||||++| +||+|.+.+.+.|.+.++.+++.+.+++.++ ...+.|+. T Consensus 980 ~nhtalsqslrteevkalaewqeti~~~LF~EELG-aV~qV~~~~~~~v~~~~~~~~L~~~~~~~~~~~~~~~i~i~~~~ 1058 (1401) T TIGR01735 980 LNHTALSQSLRTEEVKALAEWQETILAVLFNEELG-AVIQVAEEDLAAVLELLRAAGLTALILGIGTPTDEPSIEISVNG 1058 (1401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CEEEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCEEEEEECC T ss_conf 76677776654678899988899999863022545-36883241178999999864871333131401567778986088 Q ss_pred --CEECCHHHHHHHHHHH Q ss_conf --4121499999999999 Q gi|254780970|r 712 --ILNIPVATLQTKYESW 727 (737) Q Consensus 712 --~~~~~i~~l~~~w~~~ 727 (737) .++....+|+++|+.+ T Consensus 1059 ~~~~s~~~~~L~~~W~et 1076 (1401) T TIGR01735 1059 ATLLSEKRSELRDIWEET 1076 (1401) T ss_pred EEEEHHHHHHHHHHHHHH T ss_conf 465025689999988889 No 6 >KOG1907 consensus Probab=100.00 E-value=0 Score=1118.26 Aligned_cols=722 Identities=23% Similarity=0.294 Sum_probs=623.1 Q ss_pred HHHHHHHHHCCCC--HHHHHHHH----HHHCCCCCHHHHHHHHHHHCCCCCCCCCH------------HHHHHHHH---- Q ss_conf 1478899861999--89999999----99578971889999853300110744588------------99975120---- Q gi|254780970|r 5 SPILEILTAHGLT--KDECDQII----HILKRQPTLTEIGIISAMWNEHCSYKSSK------------KWLRTLPT---- 62 (737) Q Consensus 5 ~~~~e~~~~~gL~--~~e~~~i~----~~l~R~Pt~~El~~~~~~wSEHC~~k~~~------------~~l~~~~~---- 62 (737) +.+.++++++||+ +.|++|+. +.++|+|||+||+||+|.||||||||+|+ ++++++.+ T Consensus 184 ~~L~~aNqelGLAld~~dldyy~~~f~~~~kR~PTdVELFd~aQsnSEHsRHwfF~g~~~IDg~k~d~SL~kmIr~T~~~ 263 (1320) T KOG1907 184 QALEKANQELGLALDQLDLDYYYDLFVEEVKRNPTDVELFDFAQSNSEHSRHWFFRGDWTIDGRKQDKSLFKMIRNTHDS 263 (1320) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHEECCCEEEECCCCCHHHHHHHHHHHHC T ss_conf 99998778745345734179999999998537988425554002561653421441526870620763799999999861 Q ss_pred CCCCEEECCCCCEEEEEEC-------------------CCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCC Q ss_conf 6974899334773689836-------------------98399999517887776588887210558102257--67787 Q gi|254780970|r 63 TGKHVIQGPGENAGVVDIG-------------------GGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDI--FTMGA 121 (737) Q Consensus 63 ~~~~vi~~~~~~a~vi~~~-------------------~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi--~~~Ga 121 (737) ..++.+++|.|||++++.- +...+.||+|||||||||.||+||+||.||+|||+ ++.|+ T Consensus 264 n~~ntiiafsDNssa~~gf~~~~~~~P~s~v~~~~~~~~~~~ll~~aETHN~PtAVsPfpGA~TGtGGrIRD~~atGrGs 343 (1320) T KOG1907 264 NNDNTIIAFSDNSSAIRGFNSVTRFAPNSTVSPWIVIFEPSDLLFKAETHNHPTAVSPFPGATTGTGGRIRDEGATGRGS 343 (1320) T ss_pred CCCCCEEEECCCHHHHHCCCCCEEECCCCCCCCCEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEECCCCCCCCC T ss_conf 89974478636567751545442653566545524413776558886305787535788887468786352255446664 Q ss_pred EEEEEEEECCCCCCCCCC------------------HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCC--------CC Q ss_conf 436877504578756863------------------8889988874588665430872113203552156--------76 Q gi|254780970|r 122 RPVAAMNSLRFGAIHHPK------------------TKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCY--------NN 175 (737) Q Consensus 122 ~p~a~~~~l~~g~~~~p~------------------~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y--------~~ 175 (737) .++|.|.++|+|+++.|- +..|+.++..|.+||||+||.|+|.|+++.+... .+ T Consensus 344 ~~~agtaGysvg~LnipG~~qPwE~l~f~yP~~iA~pl~ImIEAsnGaSdygNkFGeP~I~Gy~rtfg~~v~~~~gerre 423 (1320) T KOG1907 344 YEIAGTAGYSVGDLNIPGYKQPWEDLDFGYPYHIASPLDIMIEASNGASDYGNKFGEPVISGYARTFGMRVLLENGERRE 423 (1320) T ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 23303444220135678887887423558864447877877227577044435236624555666541123566743001 Q ss_pred CCCCCEEEECCCCHHHHHHCC-CCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCC-CCCCCCCCCCCHHHHHH---HHH Q ss_conf 642211221121135543112-2568977999657656444104666677728653-33444332679899988---899 Q gi|254780970|r 176 NIIVNTFAAGIAKTNAIFSSK-ARGIGLPLVYLGAKTGRDGIGGASMASEEFGENI-AKKRPTVQVGDPFTGKC---LLE 250 (737) Q Consensus 176 ~~~v~~~~~Gi~~~~~~~~~~-~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~-~~~~~~VQ~gdp~~ek~---~~~ 250 (737) ...++++++||+.+++.+..| +..||..||++|||.+|+|+||+++||...++.. +.||++||||||+||+| +++ T Consensus 424 y~KPIMfsgGiG~i~~~~~~K~~~apg~~likiGGp~yrIG~GGGAASSv~~G~g~aeLDFaaVQRGdaEM~~kl~rVvr 503 (1320) T KOG1907 424 YHKPIMFSGGIGTIRKQHARKEPIAPGQLLIKIGGPVYRIGVGGGAASSVVQGEGSAELDFAAVQRGDAEMERKLQRVVR 503 (1320) T ss_pred CCCCEEEECCCCCCCHHHHHCCCCCCCCEEEEECCCEEEEECCCCHHHHHHCCCCCCCCCHHHHHCCCHHHHHHHHHHHH T ss_conf 12453760565766877752379999977999668268983266244455437984234568763489889889999999 Q ss_pred HHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHH Q ss_conf 98740113830110347752011122332014775289962022046678999898617888508999954675889998 Q gi|254780970|r 251 ACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEI 330 (737) Q Consensus 251 ~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i 330 (737) ||.++.++|+|.+|||+||||.++++.|+.+.++.|++++++++++.|++|||+||||+|||||+++.|++++++.|++| T Consensus 504 AC~~lge~NpI~sIHDqGAGGn~NvlkELV~~~~~Ga~~~~r~~~~gdpsmS~~EiW~aEyQE~~allv~a~~l~~le~I 583 (1320) T KOG1907 504 ACAELGENNPIQSIHDQGAGGNGNVLKELVEDNDLGATFDSRTFQLGDPSMSPMEIWCAEYQENYALLVKAEDLDILESI 583 (1320) T ss_pred HHHHHCCCCCEEEEECCCCCCCCCCCHHHCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHCCEEEECHHHHHHHHHH T ss_conf 99973479962676225778643120866166677617974135407987779998645442152344379899999999 Q ss_pred HHHHCCEEECCCCEEECCCCCCCCC--CCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 8870971311221210443100025--86443443210146650114555315655510123468725588999998507 Q gi|254780970|r 331 LNKWGLHFSIIGITTNDKLFRVIHR--GEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSS 408 (737) Q Consensus 331 ~~~~~~~~~~iG~vt~~~~~~v~~~--g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~d~~~~l~~vL~~ 408 (737) |+|++||++|||++|.++|+++..+ .++.+|+++..|++++|++.|+..++.+ ..++...++...+.++|.+||+. T Consensus 584 ckRERcp~svVG~vt~eqR~~l~d~~~~~~aidl~~~~llG~~pK~s~~l~~~~~--~lk~l~lp~~~~l~dAl~RVL~L 661 (1320) T KOG1907 584 CKRERCPVSVVGEVTGEQRVILLDKLAKETAIDLEMRQLLGKPPKMSRKLKRAPR--VLKKLELPSGLTLRDALERVLRL 661 (1320) T ss_pred HHHCCCCEEEEEEECCCCEEEEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCC T ss_conf 8742687268998726715998257789886377348662898201301355423--44100489987099999998605 Q ss_pred CCCCCHHHHHHHCCCCCCC--------CCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHH Q ss_conf 3210103565311210001--------10000135763142037776227973045321000010011356789777641 Q gi|254780970|r 409 PNLSSRRWVYEQYDTMIQS--------NSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNI 480 (737) Q Consensus 409 pnV~SK~~i~~qyD~~V~g--------~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl 480 (737) |+|+||+|++++.||+|+| +++|.|.+|+||+..++.+.+|.|+++|++|....+||.++|||+|+|+++|| T Consensus 662 psV~SKrfLttkvDRsVtGLiaqqQ~VGPlqvplADvavt~~S~~~~tG~A~amGeqPik~Lida~a~ARm~VaE~l~NL 741 (1320) T KOG1907 662 PSVASKRFLTTKVDRSVTGLIAQQQCVGPLQVPLADVAVTATSHFDTTGEAVAMGEQPIKALIDAAASARMCVAEALMNL 741 (1320) T ss_pred CCCCCCCEEEEECCCHHHHEEEHHHCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCHHHHHCHHHHHHHHHHHHHHHH T ss_conf 33243311665225101102211110476324202136788887615561000366464766087777678999999876 Q ss_pred HCCCCCHHH-HHHHCCCCCCCCCCH-HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC--CCCCC--CCCCCCCCCCC Q ss_conf 027851022-322102456668621-3469999999985221028866544320011236--86667--63111222233 Q gi|254780970|r 481 IATGAKPLA-ITDNLNFGNPEKEEI-MGQFVHSVKGIREACQILDFPIVSGNVSFYNETN--GQSIF--PTPTIAGVGIL 554 (737) Q Consensus 481 ~a~Ga~Pla-it~~lnf~~P~~~e~-~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~--~~~I~--pTpvi~~vG~v 554 (737) +++.++-+. +..+-||+++.+.+. =..|++|++++.....+|||.|.||||||+|..+ ++.+. -|++|+++..+ T Consensus 742 v~a~i~sL~dvK~SgNWM~aak~~GEGarlydAv~aL~~~L~eLgiAIdgGKDSlSMa~k~~ge~VkaPgtLvIsayapc 821 (1320) T KOG1907 742 VAAKITSLKDVKLSGNWMWAAKHPGEGARLYDAVQALCLGLCELGIAIDGGKDSLSMAMKWDGEVVKAPGTLVISAYAPC 821 (1320) T ss_pred HHHCCCCHHHEEECCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCEEECCCCCCHHHHEEECCEEEECCCEEEEEEECCC T ss_conf 74136504320202535210569984047999999999999981531037852011000158808715753899861357 Q ss_pred CCCEEEEECCCCC--CCCC--EEE--CCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 3520233102143--8980--751--377665543267888861167899964798999999999999999598069946 Q gi|254780970|r 555 PDYSLMTRIDSAH--EGDL--ILM--IGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHD 628 (737) Q Consensus 555 ~d~~~~it~~~k~--~Gd~--I~l--iG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HD 628 (737) +|+.+.+||++|. .|+. |++ +++.+.+||||.+++.+.+.....|...|...+++.|..+++|++.++|.|+|| T Consensus 822 ~dv~k~vtP~Lk~~~~gs~~~Ll~i~l~~~k~rLGgSaLaQvy~QiG~d~Pdl~~~~~lk~~f~~vqqL~~~~ii~AgHD 901 (1320) T KOG1907 822 PDVTKTVTPDLKANVDGSKTSLLWIDLANSKMRLGGSALAQVYSQIGDDCPDLDNFDELKKFFSVVQQLLNEGIILAGHD 901 (1320) T ss_pred CCCCEEECCCCCCCCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEECCC T ss_conf 87310425122477899721799998455322354589999999737889985334889889999999975673342553 Q ss_pred CCCCHHHHHHHHHHHCCCCCEEEEECCCCC-CCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEC---C Q ss_conf 486579999999984589708998056668-987888729883218999968809989999985798499989987---9 Q gi|254780970|r 629 ISTGGLIITLAEMTISSAKGMDIILPIEIE-KDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVS---G 704 (737) Q Consensus 629 iSdGGL~~aL~EMa~as~~G~~I~L~~~~~-~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt---~ 704 (737) +|||||+++++||||++++|++|||+.... ..+...||+|+.| +|++|...+.+.+++++.++|+++++||++. + T Consensus 902 ~SDGGLlvt~lEMAfag~~gi~idl~~~~~~~~~~~~LF~EElG-~v~evs~~dl~~v~~~~~~~gv~~~~ig~~~~~~g 980 (1320) T KOG1907 902 ISDGGLLVTLLEMAFAGNVGIEIDLDSPNQNIKLFDILFAEELG-AVLEVSDTDLEKVLEIFSEAGVKCEYIGKASAVFG 980 (1320) T ss_pred CCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHC-EEEEECCCCHHHHHHHHHHCCCCCEEEEEECCCCC T ss_conf 56772008899987604662278547743456188899898617-17786265399999999864986125411003447 Q ss_pred -C-EEEEEC----CEECCHHHHHHHHHHHHH Q ss_conf -8-689807----412149999999999999 Q gi|254780970|r 705 -S-TLSIHN----ILNIPVATLQTKYESWFP 729 (737) Q Consensus 705 -~-~l~i~~----~~~~~i~~l~~~w~~~l~ 729 (737) + .++++. +++..+.+|++.|+.+=- T Consensus 981 ~~~~i~~~~~g~~~~~~~~s~L~~~We~tsy 1011 (1320) T KOG1907 981 QDAHIKISVNGHLILNEKLSDLREEWELTSY 1011 (1320) T ss_pred CCCEEEEECCCEEEECCHHHHHHHHHHHHHH T ss_conf 7745999518857844547899999988578 No 7 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=100.00 E-value=0 Score=1094.44 Aligned_cols=697 Identities=22% Similarity=0.321 Sum_probs=607.9 Q ss_pred CHHHHHHHHHCCC--CHHHHHHHHHHH---CCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHH-----------H----- Q ss_conf 2147889986199--989999999995---789718899998533001107445889997512-----------0----- Q gi|254780970|r 4 PSPILEILTAHGL--TKDECDQIIHIL---KRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLP-----------T----- 62 (737) Q Consensus 4 ~~~~~e~~~~~gL--~~~e~~~i~~~l---~R~Pt~~El~~~~~~wSEHC~~k~~~~~l~~~~-----------~----- 62 (737) .+.++.+..+.|| +.++++.|++++ +|+||.||+.++++|||+||||.+|-+.|+.+. + T Consensus 178 ~e~~~~f~~e~GLAM~l~DL~fiq~YFk~~~R~PT~TEiKVlDTYWSDHCRHTTFET~l~~v~~~~skf~~~~~~~y~~Y 257 (1279) T TIGR01857 178 AEDLKKFKAEKGLAMSLEDLKFIQDYFKEIGRNPTITEIKVLDTYWSDHCRHTTFETELKEVTIEDSKFSKQLKKAYEDY 257 (1279) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHH T ss_conf 78888777543764227899988766123789886156644326657885157544344047648642213789989899 Q ss_pred ---------C----CCCEEECC---------------------CCCEEEEEE-------C-----CCEEEEEEECCCCCC Q ss_conf ---------6----97489933---------------------477368983-------6-----983999995178877 Q gi|254780970|r 63 ---------T----GKHVIQGP---------------------GENAGVVDI-------G-----GGDCVVFKMESHNHP 96 (737) Q Consensus 63 ---------~----~~~vi~~~---------------------~~~a~vi~~-------~-----~~~~~~~k~EtHNhP 96 (737) + -|-.|+-. --||+.|.+ . ++|+++||+|||||| T Consensus 258 ~~~r~~l~~~nln~k~v~LMDmATI~~kylkk~G~Ld~LevSeEiNACSv~v~vdv~~~~GkkaiE~wLLmFKNETHNHP 337 (1279) T TIGR01857 258 LAMREELGRENLNEKPVSLMDMATIVAKYLKKNGKLDDLEVSEEINACSVEVEVDVDGVEGKKAIEEWLLMFKNETHNHP 337 (1279) T ss_pred HHHHHHHCHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCHHHHHHHHCCCCCCCCC T ss_conf 99887622211104786200357788787652685588751103142379999985287674024336542145778658 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCC---------HHHHHHHHHHHHHHHHHHCCCCCCCCEE Q ss_conf 7658888721055810225767787436877504578756863---------8889988874588665430872113203 Q gi|254780970|r 97 SYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPK---------TKHLLSGVVAGIAGYSNSFGVPTVGGEV 167 (737) Q Consensus 97 s~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~---------~~~~~~~~~~G~~~ygN~~G~P~v~G~~ 167 (737) |.||||+|||||+||.|||+++..++.++.|..-.-|||.-|- .+.|..++-.|.|+|||++|+.|-.-.- T Consensus 338 TEIEPFGGAATClGGAIRDPLSGRSYVYQAmRvtGaadpt~~v~ETl~GKLPQ~ki~~tAA~GYSSYGNQIGLAT~~V~E 417 (1279) T TIGR01857 338 TEIEPFGGAATCLGGAIRDPLSGRSYVYQAMRVTGAADPTVPVSETLKGKLPQRKITTTAAHGYSSYGNQIGLATGQVSE 417 (1279) T ss_pred CCCCCCCCHHHHCCCCCCCCCCCHHHEEEEEEEECCCCCCCCHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHCCEEE T ss_conf 23378874445357541377763003011001105778855401011588741133220113210122246553161347 Q ss_pred EEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCC-CCCCCCCCCCCCCHHHHH Q ss_conf 5521567664221122112113554311225689779996576564441046666777286-533344433267989998 Q gi|254780970|r 168 EFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGE-NIAKKRPTVQVGDPFTGK 246 (737) Q Consensus 168 ~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~-~~~~~~~~VQ~gdp~~ek 246 (737) .|||+|..+.|-.++.|+..|+++++|.+|. +||.||+|||+||||||||||.||+++++ +.+.+.+.||+||+..|| T Consensus 418 iYHpGY~AKRMEvGAVvAAtP~~nV~R~~P~-~GD~iILLGGkTGRDGiGGATGSSK~h~~eS~~~~GAEVQKGNAp~ER 496 (1279) T TIGR01857 418 IYHPGYVAKRMEVGAVVAATPKENVVREKPE-PGDVIILLGGKTGRDGIGGATGSSKEHTVESLELCGAEVQKGNAPEER 496 (1279) T ss_pred EECCCCEEEEEEECCEEEECCCCCCEECCCC-CCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCCCCC T ss_conf 7657850124530014450588873526789-887899817852777556776885666368686468603038874311 Q ss_pred HHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHH Q ss_conf 88999874011383011034775201112233201477528996202204667899989861788850899995467588 Q gi|254780970|r 247 CLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHK 326 (737) Q Consensus 247 ~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~ 326 (737) |+||++|.-.-..||+.|.|.||||+++|++|||. |++|||++||+||+||+..||.+|||||||.++|.+||.|+ T Consensus 497 KiQRLFRn~~v~~LIKkcNDFGAGGVSVAiGELAd----G~~vdLn~VP~KYeGLdGTELAISESQERMAVvv~~Ed~d~ 572 (1279) T TIGR01857 497 KIQRLFRNGNVTRLIKKCNDFGAGGVSVAIGELAD----GLEVDLNKVPKKYEGLDGTELAISESQERMAVVVSKEDVDK 572 (1279) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCCCEEEEEEECCC----CEEEECCCCCCCEECCCCCEEEEECCCCCCEEECCCCCHHH T ss_conf 11000467414556120376588732686522027----40751575874210788513312014551146728656578 Q ss_pred HHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE--EECCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 999888709713112212104431000258644344321014665011455--531565551012346872558899999 Q gi|254780970|r 327 AQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRA--WCEPITIPPSNPQLLCSDEDCTKSLLK 404 (737) Q Consensus 327 ~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~r~--~~~~~~~~~~~~~~~~~~~d~~~~l~~ 404 (737) |.+.|++|+++++||++||+++||++.|+|++||||..+||..++-+.+-. ...+.. +.......++..++.+.+++ T Consensus 573 F~~~~~eENL~AtvvA~VTd~~rL~m~W~G~~IVdl~R~FL~tNGv~~~~~~~V~~~~~-~~~~~~~~~~~~~l~~~~l~ 651 (1279) T TIGR01857 573 FLKYAEEENLEATVVATVTDKPRLVMNWKGKTIVDLSREFLDTNGVKQEIDVKVKDKDV-KLSVEKFKVSEETLEEKWLK 651 (1279) T ss_pred HHHHHHHCCCCEEEEEEEECCCCEEEEECCCEEEEEEHHHHHHCCCEEEEEEEEEECCC-CCCCCCEECCCHHHHHHHHH T ss_conf 99875323852289999956997067117865780132113008841789899962125-56423211362047888876 Q ss_pred HHHCCCCCCHHHHHHHCCCCCCCCCCCCCC--------CCCCEEEEC---CCCCCEEEEECCCCCCCHHCCCHHHHHHHH Q ss_conf 850732101035653112100011000013--------576314203---777622797304532100001001135678 Q gi|254780970|r 405 LLSSPNLSSRRWVYEQYDTMIQSNSIQLPG--------GDAGVIRVE---GHETKALAFSSDVTPRYVKADPFEGTKQAV 473 (737) Q Consensus 405 vL~~pnV~SK~~i~~qyD~~V~g~tv~~Pg--------~DaaVv~~~---~~~~~gia~s~g~~p~~~~~dP~~ga~~AV 473 (737) +|+..||||.+.|.++||++||++||+.|. .+++|.+++ +.++++.+||.|+||+.+.++||+||.||| T Consensus 652 ~~~~LNv~SqKGL~e~FDSsvG~gTvl~PlGGkyQLTP~~a~v~KlPVl~g~t~tAS~~a~GFNPYi~~ws~yhGA~yaV 731 (1279) T TIGR01857 652 VLKDLNVASQKGLVERFDSSVGAGTVLMPLGGKYQLTPTEASVAKLPVLGGETKTASAIAWGFNPYIAEWSPYHGAIYAV 731 (1279) T ss_pred HHHHHCCCCCCCCEEEEECCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHH T ss_conf 55430121247752120034171021377888645770257873277578865340244416675013577502367888 Q ss_pred HHHHHHHHCCCCCH--HHHHHHCCCCCCC-CCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 97776410278510--2232210245666-86213469999999985221028866544320011236866676311122 Q gi|254780970|r 474 AECWRNIIATGAKP--LAITDNLNFGNPE-KEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAG 550 (737) Q Consensus 474 ~Eal~Nl~a~Ga~P--lait~~lnf~~P~-~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~ 550 (737) .||++.|+|||++- ..+|..-+|-... ++|+|++.+.|+.|..++..+||+|-||||||||..++..++|||++..+ T Consensus 732 vEs~AkLvAaG~~y~karLSFQEYFEkL~~~~ekwgkP~aAlLGA~~aQ~dLG~~AIGGKDSMSGtF~dL~VPPTLisFA 811 (1279) T TIGR01857 732 VESLAKLVAAGVDYKKARLSFQEYFEKLDKDEEKWGKPFAALLGALKAQVDLGLPAIGGKDSMSGTFEDLDVPPTLISFA 811 (1279) T ss_pred HHHHHHHHHCCCCEEEEEEEEEHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHH T ss_conf 99999999728970378862002233237775100670899988999988708876686347750158898870255344 Q ss_pred CCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 22333520233102143898075137766554326788886116789996479899999999999999959806994648 Q gi|254780970|r 551 VGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDIS 630 (737) Q Consensus 551 vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiS 630 (737) |+.. +.+++++|+||..++.||++-....+-+ -.+|.+.+|..|..+.++++...|+|+.-+- T Consensus 812 V~~~-~~~~v~SpEFK~~~~~i~~i~~~~l~~~----------------~~~d~~~lk~nf~~~~~~~~~~kv~Sa~~~k 874 (1279) T TIGR01857 812 VATA-NVKRVISPEFKKAGEKIYLIEGKALEDD----------------LTLDSDELKENFEKIEELIKDHKVVSASAVK 874 (1279) T ss_pred HHHH-HCCCCCCCHHHCCCCEEEEECCCCCCCC----------------CCCCHHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 3243-1110236023302875887066202246----------------7506788999999999840577479997446 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEEECHH-HHHHHHHHHHHCCCCEEEEEEECCC-EEE Q ss_conf 65799999999845897089980566689878887298832189999688-0998999998579849998998798-689 Q gi|254780970|r 631 TGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPE-NQDLVMSEANNKNIPLRYLGKVSGS-TLS 708 (737) Q Consensus 631 dGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~V~~~-~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~ 708 (737) =||++-+|.+|.|++++|+.|+.+.. .+-||+-.+|.||++++.+ +++ ..+ ....+..||+|+.+ .+. T Consensus 875 yGG~~E~L~kmslGNriG~~~n~~~~-----ve~lf~~~~GS~~~e~~e~~~L~---~e~--~~~~~~~~G~t~~~f~~k 944 (1279) T TIGR01857 875 YGGVAEALAKMSLGNRIGAELNNKEV-----VEDLFTLKYGSFILEVKEDVKLE---DEL--ELANVEKIGETTKDFVLK 944 (1279) T ss_pred CCHHHHHHHHHHCCCEEEEEECCHHH-----HHHHHCCCCCCEEEECCCCCCCH---HHH--CCCCEEECCCEEEEEEEE T ss_conf 01688989874014523378807678-----87652457464789726777444---453--022403414175202799 Q ss_pred EEC-CEECCHHHHHHHHHHHHHHHHC Q ss_conf 807-4121499999999999998724 Q gi|254780970|r 709 IHN-ILNIPVATLQTKYESWFPEFIS 733 (737) Q Consensus 709 i~~-~~~~~i~~l~~~w~~~l~~~~~ 733 (737) .++ ..++++++|.++|++.|+.+|- T Consensus 945 ~~gyd~~~~L~~l~~A~~~~Le~vFp 970 (1279) T TIGR01857 945 VNGYDEKLDLEELEEAWEEKLEEVFP 970 (1279) T ss_pred EEEEEEEECHHHHHHHHHCCCCEECC T ss_conf 85235530679999998637830156 No 8 >cd02203 PurL_repeat1 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), first repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=0 Score=851.56 Aligned_cols=304 Identities=51% Similarity=0.834 Sum_probs=297.4 Q ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCEEECCCCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99998533001107445889997512069748993347736898369839999951788777658888721055810225 Q gi|254780970|r 36 EIGIISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRD 115 (737) Q Consensus 36 El~~~~~~wSEHC~~k~~~~~l~~~~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRD 115 (737) ||+||+|+||||||||+||++|++| |+++||+|||||||||+||+|||||+||+||| T Consensus 1 El~~~~~~wSEHC~~k~~~~~l~~~-----------------------~~~~~k~EsHNhPSai~Py~GAaTGvGGiiRD 57 (313) T cd02203 1 ELGMFAQMWSEHCRHKSFKSLLKMI-----------------------WAVVFKVETHNHPSAIEPFGGAATGVGGIIRD 57 (313) T ss_pred CHHHHHHHHHHHCCCHHHHHHHHHH-----------------------HEEEEEECCCCCCCCCCCCCCCCCCCCCEECC T ss_conf 9367776766233653269999987-----------------------70288834678988678878501045630025 Q ss_pred CCCCCCEEEEEEEECCCCCCCCC--------CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCC Q ss_conf 76778743687750457875686--------3888998887458866543087211320355215676642211221121 Q gi|254780970|r 116 IFTMGARPVAAMNSLRFGAIHHP--------KTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIA 187 (737) Q Consensus 116 i~~~Ga~p~a~~~~l~~g~~~~p--------~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~ 187 (737) |++|||||+|+||+||||+|+.| +++||+.++++||+|||||||+|+|+|+++||++|..||+|+++|+|++ T Consensus 58 i~~~GA~pia~~d~l~fG~~~~~~~~~~~~~~~~~i~~~vv~GisdyGN~~GvP~v~G~~~f~~~y~~nPlv~~~~vG~v 137 (313) T cd02203 58 ILSMGARPIALLDGLRFGDLDIPGYEPKGKLSPRRILDGVVAGISDYGNCIGIPTVGGEVRFDPSYYGNPLVNVGCVGIV 137 (313) T ss_pred CCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHCCHHHHCCCCCCCCEEEEEEEECCCCCCCCEEEEEEEEEE T ss_conf 45668667776546520699887655545668989987751356653865576235579998665467843787547753 Q ss_pred CHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCC-CCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCC Q ss_conf 1355431122568977999657656444104666677728653-334443326798999888999874011383011034 Q gi|254780970|r 188 KTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENI-AKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQD 266 (737) Q Consensus 188 ~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~-~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D 266 (737) +.+++.++++.++||+||++|++|||||||||+|||.+++++. +.++++|||||||+||+|+++|++++++|+|.+||| T Consensus 138 ~~~~~~~~~~~~~G~~iv~vGg~tgrdGigGasfsS~~~~~~~~~~~~~aVQ~gdP~~ek~l~e~~le~~~~~li~~ihD 217 (313) T cd02203 138 PKDHIVKSKAPGPGDLVVLVGGRTGRDGIGGATFSSKELSENSSELDRPAVQVGDPFMEKKLQEAILEARETGLIVGIQD 217 (313) T ss_pred EHHHHCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 37770256689999889999078666652210354032046643346554322866888899999998885498688886 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEE Q ss_conf 77520111223320147752899620220466789998986178885089999546758899988870971311221210 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN 346 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~ 346 (737) +|||||+||++||+.++++|++||||+||+++++|+|||||||||||||+++|+|+++++|++||+||++++++||+||+ T Consensus 218 ~GaGGl~~a~~Ema~~~~~G~~i~Ld~vp~~~~~m~p~Eil~SESQERm~~~v~~~~~~~~~~i~~k~~~~~~vIG~vt~ 297 (313) T cd02203 218 LGAGGLSSAVSEMAAKGGLGAEIDLDKVPLREPGMSPWEIWISESQERMLLVVPPEDLEEFLAICKKEDLEAAVIGEVTD 297 (313) T ss_pred CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHEEEECCHHCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEEC T ss_conf 58850654237776148973999903470577999864713305611089998864699999999987999799999936 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 4431000258644344 Q gi|254780970|r 347 DKLFRVIHRGEEVANL 362 (737) Q Consensus 347 ~~~~~v~~~g~~v~dl 362 (737) +++|+|+|++++++|| T Consensus 298 ~~~~~v~~~~e~v~dl 313 (313) T cd02203 298 DGRLRLYYKGEVVADL 313 (313) T ss_pred CCEEEEEECCEEEEEC T ss_conf 9869999999898739 No 9 >cd02204 PurL_repeat2 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), second repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=0 Score=487.89 Aligned_cols=263 Identities=41% Similarity=0.655 Sum_probs=253.7 Q ss_pred CCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCC-HHHHHHHHHHH Q ss_conf 76314203777622797304532100001001135678977764102785102232210245666862-13469999999 Q gi|254780970|r 436 DAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEE-IMGQFVHSVKG 514 (737) Q Consensus 436 DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e-~~~ql~~av~g 514 (737) ||+|+|+++.+++|+||++|++|+++.+|||+||++||+|++|||+++|++|+++++|+||++|++++ .||||+++++| T Consensus 1 da~V~~~~~~~~~g~a~s~g~~p~~~~~dP~~ga~~aV~Ea~rni~~~Ga~p~ai~~~lnf~~p~~~~~~~~~l~~av~g 80 (264) T cd02204 1 DAAVLRIPGETDKGLAMSTGENPRYSLLDPYAGAALAVAEAVRNLVAVGADPLAITDCLNFGNPEKPEGEMGQLVEAVLG 80 (264) T ss_pred CCEEEEECCCCCEEEEEECCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCCHHHHEECHHCCCCCCCCCHHHHHHHHHHH T ss_conf 95599863899859999747999614348999999999999988752698389951001102877785078889999999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCC Q ss_conf 98522102886654432001123686667631112222333520233102143898075137766554326788886116 Q gi|254780970|r 515 IREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASS 594 (737) Q Consensus 515 l~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~ 594 (737) |+|+|++||+|++||||||||++.+.+|+|||+|+++|.+++.++.+++.++++||.||++|.+..+++||.|++.+.+. T Consensus 81 ~~~~~~~lgiP~v~G~~Sl~n~~~~~~ippTlvi~~vg~v~~~~~~~t~~~k~~g~~i~lig~~~~~~ggs~~~~~~~~~ 160 (264) T cd02204 81 LGDACRALGTPVIGGKDSLYNETEGVAIPPTLVIGAVGVVDDVRKIVTLDFKKEGDLLYLIGETKDELGGSEYALAYHGL 160 (264) T ss_pred HHHHHHHCCCCEECCEEEEECCCCCCCCCCCEEEEEEEECCCCCEEEHHHHCCCCCEEEEECCCCCCCCCCHHHHHHCCC T ss_conf 99999970996203446651466786678852899997325730024154347998799957884557877999997087 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEE Q ss_conf 78999647989999999999999995980699464865799999999845897089980566689878887298832189 Q gi|254780970|r 595 NIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYV 674 (737) Q Consensus 595 ~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~i 674 (737) ..+.+|.+|++.++++++++++++++++|.|+||+|+|||+++|+|||+++++|++|+++... ++..+||||++||+| T Consensus 161 ~~~~~~~~d~~~~k~~~~~i~~l~~~~li~s~hDvS~GGL~~al~EMa~~~~~G~~i~l~~~~--~~~~~LFsEs~gr~v 238 (264) T cd02204 161 GGGAPPLVDLEREKALFDAVQELIKEGLVLSAHDVSDGGLAVALAEMAFAGGLGAEVDLSKDD--AEDELLFSESLGRVL 238 (264) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCC--CHHHHEECCCCCEEE T ss_conf 889856589999999999998998659612787678781999999998428965999838999--825750378888699 Q ss_pred EEECHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 9996880998999998579849998998 Q gi|254780970|r 675 VCISPENQDLVMSEANNKNIPLRYLGKV 702 (737) Q Consensus 675 i~V~~~~~~~~~~~~~~~gi~~~~IG~V 702 (737) ++|++++.+.++ ++..++++.+||+| T Consensus 239 v~v~~~~~~~~~--~~~~~v~~~~IG~v 264 (264) T cd02204 239 VEVKPENEEVFE--AEEAGVPATVIGTV 264 (264) T ss_pred EEECCCHHHHHH--HHHCCCCEEEEEEC T ss_conf 998724299999--98889996998769 No 10 >TIGR01739 tegu_FGAM_synt herpesvirus tegument protein/v-FGAM-synthase; InterPro: IPR010077 This entry describes a family of large proteins of herpes virus. The protein is described variably as tegument protein or phosphoribosylformylglycinamidine synthase (FGAM-synthase). The protein shows homology to eukaryotic FGAM-synthase. ; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0005198 structural molecule activity, 0019033 viral tegument. Probab=100.00 E-value=0 Score=457.19 Aligned_cols=593 Identities=17% Similarity=0.217 Sum_probs=420.6 Q ss_pred CCCEEEEEEEECCCCCC------CC----CCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCC-CC-----CCCCCEE Q ss_conf 78743687750457875------68----638889988874588665430872113203552156-76-----6422112 Q gi|254780970|r 119 MGARPVAAMNSLRFGAI------HH----PKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCY-NN-----NIIVNTF 182 (737) Q Consensus 119 ~Ga~p~a~~~~l~~g~~------~~----p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y-~~-----~~~v~~~ 182 (737) +|..|+|..-++..-.| +. +..+.+..-.++--++.-.+.|+|+++|++++-... .. +++.+.. T Consensus 216 p~~~p~as~~G~y~~~pPqfq~~~~~~~~~~~~~~~~~~~~~~a~~~~~~GvP~~~GF~R~i~~~~~~~~~~~~~~~~~s 295 (1313) T TIGR01739 216 PGVSPVASALGLYYLTPPQFQALEGEQALPLARKISTALLKAQAELLRAAGVPVVGGFIRLISKNKLELGTGVTPTLFTS 295 (1313) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCEEEEEEEEEEEEE T ss_conf 41246788752787538400157854014678889999999999999866788411564621057830210023589865 Q ss_pred EECCCCHHHHHHCCCCCCCC------EEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH---HH Q ss_conf 21121135543112256897------7999657656444104666677728653334443326798999888999---87 Q gi|254780970|r 183 AAGIAKTNAIFSSKARGIGL------PLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEA---CL 253 (737) Q Consensus 183 ~~Gi~~~~~~~~~~~~~~Gd------~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~---~~ 253 (737) ..+-.....+... ..++|+ .||+||.=....+-.+..+ --...++ |-.-.++.++ +. T Consensus 296 ~~~~~~~~~~~~~-~~~~~~ksGdsl~~v~LG~F~p~~~~~~~~p--P~~y~~S-----------~~~~~~i~~~L~~~~ 361 (1313) T TIGR01739 296 ILTSLCHESLSTA-DERPGQKSGDSLDIVALGSFEPTLLPDTPGP--PLLYADS-----------PLEVNKILQALELLT 361 (1313) T ss_pred EECCCCCCCCCCC-CCCCCCCCCCEEEEEEEECCCCCCCCCCCCC--CCCCCCC-----------HHHHHHHHHHHHHHH T ss_conf 4054210001356-6666656686589998742567766567853--8974114-----------799999999999999 Q ss_pred -HHCC--CCCCHHCCCCCCCCHHHHHHHHHC-CCCCCEEEEEHHHCCC---------CCCCC--HHHHHH-----CCCCC Q ss_conf -4011--383011034775201112233201-4775289962022046---------67899--989861-----78885 Q gi|254780970|r 254 -ELMN--TDAVIAIQDMGAAGLTCSAIEMGN-QGNLGITLDLDKVPTC---------EEGMT--AYEMML-----SESQE 313 (737) Q Consensus 254 -~~~~--~~~i~~~~D~gaGGl~~a~~Ema~-~~~~G~~i~l~~vp~~---------~~~l~--~~ei~~-----sEsQe 313 (737) ++-. .-.+..+.+.+-.=+.-.|.|++. + .|++|++.+.|.. +.++. -+|-++ |=.=. T Consensus 362 ~~~~~y~~~i~~~~r~~~~~s~~~~L~~L~~~~--~Ga~l~~S~LP~~v~~~L~~~~~~~~~~P~~e~~~~~~FlN~~c~ 439 (1313) T TIGR01739 362 RDLKTYAPCIVGSIRPLGPASVKEHLTELLPDP--CGAELDLSNLPDEVVAALKASSPANAEGPELEKLVKSYFLNVVCS 439 (1313) T ss_pred HHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCC--CCEEEECCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 740777754885157777716899999971577--523784104988899961278701155745788886520011023 Q ss_pred EEEEEEECC--------------CHHHHHHHHHHHCCEEECCCCEEECCCCCCCCC--CCCC-CCCCCCCCCCCCCEEEE Q ss_conf 089999546--------------758899988870971311221210443100025--8644-34432101466501145 Q gi|254780970|r 314 RMLMILNPE--------------KQHKAQEILNKWGLHFSIIGITTNDKLFRVIHR--GEEV-ANLPIKALSDEAPEYDR 376 (737) Q Consensus 314 R~~~~v~~~--------------~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~--g~~v-~dlp~~~L~~~~P~~~r 376 (737) -+.++|+.+ -++.+...|+..+|++.+||+.+++.-++|-++ +... -...+++- ...|...+ T Consensus 440 ~v~l~v~n~~g~~~~~~~~~~~~p~~~L~~~c~~~~c~~~~lG~~~~~~g~~~v~dl~~~~~~~~~~~~~~-~~~~~~~~ 518 (1313) T TIGR01739 440 VVFLTVKNSSGLDEVTELDEGVTPLEALKRACRLAGCQVLVLGKLVELPGLHIVSDLFNPVPVTYATFDFT-SATPTVPE 518 (1313) T ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCEEEECCCCCCCCCEEEEECC-CCCCCCCC T ss_conf 05688705767321124675335899999999961994799987304888678606788976411333013-22456766 Q ss_pred EEE---------CCCCCC-----CCCCCCCCCCCC-HHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCCCCC--------C Q ss_conf 553---------156555-----101234687255-88999998507321010356531121000110000--------1 Q gi|254780970|r 377 AWC---------EPITIP-----PSNPQLLCSDED-CTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSIQL--------P 433 (737) Q Consensus 377 ~~~---------~~~~~~-----~~~~~~~~~~~d-~~~~l~~vL~~pnV~SK~~i~~qyD~~V~g~tv~~--------P 433 (737) ... .+.... ...+...+...+ +++.+.++|+||+|+||+||++|-||+-+|+.+|. | T Consensus 519 ~~~~~~dyt~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~iL~HPtVgSKe~iV~H~DR~g~GrVaQQ~GvGPlDlP 598 (1313) T TIGR01739 519 LPLQEEDYTSASASVPVSREEPMKLDESLDWATLNVLESTIEKILSHPTVGSKEFIVRHIDRCGNGRVAQQQGVGPLDLP 598 (1313) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCHHH T ss_conf 78886544357888436776651257655420147799999998148874420578756643218705872686852003 Q ss_pred CCCCCEEEE------------------------------------------C--------CCCC--------CEEEEECC Q ss_conf 357631420------------------------------------------3--------7776--------22797304 Q gi|254780970|r 434 GGDAGVIRV------------------------------------------E--------GHET--------KALAFSSD 455 (737) Q Consensus 434 g~DaaVv~~------------------------------------------~--------~~~~--------~gia~s~g 455 (737) ++||+++.. . .+++ .|+|+++| T Consensus 599 vsDYs~v~~~~~~~~~q~~~~~~~~~~~~~~~~~~e~a~~~~~d~~~~~~~~~~GltiqvPddn~~~~~~~~~~~~~A~G 678 (1313) T TIGR01739 599 VSDYSLVLHNAVKSVEQSLEAEASRVTDTTEAVSSELAMRLILDPDAWLKSKEKGLTIQVPDDNEVLENPALSGVCSALG 678 (1313) T ss_pred HHHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCCCCCCHHHHHCCHHHHHCCCCCCCEEECCCCCCCCCCCCCEEEEEECC T ss_conf 66699987314353223201034533566212488650222226134516666752687367776577765238999613 Q ss_pred CCCCCHHCCCHHHHHHHHHHHHHHHHCCC----CCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 53210000100113567897776410278----51022322102456668621346999999998522102886654432 Q gi|254780970|r 456 VTPRYVKADPFEGTKQAVAECWRNIIATG----AKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNV 531 (737) Q Consensus 456 ~~p~~~~~dP~~ga~~AV~Eal~Nl~a~G----a~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkv 531 (737) |+++++.+||..|+.+|++||++||.+++ .+.+.||.++-|+. +......|++++..++|+|++|||++.-+.- T Consensus 679 EQ~~kv~~D~~~G~~yai~EALlNL~~sP~~~Gl~~~~iT~svtw~~--~~~~~~~L~~~l~~ck~FC~~LGV~~~~~~A 756 (1313) T TIGR01739 679 EQAYKVQLDPKRGATYAITEALLNLSLSPVLTGLEDVIITLSVTWSP--EDHVYSLLKEALRACKDFCEELGVSFTVTSA 756 (1313) T ss_pred CCCEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECCC--CCHHHHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 22300242135668999999999987453003624247988885484--1004899999999999999749916987025 Q ss_pred CCCCCCCCCCCC------------CCCCCCCCCCCCC-CEEEEECCCCCCCCCEEECCC-CCCCCCHHHHHHHHHCCCC- Q ss_conf 001123686667------------6311122223335-202331021438980751377-6655432678888611678- Q gi|254780970|r 532 SFYNETNGQSIF------------PTPTIAGVGILPD-YSLMTRIDSAHEGDLILMIGN-DGCHLDCSMYSLECASSNI- 596 (737) Q Consensus 532 Sl~n~t~~~~I~------------pTpvi~~vG~v~d-~~~~it~~~k~~Gd~I~liG~-~~~~LggS~~~~~~~~~~~- 596 (737) |-.......+.+ .+.+.++...+.. ..+.+||++|.+|+.|+++.- .+..++||+|.+.+..... T Consensus 757 ~~~~~~~~~~~~~~~~~~~~vl~~~~IVfsa~~~~~~~~~~~iTPdlk~~Gs~L~~l~~~~~~~laGSif~~i~~~~~~~ 836 (1313) T TIGR01739 757 ASSPTQSSRSAPTTIQRESKVLTFNSIVFSASAPVKLGSAKKITPDLKKHGSHLIWLSIHQKLTLAGSIFEQILGLSATI 836 (1313) T ss_pred CCCCCCCCCCCCCCCCCCCEEEECCEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCC T ss_conf 78888887888857886532430120899998741366666457554478888999942897201767999862103577 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEE Q ss_conf 99964798999999999999999598069946486579999999984589708998056668987888729883218999 Q gi|254780970|r 597 GPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVC 676 (737) Q Consensus 597 g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~ 676 (737) ..++.+++...++++.++++|++++.|.|+||||||||+++++|||++|+.|++|.+|.. ..++.++||||+|| +||| T Consensus 837 ~~~~~~~~~~l~~ll~~~~~L~~~~~I~SgHDvSDGGL~ac~~EMAlaG~kgv~i~~P~~-~~~~~~~l~SETPG-~viE 914 (1313) T TIGR01739 837 HRLPELSPESLKKLLKALQELVKEGVIVSGHDVSDGGLVACVAEMALAGGKGVRITLPAH-ITDPLEFLLSETPG-VVIE 914 (1313) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHHCCCCEEEEECCCC-CCCHHHHHHCCCCC-EEEE T ss_conf 656556889999999999877656735887543613689999999963896489971764-35568875305895-6999 Q ss_pred ECHHHHHHHHHHHHHCCCCEEEEEEECCC----EEEEE----CCEECCHHHHHHHHHHHHHHHH Q ss_conf 96880998999998579849998998798----68980----7412149999999999999872 Q gi|254780970|r 677 ISPENQDLVMSEANNKNIPLRYLGKVSGS----TLSIH----NILNIPVATLQTKYESWFPEFI 732 (737) Q Consensus 677 V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~----~l~i~----~~~~~~i~~l~~~w~~~l~~~~ 732 (737) |.+++.+++++.++..++.+..||+|+++ .|.|. .+++.++..++..|+....+.. T Consensus 915 V~~~~~~~v~~~l~~~~~~~~~iG~V~~~g~~~~~~v~h~~~~l~~~~L~~l~~~W~~~~~~~~ 978 (1313) T TIGR01739 915 VEPESLKEVLQFLRSEGLVAAVIGRVGEEGESSTFSVVHNSTVLFQESLSSLQGTWRSFSSEEL 978 (1313) T ss_pred ECCCCHHHHHHHHHHCCCCEEECCEEECCCCCCEEEEEECCCEEEEEEHHHHHHHHHHHHHHHH T ss_conf 7774088999999867971444017442579972799978826886100233411343232336 No 11 >cd02193 PurL Formylglycinamide ribonucleotide amidotransferase (FGAR-AT) catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=0 Score=468.80 Aligned_cols=259 Identities=29% Similarity=0.402 Sum_probs=237.0 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 3999995178877765888872105581022576778--74368775045787568638889988874588665430872 Q gi|254780970|r 84 DCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMG--ARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVP 161 (737) Q Consensus 84 ~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~G--a~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P 161 (737) |.++||+|||||||+|+||.||+|+|++.+||++|+| |+|+|++|+|||+++ +|...|++.++++|+++|||+||+| T Consensus 1 ~g~a~~~~~h~~~s~idP~~Ga~~av~Ea~RNi~~~G~~a~pia~~dn~~~~~~-~p~~~~~l~~av~Gi~d~c~~l~iP 79 (272) T cd02193 1 YGEAMKIEEHNHPAAIDPAAGAATGVGGAIRDIAATGIDAKPIALSANWMASAG-HPGEDAILYDAVKGVAELCNQLGLP 79 (272) T ss_pred CCEEEECCCCCCCCHHCHHHHHHHHHHHHHHCCHHCCCCCCEEEEEECCCCCCC-CCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 942897168899041089999999999999760510888722589850366897-9318899999999999998862988 Q ss_pred CCCCEEEEECCCCC-----------CCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 11320355215676-----------6422112211211355431122568977999657656444104666677728653 Q gi|254780970|r 162 TVGGEVEFLPCYNN-----------NIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENI 230 (737) Q Consensus 162 ~v~G~~~~~~~y~~-----------~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~ 230 (737) +++|+++|+..|.. +|.+.++++|+++..+.......++||.|+++|++++++|+||++++|..+.... T Consensus 80 vvsGnvSlyn~~~~~~~~~~~~~~~~Pt~~i~~vG~i~d~~~~~~~~~~~g~~i~liG~~~~~~~lgGs~~~~~~~~~~~ 159 (272) T cd02193 80 IPVGKDRMSMKTRWQEGNEQREMTHPPSLVISAFGRVRDDRHTLPQLSTEGNALLLIGGGKGHNGLGGTALASVALSYRQ 159 (272) T ss_pred EECCCCCEECEEECCCCCCCEEEEEEEEEEEEEEECCCCCCEECCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCC T ss_conf 50562313000442578830168871247766752035640444777789998999915788787532689998740243 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCC Q ss_conf 33444332679899988899987401138301103477520111223320147752899620220466789998986178 Q gi|254780970|r 231 AKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSE 310 (737) Q Consensus 231 ~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sE 310 (737) ...++||+.||+.+|++++++++++++++|.++||+|+|||++|+.||+..+++|++|||++||.++++|+|+|+|||| T Consensus 160 -~g~~~~~~~d~~~ek~~~~~~~~~i~~glI~s~HDvS~GGL~vaLaEma~~g~~G~~idL~~v~~~~~~~~~~~~lfsE 238 (272) T cd02193 160 -LGDKSAQVRDPAQEKGFYEAMQALVAAGKLLAWHDRGAGGLLVALAELVFAGHCGVQVDLAALGDDEPDMEPLEIALFE 238 (272) T ss_pred -CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHC T ss_conf -7999998689999999999999999839944886658860888899998547964999805675889998789998716 Q ss_pred CCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCE Q ss_conf 8850899995467588999888709713112212 Q gi|254780970|r 311 SQERMLMILNPEKQHKAQEILNKWGLHFSIIGIT 344 (737) Q Consensus 311 sQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~v 344 (737) ||+||+++|+|+|+++|+++++++++.+.|||+. T Consensus 239 sq~R~vv~V~~e~~~~~~~i~~~~~~~~~viG~~ 272 (272) T cd02193 239 SQERGVIQVRAEDRDAVEEAQYGLADCVHVLGQA 272 (272) T ss_pred HHHCEEEEECHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 7358299986765999999961689834882579 No 12 >cd02193 PurL Formylglycinamide ribonucleotide amidotransferase (FGAR-AT) catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=0 Score=432.27 Aligned_cols=251 Identities=25% Similarity=0.295 Sum_probs=225.9 Q ss_pred EEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCC--CCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 279730453210000100113567897776410278--510223221024566686213469999999985221028866 Q gi|254780970|r 449 ALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATG--AKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPI 526 (737) Q Consensus 449 gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~G--a~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipi 526 (737) |+||+++++|+.+++|||+||++||+||+||++|+| ++|++++|||||++| +|+.+|+|+++++||+|+|++|++|+ T Consensus 2 g~a~~~~~h~~~s~idP~~Ga~~av~Ea~RNi~~~G~~a~pia~~dn~~~~~~-~p~~~~~l~~av~Gi~d~c~~l~iPv 80 (272) T cd02193 2 GEAMKIEEHNHPAAIDPAAGAATGVGGAIRDIAATGIDAKPIALSANWMASAG-HPGEDAILYDAVKGVAELCNQLGLPI 80 (272) T ss_pred CEEEECCCCCCCCHHCHHHHHHHHHHHHHHCCHHCCCCCCEEEEEECCCCCCC-CCHHHHHHHHHHHHHHHHHHHHCCCE T ss_conf 42897168899041089999999999999760510888722589850366897-93188999999999999988629885 Q ss_pred CCCCCCCCCCCCCC--------CCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCC--CCCCCHHHHHHHHHC-C- Q ss_conf 54432001123686--------66763111222233352023310214389807513776--655432678888611-6- Q gi|254780970|r 527 VSGNVSFYNETNGQ--------SIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGND--GCHLDCSMYSLECAS-S- 594 (737) Q Consensus 527 vgGkvSl~n~t~~~--------~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~--~~~LggS~~~~~~~~-~- 594 (737) +||||||||+|.+. .++|||+|+++|.++|.++. ++.++++||.|+++|.+ ..+||||.|++.... . T Consensus 81 vsGnvSlyn~~~~~~~~~~~~~~~~Pt~~i~~vG~i~d~~~~-~~~~~~~g~~i~liG~~~~~~~lgGs~~~~~~~~~~~ 159 (272) T cd02193 81 PVGKDRMSMKTRWQEGNEQREMTHPPSLVISAFGRVRDDRHT-LPQLSTEGNALLLIGGGKGHNGLGGTALASVALSYRQ 159 (272) T ss_pred ECCCCCEECEEECCCCCCCEEEEEEEEEEEEEEECCCCCCEE-CCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCC T ss_conf 056231300044257883016887124776675203564044-4777789998999915788787532689998740243 Q ss_pred -CCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC----CCCHHHHHHCCC Q ss_conf -789996479899999999999999959806994648657999999998458970899805666----898788872988 Q gi|254780970|r 595 -NIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI----EKDPKPFLFGED 669 (737) Q Consensus 595 -~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~----~~d~~~~LFsEs 669 (737) ....++..|++.++++++++++++++++|.|+||+|+|||++||+||||++++|++|||+..+ ..+++..||||+ T Consensus 160 ~g~~~~~~~d~~~ek~~~~~~~~~i~~glI~s~HDvS~GGL~vaLaEma~~g~~G~~idL~~v~~~~~~~~~~~~lfsEs 239 (272) T cd02193 160 LGDKSAQVRDPAQEKGFYEAMQALVAAGKLLAWHDRGAGGLLVALAELVFAGHCGVQVDLAALGDDEPDMEPLEIALFES 239 (272) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHCH T ss_conf 79999986899999999999999998399448866588608888999985479649998056758899987899987167 Q ss_pred CCCEEEEECHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 32189999688099899999857984999899 Q gi|254780970|r 670 QGRYVVCISPENQDLVMSEANNKNIPLRYLGK 701 (737) Q Consensus 670 ~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~ 701 (737) ++|+|++|+|++.++|++++..+++.+++||+ T Consensus 240 q~R~vv~V~~e~~~~~~~i~~~~~~~~~viG~ 271 (272) T cd02193 240 QERGVIQVRAEDRDAVEEAQYGLADCVHVLGQ 271 (272) T ss_pred HHCEEEEECHHHHHHHHHHHCCCCCCEEEECC T ss_conf 35829998676599999996168983488257 No 13 >TIGR03267 methan_mark_2 putative methanogenesis marker protein 2. A single member of this protein family is found in each of the first ten complete genome sequences of archaeal methanogens, and nowhere else. Sequence similarity to various bacterial proteins is reflected in Pfam models pfam00586 and pfam02769, AIR synthase related protein N-terminal and C-terminal domains, respectively. The functions of proteins in this family are unknown, but their role is likely one essential to methanogenesis. Probab=100.00 E-value=0 Score=425.76 Aligned_cols=305 Identities=23% Similarity=0.302 Sum_probs=263.0 Q ss_pred HHCCCCCCCCCH---HH-HHHH-HHCCCCEEECCCCCEEEEEECCCEEEEEEECC-CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 300110744588---99-9751-20697489933477368983698399999517-887776588887210558102257 Q gi|254780970|r 43 MWNEHCSYKSSK---KW-LRTL-PTTGKHVIQGPGENAGVVDIGGGDCVVFKMES-HNHPSYIEPYQGAATGVGGILRDI 116 (737) Q Consensus 43 ~wSEHC~~k~~~---~~-l~~~-~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~Et-HNhPs~i~P~~GAaTg~gG~iRDi 116 (737) ...+-|+||.-- .| +++. ++-..+|+.+||||||||+++++..+++++|+ ||||++++||+|+.+++.+++||| T Consensus 10 ~~~~g~~~k~~~~~~~~~~~~~~~~~~~~~vvgpGdDAavi~i~~~~~~~~~~d~~~~~~~~~dPy~gG~~aVa~~v~DI 89 (323) T TIGR03267 10 RSFEGVTRKHPIKDVVEILEPLDVTYEGNVIVDFGDDAAAIKIGGDDILLLAADGIWGKLLDADPWWAGYCAVLVNVNDI 89 (323) T ss_pred HHCCCCHHCCCHHHHHHHHHHCCCCCCCCEEECCCCCEEEEEECCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 74878310059899999986433455887542479860699878985899997677887514697898999999998889 Q ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCC Q ss_conf 67787436877504578756863888998887458866543087211320355215676642211221121135543112 Q gi|254780970|r 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSK 196 (737) Q Consensus 117 ~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~ 196 (737) ++|||+|+|++|+|+|++|+ .+.++++|++++++++|+|+|||+|.++..|.. ++++++|+++.+++++.. T Consensus 90 aamGA~Pla~ld~L~~~~pe------~~~~i~~Gi~~a~~~~gvpvVGG~t~~~~~~~~---v~v~vvGiv~~~~~i~~~ 160 (323) T TIGR03267 90 AAMGGKPVGMVNVLSINDVD------VCREVLEGMREGAWKFGVPVVGGHTHPDTPYNA---LDVAIVGIAKEDCIIRSD 160 (323) T ss_pred HHHCCCHHHHHHHHCCCCHH------HHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCE---EEEEEEEEECCCCEEECC T ss_conf 98087618863022679878------999999999999987499776673456888770---589999987411145447 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHH Q ss_conf 25689779996576564441046666777286533344433267989998889998740113830110347752011122 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSA 276 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~ 276 (737) .+++||.|++.|+.+|+.+.. ..+..+ ..+..+|+..|..+++++++.+++++++|||+++|||++++ T Consensus 161 ~a~~GD~ii~~~~l~g~~~~~------~~~~~~------~~~~~~~~~~~~~~~~~~~l~~~~lv~a~~DiS~gGL~~~l 228 (323) T TIGR03267 161 TAKPGDLIIFAIDLDGRPYPS------FPLNWD------TTTMKSPDYLRAQMDAVVEIAERKLVKAGKDISNPGLIGTL 228 (323) T ss_pred CCCCCCEEEEEECCCCCCCCC------CCCCCC------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 899999999996467611645------534431------01346747778889989999986678766412647167799 Q ss_pred HHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCC Q ss_conf 33201477528996202204667899989861788850899995467588999888709713112212104431000258 Q gi|254780970|r 277 IEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRG 356 (737) Q Consensus 277 ~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g 356 (737) .||+.+|++|++|++++||. .+++++.+.+.+|+|+||+++|+|++.++++++++++++++++||+||+++++++++++ T Consensus 229 ~em~~~s~~Ga~I~l~~iP~-~~~~~~~~~l~~~~~~~~l~tv~~~~~~~i~~~~~~~gi~~~vIG~Vt~~~~l~l~~~~ 307 (323) T TIGR03267 229 GMLLEASRVGAEVDLESIPK-PEDVDMVTWLKMYPGSGFVLTADPENVREIVRVLEDAGLTASVIGEVIEDGKLYVSDGD 307 (323) T ss_pred HHHHHHCCCCEEEEHHHCCC-CCCCCHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEECCCEEEEEECC T ss_conf 99986448837997531888-88999999998489983899986503999999999879988999999759829999899 Q ss_pred CC--CCCCCCCCCCC Q ss_conf 64--43443210146 Q gi|254780970|r 357 EE--VANLPIKALSD 369 (737) Q Consensus 357 ~~--v~dlp~~~L~~ 369 (737) +. +.|++.+++.+ T Consensus 308 e~~v~~d~~~~~~~g 322 (323) T TIGR03267 308 ESAVVFDFRKDRITG 322 (323) T ss_pred CCEEEEECCCCCCCC T ss_conf 018998877556568 No 14 >TIGR03267 methan_mark_2 putative methanogenesis marker protein 2. A single member of this protein family is found in each of the first ten complete genome sequences of archaeal methanogens, and nowhere else. Sequence similarity to various bacterial proteins is reflected in Pfam models pfam00586 and pfam02769, AIR synthase related protein N-terminal and C-terminal domains, respectively. The functions of proteins in this family are unknown, but their role is likely one essential to methanogenesis. Probab=100.00 E-value=0 Score=382.03 Aligned_cols=295 Identities=22% Similarity=0.253 Sum_probs=239.1 Q ss_pred CHHHHHHHHHHCCCC------CCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHH Q ss_conf 588999998507321------01035653112100011000013576314203777622797304532100001001135 Q gi|254780970|r 397 DCTKSLLKLLSSPNL------SSRRWVYEQYDTMIQSNSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTK 470 (737) Q Consensus 397 d~~~~l~~vL~~pnV------~SK~~i~~qyD~~V~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~ 470 (737) |+.+...++.+++-+ +.++|+|+|||++|+++|+++||+||||+++++ +...++++.+++++++++|||.+|+ T Consensus 1 ~~~~~~~~~~~~~g~~~k~~~~~~~~~~~~~~~~~~~~~vvgpGdDAavi~i~~-~~~~~~~~d~~~~~~~~~dPy~gG~ 79 (323) T TIGR03267 1 NLRELARELRSFEGVTRKHPIKDVVEILEPLDVTYEGNVIVDFGDDAAAIKIGG-DDILLLAADGIWGKLLDADPWWAGY 79 (323) T ss_pred CHHHHHHHHHHCCCCHHCCCHHHHHHHHHHCCCCCCCCEEECCCCCEEEEEECC-CCEEEEEECCCCCCCCCCCHHHHHH T ss_conf 978999999748783100598999999864334558875424798606998789-8589999767788751469789899 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 67897776410278510223221024566686213469999999985221028866544320011236866676311122 Q gi|254780970|r 471 QAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAG 550 (737) Q Consensus 471 ~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~ 550 (737) .||++++|||+||||+|+++++||+|++|+ . +.+.++||.++|++||+|++||++++... .+++.+++ T Consensus 80 ~aVa~~v~DIaamGA~Pla~ld~L~~~~pe---~---~~~i~~Gi~~a~~~~gvpvVGG~t~~~~~------~~~v~v~v 147 (323) T TIGR03267 80 CAVLVNVNDIAAMGGKPVGMVNVLSINDVD---V---CREVLEGMREGAWKFGVPVVGGHTHPDTP------YNALDVAI 147 (323) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHCCCCHH---H---HHHHHHHHHHHHHHCCCCEECCCCCCCCC------CCEEEEEE T ss_conf 999999888998087618863022679878---9---99999999999987499776673456888------77058999 Q ss_pred CCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 22333520233102143898075137766554326788886116789996479899999999999999959806994648 Q gi|254780970|r 551 VGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDIS 630 (737) Q Consensus 551 vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiS 630 (737) +|.++. .+.++...+++||.|++.+....+.+.+... . .......+.+.++...+.+.++++++++.||||+| T Consensus 148 vGiv~~-~~~i~~~~a~~GD~ii~~~~l~g~~~~~~~~----~--~~~~~~~~~~~~~~~~~~~~~l~~~~lv~a~~DiS 220 (323) T TIGR03267 148 VGIAKE-DCIIRSDTAKPGDLIIFAIDLDGRPYPSFPL----N--WDTTTMKSPDYLRAQMDAVVEIAERKLVKAGKDIS 220 (323) T ss_pred EEEECC-CCEEECCCCCCCCEEEEEECCCCCCCCCCCC----C--CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 998741-1145447899999999996467611645534----4--31013467477788899899999866787664126 Q ss_pred CCHHHHHHHHHHHCCCCCEEEEECCCC---CCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-E Q ss_conf 657999999998458970899805666---898788872988321899996880998999998579849998998798-6 Q gi|254780970|r 631 TGGLIITLAEMTISSAKGMDIILPIEI---EKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-T 706 (737) Q Consensus 631 dGGL~~aL~EMa~as~~G~~I~L~~~~---~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~ 706 (737) +|||+.+|.|||.+|++|++|+++..+ ..++.++||+|+++||+++|++++.+++++++++.++++++||+||++ . T Consensus 221 ~gGL~~~l~em~~~s~~Ga~I~l~~iP~~~~~~~~~~l~~~~~~~~l~tv~~~~~~~i~~~~~~~gi~~~vIG~Vt~~~~ 300 (323) T TIGR03267 221 NPGLIGTLGMLLEASRVGAEVDLESIPKPEDVDMVTWLKMYPGSGFVLTADPENVREIVRVLEDAGLTASVIGEVIEDGK 300 (323) T ss_pred CCCHHHHHHHHHHHCCCCEEEEHHHCCCCCCCCHHHHHHHCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEECCCE T ss_conf 47167799999864488379975318888899999999848998389998650399999999987998899999975982 Q ss_pred EEEEC Q ss_conf 89807 Q gi|254780970|r 707 LSIHN 711 (737) Q Consensus 707 l~i~~ 711 (737) +.+.+ T Consensus 301 l~l~~ 305 (323) T TIGR03267 301 LYVSD 305 (323) T ss_pred EEEEE T ss_conf 99998 No 15 >TIGR01736 FGAM_synth_II phosphoribosylformylglycinamidine synthase II; InterPro: IPR010074 Phosphoribosylformylglycinamidine synthase is a single, long polypeptide in most bacteria. Three proteins are required in Bacillus subtilis and many other species. This is the longest of the three and is designated PurL, phosphoribosylformylglycinamidine synthase II, or FGAM synthase II.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=375.17 Aligned_cols=305 Identities=24% Similarity=0.409 Sum_probs=258.2 Q ss_pred CHHHH-HHHHHC-CCCEEECCCC---------CEEEEEECCCE----EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 88999-751206-9748993347---------73689836983----999995178877765888872105581022576 Q gi|254780970|r 53 SKKWL-RTLPTT-GKHVIQGPGE---------NAGVVDIGGGD----CVVFKMESHNHPSYIEPYQGAATGVGGILRDIF 117 (737) Q Consensus 53 ~~~~l-~~~~~~-~~~vi~~~~~---------~a~vi~~~~~~----~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~ 117 (737) ||.|+ +|+..+ .=++|+.||+ ||+||++.+.. .|++|.+.-=-=++++||.||+.-|.=.+|+++ T Consensus 417 sK~wvY~QYDheVq~rTVvkPG~G~~N~ETH~DAAVlr~~~~~~~~~glA~t~D~NPr~~~~dPy~Ga~~~vaEa~RNLa 496 (763) T TIGR01736 417 SKEWVYRQYDHEVQTRTVVKPGEGKMNKETHADAAVLRIKETGKLDKGLALTADCNPRYVYLDPYAGAAAAVAEAYRNLA 496 (763) T ss_pred HCCCEEEEECEEECCEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCEEEEEECCCCCCEEECHHHHHHHHHHHHHHCEE T ss_conf 15413765110673257760788877741115413466405687752789970787660331603558897774311256 Q ss_pred CCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCC-------CCCCCEEEECCCC-H Q ss_conf 7787436877504578756863888998887458866543087211320355215676-------6422112211211-3 Q gi|254780970|r 118 TMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNN-------NIIVNTFAAGIAK-T 189 (737) Q Consensus 118 ~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~-------~~~v~~~~~Gi~~-~ 189 (737) |+||+|+|++||||||||++|+.+|+|.++++|+++.|..|++|+|||||||+..|++ +|.|.++.+|+++ . T Consensus 497 ~vGA~PLa~vDcLNFGNPErPEvywQf~~~v~Gl~daa~~l~~PVVgGNVSLYNE~~~~~~~~~i~PTP~i~~VG~~~Dv 576 (763) T TIGR01736 497 AVGAEPLAIVDCLNFGNPERPEVYWQFVEAVKGLGDAARALGTPVVGGNVSLYNETNGEDAKVPIAPTPTIGMVGLVEDV 576 (763) T ss_pred EECCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEECCEECCCCCCCCCCCCCCCCCCEEEEEEEECCC T ss_conf 40875488884056898887036789999999999999863889882110002310378724353785158999884274 Q ss_pred HHHHH-CCCCCCCCEEEEECCCCCC-CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCC Q ss_conf 55431-1225689779996576564-441046666777286533344433267989998889998740113830110347 Q gi|254780970|r 190 NAIFS-SKARGIGLPLVYLGAKTGR-DGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDM 267 (737) Q Consensus 190 ~~~~~-~~~~~~Gd~ii~~G~~tg~-dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~ 267 (737) +++.+ ...++.||.|+++|.+ + +.|||+-.....++.... ..|+| |...||++.|++++++++++|.++||+ T Consensus 577 ~~~~~~~~~~~~Gd~~~l~Get--~~~ElGGS~Y~~~~~g~~~G-~~P~v---Dl~~e~~~~d~v~~~i~~g~v~~ahDv 650 (763) T TIGR01736 577 EKLLTSLNFKKEGDAIYLIGET--KKDELGGSEYLKVVHGIVSG-QVPAV---DLEEEKELADAVREAIREGLVSAAHDV 650 (763) T ss_pred CCCCCCCHHHHCCCEEEEECCC--CCCCCCHHHHHHHHCCCCCC-CCCCC---CHHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 2125642054479769993143--23666476787777884478-68887---868889999999999740682055443 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCH-HHHHHHHHHHCCEEECCCCEEE Q ss_conf 752011122332014775289962022046678999898617888508999954675-8899988870971311221210 Q gi|254780970|r 268 GAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQ-HKAQEILNKWGLHFSIIGITTN 346 (737) Q Consensus 268 gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~-~~~~~i~~~~~~~~~~iG~vt~ 346 (737) |.|||+.||+||+..+|+|++|++++++-. ...+.+.||||||.|++++|+.++. .+..+.|+..++++.+||+.|. T Consensus 651 S~GGLAvaLAem~~~~G~Ga~~~~~~~~~~--~~r~~~~LFSEs~gR~iv~~~~~~~v~~~~~ff~~~~~~a~~~G~~~~ 728 (763) T TIGR01736 651 SRGGLAVALAEMAIASGIGAEVDIDEIASE--TARADELLFSESNGRAIVAVREEKAVEEAVKFFESKGVPAKVIGEKTG 728 (763) T ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCCC--HHHHHHHHCCCCCCCEEEEECCHHHHHHHHHHHHCCCCEEEEEEEEEC T ss_conf 352389999999974486259972578850--000268870677886699957979999999873123650677747864 Q ss_pred CCCCCCCCCCC--CCCCCCCC Q ss_conf 44310002586--44344321 Q gi|254780970|r 347 DKLFRVIHRGE--EVANLPIK 365 (737) Q Consensus 347 ~~~~~v~~~g~--~v~dlp~~ 365 (737) ..++++...+. ..+++.++ T Consensus 729 ~~~~~~~~~~~~~~~~~~~~~ 749 (763) T TIGR01736 729 GEKLTIKTGDFKLDEISVSVK 749 (763) T ss_pred CCEEEEEECCCEEEEEEECHH T ss_conf 863798516730344320178 No 16 >cd02204 PurL_repeat2 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), second repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=0 Score=346.70 Aligned_cols=257 Identities=25% Similarity=0.389 Sum_probs=223.4 Q ss_pred CEEEEEEC--CCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCC-HHHHHHHHHHH Q ss_conf 73689836--9839999951788777658888721055810225767787436877504578756863-88899888745 Q gi|254780970|r 74 NAGVVDIG--GGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPK-TKHLLSGVVAG 150 (737) Q Consensus 74 ~a~vi~~~--~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~-~~~~~~~~~~G 150 (737) ||+|+++. .+..+++++..+.+.+.++||.||+++|++.+||++++||+|+++++++||++|++|+ .+|++.++++| T Consensus 1 da~V~~~~~~~~~g~a~s~g~~p~~~~~dP~~ga~~aV~Ea~rni~~~Ga~p~ai~~~lnf~~p~~~~~~~~~l~~av~g 80 (264) T cd02204 1 DAAVLRIPGETDKGLAMSTGENPRYSLLDPYAGAALAVAEAVRNLVAVGADPLAITDCLNFGNPEKPEGEMGQLVEAVLG 80 (264) T ss_pred CCEEEEECCCCCEEEEEECCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCCHHHHEECHHCCCCCCCCCHHHHHHHHHHH T ss_conf 95599863899859999747999614348999999999999988752698389951001102877785078889999999 Q ss_pred HHHHHHHCCCCCCCCEEEEECCCCC---CCCCCEEEECCCCH-HHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 8866543087211320355215676---64221122112113-5543112256897799965765644410466667772 Q gi|254780970|r 151 IAGYSNSFGVPTVGGEVEFLPCYNN---NIIVNTFAAGIAKT-NAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEF 226 (737) Q Consensus 151 ~~~ygN~~G~P~v~G~~~~~~~y~~---~~~v~~~~~Gi~~~-~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~ 226 (737) +++|++++|+|+|+|+++|+..+.. +|.+.++++|+++. +..++...+++||.|+++|.++++. +|+.++...+ T Consensus 81 ~~~~~~~lgiP~v~G~~Sl~n~~~~~~ippTlvi~~vg~v~~~~~~~t~~~k~~g~~i~lig~~~~~~--ggs~~~~~~~ 158 (264) T cd02204 81 LGDACRALGTPVIGGKDSLYNETEGVAIPPTLVIGAVGVVDDVRKIVTLDFKKEGDLLYLIGETKDEL--GGSEYALAYH 158 (264) T ss_pred HHHHHHHCCCCEECCEEEEECCCCCCCCCCCEEEEEEEECCCCCEEEHHHHCCCCCEEEEECCCCCCC--CCCHHHHHHC T ss_conf 99999970996203446651466786678852899997325730024154347998799957884557--8779999970 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHH Q ss_conf 86533344433267989998889998740113830110347752011122332014775289962022046678999898 Q gi|254780970|r 227 GENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEM 306 (737) Q Consensus 227 ~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei 306 (737) +.. ...++..||+.+|++++++++++++++|.++||+|+|||++++.||+..+++|++|+|++++ +++|+ T Consensus 159 ~~~----~~~~~~~d~~~~k~~~~~i~~l~~~~li~s~hDvS~GGL~~al~EMa~~~~~G~~i~l~~~~------~~~~~ 228 (264) T cd02204 159 GLG----GGAPPLVDLEREKALFDAVQELIKEGLVLSAHDVSDGGLAVALAEMAFAGGLGAEVDLSKDD------AEDEL 228 (264) T ss_pred CCC----CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCC------CHHHH T ss_conf 878----89856589999999999998998659612787678781999999998428965999838999------82575 Q ss_pred HHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCE Q ss_conf 61788850899995467588999888709713112212 Q gi|254780970|r 307 MLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGIT 344 (737) Q Consensus 307 ~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~v 344 (737) |+||+|+||+++|++++.+.|+ ++.+++++.+||+| T Consensus 229 LFsEs~gr~vv~v~~~~~~~~~--~~~~~v~~~~IG~v 264 (264) T cd02204 229 LFSESLGRVLVEVKPENEEVFE--AEEAGVPATVIGTV 264 (264) T ss_pred EECCCCCEEEEEECCCHHHHHH--HHHCCCCEEEEEEC T ss_conf 0378888699998724299999--98889996998769 No 17 >cd02192 PurM-like3 AIR synthase (PurM) related protein, subgroup 3 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=100.00 E-value=2.8e-45 Score=329.78 Aligned_cols=265 Identities=20% Similarity=0.267 Sum_probs=217.3 Q ss_pred HHHHHHHHCCCCEEECCCCCEEEEEECCCEEEEEEECCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCC Q ss_conf 999751206974899334773689836983999995178-8777658888721055810225767787436877504578 Q gi|254780970|r 55 KWLRTLPTTGKHVIQGPGENAGVVDIGGGDCVVFKMESH-NHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFG 133 (737) Q Consensus 55 ~~l~~~~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtH-NhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g 133 (737) .+|+.+|+.++.|+.|+||||+|+++++++ ++++.+++ .++...+||.++.+++.+++|||++|||+|++++++|.++ T Consensus 18 ~~l~~~~~~~~~V~~giGDDaAVi~~~~~~-~v~~td~~~~~~~~~dP~~~G~~ava~nlsDIaAMGa~P~~~l~~l~~p 96 (283) T cd02192 18 AILPDAPFDSLGVAADLGDDAAAIPDGDGY-LLLAADGIWPSLVEADPWWAGYCSVLVNVSDIAAMGGRPLAMVDALWSP 96 (283) T ss_pred HHCCCCCCCCCCCCCCCCCCEEEEEECCCE-EEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCC T ss_conf 863338998888575799765899708974-9999785468876679799999999988888987077679886442589 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCC Q ss_conf 75686388899888745886654308721132035521567664221122112113554311225689779996576564 Q gi|254780970|r 134 AIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGR 213 (737) Q Consensus 134 ~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~ 213 (737) +. ..++++.+|++++++++|+|+|||+|.++..| +.+.+.++|+++.+.+.+. ..++||.|++.|..+|+ T Consensus 97 ~~------e~~~~i~~Gi~~~~~~~gv~lVGGdT~~~~~~---~~i~vt~~G~~~~~~~~~~-gAk~GD~i~vt~~l~g~ 166 (283) T cd02192 97 SA------EAAAQVLEGMRDAAEKFGVPIVGGHTHPDSPY---NALSVAILGRARKDLLISF-GAKPGDRLILAIDLDGR 166 (283) T ss_pred CH------HHHHHHHHHHHHHHHHCCCCEECCCCCCCCCC---CEEEEEEEEECCCCCEECC-CCCCCCEEEEEEECCCC T ss_conf 88------99999999999999875985765544567888---7368999974077743227-89999989999866753 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHH Q ss_conf 44104666677728653334443326798999888999874011383011034775201112233201477528996202 Q gi|254780970|r 214 DGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDK 293 (737) Q Consensus 214 dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~ 293 (737) . ++.. ....+ +.+..++..-|..++.+.++.+++++++|||+++|||.+++.||+++|++|++|++++ T Consensus 167 ~--~~~~----~~~~~------~~~~~~~~~~~~~~~~~~~l~~~~~~~A~~DiSdgGL~~~l~eia~aSgvGa~I~~~~ 234 (283) T cd02192 167 V--HPSP----PPNWD------ATTMKSPALLRRQIALLPELAERGLVHAAKDISNPGIIGTLGMLLEASGVGAEIDLDA 234 (283) T ss_pred C--CCCC----CCCCC------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCC T ss_conf 0--5344----23423------1012467777768899999998647888862131127889999999769809997023 Q ss_pred HCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCC Q ss_conf 20466789998986178885089999546758899988870971311221 Q gi|254780970|r 294 VPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGI 343 (737) Q Consensus 294 vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~ 343 (737) ||+ .+++.+.+.|.+|+|+||+++|+|++.++++++|+++++++++||| T Consensus 235 iP~-~~~~~~~~~L~~~~~~~ll~tv~pe~~~~v~~~~~~~gv~~~~IGE 283 (283) T cd02192 235 IPR-PEGVDLERWLKCFPGFGFLLTARPENADEVVAVFAAVGITAAVIGE 283 (283) T ss_pred CCC-CCCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEECC T ss_conf 878-8999989998719986159998851399999999986998898395 No 18 >PRK01213 phosphoribosylformylglycinamidine synthase II; Provisional Probab=100.00 E-value=4.8e-44 Score=320.93 Aligned_cols=289 Identities=26% Similarity=0.423 Sum_probs=242.7 Q ss_pred CCCEEECCCCCEEEEEECC-CEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHH Q ss_conf 9748993347736898369-839999951788777658888721055810225767787436877504578756863888 Q gi|254780970|r 64 GKHVIQGPGENAGVVDIGG-GDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKH 142 (737) Q Consensus 64 ~~~vi~~~~~~a~vi~~~~-~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~ 142 (737) +-.+++.|+-||+|+++.+ ...+++.+.-.-.-+.++||.||.--|.-.+|++.++||+|+|++|+||||+|++|+.+| T Consensus 429 ~g~TV~~P~~DaaVi~i~~s~~g~a~s~g~~p~~~~~dP~~ga~~aV~Ea~rNlva~Ga~plai~d~lnfgnPe~pe~~~ 508 (736) T PRK01213 429 QGNTVVKPGGDAAVVRIEGTGKGLAMTTDCNPRYVYLDPYEGAKLAVAEAYRNVAAVGAEPLAITDCLNFGNPEKPEVMW 508 (736) T ss_pred CCCCCCCCCCCEEEEEECCCCEEEEEECCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCHHHHH T ss_conf 47713468987179985589758999558785152069999999999999977886178613221253238977877878 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCC---CCCCCEEEECCC-CHHHHHHCCCCCCCCEEEEECCCCCCCCCHH Q ss_conf 998887458866543087211320355215676---642211221121-1355431122568977999657656444104 Q gi|254780970|r 143 LLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNN---NIIVNTFAAGIA-KTNAIFSSKARGIGLPLVYLGAKTGRDGIGG 218 (737) Q Consensus 143 ~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~---~~~v~~~~~Gi~-~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~G 218 (737) .|.++++|+++++-++|+|+++|++++...+.+ +|.+.+.++|.+ +.++.++...+++||.|+++|. .++.+|| T Consensus 509 ~l~~av~g~~dac~~l~ipvv~GkdSlyne~~~~~I~PTpvI~~vg~v~Dv~~~vt~~~k~~g~~i~lig~--~~~~LGG 586 (736) T PRK01213 509 QFVEAVRGIAEACRALGTPVVGGNVSLYNETNGTAIYPTPVIGMVGLIDDVTKAITSGFKEEGDLIYLLGE--TKGELGG 586 (736) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCEEEEEEEEECCHHCCCCCCCCCCCEEEEECC--CCCCCCC T ss_conf 89999999999999819996757537736669935787515998777506000237333578974999717--8776562 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCC Q ss_conf 66667772865333444332679899988899987401138301103477520111223320147752899620220466 Q gi|254780970|r 219 ASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCE 298 (737) Q Consensus 219 a~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~ 298 (737) +.++...++... ...|.| |+..+|++.+++.+++++++|.++||++.|||+.|+.||+..+++|++|+|+. T Consensus 587 S~~~~~~~~~~~-~~~P~v---d~~~~~~~~~~i~~li~~~~I~s~HDiSdGGL~~aLaEMaf~~~~G~~i~l~~----- 657 (736) T PRK01213 587 SEYLKVVHGRVG-GRPPAV---DLAAEKRLGELLREAIRAGLVSSAHDVSDGGLAVALAEMAIASGLGATVDLSS----- 657 (736) T ss_pred CHHHHHHHCCCC-CCCCCC---CHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEECCC----- T ss_conf 199999727558-989987---99999999999999997798699988861589999999985699737996288----- Q ss_pred CCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 789998986178885089999546758899988870971311221210443100025864434432101 Q gi|254780970|r 299 EGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKAL 367 (737) Q Consensus 299 ~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L 367 (737) ...++.++||||+|.|+++.|++++.++|++++++.++++.+||+|+++..+. ++.+++++++-| T Consensus 658 ~~~~~~~~LFsE~~gr~vveV~~~~~~~~~~~~~~~gi~~~~IG~v~~~~~i~----~~~~~~~~~~eL 722 (736) T PRK01213 658 DGGRPDALLFSESQGRYVVSVPPENEEAFEELAEALGVPATVIGVVGGDDAVL----GQGLISLSVEEL 722 (736) T ss_pred CCCCHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCEEE----CCCCEEEEHHHH T ss_conf 88876776128888659999858899999999987799889988998855395----177224159999 No 19 >cd02203 PurL_repeat1 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), first repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=100.00 E-value=1.4e-42 Score=310.84 Aligned_cols=260 Identities=23% Similarity=0.364 Sum_probs=226.5 Q ss_pred CEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCC--------HHHHHHHHHHHHHHHH Q ss_conf 22797304532100001001135678977764102785102232210245666862--------1346999999998522 Q gi|254780970|r 448 KALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEE--------IMGQFVHSVKGIREAC 519 (737) Q Consensus 448 ~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e--------~~~ql~~av~gl~d~c 519 (737) .++++++....+.+.++||.||+..|+..+||+.+|||+|++++++|||++|+.++ .+|.+.++++|+++++ T Consensus 26 ~~~~~k~EsHNhPSai~Py~GAaTGvGGiiRDi~~~GA~pia~~d~l~fG~~~~~~~~~~~~~~~~~i~~~vv~GisdyG 105 (313) T cd02203 26 WAVVFKVETHNHPSAIEPFGGAATGVGGIIRDILSMGARPIALLDGLRFGDLDIPGYEPKGKLSPRRILDGVVAGISDYG 105 (313) T ss_pred HEEEEEECCCCCCCCCCCCCCCCCCCCCEECCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHCCHHHHCC T ss_conf 70288834678988678878501045630025456686677765465206998876555456689899877513566538 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCC--CCCCCHHHHHHHHHCC--- Q ss_conf 10288665443200112368666763111222233352023310214389807513776--6554326788886116--- Q gi|254780970|r 520 QILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGND--GCHLDCSMYSLECASS--- 594 (737) Q Consensus 520 ~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~--~~~LggS~~~~~~~~~--- 594 (737) +.+|+|+++|.++||..+.. +|.+.++++|.++. +++++...+++||+|+++|.. +..++|+.++...... T Consensus 106 N~~GvP~v~G~~~f~~~y~~---nPlv~~~~vG~v~~-~~~~~~~~~~~G~~iv~vGg~tgrdGigGasfsS~~~~~~~~ 181 (313) T cd02203 106 NCIGIPTVGGEVRFDPSYYG---NPLVNVGCVGIVPK-DHIVKSKAPGPGDLVVLVGGRTGRDGIGGATFSSKELSENSS 181 (313) T ss_pred CCCCCCEEEEEEEECCCCCC---CCEEEEEEEEEEEH-HHHCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEHHHCCCCC T ss_conf 65576235579998665467---84378754775337-770256689999889999078666652210354032046643 Q ss_pred --CCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC----CCCHHHHHHCC Q ss_conf --789996479899999999999999959806994648657999999998458970899805666----89878887298 Q gi|254780970|r 595 --NIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI----EKDPKPFLFGE 668 (737) Q Consensus 595 --~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~----~~d~~~~LFsE 668 (737) ....++.-|+..+|++++++++++++++|.|+||++.|||+.++.|||+.+++|++|+|+..+ ..+|++.|||| T Consensus 182 ~~~~~aVQ~gdP~~ek~l~e~~le~~~~~li~~ihD~GaGGl~~a~~Ema~~~~~G~~i~Ld~vp~~~~~m~p~Eil~SE 261 (313) T cd02203 182 ELDRPAVQVGDPFMEKKLQEAILEARETGLIVGIQDLGAGGLSSAVSEMAAKGGLGAEIDLDKVPLREPGMSPWEIWISE 261 (313) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHEEEEC T ss_conf 34655432286688889999999888549868888658850654237776148973999903470577999864713305 Q ss_pred CCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEEEC Q ss_conf 8321899996880998999998579849998998798-689807 Q gi|254780970|r 669 DQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSIHN 711 (737) Q Consensus 669 s~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i~~ 711 (737) +|.|++++|++++.++|++++++.++++.+||+||++ .+.+.. T Consensus 262 SQERm~~~v~~~~~~~~~~i~~k~~~~~~vIG~vt~~~~~~v~~ 305 (313) T cd02203 262 SQERMLLVVPPEDLEEFLAICKKEDLEAAVIGEVTDDGRLRLYY 305 (313) T ss_pred CHHCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEECCCEEEEEE T ss_conf 61108999886469999999998799979999993698699999 No 20 >cd06061 PurM-like1 AIR synthase (PurM) related protein, subgroup 1 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=100.00 E-value=2.8e-40 Score=295.05 Aligned_cols=274 Identities=24% Similarity=0.385 Sum_probs=217.7 Q ss_pred HHHHHHHHHCCCCEEECC--CCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECC Q ss_conf 899975120697489933--477368983698399999517887776588887210558102257677874368775045 Q gi|254780970|r 54 KKWLRTLPTTGKHVIQGP--GENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLR 131 (737) Q Consensus 54 ~~~l~~~~~~~~~vi~~~--~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~ 131 (737) |..++.++++.++|+.|| ||||+|+++++++.+ + |++|...+||..+...+...++||.+|||+|+++++++. T Consensus 12 ~~~~~~l~~~~~~v~vg~~~gdDaavi~~~~~~lv-v----ttDp~~~~~~~~G~~ava~~~sDI~amGa~P~~~l~~l~ 86 (298) T cd06061 12 RLILKNLGADRDEVLVGPGGGEDAAVVDFGGKVLV-V----STDPITGAGKDAGWLAVHIAANDIATSGARPRWLLVTLL 86 (298) T ss_pred HHHHHHCCCCCCCEEECCCCCCEEEEEEECCCEEE-E----ECCCCCCCCHHHHHHHHHHHHHHHHHCCCEEHHHHHHEE T ss_conf 99861279999888778999852599986898799-9----848830581369999999879999985998288761211 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCC Q ss_conf 78756863888998887458866543087211320355215676642211221121135543112256897799965765 Q gi|254780970|r 132 FGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKT 211 (737) Q Consensus 132 ~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~t 211 (737) |+.... ...+.++++|++++++++|+|+|||+|.+.+.... +.+.+.++|+++.+.+.+....++||.|++.| T Consensus 87 ~p~~~~---~~~l~~i~~Gi~~~~~~~g~~lvGGdT~~~~~~~~-~~i~~t~~G~~~~~~~~~~~~ak~GD~Iiltg--- 159 (298) T cd06061 87 LPPGTD---EEELKAIMREINEAAKELGVSIVGGHTEVTPGVTR-PIISVTAIGKGEKDKLVTPSGAKPGDDIVMTK--- 159 (298) T ss_pred CCCCCC---HHHHHHHHHHHHHHHHHHCCCEECCEEEECCCCCC-CEEEEEEEEEECCCCEECCCCCCCCCEEEEEC--- T ss_conf 889999---99999999999999998399564431586478777-67999999995688663067799999999989--- Q ss_pred CCCCCHHHHHHHHHHCCCCCCCCCC--CCCCCH-HHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEE Q ss_conf 6444104666677728653334443--326798-9998889998740113830110347752011122332014775289 Q gi|254780970|r 212 GRDGIGGASMASEEFGENIAKKRPT--VQVGDP-FTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGIT 288 (737) Q Consensus 212 g~dGi~Ga~~sS~~~~~~~~~~~~~--VQ~gdp-~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~ 288 (737) ..|++|+.+.+..+.+........ .+.... +....+++..+ +..+..+++|||+++|||++++.||+.++++|++ T Consensus 160 -~~G~~~~~l~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~a~~Dis~gGL~~~L~e~~~~s~vg~~ 237 (298) T cd06061 160 -GAGIEGTAILANDFEEELKKRLSEEELREAAKLFYKISVVKEAL-IAAEAGVTAMHDATEGGILGALWEVAEASGVGLR 237 (298) T ss_pred -CCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHH-HHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEE T ss_conf -84076899999870665523467588998787540222399999-7510575598716755277789999974697499 Q ss_pred EEEHHHCCCCC--------CCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCE Q ss_conf 96202204667--------899989861788850899995467588999888709713112212 Q gi|254780970|r 289 LDLDKVPTCEE--------GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGIT 344 (737) Q Consensus 289 i~l~~vp~~~~--------~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~v 344 (737) |+++++|+.++ +++||++.. +++|+++|+|++.+++.++++++++++++||+| T Consensus 238 I~~~~iPi~~~~~~~~~~~~~dp~~~~~---~G~ll~~v~~~~~~~v~~~l~~~g~~a~vIGeV 298 (298) T cd06061 238 IEKDKIPIRQETKEICEALGIDPLRLIS---SGTLLITVPPEKGDELVDALEEAGIPASVIGKI 298 (298) T ss_pred EEECCCCCCHHHHHHHHHCCCCHHHHHH---CCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 9775468887999999974978888970---887999998899999999999649796999869 No 21 >cd02192 PurM-like3 AIR synthase (PurM) related protein, subgroup 3 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=100.00 E-value=2.9e-37 Score=274.40 Aligned_cols=252 Identities=23% Similarity=0.237 Sum_probs=197.7 Q ss_pred CCCCCCCCCCCCEEEECCCCCCEEEEECC-CCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHH Q ss_conf 11000013576314203777622797304-53210000100113567897776410278510223221024566686213 Q gi|254780970|r 427 SNSIQLPGGDAGVIRVEGHETKALAFSSD-VTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIM 505 (737) Q Consensus 427 g~tv~~Pg~DaaVv~~~~~~~~gia~s~g-~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~ 505 (737) -+..+.||+||||+++.. +. +++++| ..|++...|||.+|+.+|+.+++||+||||+|+++++++.++++ +. T Consensus 28 ~~V~~giGDDaAVi~~~~--~~-~v~~td~~~~~~~~~dP~~~G~~ava~nlsDIaAMGa~P~~~l~~l~~p~~---e~- 100 (283) T cd02192 28 LGVAADLGDDAAAIPDGD--GY-LLLAADGIWPSLVEADPWWAGYCSVLVNVSDIAAMGGRPLAMVDALWSPSA---EA- 100 (283) T ss_pred CCCCCCCCCCEEEEEECC--CE-EEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCH---HH- T ss_conf 885757997658997089--74-999978546887667979999999998888898707767988644258988---99- Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHH Q ss_conf 46999999998522102886654432001123686667631112222333520233102143898075137766554326 Q gi|254780970|r 506 GQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCS 585 (737) Q Consensus 506 ~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS 585 (737) +.+.++|+.++|+++|+|++||+.++ .+.+++..++++|.++. ..++...+++||.|++++.......++ T Consensus 101 --~~~i~~Gi~~~~~~~gv~lVGGdT~~------~~~~~~i~vt~~G~~~~--~~~~~~gAk~GD~i~vt~~l~g~~~~~ 170 (283) T cd02192 101 --AAQVLEGMRDAAEKFGVPIVGGHTHP------DSPYNALSVAILGRARK--DLLISFGAKPGDRLILAIDLDGRVHPS 170 (283) T ss_pred --HHHHHHHHHHHHHHCCCCEECCCCCC------CCCCCEEEEEEEEECCC--CCEECCCCCCCCEEEEEEECCCCCCCC T ss_conf --99999999999987598576554456------78887368999974077--743227899999899998667530534 Q ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC---CCCHH Q ss_conf 788886116789996479899999999999999959806994648657999999998458970899805666---89878 Q gi|254780970|r 586 MYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI---EKDPK 662 (737) Q Consensus 586 ~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~---~~d~~ 662 (737) . ..........+....+.....+.++..+++++||||+|+|||+.+|.|||.+|++|++|+++..+ ..++. T Consensus 171 ~------~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~A~~DiSdgGL~~~l~eia~aSgvGa~I~~~~iP~~~~~~~~ 244 (283) T cd02192 171 P------PPNWDATTMKSPALLRRQIALLPELAERGLVHAAKDISNPGIIGTLGMLLEASGVGAEIDLDAIPRPEGVDLE 244 (283) T ss_pred C------CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHH T ss_conf 4------2342310124677777688999999986478888621311278899999997698099970238788999989 Q ss_pred HHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 887298832189999688099899999857984999899 Q gi|254780970|r 663 PFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGK 701 (737) Q Consensus 663 ~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~ 701 (737) .+|+++++++++++|+|++.+++++++++.|+++++||| T Consensus 245 ~~L~~~~~~~ll~tv~pe~~~~v~~~~~~~gv~~~~IGE 283 (283) T cd02192 245 RWLKCFPGFGFLLTARPENADEVVAVFAAVGITAAVIGE 283 (283) T ss_pred HHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEECC T ss_conf 998719986159998851399999999986998898395 No 22 >COG0046 PurL Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain [Nucleotide transport and metabolism] Probab=100.00 E-value=8.9e-37 Score=271.09 Aligned_cols=302 Identities=20% Similarity=0.371 Sum_probs=244.0 Q ss_pred HHHHHHHCCCCC-CCCCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 035653112100-0110000135763142037776227973045321000010011356789777641027851022322 Q gi|254780970|r 414 RRWVYEQYDTMI-QSNSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITD 492 (737) Q Consensus 414 K~~i~~qyD~~V-~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~ 492 (737) |.| .++|..+. +-.+++.|+.||+|++++. ...+++.+....+-+.++||.||+..|...+||+.+|||+|+++.+ T Consensus 63 k~~-lk~~~~t~~~~~vi~g~gdnAgvvdi~d--~~~v~fKvESHNHPSaIePy~GAATGvGGiIRDv~smGArPiA~l~ 139 (743) T COG0046 63 KSL-LKMFPTTHTGEYVLSGPGDNAGVVDIGD--GWAVVFKVESHNHPSAIEPYQGAATGVGGIIRDVLSMGARPIALLD 139 (743) T ss_pred HHH-HHHCCCCCCCCEEEECCCCCCCEEEECC--CEEEEEEECCCCCCCCCCCCCCCCCCCCCEEECCCCCCCEEEEEEC T ss_conf 999-9619987888717875267763588679--6799998546789886688797304657656224346760001010 Q ss_pred HCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCE Q ss_conf 10245666862134699999999852210288665443200112368666763111222233352023310214389807 Q gi|254780970|r 493 NLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLI 572 (737) Q Consensus 493 ~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I 572 (737) .|-|+.|+.++..+-+.+.+.|++++-+.+|+|.++|-.-|+..+.+.++ ....++|.++. .++.+...+.+|+.| T Consensus 140 ~l~fG~~~~~~~~~i~~gvv~Gia~YGN~iGvPtv~Ge~~fd~~y~~npL---vna~~vG~i~~-~~i~~~~a~~~G~~l 215 (743) T COG0046 140 SLRFGIPDIEKTRYIMKGVVAGIAAYGNEIGVPTVGGEFRFDESYVGNPL---VNAGCVGLIRK-EHIVKGEAIGPGDKL 215 (743) T ss_pred CCCCCCCCCCHHHHHHHHCCCCHHHCCCCCCCCCCCEEEEECCCCCCCCE---EEEEEEECCCH-HHEEECCCCCCCCEE T ss_conf 21247999730567874006403323763465544248997444378960---78788620114-441104678999879 Q ss_pred EECCCC--CCCCCHHHHHHHHHC-----CCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 513776--655432678888611-----6789996479899999999999999959806994648657999999998458 Q gi|254780970|r 573 LMIGND--GCHLDCSMYSLECAS-----SNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISS 645 (737) Q Consensus 573 ~liG~~--~~~LggS~~~~~~~~-----~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as 645 (737) +++|.. .+.+||..++-.-.. .....+..-|+..+|.++++++++++.++|.++||+..|||..++.||+..+ T Consensus 216 i~~Gg~TgrDGigGat~aS~~~~~~~~e~dr~aVQvGdPf~Ek~l~ea~le~~~~~~I~~i~DlGAgGLs~A~~Ela~~g 295 (743) T COG0046 216 ILLGGKTGRDGIGGATFASMELGEESEEEDRPSVQVGDPFMEKRLQEAILECVQTGLIKGIQDLGAGGLSCAISELAAKG 295 (743) T ss_pred EEECCCCCCCCCCCHHHHHHHHCCCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCC T ss_conf 99747756566573323104427451553477501278588899999999987319768874168731322389997336 Q ss_pred CCCEEEEECCC----CCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCE-EEE--EC--CEECC Q ss_conf 97089980566----68987888729883218999968809989999985798499989987986-898--07--41214 Q gi|254780970|r 646 AKGMDIILPIE----IEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGST-LSI--HN--ILNIP 716 (737) Q Consensus 646 ~~G~~I~L~~~----~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~~-l~i--~~--~~~~~ 716 (737) +.|++++|+.. .+..|++.|.||||.|+++.|.|+++++|.++|++.++++.+||+||.+. +.+ ++ +++++ T Consensus 296 g~G~~i~Ld~VP~rE~gMsp~Ei~~SESQERMllvv~p~~~e~~~~i~~k~~l~~aVVG~vT~~~~~~~~~~ge~v~dlP 375 (743) T COG0046 296 GLGAEIDLDKVPLREPGMSPYEIWLSESQERMLLVVAPEDVEEFLEICEKERLPAAVVGEVTDEPRLVVDWKGEPVVDLP 375 (743) T ss_pred CCEEEEEHHCCCCCCCCCCHHHHHHHCCCHHEEEEECCCCHHHHHHHHHHCCCCEEEEEEEECCCEEEEEECCCEEEECC T ss_conf 97069980008346799988998873110105899851349999999997189807999994376599997896788665 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780970|r 717 VATLQT 722 (737) Q Consensus 717 i~~l~~ 722 (737) ++-|.+ T Consensus 376 ~~~L~~ 381 (743) T COG0046 376 IDVLAG 381 (743) T ss_pred HHHHCC T ss_conf 799768 No 23 >cd00396 PurM-like AIR (aminoimidazole ribonucleotide) synthase related protein. This family includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM (formylglycinamidine ribonucleotide) synthase and Selenophosphate synthetase (SelD). The N-terminal domain of AIR synthase forms the dimer interface of the protein, and is suggested as a putative ATP binding domain. Probab=100.00 E-value=8.5e-39 Score=284.94 Aligned_cols=215 Identities=29% Similarity=0.474 Sum_probs=131.1 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99995178877765888872105581022576778743687750457875686388899888745886654308721132 Q gi|254780970|r 86 VVFKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGG 165 (737) Q Consensus 86 ~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G 165 (737) +++|+|+||||+.++||.++.+++.+.+||+.+|||+|++++++|.|++...+ ..+.++++|++++++++|+|++|| T Consensus 2 l~~~tDg~~~~~~~dP~~~G~~av~~~~sDi~a~Ga~P~~~l~~l~~~~~~~~---~~l~~i~~Gi~~~~~~~gv~ivGG 78 (222) T cd00396 2 LAMSTDGINPPLAINPWAGGRLAVGGAVNDIAAMGARPIALLASLSLSNGLEV---DILEDVVDGVAEACNQLGVPIVGG 78 (222) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 59996688994001888899999999988898759985999998883489987---899999999999999969826813 Q ss_pred EEEEECCCCC-CCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHH Q ss_conf 0355215676-642211221121135543112256897799965765644410466667772865333444332679899 Q gi|254780970|r 166 EVEFLPCYNN-NIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFT 244 (737) Q Consensus 166 ~~~~~~~y~~-~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ 244 (737) +|.+.+.+.. ++.+.++++|+++.+++.+....++||.|++.| T Consensus 79 ~t~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~a~~GD~ii~~g------------------------------------ 122 (222) T cd00396 79 HTSVSPGTMGHKLSLAVFAIGVVEKDRVIDSSGARPGDVLILTG------------------------------------ 122 (222) T ss_pred EEEECCCCCCCCEEEEEEEEEEECCCCEECCCCCCCCCEEEEEC------------------------------------ T ss_conf 58975888754617999999875565400656799999999989------------------------------------ Q ss_pred HHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCC-----CCCHHHHHHCCCCCEEEEEE Q ss_conf 9888999874011383011034775201112233201477528996202204667-----89998986178885089999 Q gi|254780970|r 245 GKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEE-----GMTAYEMMLSESQERMLMIL 319 (737) Q Consensus 245 ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~-----~l~~~ei~~sEsQeR~~~~v 319 (737) .+.++++.+.+.|++|||+++|||+.++.||+.++++|++|+++++|+.+. ...+++.+.+++|.||+++| T Consensus 123 ----~~~~~~l~~~~~v~a~~DiT~GGL~~~l~e~a~~s~~g~~i~~~~ip~~~~~~~~~~~~~~~~~~~~~~g~lli~v 198 (222) T cd00396 123 ----VDAVLELVAAGDVHAMHDITDGGLLGTLPELAQASGVGAEIDLEAIPLDEVVRWLCVEHIEEALLFNSSGGLLIAV 198 (222) T ss_pred ----HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCCEEEEEHHHCCCCHHHHHHHHHCHHHHHHHCCCCEEEEEE T ss_conf ----4999987645885699867987499999999987697699977657988789988550359997522548699999 Q ss_pred ECCCHHHHHHHHHHHCCEEECCCC Q ss_conf 546758899988870971311221 Q gi|254780970|r 320 NPEKQHKAQEILNKWGLHFSIIGI 343 (737) Q Consensus 320 ~~~~~~~~~~i~~~~~~~~~~iG~ 343 (737) +|++.++++++++++++++++||+ T Consensus 199 ~~e~~~~v~~~l~~~gi~a~vIGr 222 (222) T cd00396 199 PAEEADAVLLLLNGNGIDAAVIGR 222 (222) T ss_pred CHHHHHHHHHHHHHCCCCEEECCC T ss_conf 789999999999966999797192 No 24 >PRK05731 thiamine monophosphate kinase; Provisional Probab=100.00 E-value=6.4e-37 Score=272.07 Aligned_cols=277 Identities=21% Similarity=0.295 Sum_probs=198.0 Q ss_pred CCCEEECCCCCEEEEEECCCEEEEEEEC-----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCC Q ss_conf 9748993347736898369839999951-----78877765888872105581022576778743687750457875686 Q gi|254780970|r 64 GKHVIQGPGENAGVVDIGGGDCVVFKME-----SHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHP 138 (737) Q Consensus 64 ~~~vi~~~~~~a~vi~~~~~~~~~~k~E-----tHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p 138 (737) .++++.++||||+|+++.++..++++.. +|.+|...+||..+-..+...++||.+|||+|++++.+|.++ ... T Consensus 18 ~~~~~~giGDDaAvi~~~~~~~lv~ttD~~ve~~HF~~~~~~p~~iG~~av~~n~SDiaamGa~P~~~~~sl~lP--~~~ 95 (320) T PRK05731 18 RADVELGIGDDAALLDPPPGQRLVVSTDTLVEGVHFRPDWSSPEDLGYKALAVNLSDLAAMGARPAAFLLALALP--KDV 95 (320) T ss_pred CCCCCCCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCC--CCC T ss_conf 988646898777999708998799995565657677878899999999999987888997189722788850268--864 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHH Q ss_conf 38889988874588665430872113203552156766422112211211355431122568977999657656444104 Q gi|254780970|r 139 KTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGG 218 (737) Q Consensus 139 ~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~G 218 (737) ...| +.++.+|++++++++|+|+|||+|... ..+.+.+.++|.++.+++++....+|||.|++. |..|..+ T Consensus 96 ~~~~-l~~i~~Gi~~~~~~~gv~lvGGdt~~~----~~~~i~vt~iG~~~~~~~l~r~ga~~GD~I~vT----G~lG~s~ 166 (320) T PRK05731 96 DEAW-LEALADGLFALADRYGAELIGGDTTRG----PDLSISVTAIGEVPGGRALRRSGAKPGDLVAVT----GTLGDSA 166 (320) T ss_pred HHHH-HHHHHHHHHHHHHHHCCEEEECCCCCC----CCCEEEEEEEEECCCCCEEECCCCCCCCEEEEC----CCCCHHH T ss_conf 3999-999999999999983956960431258----871799999961179965623678989878972----7742778 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHHHH-----HHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHH Q ss_conf 666677728653334443326798999888-----999874011383011034775201112233201477528996202 Q gi|254780970|r 219 ASMASEEFGENIAKKRPTVQVGDPFTGKCL-----LEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDK 293 (737) Q Consensus 219 a~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~-----~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~ 293 (737) +.++....+..... .+.-++..++.+ .+....+ .+.+++|||+++ ||..++.||+.+|++|++|++++ T Consensus 167 ~gl~~l~~~~~~~~----~~~~~~~~~~~~~p~~~~~~~~~l--~~~~~a~~DiSd-GL~~~L~eia~~Sgvg~~I~~~~ 239 (320) T PRK05731 167 AGLALLLNGGQASA----NEDAAALISRHLRPQPRVGLGQAL--RGLASAAMDISD-GLAADLGHIAEASGVGADIDADA 239 (320) T ss_pred HHHHHHHCCCCCCC----CHHHHHHHHHHHCCCCCHHHHHHH--HHHHHHHCCCCH-HHHHHHHHHHHHCCCCEEEEHHH T ss_conf 89999847877784----055799999962898438999997--765555320416-79999999999869939985887 Q ss_pred HCCCCC----CCCHHHHHHC--CCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCC Q ss_conf 204667----8999898617--88850899995467588999888709713112212104431000258644 Q gi|254780970|r 294 VPTCEE----GMTAYEMMLS--ESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEV 359 (737) Q Consensus 294 vp~~~~----~l~~~ei~~s--EsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v 359 (737) +|+++. +.+|+++.++ |.+| |+++|+|++.++++++++++++++++||+||+++.+.+..+++.+ T Consensus 240 iP~~~~~~~~~~~~~~~~l~gGedye-Ll~tv~~~~~~~~~~~~~~~gi~~~~IG~v~~~~gv~l~~~g~~~ 310 (320) T PRK05731 240 LPVSAALAALGEDALRWALSGGEDYE-LLFTVPPENRGALLAAAKKLGVGVTIIGRVAGGEGVLLDRDGKPV 310 (320) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCCEE-EEEEECHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEEEECCCEE T ss_conf 89875665535549998316787336-999988999999999999759896999999848847999399886 No 25 >cd00396 PurM-like AIR (aminoimidazole ribonucleotide) synthase related protein. This family includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM (formylglycinamidine ribonucleotide) synthase and Selenophosphate synthetase (SelD). The N-terminal domain of AIR synthase forms the dimer interface of the protein, and is suggested as a putative ATP binding domain. Probab=100.00 E-value=1.5e-35 Score=262.72 Aligned_cols=213 Identities=28% Similarity=0.433 Sum_probs=182.6 Q ss_pred EEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 27973045321000010011356789777641027851022322102456668621346999999998522102886654 Q gi|254780970|r 449 ALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVS 528 (737) Q Consensus 449 gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivg 528 (737) ++++|+|...+++.+|||.+++.||+++++|++||||+|++++++++|+++.+++.+ .+.++|+.++|+++|+|++| T Consensus 1 ~l~~~tDg~~~~~~~dP~~~G~~av~~~~sDi~a~Ga~P~~~l~~l~~~~~~~~~~l---~~i~~Gi~~~~~~~gv~ivG 77 (222) T cd00396 1 SLAMSTDGINPPLAINPWAGGRLAVGGAVNDIAAMGARPIALLASLSLSNGLEVDIL---EDVVDGVAEACNQLGVPIVG 77 (222) T ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEHHHHHHHHHCCCCCHHHH---HHHHHHHHHHHHHHCCCEEE T ss_conf 959996688994001888899999999988898759985999998883489987899---99999999999996982681 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHH Q ss_conf 43200112368666763111222233352023310214389807513776655432678888611678999647989999 Q gi|254780970|r 529 GNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEK 608 (737) Q Consensus 529 GkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k 608 (737) |+.+.+..... .++++.++++|.++. .+.++...+++||.|+++| T Consensus 78 G~t~~~~~~~~--~~~~~~~~~~G~~~~-~~~~~~~~a~~GD~ii~~g-------------------------------- 122 (222) T cd00396 78 GHTSVSPGTMG--HKLSLAVFAIGVVEK-DRVIDSSGARPGDVLILTG-------------------------------- 122 (222) T ss_pred EEEEECCCCCC--CCEEEEEEEEEEECC-CCEECCCCCCCCCEEEEEC-------------------------------- T ss_conf 35897588875--461799999987556-5400656799999999989-------------------------------- Q ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCC---------CHHHHHHCCCCCCEEEEECH Q ss_conf 999999999995980699464865799999999845897089980566689---------87888729883218999968 Q gi|254780970|r 609 NHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEK---------DPKPFLFGEDQGRYVVCISP 679 (737) Q Consensus 609 ~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~---------d~~~~LFsEs~Gr~ii~V~~ 679 (737) ...+.++++.+.|+|+||+|+|||+.+|.||+.+|++|++|+++..+-. .....++.++.|+++++|++ T Consensus 123 --~~~~~~l~~~~~v~a~~DiT~GGL~~~l~e~a~~s~~g~~i~~~~ip~~~~~~~~~~~~~~~~~~~~~~g~lli~v~~ 200 (222) T cd00396 123 --VDAVLELVAAGDVHAMHDITDGGLLGTLPELAQASGVGAEIDLEAIPLDEVVRWLCVEHIEEALLFNSSGGLLIAVPA 200 (222) T ss_pred --HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHCCCEEEEEHHHCCCCHHHHHHHHHCHHHHHHHCCCCEEEEEECH T ss_conf --499998764588569986798749999999998769769997765798878998855035999752254869999978 Q ss_pred HHHHHHHHHHHHCCCCEEEEEE Q ss_conf 8099899999857984999899 Q gi|254780970|r 680 ENQDLVMSEANNKNIPLRYLGK 701 (737) Q Consensus 680 ~~~~~~~~~~~~~gi~~~~IG~ 701 (737) ++.+++++.++++|+++++||+ T Consensus 201 e~~~~v~~~l~~~gi~a~vIGr 222 (222) T cd00396 201 EEADAVLLLLNGNGIDAAVIGR 222 (222) T ss_pred HHHHHHHHHHHHCCCCEEECCC T ss_conf 9999999999966999797192 No 26 >cd02197 HypE HypE (Hydrogenase expression/formation protein). HypE is involved in Ni-Fe hydrogenase biosynthesis. HypE dehydrates its own carbamoyl moiety in an ATP-dependent process to yield the enzyme thiocyanate. The N-terminal domain of HypE is related to the ATP-binding domains of the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP. Probab=100.00 E-value=4.5e-35 Score=259.42 Aligned_cols=271 Identities=24% Similarity=0.359 Sum_probs=200.1 Q ss_pred CHHHHHH-HHHCCCCEEECCCCCEEEEEECCCEEEEEEECCCCCCCCCCC--CCCCCCC---CCCCCCCCCCCCCEEEEE Q ss_conf 8899975-120697489933477368983698399999517887776588--8872105---581022576778743687 Q gi|254780970|r 53 SKKWLRT-LPTTGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEP--YQGAATG---VGGILRDIFTMGARPVAA 126 (737) Q Consensus 53 ~~~~l~~-~~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P--~~GAaTg---~gG~iRDi~~~Ga~p~a~ 126 (737) ++.+++. |...-+..+.+.||||+++++++++ ++++..+. -++| |++..-| +.+.++||.+|||+|+++ T Consensus 6 ~~~~i~~~~~~~~~~~~lgigdDaAvi~~~~~~-lvvttD~~----v~~p~~f~~~diG~~Av~~~~sDIaamGa~P~~~ 80 (293) T cd02197 6 MQELIEELFLKAFDNPILEVLEDAAALLVGGGR-LAFTTDSF----VVSPLFFPGGDIGKLAVCGTVNDLAMMGAKPLYL 80 (293) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCE-EEEEECCC----CCCCCCCCCCCHHHHHHHCCHHHHHHCCCCCHHH T ss_conf 999999998863698667888876986169984-99993654----2277528984473787521287898659975676 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEE Q ss_conf 7504578756863888998887458866543087211320355215-676642211221121135543112256897799 Q gi|254780970|r 127 MNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLV 205 (737) Q Consensus 127 ~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii 205 (737) +.+|.++. .. +...+.++.+|+.++++++|+|+|||+|.+.+. ....+.+++.++|.++.+.+++....+|||.|+ T Consensus 81 l~sl~lp~-~~--~~~~l~~i~~Gi~~~~~~~gv~iVGGdT~v~~~~~~~~l~is~t~iG~v~~~~~~~~~gA~pGD~ii 157 (293) T cd02197 81 SLGFILEE-GF--PLEDLERIVKSMAEAAREAGVKIVTGDTKVVPKGKADGIFINTTGIGVIPRGVIISPSNIRPGDKII 157 (293) T ss_pred EEEEECCC-CC--CHHHHHHHHHHHHHHHHHHCCEEEECCEEECCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCEEE T ss_conf 37565589-99--9999999999999999971976861662751478778559986788985688550357899999999 Q ss_pred EECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 96576564441046666777286533344433267989998889998740113830110347752011122332014775 Q gi|254780970|r 206 YLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNL 285 (737) Q Consensus 206 ~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~ 285 (737) +.| ..|.+|++..+.......+.+. ..|....+++.+..++. ...+++|||++.|||+.++.|||.+||+ T Consensus 158 vTg----~lG~~g~~il~~r~~~~~~~~~----~~d~~~~~~~~~~~l~~--~~~v~Am~DiT~gGL~~~L~eia~aSgv 227 (293) T cd02197 158 VSG----TIGDHGAAILAAREGLGFETDI----ESDCAPLNGLVEALLEA--GPGIHAMRDPTRGGLAAVLNEIARASGV 227 (293) T ss_pred EEC----CHHHHHHHHHHHHCCCCCCCHH----HHHHHHHHHHHHHHHHC--CCCCEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 968----6078899999852466644014----66666668999999970--7874474167975599999999998599 Q ss_pred CEEEEEHHHCCCCC--------CCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHH--CCEEECCCCE Q ss_conf 28996202204667--------89998986178885089999546758899988870--9713112212 Q gi|254780970|r 286 GITLDLDKVPTCEE--------GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKW--GLHFSIIGIT 344 (737) Q Consensus 286 G~~i~l~~vp~~~~--------~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~--~~~~~~iG~v 344 (737) |++|+.+++|++++ +++||++. |+.+|+++|+|++.+++++.++++ ++++++|||| T Consensus 228 g~~I~~~~iPv~~~v~~~~e~lg~DPl~~a---s~G~ll~~v~~e~a~~vl~~l~~~~~G~~A~iIGeV 293 (293) T cd02197 228 GIEIEEEAIPVREEVRGACEMLGLDPLYLA---NEGKFVAIVPPEDAEEVLEALRSHPLGKEAAIIGEV 293 (293) T ss_pred EEEEEECCCCCCHHHHHHHHHHCCCHHHHH---CCCEEEEEECHHHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 399940368888899999998593987883---485399998789999999999728889981898669 No 27 >cd02194 ThiL ThiL (Thiamine-monophosphate kinase) plays a dual role in de novo biosynthesis and in salvage of exogenous thiamine. Thiamine salvage occurs in two steps, with thiamine kinase catalyzing the formation of thiamine phosphate, and ThiL catalyzing the conversion of this intermediate to thiamine pyrophosphate. The N-terminal domain of ThiL binds ATP and is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, FGAM synthase and selenophosphate synthetase (SelD). Probab=100.00 E-value=3.1e-35 Score=260.50 Aligned_cols=261 Identities=26% Similarity=0.365 Sum_probs=193.3 Q ss_pred CCCCEEECCCCCEEEEEECCCEEEEEE---ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCC Q ss_conf 697489933477368983698399999---51788777658888721055810225767787436877504578756863 Q gi|254780970|r 63 TGKHVIQGPGENAGVVDIGGGDCVVFK---MESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPK 139 (737) Q Consensus 63 ~~~~vi~~~~~~a~vi~~~~~~~~~~k---~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~ 139 (737) ..++++.++||||++++.++++.++-+ +|+++||...+||..+...+.+.++||.+|||+|++++.+|.++. ... T Consensus 15 ~~~~~~~g~GDDaavi~~~~~~lv~ttD~~~e~~hf~~~~~p~~iG~~av~~nlSDIaamGa~P~~~~~sl~lp~--~~~ 92 (291) T cd02194 15 AGPGVLLGIGDDAAVLKPPGGRLVVTTDTLVEGVHFPPDTTPEDIGWKALAVNLSDLAAMGARPLGFLLSLGLPP--DTD 92 (291) T ss_pred CCCCCCCCCCCCEEEEEECCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCC--CCC T ss_conf 999853589888799960997699996665466668999998999999999888889873896123577650689--998 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHH Q ss_conf 88899888745886654308721132035521567664221122112113554311225689779996576564441046 Q gi|254780970|r 140 TKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGA 219 (737) Q Consensus 140 ~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga 219 (737) ..| +.++++|++++++++|+|+|||+|... ..+.+.+.++|.++.+.+++....+|||.|++.| ..|.+++ T Consensus 93 ~~~-l~~i~~Gi~~~~~~~gv~ivGGdt~~~----~~~~i~vt~iG~~~~~~~i~r~~ak~GD~i~vtg----~~G~~~~ 163 (291) T cd02194 93 EEW-LEEFYRGLAEAADRYGVPLVGGDTTSG----SELVISVTALGEVEKGKPLRRSGAKPGDLLYVTG----TLGDAAA 163 (291) T ss_pred HHH-HHHHHHHHHHHHHHHCCCEEECCCCCC----CCEEEEEEEEEEECCCCEEECCCCCCCCEEEEEC----CCCHHHH T ss_conf 999-999999999999986980661341235----6548999999997688434115799899799946----7427799 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCHHHHHH-----HHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHH Q ss_conf 6667772865333444332679899988-----89998740113830110347752011122332014775289962022 Q gi|254780970|r 220 SMASEEFGENIAKKRPTVQVGDPFTGKC-----LLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKV 294 (737) Q Consensus 220 ~~sS~~~~~~~~~~~~~VQ~gdp~~ek~-----~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~v 294 (737) .+...........+. -..+.++. .++....+ .++.+++|||+++ ||..++.||+.+||+|++|+++++ T Consensus 164 gl~~~~~~~~~~~~~-----~~~~~~~~~~p~~~i~~~~~l-~~~~v~a~~DiSd-GL~~~l~eia~~S~vg~~I~~~~i 236 (291) T cd02194 164 GLALLLGGLKLPEEL-----YEELIERHLRPEPRLELGRAL-AEGLATAMIDISD-GLLADLGHIAEASGVGAVIDLDKL 236 (291) T ss_pred HHHHHHCCCCCCCHH-----HHHHHHHHHCCCCHHHHHHHH-HHHHHHHHHCCCH-HHHHHHHHHHHHCCCEEEEEHHHC T ss_conf 999996688777225-----799999864888529999999-8500044521005-799999999998498099858868 Q ss_pred CCCC------CCCCHHHHHHC--CCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEE Q ss_conf 0466------78999898617--888508999954675889998887097131122121 Q gi|254780970|r 295 PTCE------EGMTAYEMMLS--ESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITT 345 (737) Q Consensus 295 p~~~------~~l~~~ei~~s--EsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt 345 (737) |+.. .+++|+|+.++ |.+| ++++|+|++. .+++++.++++++||+|| T Consensus 237 p~~~~~~~~~~~~~~~~~~l~gGedy~-ll~tv~~~~~---~~~~~~~~~~~~~IG~vt 291 (291) T cd02194 237 PLSPALRAAELGEDALELALSGGEDYE-LLFTVPPENA---EAAAAKLGVPVTVIGRVT 291 (291) T ss_pred CCCHHHHHHHCCCCHHHHHHHCCCCEE-EEEEECHHHH---HHHHHHCCCCEEEEEEEC T ss_conf 999899987629999999970997407-9999778999---999987398958998869 No 28 >cd06061 PurM-like1 AIR synthase (PurM) related protein, subgroup 1 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=100.00 E-value=3.2e-34 Score=253.55 Aligned_cols=256 Identities=23% Similarity=0.296 Sum_probs=187.1 Q ss_pred CCCCCC--CCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHH Q ss_conf 100001--357631420377762279730453210000100113567897776410278510223221024566686213 Q gi|254780970|r 428 NSIQLP--GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIM 505 (737) Q Consensus 428 ~tv~~P--g~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~ 505 (737) +.++.| |+||||++++. +-+++|+| |. ..+||..++.||+.+++||+||||+|++++++++|+...+.+ T Consensus 24 ~v~vg~~~gdDaavi~~~~---~~lvvttD--p~--~~~~~~~G~~ava~~~sDI~amGa~P~~~l~~l~~p~~~~~~-- 94 (298) T cd06061 24 EVLVGPGGGEDAAVVDFGG---KVLVVSTD--PI--TGAGKDAGWLAVHIAANDIATSGARPRWLLVTLLLPPGTDEE-- 94 (298) T ss_pred CEEECCCCCCEEEEEEECC---CEEEEECC--CC--CCCCHHHHHHHHHHHHHHHHHCCCEEHHHHHHEECCCCCCHH-- T ss_conf 8877899985259998689---87999848--83--058136999999987999998599828876121188999999-- Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHH Q ss_conf 46999999998522102886654432001123686667631112222333520233102143898075137766554326 Q gi|254780970|r 506 GQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCS 585 (737) Q Consensus 506 ~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS 585 (737) .+.+.++|+.++|+++|++++||+.+++.... .|.+.++++|.++. .+.++...+++||.|+++|..+.+ +.+ T Consensus 95 -~l~~i~~Gi~~~~~~~g~~lvGGdT~~~~~~~----~~~i~~t~~G~~~~-~~~~~~~~ak~GD~Iiltg~~G~~-~~~ 167 (298) T cd06061 95 -ELKAIMREINEAAKELGVSIVGGHTEVTPGVT----RPIISVTAIGKGEK-DKLVTPSGAKPGDDIVMTKGAGIE-GTA 167 (298) T ss_pred -HHHHHHHHHHHHHHHHCCCEECCEEEECCCCC----CCEEEEEEEEEECC-CCEECCCCCCCCCEEEEECCCCHH-HHH T ss_conf -99999999999999839956443158647877----76799999999568-866306779999999998984076-899 Q ss_pred HHHHHHHCCCCCCCCCCCHHHHHHHHHH---HH--HHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCC-- Q ss_conf 7888861167899964798999999999---99--9999598069946486579999999984589708998056668-- Q gi|254780970|r 586 MYSLECASSNIGPPPKVDCHLEKNHGFF---VL--SMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIE-- 658 (737) Q Consensus 586 ~~~~~~~~~~~g~~p~~d~~~~k~~~~~---v~--~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~-- 658 (737) +++..+................+..+.. +. .+..+..++|+||+|+|||+.+|.|||.+|++|++|+++..+- T Consensus 168 l~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~a~~Dis~gGL~~~L~e~~~~s~vg~~I~~~~iPi~~ 247 (298) T cd06061 168 ILANDFEEELKKRLSEEELREAAKLFYKISVVKEALIAAEAGVTAMHDATEGGILGALWEVAEASGVGLRIEKDKIPIRQ 247 (298) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCCCH T ss_conf 99987066552346758899878754022239999975105755987167552777899999746974999775468887 Q ss_pred ----------CCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf ----------98788872988321899996880998999998579849998998 Q gi|254780970|r 659 ----------KDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKV 702 (737) Q Consensus 659 ----------~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~V 702 (737) .||+. +| +.|+++++|++++.+++++.++++|+++++||+| T Consensus 248 ~~~~~~~~~~~dp~~-~~--~~G~ll~~v~~~~~~~v~~~l~~~g~~a~vIGeV 298 (298) T cd06061 248 ETKEICEALGIDPLR-LI--SSGTLLITVPPEKGDELVDALEEAGIPASVIGKI 298 (298) T ss_pred HHHHHHHHCCCCHHH-HH--HCCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 999999974978888-97--0887999998899999999999649796999869 No 29 >cd02195 SelD Selenophosphate synthetase (SelD) catalyzes the conversion of selenium to selenophosphate which is required by a number of bacterial, archaeal and eukaryotic organisms for synthesis of Secys-tRNA, the precursor of selenocysteine in selenoenzymes. The N-terminal domain of SelD is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, and FGAM synthase and is thought to bind ATP. Probab=100.00 E-value=3.3e-33 Score=246.60 Aligned_cols=246 Identities=21% Similarity=0.250 Sum_probs=186.4 Q ss_pred CCCC--CCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHH Q ss_conf 0001--35763142037776227973045321000010011356789777641027851022322102456668621346 Q gi|254780970|r 430 IQLP--GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQ 507 (737) Q Consensus 430 v~~P--g~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~q 507 (737) ++.| |.||||+++. .++.++.|+|..+... .|||..++.|+..+++||++||++|++++.++.+|...+...... T Consensus 35 lvg~~~gdDaAvi~~~--~~~~lv~t~D~f~~~v-~dp~~~G~~Av~~alsDiaamGa~P~~~l~~l~lP~~~~~~~~~~ 111 (287) T cd02195 35 LVGLGTGDDAAVYRLP--GGLALVQTTDFFPPIV-DDPYLFGRIAAANALSDIYAMGAKPLSALAIVTLPRKLPALQEEV 111 (287) T ss_pred CCCCCCCCCEEEEEEC--CCEEEEEECCCCCCCC-CCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHEECCCCCCCCHHHH T ss_conf 2389987754899718--9818999757877766-598999999999866469884587699998802777775112999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHH- Q ss_conf 9999999985221028866544320011236866676311122223335202331021438980751377665543267- Q gi|254780970|r 508 FVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSM- 586 (737) Q Consensus 508 l~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~- 586 (737) |.+.++|+.++|+++|++++||+.+ ..+ -++..++++|.++. ++.++...+++||.|++++. +|.+. T Consensus 112 l~~i~~Gi~~~~~~~gv~ivGGhT~------~~~-~~~i~~tv~G~~~~-~~~i~~~~ak~GD~IivTk~----lG~g~~ 179 (287) T cd02195 112 LREILAGGKDKLREAGAVLVGGHTI------EGP-EPKYGLSVTGLVHP-NKILRNSGAKPGDVLILTKP----LGTGIL 179 (287) T ss_pred HHHHHHHHHHHHHHCCCCEECCCEE------CCC-CCEEEEEEEEEECC-CCCEECCCCCCCCEEEEECC----CCHHHH T ss_conf 9999999999999829957755332------088-86688889988757-76254268999999999578----758999 Q ss_pred HHHHHHCCCCCCCCCCCHHHHH----HHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHH Q ss_conf 8888611678999647989999----999999999995980699464865799999999845897089980566689878 Q gi|254780970|r 587 YSLECASSNIGPPPKVDCHLEK----NHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPK 662 (737) Q Consensus 587 ~~~~~~~~~~g~~p~~d~~~~k----~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~ 662 (737) ++....... +..+.+... .......++..+..++||||+|+|||+.+|.|||.+|++|++|+++..+- T Consensus 180 ~~a~~~~~~----~~~~~~~~~~~~~~~~~~~~~l~~~~~v~Am~DiT~gGL~g~L~ema~aSgvg~~i~~~~IPv---- 251 (287) T cd02195 180 FAAEMAGLA----RGEDIDAALESMARLNRAAAELLRKYGAHACTDVTGFGLLGHLLEMARASGVSAEIDLDKLPL---- 251 (287) T ss_pred HHHHHCCCC----CHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCCE---- T ss_conf 999872667----868899999998531399999986558733323655478999999999779909997044877---- Q ss_pred HHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 8872988321899996880998999998579849998998 Q gi|254780970|r 663 PFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKV 702 (737) Q Consensus 663 ~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~V 702 (737) -|+.|.++++|++++.+++++.+++.|+++++||+| T Consensus 252 ----~qtsG~ll~~v~~~~a~~~~~~l~~~g~~a~iIG~V 287 (287) T cd02195 252 ----LQTSGGLLAAVPPEDAAALLALLKAGGPPAAIIGEV 287 (287) T ss_pred ----EEECCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf ----971582599986899999999999669984999779 No 30 >cd02195 SelD Selenophosphate synthetase (SelD) catalyzes the conversion of selenium to selenophosphate which is required by a number of bacterial, archaeal and eukaryotic organisms for synthesis of Secys-tRNA, the precursor of selenocysteine in selenoenzymes. The N-terminal domain of SelD is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, and FGAM synthase and is thought to bind ATP. Probab=100.00 E-value=1.5e-34 Score=255.89 Aligned_cols=275 Identities=23% Similarity=0.279 Sum_probs=207.8 Q ss_pred HCCCCCCCCCHHHHHHH-----HHCCCCEEECC--CCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 00110744588999751-----20697489933--477368983698399999517887776588887210558102257 Q gi|254780970|r 44 WNEHCSYKSSKKWLRTL-----PTTGKHVIQGP--GENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDI 116 (737) Q Consensus 44 wSEHC~~k~~~~~l~~~-----~~~~~~vi~~~--~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi 116 (737) --.=|.-|-....|+.+ ....+.++.+| ||||+|+++.++..++++++ |.+|..-+||.-.---+-..++|| T Consensus 6 ~c~gc~~Kv~~~~L~~~l~~~~~~~~~~vlvg~~~gdDaAvi~~~~~~~lv~t~D-~f~~~v~dp~~~G~~Av~~alsDi 84 (287) T cd02195 6 KCAGCGAKVGPGVLSQLLAGLPLPTDPNLLVGLGTGDDAAVYRLPGGLALVQTTD-FFPPIVDDPYLFGRIAAANALSDI 84 (287) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECCCEEEEEECC-CCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 6872677489899999997377878955023899877548997189818999757-877766598999999999866469 Q ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCC Q ss_conf 67787436877504578756863888998887458866543087211320355215676642211221121135543112 Q gi|254780970|r 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSK 196 (737) Q Consensus 117 ~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~ 196 (737) .+|||+|++.+..+.++........+.+.++.+|+.++++++|+|+|||+|...+ .+.++..+.|.++.+++++.. T Consensus 85 aamGa~P~~~l~~l~lP~~~~~~~~~~l~~i~~Gi~~~~~~~gv~ivGGhT~~~~----~~~i~~tv~G~~~~~~~i~~~ 160 (287) T cd02195 85 YAMGAKPLSALAIVTLPRKLPALQEEVLREILAGGKDKLREAGAVLVGGHTIEGP----EPKYGLSVTGLVHPNKILRNS 160 (287) T ss_pred HHHCCCHHHHHHHEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCEECCC----CCEEEEEEEEEECCCCCEECC T ss_conf 8845876999988027777751129999999999999999829957755332088----866888899887577625426 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHH Q ss_conf 25689779996576564441046666777286533344433267989998889998740113830110347752011122 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSA 276 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~ 276 (737) ..+|||.|++.| ..|.+. .++............... -..+ .++.+...++..+..+++|||++.|||+.++ T Consensus 161 ~ak~GD~IivTk----~lG~g~-~~~a~~~~~~~~~~~~~~---~~~~-~~~~~~~~~l~~~~~v~Am~DiT~gGL~g~L 231 (287) T cd02195 161 GAKPGDVLILTK----PLGTGI-LFAAEMAGLARGEDIDAA---LESM-ARLNRAAAELLRKYGAHACTDVTGFGLLGHL 231 (287) T ss_pred CCCCCCEEEEEC----CCCHHH-HHHHHHCCCCCHHHHHHH---HHHH-HHHHHHHHHHHHHCCCCEEECCCCCHHHHHH T ss_conf 899999999957----875899-999987266786889999---9998-5313999999865587333236554789999 Q ss_pred HHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCE Q ss_conf 33201477528996202204667899989861788850899995467588999888709713112212 Q gi|254780970|r 277 IEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGIT 344 (737) Q Consensus 277 ~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~v 344 (737) .||+..|++|++|++++||+. |++..|+++|+|++.+++.+.++++++++++||+| T Consensus 232 ~ema~aSgvg~~i~~~~IPv~------------qtsG~ll~~v~~~~a~~~~~~l~~~g~~a~iIG~V 287 (287) T cd02195 232 LEMARASGVSAEIDLDKLPLL------------QTSGGLLAAVPPEDAAALLALLKAGGPPAAIIGEV 287 (287) T ss_pred HHHHHHCCCEEEEEECCCCEE------------EECCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 999997799099970448779------------71582599986899999999999669984999779 No 31 >PRK05731 thiamine monophosphate kinase; Provisional Probab=100.00 E-value=1.2e-32 Score=242.83 Aligned_cols=264 Identities=19% Similarity=0.249 Sum_probs=187.9 Q ss_pred CCCCCCCCCCCCCCEEEECCCCCCEEEEECCC---CCCC--HHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCC Q ss_conf 00110000135763142037776227973045---3210--000100113567897776410278510223221024566 Q gi|254780970|r 425 IQSNSIQLPGGDAGVIRVEGHETKALAFSSDV---TPRY--VKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNP 499 (737) Q Consensus 425 V~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~---~p~~--~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P 499 (737) .+....+.||+||||+++. .++.+++|+|. +.++ ...+||..|+.||+.+++||+||||+|+++++++.+ | T Consensus 17 ~~~~~~~giGDDaAvi~~~--~~~~lv~ttD~~ve~~HF~~~~~~p~~iG~~av~~n~SDiaamGa~P~~~~~sl~l--P 92 (320) T PRK05731 17 QRADVELGIGDDAALLDPP--PGQRLVVSTDTLVEGVHFRPDWSSPEDLGYKALAVNLSDLAAMGARPAAFLLALAL--P 92 (320) T ss_pred CCCCCCCCCCCCEEEEEEC--CCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC--C T ss_conf 7988646898777999708--99879999556565767787889999999999998788899718972278885026--8 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCC Q ss_conf 68621346999999998522102886654432001123686667631112222333520233102143898075137766 Q gi|254780970|r 500 EKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDG 579 (737) Q Consensus 500 ~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~ 579 (737) .+.+.. .+.+.++|+.++|+++|++++||+.+ ..+ ...+.++++|.++. .+.++...+++||.|+++|..+ T Consensus 93 ~~~~~~-~l~~i~~Gi~~~~~~~gv~lvGGdt~------~~~-~~~i~vt~iG~~~~-~~~l~r~ga~~GD~I~vTG~lG 163 (320) T PRK05731 93 KDVDEA-WLEALADGLFALADRYGAELIGGDTT------RGP-DLSISVTAIGEVPG-GRALRRSGAKPGDLVAVTGTLG 163 (320) T ss_pred CCCHHH-HHHHHHHHHHHHHHHHCCEEEECCCC------CCC-CCEEEEEEEEECCC-CCEEECCCCCCCCEEEECCCCC T ss_conf 864399-99999999999999839569604312------588-71799999961179-9656236789898789727742 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHH-------HHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEE Q ss_conf 55432678888611678999647989999999-------99999999598069946486579999999984589708998 Q gi|254780970|r 580 CHLDCSMYSLECASSNIGPPPKVDCHLEKNHG-------FFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDII 652 (737) Q Consensus 580 ~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~-------~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~ 652 (737) ....| +.............. .......+ .....+ .+.++||||+|| ||+.+|.|||.+|++|++|+ T Consensus 164 ~s~~g--l~~l~~~~~~~~~~~--~~~~~~~~~~p~~~~~~~~~l--~~~~~a~~DiSd-GL~~~L~eia~~Sgvg~~I~ 236 (320) T PRK05731 164 DSAAG--LALLLNGGQASANED--AAALISRHLRPQPRVGLGQAL--RGLASAAMDISD-GLAADLGHIAEASGVGADID 236 (320) T ss_pred HHHHH--HHHHHCCCCCCCCHH--HHHHHHHHHCCCCCHHHHHHH--HHHHHHHCCCCH-HHHHHHHHHHHHCCCCEEEE T ss_conf 77889--999847877784055--799999962898438999997--765555320416-79999999999869939985 Q ss_pred ECCCC--------CCCHHHHHH--CCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEE Q ss_conf 05666--------898788872--988321899996880998999998579849998998798-6898 Q gi|254780970|r 653 LPIEI--------EKDPKPFLF--GEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSI 709 (737) Q Consensus 653 L~~~~--------~~d~~~~LF--sEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i 709 (737) ++..+ ..+++++.+ +|++. ++++|++++.+++++.+++.++++++||+|++. .+.+ T Consensus 237 ~~~iP~~~~~~~~~~~~~~~~l~gGedye-Ll~tv~~~~~~~~~~~~~~~gi~~~~IG~v~~~~gv~l 303 (320) T PRK05731 237 ADALPVSAALAALGEDALRWALSGGEDYE-LLFTVPPENRGALLAAAKKLGVGVTIIGRVAGGEGVLL 303 (320) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCCEE-EEEEECHHHHHHHHHHHHHCCCCEEEEEEEECCCCEEE T ss_conf 88789875665535549998316787336-99998899999999999975989699999984884799 No 32 >cd02691 PurM-like2 AIR synthase (PurM) related protein, archaeal subgroup 2 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=100.00 E-value=3.8e-33 Score=246.21 Aligned_cols=287 Identities=20% Similarity=0.269 Sum_probs=206.3 Q ss_pred HHHHHHHCCCCEEECCCCCEEEEEECCCEEEEEEECCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCC Q ss_conf 997512069748993347736898369839999951788-7776588887210558102257677874368775045787 Q gi|254780970|r 56 WLRTLPTTGKHVIQGPGENAGVVDIGGGDCVVFKMESHN-HPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGA 134 (737) Q Consensus 56 ~l~~~~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHN-hPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~ 134 (737) .-+.|..+...++..-+|||+++..++++ +++.+++|. +|++. ||--.---+-|.+||+..|||||++++.++.+.+ T Consensus 20 i~~~~~~~~~~~~l~~~dDaa~~~~~~~~-iv~ttDg~~~~~~~f-P~~~G~~av~~tvnDla~mGA~P~~l~~~~~l~e 97 (346) T cd02691 20 LAELIGKTGEVSIVAQDDDAGVDAADVEY-IVVAIDGIHSRLSDF-PFLAGFHATRAALRDVMVMGARPVALLSDIHLAD 97 (346) T ss_pred HHHHHHHCCCCCCCCCCCCEEEECCCCCE-EEEEECCEEECCCCC-CCCCHHHEEEEEHHHHHHCCCCCCEEEEEEEECC T ss_conf 99998653686444567864675379957-999947778556668-5112133476556667764884330389999669 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCC---CCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCC Q ss_conf 568638889988874588665430872113203552156---76642211221121135543112256897799965765 Q gi|254780970|r 135 IHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCY---NNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKT 211 (737) Q Consensus 135 ~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y---~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~t 211 (737) -- ....+...++|++.++++.|+|+|+|+|...+.. ..++..+++++|+.+...+.+ +..+|||.|++.| T Consensus 98 g~---~~~~L~~i~~~m~~aa~~~gV~iV~GdT~vv~gd~v~g~~i~~~v~~iGv~~~~~~~~-~~ArpGD~Ilvsg--- 170 (346) T cd02691 98 DG---DVGKLFDFTAGVTAVSEATGVPLVAGSTLRIGGDMVLGDRLVGGVGAVGRSKSDPSRR-KNAEPGDLILMTE--- 170 (346) T ss_pred CC---CHHHHHHHHHHHHHHHHHCCCCEEECCCEEECCCCEEEEEEEEEEEEEEECCCCCCCC-CCCCCCCEEEEEC--- T ss_conf 99---9999999999999999985998971473683699466436899888877404776411-4799999999978--- Q ss_pred CCCCCHHHHHHHHHHCCCCC-CCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEE Q ss_conf 64441046666777286533-34443326798999888999874011383011034775201112233201477528996 Q gi|254780970|r 212 GRDGIGGASMASEEFGENIA-KKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLD 290 (737) Q Consensus 212 g~dGi~Ga~~sS~~~~~~~~-~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~ 290 (737) |+||.+++...+.+..+ ....+.+..+...-+.+++ ......|++|+|++.|||++++.|||.+||+|++|| T Consensus 171 ---giGg~~ia~~a~~~g~~~~~~et~~~~~~~~~~~l~~----~~l~~~IhaMrD~TrGGLa~~LnEiA~~SgVgi~Id 243 (346) T cd02691 171 ---GAGGGTITTTAIYHGMPDVVEETLNVDFIKACEALRD----SGLVSKVHSMTDVTNGGIRGDALEISKTAGVSLVFD 243 (346) T ss_pred ---CCCCHHHHHHHHHCCCCCCCHHCCCHHHHHHHHHHHH----HHHHCCCEEEECCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf ---8784178899987146432100012046788999998----531047338776862458889999998669759997 Q ss_pred EHHH--CCCCCCCCHHHHH----HCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCC Q ss_conf 2022--0466789998986----178885089999546758899988870971311221210443100025864 Q gi|254780970|r 291 LDKV--PTCEEGMTAYEMM----LSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEE 358 (737) Q Consensus 291 l~~v--p~~~~~l~~~ei~----~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~ 358 (737) .++| |++.+-....|+| ++.|+|+++++++|++.+++.+.++++++++++||+||+.....+..+++. T Consensus 244 ee~I~~pv~p~V~~~cE~LgiDPL~vaneg~Lii~p~e~~~~i~~~l~~~Gv~a~~IG~V~~g~~~~l~~~g~~ 317 (346) T cd02691 244 EEKVRSLINPKVLKMLEELGIDPLGVSLDSLMIIAPEEDAVDIIRTLREAGVRADEVGRVEEGRGVPLVVTGEG 317 (346) T ss_pred HHHCCCCCCHHHHHHHHHHCCCHHHCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEEECCCCEEEECCCC T ss_conf 67704656889999999859870102556269996677889999999876997389999984896178638961 No 33 >cd02194 ThiL ThiL (Thiamine-monophosphate kinase) plays a dual role in de novo biosynthesis and in salvage of exogenous thiamine. Thiamine salvage occurs in two steps, with thiamine kinase catalyzing the formation of thiamine phosphate, and ThiL catalyzing the conversion of this intermediate to thiamine pyrophosphate. The N-terminal domain of ThiL binds ATP and is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, FGAM synthase and selenophosphate synthetase (SelD). Probab=100.00 E-value=2.1e-31 Score=234.24 Aligned_cols=252 Identities=22% Similarity=0.275 Sum_probs=178.1 Q ss_pred CCCCCCCCCCCEEEECCCCCCEEEEECCCC---C-CCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 100001357631420377762279730453---2-100001001135678977764102785102232210245666862 Q gi|254780970|r 428 NSIQLPGGDAGVIRVEGHETKALAFSSDVT---P-RYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEE 503 (737) Q Consensus 428 ~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~---p-~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e 503 (737) +.++.||+||||++.. ++.+++|+|.. . .....+||..|+.+|+.+++||+||||+|++++.++.++. +.+ T Consensus 18 ~~~~g~GDDaavi~~~---~~~lv~ttD~~~e~~hf~~~~~p~~iG~~av~~nlSDIaamGa~P~~~~~sl~lp~--~~~ 92 (291) T cd02194 18 GVLLGIGDDAAVLKPP---GGRLVVTTDTLVEGVHFPPDTTPEDIGWKALAVNLSDLAAMGARPLGFLLSLGLPP--DTD 92 (291) T ss_pred CCCCCCCCCEEEEEEC---CCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCC--CCC T ss_conf 8535898887999609---97699996665466668999998999999999888889873896123577650689--998 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCC Q ss_conf 13469999999985221028866544320011236866676311122223335202331021438980751377665543 Q gi|254780970|r 504 IMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLD 583 (737) Q Consensus 504 ~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~Lg 583 (737) ..| +.+.++|+.++|+++|++++||+.+. .+ ..+..++++|.++. .+.++...+++||.|+++|..+ T Consensus 93 ~~~-l~~i~~Gi~~~~~~~gv~ivGGdt~~------~~-~~~i~vt~iG~~~~-~~~i~r~~ak~GD~i~vtg~~G---- 159 (291) T cd02194 93 EEW-LEEFYRGLAEAADRYGVPLVGGDTTS------GS-ELVISVTALGEVEK-GKPLRRSGAKPGDLLYVTGTLG---- 159 (291) T ss_pred HHH-HHHHHHHHHHHHHHHCCCEEECCCCC------CC-CEEEEEEEEEEECC-CCEEECCCCCCCCEEEEECCCC---- T ss_conf 999-99999999999998698066134123------56-54899999999768-8434115799899799946742---- Q ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHH------HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCC- Q ss_conf 267888861167899964798999999------9999999995980699464865799999999845897089980566- Q gi|254780970|r 584 CSMYSLECASSNIGPPPKVDCHLEKNH------GFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIE- 656 (737) Q Consensus 584 gS~~~~~~~~~~~g~~p~~d~~~~k~~------~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~- 656 (737) .+.......................++ .... ..+.++.++||||+|| ||+.+|.|||.+|++|++|+++.. T Consensus 160 ~~~~gl~~~~~~~~~~~~~~~~~~~~~~~p~~~i~~~-~~l~~~~v~a~~DiSd-GL~~~l~eia~~S~vg~~I~~~~ip 237 (291) T cd02194 160 DAAAGLALLLGGLKLPEELYEELIERHLRPEPRLELG-RALAEGLATAMIDISD-GLLADLGHIAEASGVGAVIDLDKLP 237 (291) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCHHHHH-HHHHHHHHHHHHCCCH-HHHHHHHHHHHHCCCEEEEEHHHCC T ss_conf 7799999996688777225799999864888529999-9998500044521005-7999999999984980998588689 Q ss_pred ---------CCCCHHHHHHC--CCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf ---------68987888729--883218999968809989999985798499989987 Q gi|254780970|r 657 ---------IEKDPKPFLFG--EDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVS 703 (737) Q Consensus 657 ---------~~~d~~~~LFs--Es~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt 703 (737) ...++++++++ |++. +|++|++++.+ +.+++.++++++||+|| T Consensus 238 ~~~~~~~~~~~~~~~~~~l~gGedy~-ll~tv~~~~~~---~~~~~~~~~~~~IG~vt 291 (291) T cd02194 238 LSPALRAAELGEDALELALSGGEDYE-LLFTVPPENAE---AAAAKLGVPVTVIGRVT 291 (291) T ss_pred CCHHHHHHHCCCCHHHHHHHCCCCEE-EEEEECHHHHH---HHHHHCCCCEEEEEEEC T ss_conf 99899987629999999970997407-99997789999---99987398958998869 No 34 >COG0309 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=2.7e-31 Score=233.46 Aligned_cols=280 Identities=26% Similarity=0.431 Sum_probs=211.9 Q ss_pred CEEECCCCCEEEEEECCCEEEEEEECCCCCCCCCCC--CCCCCC---CCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCH Q ss_conf 489933477368983698399999517887776588--887210---558102257677874368775045787568638 Q gi|254780970|r 66 HVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEP--YQGAAT---GVGGILRDIFTMGARPVAAMNSLRFGAIHHPKT 140 (737) Q Consensus 66 ~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P--~~GAaT---g~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~ 140 (737) .+....+||++.++++++ .++|+-.+| .++| |+|--- .+-|..+|+..|||+|.+++..|..+.-- + T Consensus 32 ~v~~~~g~D~~~i~l~~~-~la~tTD~~----~i~P~ff~~~diG~lAV~gt~NDlav~GA~P~~l~~~lil~eg~---~ 103 (339) T COG0309 32 NVGLANGEDAAIIDLGDG-VLAFTTDPF----VIDPLFFPGGDIGKLAVHGTANDVAVSGAKPRYLSVGLILPEGL---P 103 (339) T ss_pred CCCCCCCCCEEEEECCCC-EEEEEECCE----EECCCCCCCCCEEEEEEEEEHHHHHHCCCCCEEEEEEEECCCCC---C T ss_conf 548655664113403784-699981880----75466457886289999986225440589860026777358999---8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHH Q ss_conf 88998887458866543087211320355215676642211221121135543112256897799965765644410466 Q gi|254780970|r 141 KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGAS 220 (737) Q Consensus 141 ~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~ 220 (737) ..-+..+.+.+..-.+++|+++|+|+|...+....++.++..++|++..+..++....+|||.||+.| -.|+||++ T Consensus 104 ~e~l~~i~~si~e~a~~~Gv~IvtGdTkV~~~~~~~~vi~tT~iG~~~~~~~v~~~~~~~GD~vI~tg----~~g~hga~ 179 (339) T COG0309 104 IEDLERILKSIDEEAEEAGVSIVTGDTKVVPGGKDPIVINTTGIGIIDKEILVSPSGARPGDAVIVTG----TIGIHGAS 179 (339) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCCCEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCEEEECC----CHHHHHHH T ss_conf 78999999999999998397498267666159987728986467751377601457899888899818----81688999 Q ss_pred HHHHHHCCC----CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCC Q ss_conf 667772865----3334443326798999888999874011383011034775201112233201477528996202204 Q gi|254780970|r 221 MASEEFGEN----IAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPT 296 (737) Q Consensus 221 ~sS~~~~~~----~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~ 296 (737) .-...+++. .++|..++ .... +..+..+.+....- |++|||.+.|||++++.|||.+||+|+.|+.++||+ T Consensus 180 ila~~~~~~l~~~l~sD~~~~---~~~~-~~~l~~~~~~~~~~-vtAMhDaTrGGla~aLnEmA~aSgvgi~I~ee~Ipv 254 (339) T COG0309 180 ILAHRFGEELETELGSDCAPL---AKLV-KALLSVVGEALAAA-VTAMHDATRGGLAGALNEMAEASGVGISIEEEKIPV 254 (339) T ss_pred HHHHHCCHHHCCCHHHHHHHH---HHHH-HHHHHCCHHHHHHH-HHHCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 998752022033235667888---9999-98862005776656-765038753678899999999749849996246666 Q ss_pred CCC--------CCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHC-CEEECCCCEEECCC-CCC--CCCCCCCCCCCC Q ss_conf 667--------899989861788850899995467588999888709-71311221210443-100--025864434432 Q gi|254780970|r 297 CEE--------GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWG-LHFSIIGITTNDKL-FRV--IHRGEEVANLPI 364 (737) Q Consensus 297 ~~~--------~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~-~~~~~iG~vt~~~~-~~v--~~~g~~v~dlp~ 364 (737) +++ |++||++. ||. .++++|+|++.+++++.+++++ .++++||+|++++. ..+ .++++.+++.|. T Consensus 255 ~~eVr~vce~lGiDPl~~a-nEG--~lv~~V~~~~a~~~l~~L~~~~~~~A~iIGeV~~~~~~v~l~~~~G~~r~l~~P~ 331 (339) T COG0309 255 REEVRGVCELLGLDPLELA-NEG--KLVIAVPPEHAEEVLEALRSHGLKDAAIIGEVVEEKGGVGLETAGGGKRILEPPE 331 (339) T ss_pred CHHHHHHHHHHCCCHHHHH-CCC--EEEEEECHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEECCCCEEECCCCC T ss_conf 4899999998398878872-586--0999978899999999998649954226899832688379995477257668887 Q ss_pred C Q ss_conf 1 Q gi|254780970|r 365 K 365 (737) Q Consensus 365 ~ 365 (737) . T Consensus 332 ~ 332 (339) T COG0309 332 G 332 (339) T ss_pred C T ss_conf 6 No 35 >cd02197 HypE HypE (Hydrogenase expression/formation protein). HypE is involved in Ni-Fe hydrogenase biosynthesis. HypE dehydrates its own carbamoyl moiety in an ATP-dependent process to yield the enzyme thiocyanate. The N-terminal domain of HypE is related to the ATP-binding domains of the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP. Probab=99.98 E-value=1e-30 Score=229.61 Aligned_cols=257 Identities=23% Similarity=0.335 Sum_probs=180.9 Q ss_pred CCCCCCCCCCCCEEEECCCCCCEEEEECCC---CCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCC Q ss_conf 110000135763142037776227973045---32100001001135678977764102785102232210245666862 Q gi|254780970|r 427 SNSIQLPGGDAGVIRVEGHETKALAFSSDV---TPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEE 503 (737) Q Consensus 427 g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~---~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e 503 (737) ++.++..|+|||++++.. ++ +++|+|. +|++. .+..-+++||+-+++||++||++|+++++++.++.- .+. T Consensus 19 ~~~~lgigdDaAvi~~~~--~~-lvvttD~~v~~p~~f--~~~diG~~Av~~~~sDIaamGa~P~~~l~sl~lp~~-~~~ 92 (293) T cd02197 19 DNPILEVLEDAAALLVGG--GR-LAFTTDSFVVSPLFF--PGGDIGKLAVCGTVNDLAMMGAKPLYLSLGFILEEG-FPL 92 (293) T ss_pred CCCCCCCCCCEEEECCCC--CE-EEEEECCCCCCCCCC--CCCCHHHHHHHCCHHHHHHCCCCCHHHEEEEECCCC-CCH T ss_conf 986678888769861699--84-999936542277528--984473787521287898659975676375655899-999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCC Q ss_conf 13469999999985221028866544320011236866676311122223335202331021438980751377665543 Q gi|254780970|r 504 IMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLD 583 (737) Q Consensus 504 ~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~Lg 583 (737) ..|.+.++|+.++|+++|++++||+.+.+.++.... ....++++|.++. .+.+++..+++||.|+++|..+.+ | T Consensus 93 --~~l~~i~~Gi~~~~~~~gv~iVGGdT~v~~~~~~~~--l~is~t~iG~v~~-~~~~~~~gA~pGD~iivTg~lG~~-g 166 (293) T cd02197 93 --EDLERIVKSMAEAAREAGVKIVTGDTKVVPKGKADG--IFINTTGIGVIPR-GVIISPSNIRPGDKIIVSGTIGDH-G 166 (293) T ss_pred --HHHHHHHHHHHHHHHHHCCEEEECCEEECCCCCCCC--EEEEEEEEEEECC-CCCCCCCCCCCCCEEEEECCHHHH-H T ss_conf --999999999999999719768616627514787785--5998678898568-855035789999999996860788-9 Q ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHH-CCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC----- Q ss_conf 267888861167899964798999999999999999-59806994648657999999998458970899805666----- Q gi|254780970|r 584 CSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMIN-AKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI----- 657 (737) Q Consensus 584 gS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~-~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~----- 657 (737) .++++.. ...........|.. .+...+..+.. ...++||||+|+|||+.+|.|||.+|++|++|+.+..+ T Consensus 167 ~~il~~r-~~~~~~~~~~~d~~---~~~~~~~~~l~~~~~v~Am~DiT~gGL~~~L~eia~aSgvg~~I~~~~iPv~~~v 242 (293) T cd02197 167 AAILAAR-EGLGFETDIESDCA---PLNGLVEALLEAGPGIHAMRDPTRGGLAAVLNEIARASGVGIEIEEEAIPVREEV 242 (293) T ss_pred HHHHHHH-CCCCCCCHHHHHHH---HHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHH T ss_conf 9999852-46664401466666---6689999999707874474167975599999999998599399940368888899 Q ss_pred -------CCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHC--CCCEEEEEEE Q ss_conf -------89878887298832189999688099899999857--9849998998 Q gi|254780970|r 658 -------EKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNK--NIPLRYLGKV 702 (737) Q Consensus 658 -------~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~--gi~~~~IG~V 702 (737) ..||+.. . +.|++|++|++++.+++++.++++ |+++++||+| T Consensus 243 ~~~~e~lg~DPl~~-a--s~G~ll~~v~~e~a~~vl~~l~~~~~G~~A~iIGeV 293 (293) T cd02197 243 RGACEMLGLDPLYL-A--NEGKFVAIVPPEDAEEVLEALRSHPLGKEAAIIGEV 293 (293) T ss_pred HHHHHHHCCCHHHH-H--CCCEEEEEECHHHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 99999859398788-3--485399998789999999999728889981898669 No 36 >TIGR01379 thiL thiamine-monophosphate kinase; InterPro: IPR006283 This family represents thiamine-monophosphate kinase, an enzyme that converts thiamine monophosphate into thiamine pyrophosphate (TPP, coenzyme B1), an enzyme cofactor. Thiamine monophosphate may be derived from de novo synthesis or from unphosphorylated thiamine, known as vitamin B1. Eukaryotes lack this enzyme, and add pyrophosphate from ATP to unphosphorylated thiamine in a single step. ; GO: 0009030 thiamin phosphate kinase activity, 0009228 thiamin biosynthetic process. Probab=99.97 E-value=1.4e-29 Score=221.73 Aligned_cols=261 Identities=23% Similarity=0.276 Sum_probs=191.9 Q ss_pred CCCCCCC-CCCCCCCCEEEECCCCCCEEEEECC---CCCCC-HHC-CCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCC Q ss_conf 0001100-0013576314203777622797304---53210-000-1001135678977764102785102232210245 Q gi|254780970|r 424 MIQSNSI-QLPGGDAGVIRVEGHETKALAFSSD---VTPRY-VKA-DPFEGTKQAVAECWRNIIATGAKPLAITDNLNFG 497 (737) Q Consensus 424 ~V~g~tv-~~Pg~DaaVv~~~~~~~~gia~s~g---~~p~~-~~~-dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~ 497 (737) .++..++ ...|+|||++....+ .-+++|+| +.-+| ... .|+..|+.+|+-+++||+||||+|.+++.++.+ T Consensus 15 ~~~~~~~~~~~GDDAA~~~~~~~--~~lv~t~D~Lve~~HF~~~~~~Pe~~G~K~~avNlSDlAAmGA~P~~~~~s~~~- 91 (336) T TIGR01379 15 LVKDDSVALGIGDDAALVSPPEG--EDLVLTTDTLVEGVHFPPDTTPPEDLGWKAVAVNLSDLAAMGATPKWFLLSLGL- 91 (336) T ss_pred CCCCCCCCCCCCCCEEEECCCCC--CEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHEEEC- T ss_conf 14787633247641231157999--638998702324757886778989999999997499997305702322310016- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCC Q ss_conf 66686213469999999985221028866544320011236866676311122223335202331021438980751377 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGN 577 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~ 577 (737) |++.+.. .+...++|+.+.|..|++++||| +|+.. -+..+|++|.++.- ..++...+++||.|+++|. T Consensus 92 -P~~~~~~-~~~~f~~G~~~~~~~Y~~~LiGG------DT~~~---~~~~~T~iG~~~~~-~~~~RsgAk~GD~v~VTG~ 159 (336) T TIGR01379 92 -PSDLDEA-WLEAFYDGLFELAKQYGVDLIGG------DTVKS---LVVTVTAIGEAPKG-RALLRSGAKPGDLVFVTGT 159 (336) T ss_pred -CCCCCHH-HHHHHHHHHHHHHHHCCCEEEEC------CCCCE---EEEEEEEEEEECCC-CEEECCCCCCCCEEEEECC T ss_conf -8898889-99999999999875539878724------40021---31434578975689-7332367877767899588 Q ss_pred CCCCCCHHHHHHHHHCC-----CCCCCCC-CCHHHHHHHHH------HHHHHHHCCCEE--EEEECCCCHHHHHHHHHHH Q ss_conf 66554326788886116-----7899964-79899999999------999999959806--9946486579999999984 Q gi|254780970|r 578 DGCHLDCSMYSLECASS-----NIGPPPK-VDCHLEKNHGF------FVLSMINAKKIT--ACHDISTGGLIITLAEMTI 643 (737) Q Consensus 578 ~~~~LggS~~~~~~~~~-----~~g~~p~-~d~~~~k~~~~------~v~~li~~g~I~--S~HDiSdGGL~~aL~EMa~ 643 (737) +|-|.....+... ....++. .+....+++.. ..+.|.+....+ |+.|||| ||+..|.++|- T Consensus 160 ----lG~saaGL~ll~~G~~~~~~~~~~~~~~~~~~~r~l~P~PR~~~G~~l~~~~~a~W~Aa~D~SD-GL~~dL~hIa~ 234 (336) T TIGR01379 160 ----LGDSAAGLELLLKGKEEGKKKEVDEEDDEALLQRHLRPEPRVEEGLALARASLANWVAAIDVSD-GLASDLGHIAE 234 (336) T ss_pred ----CCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHEHHHHCCCH-HHHHHHHHHHH T ss_conf ----3268999999973466402567785147999998638998289999997454332312122047-79999999997 Q ss_pred CCCCCEEEEECCCC--------------CCCHHHH--HHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 58970899805666--------------8987888--72988321899996880998999998579849998998798 Q gi|254780970|r 644 SSAKGMDIILPIEI--------------EKDPKPF--LFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 644 as~~G~~I~L~~~~--------------~~d~~~~--LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) +|+||++|+.+..+ ..++.++ -++|++. +|+|+++++.+.+..++...++++++||+|+.. T Consensus 235 AS~vg~~i~~~~LP~~~~~~~~~~~~~~~~~~~~~aL~gGEDyE-LvfT~~~~~~~~l~~~~~~~~v~~~~IG~v~~g 311 (336) T TIGR01379 235 ASGVGIRIDLDKLPLSSELKEIADTEELGKQPLEWALSGGEDYE-LVFTVPPERREALLDALKALGVPLTRIGRVTEG 311 (336) T ss_pred HCCCEEEEECCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCEE-EEEECCHHHHHHHHHHHHHCCCCEEEEEEEEEC T ss_conf 28962998526478708999886306310027999987288332-455158667889999997558766899999864 No 37 >TIGR02124 hypE hydrogenase expression/formation protein HypE; InterPro: IPR011854 This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (IPR006283 from INTERPRO) and other enzymes.. Probab=99.97 E-value=9.7e-31 Score=229.69 Aligned_cols=290 Identities=25% Similarity=0.390 Sum_probs=236.4 Q ss_pred CHHHHHH-HHH-------CCCCEEECCCCCEEEEEECCCEEEEEEECCCCCCCCCCC--CCCCCCC---CCCCCCCCCCC Q ss_conf 8899975-120-------697489933477368983698399999517887776588--8872105---58102257677 Q gi|254780970|r 53 SKKWLRT-LPT-------TGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEP--YQGAATG---VGGILRDIFTM 119 (737) Q Consensus 53 ~~~~l~~-~~~-------~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P--~~GAaTg---~gG~iRDi~~~ 119 (737) +..+++. |-. .-.+++.+.-|||+++...++ .++|.-.|| -|+| |+|=-=| |=|-++|+.-| T Consensus 13 ~~~Li~~lF~~R~l~~~R~f~n~~L~~~eDaA~l~~~g~-~la~sTDSf----Vv~PlFFpGGDIGkLAVcGTvNDvav~ 87 (345) T TIGR02124 13 MQQLIEELFLKRPLELKRAFGNEILAAMEDAAVLELSGG-RLAFSTDSF----VVDPLFFPGGDIGKLAVCGTVNDVAVS 87 (345) T ss_pred HHHHHHHHCCCCCCCCEEECCCCCCCCCCCCCEECCCCC-EEEEECCCE----EECCCCCCCCCCCCEEEECCHHHHHHC T ss_conf 999988703468611201248750056677413217896-089980763----754633698671117773332467533 Q ss_pred CCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCCCCCCEEEECCCCHH--HHHHCC Q ss_conf 87436877504578756863888998887458866543087211320355215-676642211221121135--543112 Q gi|254780970|r 120 GARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNNIIVNTFAAGIAKTN--AIFSSK 196 (737) Q Consensus 120 Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~~~v~~~~~Gi~~~~--~~~~~~ 196 (737) ||||++++.+|=.-. -=+..-|+.+++-++.-..+-|+++|+|+|..-|. -.+++-+|..++|+++.. ..++.+ T Consensus 88 GA~P~YLs~gfIlEE---Gfp~e~L~~iv~Sm~~~A~~aGV~iV~GDTKVV~kG~~D~iFINTtGiG~~~~~~v~~~~~~ 164 (345) T TIGR02124 88 GAKPLYLSCGFILEE---GFPIEDLERIVKSMAEAARKAGVKIVAGDTKVVEKGKVDGIFINTTGIGVVPSGGVIDISAH 164 (345) T ss_pred CCHHHHHCCCEEEEE---CCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCC T ss_conf 844578507201340---78767899999999999997397689757353345775605881388888747843376535 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHHH-HH---HCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-CCCCHHCCCCCCCC Q ss_conf 256897799965765644410466667-77---28653334443326798999888999874011-38301103477520 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMAS-EE---FGENIAKKRPTVQVGDPFTGKCLLEACLELMN-TDAVIAIQDMGAAG 271 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~sS-~~---~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~-~~~i~~~~D~gaGG 271 (737) ..+|||.||+ +|.+|-||++.-| ++ |+.+.++|.+++ ..|++.+++..- +--|++|.|.+.|| T Consensus 165 ~~~~GD~Ii~----sG~iG~HG~~Il~~Regl~f~~~i~SDCapL--------~~lv~~ll~~~g~~~~v~~~RD~TRGG 232 (345) T TIGR02124 165 NIQPGDVIIV----SGSIGDHGAAILAVREGLGFETNIESDCAPL--------NGLVEALLAVGGYEPAVHAMRDATRGG 232 (345) T ss_pred CCCCCCEEEE----CCCCHHHHHHHHHHHHCCCCCCCCCCHHHHH--------HHHHHHHHHCCCCCCCCEEEECCCCCC T ss_conf 4145766898----1684067899887650255256733327765--------899999984379871423761678550 Q ss_pred HHHHHHHHHCCCCCCEEEEEHHHCCCCC--------CCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHH--CCEEE-C Q ss_conf 1112233201477528996202204667--------89998986178885089999546758899988870--97131-1 Q gi|254780970|r 272 LTCSAIEMGNQGNLGITLDLDKVPTCEE--------GMTAYEMMLSESQERMLMILNPEKQHKAQEILNKW--GLHFS-I 340 (737) Q Consensus 272 l~~a~~Ema~~~~~G~~i~l~~vp~~~~--------~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~--~~~~~-~ 340 (737) |+.++.|+|..+|.|++|+.++||++++ |++|+++. ||- +.+++|+||+.++++++++.+ +.+|+ + T Consensus 233 LA~vLNE~A~~sg~~i~l~E~~iPV~eeV~gaCE~LGldPl~lA-NEG--~~v~~V~~E~A~~vLe~lk~hp~G~~A~Yi 309 (345) T TIGR02124 233 LAAVLNEIAESSGVGIELEEEKIPVKEEVKGACELLGLDPLYLA-NEG--KLVLAVPPEAAEKVLEILKSHPLGKDAAYI 309 (345) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCHHHHH-CCC--EEEEEECHHHHHHHHHHHHHCCCCCCCCEE T ss_conf 47899999996199279985247883789999986170325420-476--289982837799999998607764332156 Q ss_pred CCCEEECCCCCCC----CCCCCCCCCCCC Q ss_conf 2212104431000----258644344321 Q gi|254780970|r 341 IGITTNDKLFRVI----HRGEEVANLPIK 365 (737) Q Consensus 341 iG~vt~~~~~~v~----~~g~~v~dlp~~ 365 (737) ||+|++++.-+|+ |+++.++|+|.- T Consensus 310 IG~V~e~~~~~V~l~t~~G~~R~ld~p~G 338 (345) T TIGR02124 310 IGEVVEKKEGLVVLKTAYGGKRILDMPSG 338 (345) T ss_pred EEEEEECCCCEEEEEECCCCEEEEECCCC T ss_conf 30147379877999706884056553567 No 38 >PRK00943 selenophosphate synthetase; Provisional Probab=99.97 E-value=1.6e-29 Score=221.30 Aligned_cols=252 Identities=19% Similarity=0.211 Sum_probs=180.1 Q ss_pred CCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHH Q ss_conf 35763142037776227973045321000010011356789777641027851022322102456668621346999999 Q gi|254780970|r 434 GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVK 513 (737) Q Consensus 434 g~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~ 513 (737) +.||||+++. +++.++.|+|+.+... -|||..+++|++.+++||++||++|+.....+.|+...-+. ..+.+.++ T Consensus 49 ~dDaavi~~~--~~~~lv~T~D~f~piv-~Dp~~~G~IAa~~alsDi~AmGa~P~~aL~~~~~p~~~l~~--e~l~~il~ 123 (347) T PRK00943 49 RDDAAVYDLN--NGQAIVSTTDFFMPIV-DDPFDFGRIAATNAISDVYAMGGKPIMALAILGWPINVLPP--EVAREVLE 123 (347) T ss_pred CCCEEEEEEC--CCEEEEEEECCCCCCC-CCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCH--HHHHHHHH T ss_conf 7755899728--9808999804666775-68788899999985768988389488999884302021679--99999999 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHHHC Q ss_conf 99852210288665443200112368666763111222233352023310214389807513776655432678888611 Q gi|254780970|r 514 GIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLECAS 593 (737) Q Consensus 514 gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~~~ 593 (737) |+.+.|++.|++++||+ |...+ -|....+++|.++ .++.++...+++||.|++++.- |......... T Consensus 124 g~~~~~~e~gv~ivGGH------T~~~~-e~~ig~~v~G~~~-~~~~~~~~~a~~GD~liLTKpl----Gtgi~~~a~~- 190 (347) T PRK00943 124 GGRDVCREAGIPLAGGH------SIDAP-EPIFGLAVTGVVH-PERVKRNAGAKAGDVLILTKPL----GIGVLTAAEK- 190 (347) T ss_pred HHHHHHHHCCCEEECCC------EEECC-CCEEEEEEEEEEC-CCCCCCCCCCCCCCEEEEECCC----CCHHHHHHHH- T ss_conf 99999998599053465------67546-6525568986774-6323026888888889982575----0149999987- Q ss_pred CCCCCCCCCCHHHHHH----HHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHH------- Q ss_conf 6789996479899999----99999999995980699464865799999999845897089980566689878------- Q gi|254780970|r 594 SNIGPPPKVDCHLEKN----HGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPK------- 662 (737) Q Consensus 594 ~~~g~~p~~d~~~~k~----~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~------- 662 (737) .+..+....+.... +.....++.....++||||+|+|||+.+|.|||.+|++|++|+.+..+-.+.. T Consensus 191 --~~~~~~~~~~~a~~~m~~ln~~aa~l~~~~~v~A~tDvTgfGL~GhL~Ema~~S~v~~~i~~~~iPv~~~v~~l~~~g 268 (347) T PRK00943 191 --KSLLDPEHQGLAIEVMCELNTPGARFALLPGVHAMTDVTGFGLLGHLLEMARGSGVQARIDYAAIPRLPGVEELIALG 268 (347) T ss_pred --CCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHCC T ss_conf --368999999999999985358999997534996222145335899999987258965999748876654599999868 Q ss_pred -------------------------HHHHC-CCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf -------------------------88729-88321899996880998999998579849998998798 Q gi|254780970|r 663 -------------------------PFLFG-EDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 663 -------------------------~~LFs-Es~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) ..||. ++.|.++++|++++.+++++.+++.|+++.+||+|++. T Consensus 269 ~~~~~~~~N~~~~~~~~~~~~~~~~~ll~DPQTSGGLLiaV~~e~a~~~l~~L~~~G~~aa~IG~V~e~ 337 (347) T PRK00943 269 CVPGGTGRNFASYGHLMGELPREQRALLCDPQTSGGLLVAVAPEAEDEVLATFAEFGIELAAIGELVEA 337 (347) T ss_pred CCCCCHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCEEEEECHHHHHHHHHHHHHCCCCCEEEEEEEEC T ss_conf 987521778987632144566466753438557776799985999999999999679996799999967 No 39 >COG2144 Selenophosphate synthetase-related proteins [General function prediction only] Probab=99.97 E-value=6.8e-29 Score=217.04 Aligned_cols=270 Identities=21% Similarity=0.277 Sum_probs=190.6 Q ss_pred HHCCCCCCC-CCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCC Q ss_conf 311210001-1000013576314203777622797304532100001001135678977764102785102232210245 Q gi|254780970|r 419 EQYDTMIQS-NSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFG 497 (737) Q Consensus 419 ~qyD~~V~g-~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~ 497 (737) ++++....+ ...+.-|+||||+++. +++.+..+-|..+++...||++.++++|.-.++|+++||++|++++|.+.-. T Consensus 27 ~~~~~~~~~~~~~l~~GDDAavI~v~--~~~lliaadGi~g~l~~~dPw~aG~csvLVn~~DV~amGG~Pv~~vd~isa~ 104 (324) T COG2144 27 REFLDEFYGAEVILDFGDDAAVIRVG--DGKLLIAADGIWGKLIDADPWWAGYCSVLVNVNDVAAMGGEPVGAVDAISAK 104 (324) T ss_pred HHCCHHHCCCCCCCCCCCCEEEEEEC--CCEEEEECCCCCCCCCCCCCHHHHCEEEEEEEHHHHHHCCCCEEEEEEEECC T ss_conf 62123433555311147834799608--9679996577643234557233302069986013355179614888864348 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCC Q ss_conf 66686213469999999985221028866544320011236866676311122223335202331021438980751377 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGN 577 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~ 577 (737) + .+ |-.++++||++.++.||+|++|| +|+...-++.+.+...|++ +.+.++++..+++||+|+++++ T Consensus 105 s---~d---~~~ei~eglr~~a~kfgvpivGG------hthpd~~y~vl~v~i~gl~-~~e~Ii~s~~Ak~GD~lI~~~d 171 (324) T COG2144 105 S---ED---QAREILEGLRKGARKFGVPIVGG------HTHPDTPYCVLDVVIGGLI-AEEPIITSGTAKPGDLLIFVGD 171 (324) T ss_pred C---HH---HHHHHHHHHHHHHHHCCCCEECC------CCCCCCCCCEEEEEEECCC-CCCCCCCCCCCCCCCEEEEEEC T ss_conf 7---89---99999999998888549853467------1588877764446874023-6542303689886877999961 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 66554326788886116789996479899999999999999959806994648657999999998458970899805666 Q gi|254780970|r 578 DGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI 657 (737) Q Consensus 578 ~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~ 657 (737) -... .|-. .-........-....++++++++.++++ ++.|+.|+|.||++.+|++||.+|++|+.++|++.+ T Consensus 172 ~~g~----~~p~--~P~~wDttt~ka~~~~~~~~e~l~e~a~--l~~AgKDvS~gG~iGtl~mlle~S~~ga~vdl~siP 243 (324) T COG2144 172 LDGK----PYPN--FPLNWDTTTMKAKEKFRAQLELLREGAK--LVKAGKDVSNGGLLGTLLMLLEKSRVGAGVDLDSIP 243 (324) T ss_pred CCCC----CCCC--CCCCCCCEEECCHHHHHHHHHHHHHHHH--HHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 4787----4889--8855531231068889989999999999--875404566750789999999862368436523468 Q ss_pred C---CCHHHHHHCCCCC-CEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEEECC Q ss_conf 8---9878887298832-1899996880998999998579849998998798-6898074 Q gi|254780970|r 658 E---KDPKPFLFGEDQG-RYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSIHNI 712 (737) Q Consensus 658 ~---~d~~~~LFsEs~G-r~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i~~~ 712 (737) . .|..+||-. .+| .|+++|+|++++++...|.+.+++++.||+|..+ .+.|.+. T Consensus 244 ~p~~vd~~~wlk~-ypg~gfv~~v~pe~veev~~v~~~~g~~a~~~GeVie~~~L~i~~~ 302 (324) T COG2144 244 YPADVDFRQWLKR-YPGSGFVLTVDPEDVEEVVDVFEEEGCPATVIGEVIEEPVLRIGDG 302 (324) T ss_pred CCCCCCHHHHHHH-CCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEECC T ss_conf 8655547789983-8997479996889999999999875996587888603766897257 No 40 >COG2144 Selenophosphate synthetase-related proteins [General function prediction only] Probab=99.97 E-value=3.5e-29 Score=219.02 Aligned_cols=296 Identities=20% Similarity=0.325 Sum_probs=210.4 Q ss_pred CCCCCCCCC-HHHHHHH--HHCCCCEEECCCCCEEEEEECCCEEEEEEECCCCCCCCC--CCCC-CCCCCCCCCCCCCCC Q ss_conf 011074458-8999751--206974899334773689836983999995178877765--8888-721055810225767 Q gi|254780970|r 45 NEHCSYKSS-KKWLRTL--PTTGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYI--EPYQ-GAATGVGGILRDIFT 118 (737) Q Consensus 45 SEHC~~k~~-~~~l~~~--~~~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i--~P~~-GAaTg~gG~iRDi~~ 118 (737) =||--+|.. +...+.+ ...+-.++..+||||+||+++++. +++-+|-- -|+-+ +||. |.-|=..- ..||.+ T Consensus 13 ~egIlrk~~i~~~v~~~~~~~~~~~~~l~~GDDAavI~v~~~~-lliaadGi-~g~l~~~dPw~aG~csvLVn-~~DV~a 89 (324) T COG2144 13 FEGILRKNPIKDAVREFLDEFYGAEVILDFGDDAAVIRVGDGK-LLIAADGI-WGKLIDADPWWAGYCSVLVN-VNDVAA 89 (324) T ss_pred HHCHHHCCCHHHHHHHCCHHHCCCCCCCCCCCCEEEEEECCCE-EEEECCCC-CCCCCCCCCHHHHCEEEEEE-EHHHHH T ss_conf 6331312526888862123433555311147834799608967-99965776-43234557233302069986-013355 Q ss_pred CCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCC Q ss_conf 78743687750457875686388899888745886654308721132035521567664221122112113554311225 Q gi|254780970|r 119 MGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKAR 198 (737) Q Consensus 119 ~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~ 198 (737) ||++|++++|.+.-.+ ..+..++.+|++...-+||+|++||+++.|..|+ . ..+..+|++..+.++++..+ T Consensus 90 mGG~Pv~~vd~isa~s------~d~~~ei~eglr~~a~kfgvpivGGhthpd~~y~--v-l~v~i~gl~~~e~Ii~s~~A 160 (324) T COG2144 90 MGGEPVGAVDAISAKS------EDQAREILEGLRKGARKFGVPIVGGHTHPDTPYC--V-LDVVIGGLIAEEPIITSGTA 160 (324) T ss_pred HCCCCEEEEEEEECCC------HHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCC--E-EEEEEECCCCCCCCCCCCCC T ss_conf 1796148888643487------8999999999998888549853467158887776--4-44687402365423036898 Q ss_pred CCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHH Q ss_conf 68977999657656444104666677728653334443326798999888999874011383011034775201112233 Q gi|254780970|r 199 GIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIE 278 (737) Q Consensus 199 ~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~E 278 (737) ++||.||++|--.||..= + .-++-+...-. .|..-+..+++..+..+ |+++++|++.||+...+.+ T Consensus 161 k~GD~lI~~~d~~g~~~p---~---~P~~wDttt~k------a~~~~~~~~e~l~e~a~--l~~AgKDvS~gG~iGtl~m 226 (324) T COG2144 161 KPGDLLIFVGDLDGKPYP---N---FPLNWDTTTMK------AKEKFRAQLELLREGAK--LVKAGKDVSNGGLLGTLLM 226 (324) T ss_pred CCCCEEEEEECCCCCCCC---C---CCCCCCCEEEC------CHHHHHHHHHHHHHHHH--HHHHCCCCCCCCHHHHHHH T ss_conf 868779999614787488---9---88555312310------68889989999999999--8754045667507899999 Q ss_pred HHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCC-EEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCC Q ss_conf 20147752899620220466789998986178885-08999954675889998887097131122121044310002586 Q gi|254780970|r 279 MGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQE-RMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGE 357 (737) Q Consensus 279 ma~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQe-R~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~ 357 (737) |.++|.+|+.+||++||-- .+++ |+-|+--+.. -|+++|+|++++++.++|++.+|+++++|+|++++++++..+.. T Consensus 227 lle~S~~ga~vdl~siP~p-~~vd-~~~wlk~ypg~gfv~~v~pe~veev~~v~~~~g~~a~~~GeVie~~~L~i~~~~~ 304 (324) T COG2144 227 LLEKSRVGAGVDLDSIPYP-ADVD-FRQWLKRYPGSGFVLTVDPEDVEEVVDVFEEEGCPATVIGEVIEEPVLRIGDGEL 304 (324) T ss_pred HHHHHCCCCEEEECCCCCC-CCCC-HHHHHHHCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCCEEEEECCCH T ss_conf 9986236843652346886-5554-7789983899747999688999999999987599658788860376689725740 Q ss_pred C--CCCCCCCCCC Q ss_conf 4--4344321014 Q gi|254780970|r 358 E--VANLPIKALS 368 (737) Q Consensus 358 ~--v~dlp~~~L~ 368 (737) . +-|+..+.++ T Consensus 305 ~~~~~d~e~~~i~ 317 (324) T COG2144 305 VSEVFDLEKERIT 317 (324) T ss_pred HHHHHHHHHCCCC T ss_conf 4322123221013 No 41 >TIGR02124 hypE hydrogenase expression/formation protein HypE; InterPro: IPR011854 This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (IPR006283 from INTERPRO) and other enzymes.. Probab=99.95 E-value=5.3e-27 Score=204.05 Aligned_cols=260 Identities=25% Similarity=0.378 Sum_probs=196.0 Q ss_pred CCCCCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHH-----HHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCC Q ss_conf 0011000013576314203777622797304532100001001-----13567897776410278510223221024566 Q gi|254780970|r 425 IQSNSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFE-----GTKQAVAECWRNIIATGAKPLAITDNLNFGNP 499 (737) Q Consensus 425 V~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~-----ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P 499 (737) +-++.++.-..|+|++...+ +-+|||||... +||.. -+++||+-.+.|++.|||||+++|+.|.. T Consensus 31 ~f~n~~L~~~eDaA~l~~~g---~~la~sTDSfV----v~PlFFpGGDIGkLAVcGTvNDvav~GA~P~YLs~gfIl--- 100 (345) T TIGR02124 31 AFGNEILAAMEDAAVLELSG---GRLAFSTDSFV----VDPLFFPGGDIGKLAVCGTVNDVAVSGAKPLYLSCGFIL--- 100 (345) T ss_pred ECCCCCCCCCCCCCEECCCC---CEEEEECCCEE----ECCCCCCCCCCCCEEEECCHHHHHHCCCHHHHHCCCEEE--- T ss_conf 24875005667741321789---60899807637----546336986711177733324675338445785072013--- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEECCCCCCCCCEEECC Q ss_conf 686213469999999985221028866544--32001123686667631112222333520-233102143898075137 Q gi|254780970|r 500 EKEEIMGQFVHSVKGIREACQILDFPIVSG--NVSFYNETNGQSIFPTPTIAGVGILPDYS-LMTRIDSAHEGDLILMIG 576 (737) Q Consensus 500 ~~~e~~~ql~~av~gl~d~c~~lgipivgG--kvSl~n~t~~~~I~pTpvi~~vG~v~d~~-~~it~~~k~~Gd~I~liG 576 (737) |+---+-.|.+.++.|+++|++.|++||+| ||-=..+.++.-|.. |++|.++... ..++.+..++||.|++.| T Consensus 101 EEGfp~e~L~~iv~Sm~~~A~~aGV~iV~GDTKVV~kG~~D~iFINT----tGiG~~~~~~v~~~~~~~~~~GD~Ii~sG 176 (345) T TIGR02124 101 EEGFPIEDLERIVKSMAEAARKAGVKIVAGDTKVVEKGKVDGIFINT----TGIGVVPSGGVIDISAHNIQPGDVIIVSG 176 (345) T ss_pred EECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCEEEEEE----EEEEEECCCCCCCCCCCCCCCCCEEEECC T ss_conf 40787678999999999999973976897573533457756058813----88888747843376535414576689816 Q ss_pred CCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCC----CEEEEEECCCCHHHHHHHHHHHCCCCCEEEE Q ss_conf 766554326788886116789996479899999999999999959----8069946486579999999984589708998 Q gi|254780970|r 577 NDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAK----KITACHDISTGGLIITLAEMTISSAKGMDII 652 (737) Q Consensus 577 ~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g----~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~ 652 (737) .-++| |.++++.. .+......-.-|++.+..+.. .+.+.+ .|++++|.|+|||+++|.|+|.+|++|+.|+ T Consensus 177 ~iG~H-G~~Il~~R-egl~f~~~i~SDCapL~~lv~---~ll~~~g~~~~v~~~RD~TRGGLA~vLNE~A~~sg~~i~l~ 251 (345) T TIGR02124 177 SIGDH-GAAILAVR-EGLGFETNIESDCAPLNGLVE---ALLAVGGYEPAVHAMRDATRGGLAAVLNEIAESSGVGIELE 251 (345) T ss_pred CCHHH-HHHHHHHH-HCCCCCCCCCCHHHHHHHHHH---HHHHCCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCCEEEE T ss_conf 84067-89988765-025525673332776589999---99843798714237616785504789999999619927998 Q ss_pred ---ECCC---------CCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHC--CCCEE-EEEEECCCE Q ss_conf ---0566---------689878887298832189999688099899999857--98499-989987986 Q gi|254780970|r 653 ---LPIE---------IEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNK--NIPLR-YLGKVSGST 706 (737) Q Consensus 653 ---L~~~---------~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~--gi~~~-~IG~Vt~~~ 706 (737) ||.. .+.||+. |=|| |.+|+.|++|+.++++++++++ |-.+. .||+|+++. T Consensus 252 E~~iPV~eeV~gaCE~LGldPl~-lANE--G~~v~~V~~E~A~~vLe~lk~hp~G~~A~YiIG~V~e~~ 317 (345) T TIGR02124 252 EEKIPVKEEVKGACELLGLDPLY-LANE--GKLVLAVPPEAAEKVLEILKSHPLGKDAAYIIGEVVEKK 317 (345) T ss_pred ECCCCCCHHHHHHHHHHCCCHHH-HHCC--CEEEEEECHHHHHHHHHHHHHCCCCCCCCEEEEEEEECC T ss_conf 52478837899999861703254-2047--628998283779999999860776433215630147379 No 42 >cd02691 PurM-like2 AIR synthase (PurM) related protein, archaeal subgroup 2 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain. Probab=99.95 E-value=2e-26 Score=200.15 Aligned_cols=291 Identities=15% Similarity=0.188 Sum_probs=190.2 Q ss_pred CCCCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 11000013576314203777622797304532100001001135678977764102785102232210245666862134 Q gi|254780970|r 427 SNSIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 (737) Q Consensus 427 g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ 506 (737) .+++..-++|++++... ++-+++|+|........-|+..++.||.-+++||++|||+|++++.++.+... -... T Consensus 29 ~~~~l~~~dDaa~~~~~---~~~iv~ttDg~~~~~~~fP~~~G~~av~~tvnDla~mGA~P~~l~~~~~l~eg---~~~~ 102 (346) T cd02691 29 EVSIVAQDDDAGVDAAD---VEYIVVAIDGIHSRLSDFPFLAGFHATRAALRDVMVMGARPVALLSDIHLADD---GDVG 102 (346) T ss_pred CCCCCCCCCCEEEECCC---CCEEEEEECCEEECCCCCCCCCHHHEEEEEHHHHHHCCCCCCEEEEEEEECCC---CCHH T ss_conf 86444567864675379---95799994777855666851121334765566677648843303899996699---9999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC---CCCC---CCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCC Q ss_conf 69999999985221028866544320011236---8666---76311122223335202331021438980751377665 Q gi|254780970|r 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETN---GQSI---FPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGC 580 (737) Q Consensus 507 ql~~av~gl~d~c~~lgipivgGkvSl~n~t~---~~~I---~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~ 580 (737) .|.+.+.+|..+|+++|+|+|+| +|. +..+ ..|-.++++|.... ..+....+++||.|++.+..+. T Consensus 103 ~L~~i~~~m~~aa~~~gV~iV~G------dT~vv~gd~v~g~~i~~~v~~iGv~~~--~~~~~~~ArpGD~IlvsggiGg 174 (346) T cd02691 103 KLFDFTAGVTAVSEATGVPLVAG------STLRIGGDMVLGDRLVGGVGAVGRSKS--DPSRRKNAEPGDLILMTEGAGG 174 (346) T ss_pred HHHHHHHHHHHHHHHCCCCEEEC------CCEEECCCCEEEEEEEEEEEEEEECCC--CCCCCCCCCCCCEEEEECCCCC T ss_conf 99999999999999859989714------736836994664368998888774047--7641147999999999788784 Q ss_pred CCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCC-C Q ss_conf 543267888861167899964798999999999999999598069946486579999999984589708998056668-9 Q gi|254780970|r 581 HLDCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIE-K 659 (737) Q Consensus 581 ~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~-~ 659 (737) + +...++. +.+...-.....+.... .....+.+....+.|++++|+|+|||+++|.|||.+|++|+.|+.+..+. . T Consensus 175 ~-~ia~~a~-~~g~~~~~~et~~~~~~-~~~~~l~~~~l~~~IhaMrD~TrGGLa~~LnEiA~~SgVgi~Idee~I~~pv 251 (346) T cd02691 175 G-TITTTAI-YHGMPDVVEETLNVDFI-KACEALRDSGLVSKVHSMTDVTNGGIRGDALEISKTAGVSLVFDEEKVRSLI 251 (346) T ss_pred H-HHHHHHH-HCCCCCCCHHCCCHHHH-HHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHCCCEEEEEHHHCCCCC T ss_conf 1-7889998-71464321000120467-8899999853104733877686245888999999866975999767704656 Q ss_pred CH--H---HHH----HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECC-CE-EEEECCEECCH-HHHHHHHHHH Q ss_conf 87--8---887----298832189999688099899999857984999899879-86-89807412149-9999999999 Q gi|254780970|r 660 DP--K---PFL----FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSG-ST-LSIHNILNIPV-ATLQTKYESW 727 (737) Q Consensus 660 d~--~---~~L----FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~-~~-l~i~~~~~~~i-~~l~~~w~~~ 727 (737) +| . +.| ++.+..++++.+++++.+++++.++++|+++.+||+|+. +. ..+.+.-..++ -..|+.-+.+ T Consensus 252 ~p~V~~~cE~LgiDPL~vaneg~Lii~p~e~~~~i~~~l~~~Gv~a~~IG~V~~g~~~~l~~~g~~~~~~p~fre~~yt~ 331 (346) T cd02691 252 NPKVLKMLEELGIDPLGVSLDSLMIIAPEEDAVDIIRTLREAGVRADEVGRVEEGRGVPLVVTGEGRELKPAFRESAYTP 331 (346) T ss_pred CHHHHHHHHHHCCCHHHCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEEECCCCEEEECCCCCCCCCCCCCCCCCC T ss_conf 88999999985987010255626999667788999999987699738999998489617863896123672333568853 Q ss_pred HHHHHCC Q ss_conf 9987245 Q gi|254780970|r 728 FPEFISE 734 (737) Q Consensus 728 l~~~~~~ 734 (737) .+|++.+ T Consensus 332 ~k~~~~~ 338 (346) T cd02691 332 IKKVVGE 338 (346) T ss_pred HHHHCCC T ss_conf 5444177 No 43 >PRK00943 selenophosphate synthetase; Provisional Probab=99.94 E-value=7.4e-27 Score=203.07 Aligned_cols=282 Identities=19% Similarity=0.306 Sum_probs=199.3 Q ss_pred CCCCCCHHH----HHHHHH--CCCCEEECC--CCCEEEEEECCCEEEEEEECCCCCCCCCCCCC-C--CCCCCCCCCCCC Q ss_conf 074458899----975120--697489933--47736898369839999951788777658888-7--210558102257 Q gi|254780970|r 48 CSYKSSKKW----LRTLPT--TGKHVIQGP--GENAGVVDIGGGDCVVFKMESHNHPSYIEPYQ-G--AATGVGGILRDI 116 (737) Q Consensus 48 C~~k~~~~~----l~~~~~--~~~~vi~~~--~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~-G--AaTg~gG~iRDi 116 (737) |.-|-.... |+.++. ..++++.|+ +|||+|+++.++..++.++... .|--=+||. | |+. +.+.|+ T Consensus 17 Cg~Kv~~~~L~~~L~~~~~~~~~~~~lvg~~~~dDaavi~~~~~~~lv~T~D~f-~piv~Dp~~~G~IAa~---~alsDi 92 (347) T PRK00943 17 CGCKISPKVLETILKGEQAKFLDPDLLVGNETRDDAAVYDLNNGQAIVSTTDFF-MPIVDDPFDFGRIAAT---NAISDV 92 (347) T ss_pred CCCCCCHHHHHHHHHHCCCCCCCCCEECCCCCCCCEEEEEECCCEEEEEEECCC-CCCCCCHHHHHHHHHH---HHHHHH T ss_conf 227569899999997364557894301179987755899728980899980466-6775687888999999---857689 Q ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCC Q ss_conf 67787436877504578756863888998887458866543087211320355215676642211221121135543112 Q gi|254780970|r 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSK 196 (737) Q Consensus 117 ~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~ 196 (737) .+||++|+..+..+.|+.... ..+.+..+.+|+.+-.++.|++++|||+..++ .|.......|+++.+++.+.. T Consensus 93 ~AmGa~P~~aL~~~~~p~~~l--~~e~l~~il~g~~~~~~e~gv~ivGGHT~~~~----e~~ig~~v~G~~~~~~~~~~~ 166 (347) T PRK00943 93 YAMGGKPIMALAILGWPINVL--PPEVAREVLEGGRDVCREAGIPLAGGHSIDAP----EPIFGLAVTGVVHPERVKRNA 166 (347) T ss_pred HHCCCCHHHHHHHHCCCCCCC--CHHHHHHHHHHHHHHHHHCCCEEECCCEEECC----CCEEEEEEEEEECCCCCCCCC T ss_conf 883894889998843020216--79999999999999999859905346567546----652556898677463230268 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHH Q ss_conf 25689779996576564441046666777286533344433267989998889998740113830110347752011122 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSA 276 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~ 276 (737) ..+|||.||+.+ -.|.|....+.+.-.-..+....++ -.|.+ +.....++..+..+++|||++.+||+..+ T Consensus 167 ~a~~GD~liLTK----plGtgi~~~a~~~~~~~~~~~~~a~----~~m~~-ln~~aa~l~~~~~v~A~tDvTgfGL~GhL 237 (347) T PRK00943 167 GAKAGDVLILTK----PLGIGVLTAAEKKSLLDPEHQGLAI----EVMCE-LNTPGARFALLPGVHAMTDVTGFGLLGHL 237 (347) T ss_pred CCCCCCEEEEEC----CCCCHHHHHHHHCCCCCHHHHHHHH----HHHHH-CCHHHHHHHHHCCCCEEECCCCCHHHHHH T ss_conf 888888899825----7501499999873689999999999----99985-35899999753499622214533589999 Q ss_pred HHHHCCCCCCEEEEEHHHCCCCCCC--------------------------CHH--HHHH-CCCCCEEEEEEECCCHHHH Q ss_conf 3320147752899620220466789--------------------------998--9861-7888508999954675889 Q gi|254780970|r 277 IEMGNQGNLGITLDLDKVPTCEEGM--------------------------TAY--EMML-SESQERMLMILNPEKQHKA 327 (737) Q Consensus 277 ~Ema~~~~~G~~i~l~~vp~~~~~l--------------------------~~~--ei~~-sEsQeR~~~~v~~~~~~~~ 327 (737) .||+..+++|++|++++||+.+.-. +.+ .+++ -++-.-++++|+|++.+++ T Consensus 238 ~Ema~~S~v~~~i~~~~iPv~~~v~~l~~~g~~~~~~~~N~~~~~~~~~~~~~~~~~ll~DPQTSGGLLiaV~~e~a~~~ 317 (347) T PRK00943 238 LEMARGSGVQARIDYAAIPRLPGVEELIALGCVPGGTGRNFASYGHLMGELPREQRALLCDPQTSGGLLVAVAPEAEDEV 317 (347) T ss_pred HHHHHCCCCEEEEEECCCCCCHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCHHHHHHCCCCCCCCCEEEEECHHHHHHH T ss_conf 99872589659997488766545999998689875217789876321445664667534385577767999859999999 Q ss_pred HHHHHHHCCEEECCCCEEECC Q ss_conf 998887097131122121044 Q gi|254780970|r 328 QEILNKWGLHFSIIGITTNDK 348 (737) Q Consensus 328 ~~i~~~~~~~~~~iG~vt~~~ 348 (737) ++.+++.+.++++||+||+.. T Consensus 318 l~~L~~~G~~aa~IG~V~e~~ 338 (347) T PRK00943 318 LATFAEFGIELAAIGELVEAR 338 (347) T ss_pred HHHHHHCCCCCEEEEEEEECC T ss_conf 999996799967999999679 No 44 >COG0611 ThiL Thiamine monophosphate kinase [Coenzyme metabolism] Probab=99.94 E-value=3.7e-25 Score=191.47 Aligned_cols=262 Identities=22% Similarity=0.269 Sum_probs=184.8 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCEEEEECCCC---C-CCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 565311210001100001357631420377762279730453---2-100001001135678977764102785102232 Q gi|254780970|r 416 WVYEQYDTMIQSNSIQLPGGDAGVIRVEGHETKALAFSSDVT---P-RYVKADPFEGTKQAVAECWRNIIATGAKPLAIT 491 (737) Q Consensus 416 ~i~~qyD~~V~g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~---p-~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait 491 (737) .|-+.+.+..+.+.....|+|||++.+... ..+++|+|.. . ......|+..|+.+++-+++||+||||+|.+++ T Consensus 8 lI~~~~~~~~~~~~~~~~GDDaA~v~~~~~--~~lvvttD~lv~~~hF~~~~~p~d~G~Ka~a~NlSDlAAMGa~P~~~~ 85 (317) T COG0611 8 LIKRYFKRRQREDVVLGIGDDAALVDAPEG--QRLVVTTDMLVEGTHFPPDMTPEDLGWKALAVNLSDLAAMGARPKAFL 85 (317) T ss_pred HHHHHHHHCCCCCCCCCCCCCEEEEECCCC--CEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHE T ss_conf 999998403765533257776479815888--648999612125355699999999878999988889997488765342 Q ss_pred HHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCC Q ss_conf 21024566686213469999999985221028866544320011236866676311122223335202331021438980 Q gi|254780970|r 492 DNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDL 571 (737) Q Consensus 492 ~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~ 571 (737) .++.+|. +-++. .+....+|+.++|+.|++++||| +|+.. +-+..++++|.++... .+....+++||. T Consensus 86 lsl~lP~-~~d~~--~~~~~~~Gi~e~~~~y~~~lIGG------Dt~~~--~l~is~t~iG~~~~~~-~l~R~gAkpGD~ 153 (317) T COG0611 86 LSLGLPP-DLDEE--WLEALADGIFEAAKKYGVKLIGG------DTNRG--PLSISVTAIGVLPKGR-ALLRSGAKPGDL 153 (317) T ss_pred EEEECCC-CCCHH--HHHHHHHHHHHHHHHCCCEEECC------CCCCC--CEEEEEEEEEECCCCC-CEECCCCCCCCE T ss_conf 6552899-99999--99999999999999829808643------36787--3589999999617874-100469999999 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHH------HHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 75137766554326788886116789996479899999999------999999959806994648657999999998458 Q gi|254780970|r 572 ILMIGNDGCHLDCSMYSLECASSNIGPPPKVDCHLEKNHGF------FVLSMINAKKITACHDISTGGLIITLAEMTISS 645 (737) Q Consensus 572 I~liG~~~~~LggS~~~~~~~~~~~g~~p~~d~~~~k~~~~------~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as 645 (737) |+++|..+. |.....+... ...+.......+++.. .-+.+.. ..+|+.|+|| ||+..|.+++.+| T Consensus 154 v~vtG~lG~----saagl~ll~~--~~~~~~~~~l~~r~~~P~Prv~~g~~l~~--~a~aa~DiSD-GL~~dL~~i~~aS 224 (317) T COG0611 154 VAVTGTLGR----SAAGLELLLN--VLGPEDEEELIERHLRPTPRVELGLALAK--LASAAMDISD-GLAADLGHIARAS 224 (317) T ss_pred EEECCCCCH----HHHHHHHHHC--CCCCCHHHHHHHHHCCCCCCHHHHHHHHH--HHHHHEECCH-HHHHHHHHHHHHC T ss_conf 999188766----4899999961--46753038999984499997567899998--8777512306-5999999999982 Q ss_pred CCCEEEEECCCC-----------CCCHHHH--HHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 970899805666-----------8987888--729883218999968809989999985798499989987 Q gi|254780970|r 646 AKGMDIILPIEI-----------EKDPKPF--LFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVS 703 (737) Q Consensus 646 ~~G~~I~L~~~~-----------~~d~~~~--LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt 703 (737) +||++|+....+ ..+++++ ..+|++. +++++++++.+.++.+++..+ +++||+++ T Consensus 225 ~vg~~I~~~~lp~~~~~~~~~~~~~~~~~~aL~gGEDyE-L~ft~p~~~~~~~~~~~~~~~--~~~IG~v~ 292 (317) T COG0611 225 GVGIVIDEDLLPLSDAVLEALDELGDPLEWALSGGEDYE-LVFTVPEENREALLDALRSLG--VTIIGRVT 292 (317) T ss_pred CCEEEEEECCCCCCHHHHHHHHCCCCHHHHHHCCCCCEE-EEEEECCHHHHHHHHHHHHCC--CEEEEEEE T ss_conf 981999843378867789877544778999863687448-999837024799999987549--66999996 No 45 >PRK05297 phosphoribosylformylglycinamidine synthase; Provisional Probab=99.94 E-value=9.3e-26 Score=195.54 Aligned_cols=285 Identities=18% Similarity=0.272 Sum_probs=214.2 Q ss_pred CCCCCEEEEEE--CCCEEEEEEECCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCEEEE-EEEECCC----CCCCCCCHH Q ss_conf 33477368983--6983999995178877-76588887210558102257677874368-7750457----875686388 Q gi|254780970|r 70 GPGENAGVVDI--GGGDCVVFKMESHNHP-SYIEPYQGAATGVGGILRDIFTMGARPVA-AMNSLRF----GAIHHPKTK 141 (737) Q Consensus 70 ~~~~~a~vi~~--~~~~~~~~k~EtHNhP-s~i~P~~GAaTg~gG~iRDi~~~Ga~p~a-~~~~l~~----g~~~~p~~~ 141 (737) -|.-|++|+.. +.....++.+- +|-| +.++||.||.--|+-.+|++.+.|...+. +--+.+| |+|.. . T Consensus 674 ~PvaD~aV~~~~~~~~~G~Ama~G-e~P~~a~~dP~agar~AVaEAltNLvaa~i~~l~~i~LSaNWM~a~g~pgE---~ 749 (1294) T PRK05297 674 VPVADCAVTAASYDGYAGEAMSMG-ERTPVALLDAAASARMAVGEALTNIAAAPIGDLKRIKLSANWMAAAGHPGE---D 749 (1294) T ss_pred CCCCCCEEEEECCCCCCEEEEEEC-CCCCEEECCHHHHHHHHHHHHHHHHHHCCCCCHHHEEEEHHHCCCCCCCCH---H T ss_conf 576772278513789726999834-788205039889999999999998663456667663744100445799853---6 Q ss_pred HHHHHHHHHHH-HHHHHCCCCCCCCEEEEE--CCCC---------CCCCCCEEEECCC-CHHHHHHCCCCCCCCEEEEEC Q ss_conf 89988874588-665430872113203552--1567---------6642211221121-135543112256897799965 Q gi|254780970|r 142 HLLSGVVAGIA-GYSNSFGVPTVGGEVEFL--PCYN---------NNIIVNTFAAGIA-KTNAIFSSKARGIGLPLVYLG 208 (737) Q Consensus 142 ~~~~~~~~G~~-~ygN~~G~P~v~G~~~~~--~~y~---------~~~~v~~~~~Gi~-~~~~~~~~~~~~~Gd~ii~~G 208 (737) .-+-++++.++ ++|-++|+|+++|..+.. ..|+ -++.+.+.+.|.+ +.++.++..-+++|+.|+++. T Consensus 750 a~Ly~av~Ai~~e~c~~Lgi~ip~GKDSlSm~~~~~e~~~~~~v~~P~TLVisa~a~v~Dv~~~vTPelK~~gs~L~lId 829 (1294) T PRK05297 750 AGLYDAVKAVGMELCPALGITIPVGKDSLSMKTKWQEGGEDKEVTSPLSLIISAFAPVEDVRKTLTPQLRTDDTVLLLID 829 (1294) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEECCCCCCEECCCCCEEEEEEEEECCCCCCCCCCEECCCCEEEEEE T ss_conf 89999999999988998398634576666755265218975243378877999999714711032644315998799997 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEE Q ss_conf 76564441046666777286533344433267989998889998740113830110347752011122332014775289 Q gi|254780970|r 209 AKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGIT 288 (737) Q Consensus 209 ~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~ 288 (737) -..++..|||++++. .++. .....|. +.||..-|++.+++.+++.+++|.++||++.|||..++.|||..+++|++ T Consensus 830 l~~~~~~LGGS~laq-v~~~-~g~~~Pd--vd~~~~lk~~f~~iq~Li~~glI~S~HDvSDGGL~~alaEMaFag~~G~~ 905 (1294) T PRK05297 830 LGRGKNRLGGSALAQ-VYNQ-LGDEAPD--VDDPEDLKGFFNAIQALVAEGLLLAYHDRSDGGLFATLAEMAFAGHCGLD 905 (1294) T ss_pred CCCCCCCCHHHHHHH-HHCC-CCCCCCC--CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCEEE T ss_conf 689967521449999-8565-6798969--77899999999999999977987898317778099999999846785148 Q ss_pred EEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCE--EECCCCEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 9620220466789998986178885089999546758899988870971--31122121044310002586443443210 Q gi|254780970|r 289 LDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLH--FSIIGITTNDKLFRVIHRGEEVANLPIKA 366 (737) Q Consensus 289 i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~--~~~iG~vt~~~~~~v~~~g~~v~dlp~~~ 366 (737) |+|+... -++.+.++||.-. .++.|++++++.+.+++++.++. +.+||+++++.++.+.++++.+.+.++.. T Consensus 906 I~l~~~~-----~d~~~~LFsEe~G-~Viqv~~~~~~~V~~~l~~~gl~~~~~~IG~~~~~~~i~I~~~~~~i~~~~~~~ 979 (1294) T PRK05297 906 IDLDALG-----DDLLAALFNEELG-AVIQVRAADRDAVEAILAEAGLSDCSHVIGKPNAGDRIEITRNGKTVFSESRTE 979 (1294) T ss_pred EECCCCC-----HHHHHHHHCCCCE-EEEEECCCCHHHHHHHHHHCCCCCCEEEEEEECCCCCEEEEECCEEEEHHHHHH T ss_conf 9667652-----3578887451753-899971342899999998759852048975376787289998999873623999 Q ss_pred CC Q ss_conf 14 Q gi|254780970|r 367 LS 368 (737) Q Consensus 367 L~ 368 (737) |. T Consensus 980 L~ 981 (1294) T PRK05297 980 LR 981 (1294) T ss_pred HH T ss_conf 99 No 46 >pfam02769 AIRS_C AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synthase is unclear, but the cleft formed between N and C domains is postulated as a sulphate binding site. Probab=99.93 E-value=2.5e-25 Score=192.62 Aligned_cols=149 Identities=31% Similarity=0.456 Sum_probs=136.1 Q ss_pred CCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHH Q ss_conf 68977999657656444104666677728653334443326798999888999874011383011034775201112233 Q gi|254780970|r 199 GIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIE 278 (737) Q Consensus 199 ~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~E 278 (737) ++||.||++|. +|+||+++++....... ...+|++|+++++++++.++++...++|++|||+++|||++++.| T Consensus 1 k~GD~ii~~g~----~g~~g~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~l~~~~~i~a~~Dis~GGL~~~l~e 73 (150) T pfam02769 1 KPGDVLILLGS----SGLGGSGLSLVRKGLEE---DGAVPLGDPLLEPTLIYVVLLLAALGLVKAAHDITGGGLAGALAE 73 (150) T ss_pred CCCCEEEEECC----CCHHHHHHHHHHHHHCC---CCCCCCCCHHHCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHH T ss_conf 99989999899----77289999999731100---599842483206268999999886699679985599739999987 Q ss_pred HHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCC Q ss_conf 2014775289962022046678999898617888508999954675889998887097131122121044310002 Q gi|254780970|r 279 MGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIH 354 (737) Q Consensus 279 ma~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~ 354 (737) |+.++++|++|+++++|..+..++++|.|++|+|+|++++|+|++.+++.++++++++++.+||+||+++.+.+.. T Consensus 74 ~~~~~~~g~~i~~~~~~~~~~~~~~~e~l~~e~~G~~~v~v~~~~~~~v~~~~~~~~~~~~~IG~v~~~~~~~~~~ 149 (150) T pfam02769 74 MAPASGVGAEIDLDKVPIFDELLLPLEMLFSENQGRGLVVVVPEEAEAVLAILEEEGLPAAVIGEVTAGGGLELVV 149 (150) T ss_pred HHHCCCEEEEEECCCCCCCCCCCCHHHHCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEEEECCCEEEEE T ss_conf 6332561689756778603555654551232478638999868899999999998798949999999089799997 No 47 >COG0309 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=99.93 E-value=9.2e-25 Score=188.72 Aligned_cols=264 Identities=23% Similarity=0.301 Sum_probs=184.7 Q ss_pred CCCCCCCCCCCCEEEECCCCCCEEEEECCCC---CCCHHCCCH-HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCC Q ss_conf 1100001357631420377762279730453---210000100-113567897776410278510223221024566686 Q gi|254780970|r 427 SNSIQLPGGDAGVIRVEGHETKALAFSSDVT---PRYVKADPF-EGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKE 502 (737) Q Consensus 427 g~tv~~Pg~DaaVv~~~~~~~~gia~s~g~~---p~~~~~dP~-~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~ 502 (737) +..-..-+.|++.+.+... -+|+|+|.. |.+. |. ..+++||+-+++|++.+||+|.+++..++.+---.- T Consensus 31 ~~v~~~~g~D~~~i~l~~~---~la~tTD~~~i~P~ff---~~~diG~lAV~gt~NDlav~GA~P~~l~~~lil~eg~~~ 104 (339) T COG0309 31 VNVGLANGEDAAIIDLGDG---VLAFTTDPFVIDPLFF---PGGDIGKLAVHGTANDVAVSGAKPRYLSVGLILPEGLPI 104 (339) T ss_pred CCCCCCCCCCEEEEECCCC---EEEEEECCEEECCCCC---CCCCEEEEEEEEEHHHHHHCCCCCEEEEEEEECCCCCCH T ss_conf 6548655664113403784---6999818807546645---788628999998622544058986002677735899987 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCC Q ss_conf 21346999999998522102886654432001123686667631112222333520233102143898075137766554 Q gi|254780970|r 503 EIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHL 582 (737) Q Consensus 503 e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~L 582 (737) -.|.+.++.|.+.|+++|++|++|+.--|..+ +.+.-...+++|.+.. ...+++...++||.|++.|..+.| T Consensus 105 ---e~l~~i~~si~e~a~~~Gv~IvtGdTkV~~~~---~~~~vi~tT~iG~~~~-~~~v~~~~~~~GD~vI~tg~~g~h- 176 (339) T COG0309 105 ---EDLERILKSIDEEAEEAGVSIVTGDTKVVPGG---KDPIVINTTGIGIIDK-EILVSPSGARPGDAVIVTGTIGIH- 176 (339) T ss_pred ---HHHHHHHHHHHHHHHHCCCEEECCCCEEECCC---CCCCEEEEEEEEEECC-CCCCCCCCCCCCCEEEECCCHHHH- T ss_conf ---89999999999999983974982676661599---8772898646775137-760145789988889981881688- Q ss_pred CHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHC---CCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCC Q ss_conf 32678888611678999647989999999999999995---980699464865799999999845897089980566689 Q gi|254780970|r 583 DCSMYSLECASSNIGPPPKVDCHLEKNHGFFVLSMINA---KKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEK 659 (737) Q Consensus 583 ggS~~~~~~~~~~~g~~p~~d~~~~k~~~~~v~~li~~---g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~ 659 (737) |.|+++..+........ ..|.....+.+......... -.|+|+||.++|||+.+|.|||.+|++|+.|+-+..+-. T Consensus 177 ga~ila~~~~~~l~~~l-~sD~~~~~~~~~~~l~~~~~~~~~~vtAMhDaTrGGla~aLnEmA~aSgvgi~I~ee~Ipv~ 255 (339) T COG0309 177 GASILAHRFGEELETEL-GSDCAPLAKLVKALLSVVGEALAAAVTAMHDATRGGLAGALNEMAEASGVGISIEEEKIPVR 255 (339) T ss_pred HHHHHHHHCCHHHCCCH-HHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 99999875202203323-56678889999988620057766567650387536788999999997498499962466664 Q ss_pred CH---HHHHHCC------CCCCEEEEECHHHHHHHHHHHHHCC-CCEEEEEEECCC Q ss_conf 87---8887298------8321899996880998999998579-849998998798 Q gi|254780970|r 660 DP---KPFLFGE------DQGRYVVCISPENQDLVMSEANNKN-IPLRYLGKVSGS 705 (737) Q Consensus 660 d~---~~~LFsE------s~Gr~ii~V~~~~~~~~~~~~~~~g-i~~~~IG~Vt~~ 705 (737) +. .+.+|+- +.|.+|+.|++++.+++++++++++ +++..||+|+.+ T Consensus 256 ~eVr~vce~lGiDPl~~anEG~lv~~V~~~~a~~~l~~L~~~~~~~A~iIGeV~~~ 311 (339) T COG0309 256 EEVRGVCELLGLDPLELANEGKLVIAVPPEHAEEVLEALRSHGLKDAAIIGEVVEE 311 (339) T ss_pred HHHHHHHHHHCCCHHHHHCCCEEEEEECHHHHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 89999999839887887258609999788999999999986499542268998326 No 48 >TIGR01379 thiL thiamine-monophosphate kinase; InterPro: IPR006283 This family represents thiamine-monophosphate kinase, an enzyme that converts thiamine monophosphate into thiamine pyrophosphate (TPP, coenzyme B1), an enzyme cofactor. Thiamine monophosphate may be derived from de novo synthesis or from unphosphorylated thiamine, known as vitamin B1. Eukaryotes lack this enzyme, and add pyrophosphate from ATP to unphosphorylated thiamine in a single step. ; GO: 0009030 thiamin phosphate kinase activity, 0009228 thiamin biosynthetic process. Probab=99.92 E-value=3.3e-24 Score=184.91 Aligned_cols=282 Identities=24% Similarity=0.303 Sum_probs=201.1 Q ss_pred HHCCCCEEECCCCCEEEEEECCCEEEEEE----ECCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 20697489933477368983698399999----5178877765-888872105581022576778743687750457875 Q gi|254780970|r 61 PTTGKHVIQGPGENAGVVDIGGGDCVVFK----MESHNHPSYI-EPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAI 135 (737) Q Consensus 61 ~~~~~~vi~~~~~~a~vi~~~~~~~~~~k----~EtHNhPs~i-~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~ 135 (737) ....+.|+.+.|||||+|+..++.-++++ +|.=-||..+ .|+.=+=-=+.=.+=|+.+|||+|.+.+-++..+. T Consensus 15 ~~~~~~~~~~~GDDAA~~~~~~~~~lv~t~D~Lve~~HF~~~~~~Pe~~G~K~~avNlSDlAAmGA~P~~~~~s~~~P~- 93 (336) T TIGR01379 15 LVKDDSVALGIGDDAALVSPPEGEDLVLTTDTLVEGVHFPPDTTPPEDLGWKAVAVNLSDLAAMGATPKWFLLSLGLPS- 93 (336) T ss_pred CCCCCCCCCCCCCCEEEECCCCCCEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHEEECCC- T ss_conf 1478763324764123115799963899870232475788677898999999999749999730570232231001688- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCC Q ss_conf 68638889988874588665430872113203552156766422112211211355431122568977999657656444 Q gi|254780970|r 136 HHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDG 215 (737) Q Consensus 136 ~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dG 215 (737) +.+..| ++....|+..-++++|++.|||+|..- ..+.+.++|.++.+..+....++|||.|.+.| ..| T Consensus 94 -~~~~~~-~~~f~~G~~~~~~~Y~~~LiGGDT~~~------~~~~~T~iG~~~~~~~~~RsgAk~GD~v~VTG----~lG 161 (336) T TIGR01379 94 -DLDEAW-LEAFYDGLFELAKQYGVDLIGGDTVKS------LVVTVTAIGEAPKGRALLRSGAKPGDLVFVTG----TLG 161 (336) T ss_pred -CCCHHH-HHHHHHHHHHHHHHCCCEEEECCCCCE------EEEEEEEEEEECCCCEEECCCCCCCCEEEEEC----CCC T ss_conf -988899-999999999987553987872440021------31434578975689733236787776789958----832 Q ss_pred CHHHHHHHHHHCCCCCCCCCCCCC-CCHHH-HHHH-----HHHHHHHCCCCCCH--HCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 104666677728653334443326-79899-9888-----99987401138301--103477520111223320147752 Q gi|254780970|r 216 IGGASMASEEFGENIAKKRPTVQV-GDPFT-GKCL-----LEACLELMNTDAVI--AIQDMGAAGLTCSAIEMGNQGNLG 286 (737) Q Consensus 216 i~Ga~~sS~~~~~~~~~~~~~VQ~-gdp~~-ek~~-----~~~~~~~~~~~~i~--~~~D~gaGGl~~a~~Ema~~~~~G 286 (737) -..+...--..++....+ ..+.. .+.++ ||.+ ++.-..+......+ ++.|+|+| |+..+.+++.+|++| T Consensus 162 ~saaGL~ll~~G~~~~~~-~~~~~~~~~~~~~r~l~P~PR~~~G~~l~~~~~a~W~Aa~D~SDG-L~~dL~hIa~AS~vg 239 (336) T TIGR01379 162 DSAAGLELLLKGKEEGKK-KEVDEEDDEALLQRHLRPEPRVEEGLALARASLANWVAAIDVSDG-LASDLGHIAEASGVG 239 (336) T ss_pred HHHHHHHHHHHHHHHHCC-CCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHEHHHHCCCHH-HHHHHHHHHHHCCCE T ss_conf 689999999734664025-677851479999986389982899999974543323121220477-999999999728962 Q ss_pred EEEEEHHHCCCC----------CCCCHHHHHHC--C-CCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCC--C Q ss_conf 899620220466----------78999898617--8-8850899995467588999888709713112212104431--0 Q gi|254780970|r 287 ITLDLDKVPTCE----------EGMTAYEMMLS--E-SQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLF--R 351 (737) Q Consensus 287 ~~i~l~~vp~~~----------~~l~~~ei~~s--E-sQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~--~ 351 (737) ++|+++++|+.+ ...+|+|..++ | +| .+++++|++.+.++.++++.++++++||+||+.... . T Consensus 240 ~~i~~~~LP~~~~~~~~~~~~~~~~~~~~~aL~gGEDyE--LvfT~~~~~~~~l~~~~~~~~v~~~~IG~v~~g~~~~~~ 317 (336) T TIGR01379 240 IRIDLDKLPLSSELKEIADTEELGKQPLEWALSGGEDYE--LVFTVPPERREALLDALKALGVPLTRIGRVTEGEGGSVV 317 (336) T ss_pred EEEECCCCCCCHHHHHHHHCCCCCCCHHHHHHHCCCCEE--EEEECCHHHHHHHHHHHHHCCCCEEEEEEEEECCCEEEE T ss_conf 998526478708999886306310027999987288332--455158667889999997558766899999864870599 Q ss_pred CCCCCCCC Q ss_conf 00258644 Q gi|254780970|r 352 VIHRGEEV 359 (737) Q Consensus 352 v~~~g~~v 359 (737) +..+|+++ T Consensus 318 ~~~~g~~~ 325 (336) T TIGR01379 318 LLADGETV 325 (336) T ss_pred EEECCCEE T ss_conf 98668557 No 49 >COG0611 ThiL Thiamine monophosphate kinase [Coenzyme metabolism] Probab=99.86 E-value=2e-20 Score=159.01 Aligned_cols=271 Identities=22% Similarity=0.292 Sum_probs=187.0 Q ss_pred CCCEEECCCCCEEEEEECCCEEEEEE----ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCC-CCCCC Q ss_conf 97489933477368983698399999----51788777658888721055810225767787436877504578-75686 Q gi|254780970|r 64 GKHVIQGPGENAGVVDIGGGDCVVFK----MESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFG-AIHHP 138 (737) Q Consensus 64 ~~~vi~~~~~~a~vi~~~~~~~~~~k----~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g-~~~~p 138 (737) ...+..+.||||++++..++..++++ +|.--+|....|+.=+=--+.-.+-|+.+|||+|.+.+-+|..+ +.+ T Consensus 18 ~~~~~~~~GDDaA~v~~~~~~~lvvttD~lv~~~hF~~~~~p~d~G~Ka~a~NlSDlAAMGa~P~~~~lsl~lP~~~d-- 95 (317) T COG0611 18 REDVVLGIGDDAALVDAPEGQRLVVTTDMLVEGTHFPPDMTPEDLGWKALAVNLSDLAAMGARPKAFLLSLGLPPDLD-- 95 (317) T ss_pred CCCCCCCCCCCEEEEECCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHEEEEECCCCCC-- T ss_conf 655332577764798158886489996121253556999999998789999888899974887653426552899999-- Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHH Q ss_conf 38889988874588665430872113203552156766422112211211355431122568977999657656444104 Q gi|254780970|r 139 KTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGG 218 (737) Q Consensus 139 ~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~G 218 (737) ...++....|+.+..+.+|++.|||++-. ++..+.+.++|.++++.......++|||.|++.| ..|-.. T Consensus 96 --~~~~~~~~~Gi~e~~~~y~~~lIGGDt~~-----~~l~is~t~iG~~~~~~~l~R~gAkpGD~v~vtG----~lG~sa 164 (317) T COG0611 96 --EEWLEALADGIFEAAKKYGVKLIGGDTNR-----GPLSISVTAIGVLPKGRALLRSGAKPGDLVAVTG----TLGRSA 164 (317) T ss_pred --HHHHHHHHHHHHHHHHHCCCEEECCCCCC-----CCEEEEEEEEEECCCCCCEECCCCCCCCEEEECC----CCCHHH T ss_conf --99999999999999998298086433678-----7358999999961787410046999999999918----876648 Q ss_pred HHHHHHHHCCCCCCC---CCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHC Q ss_conf 666677728653334---44332679899988899987401138301103477520111223320147752899620220 Q gi|254780970|r 219 ASMASEEFGENIAKK---RPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVP 295 (737) Q Consensus 219 a~~sS~~~~~~~~~~---~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp 295 (737) +.+.........+.. ....+++-|. ++.-+.+.. ...++.|+++| |+..+.+++.+|++|++|+.+++| T Consensus 165 agl~ll~~~~~~~~~~~l~~r~~~P~Pr-----v~~g~~l~~--~a~aa~DiSDG-L~~dL~~i~~aS~vg~~I~~~~lp 236 (317) T COG0611 165 AGLELLLNVLGPEDEEELIERHLRPTPR-----VELGLALAK--LASAAMDISDG-LAADLGHIARASGVGIVIDEDLLP 236 (317) T ss_pred HHHHHHHCCCCCCHHHHHHHHHCCCCCC-----HHHHHHHHH--HHHHHEECCHH-HHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9999996146753038999984499997-----567899998--87775123065-999999999982981999843378 Q ss_pred CCCC-------CCCHHHHHHCCCCC--EEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCC Q ss_conf 4667-------89998986178885--089999546758899988870971311221210443100025864 Q gi|254780970|r 296 TCEE-------GMTAYEMMLSESQE--RMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEE 358 (737) Q Consensus 296 ~~~~-------~l~~~ei~~sEsQe--R~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~ 358 (737) +.+. ..+|+++.++ +-| -.++++++++.+.++..|++.+ +++||++++.....+..+|+. T Consensus 237 ~~~~~~~~~~~~~~~~~~aL~-gGEDyEL~ft~p~~~~~~~~~~~~~~~--~~~IG~v~~~~g~~~~~dgk~ 305 (317) T COG0611 237 LSDAVLEALDELGDPLEWALS-GGEDYELVFTVPEENREALLDALRSLG--VTIIGRVTEGEGVVVLVDGKE 305 (317) T ss_pred CCHHHHHHHHCCCCHHHHHHC-CCCCEEEEEEECCHHHHHHHHHHHHCC--CEEEEEEEECCCEEEEECCCE T ss_conf 867789877544778999863-687448999837024799999987549--669999962587389841856 No 50 >cd02196 PurM PurM (Aminoimidazole Ribonucleotide [AIR] synthetase), one of eleven enzymes required for purine biosynthesis, catalyzes the conversion of formylglycinamide ribonucleotide (FGAM) and ATP to AIR, ADP, and Pi, the fifth step in de novo purine biosynthesis. The N-terminal domain of PurM is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP. Probab=99.86 E-value=4.3e-20 Score=156.68 Aligned_cols=222 Identities=22% Similarity=0.316 Sum_probs=104.6 Q ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCC-CCCCEEEECCCCHH Q ss_conf 225767787436877504578756863888998887458866543087211320355215-6766-42211221121135 Q gi|254780970|r 113 LRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNN-IIVNTFAAGIAKTN 190 (737) Q Consensus 113 iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~-~~v~~~~~Gi~~~~ 190 (737) ++|++++||+|++.+|.+-.+.++. ..+.++++|++..|.+.|+|.+||+|...|+ |..+ .-+.++|+|+++++ T Consensus 54 VNDi~~~GA~Pl~flDY~a~~~l~~----~~~~~iv~Gi~~~c~~~~~~liGGETAemPg~y~~~~~DlaG~~vGiv~k~ 129 (297) T cd02196 54 VNDILCQGAEPLFFLDYIATGKLDP----EVAAEIVKGIAEGCRQAGCALLGGETAEMPGVYAEGEYDLAGFAVGVVEKD 129 (297) T ss_pred CCCHHCCCCEEEEEEEEEECCCCCH----HHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCCEEEEEEEEEEEECH T ss_conf 4201115866135667740278899----999999999999999839967754520144656899776756899999710 Q ss_pred HHHHCCCCCCCCEEEEECCCCCCCCCHHHHHH--HHHHCCCCC-CCCCC----CCCCCHHHH--HHHHHHHHHHCCCCCC Q ss_conf 54311225689779996576564441046666--777286533-34443----326798999--8889998740113830 Q gi|254780970|r 191 AIFSSKARGIGLPLVYLGAKTGRDGIGGASMA--SEEFGENIA-KKRPT----VQVGDPFTG--KCLLEACLELMNTDAV 261 (737) Q Consensus 191 ~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~s--S~~~~~~~~-~~~~~----VQ~gdp~~e--k~~~~~~~~~~~~~~i 261 (737) +++.....++||.||-+. ..|+|..-|| .+.+.+... .+.+. -..++..++ |.-.+.++++.++..| T Consensus 130 ~ii~~~~i~~GDviIgl~----SsG~HsNGySLiRkil~~~~~~~~~~~~~~~~tl~~~LL~PtriY~~~v~~l~~~~~i 205 (297) T cd02196 130 KIIDGSKIKPGDVLIGLP----SSGLHSNGYSLVRKILFEEGLDYDDPEPGLGKTLGEELLTPTRIYVKPILPLLEKVLV 205 (297) T ss_pred HCCCCCCCCCCCEEEEEC----CCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 135602377898899956----8876543288999998744665446685455609999710478888999999740771 Q ss_pred HHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCC--------CCCCHHHHHH--CCCCCEEEEEEECCCHHHHHHHH Q ss_conf 1103477520111223320147752899620220466--------7899989861--78885089999546758899988 Q gi|254780970|r 262 IAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCE--------EGMTAYEMML--SESQERMLMILNPEKQHKAQEIL 331 (737) Q Consensus 262 ~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~--------~~l~~~ei~~--sEsQeR~~~~v~~~~~~~~~~i~ 331 (737) +++.+++.|||...+..+.. .+++++||+++.|..+ -++++.|.+= |-. =.|+++|+|++.+++.+++ T Consensus 206 ~g~aHITGGGl~~Nl~Rvlp-~~l~a~Id~~~~~~p~iF~~i~~~g~i~~~Em~~tFNmG-iGmvliv~~~~~~~v~~~l 283 (297) T cd02196 206 KGMAHITGGGLPENLPRVLP-EGLGAVIDLGSWEIPPIFKWIQKAGNVSEEEMYRTFNMG-IGMVLIVSEEDADEVLEIL 283 (297) T ss_pred CEEEECCCCCHHHCHHHHCC-CCCEEEEECCCCCCCHHHHHHHHHCCCCHHHHHHHCCCC-CEEEEEECHHHHHHHHHHH T ss_conf 13762378514313133256-880599855667998899999985898999999754787-2299998389999999999 Q ss_pred HHHCCEEECCCCE Q ss_conf 8709713112212 Q gi|254780970|r 332 NKWGLHFSIIGIT 344 (737) Q Consensus 332 ~~~~~~~~~iG~v 344 (737) ++++.++.+||+| T Consensus 284 ~~~g~~a~vIG~V 296 (297) T cd02196 284 EKLGEKAYVIGEV 296 (297) T ss_pred HHCCCCCEEEEEE T ss_conf 9769981898787 No 51 >pfam02769 AIRS_C AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synthase is unclear, but the cleft formed between N and C domains is postulated as a sulphate binding site. Probab=99.85 E-value=2.7e-20 Score=158.15 Aligned_cols=138 Identities=31% Similarity=0.420 Sum_probs=119.8 Q ss_pred CCCCCEEECCCCCCCCCHHHHHHHHHCCC-CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 38980751377665543267888861167-89996479899999999999999959806994648657999999998458 Q gi|254780970|r 567 HEGDLILMIGNDGCHLDCSMYSLECASSN-IGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISS 645 (737) Q Consensus 567 ~~Gd~I~liG~~~~~LggS~~~~~~~~~~-~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as 645 (737) ++||.|+++|. ..++|+.|+....... .+.++..|+..++++++.++.++..++|+|+||||+|||+.+|.|||+++ T Consensus 1 k~GD~ii~~g~--~g~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~i~a~~Dis~GGL~~~l~e~~~~~ 78 (150) T pfam02769 1 KPGDVLILLGS--SGLGGSGLSLVRKGLEEDGAVPLGDPLLEPTLIYVVLLLAALGLVKAAHDITGGGLAGALAEMAPAS 78 (150) T ss_pred CCCCEEEEECC--CCHHHHHHHHHHHHHCCCCCCCCCCHHHCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHCC T ss_conf 99989999899--7728999999973110059984248320626899999988669967998559973999998763325 Q ss_pred CCCEEEEECCCCC----CCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCE Q ss_conf 9708998056668----987888729883218999968809989999985798499989987986 Q gi|254780970|r 646 AKGMDIILPIEIE----KDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGST 706 (737) Q Consensus 646 ~~G~~I~L~~~~~----~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~~ 706 (737) ++|++|+++..+. .++.+.||+|++|++++.|++++.+++++.+++.++++++||+|+++. T Consensus 79 ~~g~~i~~~~~~~~~~~~~~~e~l~~e~~G~~~v~v~~~~~~~v~~~~~~~~~~~~~IG~v~~~~ 143 (150) T pfam02769 79 GVGAEIDLDKVPIFDELLLPLEMLFSENQGRGLVVVVPEEAEAVLAILEEEGLPAAVIGEVTAGG 143 (150) T ss_pred CEEEEEECCCCCCCCCCCCHHHHCCCCCCCEEEEEECHHHHHHHHHHHHHCCCCEEEEEEEEECC T ss_conf 61689756778603555654551232478638999868899999999998798949999999089 No 52 >cd02196 PurM PurM (Aminoimidazole Ribonucleotide [AIR] synthetase), one of eleven enzymes required for purine biosynthesis, catalyzes the conversion of formylglycinamide ribonucleotide (FGAM) and ATP to AIR, ADP, and Pi, the fifth step in de novo purine biosynthesis. The N-terminal domain of PurM is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP. Probab=99.83 E-value=1e-18 Score=147.26 Aligned_cols=257 Identities=20% Similarity=0.226 Sum_probs=177.9 Q ss_pred CCCCCEEEECC--CCCCEEEEECCCCCCCHHCC----C-HHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 35763142037--77622797304532100001----0-01135678977764102785102232210245666862134 Q gi|254780970|r 434 GGDAGVIRVEG--HETKALAFSSDVTPRYVKAD----P-FEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 (737) Q Consensus 434 g~DaaVv~~~~--~~~~gia~s~g~~p~~~~~d----P-~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ 506 (737) |+=+++..+.. ..+..+++++|.-.....+- - ..-++-.|+-++.||+|+||+|++++|.+-.+.. +++ T Consensus 4 G~F~g~~~~~~~~~~~p~lv~~tDGVGTK~~la~~~~~~~~iG~DlVam~VNDi~~~GA~Pl~flDY~a~~~l-~~~--- 79 (297) T cd02196 4 GGFAGLFDLGLGGYKDPVLVSGTDGVGTKLKLAQEMGKHDTIGIDLVAMCVNDILCQGAEPLFFLDYIATGKL-DPE--- 79 (297) T ss_pred CCCCEEEECCCCCCCCCEEEEECCCCCHHHHHHHHHCCCCCHHHHHHHHCCCCHHCCCCEEEEEEEEEECCCC-CHH--- T ss_conf 6544645347568999679981788638999999957754530334551142011158661356677402788-999--- Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC-CCCC----CCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCC Q ss_conf 69999999985221028866544320011236866-6763----111222233352023310214389807513776655 Q gi|254780970|r 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQS-IFPT----PTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCH 581 (737) Q Consensus 507 ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~-I~pT----pvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~ 581 (737) .+.+.++|+.++|++.+++++|| ||...+ +.+. +.-+++|.++ .++.++....++||.|+-+.+++.| T Consensus 80 ~~~~iv~Gi~~~c~~~~~~liGG------ETAemPg~y~~~~~DlaG~~vGiv~-k~~ii~~~~i~~GDviIgl~SsG~H 152 (297) T cd02196 80 VAAEIVKGIAEGCRQAGCALLGG------ETAEMPGVYAEGEYDLAGFAVGVVE-KDKIIDGSKIKPGDVLIGLPSSGLH 152 (297) T ss_pred HHHHHHHHHHHHHHHCCCCEECC------CCCCCCCCCCCCCEEEEEEEEEEEE-CHHCCCCCCCCCCCEEEEECCCCCC T ss_conf 99999999999999839967754------5201446568997767568999997-1013560237789889995688765 Q ss_pred CCHHHHHH-HHHCCC--CCC-CCCCCH-------HHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEE Q ss_conf 43267888-861167--899-964798-------9999999999999995980699464865799999999845897089 Q gi|254780970|r 582 LDCSMYSL-ECASSN--IGP-PPKVDC-------HLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMD 650 (737) Q Consensus 582 LggS~~~~-~~~~~~--~g~-~p~~d~-------~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~ 650 (737) -.|-.+.+ .+.... ... .|..+. +.-+-....++++++...|+++.+||.|||...|.++ +.++++++ T Consensus 153 sNGySLiRkil~~~~~~~~~~~~~~~~tl~~~LL~PtriY~~~v~~l~~~~~i~g~aHITGGGl~~Nl~Rv-lp~~l~a~ 231 (297) T cd02196 153 SNGYSLVRKILFEEGLDYDDPEPGLGKTLGEELLTPTRIYVKPILPLLEKVLVKGMAHITGGGLPENLPRV-LPEGLGAV 231 (297) T ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHCHHHH-CCCCCEEE T ss_conf 43288999998744665446685455609999710478888999999740771137623785143131332-56880599 Q ss_pred EEECCCCCCCHHHH--------------HHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEEC Q ss_conf 98056668987888--------------729883218999968809989999985798499989987 Q gi|254780970|r 651 IILPIEIEKDPKPF--------------LFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVS 703 (737) Q Consensus 651 I~L~~~~~~d~~~~--------------LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt 703 (737) |+.+..+......+ -|+...| |++.|++++.+++++.+++++.++++||+|. T Consensus 232 Id~~~~~~p~iF~~i~~~g~i~~~Em~~tFNmGiG-mvliv~~~~~~~v~~~l~~~g~~a~vIG~Vv 297 (297) T cd02196 232 IDLGSWEIPPIFKWIQKAGNVSEEEMYRTFNMGIG-MVLIVSEEDADEVLEILEKLGEKAYVIGEVV 297 (297) T ss_pred EECCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCE-EEEEECHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 85566799889999998589899999975478722-9999838999999999997699818987879 No 53 >TIGR00476 selD selenide, water dikinase; InterPro: IPR004536 In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3' or 5' non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation. This family describes SelD, known as selenophosphate synthetase, selenium donor protein, and selenide, water dikinase. SelD provides reduced selenium for the selenium transferase SelA. This protein itself contains selenocysteine in many species; any sequence scoring above the trusted cut-off but not aligning to the beginning of the HMM is likely have selenocysteine residue incorrectly interpreted as a stop codon upstream of the given sequence.; GO: 0004756 selenide water dikinase activity, 0005524 ATP binding. Probab=99.82 E-value=4.8e-19 Score=149.53 Aligned_cols=252 Identities=21% Similarity=0.270 Sum_probs=173.7 Q ss_pred CCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCC--CHHHHHHHH Q ss_conf 357631420377762279730453210000100113567897776410278510223221024566686--213469999 Q gi|254780970|r 434 GGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKE--EIMGQFVHS 511 (737) Q Consensus 434 g~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~--e~~~ql~~a 511 (737) .+||+|.... ++.-++-|+|...+.+. |||..||.|-+.|++||+|||..|+....-++|.+-.=+ |+--=+.+. T Consensus 57 mDdA~v~~~~--nG~~~v~T~D~~~PiVd-dPY~~G~IAaaN~~SDiYAMG~~~i~~~l~~~~~~~kL~drer~~v~Rev 133 (372) T TIGR00476 57 MDDAAVYERH--NGLSLVKTVDVITPIVD-DPYLFGRIAAANALSDIYAMGGLPIVALLILVGVSNKLSDRERIEVVREV 133 (372) T ss_pred CCCCEEEECC--CCEEEEEEEEEEEEEEC-CHHHHHHHHHHHHHHHHHHCCCCCHHHEEEEECCCCCCCCCCCCCHHHHH T ss_conf 3431135317--98899996338983256-83554489987567777743773210143330356668754442010687 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHH Q ss_conf 99998522102886654432001123686667631112--2223335202331021438980751377665543267888 Q gi|254780970|r 512 VKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIA--GVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSL 589 (737) Q Consensus 512 v~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~--~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~ 589 (737) ++|..|+||+.++.++||+ |-. .|-|+++ +.|+++. .+.++++.+++||+|+| |++ ||-...-. T Consensus 134 ~~G~~d~~reakt~~~GGh------til---~pwp~~Gg~vtgv~~~-~~~~~~~~a~~G~~L~L---TKP-LGtqv~~~ 199 (372) T TIGR00476 134 IKGFKDACREAKTALVGGH------TIL---NPWPVFGGAVTGVCPE-EEVITKSGAQVGDVLIL---TKP-LGTQVLVA 199 (372) T ss_pred HHHHHHHHHHCCEEEECCE------EEC---CCCCCCCCEEEEECCC-CCEECCCCCCCCCEEEE---ECC-CHHHHHHH T ss_conf 7658888876890797362------112---7620005406310278-84752588742416785---175-16899998 Q ss_pred HHHC----CCC-----CC-CCCCCHHHHHH------HH-------HHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 8611----678-----99-96479899999------99-------99999999598069946486579999999984589 Q gi|254780970|r 590 ECAS----SNI-----GP-PPKVDCHLEKN------HG-------FFVLSMINAKKITACHDISTGGLIITLAEMTISSA 646 (737) Q Consensus 590 ~~~~----~~~-----g~-~p~~d~~~~k~------~~-------~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~ 646 (737) .... ... +- -++.|+...++ +. ..-.+++....+.|+.||+..||++|+.|||-.++ T Consensus 200 ~~~~~~~~~~~nkik~~i~~ee~~~~~~~A~e~M~~~Nryalkal~~A~~~~~~~~~~A~TDvTGFGlLGH~~~~ak~~~ 279 (372) T TIGR00476 200 VEKKLDIFKDLNKIKLSILKEERELIINEAIENMARLNRYALKALRKAAELMNSKDAKALTDVTGFGLLGHAQEMAKNSN 279 (372) T ss_pred HHHCCCCHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHCC T ss_conf 76311200110414678607527889989999999888999999999887664418715311456545688999874078 Q ss_pred CCEEEEE---CCCCCC------------------------------CHHHHHHCC--CCCCEEEEECHHHHHHHHH-HHH Q ss_conf 7089980---566689------------------------------878887298--8321899996880998999-998 Q gi|254780970|r 647 KGMDIIL---PIEIEK------------------------------DPKPFLFGE--DQGRYVVCISPENQDLVMS-EAN 690 (737) Q Consensus 647 ~G~~I~L---~~~~~~------------------------------d~~~~LFsE--s~Gr~ii~V~~~~~~~~~~-~~~ 690 (737) +.++|+. |..... +....||++ ..+.|+++|.++..+.+.+ ... T Consensus 280 ~~~~~~~e~lP~l~~~~~~~k~~~~~~G~~~g~~~~~~G~L~~~~~~~~~~ll~~~~~S~~y~~~~~~~~~~~v~~G~~~ 359 (372) T TIGR00476 280 VEARIVYEKLPVLKKVEELSKLKGVPLGAIEGLETAGSGLLLSELPREKAALLCDPVKSGGYLFEVQKEVKEKVEKGTAK 359 (372) T ss_pred CEEEEEEECCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 66999953088735457889863243312135421267422135707898642610105135776668885111010134 Q ss_pred HCCCCEEEEEEE Q ss_conf 579849998998 Q gi|254780970|r 691 NKNIPLRYLGKV 702 (737) Q Consensus 691 ~~gi~~~~IG~V 702 (737) +.+|....|||+ T Consensus 360 ~~~v~~~~IgE~ 371 (372) T TIGR00476 360 EKEVELIEIGEV 371 (372) T ss_pred HHHCCEEEEEEE T ss_conf 330450254310 No 54 >PRK05385 phosphoribosylaminoimidazole synthetase; Provisional Probab=99.79 E-value=1.7e-17 Score=138.83 Aligned_cols=265 Identities=20% Similarity=0.334 Sum_probs=153.8 Q ss_pred CCCCEEECCCCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCC---------------------CCCCCCCCCCC Q ss_conf 697489933477368983698399999517887776588887210558---------------------10225767787 Q gi|254780970|r 63 TGKHVIQGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVG---------------------GILRDIFTMGA 121 (737) Q Consensus 63 ~~~~vi~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~g---------------------G~iRDi~~~Ga 121 (737) ..+.|+.+.|.=||.+++..++ .+|--+.- .-||| =.++|++|+|| T Consensus 30 ~~~~v~~~iGgF~gl~~~~~~~---------~~p~Lv~~----tDGVGTK~~iA~~~~~~~tiG~DlVam~vNDil~~GA 96 (338) T PRK05385 30 FRPEVLGGLGGFGGLFDLPAGY---------KEPVLVSG----TDGVGTKLKLAIDLGKHDTIGIDLVAMCVNDLLVQGA 96 (338) T ss_pred CCCCCCCCCCCCCEEEECCCCC---------CCCEEEEC----CCCCCHHHHHHHHHCCCCCHHHHHHHHCCCCEEECCC T ss_conf 7977546876620445554468---------99679971----8887189999999576143306789752777666286 Q ss_pred EEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCC-CCCCEEEECCCCHHHHHHCCCCC Q ss_conf 436877504578756863888998887458866543087211320355215-6766-42211221121135543112256 Q gi|254780970|r 122 RPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNN-IIVNTFAAGIAKTNAIFSSKARG 199 (737) Q Consensus 122 ~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~-~~v~~~~~Gi~~~~~~~~~~~~~ 199 (737) +|++.+|.+-.|.++.. ...++++|++..|.+.|.+.+||+|.-.|+ |..+ .-.-.+|+|++++++++..+..+ T Consensus 97 ~Pl~flDY~a~~~l~~~----~~~~iv~Gi~~~c~~~~~~LiGGETAemPg~y~~~~~DLaG~~VGivek~~ii~~~~i~ 172 (338) T PRK05385 97 EPLFFLDYIATGKLDPE----VAAEVVKGIAEGCEQAGCALIGGETAEMPGMYHEGDYDLAGFAVGVVEKDEIIDGSKVK 172 (338) T ss_pred EEEEEEEEECCCCCCHH----HHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCC T ss_conf 73234555120788989----99999999999999809804365210133435689773633799998142234566689 Q ss_pred CCCEEEEECCCCCCCCCHHHHHH--HHHHCCC---CCCCCCCCC--CCCHHHH--HHHHHHHHHHCCCCCCHHCCCCCCC Q ss_conf 89779996576564441046666--7772865---333444332--6798999--8889998740113830110347752 Q gi|254780970|r 200 IGLPLVYLGAKTGRDGIGGASMA--SEEFGEN---IAKKRPTVQ--VGDPFTG--KCLLEACLELMNTDAVIAIQDMGAA 270 (737) Q Consensus 200 ~Gd~ii~~G~~tg~dGi~Ga~~s--S~~~~~~---~~~~~~~VQ--~gdp~~e--k~~~~~~~~~~~~~~i~~~~D~gaG 270 (737) +||.||-+-+ .|+|-.-|| -+.+.+. ..+..+.-+ .++..++ |.-.+.++++.+++.|+++-.++.| T Consensus 173 ~GDviiGl~S----sG~HsNGySLiRki~~~~~~~~~~~~~~~~~tlge~LL~PTriY~~~i~~l~~~~~Ikg~aHITGG 248 (338) T PRK05385 173 EGDVLIGLAS----SGLHSNGYSLVRKILEVAGLDLDDTLPELGKTLGEELLEPTRIYVKPVLALLKEGDVKGMAHITGG 248 (338) T ss_pred CCCEEEEEEC----CCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 9988999607----986655078999999874998232565346869998635067767999999862781399982588 Q ss_pred CHHHHHHHHHCCCCCCEEEEEHHHCCC-------CC-CCCHHHHHHCC-CCCEEEEEEECCCHHHHHHHHHHHCCEEECC Q ss_conf 011122332014775289962022046-------67-89998986178-8850899995467588999888709713112 Q gi|254780970|r 271 GLTCSAIEMGNQGNLGITLDLDKVPTC-------EE-GMTAYEMMLSE-SQERMLMILNPEKQHKAQEILNKWGLHFSII 341 (737) Q Consensus 271 Gl~~a~~Ema~~~~~G~~i~l~~vp~~-------~~-~l~~~ei~~sE-sQeR~~~~v~~~~~~~~~~i~~~~~~~~~~i 341 (737) ||.-.++-+.. .+++++|+++..|.- +. +++..|.+=-= --=.|+++|+|++.+++.+++++.+.++.+| T Consensus 249 Gl~~Nl~Rilp-~~l~a~Id~~~w~~P~iF~~i~~~G~i~~~EM~rtFNmGIGmvliV~~~~~~~v~~~l~~~g~~a~~I 327 (338) T PRK05385 249 GFIENLPRVLP-EGLGAEIDKGSWPVPPIFKWLQKAGNVEEEEMYRTFNMGIGMVLIVPEEDADAALALLEARGEDAWVI 327 (338) T ss_pred CCCCCCCCCCC-CCCEEEEECCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCCEEE T ss_conf 50036541157-88069996677999889999998489899999874358222699983899999999999769984898 Q ss_pred CCEEECCC Q ss_conf 21210443 Q gi|254780970|r 342 GITTNDKL 349 (737) Q Consensus 342 G~vt~~~~ 349 (737) |+|++..+ T Consensus 328 G~V~~g~~ 335 (338) T PRK05385 328 GEVKEGEE 335 (338) T ss_pred EEEEECCC T ss_conf 99997986 No 55 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=99.76 E-value=5.7e-18 Score=142.14 Aligned_cols=385 Identities=17% Similarity=0.222 Sum_probs=257.1 Q ss_pred EEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHH-HH-----HCCEEECCCCEEECCCCCCCCCCCCCCCCC Q ss_conf 620220466789998986178885089999546758899988-87-----097131122121044310002586443443 Q gi|254780970|r 290 DLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEIL-NK-----WGLHFSIIGITTNDKLFRVIHRGEEVANLP 363 (737) Q Consensus 290 ~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~-~~-----~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp 363 (737) .|..|++=+.|=.++|-.= -+|-|+...+++|-+.+.| .. .+||..-+.+-..+.--+=.|+++.++|=. T Consensus 171 ~l~~v~vl~~GR~ALE~AN----~elGLAL~~de~DYl~~~fr~~L~RnP~dVEL~mfAQaNSEHcRHkiFna~~~idG~ 246 (1401) T TIGR01735 171 KLTTVDVLGGGRKALEKAN----QELGLALDEDEIDYLTKRFRQELKRNPSDVELMMFAQANSEHCRHKIFNADWIIDGK 246 (1401) T ss_pred CCCCCCHHHHHHHHHHHHH----HHHCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCHHHHHHHHCCCEEECCC T ss_conf 7750011244489999998----985304575688999999999837898724688775303101000311253377881 Q ss_pred -C-CCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCCCC--CCCCCCCCE Q ss_conf -2-101466501145553156555101234687255889999985073210103565311210001100--001357631 Q gi|254780970|r 364 -I-KALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMIQSNSI--QLPGGDAGV 439 (737) Q Consensus 364 -~-~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~d~~~~l~~vL~~pnV~SK~~i~~qyD~~V~g~tv--~~Pg~DaaV 439 (737) + +-||. +- +.. .+...+..+..-.|=..+ +.|..| ..|-.+-.+ T Consensus 247 ~q~kSLF~----mI----k~T-~e~~~~~~v~aykDNssv-----------------------I~G~~v~r~rP~~~t~~ 294 (1401) T TIGR01735 247 KQDKSLFQ----MI----KST-HEANPENTVSAYKDNSSV-----------------------IEGHKVGRLRPDPPTRV 294 (1401) T ss_pred HHCCHHHH----HH----HHH-HHCCCCCEEEEEECCHHH-----------------------HCCCCEECCCCCCCCCC T ss_conf 21103789----88----988-733999608988311246-----------------------54742000078887660 Q ss_pred E-EECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCC--CCCHHHHHHHCCCCC------------C--CCC Q ss_conf 4-2037776227973045321000010011356789777641027--851022322102456------------6--686 Q gi|254780970|r 440 I-RVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIAT--GAKPLAITDNLNFGN------------P--EKE 502 (737) Q Consensus 440 v-~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~--Ga~Plait~~lnf~~------------P--~~~ 502 (737) - +.......-|.|+..-..+-+.+.||-||...-+.=+||-=|. |++|.|-+--++++| | ..| T Consensus 295 ~y~~~~~~~~~i~~kvETHNhPTAiaPFpGA~TGaGGeIRDegATGRGa~pkAGl~Gf~VsNL~IPg~~~PWE~~S~~~P 374 (1401) T TIGR01735 295 EYRQHQEDDVHILMKVETHNHPTAIAPFPGASTGAGGEIRDEGATGRGAKPKAGLTGFCVSNLNIPGLEQPWEDPSFQKP 374 (1401) T ss_pred HHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCC T ss_conf 24443126721899851378886467767888787873643664488771433345610202517788878875421365 Q ss_pred CHHHH----HHHHHHHHHHHHHHCCCCCCCCC-------CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCC Q ss_conf 21346----99999999852210288665443-------20011236866676311122223335202331021438980 Q gi|254780970|r 503 EIMGQ----FVHSVKGIREACQILDFPIVSGN-------VSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDL 571 (737) Q Consensus 503 e~~~q----l~~av~gl~d~c~~lgipivgGk-------vSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~ 571 (737) +.++. +.++=-|-+++-++||=|.+-|= ++...+-.-.-.+|=...+++|.++. ..++-+--.+|.. T Consensus 375 ~~~aspLdImieap~G~aafnNeFGrP~L~GYfRtfe~~~~~P~~~~R~y~KPIMlaGGiG~I~~--~hi~K~e~~~G~~ 452 (1401) T TIGR01735 375 ERIASPLDIMIEAPLGAAAFNNEFGRPALLGYFRTFELKVSLPGGEVRGYHKPIMLAGGIGSIDA--EHIQKGEIEVGAL 452 (1401) T ss_pred CCCCCHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCHHH--HCCCCCCCCCCCE T ss_conf 22457678764067652432776577302223320333211677554443351244310100012--1210578788868 Q ss_pred EEECCCCCCC--CCHHHHHH-HHHC-----CCCCCCCCCCHHHHHHHHHHHHHHHHCCC---EEEEEECCCCHHHHHHHH Q ss_conf 7513776655--43267888-8611-----67899964798999999999999999598---069946486579999999 Q gi|254780970|r 572 ILMIGNDGCH--LDCSMYSL-ECAS-----SNIGPPPKVDCHLEKNHGFFVLSMINAKK---ITACHDISTGGLIITLAE 640 (737) Q Consensus 572 I~liG~~~~~--LggS~~~~-~~~~-----~~~g~~p~~d~~~~k~~~~~v~~li~~g~---I~S~HDiSdGGL~~aL~E 640 (737) |+.+|.+... |||..-+- +..+ ...+.+.+-|++.+++.++.|..-++-|. |.|.||+.-|||--++-| T Consensus 453 ~i~lGGPam~IGlGGGAASSvm~~G~n~~~LDF~sVQR~nPEMerr~qeVId~C~~lG~~NPI~siHDvGAGGlsNa~pE 532 (1401) T TIGR01735 453 LIVLGGPAMLIGLGGGAASSVMVSGTNDADLDFASVQRGNPEMERRCQEVIDRCVQLGEKNPIISIHDVGAGGLSNALPE 532 (1401) T ss_pred EEEECCCEEECCCCCCCHHCEEECCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCHHH T ss_conf 99857862254565410000010578815524333577886887688899999985879868467712575720010010 Q ss_pred HHHCCCCCEEEEECC----CCCCCHHHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEEECC Q ss_conf 984589708998056----66898788872988321899996880998999998579849998998798-6898074 Q gi|254780970|r 641 MTISSAKGMDIILPI----EIEKDPKPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSIHNI 712 (737) Q Consensus 641 Ma~as~~G~~I~L~~----~~~~d~~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i~~~ 712 (737) +.--++.|+.|+|.. .....|++++.+|+|.|+++.|.+++.+.|..+|+..-+|+.+||++|++ .+++.+. T Consensus 533 L~~d~~~Ga~i~lr~v~~~d~~lspLEIWcnEsQEryvL~v~~~~l~~f~~iC~RERcPfAvvG~at~~~~l~l~D~ 609 (1401) T TIGR01735 533 LINDGGRGAVIDLRAVPLDDPGLSPLEIWCNESQERYVLLVKAEDLEIFTAICERERCPFAVVGTATGDRRLILVDD 609 (1401) T ss_pred HCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCEEEEEEEECCCEEEEEEC T ss_conf 00614478558884652588887767663004555577632720079999885640698378987866824899737 No 56 >TIGR00878 purM phosphoribosylformylglycinamidine cyclo-ligase; InterPro: IPR004733 The purine biosynthetic pathway in procaryotes enlists eleven enzymes, six of which use ATP. Enzymes 5 and 6 of this pathway, formylglycinamide ribonucleotide (FGAR) amidotransferase (PurL) and aminoimidazole ribonucleotide (AIR) synthetase (PurM) utilise ATP to activate the oxygen of an amide within their substrate toward nucleophilic attack by a nitrogen. AIR synthetase uses the product of PurL, formylglycinamidine ribonucleotide (FGAM) and ATP to make AIR, ADP and P(i).; GO: 0004641 phosphoribosylformylglycinamidine cyclo-ligase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=99.76 E-value=4.5e-18 Score=142.84 Aligned_cols=247 Identities=23% Similarity=0.264 Sum_probs=134.2 Q ss_pred CCCCEEEECC-CCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHH---------------HHHHCCCCCHHHHHHHCCCCC Q ss_conf 5763142037-776227973045321000010011356789777---------------641027851022322102456 Q gi|254780970|r 435 GDAGVIRVEG-HETKALAFSSDVTPRYVKADPFEGTKQAVAECW---------------RNIIATGAKPLAITDNLNFGN 498 (737) Q Consensus 435 ~DaaVv~~~~-~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal---------------~Nl~a~Ga~Plait~~lnf~~ 498 (737) +=||++++.. .....|++|+|. -|+++.||+++ .||+|.||+|+++.|-+.-+. T Consensus 43 gFagl~~l~~~Y~~~~L~~~TDG----------VGTKl~vA~~~~khDTiGIDlVAMnVNDl~~~GAEPl~flDYlAvGk 112 (338) T TIGR00878 43 GFAGLFDLGDKYKEPVLVSGTDG----------VGTKLKVAEAMDKHDTIGIDLVAMNVNDLLVQGAEPLFFLDYLAVGK 112 (338) T ss_pred CCEEEECCCCCCCCEEEEEECCC----------HHHHHHHHHHCCCCCCEEEEEEEEECCCEEEECCCHHHHHHHHHHCC T ss_conf 52135115224752389984286----------66899999972889833012488532868884777578764666068 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC--CC---CCCCCCCCCCCCCEEEEECCCCCCCCCEE Q ss_conf 66862134699999999852210288665443200112368666--76---31112222333520233102143898075 Q gi|254780970|r 499 PEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSI--FP---TPTIAGVGILPDYSLMTRIDSAHEGDLIL 573 (737) Q Consensus 499 P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I--~p---Tpvi~~vG~v~d~~~~it~~~k~~Gd~I~ 573 (737) |+ ++ .+.+.++|+.+-|+.-|+.++|| ||-.-+- ++ =+--+++|.|+ .+.++|...-++||+|+ T Consensus 113 ~d-~~---~~~~i~~G~~eGC~~ag~aLvGG------ETAemPG~y~~g~YDlAGt~VG~Ve-kd~iitG~~v~~GDv~i 181 (338) T TIGR00878 113 LD-PE---VASQIVKGIAEGCKQAGCALVGG------ETAEMPGLYRGGEYDLAGTAVGVVE-KDEIITGEKVKPGDVLI 181 (338) T ss_pred CC-HH---HHHHHHHHHHHHHHHHCCEEECC------CCCCCCCCCCCCCEEECCEEEEEEE-ECCCCCCCCCCCCCEEE T ss_conf 88-89---99999999998888717325447------5225778888865000131899996-21278823206787899 Q ss_pred ECCCCCCCCCH-HHHHHHHHCCCCCCCCCCCHHHHH-----------HHHHHHHHHHHCCC-EEEEEECCCCHHHHHHHH Q ss_conf 13776655432-678888611678999647989999-----------99999999999598-069946486579999999 Q gi|254780970|r 574 MIGNDGCHLDC-SMYSLECASSNIGPPPKVDCHLEK-----------NHGFFVLSMINAKK-ITACHDISTGGLIITLAE 640 (737) Q Consensus 574 liG~~~~~Lgg-S~~~~~~~~~~~g~~p~~d~~~~k-----------~~~~~v~~li~~g~-I~S~HDiSdGGL~~aL~E 640 (737) =+++++.|=.| |+.-+.+.....-..+.-....-| =..+-+++++++.. |+..=-|+.|||.-.|.+ T Consensus 182 gl~SSG~HSNG~SLvRKvl~~~~~~~~~~~~~~~gk~lgE~LL~PTRIYVKpiL~li~~~~ev~GlAHiTGGGl~eNl~r 261 (338) T TIGR00878 182 GLGSSGIHSNGLSLVRKVLEDNAALDYEDTVEEEGKTLGEELLEPTRIYVKPILALIKSEKEVHGLAHITGGGLLENLLR 261 (338) T ss_pred EECCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHCCCCEEEEEEECCCHHHHHHHH T ss_conf 83178731122677656664303588777457567231442238962044789999737786047775516403456998 Q ss_pred HHHCCCCCEEEEECCCCCCCHHHHH--------------HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 9845897089980566689878887--------------298832189999688099899999857984999899879 Q gi|254780970|r 641 MTISSAKGMDIILPIEIEKDPKPFL--------------FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSG 704 (737) Q Consensus 641 Ma~as~~G~~I~L~~~~~~d~~~~L--------------FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~ 704 (737) + +..++.+.||-+..+......+| ||.--| |++.|++|+.+++++.+.+++..+++||+|.. T Consensus 262 ~-L~~~~~a~~D~~~~~~~piF~~i~~~G~V~~~EM~RTFNMGVG-f~viv~~e~~~~al~~l~~~g~~a~~iG~V~~ 337 (338) T TIGR00878 262 R-LPDGLKAVIDKGSWPQPPIFKWIQELGNVEEEEMFRTFNMGVG-FVVIVPEEEVDKALALLEEKGEKAWVIGEVIK 337 (338) T ss_pred H-CCCCCEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHCCCCCC-EEEEECCHHHHHHHHHHHHCCCEEEEECEEEE T ss_conf 6-5899789985689883168899986089875555431277510-69981816589999986306960344134750 No 57 >TIGR00878 purM phosphoribosylformylglycinamidine cyclo-ligase; InterPro: IPR004733 The purine biosynthetic pathway in procaryotes enlists eleven enzymes, six of which use ATP. Enzymes 5 and 6 of this pathway, formylglycinamide ribonucleotide (FGAR) amidotransferase (PurL) and aminoimidazole ribonucleotide (AIR) synthetase (PurM) utilise ATP to activate the oxygen of an amide within their substrate toward nucleophilic attack by a nitrogen. AIR synthetase uses the product of PurL, formylglycinamidine ribonucleotide (FGAM) and ATP to make AIR, ADP and P(i).; GO: 0004641 phosphoribosylformylglycinamidine cyclo-ligase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=99.76 E-value=1.6e-16 Score=132.33 Aligned_cols=261 Identities=26% Similarity=0.377 Sum_probs=190.1 Q ss_pred HHCCCCEEECCCCCEEEEEECCCE---EEEEEECCCCCCCCCCCCCCCCCCCC---------------CC------CCCC Q ss_conf 206974899334773689836983---99999517887776588887210558---------------10------2257 Q gi|254780970|r 61 PTTGKHVIQGPGENAGVVDIGGGD---CVVFKMESHNHPSYIEPYQGAATGVG---------------GI------LRDI 116 (737) Q Consensus 61 ~~~~~~vi~~~~~~a~vi~~~~~~---~~~~k~EtHNhPs~i~P~~GAaTg~g---------------G~------iRDi 116 (737) .+..+.+..+.|-=||.+++.+.| .++. +.-||| |+ ++|+ T Consensus 31 ~~~~~~~~~~~GgFagl~~l~~~Y~~~~L~~----------------~TDGVGTKl~vA~~~~khDTiGIDlVAMnVNDl 94 (338) T TIGR00878 31 KTRRPGVMGGLGGFAGLFDLGDKYKEPVLVS----------------GTDGVGTKLKVAEAMDKHDTIGIDLVAMNVNDL 94 (338) T ss_pred HHCCCCCCCCCCCCEEEECCCCCCCCEEEEE----------------ECCCHHHHHHHHHHCCCCCCEEEEEEEEECCCE T ss_conf 3124666545775213511522475238998----------------428666899999972889833012488532868 Q ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCC--CCCCEEEECCCCHHHHHH Q ss_conf 677874368775045787568638889988874588665430872113203552156766--422112211211355431 Q gi|254780970|r 117 FTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNN--IIVNTFAAGIAKTNAIFS 194 (737) Q Consensus 117 ~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~--~~v~~~~~Gi~~~~~~~~ 194 (737) +|.||+|++++|.|-.|.|+ .+++.++++|++..|.+-|+..|||+|...|+.-.+ .-..++|||+++++++++ T Consensus 95 ~~~GAEPl~flDYlAvGk~d----~~~~~~i~~G~~eGC~~ag~aLvGGETAemPG~y~~g~YDlAGt~VG~Vekd~iit 170 (338) T TIGR00878 95 LVQGAEPLFFLDYLAVGKLD----PEVASQIVKGIAEGCKQAGCALVGGETAEMPGLYRGGEYDLAGTAVGVVEKDEIIT 170 (338) T ss_pred EEECCCHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCEEECCEEEEEEEECCCCC T ss_conf 88477757876466606888----89999999999988887173254475225778888865000131899996212788 Q ss_pred CCCCCCCCEEEEECCCCCCCCCHH--HHHHHHHHCCCCCCCCC-CCC-CCCHHHHHHH------HHHHHHHCCCCC-CHH Q ss_conf 122568977999657656444104--66667772865333444-332-6798999888------999874011383-011 Q gi|254780970|r 195 SKARGIGLPLVYLGAKTGRDGIGG--ASMASEEFGENIAKKRP-TVQ-VGDPFTGKCL------LEACLELMNTDA-VIA 263 (737) Q Consensus 195 ~~~~~~Gd~ii~~G~~tg~dGi~G--a~~sS~~~~~~~~~~~~-~VQ-~gdp~~ek~~------~~~~~~~~~~~~-i~~ 263 (737) .+.-++||.||=+++ .|+|- =|.+-+.+.++...+.+ .+. -|-+..|..| .+-|++++++.. |++ T Consensus 171 G~~v~~GDv~igl~S----SG~HSNG~SLvRKvl~~~~~~~~~~~~~~~gk~lgE~LL~PTRIYVKpiL~li~~~~ev~G 246 (338) T TIGR00878 171 GEKVKPGDVLIGLGS----SGIHSNGLSLVRKVLEDNAALDYEDTVEEEGKTLGEELLEPTRIYVKPILALIKSEKEVHG 246 (338) T ss_pred CCCCCCCCEEEEECC----CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHCCCCEEE T ss_conf 232067878998317----8731122677656664303588777457567231442238962044789999737786047 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCC-------CCC-CCHHHHH--HCCCCCEEEEEEECCCHHHHHHHHHH Q ss_conf 0347752011122332014775289962022046-------678-9998986--17888508999954675889998887 Q gi|254780970|r 264 IQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTC-------EEG-MTAYEMM--LSESQERMLMILNPEKQHKAQEILNK 333 (737) Q Consensus 264 ~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~-------~~~-l~~~ei~--~sEsQeR~~~~v~~~~~~~~~~i~~~ 333 (737) +=.++.|||...+.-+..+ ++.++||-...|.. +.| ..-.|++ ||=- ==|+++|++|+.++.++++++ T Consensus 247 lAHiTGGGl~eNl~r~L~~-~~~a~~D~~~~~~~piF~~i~~~G~V~~~EM~RTFNMG-VGf~viv~~e~~~~al~~l~~ 324 (338) T TIGR00878 247 LAHITGGGLLENLLRRLPD-GLKAVIDKGSWPQPPIFKWIQELGNVEEEEMFRTFNMG-VGFVVIVPEEEVDKALALLEE 324 (338) T ss_pred EEEECCCHHHHHHHHHCCC-CCEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHCCCC-CCEEEEECCHHHHHHHHHHHH T ss_conf 7755164034569986589-97899856898831688999860898755554312775-106998181658999998630 Q ss_pred HCCEEECCCCEEEC Q ss_conf 09713112212104 Q gi|254780970|r 334 WGLHFSIIGITTND 347 (737) Q Consensus 334 ~~~~~~~iG~vt~~ 347 (737) ++..+-+||+|.+. T Consensus 325 ~g~~a~~iG~V~~~ 338 (338) T TIGR00878 325 KGEKAWVIGEVIKG 338 (338) T ss_pred CCCEEEEECEEEEC T ss_conf 69603441347509 No 58 >COG1973 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=99.74 E-value=2.6e-17 Score=137.69 Aligned_cols=265 Identities=22% Similarity=0.353 Sum_probs=183.4 Q ss_pred ECCCCCEEEEEECCCEEEEEEECCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHH Q ss_conf 9334773689836983999995178877765888872--10558102257677874368775045787568638889988 Q gi|254780970|r 69 QGPGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQGA--ATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSG 146 (737) Q Consensus 69 ~~~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~GA--aTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~ 146 (737) ..--||||+++++++|.++.---.|-.-|. =||=+- +|- -.+||+..|||+|+|+..-+... ++.+.-.+|+ T Consensus 108 ~ee~DDaGvvr~~~~yivvaiDGiHSRLSe-fPFLaGFhvtR--AalRDv~vmGa~p~alisDiHla--DDgDVgklfD- 181 (449) T COG1973 108 PEELDDAGVVRIENDYIVVAIDGIHSRLSE-FPFLAGFHVTR--AALRDVYVMGARPVALISDIHLA--DDGDVGKLFD- 181 (449) T ss_pred HHHCCCCCEEEECCCEEEEEECCHHHHHHC-CCCHHHHHHHH--HHHHHHHHCCCCCCEEEEEEEEC--CCCCHHHHHH- T ss_conf 878477772773586699994133433112-75122468779--99988987276430335655633--6754656301- Q ss_pred HHHHHHHHHHHCCCCCCCCEEEEEC--CCCCCCCC-CEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHH Q ss_conf 8745886654308721132035521--56766422-11221121135543112256897799965765644410466667 Q gi|254780970|r 147 VVAGIAGYSNSFGVPTVGGEVEFLP--CYNNNIIV-NTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMAS 223 (737) Q Consensus 147 ~~~G~~~ygN~~G~P~v~G~~~~~~--~y~~~~~v-~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS 223 (737) -.+|+..-+...|+|.++|.|.--- ..-+..++ .++|+|+.+ ..+...+..+|||+|++. .|-||++.+. T Consensus 182 f~AGvtaVsea~~vPl~aGSTLRiGGDmVlGdRl~~~Vg~iGi~k-~~~t~Rr~~~~Gd~IlmT------EGaGGGTi~T 254 (449) T COG1973 182 FTAGVTAVSEAVGVPLLAGSTLRIGGDMVLGDRLVGAVGAIGISK-SELTPRRRAEPGDVILMT------EGAGGGTIAT 254 (449) T ss_pred HHHHHHHHHHHHCCCEECCCCEEECCEEEEECHHHCCCCEEEEEC-CCCCHHCCCCCCCEEEEE------CCCCCCEEEH T ss_conf 221265788874885433652577472676040103210256505-888701038888779994------2788855331 Q ss_pred HHHCCCCCCCCCCCCCCCHHHHH---HHHHHHHHHCCCCC---CHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCC Q ss_conf 77286533344433267989998---88999874011383---0110347752011122332014775289962022046 Q gi|254780970|r 224 EEFGENIAKKRPTVQVGDPFTGK---CLLEACLELMNTDA---VIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTC 297 (737) Q Consensus 224 ~~~~~~~~~~~~~VQ~gdp~~ek---~~~~~~~~~~~~~~---i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~ 297 (737) ....-...+ | ..|- ..+++|..+.+.+| |++|+|++.||+-.-.-|.+..+++++.+|.+|+-.- T Consensus 255 TAiY~Gm~D----V-----v~ETlnv~fi~a~eai~~s~L~~~vhsMTDVTNGGiRgDA~EISkta~v~lvfDeekv~sl 325 (449) T COG1973 255 TAIYHGMHD----V-----VEETLNVDFIRACEAIVRSGLLSDVHSMTDVTNGGIRGDALEISKTAGVSLVFDEEKVRSL 325 (449) T ss_pred HHHHCCCHH----H-----HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCEEEEECHHHHHHH T ss_conf 444516278----8-----8876274899999999985056653454305477614437876553272699718999864 Q ss_pred ----------CCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCC-CCCCCCC Q ss_conf ----------6789998986178885089999546758899988870971311221210443100-0258644 Q gi|254780970|r 298 ----------EEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRV-IHRGEEV 359 (737) Q Consensus 298 ----------~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v-~~~g~~v 359 (737) +-.++|+- =|-.-.++.++++..+.+.+..+..++.+.+||.|++.+...+ .++|++. T Consensus 326 iNpkVl~MLeel~ID~lG----vSiDslmii~ped~a~~v~k~~~~agvr~~~vG~Vee~~~~~~l~~~gee~ 394 (449) T COG1973 326 INPKVLKMLEELNIDPLG----VSIDSLMIIAPEDVADLVKKALRGAGVRAEEVGRVEEGGQGVILVDGGEER 394 (449) T ss_pred CCHHHHHHHHHCCCCCCC----CCCCEEEEECCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCEEEEECCEEE T ss_conf 098999999971888430----202017896661316999999874586236886663278421998799750 No 59 >PRK05385 phosphoribosylaminoimidazole synthetase; Provisional Probab=99.73 E-value=2.4e-16 Score=131.02 Aligned_cols=260 Identities=19% Similarity=0.268 Sum_probs=181.3 Q ss_pred CCCCCCEEEECC-CCCCEEEEECCCCCCCHHCC----CH-HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 135763142037-77622797304532100001----00-1135678977764102785102232210245666862134 Q gi|254780970|r 433 PGGDAGVIRVEG-HETKALAFSSDVTPRYVKAD----PF-EGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 (737) Q Consensus 433 Pg~DaaVv~~~~-~~~~gia~s~g~~p~~~~~d----P~-~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ 506 (737) .|+=+++.++.. ..+..++.++|.-.....+. -| .-++=.|+-++.|++|.||+|++++|.+..+..+ ++. T Consensus 38 iGgF~gl~~~~~~~~~p~Lv~~tDGVGTK~~iA~~~~~~~tiG~DlVam~vNDil~~GA~Pl~flDY~a~~~l~-~~~-- 114 (338) T PRK05385 38 LGGFGGLFDLPAGYKEPVLVSGTDGVGTKLKLAIDLGKHDTIGIDLVAMCVNDLLVQGAEPLFFLDYIATGKLD-PEV-- 114 (338) T ss_pred CCCCCEEEECCCCCCCCEEEECCCCCCHHHHHHHHHCCCCCHHHHHHHHCCCCEEECCCEEEEEEEEECCCCCC-HHH-- T ss_conf 76620445554468996799718887189999999576143306789752777666286732345551207889-899-- Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC-CCC----CCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCC Q ss_conf 69999999985221028866544320011236866-676----3111222233352023310214389807513776655 Q gi|254780970|r 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQS-IFP----TPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCH 581 (737) Q Consensus 507 ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~-I~p----Tpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~ 581 (737) ..+.++|++++|++.+++++|| ||..-+ +.+ -+.-+++|.++ .++.++....++||.|+-+.+++.| T Consensus 115 -~~~iv~Gi~~~c~~~~~~LiGG------ETAemPg~y~~~~~DLaG~~VGive-k~~ii~~~~i~~GDviiGl~SsG~H 186 (338) T PRK05385 115 -AAEVVKGIAEGCEQAGCALIGG------ETAEMPGMYHEGDYDLAGFAVGVVE-KDEIIDGSKVKEGDVLIGLASSGLH 186 (338) T ss_pred -HHHHHHHHHHHHHHCCCEEECC------CCCCCCCCCCCCCEEEEEEEEEEEE-CCCCCCCCCCCCCCEEEEEECCCCC T ss_conf -9999999999999809804365------2101334356897736337999981-4223456668999889996079866 Q ss_pred CCHHHHHHHHH-CCC---CCCCCCCC-------HHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEE Q ss_conf 43267888861-167---89996479-------89999999999999995980699464865799999999845897089 Q gi|254780970|r 582 LDCSMYSLECA-SSN---IGPPPKVD-------CHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMD 650 (737) Q Consensus 582 LggS~~~~~~~-~~~---~g~~p~~d-------~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~ 650 (737) -.|-.+.+.+. ... ....|..+ +..-+-....+++++++..|+++-.|+.|||...|..+. ..+++++ T Consensus 187 sNGySLiRki~~~~~~~~~~~~~~~~~tlge~LL~PTriY~~~i~~l~~~~~Ikg~aHITGGGl~~Nl~Ril-p~~l~a~ 265 (338) T PRK05385 187 SNGYSLVRKILEVAGLDLDDTLPELGKTLGEELLEPTRIYVKPVLALLKEGDVKGMAHITGGGFIENLPRVL-PEGLGAE 265 (338) T ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCC-CCCCEEE T ss_conf 550789999998749982325653468699986350677679999998627813999825885003654115-7880699 Q ss_pred EEECCCCCCCHHHHH--------------HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 980566689878887--------------2988321899996880998999998579849998998798 Q gi|254780970|r 651 IILPIEIEKDPKPFL--------------FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 651 I~L~~~~~~d~~~~L--------------FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) |+.+..+....+.+| |+-..| |++.|++++.+++++.+++++.++++||+|... T Consensus 266 Id~~~w~~P~iF~~i~~~G~i~~~EM~rtFNmGIG-mvliV~~~~~~~v~~~l~~~g~~a~~IG~V~~g 333 (338) T PRK05385 266 IDKGSWPVPPIFKWLQKAGNVEEEEMYRTFNMGIG-MVLIVPEEDADAALALLEARGEDAWVIGEVKEG 333 (338) T ss_pred EECCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCC-EEEEECHHHHHHHHHHHHHCCCCCEEEEEEEEC T ss_conf 96677999889999998489899999874358222-699983899999999999769984898999979 No 60 >COG0709 SelD Selenophosphate synthase [Amino acid transport and metabolism] Probab=99.72 E-value=1e-16 Score=133.57 Aligned_cols=249 Identities=20% Similarity=0.271 Sum_probs=173.2 Q ss_pred CCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCC-CCCHHHHHHHH Q ss_conf 13576314203777622797304532100001001135678977764102785102232210245666-86213469999 Q gi|254780970|r 433 PGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPE-KEEIMGQFVHS 511 (737) Q Consensus 433 Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~-~~e~~~ql~~a 511 (737) -++|++|..+. .+.+++.|+|+.-.... |||.-++.|...+++|+.+||++|+....-++|++-. .++. +.+. T Consensus 48 ~~dda~vy~ig--~g~~iIsT~df~mpivd-dp~dfG~IaAtNaIsDv~amgg~pi~AiAI~g~~~~~l~~~i---~~ei 121 (346) T COG0709 48 TGDDAAVYDLG--NGHSIISTTDFFMPIVD-DPFDFGRIAATNAISDVFAMGGKPIMAIAILGFPNDLLDKEI---AQEV 121 (346) T ss_pred CCCCCEEEEEC--CCEEEEEECCEEEECCC-CCCHHHHHHHCCCCCCHHHCCCCCHHHHHHCCCCCCCCCHHH---HHHH T ss_conf 46762168846--98089983440224215-823220211112103466508961222211178422258899---9999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHHHHHH Q ss_conf 99998522102886654432001123686667631112222333520233102143898075137766554326788886 Q gi|254780970|r 512 VKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMYSLEC 591 (737) Q Consensus 512 v~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~~~~~ 591 (737) ++|-.+.|++-++++.||+.- + ...++. -..+.|.++ ..+..+.+..++|+.|+++. + +|-.+|.... T Consensus 122 ~~gg~~~~rea~i~lagGhsI---d-apEP~f---Glav~G~~~-~~k~~~n~~a~~G~~lilTK---P-lG~gil~ta~ 189 (346) T COG0709 122 IEGGRDACREAGIALAGGHSI---D-APEPIF---GLAVTGIVP-TGKVKRNSTAKAGCKLILTK---P-LGIGILTTAE 189 (346) T ss_pred HHHHHHHHHHHCCCCCCCCCC---C-CCCCCC---CEEEECCCC-HHHHCCCCCCCCCCEEEEEC---C-CCCCEEHHHH T ss_conf 975478998718422488323---7-888430---415640435-66612256667786899914---7-5552321454 Q ss_pred HCCCCCCCCCCCHHHHHHHHHHHHHHHHC-------CCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHH Q ss_conf 11678999647989999999999999995-------98069946486579999999984589708998056668987888 Q gi|254780970|r 592 ASSNIGPPPKVDCHLEKNHGFFVLSMINA-------KKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPF 664 (737) Q Consensus 592 ~~~~~g~~p~~d~~~~k~~~~~v~~li~~-------g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~ 664 (737) ... ....+......+.+.++++. --|.|+.|||..||+.++.|||..+++.+++.....+..+..++ T Consensus 190 k~~------~L~~e~~~~ale~M~~lN~~~~~~~~~~gv~A~TDVTGfGLlghl~E~ae~s~v~~~lf~~~ip~l~~~e~ 263 (346) T COG0709 190 KKG------LLKEEHKGLALENMCTLNKIGAQFAPDFGVAAMTDVTGFGLLGHLKEMAEGAGVDARLFDSPIPFLDGVEF 263 (346) T ss_pred HHH------CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHH T ss_conf 441------15888999999998633477897376678502432657408999999984359857997266787177999 Q ss_pred H--------------------------------HC--CCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 7--------------------------------29--88321899996880998999998579849998998798 Q gi|254780970|r 665 L--------------------------------FG--EDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 665 L--------------------------------Fs--Es~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) + .+ .++|.+++.|.++...++..+..+.++++..||++... T Consensus 264 ~~e~g~iP~gt~~n~~s~~~~~~~~l~e~~~~lL~dpqtsgglliav~~~~~~~~~~i~~e~~~~~~~i~~l~~k 338 (346) T COG0709 264 YIENGIIPGGTARNFGSYGIFAGKDLPEEQKLLLCDPQTSGGLLIAVVPEGKGEGLEIAPELGIQLIIIGELVAK 338 (346) T ss_pred HHHCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHCCCCCCCCEEEEEECCCCCCHHEECCHHCCCEEHHHHHHHH T ss_conf 997588768544332678886034551899987048877886699980466533202341312641045668874 No 61 >TIGR01739 tegu_FGAM_synt herpesvirus tegument protein/v-FGAM-synthase; InterPro: IPR010077 This entry describes a family of large proteins of herpes virus. The protein is described variably as tegument protein or phosphoribosylformylglycinamidine synthase (FGAM-synthase). The protein shows homology to eukaryotic FGAM-synthase. ; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0005198 structural molecule activity, 0019033 viral tegument. Probab=99.71 E-value=1.7e-15 Score=125.21 Aligned_cols=280 Identities=20% Similarity=0.299 Sum_probs=198.3 Q ss_pred EEECCCCCEE---------EEEECCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC----CCCCEEEEEEEECCCC Q ss_conf 8993347736---------89836983999995178877765888872105581022576----7787436877504578 Q gi|254780970|r 67 VIQGPGENAG---------VVDIGGGDCVVFKMESHNHPSYIEPYQGAATGVGGILRDIF----TMGARPVAAMNSLRFG 133 (737) Q Consensus 67 vi~~~~~~a~---------vi~~~~~~~~~~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~----~~Ga~p~a~~~~l~~g 133 (737) .|+.|.||.. +.-+||....+ ++||-.||.=++--.+=+.+ ..|-+=+.+|-.+.++ T Consensus 655 tiqvPddn~~~~~~~~~~~~~A~GEQ~~kv----------~~D~~~G~~yai~EALlNL~~sP~~~Gl~~~~iT~svtw~ 724 (1313) T TIGR01739 655 TIQVPDDNEVLENPALSGVCSALGEQAYKV----------QLDPKRGATYAITEALLNLSLSPVLTGLEDVIITLSVTWS 724 (1313) T ss_pred EEECCCCCCCCCCCCCEEEEEECCCCCEEE----------EECHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEECC T ss_conf 687367776577765238999613223002----------4213566899999999998745300362424798888548 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCC---CCC----------CC-CEE---EECCCCHHHHHHCC Q ss_conf 75686388899888745886654308721132035521567---664----------22-112---21121135543112 Q gi|254780970|r 134 AIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYN---NNI----------IV-NTF---AAGIAKTNAIFSSK 196 (737) Q Consensus 134 ~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~---~~~----------~v-~~~---~~Gi~~~~~~~~~~ 196 (737) .-+ .....+..+..-++|||.++||+..-..-.-.+... .++ .- +|+ |-|.......++-. T Consensus 725 ~~~--~~~~~L~~~l~~ck~FC~~LGV~~~~~~A~~~~~~~~~~~~~~~~~~~~vl~~~~IVfsa~~~~~~~~~~~iTPd 802 (1313) T TIGR01739 725 PED--HVYSLLKEALRACKDFCEELGVSFTVTSAASSPTQSSRSAPTTIQRESKVLTFNSIVFSASAPVKLGSAKKITPD 802 (1313) T ss_pred CCC--HHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCEEEECCEEEEEEEEEECCCCCCCCCCC T ss_conf 410--048999999999999997499169870257888888788885788653243012089999874136666645755 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCC-CCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHH Q ss_conf 25689779996576564441046666777286533-34443326798999888999874011383011034775201112 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIA-KKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCS 275 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~-~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a 275 (737) -+..|+.||.|-- .-...+.|+.|.. .+..... ...+.. +|.+=+++..++.++.+++.|.|+||+|+|||.-+ T Consensus 803 lk~~Gs~L~~l~~-~~~~~laGSif~~-i~~~~~~~~~~~~~---~~~~l~~ll~~~~~L~~~~~I~SgHDvSDGGL~ac 877 (1313) T TIGR01739 803 LKKHGSHLIWLSI-HQKLTLAGSIFEQ-ILGLSATIHRLPEL---SPESLKKLLKALQELVKEGVIVSGHDVSDGGLVAC 877 (1313) T ss_pred CCCCCCEEEEEEC-CCCCCHHHHHHHH-HHHCCCCCCCCCCC---CHHHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHH T ss_conf 4478888999942-8972017679998-62103577656556---88999999999987765673588754361368999 Q ss_pred HHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEE---CCCCCC Q ss_conf 23320147752899620220466789998986178885089999546758899988870971311221210---443100 Q gi|254780970|r 276 AIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTN---DKLFRV 352 (737) Q Consensus 276 ~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~---~~~~~v 352 (737) +.|||.+|+.|++|.|-. ....|++.||||+.. +|+.|++++.+++...++..++-|.+||.|++ +..+.| T Consensus 878 ~~EMAlaG~kgv~i~~P~-----~~~~~~~~l~SETPG-~viEV~~~~~~~v~~~l~~~~~~~~~iG~V~~~g~~~~~~v 951 (1313) T TIGR01739 878 VAEMALAGGKGVRITLPA-----HITDPLEFLLSETPG-VVIEVEPESLKEVLQFLRSEGLVAAVIGRVGEEGESSTFSV 951 (1313) T ss_pred HHHHHHCCCCEEEEECCC-----CCCCHHHHHHCCCCC-EEEEECCCCHHHHHHHHHHCCCCEEECCEEECCCCCCEEEE T ss_conf 999996389648997176-----435568875305895-69997774088999999867971444017442579972799 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 02586443443210146 Q gi|254780970|r 353 IHRGEEVANLPIKALSD 369 (737) Q Consensus 353 ~~~g~~v~dlp~~~L~~ 369 (737) .++++.+..-|+..|.. T Consensus 952 ~h~~~~l~~~~L~~l~~ 968 (1313) T TIGR01739 952 VHNSTVLFQESLSSLQG 968 (1313) T ss_pred EECCCEEEEEEHHHHHH T ss_conf 97882688610023341 No 62 >COG0150 PurM Phosphoribosylaminoimidazole (AIR) synthetase [Nucleotide transport and metabolism] Probab=99.64 E-value=4.5e-14 Score=115.46 Aligned_cols=260 Identities=21% Similarity=0.262 Sum_probs=166.2 Q ss_pred CCCCCCEEEECC-CCCCEEEEECCCCCCCHHCCCH-----HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 135763142037-7762279730453210000100-----1135678977764102785102232210245666862134 Q gi|254780970|r 433 PGGDAGVIRVEG-HETKALAFSSDVTPRYVKADPF-----EGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 (737) Q Consensus 433 Pg~DaaVv~~~~-~~~~gia~s~g~~p~~~~~dP~-----~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ 506 (737) .|+=+++..+.. .....+++++|.-.....+..- .-+.=.|+-++.|++|.||+|+++.|-+.-+.|+ ++. T Consensus 39 lGgf~~lf~l~~~~~~p~Lv~~tDGVGTKl~~A~~~~k~dTiGID~VAM~VNDlv~~GAePl~flDY~A~gk~~-~~~-- 115 (345) T COG0150 39 LGGFAGLFDLGQAYYKPVLVSGTDGVGTKLLLAEDAGKHDTIGIDLVAMCVNDLVVQGAEPLFFLDYLATGKLD-PEV-- 115 (345) T ss_pred CCCEEEEEECCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCHHEEECCHHHHCCCCHHHHHHHHHCCCCC-HHH-- T ss_conf 76415788726777542699628873389999998078543240010120311321585207877667308898-899-- Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC--C---CCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCC Q ss_conf 6999999998522102886654432001123686667--6---3111222233352023310214389807513776655 Q gi|254780970|r 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQSIF--P---TPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCH 581 (737) Q Consensus 507 ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~--p---Tpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~ 581 (737) ..+.++|+.+.|+..|+.++|| ||..-+-. + -+.-+++|.++. +.+++....++||.|+-+.+++.| T Consensus 116 -~~~iv~GiaeGc~~ag~aLvGG------ETAeMPg~y~~g~yDlaG~~vGvvek-~~ii~g~~i~~GDviigl~SSG~H 187 (345) T COG0150 116 -AAQIVKGIAEGCKQAGCALVGG------ETAEMPGMYRGGDYDLAGFAVGVVEK-DEIIDGSKVKEGDVIIGLASSGLH 187 (345) T ss_pred -HHHHHHHHHHHHHHHCCEEECC------CCCCCCCCCCCCCEEEEEEEEEEEEC-CCCCCCCCCCCCCEEEEECCCCCC T ss_conf -9999999999999849778532------45568885268843031169999875-643360116788789983378767 Q ss_pred CCH-HHHHHHHHCCCCCC---CCCCC--------HHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCE Q ss_conf 432-67888861167899---96479--------8999999999999999598069946486579999999984589708 Q gi|254780970|r 582 LDC-SMYSLECASSNIGP---PPKVD--------CHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGM 649 (737) Q Consensus 582 Lgg-S~~~~~~~~~~~g~---~p~~d--------~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~ 649 (737) -.| |...+.+....... .|... ++.-|-..+.+++++++..|+++-.|+.||+.-.|-.| +..++++ T Consensus 188 SNGySLvRKi~~~~~~~~~~~~~~~~g~~l~e~LL~PTrIYvk~vL~l~~~~~vkg~AHITGGG~~eNl~Rv-~p~~l~a 266 (345) T COG0150 188 SNGYSLVRKIIEESGLDYDDELPEELGKTLGEELLEPTRIYVKPVLALIKEGDVKGMAHITGGGFVENLPRV-LPEGLGA 266 (345) T ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCEEEHHHHHHHHHCCCCCEEEEECCCCHHHHCHHH-CCCCCEE T ss_conf 784089999997547654445752235578998527726555879999752785367775377714427333-6766169 Q ss_pred EEEECCCCCCCHHHHH--------------HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 9980566689878887--------------2988321899996880998999998579849998998798 Q gi|254780970|r 650 DIILPIEIEKDPKPFL--------------FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 650 ~I~L~~~~~~d~~~~L--------------FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) .|+.+...-.....+| |+-.-| |++.|++++.+++++.++..|.++++||+|... T Consensus 267 ~id~~~~~~p~iF~~i~~~G~v~~~EM~rtFNmGvG-~v~iv~~e~~~~~~~~l~~~g~~a~~iG~V~~~ 335 (345) T COG0150 267 VIDKPSWPPPPIFKWLQKAGNVEREEMYRTFNMGVG-MVLIVPEEDAEKALALLKEQGETAYVIGRVEAG 335 (345) T ss_pred EECCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCCCC-EEEEECHHHHHHHHHHHHHCCCCEEEEEEEEEC T ss_conf 982798887589999998469899999987267664-699976788999999998569742898899837 No 63 >COG0150 PurM Phosphoribosylaminoimidazole (AIR) synthetase [Nucleotide transport and metabolism] Probab=99.59 E-value=1.9e-14 Score=118.09 Aligned_cols=269 Identities=22% Similarity=0.325 Sum_probs=182.7 Q ss_pred HCCCCEEECCCCCEEEEEECCCE---EEE-------------EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 06974899334773689836983---999-------------99517887776588887210558102257677874368 Q gi|254780970|r 62 TTGKHVIQGPGENAGVVDIGGGD---CVV-------------FKMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVA 125 (737) Q Consensus 62 ~~~~~vi~~~~~~a~vi~~~~~~---~~~-------------~k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a 125 (737) |..++++.+.|-=++.++++..+ .++ +++-.|+ .| -..+|.=.++|+++.||+|++ T Consensus 30 T~r~~v~gglGgf~~lf~l~~~~~~p~Lv~~tDGVGTKl~~A~~~~k~d---Ti-----GID~VAM~VNDlv~~GAePl~ 101 (345) T COG0150 30 TRRPEVMGGLGGFAGLFDLGQAYYKPVLVSGTDGVGTKLLLAEDAGKHD---TI-----GIDLVAMCVNDLVVQGAEPLF 101 (345) T ss_pred HCCCCCCCCCCCEEEEEECCCCCCCCEEEECCCCCHHHHHHHHHHCCCC---CC-----CCCHHEEECCHHHHCCCCHHH T ss_conf 2576534577641578872677754269962887338999999807854---32-----400101203113215852078 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCC-CCCCEEEECCCCHHHHHHCCCCCCCCE Q ss_conf 77504578756863888998887458866543087211320355215-6766-422112211211355431122568977 Q gi|254780970|r 126 AMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNN-IIVNTFAAGIAKTNAIFSSKARGIGLP 203 (737) Q Consensus 126 ~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~-~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ 203 (737) .+|.|-.|.++.. ...++++|++..|-+.|+..|||+|.-.|. |.+. .-+.++|+|++++++++.....++||. T Consensus 102 flDY~A~gk~~~~----~~~~iv~GiaeGc~~ag~aLvGGETAeMPg~y~~g~yDlaG~~vGvvek~~ii~g~~i~~GDv 177 (345) T COG0150 102 FLDYLATGKLDPE----VAAQIVKGIAEGCKQAGCALVGGETAEMPGMYRGGDYDLAGFAVGVVEKDEIIDGSKVKEGDV 177 (345) T ss_pred HHHHHHCCCCCHH----HHHHHHHHHHHHHHHHCCEEECCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCE T ss_conf 7766730889889----999999999999998497785324556888526884303116999987564336011678878 Q ss_pred EEEECCCCCCCCCHHHHH--HHHHHCCC---CCCCCCCCCCCCHHHHHH------HHHHHHHHCCCCCCHHCCCCCCCCH Q ss_conf 999657656444104666--67772865---333444332679899988------8999874011383011034775201 Q gi|254780970|r 204 LVYLGAKTGRDGIGGASM--ASEEFGEN---IAKKRPTVQVGDPFTGKC------LLEACLELMNTDAVIAIQDMGAAGL 272 (737) Q Consensus 204 ii~~G~~tg~dGi~Ga~~--sS~~~~~~---~~~~~~~VQ~gdp~~ek~------~~~~~~~~~~~~~i~~~~D~gaGGl 272 (737) ||-+++ .|+|-.-| .-+.+.++ .....+.- .|.+..+-. -.+.+++++++..|++|-.++.||+ T Consensus 178 iigl~S----SG~HSNGySLvRKi~~~~~~~~~~~~~~~-~g~~l~e~LL~PTrIYvk~vL~l~~~~~vkg~AHITGGG~ 252 (345) T COG0150 178 IIGLAS----SGLHSNGYSLVRKIIEESGLDYDDELPEE-LGKTLGEELLEPTRIYVKPVLALIKEGDVKGMAHITGGGF 252 (345) T ss_pred EEEECC----CCCCCCCHHHHHHHHHHCCCCCCCCCCCC-CCCCHHHHHCCCCEEEHHHHHHHHHCCCCCEEEEECCCCH T ss_conf 998337----87677840899999975476544457522-3557899852772655587999975278536777537771 Q ss_pred HHHHHHHHCCCCCCEEEEEHHHCCC-------CCC-CCHHHHHH--CCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCC Q ss_conf 1122332014775289962022046-------678-99989861--7888508999954675889998887097131122 Q gi|254780970|r 273 TCSAIEMGNQGNLGITLDLDKVPTC-------EEG-MTAYEMML--SESQERMLMILNPEKQHKAQEILNKWGLHFSIIG 342 (737) Q Consensus 273 ~~a~~Ema~~~~~G~~i~l~~vp~~-------~~~-l~~~ei~~--sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG 342 (737) .-.+.-|..+ +++++|+....+.. +.+ .+..|.+- | --=-|+++|++++.+++.++++..+.++.+|| T Consensus 253 ~eNl~Rv~p~-~l~a~id~~~~~~p~iF~~i~~~G~v~~~EM~rtFN-mGvG~v~iv~~e~~~~~~~~l~~~g~~a~~iG 330 (345) T COG0150 253 VENLPRVLPE-GLGAVIDKPSWPPPPIFKWLQKAGNVEREEMYRTFN-MGVGMVLIVPEEDAEKALALLKEQGETAYVIG 330 (345) T ss_pred HHHCHHHCCC-CCEEEECCCCCCCCHHHHHHHHHCCCCHHHHHHHHC-CCCCEEEEECHHHHHHHHHHHHHCCCCEEEEE T ss_conf 4427333676-616998279888758999999846989999998726-76646999767889999999985697428988 Q ss_pred CEEECCC Q ss_conf 1210443 Q gi|254780970|r 343 ITTNDKL 349 (737) Q Consensus 343 ~vt~~~~ 349 (737) +|.+... T Consensus 331 ~V~~~~~ 337 (345) T COG0150 331 RVEAGEG 337 (345) T ss_pred EEEECCC T ss_conf 9983798 No 64 >COG1973 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=99.57 E-value=6e-14 Score=114.57 Aligned_cols=256 Identities=17% Similarity=0.273 Sum_probs=166.0 Q ss_pred CCCC--CCCCCEEEECCCCCCEEEEECCC-CCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHH Q ss_conf 0001--35763142037776227973045-32100001001135678977764102785102232210245666862134 Q gi|254780970|r 430 IQLP--GGDAGVIRVEGHETKALAFSSDV-TPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMG 506 (737) Q Consensus 430 v~~P--g~DaaVv~~~~~~~~gia~s~g~-~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ 506 (737) +.-| .+|++|++++. ..++.++|. ..|++. =|+.++.++---|+|+++.||++|+++.--+.... ...-+ T Consensus 105 ~vs~ee~DDaGvvr~~~---~yivvaiDGiHSRLSe-fPFLaGFhvtRAalRDv~vmGa~p~alisDiHlaD---DgDVg 177 (449) T COG1973 105 VVSPEELDDAGVVRIEN---DYIVVAIDGIHSRLSE-FPFLAGFHVTRAALRDVYVMGARPVALISDIHLAD---DGDVG 177 (449) T ss_pred CCCHHHCCCCCEEEECC---CEEEEEECCHHHHHHC-CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECC---CCCHH T ss_conf 06887847777277358---6699994133433112-75122468779999889872764303356556336---75465 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC-CC---CCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCC Q ss_conf 69999999985221028866544320011236866-67---631112222333520233102143898075137766554 Q gi|254780970|r 507 QFVHSVKGIREACQILDFPIVSGNVSFYNETNGQS-IF---PTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDGCHL 582 (737) Q Consensus 507 ql~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~-I~---pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~~~L 582 (737) .++.-..|+.-.|.+.++|+++|.. ---+.+ +. -+--++++|..+ .+.......++||+|+++- ..- T Consensus 178 klfDf~AGvtaVsea~~vPl~aGST----LRiGGDmVlGdRl~~~Vg~iGi~k--~~~t~Rr~~~~Gd~IlmTE---GaG 248 (449) T COG1973 178 KLFDFTAGVTAVSEAVGVPLLAGST----LRIGGDMVLGDRLVGAVGAIGISK--SELTPRRRAEPGDVILMTE---GAG 248 (449) T ss_pred HHHHHHHHHHHHHHHHCCCEECCCC----EEECCEEEEECHHHCCCCEEEEEC--CCCCHHCCCCCCCEEEEEC---CCC T ss_conf 6301221265788874885433652----577472676040103210256505--8887010388887799942---788 Q ss_pred CHHHHHHHHH-CCCCCCCCCCCHHHHHHHHHHHHHHHHCCC---EEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC- Q ss_conf 3267888861-167899964798999999999999999598---06994648657999999998458970899805666- Q gi|254780970|r 583 DCSMYSLECA-SSNIGPPPKVDCHLEKNHGFFVLSMINAKK---ITACHDISTGGLIITLAEMTISSAKGMDIILPIEI- 657 (737) Q Consensus 583 ggS~~~~~~~-~~~~g~~p~~d~~~~k~~~~~v~~li~~g~---I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~- 657 (737) ||.+....++ +-..-.-..+|... ..+++.++++++ |+|+.|++.||+-....|.+-..++++.+|-+... T Consensus 249 GGTi~TTAiY~Gm~DVv~ETlnv~f----i~a~eai~~s~L~~~vhsMTDVTNGGiRgDA~EISkta~v~lvfDeekv~s 324 (449) T COG1973 249 GGTIATTAIYHGMHDVVEETLNVDF----IRACEAIVRSGLLSDVHSMTDVTNGGIRGDALEISKTAGVSLVFDEEKVRS 324 (449) T ss_pred CCEEEHHHHHCCCHHHHHHHCCHHH----HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCEEEEECHHHHHH T ss_conf 8553314445162788887627489----999999998505665345430547761443787655327269971899986 Q ss_pred CCCH--HHHH-------HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC Q ss_conf 8987--8887-------2988321899996880998999998579849998998798 Q gi|254780970|r 658 EKDP--KPFL-------FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS 705 (737) Q Consensus 658 ~~d~--~~~L-------FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~ 705 (737) ..+| +..| |+-|.-.+++-.+++..+.+.+.++..|+.+..||.|.+. T Consensus 325 liNpkVl~MLeel~ID~lGvSiDslmii~ped~a~~v~k~~~~agvr~~~vG~Vee~ 381 (449) T COG1973 325 LINPKVLKMLEELNIDPLGVSIDSLMIIAPEDVADLVKKALRGAGVRAEEVGRVEEG 381 (449) T ss_pred HCCHHHHHHHHHCCCCCCCCCCCEEEEECCHHHHHHHHHHHHHCCCCEEEEEEEECC T ss_conf 409899999997188843020201789666131699999987458623688666327 No 65 >COG0709 SelD Selenophosphate synthase [Amino acid transport and metabolism] Probab=99.55 E-value=3.4e-13 Score=109.46 Aligned_cols=276 Identities=23% Similarity=0.336 Sum_probs=195.8 Q ss_pred CCCCCCCHHHHHHHHHC------CCCEEEC--CCCCEEEEEECCCEEEEEEECCCCCCCCCCCCC-C--CCCCCCCCCCC Q ss_conf 10744588999751206------9748993--347736898369839999951788777658888-7--21055810225 Q gi|254780970|r 47 HCSYKSSKKWLRTLPTT------GKHVIQG--PGENAGVVDIGGGDCVVFKMESHNHPSYIEPYQ-G--AATGVGGILRD 115 (737) Q Consensus 47 HC~~k~~~~~l~~~~~~------~~~vi~~--~~~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~~-G--AaTg~gG~iRD 115 (737) -|--|.|.+.|.++-.. -++.+.| -+|||+|.+++++..++-+.. -=-|=-=+||. | |||- .|-| T Consensus 16 g~gck~sp~vL~qil~~~~a~~~~~~~lvgn~~~dda~vy~ig~g~~iIsT~d-f~mpivddp~dfG~IaAtN---aIsD 91 (346) T COG0709 16 GCGCKISPKVLEQILHGTPAKPDFAKLLVGNETGDDAAVYDLGNGHSIISTTD-FFMPIVDDPFDFGRIAATN---AISD 91 (346) T ss_pred CCCCCCCHHHHHHHHHCCCCCCCCHHHHHCCCCCCCCEEEEECCCEEEEEECC-EEEECCCCCCHHHHHHHCC---CCCC T ss_conf 65222579999999723867887467763355467621688469808998344-0224215823220211112---1034 Q ss_pred CCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHC Q ss_conf 76778743687750457875686388899888745886654308721132035521567664221122112113554311 Q gi|254780970|r 116 IFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSS 195 (737) Q Consensus 116 i~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~ 195 (737) +++||++|+..+..++|.+-.. ...++.+++.|..+-+.+-|+++.+|+.--++ -|.-.-...|++...+..+. T Consensus 92 v~amgg~pi~AiAI~g~~~~~l--~~~i~~ei~~gg~~~~rea~i~lagGhsIdap----EP~fGlav~G~~~~~k~~~n 165 (346) T COG0709 92 VFAMGGKPIMAIAILGFPNDLL--DKEIAQEVIEGGRDACREAGIALAGGHSIDAP----EPIFGLAVTGIVPTGKVKRN 165 (346) T ss_pred HHHCCCCCHHHHHHCCCCCCCC--CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC----CCCCCEEEECCCCHHHHCCC T ss_conf 6650896122221117842225--88999999975478998718422488323788----84304156404356661225 Q ss_pred CCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-------CCCCHHCCCCC Q ss_conf 22568977999657656444104666677728653334443326798999888999874011-------38301103477 Q gi|254780970|r 196 KARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMN-------TDAVIAIQDMG 268 (737) Q Consensus 196 ~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~-------~~~i~~~~D~g 268 (737) ...++|+.|++.- | .|+|= . +..+-++. . .|+-.++.++..+.+.. .--|.+|.|++ T Consensus 166 ~~a~~G~~lilTK-P---lG~gi--l-~ta~k~~~---L------~~e~~~~ale~M~~lN~~~~~~~~~~gv~A~TDVT 229 (346) T COG0709 166 STAKAGCKLILTK-P---LGIGI--L-TTAEKKGL---L------KEEHKGLALENMCTLNKIGAQFAPDFGVAAMTDVT 229 (346) T ss_pred CCCCCCCEEEEEC-C---CCCCE--E-HHHHHHHC---C------CHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEECCC T ss_conf 6667786899914-7---55523--2-14544411---5------88899999999863347789737667850243265 Q ss_pred CCCHHHHHHHHHCCCCCCEEEEEHHHCCCCC---------------------------CCCHHHHH-HC--CCCCEEEEE Q ss_conf 5201112233201477528996202204667---------------------------89998986-17--888508999 Q gi|254780970|r 269 AAGLTCSAIEMGNQGNLGITLDLDKVPTCEE---------------------------GMTAYEMM-LS--ESQERMLMI 318 (737) Q Consensus 269 aGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~---------------------------~l~~~ei~-~s--EsQeR~~~~ 318 (737) .-||...+.|||+.+++.++++.++||.-+- .|.-|+.. ++ -+|+-++++ T Consensus 230 GfGLlghl~E~ae~s~v~~~lf~~~ip~l~~~e~~~e~g~iP~gt~~n~~s~~~~~~~~l~e~~~~lL~dpqtsggllia 309 (346) T COG0709 230 GFGLLGHLKEMAEGAGVDARLFDSPIPFLDGVEFYIENGIIPGGTARNFGSYGIFAGKDLPEEQKLLLCDPQTSGGLLIA 309 (346) T ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCHHHHHHHCCCCCCCCEEEE T ss_conf 74089999999843598579972667871779999975887685443326788860345518999870488778866999 Q ss_pred EECCCHHHHHHHHHHHCCEEECCCCEEECC Q ss_conf 954675889998887097131122121044 Q gi|254780970|r 319 LNPEKQHKAQEILNKWGLHFSIIGITTNDK 348 (737) Q Consensus 319 v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~ 348 (737) |.+|...++.++.+.|++++..||++.... T Consensus 310 v~~~~~~~~~~i~~e~~~~~~~i~~l~~k~ 339 (346) T COG0709 310 VVPEGKGEGLEIAPELGIQLIIIGELVAKQ 339 (346) T ss_pred EECCCCCCHHEECCHHCCCEEHHHHHHHHC T ss_conf 804665332023413126410456688742 No 66 >KOG1907 consensus Probab=99.55 E-value=1.8e-13 Score=111.28 Aligned_cols=163 Identities=20% Similarity=0.361 Sum_probs=106.7 Q ss_pred CCCCEE--EEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHH Q ss_conf 689779--996576564441046666777286533344433267989998889998740113830110347752011122 Q gi|254780970|r 199 GIGLPL--VYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSA 276 (737) Q Consensus 199 ~~Gd~i--i~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~ 276 (737) .+|+.. +-+--..++--+||++.+.... ....|.|.+- |--.-|++.+.+..+...|+|.+-||.++|||.+++ T Consensus 836 ~~gs~~~Ll~i~l~~~k~rLGgSaLaQvy~--QiG~d~Pdl~--~~~~lk~~f~~vqqL~~~~ii~AgHD~SDGGLlvt~ 911 (1320) T KOG1907 836 VDGSKTSLLWIDLANSKMRLGGSALAQVYS--QIGDDCPDLD--NFDELKKFFSVVQQLLNEGIILAGHDISDGGLLVTL 911 (1320) T ss_pred CCCCCEEEEEEECCCCCCCCCHHHHHHHHH--HHCCCCCCCC--CHHHHHHHHHHHHHHHHCCCEEECCCCCCCCHHHHH T ss_conf 899721799998455322354589999999--7378899853--348898899999999756733425535677200889 Q ss_pred HHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCE----EECCCCCC Q ss_conf 33201477528996202204667899989861788850899995467588999888709713112212----10443100 Q gi|254780970|r 277 IEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSIIGIT----TNDKLFRV 352 (737) Q Consensus 277 ~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~v----t~~~~~~v 352 (737) .|||+++++|++|||+.- ..++.+.+++++|-- -.++.|...|++++++++++.++.+..||++ +.+.++.+ T Consensus 912 lEMAfag~~gi~idl~~~---~~~~~~~~~LF~EEl-G~v~evs~~dl~~v~~~~~~~gv~~~~ig~~~~~~g~~~~i~~ 987 (1320) T KOG1907 912 LEMAFAGNVGIEIDLDSP---NQNIKLFDILFAEEL-GAVLEVSDTDLEKVLEIFSEAGVKCEYIGKASAVFGQDAHIKI 987 (1320) T ss_pred HHHHHHCCCCEEEECCCC---CCCCCHHHHHHHHHH-CEEEEECCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEE T ss_conf 998760466227854774---345618889989861-7177862653999999998649861254110034477745999 Q ss_pred CCCCCCCCCCCCCCCCC Q ss_conf 02586443443210146 Q gi|254780970|r 353 IHRGEEVANLPIKALSD 369 (737) Q Consensus 353 ~~~g~~v~dlp~~~L~~ 369 (737) ..++..+.+-++.+|.+ T Consensus 988 ~~~g~~~~~~~~s~L~~ 1004 (1320) T KOG1907 988 SVNGHLILNEKLSDLRE 1004 (1320) T ss_pred ECCCEEEECCHHHHHHH T ss_conf 51885784454789999 No 67 >TIGR00476 selD selenide, water dikinase; InterPro: IPR004536 In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3' or 5' non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation. This family describes SelD, known as selenophosphate synthetase, selenium donor protein, and selenide, water dikinase. SelD provides reduced selenium for the selenium transferase SelA. This protein itself contains selenocysteine in many species; any sequence scoring above the trusted cut-off but not aligning to the beginning of the HMM is likely have selenocysteine residue incorrectly interpreted as a stop codon upstream of the given sequence.; GO: 0004756 selenide water dikinase activity, 0005524 ATP binding. Probab=99.49 E-value=3.1e-13 Score=109.74 Aligned_cols=269 Identities=19% Similarity=0.251 Sum_probs=184.1 Q ss_pred CCCEEEC--CC------CCEEEEEECCCEEEEEEECCCCCCCCCCCC-CC--CCCCCCCCCCCCCCCCCEEEEEEEECCC Q ss_conf 9748993--34------773689836983999995178877765888-87--2105581022576778743687750457 Q gi|254780970|r 64 GKHVIQG--PG------ENAGVVDIGGGDCVVFKMESHNHPSYIEPY-QG--AATGVGGILRDIFTMGARPVAAMNSLRF 132 (737) Q Consensus 64 ~~~vi~~--~~------~~a~vi~~~~~~~~~~k~EtHNhPs~i~P~-~G--AaTg~gG~iRDi~~~Ga~p~a~~~~l~~ 132 (737) .++.+.| .+ |||+|++-+++..++-++.- =.|=-=||| .| |+|-+ +-||.+||..|+..+-.++| T Consensus 41 ~~n~lvG~~~~rlGiGmDdA~v~~~~nG~~~v~T~D~-~~PiVddPY~~G~IAaaN~---~SDiYAMG~~~i~~~l~~~~ 116 (372) T TIGR00476 41 DDNLLVGAVLRRLGIGMDDAAVYERHNGLSLVKTVDV-ITPIVDDPYLFGRIAAANA---LSDIYAMGGLPIVALLILVG 116 (372) T ss_pred CCCCEECCCCCCCCCCCCCCEEEECCCCEEEEEEEEE-EEEEECCHHHHHHHHHHHH---HHHHHHCCCCCHHHEEEEEC T ss_conf 7210101000104323343113531798899996338-9832568355448998756---77777437732101433303 Q ss_pred CCCCCC--CHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCC Q ss_conf 875686--388899888745886654308721132035521567664221122112113554311225689779996576 Q gi|254780970|r 133 GAIHHP--KTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAK 210 (737) Q Consensus 133 g~~~~p--~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~ 210 (737) -+-+.+ +-..+++++++|..|++.+-|+-++||++--+ ..|+....+-||++.++.+.....++||++|+. .| T Consensus 117 ~~~kL~drer~~v~Rev~~G~~d~~reakt~~~GGhtil~----pwp~~Gg~vtgv~~~~~~~~~~~a~~G~~L~LT-KP 191 (372) T TIGR00476 117 VSNKLSDRERIEVVREVIKGFKDACREAKTALVGGHTILN----PWPVFGGAVTGVCPEEEVITKSGAQVGDVLILT-KP 191 (372) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHCCEEEECCEEECC----CCCCCCCEEEEECCCCCEECCCCCCCCCEEEEE-CC T ss_conf 5666875444201068776588888768907973621127----620005406310278847525887424167851-75 Q ss_pred CCCCCCHHHHHHHHHHCCCCC---CCCCCCCCCCHHHHHHH-----------HHHH---HHHCCCCCCHHCCCCCCCCHH Q ss_conf 564441046666777286533---34443326798999888-----------9998---740113830110347752011 Q gi|254780970|r 211 TGRDGIGGASMASEEFGENIA---KKRPTVQVGDPFTGKCL-----------LEAC---LELMNTDAVIAIQDMGAAGLT 273 (737) Q Consensus 211 tg~dGi~Ga~~sS~~~~~~~~---~~~~~VQ~gdp~~ek~~-----------~~~~---~~~~~~~~i~~~~D~gaGGl~ 273 (737) .|.+=+...++.+++-.. .+.+=.|-.+-+..++- +.++ .++....-.+|++|++.-||. T Consensus 192 ---LGtqv~~~~~~~~~~~~~~nkik~~i~~ee~~~~~~~A~e~M~~~Nryalkal~~A~~~~~~~~~~A~TDvTGFGlL 268 (372) T TIGR00476 192 ---LGTQVLVAVEKKLDIFKDLNKIKLSILKEERELIINEAIENMARLNRYALKALRKAAELMNSKDAKALTDVTGFGLL 268 (372) T ss_pred ---CHHHHHHHHHHCCCCHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCHH T ss_conf ---16899998763112001104146786075278899899999998889999999998876644187153114565456 Q ss_pred HHHHHHHCCCCCCEEEEEHHHCCCCC----------------C------------CCHH-HHHHCCCC--CEEEEEEECC Q ss_conf 12233201477528996202204667----------------8------------9998-98617888--5089999546 Q gi|254780970|r 274 CSAIEMGNQGNLGITLDLDKVPTCEE----------------G------------MTAY-EMMLSESQ--ERMLMILNPE 322 (737) Q Consensus 274 ~a~~Ema~~~~~G~~i~l~~vp~~~~----------------~------------l~~~-ei~~sEsQ--eR~~~~v~~~ 322 (737) .-+.|||..+++.++|+.+++|.-.. + |.-| ..+++|-| ..|++.|.++ T Consensus 269 GH~~~~ak~~~~~~~~~~e~lP~l~~~~~~~k~~~~~~G~~~g~~~~~~G~L~~~~~~~~~~ll~~~~~S~~y~~~~~~~ 348 (372) T TIGR00476 269 GHAQEMAKNSNVEARIVYEKLPVLKKVEELSKLKGVPLGAIEGLETAGSGLLLSELPREKAALLCDPVKSGGYLFEVQKE 348 (372) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCHHCCCHHHHHHHHH T ss_conf 88999874078669999530887354578898632433121354212674221357078986426101051357766688 Q ss_pred CHHHHHH-HHHHHCCEEECCCCE Q ss_conf 7588999-888709713112212 Q gi|254780970|r 323 KQHKAQE-ILNKWGLHFSIIGIT 344 (737) Q Consensus 323 ~~~~~~~-i~~~~~~~~~~iG~v 344 (737) ..+.+.+ -....+++...|||+ T Consensus 349 ~~~~v~~G~~~~~~v~~~~IgE~ 371 (372) T TIGR00476 349 VKEKVEKGTAKEKEVELIEIGEV 371 (372) T ss_pred HHHHHHHHHHHHHHCCEEEEEEE T ss_conf 85111010134330450254310 No 68 >KOG0237 consensus Probab=99.45 E-value=2.5e-12 Score=103.52 Aligned_cols=263 Identities=23% Similarity=0.276 Sum_probs=168.1 Q ss_pred CCCCCCCC------EEEEC--CCCCCEEEEECCCCCCCHHCCCH-----HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCC Q ss_conf 00135763------14203--77762279730453210000100-----1135678977764102785102232210245 Q gi|254780970|r 431 QLPGGDAG------VIRVE--GHETKALAFSSDVTPRYVKADPF-----EGTKQAVAECWRNIIATGAKPLAITDNLNFG 497 (737) Q Consensus 431 ~~Pg~Daa------Vv~~~--~~~~~gia~s~g~~p~~~~~dP~-----~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~ 497 (737) +.||.|+. +.++. +..+..++..+|.-.....+.-. .-+.--|+-++.|+++.||+|++..|-+..+ T Consensus 464 ~rpG~~~diGGFgglfdLk~ag~~d~~Lv~~tdGVGtKl~iA~~~~~hdtvG~DlvAm~vNDii~~gAePl~FLDYfa~g 543 (788) T KOG0237 464 RRPGADADIGGFGGLFDLKQAGFKDPLLVSGTDGVGTKLKIAQETNIHDTVGIDLVAMNVNDIIVQGAEPLFFLDYFATG 543 (788) T ss_pred CCCCCCCCCCCCCEEEEHHHCCCCCCEEEEECCCCCCEEEHHHHHCCCCCCCEEEEEEEHHHHHHCCCCCEEEHHHHHHC T ss_conf 58874246666210000543588885688732676510201245076545250001200114755177530022235306 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCEE Q ss_conf 66686213469999999985221028866544320----01123686667631112222333520233102143898075 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVS----FYNETNGQSIFPTPTIAGVGILPDYSLMTRIDSAHEGDLIL 573 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvS----l~n~t~~~~I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~ 573 (737) .-+-+- -.+.++|+.+.|++.++.++||-.+ ||....+. +.=+++|.++.-..+..++--.+||.|+ T Consensus 544 kLd~~v----a~~~v~gia~gC~qa~CaLvGGETaEMP~~Y~~g~yD-----laG~Avga~e~~~iLp~~~ei~~GDVll 614 (788) T KOG0237 544 KLDVSV----AEQVVKGIAEGCRQAGCALVGGETAEMPGMYAPGEYD-----LAGFAVGAVERTDILPKLNEIVAGDVLL 614 (788) T ss_pred CCCHHH----HHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCC-----CCCCEEEEEECCCCCCCCCCCCCCCEEE T ss_conf 504178----9999999998776506512156323477888988315-----4551676662267577723255686799 Q ss_pred ECCCCCCCC-CHHHHHHHHHCC--CCCC-CCCCCH--------HHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHH Q ss_conf 137766554-326788886116--7899-964798--------9999999999999995980699464865799999999 Q gi|254780970|r 574 MIGNDGCHL-DCSMYSLECASS--NIGP-PPKVDC--------HLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEM 641 (737) Q Consensus 574 liG~~~~~L-ggS~~~~~~~~~--~~g~-~p~~d~--------~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EM 641 (737) =+.+.+-+- |-|+..+.+... .... .|-.+- ...|-..+.++.+++++.|.++-.|+.|||.-.+-.+ T Consensus 615 GL~SsGvHSNGfSLvrkil~~~~l~~~~~~P~~~~~tlGd~LL~pTkiYvk~ll~~i~~g~vkg~AHITGGGl~ENipRv 694 (788) T KOG0237 615 GLPSSGVHSNGFSLVRKILARSGLSYKDPLPWDSSKTLGDELLAPTKIYVKQLLPLIRKGGVKGLAHITGGGLTENIPRV 694 (788) T ss_pred ECCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCEEEHHHHHHHHHHCCCCEEEEECCCCCCCCCHHH T ss_conf 54666645572789999998608765788999765641144426507751142788873551225664178721265011 Q ss_pred HHCCCCCEEEEECCCCCCCHHHHH--------------HCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECC Q ss_conf 845897089980566689878887--------------298832189999688099899999857984999899879 Q gi|254780970|r 642 TISSAKGMDIILPIEIEKDPKPFL--------------FGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSG 704 (737) Q Consensus 642 a~as~~G~~I~L~~~~~~d~~~~L--------------FsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~ 704 (737) +.+++++.||.+...-.....|| |+-..| +|+.|.+++++.+++.+++.+-.+++||+|.. T Consensus 695 -lp~~l~~~id~~twe~p~vF~Wl~~aG~v~~~em~RTfN~GiG-mVlvVs~e~ve~v~kel~~~~e~~~~iG~vv~ 769 (788) T KOG0237 695 -LPDHLGAVIDADTWELPPVFKWLQQAGNVPDSEMARTFNCGIG-MVLVVSPENVERVLKELTNHGEEAYRIGEVVN 769 (788) T ss_pred -CCCCCCEEEECCCCCCCHHHHHHHHCCCCCHHHHHHHHCCCCC-EEEEECHHHHHHHHHHHHHCCEEEEEEEEEEE T ss_conf -6522453784443447378899987389987998877355665-69998688999999998527817999876763 No 69 >pfam00586 AIRS AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the dimer interface of the protein, and is suggested as a putative ATP binding domain. Probab=99.39 E-value=1.5e-13 Score=111.86 Aligned_cols=92 Identities=39% Similarity=0.577 Sum_probs=73.7 Q ss_pred CCCCCEEEECCCCCCEEEEECCCCCCCHH--CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHH Q ss_conf 35763142037776227973045321000--0100113567897776410278510223221024566686213469999 Q gi|254780970|r 434 GGDAGVIRVEGHETKALAFSSDVTPRYVK--ADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHS 511 (737) Q Consensus 434 g~DaaVv~~~~~~~~gia~s~g~~p~~~~--~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~a 511 (737) |+||||+++.. ++.+++|+|....... .+||.+++.+++.+++||++||++|++++++++++.+.+++ ++.+. T Consensus 1 GdDaavi~~~~--~~~lv~ttD~~~~~~~f~~~p~~~g~~av~~~~sDi~a~Ga~P~~~~~~l~~~~~~~~~---~l~~~ 75 (95) T pfam00586 1 GDDAAVIRLGE--GPALVVTTDGVVTHVHFAADPYDGGATAVAGNLSDLAAMGAKPLAFLDGLALGEGLDPE---WLEEI 75 (95) T ss_pred CCEEEEEEECC--CCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCHH---HHHHH T ss_conf 97059997099--98899996777888867889899999999999861997499916507989769975236---89999 Q ss_pred HHHHHHHHHHCCCCCCCCC Q ss_conf 9999852210288665443 Q gi|254780970|r 512 VKGIREACQILDFPIVSGN 530 (737) Q Consensus 512 v~gl~d~c~~lgipivgGk 530 (737) ++|+.++|+++|++++||+ T Consensus 76 ~~g~~~~~~~~g~~ivGGd 94 (95) T pfam00586 76 VRGIAEAANEFGVPLVGGD 94 (95) T ss_pred HHHHHHHHHHHCCEEECCC T ss_conf 9999999999299883707 No 70 >KOG0237 consensus Probab=99.38 E-value=1.9e-12 Score=104.34 Aligned_cols=228 Identities=22% Similarity=0.321 Sum_probs=159.1 Q ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCC-CCCCCEEEECCCCHH Q ss_conf 225767787436877504578756863888998887458866543087211320355215-676-642211221121135 Q gi|254780970|r 113 LRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNN-NIIVNTFAAGIAKTN 190 (737) Q Consensus 113 iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~-~~~v~~~~~Gi~~~~ 190 (737) ++|+++.||+|++.+|.+..|.++.+ +...+++||+..|-+-|...+||+|.-.|+ |.. .--.+.+++|+++.+ T Consensus 523 vNDii~~gAePl~FLDYfa~gkLd~~----va~~~v~gia~gC~qa~CaLvGGETaEMP~~Y~~g~yDlaG~Avga~e~~ 598 (788) T KOG0237 523 VNDIIVQGAEPLFFLDYFATGKLDVS----VAEQVVKGIAEGCRQAGCALVGGETAEMPGMYAPGEYDLAGFAVGAVERT 598 (788) T ss_pred HHHHHHCCCCCEEEHHHHHHCCCCHH----HHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCCCCCCCCCEEEEEECC T ss_conf 11475517753002223530650417----89999999998776506512156323477888988315455167666226 Q ss_pred HHHHCCC-CCCCCEEEEECCCCCCCCCHHHHHHH--HHHCC-CCC--CCC---CCCCCCCHHHH--HHHHHHHHHHCCCC Q ss_conf 5431122-56897799965765644410466667--77286-533--344---43326798999--88899987401138 Q gi|254780970|r 191 AIFSSKA-RGIGLPLVYLGAKTGRDGIGGASMAS--EEFGE-NIA--KKR---PTVQVGDPFTG--KCLLEACLELMNTD 259 (737) Q Consensus 191 ~~~~~~~-~~~Gd~ii~~G~~tg~dGi~Ga~~sS--~~~~~-~~~--~~~---~~VQ~gdp~~e--k~~~~~~~~~~~~~ 259 (737) .+..... ..+||.++ |-+ .+|+|-.-||- +.+.. ... +.. .++--||..+. |.-...++-++.++ T Consensus 599 ~iLp~~~ei~~GDVll--GL~--SsGvHSNGfSLvrkil~~~~l~~~~~~P~~~~~tlGd~LL~pTkiYvk~ll~~i~~g 674 (788) T KOG0237 599 DILPKLNEIVAGDVLL--GLP--SSGVHSNGFSLVRKILARSGLSYKDPLPWDSSKTLGDELLAPTKIYVKQLLPLIRKG 674 (788) T ss_pred CCCCCCCCCCCCCEEE--ECC--CCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHCCCCEEEHHHHHHHHHHC T ss_conf 7577723255686799--546--666455727899999986087657889997656411444265077511427888735 Q ss_pred CCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCC-------CC-CCHHHHHHC-CCCCEEEEEEECCCHHHHHHH Q ss_conf 301103477520111223320147752899620220466-------78-999898617-888508999954675889998 Q gi|254780970|r 260 AVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCE-------EG-MTAYEMMLS-ESQERMLMILNPEKQHKAQEI 330 (737) Q Consensus 260 ~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~-------~~-l~~~ei~~s-EsQeR~~~~v~~~~~~~~~~i 330 (737) +++++-.++.|||.-.++.+.. ..+|+.||.+.--+.. .| ++-.|+.-- -.-=-|+++|.||+.+++++- T Consensus 675 ~vkg~AHITGGGl~ENipRvlp-~~l~~~id~~twe~p~vF~Wl~~aG~v~~~em~RTfN~GiGmVlvVs~e~ve~v~ke 753 (788) T KOG0237 675 GVKGLAHITGGGLTENIPRVLP-DHLGAVIDADTWELPPVFKWLQQAGNVPDSEMARTFNCGIGMVLVVSPENVERVLKE 753 (788) T ss_pred CCCEEEEECCCCCCCCCHHHCC-CCCCEEEECCCCCCCHHHHHHHHCCCCCHHHHHHHHCCCCCEEEEECHHHHHHHHHH T ss_conf 5122566417872126501165-224537844434473788999873899879988773556656999868899999999 Q ss_pred HHHHCCEEECCCCEEECCC Q ss_conf 8870971311221210443 Q gi|254780970|r 331 LNKWGLHFSIIGITTNDKL 349 (737) Q Consensus 331 ~~~~~~~~~~iG~vt~~~~ 349 (737) .++++-.+.+||+|.+... T Consensus 754 l~~~~e~~~~iG~vv~r~~ 772 (788) T KOG0237 754 LTNHGEEAYRIGEVVNREG 772 (788) T ss_pred HHHCCEEEEEEEEEEECCC T ss_conf 8527817999876763588 No 71 >pfam00586 AIRS AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the dimer interface of the protein, and is suggested as a putative ATP binding domain. Probab=99.37 E-value=1.1e-12 Score=106.05 Aligned_cols=92 Identities=40% Similarity=0.716 Sum_probs=81.3 Q ss_pred CCCEEEEEECCCEEEEE---EECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHH Q ss_conf 47736898369839999---951788777658888721055810225767787436877504578756863888998887 Q gi|254780970|r 72 GENAGVVDIGGGDCVVF---KMESHNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVV 148 (737) Q Consensus 72 ~~~a~vi~~~~~~~~~~---k~EtHNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~ 148 (737) ||||+||++.++..+++ .+.+|+|++. +||.++.+.+...+||+.+|||+|++++++|++++..+ ...+.+.. T Consensus 1 GdDaavi~~~~~~~lv~ttD~~~~~~~f~~-~p~~~g~~av~~~~sDi~a~Ga~P~~~~~~l~~~~~~~---~~~l~~~~ 76 (95) T pfam00586 1 GDDAAVIRLGEGPALVVTTDGVVTHVHFAA-DPYDGGATAVAGNLSDLAAMGAKPLAFLDGLALGEGLD---PEWLEEIV 76 (95) T ss_pred CCEEEEEEECCCCEEEEEECCCCCCCCCCC-CHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCC---HHHHHHHH T ss_conf 970599970999889999677788886788-98999999999998619974999165079897699752---36899999 Q ss_pred HHHHHHHHHCCCCCCCCEE Q ss_conf 4588665430872113203 Q gi|254780970|r 149 AGIAGYSNSFGVPTVGGEV 167 (737) Q Consensus 149 ~G~~~ygN~~G~P~v~G~~ 167 (737) +|++++++++|+|++||+| T Consensus 77 ~g~~~~~~~~g~~ivGGdT 95 (95) T pfam00586 77 RGIAEAANEFGVPLVGGDT 95 (95) T ss_pred HHHHHHHHHHCCEEECCCC T ss_conf 9999999992998837079 No 72 >KOG3939 consensus Probab=99.31 E-value=2.8e-10 Score=89.43 Aligned_cols=252 Identities=19% Similarity=0.243 Sum_probs=154.7 Q ss_pred CCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCH-------HHHHHHCCCCCCCCCCHHHHH Q ss_conf 7631420377762279730453210000100113567897776410278510-------223221024566686213469 Q gi|254780970|r 436 DAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKP-------LAITDNLNFGNPEKEEIMGQF 508 (737) Q Consensus 436 DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~P-------lait~~lnf~~P~~~e~~~ql 508 (737) |++|+.+.+ .+..++-|+|+.-..+ .|||.++|.|.+..++|++|||... ++++..+ ..-+.... . T Consensus 2 d~~viplrh-~gl~lvqttdffyp~v-ddpy~mgriacanvlsdlyamgvtecdnmlmll~vs~~~--~ekerd~v---~ 74 (312) T KOG3939 2 DCCVIPLRH-GGLLLVQTTDFFYPLV-DDPYMMGRIACANVLSDLYAMGVTECDNMLMLLSVSTSM--SEKERDVV---I 74 (312) T ss_pred CCEEEEECC-CCEEEEEEECEEEECC-CCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH--HHHHHCHH---H T ss_conf 716974046-9768998503044525-786776578998888888872532105230222441110--25653320---6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCEEEEECCCCCCCCCEEECCCCCCCCCHHHH Q ss_conf 999999985221028866544320011236866676311122223-3352023310214389807513776655432678 Q gi|254780970|r 509 VHSVKGIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAGVGI-LPDYSLMTRIDSAHEGDLILMIGNDGCHLDCSMY 587 (737) Q Consensus 509 ~~av~gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~vG~-v~d~~~~it~~~k~~Gd~I~liG~~~~~LggS~~ 587 (737) --.++|..|++.+.|+|+.||. + .+.|-.++++++. +--.+..+-++.+.+||.+++.+..+....--.+ T Consensus 75 pl~~~gfkdaa~e~gt~v~ggq------t---v~npw~~igGVatsVcq~ne~i~pdnAvpGdvlvLTkplg~qvAv~~h 145 (312) T KOG3939 75 PLIIQGFKDAAEEAGTPVTGGQ------T---VINPWCIIGGVATSVCQPNEIIMPDNAVPGDVLVLTKPLGGQVAVNAH 145 (312) T ss_pred HHHHHHHHHHHHHCCCCEECCE------E---ECCCCEEECCCCCCCCCCCCEECCCCCCCCCEEEEECCCCCEEEHHHH T ss_conf 8888876778973598446670------5---417403445541233374310055677776369993267752112798 Q ss_pred HHHHHC-----CCCCCCCCCCHHHHHHHHHHHH---H-------HHHCCCEEEEEECCCCHHHHHHHHHHHCC--CCCEE Q ss_conf 888611-----6789996479899999999999---9-------99959806994648657999999998458--97089 Q gi|254780970|r 588 SLECAS-----SNIGPPPKVDCHLEKNHGFFVL---S-------MINAKKITACHDISTGGLIITLAEMTISS--AKGMD 650 (737) Q Consensus 588 ~~~~~~-----~~~g~~p~~d~~~~k~~~~~v~---~-------li~~g~I~S~HDiSdGGL~~aL~EMa~as--~~G~~ 650 (737) .....+ ...-.+++ .+.++....++. . |+..-.++|+.||+..||+.+---++-.- .+-+. T Consensus 146 ~wi~~~~ek~~~~~l~vs~--~die~ay~~a~~~M~~Lnr~aA~lmhkynaHaaTDItGFgllGHaqnLa~qqk~~v~f~ 223 (312) T KOG3939 146 QWIDNQPEKWNKLKLEVSD--EDIEKAYEEAMKSMARLNRNAAGLMHKYNAHAATDITGFGLLGHAQNLAKQQKNEVSFQ 223 (312) T ss_pred HHHHCCCCEEEEEEEEECH--HHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf 8876175101124555258--88999999997456430446654321102232224233111345778999862463021 Q ss_pred EE-ECCCCCCCHHHHH----H-------CCCCCCEEEEECHHHHHHHHHHHHHCCC---CEEEEEEECCC Q ss_conf 98-0566689878887----2-------9883218999968809989999985798---49998998798 Q gi|254780970|r 651 II-LPIEIEKDPKPFL----F-------GEDQGRYVVCISPENQDLVMSEANNKNI---PLRYLGKVSGS 705 (737) Q Consensus 651 I~-L~~~~~~d~~~~L----F-------sEs~Gr~ii~V~~~~~~~~~~~~~~~gi---~~~~IG~Vt~~ 705 (737) |. ||.....+....+ | .|+.|.+++..+.++...|-+.+++... .+..||.|+.. T Consensus 224 i~~LPiI~km~~vska~G~~f~l~qGts~ETsGGlLIclP~eqaakfcaei~s~k~gegqAWIIGiVekg 293 (312) T KOG3939 224 IHKLPIIAKMAEVSKACGNMFKLRQGTSAETSGGLLICLPREQAAKFCAEIKSPKNGEGQAWIIGIVEKG 293 (312) T ss_pred ECCCHHHHHHHHHHHHCCCCCEECCCCCCCCCCCEEEECCHHHHHHHHHHHCCCCCCCCCEEEEEEEECC T ss_conf 0257056658899875264200014777667785599735889999998745876688734799998548 No 73 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=99.09 E-value=3.9e-11 Score=95.33 Aligned_cols=347 Identities=18% Similarity=0.246 Sum_probs=189.0 Q ss_pred ECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCHH Q ss_conf 54675889998887097131122121044310002586443443210146650114555315655510123468725588 Q gi|254780970|r 320 NPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYDRAWCEPITIPPSNPQLLCSDEDCT 399 (737) Q Consensus 320 ~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~r~~~~~~~~~~~~~~~~~~~~d~~ 399 (737) +=+|+.-+++.++..+=.- +.|+=+.+--||- +.|- |-++. ..++...-+...-...++ T Consensus 193 ~l~DL~fiq~YFk~~~R~P----T~TEiKVlDTYWS-------------DHCR--HTTFE--T~l~~v~~~~skf~~~~~ 251 (1279) T TIGR01857 193 SLEDLKFIQDYFKEIGRNP----TITEIKVLDTYWS-------------DHCR--HTTFE--TELKEVTIEDSKFSKQLK 251 (1279) T ss_pred CHHHHHHHHHHHCCCCCCC----CEEEEEEECCCCC-------------CCCC--CCCCC--CEEEEEECCCCCCCHHHH T ss_conf 2789998876612378988----6156644326657-------------8851--57544--344047648642213789 Q ss_pred HHHHHHHHCCCCCCHHHHHHH--CCCCCCC---CCC-----CCCC--CC---------CCEE---EECCCC-CC-----E Q ss_conf 999998507321010356531--1210001---100-----0013--57---------6314---203777-62-----2 Q gi|254780970|r 400 KSLLKLLSSPNLSSRRWVYEQ--YDTMIQS---NSI-----QLPG--GD---------AGVI---RVEGHE-TK-----A 449 (737) Q Consensus 400 ~~l~~vL~~pnV~SK~~i~~q--yD~~V~g---~tv-----~~Pg--~D---------aaVv---~~~~~~-~~-----g 449 (737) ++..+-|. +|+.|++. -.+.|.- .|+ .+-| +| |+|- .+++.. .+ . T Consensus 252 ~~y~~Y~~-----~r~~l~~~nln~k~v~LMDmATI~~kylkk~G~Ld~LevSeEiNACSv~v~vdv~~~~GkkaiE~wL 326 (1279) T TIGR01857 252 KAYEDYLA-----MREELGRENLNEKPVSLMDMATIVAKYLKKNGKLDDLEVSEEINACSVEVEVDVDGVEGKKAIEEWL 326 (1279) T ss_pred HHHHHHHH-----HHHHHCHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCHHHHHH T ss_conf 98989999-----8876222111047862003577887876526855887511031423799999852876740243365 Q ss_pred EEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHC-----------CCC-CHHH-HHHHCCCCCCCCCCHHHHH--H-HHHH Q ss_conf 797304532100001001135678977764102-----------785-1022-3221024566686213469--9-9999 Q gi|254780970|r 450 LAFSSDVTPRYVKADPFEGTKQAVAECWRNIIA-----------TGA-KPLA-ITDNLNFGNPEKEEIMGQF--V-HSVK 513 (737) Q Consensus 450 ia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a-----------~Ga-~Pla-it~~lnf~~P~~~e~~~ql--~-~av~ 513 (737) +-+.-.-+.+...|-|+=||+.+++.|+||=.+ +|+ +|+. |.=.+ .-++-|= . .|.. T Consensus 327 LmFKNETHNHPTEIEPFGGAATClGGAIRDPLSGRSYVYQAmRvtGaadpt~~v~ETl-------~GKLPQ~ki~~tAA~ 399 (1279) T TIGR01857 327 LMFKNETHNHPTEIEPFGGAATCLGGAIRDPLSGRSYVYQAMRVTGAADPTVPVSETL-------KGKLPQRKITTTAAH 399 (1279) T ss_pred HHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHEEEEEEEECCCCCCCCHHHHC-------CCCCCCHHHHHHHCC T ss_conf 4214577865823378874445357541377763003011001105778855401011-------588741133220113 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEECCCCCCCCCEEECCCCC--CCCCHHH-HHH Q ss_conf 9985221028866544320011236866676311122-22333520233102143898075137766--5543267-888 Q gi|254780970|r 514 GIREACQILDFPIVSGNVSFYNETNGQSIFPTPTIAG-VGILPDYSLMTRIDSAHEGDLILMIGNDG--CHLDCSM-YSL 589 (737) Q Consensus 514 gl~d~c~~lgipivgGkvSl~n~t~~~~I~pTpvi~~-vG~v~d~~~~it~~~k~~Gd~I~liG~~~--~~LggS~-~~~ 589 (737) |=+-+-+ .|-+-.|-||=+=+ ..-.-.-.=+++ +|-.| ..+.++.. ..+||.|+|+|... +..||.- -++ T Consensus 400 GYSSYGN--QIGLAT~~V~EiYH--pGY~AKRMEvGAVvAAtP-~~nV~R~~-P~~GD~iILLGGkTGRDGiGGATGSSK 473 (1279) T TIGR01857 400 GYSSYGN--QIGLATGQVSEIYH--PGYVAKRMEVGAVVAATP-KENVVREK-PEPGDVIILLGGKTGRDGIGGATGSSK 473 (1279) T ss_pred CCCHHHH--HHHHHHCCEEEEEC--CCCEEEEEEECCEEEECC-CCCCEECC-CCCCCEEEEECCCCCCCCCCCCCCCCH T ss_conf 2101222--46553161347765--785012453001445058-88735267-898878998178527775567768856 Q ss_pred H----HHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCC----CCCH Q ss_conf 8----6116789996479899999999999999959806994648657999999998458970899805666----8987 Q gi|254780970|r 590 E----CASSNIGPPPKVDCHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEI----EKDP 661 (737) Q Consensus 590 ~----~~~~~~g~~p~~d~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~----~~d~ 661 (737) . -.......+..=|.-.||+.++..+.-....+|.=|.|=..||+-+|+-|+|- |++|||+.-| +.|- T Consensus 474 ~h~~eS~~~~GAEVQKGNAp~ERKiQRLFRn~~v~~LIKkcNDFGAGGVSVAiGELAd----G~~vdLn~VP~KYeGLdG 549 (1279) T TIGR01857 474 EHTVESLELCGAEVQKGNAPEERKIQRLFRNGNVTRLIKKCNDFGAGGVSVAIGELAD----GLEVDLNKVPKKYEGLDG 549 (1279) T ss_pred HHHHHHHHHCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEEECCC----CEEEECCCCCCCEECCCC T ss_conf 6636868646860303887431111000467414556120376588732686522027----407515758742107885 Q ss_pred HHHHHCCCCCCEEEEECHHHHHHHHHHHHHCCCCEEEEEEECCC-EEEE Q ss_conf 88872988321899996880998999998579849998998798-6898 Q gi|254780970|r 662 KPFLFGEDQGRYVVCISPENQDLVMSEANNKNIPLRYLGKVSGS-TLSI 709 (737) Q Consensus 662 ~~~LFsEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~-~l~i 709 (737) -++--||||.|+-|.|.+|++++|++.+++.++.+++|++||++ +|.. T Consensus 550 TELAISESQERMAVvv~~Ed~d~F~~~~~eENL~AtvvA~VTd~~rL~m 598 (1279) T TIGR01857 550 TELAISESQERMAVVVSKEDVDKFLKYAEEENLEATVVATVTDKPRLVM 598 (1279) T ss_pred CEEEEECCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEEECCCCEEE T ss_conf 1331201455114672865657899875323852289999956997067 No 74 >KOG3939 consensus Probab=96.88 E-value=0.019 Score=35.80 Aligned_cols=258 Identities=20% Similarity=0.253 Sum_probs=141.1 Q ss_pred CEEEEEECC-CEEEEEEECCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCEEEE-EEEECCCCCCCCCCHHHHHHH-HHH Q ss_conf 736898369-839999951788777658888-7210558102257677874368-775045787568638889988-874 Q gi|254780970|r 74 NAGVVDIGG-GDCVVFKMESHNHPSYIEPYQ-GAATGVGGILRDIFTMGARPVA-AMNSLRFGAIHHPKTKHLLSG-VVA 149 (737) Q Consensus 74 ~a~vi~~~~-~~~~~~k~EtHNhPs~i~P~~-GAaTg~gG~iRDi~~~Ga~p~a-~~~~l~~g~~~~p~~~~~~~~-~~~ 149 (737) |.+||...- ++.++-+. --=.|--=|||= |-.++. .++-|..+||-.--- ++.-|..+.--..+.|..... +++ T Consensus 2 d~~viplrh~gl~lvqtt-dffyp~vddpy~mgriaca-nvlsdlyamgvtecdnmlmll~vs~~~~ekerd~v~pl~~~ 79 (312) T KOG3939 2 DCCVIPLRHGGLLLVQTT-DFFYPLVDDPYMMGRIACA-NVLSDLYAMGVTECDNMLMLLSVSTSMSEKERDVVIPLIIQ 79 (312) T ss_pred CCEEEEECCCCEEEEEEE-CEEEECCCCHHHHHHHHHH-HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 716974046976899850-3044525786776578998-88888887253210523022244111025653320688888 Q ss_pred HHHHHHHHCCCCCCCCEEEEECCCCCCCCCCEEEECC----CCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHH Q ss_conf 5886654308721132035521567664221122112----113554311225689779996576564441046666777 Q gi|254780970|r 150 GIAGYSNSFGVPTVGGEVEFLPCYNNNIIVNTFAAGI----AKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEE 225 (737) Q Consensus 150 G~~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~~~~Gi----~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~ 225 (737) |-.|...+-|.|+.||.+..+ |... ..|+ .+.++++....+-|||.+++.+.. |---| .++.+ T Consensus 80 gfkdaa~e~gt~v~ggqtv~n------pw~~--igGVatsVcq~ne~i~pdnAvpGdvlvLTkpl----g~qvA-v~~h~ 146 (312) T KOG3939 80 GFKDAAEEAGTPVTGGQTVIN------PWCI--IGGVATSVCQPNEIIMPDNAVPGDVLVLTKPL----GGQVA-VNAHQ 146 (312) T ss_pred HHHHHHHHCCCCEECCEEECC------CCEE--ECCCCCCCCCCCCEECCCCCCCCCEEEEECCC----CCEEE-HHHHH T ss_conf 767789735984466705417------4034--45541233374310055677776369993267----75211-27988 Q ss_pred --H--CCCCCCCCCCCCCCCHHHHHHHHHHHHHH-----CCCC-----CCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEE Q ss_conf --2--86533344433267989998889998740-----1138-----30110347752011122332014775289962 Q gi|254780970|r 226 --F--GENIAKKRPTVQVGDPFTGKCLLEACLEL-----MNTD-----AVIAIQDMGAAGLTCSAIEMGNQGNLGITLDL 291 (737) Q Consensus 226 --~--~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~-----~~~~-----~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l 291 (737) . .|....-. +-+.+.-.|+--.+++..+ ...+ -.++..|+..-||..-..-++..-.-.+.+-. T Consensus 147 wi~~~~ek~~~~~--l~vs~~die~ay~~a~~~M~~Lnr~aA~lmhkynaHaaTDItGFgllGHaqnLa~qqk~~v~f~i 224 (312) T KOG3939 147 WIDNQPEKWNKLK--LEVSDEDIEKAYEEAMKSMARLNRNAAGLMHKYNAHAATDITGFGLLGHAQNLAKQQKNEVSFQI 224 (312) T ss_pred HHHCCCCEEEEEE--EEECHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 8761751011245--55258889999999974564304466543211022322242331113457789998624630210 Q ss_pred HHHCCCCCCCCHHHHHH-----------CCCCCEEEEEEECCCHHHHHHHHHH----HCCEEECCCCEEECCCC Q ss_conf 02204667899989861-----------7888508999954675889998887----09713112212104431 Q gi|254780970|r 292 DKVPTCEEGMTAYEMML-----------SESQERMLMILNPEKQHKAQEILNK----WGLHFSIIGITTNDKLF 350 (737) Q Consensus 292 ~~vp~~~~~l~~~ei~~-----------sEsQeR~~~~v~~~~~~~~~~i~~~----~~~~~~~iG~vt~~~~~ 350 (737) .++|+-.. |...--.| .|+-.-.+++.+.|+..+|.+-.+. ++ .+-+||.|+.+++- T Consensus 225 ~~LPiI~k-m~~vska~G~~f~l~qGts~ETsGGlLIclP~eqaakfcaei~s~k~geg-qAWIIGiVekgn~t 296 (312) T KOG3939 225 HKLPIIAK-MAEVSKACGNMFKLRQGTSAETSGGLLICLPREQAAKFCAEIKSPKNGEG-QAWIIGIVEKGNST 296 (312) T ss_pred CCCHHHHH-HHHHHHHCCCCCEECCCCCCCCCCCEEEECCHHHHHHHHHHHCCCCCCCC-CEEEEEEEECCCCC T ss_conf 25705665-88998752642000147776677855997358899999987458766887-34799998548741 No 75 >PRK07226 fructose-bisphosphate aldolase; Provisional Probab=84.58 E-value=0.78 Score=24.69 Aligned_cols=49 Identities=18% Similarity=0.451 Sum_probs=33.2 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 56789777641027851022322102456668621346999999998522102886654 Q gi|254780970|r 470 KQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVS 528 (737) Q Consensus 470 ~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivg 528 (737) ..+|-||+| .||.-++ ...|||++.+++.+-+ +..+.+.|..+|+|++. T Consensus 96 ~~sVeeAvr----lGAdaV~--~~v~~Gs~~E~~~l~~----l~~l~~ea~~~GlPll~ 144 (266) T PRK07226 96 VGTVEDAIK----LGADAVS--VHVNVGSETEAEMLED----LGRIARECEEWGMPLLA 144 (266) T ss_pred ECCHHHHHH----CCCCEEE--EEEECCCCCHHHHHHH----HHHHHHHHHHHCCCEEE T ss_conf 214999985----5877899--9854799837999999----99999999985995699 No 76 >PRK08227 aldolase; Validated Probab=83.65 E-value=0.95 Score=24.11 Aligned_cols=70 Identities=11% Similarity=0.166 Sum_probs=40.7 Q ss_pred CEEEEECCCCCCCHHCCCH-HHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 2279730453210000100-113567897776410278510223221024566686213469999999985221028866 Q gi|254780970|r 448 KALAFSSDVTPRYVKADPF-EGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPI 526 (737) Q Consensus 448 ~gia~s~g~~p~~~~~dP~-~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipi 526 (737) +++++-....+.+.. +|. ..-..+|-||+| +||.-+++. +|+|++.+++.+ +-+..+.+.|..+|+|+ T Consensus 102 ~~lil~~s~~t~~~~-~~~~k~lv~sVeeAvr----lGAdAVsv~--v~iGs~~E~~~l----~~lg~v~~e~~~~GmPl 170 (291) T PRK08227 102 KPVVLRASGGNSILK-ELSNEAVAVDMEDAVR----LNVCAVAAQ--VFIGSEYETQSI----KNIIQLVDAGLRYGMPT 170 (291) T ss_pred CEEEEEEECCCCCCC-CCCCCEEEECHHHHHH----CCCCEEEEE--EECCCCHHHHHH----HHHHHHHHHHHHCCCCE T ss_conf 407999806876667-8788755534999986----799789998--635993289999----99999999999829987 Q ss_pred CC Q ss_conf 54 Q gi|254780970|r 527 VS 528 (737) Q Consensus 527 vg 528 (737) +. T Consensus 171 la 172 (291) T PRK08227 171 MA 172 (291) T ss_pred EE T ss_conf 99 No 77 >TIGR02033 D-hydantoinase dihydropyrimidinase; InterPro: IPR011778 This entry represents the D-hydantoinase (dihydropyrimidinase) which primarily converts 5,6-dihydrouracil to 3-ureidopropanoate but also acts on dihydrothymine and hydantoin. The enzyme is a metalloenzyme .. Probab=82.31 E-value=0.94 Score=24.13 Aligned_cols=35 Identities=23% Similarity=0.400 Sum_probs=17.0 Q ss_pred CCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHH Q ss_conf 0135763142037776227973045321000010011 Q gi|254780970|r 432 LPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEG 468 (737) Q Consensus 432 ~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~g 468 (737) .|||||=++-.+.+.. -+++....-..+..+||+| T Consensus 397 ~~GSDADivi~DP~~~--~~I~~~~~h~~~dY~pfEG 431 (466) T TIGR02033 397 AVGSDADIVIWDPRRS--TVISAETHHDNADYTPFEG 431 (466) T ss_pred CCCCCCCEEEECCCCC--EEHHHHHHHHHCCCCCCCC T ss_conf 0246735689889964--5302366653148860257 No 78 >TIGR00608 radc DNA repair protein RadC; InterPro: IPR001405 RadC is a DNA repair protein found in many bacteria. They share a region of similarity at the C-terminus that contains two perfectly conserved histidine residues.; GO: 0006281 DNA repair. Probab=77.74 E-value=0.83 Score=24.50 Aligned_cols=25 Identities=8% Similarity=0.013 Sum_probs=10.8 Q ss_pred HHHHHHHHCCCCCCEEEEEHHHCCCCC Q ss_conf 112233201477528996202204667 Q gi|254780970|r 273 TCSAIEMGNQGNLGITLDLDKVPTCEE 299 (737) Q Consensus 273 ~~a~~Ema~~~~~G~~i~l~~vp~~~~ 299 (737) .-|+.||+.+-- +.+ .+.+.+++.| T Consensus 79 lka~~El~~R~~-~~~-~~~~~~i~sP 103 (223) T TIGR00608 79 LKAIVELAKRYA-KSR-MLERPVIRSP 103 (223) T ss_pred HHHHHHHHHHHH-CCC-CCCCCCCCCH T ss_conf 999999986775-244-2146664898 No 79 >TIGR01950 SoxR redox-sensitive transcriptional activator SoxR; InterPro: IPR010211 SoxR is a MerR-family homodimeric transcription factor with a 2Fe-2S cluster in each monomer. The motif CIGCGCxxxxxC is conserved. Oxidation of the iron-sulphur cluster activates SoxR. The physiological role in Escherichia coli is response to oxidative stress. It is activated by superoxide, singlet oxygen, nitric oxide (NO), and hydrogen peroxide. In Escherichia coli, SoxR increases expression of transcription factor SoxS; different downstream targets may exist in other species.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0005506 iron ion binding, 0006118 electron transport, 0006355 regulation of transcription DNA-dependent, 0006979 response to oxidative stress. Probab=74.43 E-value=5.1 Score=19.09 Aligned_cols=44 Identities=20% Similarity=0.356 Sum_probs=25.6 Q ss_pred HHHHHCCCCHHHHHHHHHHH--CCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 89986199989999999995--789718899998533001107445 Q gi|254780970|r 9 EILTAHGLTKDECDQIIHIL--KRQPTLTEIGIISAMWNEHCSYKS 52 (737) Q Consensus 9 e~~~~~gL~~~e~~~i~~~l--~R~Pt~~El~~~~~~wSEHC~~k~ 52 (737) ++....||+.+|+-.=.-.| +|-||.-++.=+|+.|-|-=-|++ T Consensus 51 k~AQ~vGipL~~I~~ALa~LP~grtPt~~DW~rlS~~W~~~LdeRI 96 (142) T TIGR01950 51 KVAQRVGIPLAEIAEALAELPEGRTPTADDWARLSSQWKEELDERI 96 (142) T ss_pred HHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 3233258988899999973468899888889999885018889999 No 80 >cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies. Probab=71.17 E-value=4.5 Score=19.47 Aligned_cols=56 Identities=16% Similarity=0.291 Sum_probs=36.7 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 100113567897776410278510223221024566686213469999999985221028866544 Q gi|254780970|r 464 DPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 464 dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +++..-...|.||+| .||.-++ ..+|||++.+.+. .+-+..+.+.|..+|+|++.= T Consensus 73 ~~~~~l~~sVeeAvr----lGAdaV~--~~v~~Gs~~E~~~----l~~l~~v~~ea~~~G~Pll~~ 128 (235) T cd00958 73 DNDKVLVASVEDAVR----LGADAVG--VTVYVGSEEEREM----LEELARVAAEAHKYGLPLIAW 128 (235) T ss_pred CCHHHHHCCHHHHHH----CCCCEEE--EEEECCCCHHHHH----HHHHHHHHHHHHHHCCCEEEE T ss_conf 517766516999983----5556799--9986898168999----999999999999839978999 No 81 >TIGR02479 FliA_WhiG RNA polymerase sigma factor, FliA/WhiG family; InterPro: IPR012845 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . Most members of this family are the flagellar operon sigma factor FliA, controlling transcription of bacterial flagellar genes by RNA polymerase. An exception is the sigma factor WhiG in the genus Streptomyces, involved in the production of sporulating aerial mycelium.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0045449 regulation of transcription. Probab=70.95 E-value=4.4 Score=19.55 Aligned_cols=23 Identities=30% Similarity=0.475 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 80998999998579849998998 Q gi|254780970|r 680 ENQDLVMSEANNKNIPLRYLGKV 702 (737) Q Consensus 680 ~~~~~~~~~~~~~gi~~~~IG~V 702 (737) ++...+++..=..++....||+| T Consensus 181 eRE~LVl~LYY~eELnLKEIg~V 203 (227) T TIGR02479 181 EREKLVLSLYYYEELNLKEIGEV 203 (227) T ss_pred HHHHHHHHHHHHCCCCHHHHHHH T ss_conf 55788888864127668899988 No 82 >TIGR02634 xylF D-xylose ABC transporter, substrate-binding protein; InterPro: IPR013456 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents the D-xylose ABC transporter substrate-binding protein which is a periplasmic (when in Gram-negative bacteria) binding protein for D-xylose import by a high-affinity ATP-binding cassette (ABC) transporter.. Probab=70.66 E-value=7.2 Score=18.08 Aligned_cols=129 Identities=17% Similarity=0.219 Sum_probs=67.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCEEEEECC-CCCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCCCCCC--CH Q ss_conf 88998887458866543087211320355215-6766422112211211355431122568977999657656444--10 Q gi|254780970|r 141 KHLLSGVVAGIAGYSNSFGVPTVGGEVEFLPC-YNNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKTGRDG--IG 217 (737) Q Consensus 141 ~~~~~~~~~G~~~ygN~~G~P~v~G~~~~~~~-y~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~tg~dG--i~ 217 (737) -..+..++.-+. +-|+|+++-+--..+. ..-...-.=.-||-+....+....+ .|+.+++=|+||+.-- +- T Consensus 65 ~~vl~~~~~~A~----~~GI~V~aYDRLI~~aD~~fY~sFDN~~VG~~QA~a~l~~~P--~GnY~l~~GsP~D~NA~~lR 138 (307) T TIGR02634 65 GEVLSNAVQEAK----KEGIKVLAYDRLIEDADIDFYLSFDNEKVGEMQAKAVLAAAP--KGNYFLLGGSPTDNNAKLLR 138 (307) T ss_pred CHHHHHHHHHHH----HCCCEEEEEECCCCCCCCCEEEEECHHHHHHHHHHHHHHCCC--CCCEEEEECCCCCHHHHHHH T ss_conf 603788999998----779808873011133552002230435788898888863167--87578841778972279988 Q ss_pred HHHHHHHHHCCCCCCCCCCCC-CCC-------HHHHHHHHHHHHHHCCCCC--CHHCCCCCCCCHHHHHHHH Q ss_conf 466667772865333444332-679-------8999888999874011383--0110347752011122332 Q gi|254780970|r 218 GASMASEEFGENIAKKRPTVQ-VGD-------PFTGKCLLEACLELMNTDA--VIAIQDMGAAGLTCSAIEM 279 (737) Q Consensus 218 Ga~~sS~~~~~~~~~~~~~VQ-~gd-------p~~ek~~~~~~~~~~~~~~--i~~~~D~gaGGl~~a~~Em 279 (737) .+-+ +.|.-... ..-|. +|+ |....+.+|-.|.+.+++. |++-.|-+|||...|+.== T Consensus 139 ~GQ~--~VL~~~id--~G~ikivG~~wvd~Wlp~~A~~~~E~~LTan~N~~D~vVASNDaTAGGai~AL~AQ 206 (307) T TIGR02634 139 EGQM--KVLQPAID--SGDIKIVGDQWVDGWLPENALKIMENALTANDNKVDAVVASNDATAGGAIQALSAQ 206 (307) T ss_pred HHHH--HHHHHHHC--CCCEEEECCHHHCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHHHHHHC T ss_conf 6568--88656431--68748961322032256889999998851026860169863877634799998616 No 83 >TIGR01034 metK S-adenosylmethionine synthetase; InterPro: IPR002133 S-adenosylmethionine synthetase (MAT, 2.5.1.6 from EC) is the enzyme that catalyzes the formation of S-adenosylmethionine (AdoMet) from methionine and ATP . AdoMet is an important methyl donor for transmethylation and is also the propylamino donor in polyamine biosynthesis. In bacteria there is a single isoform of AdoMet synthetase (gene metK), there are two in budding yeast (genes SAM1 and SAM2) and in mammals while in plants there is generally a multigene family. The sequence of AdoMet synthetase is highly conserved throughout isozymes and species. The active sites of both the Escherichia coli and rat liver MAT reside between two subunits, with contributions from side chains of residues from both subunits, resulting in a dimer as the minimal catalytic entity. The side chains that contribute to the ligand binding sites are conserved between the two proteins. In the structures of complexes with the E. coli enzyme, the phosphate groups have the same positions in the (PPi plus Pi) complex and the (ADP plus Pi) complex, and are located at the bottom of a deep cavity with the adenosyl group nearer the entrance .; GO: 0004478 methionine adenosyltransferase activity, 0005524 ATP binding, 0006730 one-carbon compound metabolic process. Probab=68.89 E-value=7.2 Score=18.08 Aligned_cols=101 Identities=20% Similarity=0.283 Sum_probs=61.4 Q ss_pred CCCCCCHHHHHHHHHHCCCCCCHHHHHHHCCCCC--CCCCCC-CCCCCCCEEE--ECCCCCCE------EEEECCCCCCC Q ss_conf 6872558899999850732101035653112100--011000-0135763142--03777622------79730453210 Q gi|254780970|r 392 LCSDEDCTKSLLKLLSSPNLSSRRWVYEQYDTMI--QSNSIQ-LPGGDAGVIR--VEGHETKA------LAFSSDVTPRY 460 (737) Q Consensus 392 ~~~~~d~~~~l~~vL~~pnV~SK~~i~~qyD~~V--~g~tv~-~Pg~DaaVv~--~~~~~~~g------ia~s~g~~p~~ 460 (737) ..+..+|++++.+=.=.|.|..++|+-+.--..| +|+=|+ +|.+|++++= +--+++=| -|+| |=.|-+ T Consensus 201 ~iS~~~Lr~~i~E~iI~p~lp~~~~ld~~t~~~iNPtGrFViGGP~gDtGLTGRKIIVDTYGG~~~HGGGAFS-GKDpsK 279 (393) T TIGR01034 201 DISQKELREAIIEEIIKPVLPDAEYLDEKTKFFINPTGRFVIGGPMGDTGLTGRKIIVDTYGGWARHGGGAFS-GKDPSK 279 (393) T ss_pred CCCHHHHHHHHHHHCCCCCCCCHHHCCCCCEEEECCCCEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCC-CCCCCC T ss_conf 5477999999875112224385200566886888798315746866577888775888677886467886455-788874 Q ss_pred HHCCCHHHHHHHHHHHHHHHHCCCC-CHHHHHHHCCCC Q ss_conf 0001001135678977764102785-102232210245 Q gi|254780970|r 461 VKADPFEGTKQAVAECWRNIIATGA-KPLAITDNLNFG 497 (737) Q Consensus 461 ~~~dP~~ga~~AV~Eal~Nl~a~Ga-~Plait~~lnf~ 497 (737) ++.+..-+||| || -||+|+|. +..-+..++.+| T Consensus 280 VDRSAAY~ARy-iA---Kn~VAaGlA~r~~vQlsYAIG 313 (393) T TIGR01034 280 VDRSAAYAARY-IA---KNIVAAGLADRCEVQLSYAIG 313 (393) T ss_pred CCHHHHHHHHH-HH---HHHHHHHHHCEEEEEEEEEEC T ss_conf 55038899999-99---875252000231234543312 No 84 >TIGR02103 pullul_strch alpha-1,6-glucosidases, pullulanase-type; InterPro: IPR011839 Members of this protein family include secreted (or membrane-anchored) pullulanases of Gram-negative bacteria and pullulanase-type starch debranching enzymes of plants. Both enzymes hydrolyze alpha-1,6 glycosidic linkages. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family is closely homologous to, but architecturally different from the pullulanases of Gram-positive bacteria (IPR011838 from INTERPRO).; GO: 0051060 pullulanase activity, 0005975 carbohydrate metabolic process. Probab=67.99 E-value=6.3 Score=18.46 Aligned_cols=48 Identities=25% Similarity=0.436 Sum_probs=30.3 Q ss_pred CHHHHHHHCCCC---------------CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 102232210245---------------66686213469999999985221028866544320011236 Q gi|254780970|r 486 KPLAITDNLNFG---------------NPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETN 538 (737) Q Consensus 486 ~Plait~~lnf~---------------~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~ 538 (737) +++.-+|++||| +|+-+++--+|+++|+++..- +|+-||= |--||+|+ T Consensus 390 ~~v~~~DsyNWGYDPfHy~VPEGSYatdP~G~~Ri~EFR~MVqa~~~~---~GL~Vvm--DVVYNHTn 452 (945) T TIGR02103 390 NEVRDLDSYNWGYDPFHYTVPEGSYATDPEGPKRIKEFREMVQALNKD---TGLNVVM--DVVYNHTN 452 (945) T ss_pred HHHHHHCCCCCCCCCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH---CCCEEEE--EEEECHHH T ss_conf 998852677757787001466755575968550267777899998662---5883888--52000011 No 85 >COG4008 Predicted metal-binding transcription factor [Transcription] Probab=65.32 E-value=7.4 Score=18.01 Aligned_cols=10 Identities=20% Similarity=0.258 Sum_probs=3.5 Q ss_pred CCCCCHHHHH Q ss_conf 2785102232 Q gi|254780970|r 482 ATGAKPLAIT 491 (737) Q Consensus 482 a~Ga~Plait 491 (737) |.|+.+-++. T Consensus 11 CrgGd~raLa 20 (153) T COG4008 11 CRGGDFRALA 20 (153) T ss_pred HCCCCCCCEE T ss_conf 2589810232 No 86 >PRK06108 aspartate aminotransferase; Provisional Probab=65.27 E-value=6.5 Score=18.37 Aligned_cols=32 Identities=16% Similarity=0.156 Sum_probs=19.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.|++- T Consensus 166 ~P~NPtG~v~s~e~l~~l~~la~~~~v~iisD 197 (382) T PRK06108 166 SPNNPTGWTASRDDQQAILAHCRRHGLWIVAD 197 (382) T ss_pred CCCCCCCHHCCHHHHHHHHHHHHCCCCEEECH T ss_conf 99699662044788999999876269753533 No 87 >TIGR02356 adenyl_thiF thiazole biosynthesis adenylyltransferase ThiF; InterPro: IPR012731 Members of the HesA/MoeB/ThiF family of proteins (IPR000594 from INTERPRO) include a number of members encoded in the midst of thiamine biosynthetic operons. This mix of known and putative ThiF proteins shows a deep split in phylogenetic trees. The Escherichia coli ThiF and MoeB proteins are seemingly more closely related than the E. coli ThiF and Campylobacter (for example) ThiF. This entry represents the more widely distributed clade of ThiF proteins as found in E. coli.. Probab=62.06 E-value=4.7 Score=19.34 Aligned_cols=43 Identities=23% Similarity=0.384 Sum_probs=26.2 Q ss_pred CCCCCHHHHHHHHHCCCCCCE--EEEEHHHCCCCCCCCHHHHHHCCCC Q ss_conf 775201112233201477528--9962022046678999898617888 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGI--TLDLDKVPTCEEGMTAYEMMLSESQ 312 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~--~i~l~~vp~~~~~l~~~ei~~sEsQ 312 (737) +|+|||.++..++...+|+|= -||-|.|.+ .+|. ..+++.|++ T Consensus 27 iGaGgLGs~~~~~LA~AGVG~i~i~D~D~V~~--SNLq-RQ~Lf~e~D 71 (210) T TIGR02356 27 IGAGGLGSPAALYLAAAGVGTITIVDDDHVDL--SNLQ-RQILFAEED 71 (210) T ss_pred EEECHHHHHHHHHHHHCCCCEEEEEECCEECH--HHCH-HHHHHCHHH T ss_conf 97261456899999828883789985167701--0120-554303244 No 88 >PRK08175 aminotransferase; Validated Probab=61.92 E-value=6.3 Score=18.47 Aligned_cols=31 Identities=10% Similarity=0.105 Sum_probs=17.9 Q ss_pred CCCCCCH-HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 6668621-3469999999985221028866544 Q gi|254780970|r 498 NPEKEEI-MGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~-~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-. .|. .+-++.+.+.|++.++.+++- T Consensus 172 ~P~NPTG~v~s-~e~l~~i~~~a~~~~i~visD 203 (395) T PRK08175 172 FPSNPTAQCVE-LEFFEKVVALAKRYDVLVVHD 203 (395) T ss_pred CCCCCCCCCCC-HHHHHHHHHHHHHCCEEEEEC T ss_conf 99998887579-999999999888669489823 No 89 >PRK07878 molybdopterin biosynthesis-like protein MoeZ; Validated Probab=61.56 E-value=8.1 Score=17.74 Aligned_cols=44 Identities=16% Similarity=0.191 Sum_probs=22.5 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCC Q ss_conf 775201112233201477528996202204667899989861788 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSES 311 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEs 311 (737) +|+|||.|.+.+...++|+|--.=.|.=.+...+|. ..++.+++ T Consensus 48 vG~GGLG~~~~~yLaaaGvG~i~ivD~D~v~~sNL~-RQil~~~~ 91 (392) T PRK07878 48 IGAGGLGSPTLLYLAAAGVGTIGIVEFDVVDESNLQ-RQIIHGQS 91 (392) T ss_pred ECCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCC-CCCCCCHH T ss_conf 878757899999999828975999878996745577-20056843 No 90 >PRK07367 consensus Probab=61.35 E-value=3.5 Score=20.20 Aligned_cols=32 Identities=19% Similarity=0.154 Sum_probs=15.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 169 ~P~NPTG~v~s~~~l~~l~~~a~~~~~~iIsD 200 (385) T PRK07367 169 SPSNPTGMVYTPEEIAALAEVIVEHDLYVVSD 200 (385) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC T ss_conf 99798785665788889988887558699711 No 91 >TIGR01949 AroFGH_arch predicted phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR010210 This entry represents a clade of primarily archaeal sequences related to fructose-bisphosphate aldolase. The members appear to be phospho-2-dehydro-3-deoxyheptonate aldolases. This enzyme is the first step of the chorismate biosynthesis pathway. Evidence for this assignment is based on gene clustering and phylogenetic profiling. A group of species lack members of the three other types of phospho-2-dehydro-3-deoxyheptonate aldolase (represented by IPR006219 from INTERPRO, IPR002480 from INTERPRO and IPR006268 from INTERPRO), and also apparently lack the well-known forms of step 2 (3-dehydroquinate synthase), but contain all other steps of the pathway: Aquifex, Archaeoglobus, Halobacterium, Methanococcus, Methanopyrus and Methanobacterium. In Aquifex, Archaeoglobus, Halobacterium, Methanopyrus and Methanosarcina the genes are clustered with other genes from the chorismate, phenylalanine, tyrosine and tryptophan biosynthesis pathways. In addition, these genes in Archaeoglobus, Halobacterium, and Methanosarcina are adjacent to 3-dehydroquinate synthase (IPR002812 from INTERPRO) which also has the property of having members only in those species which lack steps 1 and 2. Together these two genes appear to perform the synthesis of 3-dehydroquinate. It is presumed that the substrates and the chemical transformations involved are identical, but this has not yet been proven experimentally.. Probab=58.52 E-value=4.9 Score=19.23 Aligned_cols=51 Identities=16% Similarity=0.341 Sum_probs=34.9 Q ss_pred HHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 410278510223221024566686213469999999985221028866544320011236 Q gi|254780970|r 479 NIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNETN 538 (737) Q Consensus 479 Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~t~ 538 (737) +-+.+||.-++ ...|.|+-+++|.+ +.+--+++.|..||+|++.- ||-... T Consensus 98 ~Ai~~GADAVS--~HvNvGs~~e~d~~----~~lg~vA~~ad~~GvPlLAM---mYaRG~ 148 (259) T TIGR01949 98 DAIRLGADAVS--IHVNVGSDTEADQI----EDLGDVAEIADDWGVPLLAM---MYARGP 148 (259) T ss_pred HHHHCCCCEEE--EEEECCCCCCHHHH----HHHHHHHHHHHHCCCCHHEC---CCCCCC T ss_conf 99732898679--98864898738999----99989999886548842011---278868 No 92 >pfam07141 Phage_term_sma Putative bacteriophage terminase small subunit. This family consists of several putative Lactococcus bacteriophage terminase small subunit proteins. The exact function of this family is unknown. Probab=58.39 E-value=6.4 Score=18.43 Aligned_cols=30 Identities=37% Similarity=0.451 Sum_probs=15.2 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHH Q ss_conf 99999578971889999853300110744588999 Q gi|254780970|r 23 QIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWL 57 (737) Q Consensus 23 ~i~~~l~R~Pt~~El~~~~~~wSEHC~~k~~~~~l 57 (737) .|.+.+.--.||-|+. -|=||+-|+||+|- T Consensus 23 qiidkiesgcndreiy-----tslhcsaktfrkwr 52 (174) T pfam07141 23 QIIDKIESGCNDREIY-----TSLHCSAKTFRKWR 52 (174) T ss_pred HHHHHHHCCCCHHHHH-----HHHCCCHHHHHHHH T ss_conf 9999997488514545-----54201478888763 No 93 >PRK07550 hypothetical protein; Provisional Probab=56.63 E-value=3.5 Score=20.22 Aligned_cols=37 Identities=5% Similarity=0.098 Sum_probs=20.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 666862134699999999852210288665443200112 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSGNVSFYNE 536 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgGkvSl~n~ 536 (737) +|.+|-..---.+-++.+.+.|++.++.|++- -.|.+ T Consensus 172 ~P~NPtG~v~s~e~l~~l~~la~~~~~~iI~D--EiY~~ 208 (387) T PRK07550 172 TPNNPTGVVYPPELLHELYDLARRRGLALILD--ETYRD 208 (387) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCEEEEEEEC--CCHHH T ss_conf 99699885107798534654303315899941--75787 No 94 >PRK07568 aspartate aminotransferase; Provisional Probab=56.55 E-value=4.1 Score=19.78 Aligned_cols=94 Identities=19% Similarity=0.116 Sum_probs=39.0 Q ss_pred CCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 00013576314203777622797304532100001001135678977764102785102232210245666862134699 Q gi|254780970|r 430 IQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFV 509 (737) Q Consensus 430 v~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~ 509 (737) +..||+...|..+.+......+-..|.++.....+|..+-..--.+.+.+.+.-.. -++. +.+|.+|...---. T Consensus 108 l~~pGD~Vlv~~P~y~~y~~~~~~~g~~~v~v~~~~e~g~~lp~~~~l~~~~~~~t--k~ii----~~~P~NPTG~v~s~ 181 (396) T PRK07568 108 ICDPGDEIIVPEPFYANYNGFATSAGVKIVPVTTKIEEGFHLPKKEEIEKLITDKT--KAIL----ISNPGNPTGVVYTK 181 (396) T ss_pred HCCCCCEEEEECCCCCCHHHHHHHCCCEEEECCCCHHHCCCCCCHHHHHHHCCCCC--EEEE----ECCCCCCCCCCCCH T ss_conf 27998989993897532599999849878516767211887896999997477657--4998----58992988754015 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999985221028866544 Q gi|254780970|r 510 HSVKGIREACQILDFPIVSG 529 (737) Q Consensus 510 ~av~gl~d~c~~lgipivgG 529 (737) +-.+.+.+.|++.++.|+.- T Consensus 182 ~~~~~l~~la~~~~~~ii~D 201 (396) T PRK07568 182 EELRQLADIAKEHDLFLISD 201 (396) T ss_pred HHHHHHHHHHHHHCEEEECC T ss_conf 78999999998710667424 No 95 >pfam04539 Sigma70_r3 Sigma-70 region 3. Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but as some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme. Probab=56.09 E-value=10 Score=17.07 Aligned_cols=21 Identities=29% Similarity=0.289 Sum_probs=15.5 Q ss_pred HHHHHHHCCCCCHHHHHHHHH Q ss_conf 999999578971889999853 Q gi|254780970|r 22 DQIIHILKRQPTLTEIGIISA 42 (737) Q Consensus 22 ~~i~~~l~R~Pt~~El~~~~~ 42 (737) ..+...|||+||+.|+.-.-. T Consensus 11 ~~L~q~lgR~Pt~eEiA~~l~ 31 (78) T pfam04539 11 RELEQELGREPTPEEIAEELG 31 (78) T ss_pred HHHHHHHCCCCCHHHHHHHHC T ss_conf 999999888999999999969 No 96 >PRK07212 consensus Probab=55.46 E-value=6.5 Score=18.40 Aligned_cols=92 Identities=13% Similarity=0.066 Sum_probs=41.1 Q ss_pred CCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 00013576314203777622797304532100001001135678977764102785102232210245666862134699 Q gi|254780970|r 430 IQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFV 509 (737) Q Consensus 430 v~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~ 509 (737) +..||+-..|..+.+......+-..|..+..+..++..+-.- +.+...+.-. .-.+. +.+|.+|-..---. T Consensus 111 l~~pGD~Vlv~~P~Y~~y~~~~~~~g~~~v~vpl~~~~~~~~---~~l~~~i~~~--~k~i~----l~~P~NPtG~v~s~ 181 (378) T PRK07212 111 VLDDGDEVILLDPAWVSYEACIKFAGGKPVWVPLNPEDFQLP---DDLAEAITDK--TKLIV----VNSPSNPTGAVFGK 181 (378) T ss_pred HCCCCCEEEEECCCCCCHHHHHHHCCCEEEEEECCCCCCCCH---HHHHHHCCCC--CEEEE----ECCCCCCCCCCCCC T ss_conf 479989899817963425689997199699983555678528---9999856657--82999----88997998988471 Q ss_pred HHHHHHHHHHHHCCCCCCCCC Q ss_conf 999999852210288665443 Q gi|254780970|r 510 HSVKGIREACQILDFPIVSGN 530 (737) Q Consensus 510 ~av~gl~d~c~~lgipivgGk 530 (737) +-++.+.+.|++.++.|++-. T Consensus 182 ~~l~~l~~la~~~~~~ii~DE 202 (378) T PRK07212 182 ESLQGVADLAVDHDLYVLSDE 202 (378) T ss_pred HHHHHHHHHHHHHCCEECCCC T ss_conf 768899997774245013300 No 97 >TIGR02885 spore_sigF RNA polymerase sigma-F factor; InterPro: IPR014236 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . Members of this entry represent RNA polymerase sigma factor F. Within the Firmicutes, Sigma-F is specifically, and universally, a component of the endospore formation program, and is expressed in the forespore to turn on the expression of dozens of genes. It is closely related to sigma-G, which is also expressed in the forespore.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=55.13 E-value=10 Score=17.00 Aligned_cols=22 Identities=23% Similarity=0.163 Sum_probs=9.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 686213469999999985221028 Q gi|254780970|r 500 EKEEIMGQFVHSVKGIREACQILD 523 (737) Q Consensus 500 ~~~e~~~ql~~av~gl~d~c~~lg 523 (737) .++|.++|. .|-|+-.|-..|+ T Consensus 36 Ye~eDlFQI--GciGLvKAidkFD 57 (231) T TIGR02885 36 YEYEDLFQI--GCIGLVKAIDKFD 57 (231) T ss_pred CCCCHHHHH--HHHHHHHHHHHCC T ss_conf 870000222--1155666554217 No 98 >TIGR01833 HMG-CoA-S_euk hydroxymethylglutaryl-CoA synthase; InterPro: IPR010122 Synonym(s): 3-hydroxy-3-methylglutaryl-coenzyme A synthase, HMG-CoA synthase. Hydroxymethylglutaryl-CoA synthase (2.3.3.10 from EC) catalyses the condensation of acetyl-CoA with acetoacetyl-CoA to produce HMG-CoA and CoA, the second reaction in the mevalonate-dependent isoprenoid biosynthesis pathway. HMG-CoA synthase contains an important catalytic cysteine residue that acts as a nucleophile in the first step of the reaction: the acetylation of the enzyme by acetyl-CoA (its first substrate) to produce an acetyl-enzyme thioester, releasing the reduced coenzyme A. The subsequent nucleophilic attack on acetoacetyl-CoA (its second substrate) leads to the formation of HMG-CoA . HMG-CoA synthase occurs in eukaryotes, archaea and certain bacteria . In vertebrates, there are two isozymes located in different subcellular compartments: a cytosolic form that is the starting point of the mevalonate pathway (leads to cholesterol and other sterolic and isoprenoid compounds), and a mitochondrial form responsible for ketone body biosynthesis. HMG-CoA is also found in other eukaryotes such as insects, plants and fungi . In bacteria, isoprenoid precursors are generally synthesised via an alternative, non-mevalonate pathway, however a number of Gram-positive pathogens utilise a mevalonate pathway involving HMG-CoA synthase that is parallel to that found in eukaryotes , . This entry is specific for eukaryotic HMG-CoA synthase enzymes.; GO: 0004421 hydroxymethylglutaryl-CoA synthase activity, 0008299 isoprenoid biosynthetic process. Probab=54.10 E-value=11 Score=16.77 Aligned_cols=69 Identities=12% Similarity=0.248 Sum_probs=43.9 Q ss_pred CCEEEEEHHHCCCCCCCCHHH------HHHCCCCCEEEEEEECCCHH-----HHHHHHHHHCCEEECCCCEEECCCCCCC Q ss_conf 528996202204667899989------86178885089999546758-----8999888709713112212104431000 Q gi|254780970|r 285 LGITLDLDKVPTCEEGMTAYE------MMLSESQERMLMILNPEKQH-----KAQEILNKWGLHFSIIGITTNDKLFRVI 353 (737) Q Consensus 285 ~G~~i~l~~vp~~~~~l~~~e------i~~sEsQeR~~~~v~~~~~~-----~~~~i~~~~~~~~~~iG~vt~~~~~~v~ 353 (737) +|++|....--+.+..|..+. .-+.=.|.||.++-+.||+. .+.++++|+.+++..||+. . T Consensus 8 ~al~~yfP~~yV~Q~~LE~yDgVs~GKYTiGLGq~~mgFc~drEDI~Sl~LTvv~~L~er~~id~~~IGRL--------E 79 (461) T TIGR01833 8 LALEIYFPSQYVDQAELEKYDGVSAGKYTIGLGQTKMGFCTDREDINSLCLTVVSKLMERYQIDYNKIGRL--------E 79 (461) T ss_pred EEEEEECCCCCCCHHHHHHHCCCCCCCEEECCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCEE--------E T ss_conf 88886368875265778632688889503423566200002210156665899999998607871543110--------1 Q ss_pred CCCCCCCC Q ss_conf 25864434 Q gi|254780970|r 354 HRGEEVAN 361 (737) Q Consensus 354 ~~g~~v~d 361 (737) =+-|+|+| T Consensus 80 VGTETiiD 87 (461) T TIGR01833 80 VGTETIID 87 (461) T ss_pred CCCEEEEE T ss_conf 04253410 No 99 >cd01459 vWA_copine_like VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding. Probab=53.97 E-value=11 Score=16.87 Aligned_cols=18 Identities=17% Similarity=0.239 Sum_probs=8.6 Q ss_pred CCHHHHHHHHHHHHHHHH Q ss_conf 100113567897776410 Q gi|254780970|r 464 DPFEGTKQAVAECWRNII 481 (737) Q Consensus 464 dP~~ga~~AV~Eal~Nl~ 481 (737) +|+.-+.-.|.+++++.+ T Consensus 109 ~p~~~G~~gvl~aY~~~l 126 (254) T cd01459 109 SPECQGFEGVLRAYREAL 126 (254) T ss_pred CCCCCCHHHHHHHHHHHC T ss_conf 996559999999999860 No 100 >PRK06852 aldolase; Validated Probab=53.88 E-value=7.2 Score=18.09 Aligned_cols=60 Identities=15% Similarity=0.155 Sum_probs=42.1 Q ss_pred CCCHHHHHHHHHHHHHHHHC--CCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 01001135678977764102--7851022322102456668621346999999998522102886654 Q gi|254780970|r 463 ADPFEGTKQAVAECWRNIIA--TGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVS 528 (737) Q Consensus 463 ~dP~~ga~~AV~Eal~Nl~a--~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivg 528 (737) .||+.....+|..+++=-.+ +|+ .++....+|||+...+.+-+ +..+.+-|+++|+|+|- T Consensus 111 ~dp~s~~~~sv~~a~~~~~~~~LGa--~aVG~TIY~GSe~e~~ml~e----~~~i~~eA~~~Gl~~Vl 172 (303) T PRK06852 111 KDPFSNQWLDVDQVVEFKKNSGLNI--LGVGYTIYLGSEYEAEMLSE----AAQIIYEAHKHGLVAVL 172 (303) T ss_pred CCCCHHCCCCHHHHHHHHHHCCCCC--EEEEEEEECCCHHHHHHHHH----HHHHHHHHHHCCCEEEE T ss_conf 7840101067999986523115886--59999985797406999999----99999999976990899 No 101 >PRK09148 aminotransferase; Validated Probab=53.73 E-value=6.2 Score=18.51 Aligned_cols=19 Identities=21% Similarity=0.085 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHCCCCCCCC Q ss_conf 9999985221028866544 Q gi|254780970|r 511 SVKGIREACQILDFPIVSG 529 (737) Q Consensus 511 av~gl~d~c~~lgipivgG 529 (737) -++.+.+.|++.++.|++- T Consensus 187 ~l~~l~~~a~~~~i~visD 205 (406) T PRK09148 187 FYKDLVAFAKKHDIWILSD 205 (406) T ss_pred HHHHHHHHHHCCEEEEECC T ss_conf 9999998754054899634 No 102 >PRK05957 aspartate aminotransferase; Provisional Probab=53.45 E-value=11 Score=16.70 Aligned_cols=32 Identities=9% Similarity=0.109 Sum_probs=21.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 168 ~P~NPTG~v~s~~~l~~l~~la~~~~~~ii~D 199 (389) T PRK05957 168 SPNNPTGVVYPEALLRAVNQICAEHGIYHISD 199 (389) T ss_pred CCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEC T ss_conf 99899898876778999998760678189989 No 103 >TIGR02187 GlrX_arch Glutaredoxin-like domain protein; InterPro: IPR011903 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry of archaeal proteins contains a C-terminal domain with homology to bacterial and eukaryotic glutaredoxins, including a CPYC motif. There is an N-terminal domain which has even more distant homology to glutaredoxins. The name "glutaredoxin" may be inappropriate in the sense of working in tandem with glutathione and glutathione reductase which may not be present in the archaea. The overall domain structure appears to be related to bacterial alkylhydroperoxide reductases, but the homology may be distant enough that the function of this family is wholly different.. Probab=53.17 E-value=4.1 Score=19.78 Aligned_cols=65 Identities=20% Similarity=0.318 Sum_probs=35.9 Q ss_pred CCHHHHHHHHHH-HCCCCCHHHHHHHHHHH------CCCCCC-CCCHHHHHHHHHCCCCEEECCCCCEEEEEE Q ss_conf 998999999999-57897188999985330------011074-458899975120697489933477368983 Q gi|254780970|r 16 LTKDECDQIIHI-LKRQPTLTEIGIISAMW------NEHCSY-KSSKKWLRTLPTTGKHVIQGPGENAGVVDI 80 (737) Q Consensus 16 L~~~e~~~i~~~-l~R~Pt~~El~~~~~~w------SEHC~~-k~~~~~l~~~~~~~~~vi~~~~~~a~vi~~ 80 (737) |++++-++|++. ++--=++|++.+|.+.- -+||.| +....+|+.+-+-.+.-.-++.=.=-|+++ T Consensus 1 ls~~~~~~lke~f~k~lknpV~~~vF~~~~~liaPGk~~C~yC~~t~qLl~e~~e~~p~~~gg~KL~L~iyd~ 73 (237) T TIGR02187 1 LSEEDREILKEEFLKELKNPVELVVFTDNDTLIAPGKERCQYCKETKQLLEELSEVSPEASGGDKLKLEIYDF 73 (237) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEECCCEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEECC T ss_conf 9868999999998863589817999704751244798888655899999998875166446896225888627 No 104 >pfam04227 Indigoidine_A Indigoidine synthase A like protein. Indigoidine is a blue pigment synthesized by Erwinia chrysanthemi implicated in pathogenicity and protection from oxidative stress. IdgA is involved in indigoidine biosynthesis, but its specific function is unknown. Probab=51.89 E-value=8.7 Score=17.51 Aligned_cols=15 Identities=27% Similarity=0.273 Sum_probs=7.0 Q ss_pred CHHHHHHHHHHHCCE Q ss_conf 758899988870971 Q gi|254780970|r 323 KQHKAQEILNKWGLH 337 (737) Q Consensus 323 ~~~~~~~i~~~~~~~ 337 (737) |+.+-.+.++-.+++ T Consensus 156 Di~~TlE~LET~GV~ 170 (293) T pfam04227 156 DIPKTLEYLETQGVP 170 (293) T ss_pred CCHHHHHHHHHCCCE T ss_conf 544689999975943 No 105 >COG5616 Predicted integral membrane protein [Function unknown] Probab=51.63 E-value=14 Score=16.12 Aligned_cols=121 Identities=12% Similarity=0.199 Sum_probs=63.4 Q ss_pred CCCCCCCCCCCCCCCCCCEE-EEEEEECCCCCCCCCCHHHHHHHHHHHH-HHHHHHCCCCCCCCEEEEECCCCCCCCCCE Q ss_conf 72105581022576778743-6877504578756863888998887458-866543087211320355215676642211 Q gi|254780970|r 104 GAATGVGGILRDIFTMGARP-VAAMNSLRFGAIHHPKTKHLLSGVVAGI-AGYSNSFGVPTVGGEVEFLPCYNNNIIVNT 181 (737) Q Consensus 104 GAaTg~gG~iRDi~~~Ga~p-~a~~~~l~~g~~~~p~~~~~~~~~~~G~-~~ygN~~G~P~v~G~~~~~~~y~~~~~v~~ 181 (737) .+++++.+.........++| ||.|---|.+ .+|+..|+-+|+.+.| .+....-+..+|..++.| .|.+++.- + T Consensus 6 ~~~~~~~~~~~~~~~~p~~psiAVlPF~nls--~dp~q~~faDGlTEdiIt~Lsr~~~l~VIArnssf--t~kgka~d-v 80 (152) T COG5616 6 FVESTTESLTAPGTSIPAKPSIAVLPFVNLS--GDPEQEYFADGLTEDIITDLSRFRELFVIARNSSF--TYKGKAVD-V 80 (152) T ss_pred CCCCCHHHCCCCCCCCCCCCCEEEECCCCCC--CCHHHHHHCCCCHHHHHHHHHHCCCCEEEECCCEE--ECCCCCCC-H T ss_conf 4655502203799888998847996403578--88767565055308899876651685699756504--31688878-7 Q ss_pred EEECC-CCHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHCCCCC Q ss_conf 22112-113554311225689779996576564441046666777286533 Q gi|254780970|r 182 FAAGI-AKTNAIFSSKARGIGLPLVYLGAKTGRDGIGGASMASEEFGENIA 231 (737) Q Consensus 182 ~~~Gi-~~~~~~~~~~~~~~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~ 231 (737) --+|- ....-+......+.|+.|.+..-- -|.-.|+..=++.++.... T Consensus 81 ~~v~~~Lgv~YvleGsvRr~g~RvRvtaqL--vda~~G~~LWAeryDr~l~ 129 (152) T COG5616 81 REVGEELGVRYVLEGSVRRAGGRVRVTAQL--VDASSGGHLWAERYDRDLD 129 (152) T ss_pred HHHHHHHCCCEEEEEEEEECCCEEEEEEEE--EECCCCCCCCHHHCCCCHH T ss_conf 898887388689960576328879999999--7346884615777084578 No 106 >TIGR02978 phageshock_pspC phage shock protein C; InterPro: IPR014320 All members of this protein are the phage shock protein PspC. The phage shock regulon is restricted to the Proteobacteria and somewhat sparsely distributed there. It is expressed, under positive control of a sigma-54-dependent transcription factor; PspF, which binds and is modulated by PspA. Stresses that induce the psp regulon include phage secretin over expression, ethanol, heat shock and protein export defects.. Probab=51.40 E-value=4.7 Score=19.34 Aligned_cols=19 Identities=47% Similarity=0.746 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9988874588665430872113 Q gi|254780970|r 143 LLSGVVAGIAGYSNSFGVPTVG 164 (737) Q Consensus 143 ~~~~~~~G~~~ygN~~G~P~v~ 164 (737) -+.||.+|+|+| ||+|+.- T Consensus 12 Ki~GVCAGlA~Y---fg~ev~L 30 (128) T TIGR02978 12 KIAGVCAGLAEY---FGVEVWL 30 (128) T ss_pred EEECHHHHHHHH---HCCCHHH T ss_conf 051224577766---2786888 No 107 >PRK06056 consensus Probab=51.21 E-value=8.1 Score=17.74 Aligned_cols=32 Identities=13% Similarity=0.017 Sum_probs=16.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 176 ~P~NPtG~v~s~e~l~~l~~la~~~~i~ii~D 207 (402) T PRK06056 176 SPSNPTGAVYSPEQVRAIGRWAAEHGIWVISD 207 (402) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCEECC T ss_conf 89798888742788999999999714615510 No 108 >PRK12399 tagatose 1,6-diphosphate aldolase; Reviewed Probab=50.70 E-value=16 Score=15.72 Aligned_cols=82 Identities=13% Similarity=0.135 Sum_probs=47.9 Q ss_pred CCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHH-HHHHHHHHHHHHHHCCC Q ss_conf 76227973045321000010011356789777641027851022322102456668621346-99999999852210288 Q gi|254780970|r 446 ETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQ-FVHSVKGIREACQILDF 524 (737) Q Consensus 446 ~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~q-l~~av~gl~d~c~~lgi 524 (737) .+.|+.++.+-. -|-..+|.+-..+-..-+++.+-.+|++-+.+.. |-+|+.++...+ -..-|+.+.+.|++.+| T Consensus 81 ~~~GLlla~E~t-gyd~~~~gRl~~~i~~wSv~rik~~GadAvK~L~---yy~pD~~~~in~~k~~~Verig~eC~~~di 156 (324) T PRK12399 81 EEAGLLLAYEKT-GYDATTTGRLPDCLDVWSAKRIKEEGADAVKFLL---YYDVDEDDEINEQKKAYIERIGSECVAEDI 156 (324) T ss_pred CCCCEEEEECCC-CCCCCCCCCCCCCCCCCCHHHHHHCCCCEEEEEE---EECCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 888638996044-5667888767443344589999970657388999---977999778889999999999999997699 Q ss_pred CCCCCCC Q ss_conf 6654432 Q gi|254780970|r 525 PIVSGNV 531 (737) Q Consensus 525 pivgGkv 531 (737) |++==-+ T Consensus 157 pf~lE~l 163 (324) T PRK12399 157 PFFLEIL 163 (324) T ss_pred CCEEEEE T ss_conf 7157651 No 109 >COG2860 Predicted membrane protein [Function unknown] Probab=50.60 E-value=3.3 Score=20.39 Aligned_cols=23 Identities=35% Similarity=0.723 Sum_probs=12.6 Q ss_pred CCCCCCC-CCCCCCCCCCCEEEEEE Q ss_conf 7210558-10225767787436877 Q gi|254780970|r 104 GAATGVG-GILRDIFTMGARPVAAM 127 (737) Q Consensus 104 GAaTg~g-G~iRDi~~~Ga~p~a~~ 127 (737) |..|.+| |.|||.+ .|..|..-+ T Consensus 39 a~vTAiGGGtiRDlL-LG~~Pv~wv 62 (209) T COG2860 39 AVVTAIGGGTIRDLL-LGHYPVFWV 62 (209) T ss_pred HHHHHHCCHHHHHHH-CCCCCCEEE T ss_conf 777872561999998-167881015 No 110 >PRK05973 replicative DNA helicase; Provisional Probab=50.36 E-value=16 Score=15.69 Aligned_cols=22 Identities=18% Similarity=0.154 Sum_probs=12.2 Q ss_pred CCCCCCEEEEECCCCCCCCCHHHHHH Q ss_conf 25689779996576564441046666 Q gi|254780970|r 197 ARGIGLPLVYLGAKTGRDGIGGASMA 222 (737) Q Consensus 197 ~~~~Gd~ii~~G~~tg~dGi~Ga~~s 222 (737) .-+|||.||+-| |=|+|=.+|+ T Consensus 60 gLqPGDLIIlAA----RPsMGKTafa 81 (237) T PRK05973 60 QLRPGDLVLLGA----RPGQGKTLLG 81 (237) T ss_pred CCCCCCEEEEEC----CCCCCHHHHH T ss_conf 899867799942----8988789999 No 111 >pfam07302 AroM AroM protein. This family consists of several bacterial and archaeal AroM proteins. In Escherichia coli the aroM gene is cotranscribed with aroL. The function of this family is unknown. Probab=50.31 E-value=11 Score=16.84 Aligned_cols=61 Identities=20% Similarity=0.369 Sum_probs=37.3 Q ss_pred CCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 4532100001001135678977764102785102232210245666862134699999999852210288665443 Q gi|254780970|r 455 DVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGN 530 (737) Q Consensus 455 g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGk 530 (737) +..+.+...+||.+..-.+.+|.+.+...|++-+ +.||+-| .+.|.+ .+.+.+|+|++-.| T Consensus 149 ~~~~~~~~asPy~~~~~~l~~Aa~~L~~~gadli-vLDCiGY-----t~~~r~---------~~~~~~g~PViLsr 209 (221) T pfam07302 149 VKEVVVVAASPYHGSEDRLEEAARELLDQGADLI-VLDCIGY-----TQAMKD---------MLQEALGKPVLLSR 209 (221) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEE-EEECCCC-----CHHHHH---------HHHHHHCCCEEEHH T ss_conf 8871799718988988999999999997598999-9988777-----499999---------99998699999508 No 112 >PRK07778 consensus Probab=48.96 E-value=6.3 Score=18.46 Aligned_cols=32 Identities=13% Similarity=0.154 Sum_probs=19.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.|++- T Consensus 170 ~P~NPTG~v~s~~~l~~l~~la~~~~i~iisD 201 (386) T PRK07778 170 SPANPTGTVIAPEELAAIASWCEASGVRLISD 201 (386) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCCEEEEEC T ss_conf 99898877850799999999985378799943 No 113 >COG1191 FliA DNA-directed RNA polymerase specialized sigma subunit [Transcription] Probab=48.95 E-value=17 Score=15.54 Aligned_cols=13 Identities=23% Similarity=0.345 Sum_probs=6.0 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999985221028 Q gi|254780970|r 511 SVKGIREACQILD 523 (737) Q Consensus 511 av~gl~d~c~~lg 523 (737) .+-|+-++...|+ T Consensus 58 G~iGLi~Aieryd 70 (247) T COG1191 58 GMIGLIKAIERYD 70 (247) T ss_pred HHHHHHHHHHHCC T ss_conf 8999999999749 No 114 >pfam04002 DUF2466 Protein of unknown function (DUF2466). This family was named initially with reference to the E. coli radC102 mutation which suggested that RadC was involved in repair of DNA lesions. However the relevant mutation has subsequently been shown to be in recG, where radC is in fact an allele of recG. In addition, a personal communication from Claverys, J-P, et al, indicates a total failure of all attempts to characterize a radiation-related function for RadC in Streptococcus pneumoniae, suggesting that it is not involved in repair of DNA lesions, in recombination during transformation, in gene conversion, nor in mismatch repair. Probab=48.34 E-value=6 Score=18.60 Aligned_cols=56 Identities=21% Similarity=0.287 Sum_probs=27.9 Q ss_pred CHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 0000100113567897776410278510223221024566686213469999999985221028866 Q gi|254780970|r 460 YVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPI 526 (737) Q Consensus 460 ~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipi 526 (737) .+..+|..--+.|+....+.++.+-=-| -|+|.....- .+.-+.+.++|+.+||++ T Consensus 48 ~~~v~pReI~~~Al~~~A~~iIl~HNHP--------SG~~~PS~~D---~~~T~~l~~a~~~l~I~l 103 (123) T pfam04002 48 SASVHPREIFKRALRLNAAAVILAHNHP--------SGDPTPSQAD---IELTKRLKEAGELLGIRL 103 (123) T ss_pred EEEECHHHHHHHHHHCCCCEEEEEECCC--------CCCCCCCHHH---HHHHHHHHHHHHHCCCEE T ss_conf 6788799999999983898699983189--------9989958879---999999999997689879 No 115 >COG1830 FbaB DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes [Carbohydrate transport and metabolism] Probab=48.06 E-value=17 Score=15.45 Aligned_cols=42 Identities=24% Similarity=0.483 Sum_probs=27.5 Q ss_pred HCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 027851022322102456668621346999999998522102886654 Q gi|254780970|r 481 IATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVS 528 (737) Q Consensus 481 ~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivg 528 (737) +.+||.-++++ +|||+..+.+ ..+-+.-+.+.|.++|+|++- T Consensus 107 i~lgadAV~~~--Vy~Gse~e~~----~i~~~~~v~~~a~~~Gmp~v~ 148 (265) T COG1830 107 IRLGADAVGAT--VYVGSETERE----MIENISQVVEDAHELGMPLVA 148 (265) T ss_pred HHCCCCEEEEE--EECCCCCHHH----HHHHHHHHHHHHHHCCCCEEE T ss_conf 74787479999--9668762589----999999999999870996499 No 116 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=48.05 E-value=12 Score=16.68 Aligned_cols=29 Identities=14% Similarity=0.177 Sum_probs=25.8 Q ss_pred CCHHHHHHHHHHHCCEEECCCCEEECCCC Q ss_conf 67588999888709713112212104431 Q gi|254780970|r 322 EKQHKAQEILNKWGLHFSIIGITTNDKLF 350 (737) Q Consensus 322 ~~~~~~~~i~~~~~~~~~~iG~vt~~~~~ 350 (737) +-..++.+++++.+++.-.||+||+++.+ T Consensus 187 eca~~L~~~AK~~~i~~~lVGHVTK~G~i 215 (372) T cd01121 187 ECTAELMRFAKERNIPIFIVGHVTKEGSI 215 (372) T ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCC T ss_conf 99999999998619739999876268863 No 117 >TIGR00416 sms DNA repair protein RadA; InterPro: IPR004504 RadA/Sms is a highly conserved eubacterial protein that shares sequence similarity with both RecA strand transferase and lon protease. The RadA/Sms family are probable ATP-dependent proteases involved in both DNA repair and degradation of proteins, peptides, glycopeptides. They are classified in as non-peptidase homologues and unassigned peptidases in MEROPS peptidase family S16 (lon protease family, clan SJ). RadA/Sms is involved in recombination and recombinational repair, most likely involving the stabilisation or processing of branched DNA molecules or blocked replication forks because of its genetic redundancy with RecG and RuvABC .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=47.85 E-value=8 Score=17.76 Aligned_cols=47 Identities=15% Similarity=0.177 Sum_probs=32.6 Q ss_pred CCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCC--CCCCCCCCCCCC Q ss_conf 675889998887097131122121044310002586443--443210146650 Q gi|254780970|r 322 EKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVA--NLPIKALSDEAP 372 (737) Q Consensus 322 ~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~--dlp~~~L~~~~P 372 (737) |-.-.+.+++|+.+++--+||+||+||-+ =|..+. =++..+++++=+ T Consensus 225 E~t~~Lmr~AKt~~iaifiVGHVTKeGsi----AGPkvLEH~vD~vLyfeGd~ 273 (481) T TIGR00416 225 ECTAELMRLAKTRGIAIFIVGHVTKEGSI----AGPKVLEHMVDTVLYFEGDR 273 (481) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCCCCC----CCCHHHHHHHHHHEECCCCC T ss_conf 99999987652168657997004356754----34046663433110115887 No 118 >TIGR01924 rsbW_low_gc anti-sigma B factor; InterPro: IPR010193 This entry describes the anti-sigma B factor also known as serine-protein kinase RsbW. Sigma B controls the general stress regulon in B subtilis and is activated by cell stresses such as stationary phase and heat shock. RsbW binds to sigma B and prevents formation of the transcription complex at the promoter . RsbV (anti-anti-sigma factor) binds to RsbW to inhibit association with sigma B, however RsbW can phosphorylate RsbV, causing disassociation of the RsbV/RsbW complex . Low ATP level or environmental stress causes the dephosphorylation of RsbV .; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0016989 sigma factor antagonist activity, 0006468 protein amino acid phosphorylation, 0045892 negative regulation of transcription DNA-dependent. Probab=47.68 E-value=6.5 Score=18.37 Aligned_cols=38 Identities=29% Similarity=0.333 Sum_probs=25.6 Q ss_pred CCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHC Q ss_conf 6314203777622797304532100001001135678977764102 Q gi|254780970|r 437 AGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIA 482 (737) Q Consensus 437 aaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a 482 (737) .+++|+. -.|+|.=.|.. .|--.=.++||.||+||.+- T Consensus 20 Vs~~RLt---lSGvasR~Gyt-----Yd~IED~KIAVSEA~TN~V~ 57 (161) T TIGR01924 20 VSLIRLT---LSGVASRAGYT-----YDDIEDLKIAVSEACTNAVK 57 (161) T ss_pred EEEEEEE---EHHHHHHCCCC-----HHHHHHHHHHHHHHHCCCEE T ss_conf 7876441---02565524878-----65752067766555303311 No 119 >TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component; InterPro: IPR014350 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . According to structural and sequence analyses, the PTS EIIB domain (2.7.1.69 from EC) can be divided in five groups , , : The PTS EIIB type 1 domain, which is found in the Glucose class of PTS, has an average length of about 80 amino acids. It forms a split alpha/beta sandwich composed of an antiparallel sheet (beta 1 to beta 4) and three alpha helices superimposed onto one side of the sheet. The phosphorylation site (Cys) is located at the end of the first beta strand on a protrusion formed by the edge of beta 1 and the reverse turn between beta 1 and beta 2 . The PTS EIIB type 2 domain, which is found in the Mannitol class of PTS, has an average length of about 100 amino acids. It consists of a four stranded parallel beta sheet flanked by two alpha helices (alpha 1 and 3) on one face and helix alpha 2 on the opposite face, with a characteristic Rossmann fold comprising two right-handed beta-alpha-beta motifs. The phosphorylation site (Cys) is located at the N-terminus of the domain, in the first beta strand. The PTS EIIB type 3 domain, which is found in the Lactose class of PTS, has an average length of about 100 amino acids. It is composed of a central four-stranded parallel open twisted beta sheet, which is flanked by three alpha helices on the concave side and two on the convex side of the beta sheet. The phosphorylation site (Cys) is located in the C-terminal end of the first beta strand . The PTS EIIB type 4 domain, which is found in the Mannose class of PTS, has an average length of about 160 amino acids. It has a central core of seven parallel beta strands surrounded by a total of six alpha-helices. Three helices cover the front face, one the back face with the remaining two capping the central beta sheet at the top and bottom. The phosphorylation site (His) is located at the suface exposed loop between strand 1 and helix 1 . The PTS EIIB type 5 domain, which is found in the Sorbitol class of PTS, has an average length of about 190 amino acids. The phosphorylation site (Cys) is located in the N-terminus of the domain. The EIIB type 3 protein is often found downstream of EIIC type 3. This entry hits fewer proteins than IPR013012 from INTERPRO. . Probab=46.53 E-value=16 Score=15.79 Aligned_cols=73 Identities=14% Similarity=0.248 Sum_probs=42.1 Q ss_pred CCCCCHHCCCCCCCC----HHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEE--CCCHHHHHHH Q ss_conf 138301103477520----1112233201477528996202204667899989861788850899995--4675889998 Q gi|254780970|r 257 NTDAVIAIQDMGAAG----LTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILN--PEKQHKAQEI 330 (737) Q Consensus 257 ~~~~i~~~~D~gaGG----l~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~--~~~~~~~~~i 330 (737) +.+...-| +||| |.+++.+.|.. .|+.+....++-... .|+ +-+.| +||.-| .-+.+.++++ T Consensus 46 ~~~iLl~C---aaGmstsLLv~~l~k~A~~--~~~~~~i~A~~~~~~----~e~-~~~~d--~VlLaPQ~~~~~~~lk~~ 113 (142) T TIGR00853 46 ETNILLLC---AAGMSTSLLVNKLNKAAKE--YGVPVKIAAVSYGAA----MEI-LDDAD--VVLLAPQVAYMLPDLKKE 113 (142) T ss_pred CCEEEEEE---CCCCCHHHHHHHHHHHHHH--CCCCEEEEEECCCHH----HHH-HHHHC--EEEEHHHHHHHHHHHHHH T ss_conf 21168887---6973547899999999984--599758884057634----433-53504--311203267557999985 Q ss_pred HHHHCCEEECC Q ss_conf 88709713112 Q gi|254780970|r 331 LNKWGLHFSII 341 (737) Q Consensus 331 ~~~~~~~~~~i 341 (737) -++.|+|..+| T Consensus 114 t~~~Gip~~~i 124 (142) T TIGR00853 114 TDKKGIPVEVI 124 (142) T ss_pred CCCCCCCEEEE T ss_conf 13589753663 No 120 >PRK05597 molybdopterin biosynthesis protein MoeB; Validated Probab=46.08 E-value=19 Score=15.25 Aligned_cols=21 Identities=29% Similarity=0.408 Sum_probs=12.4 Q ss_pred CCCCCHHHHHHHHHCCCCCCE Q ss_conf 775201112233201477528 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGI 287 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~ 287 (737) +|+|||.|.+..-..++|+|- T Consensus 34 vG~GGLG~~~~~yLa~aGvG~ 54 (355) T PRK05597 34 IGAGGLGSPALLYLAGAGVGH 54 (355) T ss_pred ECCCHHHHHHHHHHHHCCCCE T ss_conf 877766899999999849975 No 121 >TIGR03277 methan_mark_9 putative methanogenesis marker domain 9. A gene for a protein that contains a copy of this domain, to date, is found in a completed prokaryotic genome if and only if the species is one of the archaeal methanogens. The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it. A 69-amino acid core region of this 110-amino acid domain contains eight invariant Cys residues, including two copies of a motif [WFY]CCxxKPC. These motifs could be consistent with predicted metal-binding transcription factor as was suggested for the COG4008 family. Some members of this family have an additional N-terminal domain of about 250 amino acids from the nifR3 family of predicted TIM-barrel proteins. Probab=45.91 E-value=19 Score=15.23 Aligned_cols=25 Identities=32% Similarity=0.440 Sum_probs=11.5 Q ss_pred CHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 2147889986199989999999995 Q gi|254780970|r 4 PSPILEILTAHGLTKDECDQIIHIL 28 (737) Q Consensus 4 ~~~~~e~~~~~gL~~~e~~~i~~~l 28 (737) |-|+....+++|+|+|||-.|+..+ T Consensus 27 ~CP~~~al~~~g~speef~~~K~ef 51 (109) T TIGR03277 27 PCPIHPKLAELGLSPQEFVRIKEEF 51 (109) T ss_pred CCCCHHHHHHCCCCHHHHHHHHHHH T ss_conf 7754799999099999999999998 No 122 >COG1010 CobJ Precorrin-3B methylase [Coenzyme metabolism] Probab=45.68 E-value=19 Score=15.21 Aligned_cols=112 Identities=19% Similarity=0.306 Sum_probs=70.1 Q ss_pred CHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHC---------------CCC--------CCCCHHHHHH----CCCCC Q ss_conf 01103477520111223320147752899620220---------------466--------7899989861----78885 Q gi|254780970|r 261 VIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVP---------------TCE--------EGMTAYEMML----SESQE 313 (737) Q Consensus 261 i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp---------------~~~--------~~l~~~ei~~----sEsQe 313 (737) +.+--|.|-=|++.++.|+++..+ .-++++.-|| +.- +-|.|||... .-.|. T Consensus 77 lVSsGDpgVYgMA~lv~E~~~~~~-~~~v~veVvPGvTA~~aaAa~lGAPL~hDF~~ISLSDlLtPwe~IekRl~aAA~a 155 (249) T COG1010 77 LVSSGDPGVYGMAGLVLEAAEEEG-WYDVDVEVVPGVTAALAAAARLGAPLGHDFCVISLSDLLTPWEVIEKRLRAAAEA 155 (249) T ss_pred EEECCCCCHHHHHHHHHHHHHHCC-CCCCCEEEECCHHHHHHHHHHHCCCCCCCEEEEEHHHCCCCHHHHHHHHHHHHHC T ss_conf 982799538886999999998618-7776379807757999999973897555549999676288689999999988647 Q ss_pred EEEEEE-------ECCCHHHHHHHHHHHCCEEECCCCEEECCC----CCCCC-C--C-CCCCCCCCCCCCCCCCE Q ss_conf 089999-------546758899988870971311221210443----10002-5--8-64434432101466501 Q gi|254780970|r 314 RMLMIL-------NPEKQHKAQEILNKWGLHFSIIGITTNDKL----FRVIH-R--G-EEVANLPIKALSDEAPE 373 (737) Q Consensus 314 R~~~~v-------~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~----~~v~~-~--g-~~v~dlp~~~L~~~~P~ 373 (737) -+++++ .|+++.+..+|+.++.-+-+-+|-|.+-+| .++.- . . ...+|+-.-.+.++.-. T Consensus 156 dfVi~~YNP~s~~R~~~~~~a~eil~~~r~~~tpVgivrnagR~~e~v~ittL~~l~~~~~iDM~T~vIIGns~T 230 (249) T COG1010 156 DFVIALYNPISKRRPEQLGRAFEILREHRSPDTPVGIVRNAGREGEEVRITTLGDLDEAEEIDMRTIVIIGNSET 230 (249) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEEEEHHHHCCCCCCCCEEEEEECCCCE T ss_conf 979999777431352789999999998558998489983378888649999968816565556547999858740 No 123 >cd05565 PTS_IIB_lactose PTS_IIB_lactose: subunit IIB of enzyme II (EII) of the lactose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS) found in Firmicutes as well as Actinobacteria. In this system, EII is a lactose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIC and IIB domains are expressed as a single protein from the lac operon. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include lactose, chitobiose/lichenan, ascorbate, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. Probab=45.39 E-value=19 Score=15.18 Aligned_cols=60 Identities=17% Similarity=0.298 Sum_probs=36.6 Q ss_pred HHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECC---CHHHHHHHHHHHCCEEECC Q ss_conf 111223320147752899620220466789998986178885089999546---7588999888709713112 Q gi|254780970|r 272 LTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPE---KQHKAQEILNKWGLHFSII 341 (737) Q Consensus 272 l~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~---~~~~~~~i~~~~~~~~~~i 341 (737) |++++.|-|..- |.+++.+.+...+ -.|+ +.+. -++.+.|. ..+++.+.+++.|++..+| T Consensus 16 la~~m~k~A~~~--gi~~~i~A~~~~~----~~d~-~~~y---DvilLaPQv~~~~~~lk~~ad~~Gi~v~~i 78 (99) T cd05565 16 LANALNKGAKER--GVPLEAAAGAYGS----HYDM-IPDY---DLVILAPQMASYYDELKKDTDRLGIKLVTT 78 (99) T ss_pred HHHHHHHHHHHC--CCCEEEEEEECHH----HHHH-HHCC---CEEEECHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 999999999981--9976999510344----8988-7079---999998508888999999999839918870 No 124 >PRK00024 radC DNA repair protein RadC; Reviewed Probab=45.08 E-value=6.8 Score=18.24 Aligned_cols=54 Identities=24% Similarity=0.299 Sum_probs=24.3 Q ss_pred HCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 00100113567897776410278510223221024566686213469999999985221028866 Q gi|254780970|r 462 KADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPI 526 (737) Q Consensus 462 ~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipi 526 (737) ..+|..-.+.|+....+.++.+ =|=--|+|+..+. .. ..-+.+.++|..+||++ T Consensus 151 ~v~pReI~~~Al~~~A~~iIl~--------HNHPSG~~~PS~~--Di-~~T~~l~~a~~~lgI~l 204 (224) T PRK00024 151 IVHPREIVKEALKLNAAALILA--------HNHPSGDPEPSQA--DI-LITKRLKEAGELLGIRL 204 (224) T ss_pred EECHHHHHHHHHHHCCCEEEEE--------EECCCCCCCCCHH--HH-HHHHHHHHHHHHCCCEE T ss_conf 7628999999998278879999--------5469999998988--99-99999999997689868 No 125 >pfam11212 DUF2999 Protein of unknown function (DUF2999). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. Probab=45.06 E-value=19 Score=15.15 Aligned_cols=27 Identities=26% Similarity=0.682 Sum_probs=16.4 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHCCCC Q ss_conf 478899861999899999999957897 Q gi|254780970|r 6 PILEILTAHGLTKDECDQIIHILKRQP 32 (737) Q Consensus 6 ~~~e~~~~~gL~~~e~~~i~~~l~R~P 32 (737) |+..+.|+|..|++-+..++..|-.+| T Consensus 3 PIia~LKehNvSdeqi~elFqalteNP 29 (82) T pfam11212 3 PIIAILKEHNVSDEQIKELFQALTENP 29 (82) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCH T ss_conf 399999982887999999999985297 No 126 >cd00757 ThiF_MoeB_HesA_family ThiF_MoeB_HesA. Family of E1-like enzymes involved in molybdopterin and thiamine biosynthesis family. The common reaction mechanism catalyzed by MoeB and ThiF, like other E1 enzymes, begins with a nucleophilic attack of the C-terminal carboxylate of MoaD and ThiS, respectively, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of MoaD and ThiS. MoeB, as the MPT synthase (MoaE/MoaD complex) sulfurase, is involved in the biosynthesis of the molybdenum cofactor, a derivative of the tricyclic pterin, molybdopterin (MPT). ThiF catalyzes the adenylation of ThiS, as part of the biosynthesis pathway of thiamin pyrophosphate (vitamin B1). Probab=44.02 E-value=20 Score=15.04 Aligned_cols=22 Identities=32% Similarity=0.412 Sum_probs=14.2 Q ss_pred CCCCCHHHHHHHHHCCCCCCEE Q ss_conf 7752011122332014775289 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGIT 288 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~ 288 (737) +|+|||.+.+.+...++|+|-- T Consensus 27 vG~GGlGs~~~~~La~~Gvg~i 48 (228) T cd00757 27 VGAGGLGSPAAEYLAAAGVGKL 48 (228) T ss_pred ECCCHHHHHHHHHHHHCCCCEE T ss_conf 8877889999999998399758 No 127 >PRK08443 consensus Probab=43.36 E-value=10 Score=16.99 Aligned_cols=33 Identities=12% Similarity=0.047 Sum_probs=20.4 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 566686213469999999985221028866544 Q gi|254780970|r 497 GNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 497 ~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) .+|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 169 ~~P~NPTG~v~s~~~l~~l~~~a~~~~~~ii~D 201 (388) T PRK08443 169 NTPSNPTGSVYSKEELEAIAKVLKGTDIWVLSD 201 (388) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCCCEEECC T ss_conf 689899884764478999999862357567422 No 128 >PRK08328 hypothetical protein; Provisional Probab=43.00 E-value=14 Score=16.03 Aligned_cols=44 Identities=16% Similarity=0.216 Sum_probs=27.9 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCC Q ss_conf 775201112233201477528996202204667899989861788 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSES 311 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEs 311 (737) +|+|||.+.+.++..++|+|--.=.|.=-+...+|. ..+++.|. T Consensus 33 vG~GGlGs~~~~~La~~GvG~i~lvD~D~ve~SNLn-RQil~~~~ 76 (230) T PRK08328 33 VGVGGLGSPVAYYLAAAGVGTVLLIDEQTPELSNLN-RQILHWEE 76 (230) T ss_pred ECCCHHHHHHHHHHHHCCCCEEEEECCCEEECCCCH-HHHHCCHH T ss_conf 887878999999999848986898748787515632-55403087 No 129 >COG4915 XpaC 5-bromo-4-chloroindolyl phosphate hydrolysis protein [General function prediction only] Probab=42.86 E-value=20 Score=15.03 Aligned_cols=14 Identities=21% Similarity=0.190 Sum_probs=4.5 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999987 Q gi|254780970|r 718 ATLQTKYESWFPEF 731 (737) Q Consensus 718 ~~l~~~w~~~l~~~ 731 (737) .+|...-++-|.++ T Consensus 170 ~elsr~i~kdl~~l 183 (204) T COG4915 170 KELSRSIEKDLHEL 183 (204) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 130 >PRK07324 transaminase; Validated Probab=42.62 E-value=6 Score=18.64 Aligned_cols=95 Identities=15% Similarity=0.090 Sum_probs=45.1 Q ss_pred CCCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 00001357631420377762279730453210000100113567897776410278510223221024566686213469 Q gi|254780970|r 429 SIQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQF 508 (737) Q Consensus 429 tv~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql 508 (737) ++..||+-+-|..+.+.....+.-+.|..+.....++..+-..- .+.+..++.-..+ + + .+ .||.+|-..--- T Consensus 99 ~l~~pGD~Viv~~P~Y~~~~~~~~~~G~~v~~~~l~~~~~~~~D-l~~l~~~i~~~tk-l-i--il--~nP~NPTG~v~s 171 (373) T PRK07324 99 ALIEPGDHVISVYPTYQQLYDIPESLGAEVDYWKLREENGWLPD-LDELKRLVRPNTK-L-I--CI--NNANNPTGALMD 171 (373) T ss_pred HHCCCCCEEEECCCCCHHHHHHHHHCCCEEEECCCCCCCCCCCC-HHHHHHHCCCCCE-E-E--EE--CCCCCCCCCCCC T ss_conf 97599998998899857799999983987654076722598879-9999961887876-9-9--97--999798897788 Q ss_pred HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 9999999852210288665443 Q gi|254780970|r 509 VHSVKGIREACQILDFPIVSGN 530 (737) Q Consensus 509 ~~av~gl~d~c~~lgipivgGk 530 (737) .+-++.+.+.|++.++.|++-. T Consensus 172 ~e~l~~l~~la~~~~i~iisDE 193 (373) T PRK07324 172 RAFLEEIVEIAKSVDAYVLSDE 193 (373) T ss_pred HHHHHHHHHHHHCCCEEEEECC T ss_conf 9999999987541785998125 No 131 >PRK11823 DNA repair protein RadA; Provisional Probab=41.37 E-value=16 Score=15.70 Aligned_cols=35 Identities=23% Similarity=0.284 Sum_probs=29.0 Q ss_pred CCCCCEEEEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCC Q ss_conf 788850899995467588999888709713112212104431 Q gi|254780970|r 309 SESQERMLMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLF 350 (737) Q Consensus 309 sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~ 350 (737) |=+|=| +-..++.++|++.+++.-.||+||+++.+ T Consensus 188 svsQVr-------e~a~~L~~~AK~~~i~~~lVGHVTKdG~i 222 (454) T PRK11823 188 SVSQVR-------ECAAELTRLAKQSGIAVFLVGHVTKEGAI 222 (454) T ss_pred CHHHHH-------HHHHHHHHHHHHCCCCEEEEEEECCCCCC T ss_conf 789999-------99999999997449828999977267764 No 132 >TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. Probab=40.97 E-value=14 Score=16.03 Aligned_cols=180 Identities=23% Similarity=0.387 Sum_probs=94.3 Q ss_pred CCCCHHHHHHHH-HHHCCCC-CCCCC--HHHHHHHHHC----CCCEEECCCCCEEEE--------EECCCEEEEEEECCC Q ss_conf 897188999985-3300110-74458--8999751206----974899334773689--------836983999995178 Q gi|254780970|r 30 RQPTLTEIGIIS-AMWNEHC-SYKSS--KKWLRTLPTT----GKHVIQGPGENAGVV--------DIGGGDCVVFKMESH 93 (737) Q Consensus 30 R~Pt~~El~~~~-~~wSEHC-~~k~~--~~~l~~~~~~----~~~vi~~~~~~a~vi--------~~~~~~~~~~k~EtH 93 (737) |-||++|+-+.| |-|--|- +-.+. ...+...-+. |.+| .|-+|.+.| ++-++|+--|+=+| T Consensus 162 RLPTEAEW~~VAsq~W~~E~~sWNs~NS~~~~HeVC~~PGldGsnv--kfe~n~~~vCDmaGN~lEwVNDW~~~F~dtT- 238 (982) T TIGR02171 162 RLPTEAEWVYVASQSWDPEKYSWNSENSEYELHEVCTSPGLDGSNV--KFEDNEGNVCDMAGNVLEWVNDWYASFKDTT- 238 (982) T ss_pred CCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE--EEECCCCCCCCHHCCHHHHHCCCCCCCCCCC- T ss_conf 1850888788875167646774156177862444315788776410--2003777400002012220056000312562- Q ss_pred CCCCCCCCCCCCCCC--------CCCCCCCC------CCCC-CEEEEEE---EE----CCCCCCCCCCHHHH-------- Q ss_conf 877765888872105--------58102257------6778-7436877---50----45787568638889-------- Q gi|254780970|r 94 NHPSYIEPYQGAATG--------VGGILRDI------FTMG-ARPVAAM---NS----LRFGAIHHPKTKHL-------- 143 (737) Q Consensus 94 NhPs~i~P~~GAaTg--------~gG~iRDi------~~~G-a~p~a~~---~~----l~~g~~~~p~~~~~-------- 143 (737) |.=|=|++-+ =||-+|.- -++| -+||+.- |. |-||.+.++ -|+ T Consensus 239 -----ltNyvGs~d~~~lg~~VVKGGsY~~~ps~~~ly~RGDVYPv~~St~~DY~GFRLv~G~IPdA--~wLd~~G~v~t 311 (982) T TIGR02171 239 -----LTNYVGSSDEGSLGERVVKGGSYRNSPSAINLYTRGDVYPVTSSTKGDYVGFRLVLGAIPDA--TWLDESGKVAT 311 (982) T ss_pred -----EEEEECCCCCCCCCCCEEECCCCCCCCCCEECCCCCCCCCEECCCCCCCCEEEEECCCCCCC--CCCCCCCCEEE T ss_conf -----01030567754436634535754567232001478863120037877620344411488875--54188896768 Q ss_pred -------H-HHHHHHHHH--HHHHCC-CCCCCCEEEEECCCCCCCCCCEEEECCCCHHH---HHHCCCCCCCCEEEEECC Q ss_conf -------9-888745886--654308-72113203552156766422112211211355---431122568977999657 Q gi|254780970|r 144 -------L-SGVVAGIAG--YSNSFG-VPTVGGEVEFLPCYNNNIIVNTFAAGIAKTNA---IFSSKARGIGLPLVYLGA 209 (737) Q Consensus 144 -------~-~~~~~G~~~--ygN~~G-~P~v~G~~~~~~~y~~~~~v~~~~~Gi~~~~~---~~~~~~~~~Gd~ii~~G~ 209 (737) . ..+-+-.+. |.-|+= +=-|.|+..|-+ |.++ +..++++++ +.+..-...|..|-.+ T Consensus 312 ~~~~~~~~~~~~r~~~gtkky~~KLaFRND~TgnL~yID-y~~~------~~~~~EI~D~~~vyHPDISPDG~~VAFC-- 382 (982) T TIGR02171 312 SKVKSLISSSEIRELTGTKKYKAKLAFRNDVTGNLAYID-YAKG------ASKVVEIEDTIKVYHPDISPDGKKVAFC-- 382 (982) T ss_pred EEEEEEEEHHHHHHHHCCEEEEEEEEEECCCCCCEEEEE-ECCC------CCEEEEEECCCCEECCCCCCCCCEEEEE-- T ss_conf 887875205888853077036777787047643326887-0588------3369983037412027407986874576-- Q ss_pred CCCCCCCHHH-HHHHHHHCCC Q ss_conf 6564441046-6667772865 Q gi|254780970|r 210 KTGRDGIGGA-SMASEEFGEN 229 (737) Q Consensus 210 ~tg~dGi~Ga-~~sS~~~~~~ 229 (737) |+--|+-|- ...=+.|++. T Consensus 383 -T~~EG~~GkS~vYVR~Ln~~ 402 (982) T TIGR02171 383 -TGIEGVDGKSEVYVRDLNAS 402 (982) T ss_pred -CCCCCCCCCCEEEEECCCCC T ss_conf -34114478743898445557 No 133 >PRK09276 aspartate aminotransferase; Provisional Probab=40.39 E-value=15 Score=15.94 Aligned_cols=32 Identities=6% Similarity=0.015 Sum_probs=19.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 174 ~P~NPTG~v~s~~~l~~l~~la~~~~i~ii~D 205 (385) T PRK09276 174 YPNNPTGAVADLEFFEKVVDFAKKYDIIVCHD 205 (385) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEH T ss_conf 99898551111889999987632557569754 No 134 >PRK10481 hypothetical protein; Provisional Probab=39.95 E-value=20 Score=15.04 Aligned_cols=59 Identities=24% Similarity=0.459 Sum_probs=33.1 Q ss_pred CCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 32100001001135678977764102785102232210245666862134699999999852210288665443 Q gi|254780970|r 457 TPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSGN 530 (737) Q Consensus 457 ~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgGk 530 (737) .|.+...+||.|..-.+.+|.+.+..-|++-+ +.||.-| .+.|.+ .+.+.+|+|++-.| T Consensus 154 ~~~~~~asPY~g~~~~l~~Aa~~L~~~gadli-vLDCiGY-----t~~~r~---------~~~~~~g~PViLsr 212 (224) T PRK10481 154 PPVVSLASPYHGSEQKLIDAGKELLAQGADLI-VLDCLGY-----HQRHRD---------LLQKALDVPVLLSN 212 (224) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHCCCCEE-EEECCCC-----CHHHHH---------HHHHHHCCCEEEEH T ss_conf 61699837988988999999999986599999-9987777-----599999---------99998599999518 No 135 >PRK08363 alanine aminotransferase; Validated Probab=39.88 E-value=12 Score=16.62 Aligned_cols=32 Identities=22% Similarity=0.238 Sum_probs=20.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.+++- T Consensus 174 nP~NPTG~v~s~~~l~~i~~la~~~~~~vIsD 205 (398) T PRK08363 174 NPNNPTGALYDKKTLKEILDIAGEHDLPVISD 205 (398) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCEEECH T ss_conf 99998888654889999999987559647613 No 136 >COG2003 RadC DNA repair proteins [DNA replication, recombination, and repair] Probab=39.76 E-value=10 Score=16.98 Aligned_cols=53 Identities=21% Similarity=0.311 Sum_probs=25.8 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 0100113567897776410278510223221024566686213469999999985221028866 Q gi|254780970|r 463 ADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPI 526 (737) Q Consensus 463 ~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipi 526 (737) ..|....+.|+-.+..-++.+---|- |+|+.... + ...-+-+.++|.-+||.+ T Consensus 152 V~PREI~k~Al~~nAaavIlaHNHPS--------Gd~~PS~a--D-~~iT~rl~~a~~ll~I~v 204 (224) T COG2003 152 VHPREIFKEALKYNAAAVILAHNHPS--------GDPTPSRA--D-ILITERLKEAGKLLGIRL 204 (224) T ss_pred ECHHHHHHHHHHHCCHHHHEECCCCC--------CCCCCCHH--H-HHHHHHHHHHHHHCCCEE T ss_conf 33799999999822135530015899--------99985778--8-999999999987429331 No 137 >PRK13805 bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional Probab=39.17 E-value=24 Score=14.54 Aligned_cols=20 Identities=20% Similarity=0.210 Sum_probs=12.7 Q ss_pred EEEEEEECCCHHHHHHHHHH Q ss_conf 08999954675889998887 Q gi|254780970|r 314 RMLMILNPEKQHKAQEILNK 333 (737) Q Consensus 314 R~~~~v~~~~~~~~~~i~~~ 333 (737) ..+-....++.++-.++|++ T Consensus 349 PVLa~y~~~d~~~A~~~~~~ 368 (862) T PRK13805 349 PVLAMYKAKSFEDAVEKAEK 368 (862) T ss_pred CEEEEEEECCHHHHHHHHHH T ss_conf 00599895899999999999 No 138 >cd01492 Aos1_SUMO Ubiquitin activating enzyme (E1) subunit Aos1. Aos1 is part of the heterodimeric activating enzyme (E1), specific for the SUMO family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. The E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by SUMO family of ubiquitin-like proteins (Ublps) is involved in cell division, nuclear transport, the stress response and signal transduction. Aos1 contains part of the adenylation domain. Probab=38.70 E-value=23 Score=14.64 Aligned_cols=44 Identities=18% Similarity=0.244 Sum_probs=25.4 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCC Q ss_conf 775201112233201477528996202204667899989861788 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSES 311 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEs 311 (737) +|+|||.+.+.+....+|+|--.=.|.=.+...+|. ..++++|+ T Consensus 27 vG~GGLG~~v~~~La~aGvg~i~ivD~d~v~~snL~-RQ~l~~~~ 70 (197) T cd01492 27 IGLKGLGAEIAKNLVLSGIGSLTILDDRTVTEEDLG-AQFLIPAE 70 (197) T ss_pred ECCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHCC-CCEECCCC T ss_conf 887889999999999837986999989918777739-78603233 No 139 >cd02430 PTH2 Peptidyl-tRNA hydrolase, type 2 (PTH2). Peptidyl-tRNA hydrolase (PTH) activity releases tRNA from the premature translation termination product peptidyl-tRNA, therefore allowing the tRNA and peptide to be reused in protein synthesis. PTH2 is present in archaea and eukaryotes. Probab=38.67 E-value=24 Score=14.49 Aligned_cols=38 Identities=21% Similarity=0.304 Sum_probs=24.0 Q ss_pred HHHHHCCCCCEEEEEEEC-CCHHHHHHHHHHHCCEEECC Q ss_conf 898617888508999954-67588999888709713112 Q gi|254780970|r 304 YEMMLSESQERMLMILNP-EKQHKAQEILNKWGLHFSII 341 (737) Q Consensus 304 ~ei~~sEsQeR~~~~v~~-~~~~~~~~i~~~~~~~~~~i 341 (737) ++-|..+.|-+.++-++. +.+.++.+.+++.++++.+| T Consensus 41 ~~~W~~~G~~KIvlkv~~e~~l~~l~~~a~~~~l~~~~V 79 (115) T cd02430 41 LRAWEREGQKKIVLKVNSEEELLELKKKAKSLGLPTSLI 79 (115) T ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 999996799569997799999999999999879978999 No 140 >PRK04250 dihydroorotase; Provisional Probab=38.51 E-value=24 Score=14.48 Aligned_cols=13 Identities=23% Similarity=0.422 Sum_probs=6.3 Q ss_pred EEEEECCCEEEEE Q ss_conf 6898369839999 Q gi|254780970|r 76 GVVDIGGGDCVVF 88 (737) Q Consensus 76 ~vi~~~~~~~~~~ 88 (737) +-|-|.|+.-..+ T Consensus 19 adI~I~dGkI~~I 31 (408) T PRK04250 19 GSIGIEDGKISKI 31 (408) T ss_pred EEEEEECCEEEEE T ss_conf 6699989999893 No 141 >PTZ00079 NADP-specific glutamate dehydrogenase; Provisional Probab=38.36 E-value=24 Score=14.46 Aligned_cols=83 Identities=25% Similarity=0.388 Sum_probs=54.7 Q ss_pred HHCCCCHHHHHHHHH-HHCCCCCHHHHH--------HHHHHHCCCCCCCCCHHHHHHHHHCCCCEEEC--C-CCCEEEEE Q ss_conf 861999899999999-957897188999--------98533001107445889997512069748993--3-47736898 Q gi|254780970|r 12 TAHGLTKDECDQIIH-ILKRQPTLTEIG--------IISAMWNEHCSYKSSKKWLRTLPTTGKHVIQG--P-GENAGVVD 79 (737) Q Consensus 12 ~~~gL~~~e~~~i~~-~l~R~Pt~~El~--------~~~~~wSEHC~~k~~~~~l~~~~~~~~~vi~~--~-~~~a~vi~ 79 (737) ...|++.++++.+.+ ...|+|...|.- .+.-..-.|-.| ...|..+ .+++++|+- | -+|.|-++ T Consensus 20 ~~~~~~~~~~~~~~~~~~~r~p~e~ef~qav~ev~~sl~p~~~~~p~y---~~ilerl-~ePeR~i~frVpw~~D~G~v~ 95 (469) T PTZ00079 20 NYEGLPDQEMNELYERVKQRDGNQTEFLQAFEEVLYSLKPLFMREPKY---LPIIERL-SEPERVIQFRVPWMDDAGRQR 95 (469) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCHHH---HHHHHHH-CCCCEEEEEEEEEEECCCCEE T ss_conf 645650888999999998408997799999999999889999739489---9999986-488559999988997799688 Q ss_pred ECCCEEEEEEECCCCCCCCCCCCCCC Q ss_conf 36983999995178877765888872 Q gi|254780970|r 80 IGGGDCVVFKMESHNHPSYIEPYQGA 105 (737) Q Consensus 80 ~~~~~~~~~k~EtHNhPs~i~P~~GA 105 (737) +..+|-+.+ | |++-||-|- T Consensus 96 v~~GyRVQ~-----n--~alGPyKGG 114 (469) T PTZ00079 96 KNRGFRVQY-----N--SALGPYKGG 114 (469) T ss_pred EEEEEEEEE-----C--CCCCCCCCC T ss_conf 862457775-----3--877888787 No 142 >PRK09517 multifunctional thiamine-phosphate pyrophosphorylase/synthase/phosphomethylpyrimidine kinase; Provisional Probab=38.19 E-value=24 Score=14.44 Aligned_cols=38 Identities=3% Similarity=0.037 Sum_probs=27.5 Q ss_pred EEEEEEECCCHH-HHHHHHHHHCCEEECCCCEEECCCCC Q ss_conf 089999546758-89998887097131122121044310 Q gi|254780970|r 314 RMLMILNPEKQH-KAQEILNKWGLHFSIIGITTNDKLFR 351 (737) Q Consensus 314 R~~~~v~~~~~~-~~~~i~~~~~~~~~~iG~vt~~~~~~ 351 (737) |-+..++++-+. ++..+++-.++.+..||=..+..... T Consensus 278 ~~i~~~p~~~v~~Ql~av~~Di~vdavK~GML~~~~~i~ 316 (738) T PRK09517 278 NTIHTPPVDFLRAQLDAVFSDVTVDAVKLGMLGSADTVD 316 (738) T ss_pred CEEECCCHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHH T ss_conf 146309999999999998706887858775758899999 No 143 >pfam10112 Halogen_Hydrol 5-bromo-4-chloroindolyl phosphate hydrolysis protein. Members of this family of prokaryotic proteins mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds. Probab=38.15 E-value=23 Score=14.58 Aligned_cols=20 Identities=25% Similarity=0.398 Sum_probs=9.2 Q ss_pred CCHHHHHHHHHHHHHHHHCC Q ss_conf 14999999999999987245 Q gi|254780970|r 715 IPVATLQTKYESWFPEFISE 734 (737) Q Consensus 715 ~~i~~l~~~w~~~l~~~~~~ 734 (737) .-+++|....++-+.+++++ T Consensus 167 ~~ld~L~~~~~~dl~~l~~~ 186 (199) T pfam10112 167 ITIDELSRTIEKDLQELLSD 186 (199) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999996 No 144 >PRK07777 aminotransferase; Validated Probab=37.30 E-value=9.8 Score=17.17 Aligned_cols=47 Identities=15% Similarity=0.236 Sum_probs=28.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 6588887210558102257677874368775045787568638889988874588 Q gi|254780970|r 98 YIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIA 152 (737) Q Consensus 98 ~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~ 152 (737) -+.||+..+ +|+.+.+=+++- .=+|..|+|+.|-+..+.+.+.+-+. T Consensus 5 rl~~~~~~~------~~~~~~~a~~~~--~I~l~~G~Pd~~~p~~i~~a~~~a~~ 51 (386) T PRK07777 5 RLRPFGTTI------FAEMSALAVRTG--AVNLGQGFPDEDGPPEMLEAAQEAIA 51 (386) T ss_pred CCCCCCCCH------HHHHHHHHCCCC--CEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 138998749------999999857999--88949978998568999999999986 No 145 >TIGR00313 cobQ cobyric acid synthase CobQ; InterPro: IPR004459 Cobyric acid synthase (CobQ) catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. NH(2) groups are provided by glutamine, and one molecule of ATP is hydrogenolysed for each amidation .; GO: 0003824 catalytic activity, 0009236 cobalamin biosynthetic process. Probab=37.20 E-value=25 Score=14.34 Aligned_cols=113 Identities=24% Similarity=0.333 Sum_probs=57.3 Q ss_pred EEEECCCCHHH--HHHCCCCCCCCE---EEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 12211211355--431122568977---9996576564441046666777286533344433267989998889998740 Q gi|254780970|r 181 TFAAGIAKTNA--IFSSKARGIGLP---LVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLEL 255 (737) Q Consensus 181 ~~~~Gi~~~~~--~~~~~~~~~Gd~---ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~ 255 (737) +.|.|+.|..+ ++..||+ ||- ||+.|-+.|. +-+.++ ....+ | +..|.+-|++..| T Consensus 59 A~AAGieP~~~mNPvLLKPk--gdf~SQviv~G~a~g~-------~~~~~Y---~~~~~------d-~~~~~i~eSle~L 119 (502) T TIGR00313 59 ALAAGIEPSVHMNPVLLKPK--GDFTSQVIVHGRAVGD-------MNAQEY---YKNKV------D-FLLKAIKESLEIL 119 (502) T ss_pred HHHCCCCCCCCCCCCCCCCC--CCCEEEEEEEEECCCC-------CCHHHH---HHHHH------H-HHHHHHHHHHHHH T ss_conf 98638788623487101678--8846678984112466-------761567---77999------9-9999999999875 Q ss_pred CCC-CCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHH Q ss_conf 113-8301103477520111223320147752899620220466789998986178885089999546758899988870 Q gi|254780970|r 256 MNT-DAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKW 334 (737) Q Consensus 256 ~~~-~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~ 334 (737) -+. .+|. | =|||+ -|||+| ++..|.=++|+- =.|-..+++. |. .|= T Consensus 120 ~~~Yd~vv-~--EGAGS--------------~AEINL-----~~rDLaN~~iA~-~~~A~~iLvA---DI-------DRG 166 (502) T TIGR00313 120 AEEYDLVV-I--EGAGS--------------PAEINL-----KERDLANMRIAE-LADADVILVA---DI-------DRG 166 (502) T ss_pred HCCCCEEE-E--ECCCC--------------CCCCCC-----CCCCCHHHHHHH-HCCCCEEEEE---EC-------CCC T ss_conf 20288899-8--26887--------------100053-----315722478986-4397679997---50-------777 Q ss_pred CCEEECCCCEE Q ss_conf 97131122121 Q gi|254780970|r 335 GLHFSIIGITT 345 (737) Q Consensus 335 ~~~~~~iG~vt 345 (737) ||=|++.|+.. T Consensus 167 GVFAsi~GTl~ 177 (502) T TIGR00313 167 GVFASIYGTLK 177 (502) T ss_pred CCHHHHHHHHH T ss_conf 43243374666 No 146 >PRK07683 aminotransferase A; Validated Probab=36.44 E-value=12 Score=16.46 Aligned_cols=35 Identities=11% Similarity=0.055 Sum_probs=22.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 24566686213469999999985221028866544 Q gi|254780970|r 495 NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 495 nf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) .+.||.+|-..---.+-++.+.+.|++.++.|++- T Consensus 166 il~~P~NPtG~v~s~e~l~~i~~la~~~~i~ii~D 200 (387) T PRK07683 166 VLPYPSNPTGVTLSKEELKDIANVLKDKNIFVLSD 200 (387) T ss_pred EECCCCCCCCCCCCHHHHHHHHHHHHCCCCEEECC T ss_conf 64799598978878999999999761369447546 No 147 >PRK05690 molybdopterin biosynthesis protein MoeB; Provisional Probab=36.38 E-value=19 Score=15.23 Aligned_cols=60 Identities=20% Similarity=0.356 Sum_probs=33.7 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEE--EEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHC Q ss_conf 77520111223320147752899--6202204667899989861788850899995467588999888709 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITL--DLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWG 335 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i--~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~ 335 (737) +|+|||.+.+.+....+|+|--. |-|.|- ..+|. ..++++|+.- -..+.+...+..++-+ T Consensus 38 vG~GGlG~~~~~~La~aGvg~i~lvD~D~ve--~sNLn-RQ~l~~~~di------G~~Kv~~a~~~l~~in 99 (245) T PRK05690 38 VGLGGLGCAAAQYLAAAGVGTLTLVDFDTVS--LSNLQ-RQVLHDDATI------GQPKVESAKAALARIN 99 (245) T ss_pred ECCCHHHHHHHHHHHHCCCCEEEEEECCCCC--CCCHH-HHHCCCHHHC------CCCHHHHHHHHHHHHC T ss_conf 8777778999999998599659999688678--88678-8864598778------9887999999999758 No 148 >TIGR02719 repress_PhaQ poly-beta-hydroxybutyrate-responsive repressor; InterPro: IPR014091 Members of this family are transcriptional regulatory proteins found in the vicinity of poly-beta-hydroxybutyrate (PHB) operons in several species of Bacillus. This protein appears to have repressor activity modulated by PHB itself. This protein belongs to the larger PadR family.. Probab=36.09 E-value=19 Score=15.21 Aligned_cols=31 Identities=16% Similarity=0.045 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHHHHH Q ss_conf 9999999999959806994648657999999 Q gi|254780970|r 609 NHGFFVLSMINAKKITACHDISTGGLIITLA 639 (737) Q Consensus 609 ~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~ 639 (737) ++++++++|=++.+|.|--|-|+||=|--|. T Consensus 59 NVYRTLR~LEK~NLI~S~WDTS~~GPA~RIY 89 (138) T TIGR02719 59 NVYRTLRKLEKDNLISSTWDTSDEGPAKRIY 89 (138) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHH T ss_conf 3778988876651355666468878630000 No 149 >KOG1327 consensus Probab=36.08 E-value=16 Score=15.71 Aligned_cols=44 Identities=27% Similarity=0.483 Sum_probs=27.2 Q ss_pred HHCCCHHHHHHHHHHHHHHHHC--------CCCC--H-HHHHHHCCCC-CCCCCCH Q ss_conf 0001001135678977764102--------7851--0-2232210245-6668621 Q gi|254780970|r 461 VKADPFEGTKQAVAECWRNIIA--------TGAK--P-LAITDNLNFG-NPEKEEI 504 (737) Q Consensus 461 ~~~dP~~ga~~AV~Eal~Nl~a--------~Ga~--P-lait~~lnf~-~P~~~e~ 504 (737) ...|||..|..+|++.+.+--. =||+ | -.+++++++. +|++++. T Consensus 313 ~~~N~Y~~Ai~~vG~~lq~ydsdk~fpa~GFGakip~~~~vs~~f~ln~~~~~~~c 368 (529) T KOG1327 313 HQPNPYEQAIRSVGETLQDYDSDKLFPAFGFGAKIPPDGQVSHEFVLNFNPEDPEC 368 (529) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCC T ss_conf 89988999999971341055877764234546657998652420220378899854 No 150 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=35.91 E-value=26 Score=14.20 Aligned_cols=25 Identities=8% Similarity=-0.074 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 3469999999985221028866544 Q gi|254780970|r 505 MGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 505 ~~ql~~av~gl~d~c~~lgipivgG 529 (737) ..++.+.++.++.+|+++++||+.- T Consensus 163 ~~~i~~i~~~Lk~lAke~~v~Vi~l 187 (271) T cd01122 163 RKALDEIMTKLRGFATEHGIHITLV 187 (271) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 9999999999999999979977999 No 151 >TIGR02326 transamin_PhnW 2-aminoethylphosphonate--pyruvate transaminase; InterPro: IPR012703 Phosphonates are a class of organophosphorus compounds, characterised by a stable C-P bond, which are found in a variety of biologically produced molecules including antiobiotics, lipids, proteins and polysaccharides . The functions of these molecules include phosphorus storage, cell communication, host recognition and chemical warfare. 2-Aminoethylphosphonate (AEP), the most common naturally occurring phosphonate, is an important precursor used in the biosynthesis of phosphonolipids, phosphonoproteins, and phosphonoglycans. This entry represents 2-aminoethylphosphonate-pyruvate transaminase (AEPT)(2.6.1.37 from EC) which catalyses the interconversion of AEP and phosphonoacetaldehyde (P-Ald), coupled with the interconversion of pryuvate and L-alanine. In some bacterial species this is the first step in an AEP degradation pathway which allows them to utilise this compound as a source of carbon, nitrogen and phosphorus. Phosphonoacetaldeyhde hydrolase, often encoded by an adjacent gene, then converts P-Ald to acetaldehyde and phosphate.Species with this pathway generally have an identified phosphonate ABC transporter but do not also have the multisubunit C-P lyase complex as found in Escherichia coli. The crystal structure of AEPT from Salmonella typhimurium has been studied to 2.2 A resolution . The protein is a homodimer where each subunit is composed of two domains, large and small. The large domain forms an alpha-beta complex characteristic of the aspartate transaminase family, while the small domain is comosed of highly twisted beta strands. The active site contains a pyridoxal 5'-phosphate cofactor and is found at the domain interface, being composed of residues from both subunits.. Probab=35.34 E-value=27 Score=14.15 Aligned_cols=111 Identities=17% Similarity=0.193 Sum_probs=72.6 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCEEEEECHHHHH Q ss_conf 89999999999999995980699464865799999999845897089980566689878887298832189999688099 Q gi|254780970|r 604 CHLEKNHGFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPKPFLFGEDQGRYVVCISPENQD 683 (737) Q Consensus 604 ~~~~k~~~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~~~LFsEs~Gr~ii~V~~~~~~ 683 (737) ......++.++.++-++|-|.|=|-.-.--=-.=+.+| ..+|++.-|+.....--+....+.+..+| +-. T Consensus 246 THvV~Af~qAL~el~~EGGv~aR~~RY~~nq~~L~~~m---~~LGF~~ll~~~~~sPIItsF~~P~~~~y-------~F~ 315 (366) T TIGR02326 246 THVVHAFAQALLELEKEGGVEARYQRYQSNQKLLVAGM---RALGFEPLLDDELQSPIITSFYSPEDPDY-------RFA 315 (366) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH---HHCCCCCCCCHHHCCCEEEEEECCCCCCC-------HHH T ss_conf 68999999999999845897457899999999999999---86155114754345886898507788971-------389 Q ss_pred HHHHHHHHCCCCEEEEEEECC-CEEEEECCEECCHHHHHHHHH Q ss_conf 899999857984999899879-868980741214999999999 Q gi|254780970|r 684 LVMSEANNKNIPLRYLGKVSG-STLSIHNILNIPVATLQTKYE 725 (737) Q Consensus 684 ~~~~~~~~~gi~~~~IG~Vt~-~~l~i~~~~~~~i~~l~~~w~ 725 (737) .|-+.+++.|.-.+. |+|+. +.|.|-++-+++-.|++.+-. T Consensus 316 ~fY~~lk~~GFviYP-GKvs~~~~FRiG~IG~V~~~di~~l~~ 357 (366) T TIGR02326 316 DFYQRLKEQGFVIYP-GKVSQVDTFRIGNIGDVDAADIERLVT 357 (366) T ss_pred HHHHHHHHCCCEECC-CCCCCCCCEECCCCCCCCHHHHHHHHH T ss_conf 999999767817627-610456621116767638899999999 No 152 >COG4721 ABC-type cobalt transport system, predicted permease component [Inorganic ion transport and metabolism] Probab=35.00 E-value=16 Score=15.72 Aligned_cols=57 Identities=32% Similarity=0.515 Sum_probs=31.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHH--HHHHHCCCCCC Q ss_conf 887776588887210558102257677874368775045787568638889988874588--66543087211 Q gi|254780970|r 93 HNHPSYIEPYQGAATGVGGILRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGIA--GYSNSFGVPTV 163 (737) Q Consensus 93 HNhPs~i~P~~GAaTg~gG~iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~~--~ygN~~G~P~v 163 (737) -=||-+.+||-+- ++--+-+|+| |++.+- . .+|. .-++.+++++.. -++|+||+|++ T Consensus 37 ~lhp~ale~~~~~------i~~GiW~maa-vi~~l~---I---pkpG-aAl~~Ev~Aa~ve~ll~sqfgi~ti 95 (192) T COG4721 37 LLHPLALEPFANE------ILFGIWFMAA-VIAALF---I---PKPG-AALIGEVLAALVEVLLGSQFGIGTI 95 (192) T ss_pred HCCHHHCCCCCCC------HHHHHHHHHH-HHEEEE---E---CCCC-HHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 4162421755430------3787999987-770156---2---6872-8899999999999997587772699 No 153 >pfam02244 Propep_M14 Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of folding and activity of the pro-enzyme. Probab=34.51 E-value=28 Score=14.06 Aligned_cols=25 Identities=8% Similarity=0.101 Sum_probs=14.5 Q ss_pred EEEECHHHHHHHHHHHHHCCCCEEE Q ss_conf 9999688099899999857984999 Q gi|254780970|r 674 VVCISPENQDLVMSEANNKNIPLRY 698 (737) Q Consensus 674 ii~V~~~~~~~~~~~~~~~gi~~~~ 698 (737) =+-|+|.+...|...++.+++++.+ T Consensus 38 dv~V~p~~~~~f~~~L~~~~I~~~v 62 (75) T pfam02244 38 DIMVPPHKLADFEALLEQNGISYEV 62 (75) T ss_pred EEEECHHHHHHHHHHHHHCCCEEEE T ss_conf 9996999999999999986984899 No 154 >PRK12475 thiamine/molybdopterin biosynthesis MoeB-like protein; Provisional Probab=34.36 E-value=22 Score=14.73 Aligned_cols=22 Identities=23% Similarity=0.314 Sum_probs=14.7 Q ss_pred CCCCCHHHHHHHHHCCCCCCEE Q ss_conf 7752011122332014775289 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGIT 288 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~ 288 (737) +|+|||.+.+.+...++|+|-- T Consensus 30 VGaGGLGs~~a~~La~aGVG~l 51 (337) T PRK12475 30 IGAGALGAANAEALVRAGIGKL 51 (337) T ss_pred ECCCHHHHHHHHHHHHCCCCEE T ss_conf 9777778999999998289869 No 155 >TIGR03538 DapC_gpp succinyldiaminopimelate transaminase. This family of succinyldiaminopimelate transaminases (DapC) includes the experimentally characterized enzyme from Bordatella pertussis. The majority of genes in this family are proximal to genes encoding components of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Probab=34.25 E-value=14 Score=16.01 Aligned_cols=32 Identities=19% Similarity=0.167 Sum_probs=16.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 174 sP~NPTG~v~s~~~l~~l~~~a~~~~i~iisD 205 (394) T TIGR03538 174 SPGNPTGAVLSLETLKKLIELADQYGFIIASD 205 (394) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999897778799999999973267699980 No 156 >PRK07682 hypothetical protein; Validated Probab=33.62 E-value=13 Score=16.25 Aligned_cols=38 Identities=16% Similarity=0.278 Sum_probs=22.3 Q ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 225767787436877504578756863888998887458 Q gi|254780970|r 113 LRDIFTMGARPVAAMNSLRFGAIHHPKTKHLLSGVVAGI 151 (737) Q Consensus 113 iRDi~~~Ga~p~a~~~~l~~g~~~~p~~~~~~~~~~~G~ 151 (737) ||.++...+++.-+. .|..|+|+-|-+..+.+.+.+-+ T Consensus 9 ir~~~~~~~~~~dvi-~l~vgdpdf~~p~~i~ea~~~~~ 46 (378) T PRK07682 9 IRKFFDLAANMEGVI-SLGVGEPDFVTPWNVREASIRSL 46 (378) T ss_pred HHHHHHHHHCCCCCE-ECCCCCCCCCCCHHHHHHHHHHH T ss_conf 999999872899828-89999899987899999999998 No 157 >PRK08762 molybdopterin biosynthesis protein MoeB; Validated Probab=33.36 E-value=21 Score=14.87 Aligned_cols=45 Identities=18% Similarity=0.176 Sum_probs=29.4 Q ss_pred CCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCCCCCHHHHHHCCCC Q ss_conf 7752011122332014775289962022046678999898617888 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEEGMTAYEMMLSESQ 312 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~~l~~~ei~~sEsQ 312 (737) +|+|||.+.+......+|+|--.=.|.=.+...+|. ..++++|+. T Consensus 144 vG~GGLGs~~a~yLA~aGVG~i~lvD~D~Ve~SNL~-RQil~~~~d 188 (379) T PRK08762 144 IGAGGLGSPAAFYLAAAGVGHLRIADHDVVDRSNLQ-RQILHTEDS 188 (379) T ss_pred ECCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHH-HCCCCCHHH T ss_conf 888755799999999837975897628861334501-125746433 No 158 >PRK06507 consensus Probab=33.09 E-value=25 Score=14.33 Aligned_cols=32 Identities=13% Similarity=-0.014 Sum_probs=15.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHC-CCCCCCC Q ss_conf 6668621346999999998522102-8866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQIL-DFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~l-gipivgG 529 (737) +|.+|-..---.+-++.+.+.|++. ++-|++- T Consensus 172 ~P~NPTG~v~s~~~l~~i~~~a~~~~~v~visD 204 (400) T PRK06507 172 FPNNPTGAACTRAEMAAIAEVMLRHPHVWIMTD 204 (400) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCEECCC T ss_conf 992931288647799999986653168214056 No 159 >PRK10664 transcriptional regulator HU subunit beta; Provisional Probab=32.89 E-value=29 Score=13.88 Aligned_cols=24 Identities=21% Similarity=0.274 Sum_probs=10.3 Q ss_pred CCCCHHHHHHHHHCCCCHHHHHHH Q ss_conf 966214788998619998999999 Q gi|254780970|r 1 MSKPSPILEILTAHGLTKDECDQI 24 (737) Q Consensus 1 ~~~~~~~~e~~~~~gL~~~e~~~i 24 (737) |.+.+-+.++..+.||+..|.+.+ T Consensus 1 M~K~eLi~~ia~~~~ltk~~a~~~ 24 (90) T PRK10664 1 MNKSQLIDKIAAGADISKAAAGRA 24 (90) T ss_pred CCHHHHHHHHHHHCCCCHHHHHHH T ss_conf 978999999998709999999999 No 160 >PRK07681 aspartate aminotransferase; Provisional Probab=32.73 E-value=16 Score=15.68 Aligned_cols=25 Identities=20% Similarity=0.161 Sum_probs=16.9 Q ss_pred CEEEEECHHHHHHHHHHHHHCCCCE Q ss_conf 1899996880998999998579849 Q gi|254780970|r 672 RYVVCISPENQDLVMSEANNKNIPL 696 (737) Q Consensus 672 r~ii~V~~~~~~~~~~~~~~~gi~~ 696 (737) |+-++.+.+..++.++.++++||.. T Consensus 365 Rl~f~~~~e~l~eal~RLk~a~~~~ 389 (399) T PRK07681 365 RIALVQDKEVLQQVVENIRNSGIFA 389 (399) T ss_pred EEEEECCHHHHHHHHHHHHHCCCCH T ss_conf 9997188999999999998888656 No 161 >PRK07688 thiamine/molybdopterin biosynthesis ThiF/MoeB-like protein; Validated Probab=32.55 E-value=25 Score=14.34 Aligned_cols=21 Identities=29% Similarity=0.396 Sum_probs=13.8 Q ss_pred CCCCCHHHHHHHHHCCCCCCE Q ss_conf 775201112233201477528 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGI 287 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~ 287 (737) +|+|||.+.+.+...++|+|- T Consensus 30 vGaGGLGs~~a~~La~aGVG~ 50 (339) T PRK07688 30 IGAGALGTANAEMLVRAGVGK 50 (339) T ss_pred ECCCHHHHHHHHHHHHCCCCE T ss_conf 877777799999999848982 No 162 >PRK05764 aspartate aminotransferase; Provisional Probab=32.35 E-value=15 Score=15.83 Aligned_cols=32 Identities=13% Similarity=0.073 Sum_probs=18.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|+.- T Consensus 172 ~P~NPtG~v~s~~~l~~l~~~a~~~~i~ii~D 203 (389) T PRK05764 172 SPSNPTGAVYSKEELEAIADVAVEHDIWVLSD 203 (389) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 99899886757799999999988635327510 No 163 >TIGR00366 TIGR00366 conserved hypothetical protein TIGR00366; InterPro: IPR006161 This is a family of conserved hypothetical proteins of no clearly defined function, although they may act as short chain fatty acid transporters.; GO: 0015635 short-chain fatty acid transporter activity, 0015912 short-chain fatty acid transport, 0016020 membrane. Probab=32.13 E-value=15 Score=15.87 Aligned_cols=27 Identities=19% Similarity=0.146 Sum_probs=11.7 Q ss_pred CCCCHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 72558899999850732101035653112 Q gi|254780970|r 394 SDEDCTKSLLKLLSSPNLSSRRWVYEQYD 422 (737) Q Consensus 394 ~~~d~~~~l~~vL~~pnV~SK~~i~~qyD 422 (737) ++..+-..+.-.|.--..+- |-|+++= T Consensus 267 E~S~~L~~li~~LG~~ylg~--yFy~~Gf 293 (463) T TIGR00366 267 ENSRLLALLIGALGIIYLGM--YFYEKGF 293 (463) T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHCCCC T ss_conf 25689999999998999888--7741663 No 164 >PRK06207 aspartate aminotransferase; Provisional Probab=32.05 E-value=21 Score=14.91 Aligned_cols=32 Identities=9% Similarity=0.101 Sum_probs=18.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.|++- T Consensus 187 nP~NPTG~v~s~e~l~~l~~la~~~~i~visD 218 (406) T PRK06207 187 NPNNPAGVVYSPEEIAQIAALARRYGATVIAD 218 (406) T ss_pred CCCCCCCHHCCHHHHHHHHHHHHHCCEEEEEE T ss_conf 99798572133999999999875579599996 No 165 >PRK05463 hypothetical protein; Provisional Probab=31.81 E-value=31 Score=13.77 Aligned_cols=53 Identities=17% Similarity=0.218 Sum_probs=36.5 Q ss_pred EEEEECCCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 999954675889998887097131122121044310002586443443210146650114 Q gi|254780970|r 316 LMILNPEKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLPIKALSDEAPEYD 375 (737) Q Consensus 316 ~~~v~~~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp~~~L~~~~P~~~ 375 (737) +++++++..+.|+..|++.-.+|-+|+.. +-+......-+ .+.++. .+.|+|. T Consensus 39 lvILP~~~A~DF~~FC~rNpkpCPLl~~s-e~G~~~~~~la---~~~DiR---TD~P~Y~ 91 (264) T PRK05463 39 LVILPKDWAYDFLLFCQRNPKPCPLLDVS-EPGSPRLPLLG---ADADIR---TDVPRYR 91 (264) T ss_pred EEEEEHHHHHHHHHHHHHCCCCCCEEEEC-CCCCCCCCCCC---CCCCCC---CCCCCEE T ss_conf 99930999999999995599887656513-79997761226---775400---3787428 No 166 >TIGR02662 dinitro_DRAG ADP-ribosyl-[dinitrogen reductase] hydrolase; InterPro: IPR013479 This entry represents ADP-ribosyl-dinitrogen reductase hydrolase (3.2.2.24 from EC), also known as dinitrogenase reductase activating glycohydrolase (DraG). It is involved in the regulation of nitrogen fixation activity by the reversible ADP-ribosylation of the dinitrogenase reductase component of the nitrogenase enzyme complex . The ADP-ribosyltransferase protein DraT acts to inactivate dinitrogen reductase by adding an ADP-ribose group to this enzyme. DraG, in contrast, activates this enzyme by removing the attached ADP-ribose group. . Probab=31.61 E-value=15 Score=15.96 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=8.1 Q ss_pred CHHHHHHHCCCCCCCCCCCCC Q ss_conf 103565311210001100001 Q gi|254780970|r 413 SRRWVYEQYDTMIQSNSIQLP 433 (737) Q Consensus 413 SK~~i~~qyD~~V~g~tv~~P 433 (737) +++.| ++|=+.-|.+++..| T Consensus 98 ~RRGI-r~f~~~rqsG~~~~P 117 (295) T TIGR02662 98 CRRGI-RRFMLHRQSGTLEAP 117 (295) T ss_pred CCHHH-HHHHHCCCCCCEECC T ss_conf 50445-875540467743166 No 167 >pfam06032 DUF917 Protein of unknown function (DUF917). This family consists of hypothetical bacterial and archaeal proteins of unknown function. Probab=31.48 E-value=22 Score=14.74 Aligned_cols=55 Identities=29% Similarity=0.518 Sum_probs=31.4 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEHHHCCCCC Q ss_conf 798999888999874011383011034775201112233201477528996202204667 Q gi|254780970|r 240 GDPFTGKCLLEACLELMNTDAVIAIQDMGAAGLTCSAIEMGNQGNLGITLDLDKVPTCEE 299 (737) Q Consensus 240 gdp~~ek~~~~~~~~~~~~~~i~~~~D~gaGGl~~a~~Ema~~~~~G~~i~l~~vp~~~~ 299 (737) |||+..+.+.+..++....=.+....++....+.+.++-|... .+..+|+|..++ T Consensus 22 Gdp~~~~l~~~~~l~~~~~v~lv~~del~dd~~vv~v~~~GaP-----~v~~Eklp~g~e 76 (352) T pfam06032 22 GDPYIGKLLLRAALKEGGPVELVSPDELPDDDLVVTVGMMGAP-----TVSIEKLPSGDE 76 (352) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEHHHCCCCCEEEEEEECCCC-----CCHHCCCCCCHH T ss_conf 7479999999999975997699747565988889997453677-----101124778358 No 168 >TIGR01466 cobJ_cbiH precorrin-3B C17-methyltransferase; InterPro: IPR006363 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobJ and CbiH precorrin-3B C(17)-methyltransferase (2.1.1.131 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, once CobG has generated precorrin-3b, CobJ catalyses the methylation of precorrin-3b at C-17 to form precorrin-4 (the extruded methylated C-20 fragment is left attached as an acyl group at C-1) . In the corresponding anaerobic pathway, CbiH carries out this ring contraction, using cobalt-precorrin-3b as a substrate to generate a tetramethylated delta-lactone. These proteins belong to the superfamily of tetrapyrrole (corrin/porphyrin) methylases (IPR000878 from INTERPRO), which includes methylases that use S-adenosylmethionine (S-AdoMet) in the methylation of diverse substrates. A number of other methylases in the cobalamin biosynthesis pathway also belong to this domain superfamily (precorrin-3 methylase, IPR012203 from INTERPRO, IPR012059 from INTERPRO, IPR012382 from INTERPRO, IPR012797 from INTERPRO and others), and a fusion of precorrin-3B C17-methyltransferases with precorrin isomerase is represented by IPR014422 from INTERPRO. Nomenclature note: precorrin-3B C17-methyltransferase is one of the two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.133 from EC). ; GO: 0008168 methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=31.42 E-value=31 Score=13.72 Aligned_cols=179 Identities=17% Similarity=0.239 Sum_probs=98.7 Q ss_pred EECCCC-HHHHHHCCCCC-CCCEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 211211-35543112256-8977999657656444104666677728653334443326798999888999874011383 Q gi|254780970|r 183 AAGIAK-TNAIFSSKARG-IGLPLVYLGAKTGRDGIGGASMASEEFGENIAKKRPTVQVGDPFTGKCLLEACLELMNTDA 260 (737) Q Consensus 183 ~~Gi~~-~~~~~~~~~~~-~Gd~ii~~G~~tg~dGi~Ga~~sS~~~~~~~~~~~~~VQ~gdp~~ek~~~~~~~~~~~~~~ 260 (737) .+|||| .+..++.++++ -.+.=+++|=.||.+=|. .+-++.+ ++ +.-|+-..=.|.-+|+-.+.+-+- T Consensus 3 vvGiGPG~~~~~T~~A~~AL~~ad~ivGY~~Y~~~~~-------~~~~~~~-~~--~~~~M~~E~~Ra~~Ai~~A~~G~~ 72 (254) T TIGR01466 3 VVGIGPGAEELMTPEAKEALAEADVIVGYKTYLRLIE-------DLLDGKK-EV--ITSGMREEIERAELAIELAKEGKT 72 (254) T ss_pred EEECCCCCCCCCCHHHHHHHHHCCEEEEECCHHHHHH-------HHCCCCC-EE--EECCCCHHHHHHHHHHHHHHCCCE T ss_conf 8971888910168899999874888883001267787-------6468962-48--767796278999999999860994 Q ss_pred --CHHCCCCCCCCHHHHHHHHHCCCCC-CEEEEEHHHC------------------------CCCCCCCHHHHHH----- Q ss_conf --0110347752011122332014775-2899620220------------------------4667899989861----- Q gi|254780970|r 261 --VIAIQDMGAAGLTCSAIEMGNQGNL-GITLDLDKVP------------------------TCEEGMTAYEMML----- 308 (737) Q Consensus 261 --i~~~~D~gaGGl~~a~~Ema~~~~~-G~~i~l~~vp------------------------~~~~~l~~~ei~~----- 308 (737) +.|==|.|==|++..+-|+.++.+. =..+|.+=|| +.| -|.||++.. T Consensus 73 VAlvSSGDpGiYgMA~l~~E~~~~~~~~~~~~dieV~PGiTA~~aAAs~LGaPL~HDFc~ISLSD-lLtPw~~Ie~R~~a 151 (254) T TIGR01466 73 VALVSSGDPGIYGMAALVFEILEKKGLEDVAIDIEVIPGITAASAAASLLGAPLGHDFCVISLSD-LLTPWPVIEKRLRA 151 (254) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCEEEEECCC-CCCCHHHHHHHHHH T ss_conf 89993687527888999999986238997723479858833789999973363113788887455-56787899999999 Q ss_pred -CCCCCEEEEEE-------ECCCHHHHHHHHHHHCCEEECCCCEEEC----CCCCCC---CCCCCC------CCCCCCCC Q ss_conf -78885089999-------5467588999888709713112212104----431000---258644------34432101 Q gi|254780970|r 309 -SESQERMLMIL-------NPEKQHKAQEILNKWGLHFSIIGITTND----KLFRVI---HRGEEV------ANLPIKAL 367 (737) Q Consensus 309 -sEsQeR~~~~v-------~~~~~~~~~~i~~~~~~~~~~iG~vt~~----~~~~v~---~~g~~v------~dlp~~~L 367 (737) .|+ =|++++ .++++-++++|+.++.-+-+.||-|-+= ..+.++ .=.+.. +|+-.-.| T Consensus 152 AA~a--DFVia~YNP~S~~R~~~~~~a~eIL~~~r~~~TpVGiVr~AgR~~e~v~ittL~~l~~~~~~~~~l~DM~T~vi 229 (254) T TIGR01466 152 AAEA--DFVIAIYNPRSKRRKEQLRRAVEILLEHRKPDTPVGIVRNAGREDEEVEITTLSELDEELEKELDLIDMLTTVI 229 (254) T ss_pred HHCC--CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEEEEHHHCCCCCCCCEEEECCEEEEE T ss_conf 8449--92899974898765446999999999856899728988745899854899764330001110133443458999 Q ss_pred CCCCCEE Q ss_conf 4665011 Q gi|254780970|r 368 SDEAPEY 374 (737) Q Consensus 368 ~~~~P~~ 374 (737) .||.--| T Consensus 230 iGN~~T~ 236 (254) T TIGR01466 230 IGNSETY 236 (254) T ss_pred ECCCCCE T ss_conf 7178525 No 169 >PRK06107 aspartate aminotransferase; Provisional Probab=31.36 E-value=22 Score=14.79 Aligned_cols=18 Identities=0% Similarity=0.137 Sum_probs=8.3 Q ss_pred HHHHHHHHHHC-CCCCCCC Q ss_conf 99998522102-8866544 Q gi|254780970|r 512 VKGIREACQIL-DFPIVSG 529 (737) Q Consensus 512 v~gl~d~c~~l-gipivgG 529 (737) ++.+.+.|.+. ++.|++- T Consensus 188 l~~i~~i~~~~~~i~iisD 206 (402) T PRK06107 188 LRALADVLLRHPHVLVLTD 206 (402) T ss_pred HHHHHHHHHHCCCCEEEEE T ss_conf 9999887752489328622 No 170 >PRK06348 aspartate aminotransferase; Provisional Probab=31.20 E-value=15 Score=15.90 Aligned_cols=33 Identities=15% Similarity=0.107 Sum_probs=19.6 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 566686213469999999985221028866544 Q gi|254780970|r 497 GNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 497 ~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) .+|.+|-..---.+-++.+.+.|++.++.+++- T Consensus 169 ~~P~NPTG~v~~~e~l~~l~~la~~~~i~visD 201 (383) T PRK06348 169 NSPNNPTGAVFSKETLEEIAKVAIENDLVIISD 201 (383) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC T ss_conf 999798888777789999997653068699851 No 171 >PRK04914 ATP-dependent helicase HepA; Validated Probab=31.16 E-value=16 Score=15.66 Aligned_cols=43 Identities=30% Similarity=0.559 Sum_probs=27.6 Q ss_pred CCCHHHHHHCCCCCEEEEEEECCCHHHH-HHHHHHHCCEEECCC Q ss_conf 8999898617888508999954675889-998887097131122 Q gi|254780970|r 300 GMTAYEMMLSESQERMLMILNPEKQHKA-QEILNKWGLHFSIIG 342 (737) Q Consensus 300 ~l~~~ei~~sEsQeR~~~~v~~~~~~~~-~~i~~~~~~~~~~iG 342 (737) ||--.|.+..+--+|++++||+.-..++ .++.+|++++++++. T Consensus 186 GLIl~ell~rgra~RvLIvvP~~L~~QW~~EL~~KF~L~f~i~D 229 (955) T PRK04914 186 GLIIHQQLLTGRAERVLIIVPETLQHQWLVEMLRRFNLRFSLFD 229 (955) T ss_pred HHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCEEEC T ss_conf 99999999748777799992779989999999998399809955 No 172 >pfam01988 VIT1 VIT family. This family includes the vacuolar Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron transporter VIT1. Probab=31.07 E-value=32 Score=13.68 Aligned_cols=21 Identities=14% Similarity=0.204 Sum_probs=9.3 Q ss_pred CHHHHHHHHHHHCCEEECCCC Q ss_conf 758899988870971311221 Q gi|254780970|r 323 KQHKAQEILNKWGLHFSIIGI 343 (737) Q Consensus 323 ~~~~~~~i~~~~~~~~~~iG~ 343 (737) ..+++.+++++.|++-...-+ T Consensus 81 e~~el~~i~~~~Gl~~~~a~~ 101 (210) T pfam01988 81 EREELAQIYRARGLSEELARQ 101 (210) T ss_pred HHHHHHHHHHHCCCCHHHHHH T ss_conf 999999999874999899999 No 173 >PRK07337 aminotransferase; Validated Probab=31.07 E-value=13 Score=16.23 Aligned_cols=17 Identities=18% Similarity=0.053 Sum_probs=8.7 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99998522102886654 Q gi|254780970|r 512 VKGIREACQILDFPIVS 528 (737) Q Consensus 512 v~gl~d~c~~lgipivg 528 (737) ++.+.+.|++.++.+++ T Consensus 185 l~~i~~~a~~~~~~vIs 201 (388) T PRK07337 185 LRRIVEAVRARGGFTIV 201 (388) T ss_pred HHHHHHHHHCCCEEEEC T ss_conf 43355443145808845 No 174 >TIGR02355 moeB molybdopterin synthase sulfurylase MoeB; InterPro: IPR012730 This entry describes the molybdopterin biosynthesis protein MoeB in Escherichia coli and related species. MoeB and MoaD are involved in molybdenum cofactor biosynthesis, an evolutionarily conserved pathway. The MoeB enzyme covalently modifies the molybdopterin synthase MoaD by sulphurylation. The crystal structure of the complex between the E. coli MoeB and MoaD proteins reveals a MoeB(2)-MoaD(2) heterotetramer in which the MoeB subunits form a mainly hydrophobic dimer . MoeB is closely related to ThiF, a thiamine biosynthesis enzyme that modifies ThiS by an analogous adenylation. Both MoeB and ThiF belong to the HesA/MoeB/ThiF family (IPR000594 from INTERPRO). ; GO: 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=30.96 E-value=26 Score=14.25 Aligned_cols=43 Identities=7% Similarity=-0.104 Sum_probs=17.2 Q ss_pred HHHHHHHHCCEEECCCCEEECCCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 999888709713112212104431000-2586443443210146 Q gi|254780970|r 327 AQEILNKWGLHFSIIGITTNDKLFRVI-HRGEEVANLPIKALSD 369 (737) Q Consensus 327 ~~~i~~~~~~~~~~iG~vt~~~~~~v~-~~g~~v~dlp~~~L~~ 369 (737) +..-|-+..+|-.-=.-+.=+|.+.|+ |..++|++-.++.||+ T Consensus 131 Ln~~Cf~~K~PLvSgaAIRmEG~v~VFtY~~~~PcY~clS~LFg 174 (240) T TIGR02355 131 LNRQCFAAKVPLVSGAAIRMEGQVSVFTYQDDEPCYRCLSKLFG 174 (240) T ss_pred HHHHHHHCCCCCEEHHHHHCCCEEEEEEECCCCCHHHHHHHHHC T ss_conf 75777623898000122100451789852597871889999840 No 175 >TIGR00987 himA integration host factor, alpha subunit; InterPro: IPR005684 Integration host factor (IHF) is a heterodimeric protein consisting of alpha and beta (IPR005685 from INTERPRO) subunits. IHF binds the minor groove of DNA in a sequence-specific manner to induce a large bend. This bending stabilises distinct DNA conformations that are required during several bacterial processes, such as recombination, transposition, replication and transcription .; GO: 0003677 DNA binding, 0005694 chromosome. Probab=30.66 E-value=32 Score=13.64 Aligned_cols=15 Identities=27% Similarity=0.534 Sum_probs=7.0 Q ss_pred HHHHHCCCCHHHHHH Q ss_conf 899861999899999 Q gi|254780970|r 9 EILTAHGLTKDECDQ 23 (737) Q Consensus 9 e~~~~~gL~~~e~~~ 23 (737) .++.++||+..|-.. T Consensus 10 ~~f~~~GLs~~ea~~ 24 (96) T TIGR00987 10 ALFRELGLSKREAKE 24 (96) T ss_pred HHHHHHCCCHHHHHH T ss_conf 999982888557789 No 176 >PRK08362 consensus Probab=30.62 E-value=15 Score=15.93 Aligned_cols=16 Identities=6% Similarity=0.104 Sum_probs=8.6 Q ss_pred CCCHHHHHHHHHHHCC Q ss_conf 4675889998887097 Q gi|254780970|r 321 PEKQHKAQEILNKWGL 336 (737) Q Consensus 321 ~~~~~~~~~i~~~~~~ 336 (737) ++.++++.++|+++++ T Consensus 179 ~~~l~~i~~~a~~~~~ 194 (389) T PRK08362 179 KKDLEEIADFAVEHDL 194 (389) T ss_pred HHHHHHHHHHHHHCCE T ss_conf 9999999999875495 No 177 >cd01487 E1_ThiF_like E1_ThiF_like. Member of superfamily of activating enzymes (E1) of the ubiquitin-like proteins. The common reaction mechanism catalyzed by E1-like enzymes begins with a nucleophilic attack of the C-terminal carboxylate of the ubiquitin-like substrate, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of the substrate. The exact function of this family is unknown. Probab=29.84 E-value=33 Score=13.55 Aligned_cols=59 Identities=19% Similarity=0.256 Sum_probs=32.3 Q ss_pred CCCCCHHHHHHHHHCCCCCCEE--EEEHHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHC Q ss_conf 7752011122332014775289--96202204667899989861788850899995467588999888709 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGIT--LDLDKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWG 335 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~--i~l~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~ 335 (737) +|+|||.|.+.++..++|+|-- +|-|+| ...+|.. .++..|. +-+.+.+...+.+++-+ T Consensus 5 vG~GGLG~~~a~~La~aGvg~i~lvD~D~V--e~SNL~R-Q~~~~~d-------iG~~Ka~~a~~~l~~iN 65 (174) T cd01487 5 AGAGGLGSNIAVLLARSGVGNLKLVDFDVV--EPSNLNR-QQYFLSQ-------IGEPKVEALKENLREIN 65 (174) T ss_pred ECCCHHHHHHHHHHHHHCCCEEEEEECCCC--CCCCHHC-CHHHHHH-------CCCHHHHHHHHHHHHHC T ss_conf 876788999999999818973999989946--6644301-3247774-------79749999999999879 No 178 >TIGR01304 IMP_DH_rel_2 IMP dehydrogenase family protein; InterPro: IPR005992 This family of proteins, often annotated as a putative IMP dehydrogenase, are related to IMP dehydrogenase and GMP reductase. Most species with a member of this family belong to the high GC Gram-positive bacteria.. Probab=29.64 E-value=22 Score=14.74 Aligned_cols=19 Identities=37% Similarity=0.651 Sum_probs=12.5 Q ss_pred HHHHHHHHCCCCCCCCCCC Q ss_conf 9985221028866544320 Q gi|254780970|r 514 GIREACQILDFPIVSGNVS 532 (737) Q Consensus 514 gl~d~c~~lgipivgGkvS 532 (737) -|++||.+|++|||-|++- T Consensus 181 nLk~fi~eLDvPVv~Ggv~ 199 (376) T TIGR01304 181 NLKKFIQELDVPVVAGGVV 199 (376) T ss_pred HHHHHHHHCCCCEEECCCC T ss_conf 4888975489887883853 No 179 >PRK04322 peptidyl-tRNA hydrolase; Provisional Probab=29.37 E-value=34 Score=13.49 Aligned_cols=46 Identities=22% Similarity=0.415 Sum_probs=27.0 Q ss_pred HHHHHCCCCCEEEEEEEC-CCHHHHHHHHHHHCCEEECCCCEEECCCCCC Q ss_conf 898617888508999954-6758899988870971311221210443100 Q gi|254780970|r 304 YEMMLSESQERMLMILNP-EKQHKAQEILNKWGLHFSIIGITTNDKLFRV 352 (737) Q Consensus 304 ~ei~~sEsQeR~~~~v~~-~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v 352 (737) ++-|.++.|-..++-++. +.+..+.+.+++.++++.+| .+.++-.+ T Consensus 42 ~~~W~~~G~~KIvlkv~~e~~L~~l~~~a~~~gl~~~~V---~DAG~Tei 88 (116) T PRK04322 42 FEEWLNEGQKKVVLKVNSEEELLELKKKAERLGLPTALI---RDAGLTQI 88 (116) T ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCEEEE---ECCCCCCC T ss_conf 999995799479998599999999999999879968999---76887353 No 180 >PRK10753 transcriptional regulator HU subunit alpha; Provisional Probab=28.93 E-value=34 Score=13.44 Aligned_cols=20 Identities=25% Similarity=0.194 Sum_probs=7.1 Q ss_pred CCHHHHHHHHHCCCCHHHHH Q ss_conf 62147889986199989999 Q gi|254780970|r 3 KPSPILEILTAHGLTKDECD 22 (737) Q Consensus 3 ~~~~~~e~~~~~gL~~~e~~ 22 (737) +.+-+.++..+.||+..|.+ T Consensus 3 K~eLi~~ia~~~~~sk~~~~ 22 (90) T PRK10753 3 KTQLIDVIADKAELSKTQAK 22 (90) T ss_pred HHHHHHHHHHHHCCCHHHHH T ss_conf 89999999987398999999 No 181 >PRK11706 TDP-4-oxo-6-deoxy-D-glucose transaminase; Provisional Probab=28.87 E-value=13 Score=16.34 Aligned_cols=37 Identities=8% Similarity=-0.078 Sum_probs=23.8 Q ss_pred CCCHHHHHHHHHHHCCC-C--CHHHHHHHHHHHCCCCCCC Q ss_conf 99989999999995789-7--1889999853300110744 Q gi|254780970|r 15 GLTKDECDQIIHILKRQ-P--TLTEIGIISAMWNEHCSYK 51 (737) Q Consensus 15 gL~~~e~~~i~~~l~R~-P--t~~El~~~~~~wSEHC~~k 51 (737) -++.+|++++.+.|... - .--...-|.+.+++.+.-| T Consensus 8 ~~~~~e~~~v~~~l~sg~~~~~G~~v~~fE~~~a~~~g~~ 47 (375) T PRK11706 8 PVVGTELDYIQEAMSSGKLSGDGGFTRRCQQWLEQRFGCA 47 (375) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 9998999999999970987689979999999999997849 No 182 >cd04914 ACT_AKi-DapG-BS_1 ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI. This CD includes the N-terminal of the two ACT domains of the diaminopimelate-sensitive aspartokinase (AK) isoenzyme AKI, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) strain 168), Clostridia, and Actinobacteria, bacterial species. In B. subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and Probab=28.87 E-value=34 Score=13.44 Aligned_cols=32 Identities=3% Similarity=0.016 Sum_probs=22.0 Q ss_pred CCCCCCEEEEECHHHHHHHHHHHHHCCCCEEE Q ss_conf 98832189999688099899999857984999 Q gi|254780970|r 667 GEDQGRYVVCISPENQDLVMSEANNKNIPLRY 698 (737) Q Consensus 667 sEs~Gr~ii~V~~~~~~~~~~~~~~~gi~~~~ 698 (737) +-+|...+++|+.+..+.+.+++++.|+...+ T Consensus 34 NI~P~~~vfTV~~~~~~ka~~iL~~lg~~p~v 65 (67) T cd04914 34 NVSPEEVIFTVDGEVAEKAVDILEKMGLDPSV 65 (67) T ss_pred EECCCCEEEEECHHHHHHHHHHHHHCCCCCEE T ss_conf 86677479980778899999999986998375 No 183 >PRK08361 aspartate aminotransferase; Provisional Probab=28.85 E-value=16 Score=15.66 Aligned_cols=20 Identities=20% Similarity=0.295 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHCCCCCCCC Q ss_conf 99999985221028866544 Q gi|254780970|r 510 HSVKGIREACQILDFPIVSG 529 (737) Q Consensus 510 ~av~gl~d~c~~lgipivgG 529 (737) +-++.+.+.|++.++.|++- T Consensus 185 e~l~~l~~la~~~~i~ii~D 204 (390) T PRK08361 185 ETAKAIADIAEDYNIYILSD 204 (390) T ss_pred HHHHHHHHHHHHCCCEECCH T ss_conf 88889999998639553470 No 184 >PRK08912 hypothetical protein; Provisional Probab=28.78 E-value=15 Score=15.89 Aligned_cols=32 Identities=19% Similarity=0.062 Sum_probs=18.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.+++- T Consensus 167 ~P~NPTG~v~s~e~l~~l~~~a~~~~i~vi~D 198 (387) T PRK08912 167 NPLNPAGKVFPREDLALLAEFCQRHDAVAICD 198 (387) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCEEEEEC T ss_conf 99898885686999999999886579799802 No 185 >pfam01726 LexA_DNA_bind LexA DNA binding domain. This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins. The aligned region contains a variant form of the helix-turn-helix DNA binding motif. This domain is found associated with pfam00717 the auto-proteolytic domain of LexA EC:3.4.21.88. Probab=28.59 E-value=35 Score=13.40 Aligned_cols=37 Identities=24% Similarity=0.344 Sum_probs=26.8 Q ss_pred HHHHHHHH---CCCCCHHHHHHHHHHHCCCCCCCCCH---HHHHHHHHCC Q ss_conf 99999995---78971889999853300110744588---9997512069 Q gi|254780970|r 21 CDQIIHIL---KRQPTLTEIGIISAMWNEHCSYKSSK---KWLRTLPTTG 64 (737) Q Consensus 21 ~~~i~~~l---~R~Pt~~El~~~~~~wSEHC~~k~~~---~~l~~~~~~~ 64 (737) +++|+++. |..||--|+. +||.+||.. .+|+.+...| T Consensus 12 l~~I~~~~~~~G~~Pt~rEI~-------~~~g~~S~s~v~~~l~~Le~kG 54 (65) T pfam01726 12 LDFIKASIEETGYPPSRREIA-------QALGLRSPNAAEEHLKALERKG 54 (65) T ss_pred HHHHHHHHHHHCCCCCHHHHH-------HHHCCCCHHHHHHHHHHHHHCC T ss_conf 999999999828898799999-------9938998099999999999983 No 186 >PRK05794 consensus Probab=28.54 E-value=20 Score=14.99 Aligned_cols=32 Identities=16% Similarity=0.104 Sum_probs=16.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.+++- T Consensus 171 ~P~NPtG~v~s~~~l~~l~~la~~~~i~visD 202 (397) T PRK05794 171 SPNNPTGTVYSKEELEEIAKFAKEHDLFIISD 202 (397) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCEEEEECH T ss_conf 99898782435067999997654103688433 No 187 >TIGR02393 RpoD_Cterm RNA polymerase sigma factor RpoD; InterPro: IPR012760 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the C-terminal region of sigma 70 (RpoD) which contains the well-conserved regions 2, 3 and 4. Region 2 of sigma-70 is the most conserved region of the entire protein. The high conservation is due to region 2 containing both the -10 promoter recognition helix and the primary core RNA polymerase binding determinant. The core-binding helix, interacts with the clamp domain of the largest polymerase subunit, beta prime , . The aromatic residues of the recognition helix, found at the C terminus of this domain are thought to mediate strand separation, thereby allowing transcription initiation , . Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but in some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme .Region 4 is involved in binding to the -35 promoter element via a helix-turn-helix motif .; GO: 0003677 DNA binding, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=27.69 E-value=36 Score=13.30 Aligned_cols=21 Identities=10% Similarity=0.384 Sum_probs=10.4 Q ss_pred HHHHHHHH-HHHHHHHHHCCCCC Q ss_conf 34699999-99985221028866 Q gi|254780970|r 505 MGQFVHSV-KGIREACQILDFPI 526 (737) Q Consensus 505 ~~ql~~av-~gl~d~c~~lgipi 526 (737) || ..+|+ ++++|=+|..-||| T Consensus 59 WW-IRQAITRAIADQARtIRiPV 80 (240) T TIGR02393 59 WW-IRQAITRAIADQARTIRIPV 80 (240) T ss_pred HH-HHHHHHHHHHHCCCCCCCCH T ss_conf 89-99999998875137201103 No 188 >PRK07366 succinyldiaminopimelate transaminase; Validated Probab=27.68 E-value=16 Score=15.72 Aligned_cols=32 Identities=16% Similarity=0.136 Sum_probs=19.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.+++- T Consensus 173 ~P~NPTG~v~s~~~l~~i~~~a~~~~i~lisD 204 (388) T PRK07366 173 YPHNPTTAIAPLSFFQEAVAFCQQHDLVLVHD 204 (388) T ss_pred CCCCCCCEEEEHHHHHHHHHHHHHCCEEEECC T ss_conf 89898466803888888999874055078622 No 189 >cd02962 TMX2 TMX2 family; composed of proteins similar to human TMX2, a 372-amino acid TRX-related transmembrane protein, identified and characterized through the cloning of its cDNA from a human fetal library. It contains a TRX domain but the redox active CXXC motif is replaced with SXXC. Sequence analysis predicts that TMX2 may be a Type I membrane protein, with its C-terminal half protruding on the luminal side of the endoplasmic reticulum (ER). In addition to the TRX domain, transmembrane region and ER-retention signal, TMX2 also contains a Myb DNA-binding domain repeat signature and a dileucine motif in the tail. Probab=27.63 E-value=4.1 Score=19.73 Aligned_cols=40 Identities=18% Similarity=0.199 Sum_probs=20.5 Q ss_pred CCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCCCCCC Q ss_conf 675889998887097131122121044310002586443443 Q gi|254780970|r 322 EKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEVANLP 363 (737) Q Consensus 322 ~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v~dlp 363 (737) =|+.++.++++++++..+...+ .=+.+.++.+|+++...| T Consensus 86 vDv~r~p~vA~ky~I~~s~~s~--qlPT~iLF~~G~Ev~R~P 125 (152) T cd02962 86 IDIGRFPNVAEKFRVSTSPLSK--QLPTIILFQGGKEVARRP 125 (152) T ss_pred EECCCCHHHHHHCCCCCCCCCC--CCCEEEEEECCEEEEECC T ss_conf 8766374069882867532336--587599974889945637 No 190 >PRK05942 aspartate aminotransferase; Provisional Probab=27.42 E-value=19 Score=15.16 Aligned_cols=32 Identities=9% Similarity=0.038 Sum_probs=15.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+.++.+.+.|++.++.|++- T Consensus 178 ~P~NPTG~v~~~e~l~~i~~~a~~~~i~vIsD 209 (394) T PRK05942 178 YPSNPTTATAPREFFEEIVAFARKYEILLVHD 209 (394) T ss_pred CCCCCCCHHCCHHHHHHHHHHHHCCCCEEEEC T ss_conf 89881012201999999999885178679703 No 191 >PRK07411 hypothetical protein; Validated Probab=27.36 E-value=36 Score=13.26 Aligned_cols=20 Identities=20% Similarity=0.405 Sum_probs=13.7 Q ss_pred CCCCCHHHHHHHHHCCCCCC Q ss_conf 77520111223320147752 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLG 286 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G 286 (737) +|+|||.|.+..-..++|+| T Consensus 44 vG~GGLG~p~~~yLaaaGvG 63 (390) T PRK07411 44 IGTGGLGSPLLLYLAAAGIG 63 (390) T ss_pred ECCCHHHHHHHHHHHHCCCC T ss_conf 88872379999999983897 No 192 >cd00616 AHBA_syn 3-amino-5-hydroxybenzoic acid synthase family (AHBA_syn). AHBA_syn family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The members of this CD are involved in various biosynthetic pathways for secondary metabolites. Some well studied proteins in this CD are AHBA_synthase, protein product of pleiotropic regulatory gene degT, Arnb aminotransferase and pilin glycosylation protein. The prototype of this family, the AHBA_synthase, is a dimeric PLP dependent enzyme. AHBA_syn is the terminal enzyme of 3-amino-5-hydroxybenzoic acid (AHBA) formation which is involved in the biosynthesis of ansamycin antibiotics, including rifamycin B. Some members of this CD are involved in 4-amino-6-deoxy-monosaccharide D-perosamine synthesis. Perosamine is an important element in the glycosylation of several cell products, such as antibiotics and lipopolysaccharides of gram-positive and gram-negative bacteria. The pilin glycosylation protein Probab=27.35 E-value=12 Score=16.52 Aligned_cols=26 Identities=27% Similarity=0.521 Sum_probs=15.5 Q ss_pred CEEEEECH---HHHHHHHHHHHHCCCCEE Q ss_conf 18999968---809989999985798499 Q gi|254780970|r 672 RYVVCISP---ENQDLVMSEANNKNIPLR 697 (737) Q Consensus 672 r~ii~V~~---~~~~~~~~~~~~~gi~~~ 697 (737) .+.+.+++ .+.+.+.+.++++||.+. T Consensus 269 ~~~i~l~~~~~~~R~~l~~~L~~~gI~~~ 297 (352) T cd00616 269 LYVIRLDPEAGESRDELIEALKEAGIETR 297 (352) T ss_pred EEEEEECCCCCCCHHHHHHHHHHCCCCEE T ss_conf 99999865021059999999998799858 No 193 >cd01316 CAD_DHOase The eukaryotic CAD protein is a trifunctional enzyme of carbamoylphosphate synthetase-aspartate transcarbamoylase-dihydroorotase, which catalyzes the first three steps of de novo pyrimidine nucleotide biosynthesis. Dihydroorotase (DHOase) catalyzes the third step, the reversible interconversion of carbamoyl aspartate to dihydroorotate. Probab=27.26 E-value=36 Score=13.25 Aligned_cols=16 Identities=19% Similarity=0.455 Sum_probs=7.1 Q ss_pred HHHHHH-HHHHHHCCCC Q ss_conf 999999-8522102886 Q gi|254780970|r 510 HSVKGI-REACQILDFP 525 (737) Q Consensus 510 ~av~gl-~d~c~~lgip 525 (737) +.++-+ ..-++-||++ T Consensus 265 ~lv~~~s~nPAki~gl~ 281 (344) T cd01316 265 DIVDRLHTNPKRIFNLP 281 (344) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 99999949899972889 No 194 >pfam01791 DeoC DeoC/LacD family aldolase. This family includes diverse aldolase enzymes. This family includes the enzyme deoxyribose-phosphate aldolase EC:4.1.2.4, which is involved in nucleotide metabolism. The family also includes a group of related bacterial proteins of unknown function. The family also includes tagatose 1,6-diphosphate aldolase (EC:4.1.2.40) is part of the tagatose-6-phosphate pathway of galactose-6-phosphate degradation. Probab=27.21 E-value=37 Score=13.24 Aligned_cols=50 Identities=20% Similarity=0.293 Sum_probs=34.3 Q ss_pred HHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 76410278510223221024566686213469999999985221028866544 Q gi|254780970|r 477 WRNIIATGAKPLAITDNLNFGNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 477 l~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +...+..||+-+.++.++.+.. ++.+.+..+-+..+.+.|..+++|++-= T Consensus 80 v~~Ai~~GAdeid~vi~~~~~~---~~~~~~~~~~i~~v~~~~~~~~lkvIiE 129 (231) T pfam01791 80 AEEAIALGADEVDMVVNIGADG---SEDEEQVLAEIAAVKEACHGEGLKVILE 129 (231) T ss_pred HHHHHHCCCCEEEEEEECCCCC---CCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 9999987998899997246578---9559999999999998631048708999 No 195 >COG4840 Uncharacterized protein conserved in bacteria [Function unknown] Probab=27.08 E-value=37 Score=13.23 Aligned_cols=18 Identities=22% Similarity=0.292 Sum_probs=8.0 Q ss_pred HCCCCHHHHHHHHHHHCC Q ss_conf 619998999999999578 Q gi|254780970|r 13 AHGLTKDECDQIIHILKR 30 (737) Q Consensus 13 ~~gL~~~e~~~i~~~l~R 30 (737) +..+|+.|++.|.+.||| T Consensus 51 kenfSpsEmqaiA~eL~r 68 (71) T COG4840 51 KENFSPSEMQAIADELGR 68 (71) T ss_pred HCCCCHHHHHHHHHHHHH T ss_conf 625998999999999987 No 196 >COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=26.92 E-value=14 Score=16.05 Aligned_cols=93 Identities=25% Similarity=0.152 Sum_probs=45.8 Q ss_pred CCCCCCCCCEEEECCCCCCEEEEECCCCCCCHHCCCHH-HHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 00013576314203777622797304532100001001-13567897776410278510223221024566686213469 Q gi|254780970|r 430 IQLPGGDAGVIRVEGHETKALAFSSDVTPRYVKADPFE-GTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEEIMGQF 508 (737) Q Consensus 430 v~~Pg~DaaVv~~~~~~~~gia~s~g~~p~~~~~dP~~-ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e~~~ql 508 (737) +..||+.+-|..+.+.....++...|..|..+.+++.. +-+. =.+.+...+.- +.-++. +| +|.+|...--- T Consensus 109 l~~pGDeVlip~P~Y~~y~~~~~~~gg~~v~v~l~~~~~~f~~-d~~~l~~~i~~--ktk~i~--ln--~PnNPTGav~~ 181 (393) T COG0436 109 LLNPGDEVLIPDPGYPSYEAAVKLAGGKPVPVPLDEEENGFKP-DLEDLEAAITP--KTKAII--LN--SPNNPTGAVYS 181 (393) T ss_pred HCCCCCEEEECCCCCCCHHHHHHHCCCEEEEECCCCCCCCCCC-CHHHHHHHCCC--CCEEEE--EE--CCCCCCCCCCC T ss_conf 7189988998389982559999877997987216666558747-99999864774--645999--97--99996677889 Q ss_pred HHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999985221028866544 Q gi|254780970|r 509 VHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 509 ~~av~gl~d~c~~lgipivgG 529 (737) .+.++.+.+.|++.++-+++- T Consensus 182 ~~~l~~i~~~a~~~~i~ii~D 202 (393) T COG0436 182 KEELKAIVELAREHDIIIISD 202 (393) T ss_pred HHHHHHHHHHHHHCCEEEEEE T ss_conf 999999999999879299996 No 197 >PRK08960 hypothetical protein; Provisional Probab=26.79 E-value=25 Score=14.40 Aligned_cols=32 Identities=9% Similarity=-0.017 Sum_probs=16.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.++|++.++.|++- T Consensus 173 nP~NPTG~v~s~~~l~~l~~~a~~~~~~vI~D 204 (387) T PRK08960 173 SPANPTGTLLSRDELAALSQALKARGGHLVVD 204 (387) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 89898863146899999999876239779984 No 198 >COG2759 MIS1 Formyltetrahydrofolate synthetase [Nucleotide transport and metabolism] Probab=26.70 E-value=37 Score=13.18 Aligned_cols=46 Identities=28% Similarity=0.377 Sum_probs=31.7 Q ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHC--CCCEE Q ss_conf 66214788998619998999999999578971889999853300110744588999751206--97489 Q gi|254780970|r 2 SKPSPILEILTAHGLTKDECDQIIHILKRQPTLTEIGIISAMWNEHCSYKSSKKWLRTLPTT--GKHVI 68 (737) Q Consensus 2 ~~~~~~~e~~~~~gL~~~e~~~i~~~l~R~Pt~~El~~~~~~wSEHC~~k~~~~~l~~~~~~--~~~vi 68 (737) +++.|+.|+.+++||+.+|++..= |-.-|.+-..++.++.. ++.++ T Consensus 9 ~~~k~I~~ia~k~Gl~~d~lelYG---------------------~ykAKi~~~~~~~l~~k~~gKlIL 56 (554) T COG2759 9 ATMKPIEEIAEKLGLSADDLELYG---------------------HYKAKISLEVIKRLKNKPDGKLIL 56 (554) T ss_pred CCCCCHHHHHHHCCCCHHHHHHCC---------------------CHHHHCCHHHHHHHCCCCCCEEEE T ss_conf 553569999988099877765424---------------------034315899997531488860899 No 199 >cd01899 Ygr210 Ygr210 subfamily. Ygr210 is a member of Obg-like family and present in archaea and fungi. They are characterized by a distinct glycine-rich motif immediately following the Walker B motif. The Ygr210 and YyaF/YchF subfamilies appear to form one major branch of the Obg-like family. Among eukaryotes, the Ygr210 subfamily is represented only in fungi. These fungal proteins form a tight cluster with their archaeal orthologs, which suggests the possibility of horizontal transfer from archaea to fungi. Probab=26.67 E-value=14 Score=16.08 Aligned_cols=16 Identities=38% Similarity=0.623 Sum_probs=11.6 Q ss_pred EECCCCCC-CCCCCCCC Q ss_conf 95178877-76588887 Q gi|254780970|r 89 KMESHNHP-SYIEPYQG 104 (737) Q Consensus 89 k~EtHNhP-s~i~P~~G 104 (737) .+|+.|+| |.|+|--| T Consensus 22 ~~~~anyPF~TIePN~G 38 (318) T cd01899 22 DVEIANYPFTTIDPNVG 38 (318) T ss_pred CCCCCCCCCCCCCCCCC T ss_conf 98512799667677416 No 200 >PRK10578 hypothetical protein; Provisional Probab=26.18 E-value=4.3 Score=19.61 Aligned_cols=20 Identities=40% Similarity=0.630 Sum_probs=10.7 Q ss_pred CCCCCC-CCCCCCCCCCCEEEE Q ss_conf 210558-102257677874368 Q gi|254780970|r 105 AATGVG-GILRDIFTMGARPVA 125 (737) Q Consensus 105 AaTg~g-G~iRDi~~~Ga~p~a 125 (737) ..|++| |.+||++- |..|.+ T Consensus 37 ~vTa~GGGtlRDlLL-~~~P~~ 57 (207) T PRK10578 37 VVTAVGGGTIRDMAL-DNGPVF 57 (207) T ss_pred HHHHCCCHHHHHHHC-CCCCCE T ss_conf 999524337765523-788985 No 201 >cd05383 SCP_CRISP SCP_CRISP: SCP-like extracellular protein domain, CRISP-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Involvement of CRISP in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as the CRISPs, are involved in, and is supported by sequence and structural evidence of a conserved pocket containing two histidines and a glutamate. It also may explain how helothermine, a toxic peptide secreted by the beaded lizard, blocks Ca++ t Probab=26.06 E-value=35 Score=13.34 Aligned_cols=22 Identities=18% Similarity=0.344 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHCCCCCCCCCHH Q ss_conf 8899998533001107445889 Q gi|254780970|r 34 LTEIGIISAMWNEHCSYKSSKK 55 (737) Q Consensus 34 ~~El~~~~~~wSEHC~~k~~~~ 55 (737) |.||+-.||.|..+|..+.+.. T Consensus 30 d~~La~~A~~~A~~C~~~~~~~ 51 (138) T cd05383 30 NEEAAQNAKKWANTCNLTHSPP 51 (138) T ss_pred CHHHHHHHHHHHHHCCCCCCCC T ss_conf 8999999999997475557970 No 202 >pfam07002 Copine Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth. Probab=25.90 E-value=16 Score=15.77 Aligned_cols=15 Identities=27% Similarity=0.532 Sum_probs=6.2 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 100113567897776 Q gi|254780970|r 464 DPFEGTKQAVAECWR 478 (737) Q Consensus 464 dP~~ga~~AV~Eal~ 478 (737) |||.-|..+|.+.+. T Consensus 11 N~Y~~AI~~vg~il~ 25 (145) T pfam07002 11 NPYEQAIRIVGEILQ 25 (145) T ss_pred CHHHHHHHHHHHHHH T ss_conf 899999999999987 No 203 >PRK07543 consensus Probab=25.85 E-value=33 Score=13.55 Aligned_cols=19 Identities=11% Similarity=0.191 Sum_probs=7.7 Q ss_pred CCCCCCCCCCHHHHHHHHH Q ss_conf 4578756863888998887 Q gi|254780970|r 130 LRFGAIHHPKTKHLLSGVV 148 (737) Q Consensus 130 l~~g~~~~p~~~~~~~~~~ 148 (737) |.+|+|+-|-+..+.+.+. T Consensus 36 l~~G~pdf~~p~~i~~a~~ 54 (400) T PRK07543 36 LGAGEPDFDTPDNIKQAAI 54 (400) T ss_pred CCCCCCCCCCCHHHHHHHH T ss_conf 9997999997999999999 No 204 >PRK11032 hypothetical protein; Provisional Probab=25.78 E-value=39 Score=13.07 Aligned_cols=17 Identities=41% Similarity=0.620 Sum_probs=11.2 Q ss_pred CCHHHHHHHHHHHCCCC Q ss_conf 99899999999957897 Q gi|254780970|r 16 LTKDECDQIIHILKRQP 32 (737) Q Consensus 16 L~~~e~~~i~~~l~R~P 32 (737) ||.+|++.|..+|+|+- T Consensus 43 LT~eE~~li~~ylkRDl 59 (160) T PRK11032 43 LTRDEVDLITRAVRRDL 59 (160) T ss_pred HCHHHHHHHHHHHHHHH T ss_conf 07999999999999999 No 205 >PRK08644 thiamine biosynthesis protein ThiF; Provisional Probab=25.32 E-value=39 Score=13.02 Aligned_cols=22 Identities=23% Similarity=0.259 Sum_probs=19.0 Q ss_pred CCCCCHHHHHHHHHCCCCCCEE Q ss_conf 7752011122332014775289 Q gi|254780970|r 267 MGAAGLTCSAIEMGNQGNLGIT 288 (737) Q Consensus 267 ~gaGGl~~a~~Ema~~~~~G~~ 288 (737) +|+|||.+.+.+...++|+|-- T Consensus 33 vG~GGLG~~~a~~La~aGVG~i 54 (209) T PRK08644 33 AGAGGLGSNIAVALARSGVGNL 54 (209) T ss_pred ECCCHHHHHHHHHHHHHCCCEE T ss_conf 8887889999999999389818 No 206 >COG1066 Sms Predicted ATP-dependent serine protease [Posttranslational modification, protein turnover, chaperones] Probab=24.79 E-value=40 Score=12.95 Aligned_cols=29 Identities=14% Similarity=0.187 Sum_probs=25.6 Q ss_pred CCHHHHHHHHHHHCCEEECCCCEEECCCC Q ss_conf 67588999888709713112212104431 Q gi|254780970|r 322 EKQHKAQEILNKWGLHFSIIGITTNDKLF 350 (737) Q Consensus 322 ~~~~~~~~i~~~~~~~~~~iG~vt~~~~~ 350 (737) +-..++.++++..++..-.+|+||+++.+ T Consensus 197 e~t~~L~~~AK~~~i~~fiVGHVTKeG~I 225 (456) T COG1066 197 EVAAELMRLAKTKNIAIFIVGHVTKEGAI 225 (456) T ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCCC T ss_conf 99999999998759739999888246632 No 207 >PRK08068 transaminase; Reviewed Probab=24.33 E-value=21 Score=14.94 Aligned_cols=18 Identities=6% Similarity=-0.021 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHCCCCCCC Q ss_conf 999998522102886654 Q gi|254780970|r 511 SVKGIREACQILDFPIVS 528 (737) Q Consensus 511 av~gl~d~c~~lgipivg 528 (737) -++.+.+.|++.++.|++ T Consensus 188 ~l~~l~~la~~~~i~vis 205 (389) T PRK08068 188 FFEETVAFAKKHNIGVVH 205 (389) T ss_pred HHHHHHHHHHCCCEEEEE T ss_conf 999999987107779996 No 208 >PRK12319 acetyl-CoA carboxylase subunit alpha; Provisional Probab=24.20 E-value=41 Score=12.88 Aligned_cols=29 Identities=24% Similarity=0.494 Sum_probs=18.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 2456668621346999999998522102886654 Q gi|254780970|r 495 NFGNPEKEEIMGQFVHSVKGIREACQILDFPIVS 528 (737) Q Consensus 495 nf~~P~~~e~~~ql~~av~gl~d~c~~lgipivg 528 (737) |||.| .||.|. .++ .+.+.|.+|++||++ T Consensus 75 nfGm~-~PeGYR---KA~-Rlm~lAekf~~Pvit 103 (256) T PRK12319 75 NFGQP-HPEGYR---KAL-RLMKQAEKFGRPVVT 103 (256) T ss_pred CCCCC-CHHHHH---HHH-HHHHHHHHHCCCEEE T ss_conf 69999-817999---999-999999980998799 No 209 >smart00198 SCP SCP / Tpx-1 / Ag5 / PR-1 / Sc7 family of extracellular domains. Human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein represent functional links between plant defense systems and human immune system. This family has no known function. Probab=24.18 E-value=38 Score=13.13 Aligned_cols=17 Identities=18% Similarity=0.599 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 88999985330011074 Q gi|254780970|r 34 LTEIGIISAMWNEHCSY 50 (737) Q Consensus 34 ~~El~~~~~~wSEHC~~ 50 (737) |.||+-.++.|.+.|.. T Consensus 36 d~~La~~A~~~a~~C~~ 52 (144) T smart00198 36 DCELASSAQNWANQCPF 52 (144) T ss_pred CHHHHHHHHHHHHHCCC T ss_conf 89999999999974865 No 210 >TIGR03600 phage_DnaB phage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model. Probab=24.06 E-value=42 Score=12.86 Aligned_cols=57 Identities=18% Similarity=0.262 Sum_probs=37.0 Q ss_pred HHHHHHHHHCCC-CCHHHHHHHCCCCCCCCC-CHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 897776410278-510223221024566686-213469999999985221028866544 Q gi|254780970|r 473 VAECWRNIIATG-AKPLAITDNLNFGNPEKE-EIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 473 V~Eal~Nl~a~G-a~Plait~~lnf~~P~~~-e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +-..+|.+.... .-.+-+.|.+....|.+. ...-++.+..++++..|++|+|||+.- T Consensus 292 ir~~~r~~~~~~~~l~livIDYLqLi~~~~~~~r~~ei~~Isr~LK~lAkel~ipVi~l 350 (421) T TIGR03600 292 IRSIARRIKRKKGGLDLIVVDYIQLMAPTRGRDRNEELGGISRGLKALAKELDVPVVLL 350 (421) T ss_pred HHHHHHHHHHHCCCCCEEEECHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 99999999986289869997578653788888889999999999999999979978997 No 211 >PRK08069 consensus Probab=24.04 E-value=27 Score=14.19 Aligned_cols=15 Identities=13% Similarity=0.045 Sum_probs=7.8 Q ss_pred CCCHHHHHHHHHHHC Q ss_conf 467588999888709 Q gi|254780970|r 321 PEKQHKAQEILNKWG 335 (737) Q Consensus 321 ~~~~~~~~~i~~~~~ 335 (737) ++.++++.++|++++ T Consensus 182 ~e~l~~i~~la~~~~ 196 (390) T PRK08069 182 KEELAALGEVCLEHD 196 (390) T ss_pred CHHHHHHHHHHHHCC T ss_conf 068999999987563 No 212 >pfam01041 DegT_DnrJ_EryC1 DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC, and StsS. The aminotransferase activity was demonstrated for purified StsC protein as the L-glutamine:scyllo-inosose aminotransferase EC:2.6.1.50, which catalyses the first amino transfer in the biosynthesis of the streptidine subunit of streptomycin. Probab=23.99 E-value=31 Score=13.76 Aligned_cols=36 Identities=14% Similarity=0.158 Sum_probs=23.2 Q ss_pred CCHHHHHHHHHHHCCCC--C-HHHHHHHHHHHCCCCCCC Q ss_conf 99899999999957897--1-889999853300110744 Q gi|254780970|r 16 LTKDECDQIIHILKRQP--T-LTEIGIISAMWNEHCSYK 51 (737) Q Consensus 16 L~~~e~~~i~~~l~R~P--t-~~El~~~~~~wSEHC~~k 51 (737) ++.+|++++.+.|.-.. | --+..-|.+.+|+.+..| T Consensus 3 ~~~~~~~~v~~~l~s~~~~~~G~~v~~fE~~~~~~~g~k 41 (363) T pfam01041 3 IDEEELAAVTEVLRSGWLTTGGPEVREFEKEFAAYLGVK 41 (363) T ss_pred CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 998999999999973981479989999999999997849 No 213 >cd02975 PfPDO_like_N Pyrococcus furiosus protein disulfide oxidoreductase (PfPDO)-like family, N-terminal TRX-fold subdomain; composed of proteins with similarity to PfPDO, a redox active thermostable protein believed to be the archaeal counterpart of bacterial DsbA and eukaryotic protein disulfide isomerase (PDI), which are both involved in oxidative protein folding. PfPDO contains two redox active CXXC motifs in two contiguous TRX-fold subdomains. The active site in the N-terminal TRX-fold subdomain is required for isomerase but not for reductase activity of PfPDO. The exclusive presence of PfPDO-like proteins in extremophiles may suggest that they have a special role in adaptation to extreme conditions. Probab=23.96 E-value=16 Score=15.75 Aligned_cols=50 Identities=18% Similarity=0.397 Sum_probs=30.8 Q ss_pred CCHHHHHHHHH-HHCCCCCHHHHHHHHHHHCCCCCCCC-CHHHHHHHHHCCCCE Q ss_conf 99899999999-95789718899998533001107445-889997512069748 Q gi|254780970|r 16 LTKDECDQIIH-ILKRQPTLTEIGIISAMWNEHCSYKS-SKKWLRTLPTTGKHV 67 (737) Q Consensus 16 L~~~e~~~i~~-~l~R~Pt~~El~~~~~~wSEHC~~k~-~~~~l~~~~~~~~~v 67 (737) |++++-++|++ +|..--+.+|+-.|.+ .+||.|-- .+.+++.+..-.+.+ T Consensus 3 l~~~~r~~l~e~fl~~l~~pV~ivvF~~--~~~c~yc~~t~~ll~ela~lsdKi 54 (113) T cd02975 3 LSDEDRKALKEEFFKEMKNPVDLVVFSS--KEGCQYCEVTKQLLEELSELSDKL 54 (113) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEC--CCCCCCHHHHHHHHHHHHHHCCCE T ss_conf 8888999999999985799869999957--988711699999999999748853 No 214 >cd02434 Nodulin-21_like_3 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_3: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=23.92 E-value=42 Score=12.84 Aligned_cols=18 Identities=22% Similarity=0.265 Sum_probs=9.0 Q ss_pred CHHHHHHHHHHHCCEEEC Q ss_conf 758899988870971311 Q gi|254780970|r 323 KQHKAQEILNKWGLHFSI 340 (737) Q Consensus 323 ~~~~~~~i~~~~~~~~~~ 340 (737) ..+++.+++++.|++-.. T Consensus 82 E~~El~~iy~~~G~~~~~ 99 (225) T cd02434 82 EKSEMVEIYSLKGLSEEV 99 (225) T ss_pred HHHHHHHHHHHCCCCHHH T ss_conf 999999999877999899 No 215 >TIGR02980 SigBFG RNA polymerase sigma-70 factor, sigma-B/F/G subfamily; InterPro: IPR014322 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of similar sigma-70 factors includes clades found in Bacilli (including the sporulation factors SigF:IPR014236 from INTERPRO and SigG:IPR014212 from INTERPRO as well as SigB:IPR014288 from INTERPRO), and the high GC Gram-positive bacteria (Actinobacteria) where a variable number of them are found depending on the lineage.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=23.91 E-value=42 Score=12.84 Aligned_cols=26 Identities=15% Similarity=0.018 Sum_probs=13.2 Q ss_pred HHHCCCEEEEEECC---CCHHHHHHHHHH Q ss_conf 99959806994648---657999999998 Q gi|254780970|r 617 MINAKKITACHDIS---TGGLIITLAEMT 642 (737) Q Consensus 617 li~~g~I~S~HDiS---dGGL~~aL~EMa 642 (737) +.++-..+|.|+.- ++|=-.+|+.+. T Consensus 130 A~~~Y~~~Sld~~~~a~~dg~~~~L~D~l 158 (229) T TIGR02980 130 AGNSYSALSLDATIEADDDGDPIALLDTL 158 (229) T ss_pred HHHCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 65417861120004558888424566444 No 216 >pfam11198 DUF2857 Protein of unknown function (DUF2857). This is a bacterial family of uncharacterized proteins. Probab=23.90 E-value=42 Score=12.84 Aligned_cols=26 Identities=19% Similarity=0.111 Sum_probs=11.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 89996479899999999999999959 Q gi|254780970|r 596 IGPPPKVDCHLEKNHGFFVLSMINAK 621 (737) Q Consensus 596 ~g~~p~~d~~~~k~~~~~v~~li~~g 621 (737) .|.+|..+.+....++..-+++.++. T Consensus 117 ~GR~~~~~ee~~~~lW~~W~~~~~~~ 142 (181) T pfam11198 117 KGRPPALDEEQEAALWRAWQQLRSKP 142 (181) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 99999998688999999999989864 No 217 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=23.88 E-value=42 Score=12.84 Aligned_cols=24 Identities=8% Similarity=0.317 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 469999999985221028866544 Q gi|254780970|r 506 GQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 506 ~ql~~av~gl~d~c~~lgipivgG 529 (737) .++.+..+.++..|+++++||+.. T Consensus 154 ~~v~~i~~~Lk~lA~e~~i~ii~~ 177 (186) T pfam03796 154 QEISEISRSLKALAKELNIPVIAL 177 (186) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 999999999999999979918997 No 218 >PRK07309 aromatic amino acid aminotransferase; Validated Probab=23.85 E-value=21 Score=14.91 Aligned_cols=31 Identities=16% Similarity=0.154 Sum_probs=19.0 Q ss_pred CCCCCCH-HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 6668621-3469999999985221028866544 Q gi|254780970|r 498 NPEKEEI-MGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~-~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-. .| -.+-++.+.+.|++.++.|++- T Consensus 174 ~P~NPtG~v~-s~e~l~~i~~la~~~~v~iisD 205 (390) T PRK07309 174 YPANPTGVTY-SREQIKALAEVLKKYDIFVISD 205 (390) T ss_pred CCCCCCCCCC-CHHHHHHHHHHHHHCCEEEEEE T ss_conf 9989988878-9999999999997689899950 No 219 >COG2344 AT-rich DNA-binding protein [General function prediction only] Probab=23.64 E-value=42 Score=12.81 Aligned_cols=85 Identities=18% Similarity=0.196 Sum_probs=51.2 Q ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEEECC----------------CCCCCHHHHHHCCCCCCEE Q ss_conf 999999999598069946486579999999984589708998056----------------6689878887298832189 Q gi|254780970|r 611 GFFVLSMINAKKITACHDISTGGLIITLAEMTISSAKGMDIILPI----------------EIEKDPKPFLFGEDQGRYV 674 (737) Q Consensus 611 ~~~v~~li~~g~I~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~----------------~~~~d~~~~LFsEs~Gr~i 674 (737) ..++.++.......-..=|.-|-|..||+.--|..+.|+.|-.-- ..-.+...++=...-...+ T Consensus 72 ~~ff~~~Lg~~~~tnviiVG~GnlG~All~Y~f~~~~~~~iv~~FDv~~~~VG~~~~~v~V~~~d~le~~v~~~dv~iai 151 (211) T COG2344 72 RDFFDDLLGQDKTTNVIIVGVGNLGRALLNYNFSKKNGMKIVAAFDVDPDKVGTKIGDVPVYDLDDLEKFVKKNDVEIAI 151 (211) T ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHCCCCHHHCCCEEEEEECCCHHHHCCCCCCEEEECHHHHHHHHHHCCCCEEE T ss_conf 99999983877403489990570888986074233269359999617878837524881455268888898861761899 Q ss_pred EEECHHHHHHHHHHHHHCCCC Q ss_conf 999688099899999857984 Q gi|254780970|r 675 VCISPENQDLVMSEANNKNIP 695 (737) Q Consensus 675 i~V~~~~~~~~~~~~~~~gi~ 695 (737) ++|+.++..++.+.+-++|+. T Consensus 152 LtVPa~~AQ~vad~Lv~aGVk 172 (211) T COG2344 152 LTVPAEHAQEVADRLVKAGVK 172 (211) T ss_pred EECCHHHHHHHHHHHHHCCCC T ss_conf 973489899999999983873 No 220 >TIGR02374 nitri_red_nirB nitrite reductase [NAD(P)H], large subunit; InterPro: IPR012744 This entry describes NirB, the large subunit of nitrite reductase [NAD(P)H] (the assimilatory nitrite reductase), which associates with NirD, the small subunit (IPR012748 from INTERPRO). In a few bacteria such as Klebsiella pneumoniae and in fungi, the two regions are fused.; GO: 0008942 nitrite reductase [NAD(P)H] activity, 0050660 FAD binding, 0050661 NADP binding, 0042128 nitrate assimilation. Probab=23.47 E-value=43 Score=12.78 Aligned_cols=14 Identities=29% Similarity=0.394 Sum_probs=7.4 Q ss_pred CCCCCEEECCCCCC Q ss_conf 38980751377665 Q gi|254780970|r 567 HEGDLILMIGNDGC 580 (737) Q Consensus 567 ~~Gd~I~liG~~~~ 580 (737) +.|.-||+-|+.+. T Consensus 711 ~~GW~LYvgGnGG~ 724 (813) T TIGR02374 711 EKGWNLYVGGNGGT 724 (813) T ss_pred CCCCEEEECCCCCC T ss_conf 28746888167887 No 221 >PRK06749 replicative DNA helicase; Provisional Probab=23.43 E-value=43 Score=12.78 Aligned_cols=57 Identities=16% Similarity=0.166 Sum_probs=37.0 Q ss_pred HHHHHHHHHCC-CCCH-HHHHHHCCCCCC--CCC-CHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 89777641027-8510-223221024566--686-213469999999985221028866544 Q gi|254780970|r 473 VAECWRNIIAT-GAKP-LAITDNLNFGNP--EKE-EIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 473 V~Eal~Nl~a~-Ga~P-lait~~lnf~~P--~~~-e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +.-.+|.+..- |.+. +-+.|.+....+ ... ...-++.+..++++..|++|+|||+.= T Consensus 285 i~~~~r~~~~~~g~~~~livIDYlqLi~~~~~~~~~r~~ev~~isr~lK~lAkel~vpvi~l 346 (428) T PRK06749 285 IWMQTRKLKRKHGDKKVLIIVDYLQLITGDPKHKGNRFQEISEISRKLKLLARELNVCVVAL 346 (428) T ss_pred HHHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 99999999997499876999767765057877777899999999999999999969989997 No 222 >PRK12414 putative aminotransferase; Provisional Probab=23.07 E-value=18 Score=15.37 Aligned_cols=23 Identities=26% Similarity=0.384 Sum_probs=10.2 Q ss_pred ECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 04578756863888998887458 Q gi|254780970|r 129 SLRFGAIHHPKTKHLLSGVVAGI 151 (737) Q Consensus 129 ~l~~g~~~~p~~~~~~~~~~~G~ 151 (737) .|..|+|+-|-+..+.+.+.+-+ T Consensus 33 ~l~~G~Pd~~~p~~i~~a~~~a~ 55 (384) T PRK12414 33 NLSQGAPNFAPDPALVEGVARAM 55 (384) T ss_pred ECCCCCCCCCCCHHHHHHHHHHH T ss_conf 81898999999899999999999 No 223 >PRK06290 aspartate aminotransferase; Provisional Probab=23.00 E-value=16 Score=15.66 Aligned_cols=32 Identities=13% Similarity=-0.027 Sum_probs=15.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|...---.+-++.+.+.|++.++.|++- T Consensus 187 sP~NPTG~v~s~e~l~~i~~~a~~~~i~vIsD 218 (411) T PRK06290 187 YPNNPTGAVATKEFYKEVVDFAFENDIIVVQD 218 (411) T ss_pred CCCCCCCEEECHHHHHHHHHHHHHCCCEEECH T ss_conf 99999861424899999999986428776423 No 224 >TIGR01709 typeII_sec_gspL general secretion pathway protein L; InterPro: IPR007812 This family consists of general secretion pathway protein L sequences from several Gram-negative bacteria. The general secretion pathway of Gram-negative bacteria is responsible for extracellular secretion of a number of different proteins, including proteases and toxins. This pathway supports secretion of proteins across the cell envelope in two distinct steps, in which the second step, involving translocation through the outer membrane, is assisted by at least 13 different gene products. GspL is predicted to contain a large cytoplasmic domain and has been shown to interact with the autophosphorylating cytoplasmic membrane protein GspE. It is thought that the tri-molecular complex of GspL, GspE and GspM might be involved in regulating the opening and closing of the secretion pore and/or transducing energy to the site of outer membrane translocation .; GO: 0008565 protein transporter activity, 0015031 protein transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=22.99 E-value=44 Score=12.72 Aligned_cols=33 Identities=18% Similarity=0.274 Sum_probs=28.5 Q ss_pred CCCCCEEEEEEECCCHHHHHHHHHHHCCEEECC Q ss_conf 788850899995467588999888709713112 Q gi|254780970|r 309 SESQERMLMILNPEKQHKAQEILNKWGLHFSII 341 (737) Q Consensus 309 sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~i 341 (737) .|..+|.|++|..+.++.+...|+..|+....| T Consensus 102 ~~~~~r~v~~v~r~~l~~wl~~l~~~G~~~~~v 134 (433) T TIGR01709 102 AEGATRVVAVVDREWLQAWLDLLTEAGLAPERV 134 (433) T ss_pred CCCCEEEEEEEHHHHHHHHHHHHHHCCCCCEEE T ss_conf 246507887522789999999999758980105 No 225 >TIGR00028 Mtu_PIN_fam Mycobacterium tuberculosis PIN domain family; InterPro: IPR006226 This is a family of hypothetical proteins with no known function that is mainly restricted to Mycobacterium tuberculosis. No similar sequences have been found outside of Mycobacteria.. Probab=22.82 E-value=44 Score=12.70 Aligned_cols=44 Identities=20% Similarity=0.259 Sum_probs=22.4 Q ss_pred HHHHHCCCCCCCCCHHHHHHHHHCCCCEEECCCCCEEEEEECCC Q ss_conf 85330011074458899975120697489933477368983698 Q gi|254780970|r 40 ISAMWNEHCSYKSSKKWLRTLPTTGKHVIQGPGENAGVVDIGGG 83 (737) Q Consensus 40 ~~~~wSEHC~~k~~~~~l~~~~~~~~~vi~~~~~~a~vi~~~~~ 83 (737) ++..|++|--|+..|.||..+-+.+.++...+-=-+|.|++--+ T Consensus 18 laa~~~~h~~H~~a~~Wl~~~~~~~~~~~~~~~~~~gf~R~~tn 61 (155) T TIGR00028 18 LAAVNRDHVHHDAARAWLDRLLAGGDWATCPWLTLAGFVRLLTN 61 (155) T ss_pred HHHHCCCCCCHHHHHHHHHHHHCCCCEEEECCHHHHHEEEEEEC T ss_conf 98752887174889999999836994898454223000633551 No 226 >PRK06225 aspartate aminotransferase; Provisional Probab=22.81 E-value=22 Score=14.73 Aligned_cols=33 Identities=15% Similarity=-0.013 Sum_probs=22.7 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 566686213469999999985221028866544 Q gi|254780970|r 497 GNPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 497 ~~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) .||.+|-..---.+-++.+.+.|++.++.++.- T Consensus 159 ~nPnNPTG~v~s~~~l~~l~~~a~~~~i~ii~D 191 (375) T PRK06225 159 IDPLNPLGSSYTREEIKEFAEIARDNDAYLVHD 191 (375) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCH T ss_conf 299899888788999999999998627700002 No 227 >PRK06375 consensus Probab=22.69 E-value=24 Score=14.50 Aligned_cols=32 Identities=16% Similarity=0.056 Sum_probs=17.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) ||.+|-..---.+-++.+.+.|++.++.|++- T Consensus 167 nP~NPTG~v~s~e~l~~i~~la~~~~i~iIsD 198 (381) T PRK06375 167 NPVNPTGKVLSEKEIRELVDFALENNLYIVSD 198 (381) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCEEEEEEC T ss_conf 99798899999999999999887565899813 No 228 >KOG2584 consensus Probab=22.67 E-value=44 Score=12.68 Aligned_cols=25 Identities=8% Similarity=0.174 Sum_probs=14.3 Q ss_pred CHHHHHHHHHHHCCEEECCCCEEEC Q ss_conf 7588999888709713112212104 Q gi|254780970|r 323 KQHKAQEILNKWGLHFSIIGITTND 347 (737) Q Consensus 323 ~~~~~~~i~~~~~~~~~~iG~vt~~ 347 (737) -......+++..+||..++..+..+ T Consensus 234 A~~rai~ia~~~ncPlyvvhVmsks 258 (522) T KOG2584 234 ATNRAITIARQANCPLYVVHVMSKS 258 (522) T ss_pred HHHHHHHHHHHCCCCCCEEEEEEHH T ss_conf 9999999988628982058885263 No 229 >PRK00199 ihfB integration host factor subunit beta; Reviewed Probab=22.50 E-value=44 Score=12.66 Aligned_cols=17 Identities=18% Similarity=0.188 Sum_probs=5.5 Q ss_pred HHHHHHHHHCC-CCHHHH Q ss_conf 14788998619-998999 Q gi|254780970|r 5 SPILEILTAHG-LTKDEC 21 (737) Q Consensus 5 ~~~~e~~~~~g-L~~~e~ 21 (737) +-+..+..++| |+..+. T Consensus 5 eli~~ia~~~~~l~~~~~ 22 (91) T PRK00199 5 ELIERLATRQPHLPAKDV 22 (91) T ss_pred HHHHHHHHHCCCCCHHHH T ss_conf 999999987699899999 No 230 >COG1574 Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=22.46 E-value=45 Score=12.65 Aligned_cols=26 Identities=19% Similarity=0.166 Sum_probs=16.9 Q ss_pred CCEEEEEECCCEEEEEEECCCCCCCC Q ss_conf 77368983698399999517887776 Q gi|254780970|r 73 ENAGVVDIGGGDCVVFKMESHNHPSY 98 (737) Q Consensus 73 ~~a~vi~~~~~~~~~~k~EtHNhPs~ 98 (737) .+.-+||....+.+=-=+++|+|... T Consensus 49 ~~t~viDL~Gk~v~PGfvDaH~Hl~~ 74 (535) T COG1574 49 PATEVIDLKGKFVLPGFVDAHLHLIS 74 (535) T ss_pred CCCEEEECCCCEECCCCCHHHHHHHH T ss_conf 88639968998754661004677987 No 231 >pfam01981 PTH2 Peptidyl-tRNA hydrolase PTH2. Peptidyl-tRNA hydrolases are enzymes that release tRNAs from peptidyl-tRNA during translation. Probab=22.43 E-value=45 Score=12.65 Aligned_cols=53 Identities=17% Similarity=0.200 Sum_probs=29.5 Q ss_pred HHHHHCCCCCEEEEEEEC-CCHHHHHHHHHHHCCEEECCCCEEECCCCCCCCCCCCC Q ss_conf 898617888508999954-67588999888709713112212104431000258644 Q gi|254780970|r 304 YEMMLSESQERMLMILNP-EKQHKAQEILNKWGLHFSIIGITTNDKLFRVIHRGEEV 359 (737) Q Consensus 304 ~ei~~sEsQeR~~~~v~~-~~~~~~~~i~~~~~~~~~~iG~vt~~~~~~v~~~g~~v 359 (737) ++-|..+.|-..++-++. +.+.++.+.+++.++++.+ |.+.|+-.+--+..++ T Consensus 42 ~~~W~~~G~~KIvlkv~de~~L~~l~~~a~~~gl~~~~---V~DAG~Tei~pgt~Tv 95 (116) T pfam01981 42 LKAWEREGQKKIVLKVNSEEELLELKEKAKSLGLPTSL---IQDAGRTQIAPGTVTV 95 (116) T ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCCEEE---EECCCCEECCCCCEEE T ss_conf 99999779967999729999999999999984996699---9868731459998589 No 232 >cd06218 DHOD_e_trans FAD/NAD binding domain in the electron transfer subunit of dihydroorotate dehydrogenase. Dihydroorotate dehydrogenases (DHODs) catalyze the only redox reaction in pyrimidine de novo biosynthesis. They catalyze the oxidation of (S)-dihydroorotate to orotate coupled with the reduction of NAD+. In L. lactis, DHOD B (encoded by pyrDa) is co-expressed with pyrK and both gene products are required for full activity, as well as 3 cofactors: FMN, FAD, and an [2Fe-2S] cluster. Probab=22.43 E-value=45 Score=12.65 Aligned_cols=38 Identities=18% Similarity=0.093 Sum_probs=20.5 Q ss_pred CCCCCCCEEEECCCCHHHHHHCCCCCCCCEEEEECCCC Q ss_conf 76642211221121135543112256897799965765 Q gi|254780970|r 174 NNNIIVNTFAAGIAKTNAIFSSKARGIGLPLVYLGAKT 211 (737) Q Consensus 174 ~~~~~v~~~~~Gi~~~~~~~~~~~~~~Gd~ii~~G~~t 211 (737) .++.+..++.+|+.+.-.+.+.-.....+..+++|.++ T Consensus 98 ~~~~llvaGG~GiaPl~~l~~~l~~~~~~v~~i~G~r~ 135 (246) T cd06218 98 DGKVLLVGGGIGIAPLLFLAKQLAERGIKVTVLLGFRS 135 (246) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 97299996573154399999999974992899997477 No 233 >pfam03458 UPF0126 UPF0126 domain. Domain always found as pair in bacterial membrane proteins of unknown function. This domain contains three transmembrane helices. The conserved glycines are suggestive of an ion channel (C. Yeats unpublished obs.). Probab=22.40 E-value=12 Score=16.62 Aligned_cols=24 Identities=46% Similarity=0.803 Sum_probs=14.3 Q ss_pred CCCCCCCCC-CCCCCCCCCCCEEEEE Q ss_conf 887210558-1022576778743687 Q gi|254780970|r 102 YQGAATGVG-GILRDIFTMGARPVAA 126 (737) Q Consensus 102 ~~GAaTg~g-G~iRDi~~~Ga~p~a~ 126 (737) .=|..|++| |.+||+++ |..|... T Consensus 32 ~lg~iTa~gGG~iRDil~-~~~P~~~ 56 (81) T pfam03458 32 VLGVVTAVGGGILRDVLL-GEVPLVL 56 (81) T ss_pred HHHHHHHHCCHHHHHHHC-CCCCHHH T ss_conf 889988611356999991-7898255 No 234 >TIGR03540 DapC_direct LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known aminotransferases, succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh). Although there is no experimental characterization of any of the sequences in this clade, a direct pathway is known in plants and Chlamydia, so it seems quite reasonable that these enzymes catalyze the same transformation. Probab=22.11 E-value=17 Score=15.57 Aligned_cols=32 Identities=9% Similarity=0.022 Sum_probs=16.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 66686213469999999985221028866544 Q gi|254780970|r 498 NPEKEEIMGQFVHSVKGIREACQILDFPIVSG 529 (737) Q Consensus 498 ~P~~~e~~~ql~~av~gl~d~c~~lgipivgG 529 (737) +|.+|-..---.+-++.+.+.|++.++.|++- T Consensus 172 ~P~NPTG~v~s~~~l~~i~~la~~~~i~iIsD 203 (383) T TIGR03540 172 YPNNPTGAVAPLKFFKELVEFAKEYNIIVCHD 203 (383) T ss_pred CCCCCCCCCCCCHHHHCCCCCCCCCCCEEECC T ss_conf 99897070121022220100124554057611 No 235 >TIGR02624 rhamnu_1P_ald rhamnulose-1-phosphate aldolase; InterPro: IPR013447 Proteins in this entry match the enzyme RhaD, rhamnulose-1-phosphate aldolase (4.1.2.19 from EC).; GO: 0008994 rhamnulose-1-phosphate aldolase activity. Probab=21.83 E-value=46 Score=12.57 Aligned_cols=15 Identities=33% Similarity=0.547 Sum_probs=6.9 Q ss_pred CCCCCCCCCEEEECC Q ss_conf 000135763142037 Q gi|254780970|r 430 IQLPGGDAGVIRVEG 444 (737) Q Consensus 430 v~~Pg~DaaVv~~~~ 444 (737) ...|-.|-+|+|++. T Consensus 83 ~~~P~~nLg~~rVs~ 97 (273) T TIGR02624 83 EENPAENLGVLRVSE 97 (273) T ss_pred HCCCCCCEEEEEECC T ss_conf 117014646789815 No 236 >PRK10241 hydroxyacylglutathione hydrolase; Provisional Probab=21.79 E-value=46 Score=12.56 Aligned_cols=45 Identities=11% Similarity=0.318 Sum_probs=26.5 Q ss_pred HHHCCCCCCCCHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCCEEEC Q ss_conf 0220466789998986178885089999546758899988870971311 Q gi|254780970|r 292 DKVPTCEEGMTAYEMMLSESQERMLMILNPEKQHKAQEILNKWGLHFSI 340 (737) Q Consensus 292 ~~vp~~~~~l~~~ei~~sEsQeR~~~~v~~~~~~~~~~i~~~~~~~~~~ 340 (737) -.||.-.++.. | |+..|+. + +++|+|.+.+.+.+.+++.++.-.- T Consensus 4 ~~ip~~~DNy~-y-li~d~~~-~-a~vIDPgd~~~i~~~l~~~~l~l~~ 48 (251) T PRK10241 4 NSIPAFDDNYI-W-VLNDEAG-R-CLIVDPGEAEPVLNAIAANNWQPEA 48 (251) T ss_pred EEEEEECCEEE-E-EEECCCC-C-EEEECCCCHHHHHHHHHHCCCEEEE T ss_conf 99612556689-9-9985998-3-8999499709999999977991789 No 237 >pfam08731 AFT Transcription factor AFT. AFT (activator of iron transcription) is an iron regulated transcriptional activator that regulates the expression of genes involved in iron homeostasis. This family includes the paralogous pair of transcription factors AFT1 and AFT2. Probab=21.72 E-value=46 Score=12.55 Aligned_cols=14 Identities=29% Similarity=0.442 Sum_probs=8.6 Q ss_pred CEEEEEEECCCCCC Q ss_conf 83999995178877 Q gi|254780970|r 83 GDCVVFKMESHNHP 96 (737) Q Consensus 83 ~~~~~~k~EtHNhP 96 (737) .|.+++=.++|+|| T Consensus 76 ~W~ivv~n~~H~Hp 89 (90) T pfam08731 76 KWTIVVMNNNHSHP 89 (90) T ss_pred EEEEEEECCCCCCC T ss_conf 48899823677898 No 238 >PRK04161 tagatose 1,6-diphosphate aldolase; Reviewed Probab=21.61 E-value=46 Score=12.54 Aligned_cols=80 Identities=15% Similarity=0.155 Sum_probs=44.7 Q ss_pred CCCEEEEECCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCC-HHHHHHHHHHHHHHHHHHCCC Q ss_conf 7622797304532100001001135678977764102785102232210245666862-134699999999852210288 Q gi|254780970|r 446 ETKALAFSSDVTPRYVKADPFEGTKQAVAECWRNIIATGAKPLAITDNLNFGNPEKEE-IMGQFVHSVKGIREACQILDF 524 (737) Q Consensus 446 ~~~gia~s~g~~p~~~~~dP~~ga~~AV~Eal~Nl~a~Ga~Plait~~lnf~~P~~~e-~~~ql~~av~gl~d~c~~lgi 524 (737) .+.|+.++.+- .-|-..+|.+-..+-..-+++.+--+|++-+.+.. |-+|+.++ .--+=..-|+.+.+.|++.+| T Consensus 83 ~~~GLlla~E~-tgyd~t~~gRl~~li~~wsv~rik~~GadAvK~L~---yy~pD~~~ein~~k~a~ve~ig~eC~~~di 158 (326) T PRK04161 83 NQTGLLLAYEK-TGYDATTTSRLPDCLVEWSVKRLKEAGADAVKFLL---YYDVDGDEEINLQKQAYIERIGSECTAEDI 158 (326) T ss_pred CCCCEEEEEEC-CCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEEEE---EECCCCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88765999613-46777787657666766689999982846137999---978999878889999999999999987799 Q ss_pred CCCCC Q ss_conf 66544 Q gi|254780970|r 525 PIVSG 529 (737) Q Consensus 525 pivgG 529 (737) |++== T Consensus 159 pf~lE 163 (326) T PRK04161 159 PFFLE 163 (326) T ss_pred CEEEE T ss_conf 83898 No 239 >TIGR02823 oxido_YhdH putative quinone oxidoreductase, YhdH/YhfP family; InterPro: IPR014188 This entry represents a subfamily of the alcohol dehydrogenase, a superfamily in which some members are zinc-binding medium-chain alcohol dehydrogenases while others are quinone oxidoreductases with no bound zinc. This entry includes YhdH from Escherichia coli and YhfP from Bacillus subtilis both of which bind NADPH or NAD, but not zinc. Both proteins have been studied crystallographically for insight into function. . Probab=21.07 E-value=47 Score=12.46 Aligned_cols=188 Identities=14% Similarity=0.162 Sum_probs=102.0 Q ss_pred CCCCCCCCCCCC---CCCCCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCC--CCCCH-HHHHHHHHCCCCCCCCCCC Q ss_conf 320011236866---67631112222333520233102143898075137766--55432-6788886116789996479 Q gi|254780970|r 530 NVSFYNETNGQS---IFPTPTIAGVGILPDYSLMTRIDSAHEGDLILMIGNDG--CHLDC-SMYSLECASSNIGPPPKVD 603 (737) Q Consensus 530 kvSl~n~t~~~~---I~pTpvi~~vG~v~d~~~~it~~~k~~Gd~I~liG~~~--~~Lgg-S~~~~~~~~~~~g~~p~~d 603 (737) ||.|.-+-+++- -|-+|-|=..|.|.. .-.+.|+ +||.++++|..- .|.|| |.|++.-..=.---|...+ T Consensus 43 KDALA~tg~g~v~R~yP~~PGID~aG~Vv~---S~dp~F~-~GD~VivTGyglG~~H~GGysqyaRVp~dWlVpLP~gls 118 (330) T TIGR02823 43 KDALAITGKGGVVRSYPMIPGIDLAGTVVS---SEDPRFR-PGDEVIVTGYGLGVSHDGGYSQYARVPADWLVPLPEGLS 118 (330) T ss_pred HHHHEECCCCCEEEECCCCCCCCCEEEEEE---CCCCCCC-CCCEEEEEEECCCCCCCCCCEEEEEECHHHEECCCCCCC T ss_conf 033101387976887787688175468984---4887757-887189974024565157622356733231015889889 Q ss_pred HHHHHHH-------HHHHHHHHHCC-CE-----EEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHH-HHH---- Q ss_conf 8999999-------99999999959-80-----699464865799999999845897089980566689878-887---- Q gi|254780970|r 604 CHLEKNH-------GFFVLSMINAK-KI-----TACHDISTGGLIITLAEMTISSAKGMDIILPIEIEKDPK-PFL---- 665 (737) Q Consensus 604 ~~~~k~~-------~~~v~~li~~g-~I-----~S~HDiSdGGL~~aL~EMa~as~~G~~I~L~~~~~~d~~-~~L---- 665 (737) +.....+ ...++.|.+.| +- .-..=-|. |.-.--..| -+++|.++--.. .+.+.. ++| T Consensus 119 l~eAM~~GTAGfTAaL~V~aLe~~Gl~~P~~GpVlVTGAtG-GVGS~Av~~--L~~lGY~V~A~t-Gk~~~~~~yL~~LG 194 (330) T TIGR02823 119 LREAMALGTAGFTAALSVMALERNGLLTPEDGPVLVTGATG-GVGSLAVAI--LSKLGYEVVAST-GKAEEEVEYLKELG 194 (330) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEECCCC-CHHHHHHHH--HHHCCCEEEEEE-CCHHHHHHHHHHCC T ss_conf 88999860077899999999997268888888788706778-778999999--983797699973-78377889998658 Q ss_pred ---------HCCC-CCC---------EEEEECHHHHHHHHHHHHHCCCCEEEEEEECCCE-------EEEECC------- Q ss_conf ---------2988-321---------8999968809989999985798499989987986-------898074------- Q gi|254780970|r 666 ---------FGED-QGR---------YVVCISPENQDLVMSEANNKNIPLRYLGKVSGST-------LSIHNI------- 712 (737) Q Consensus 666 ---------FsEs-~Gr---------~ii~V~~~~~~~~~~~~~~~gi~~~~IG~Vt~~~-------l~i~~~------- 712 (737) |+++ .+| .|=+|=-+-...+++.++-.|+ .+.+|-+.+.. |++.++ T Consensus 195 A~evi~R~~l~~~a~~kPL~k~~WAGAvDtVGG~~La~~l~~~~ygG~-VA~cGlagG~~L~tTV~PFILRGV~LlGIDS 273 (330) T TIGR02823 195 ASEVIDREELSEDAPGKPLEKERWAGAVDTVGGKTLANVLAQLKYGGA-VAACGLAGGADLPTTVLPFILRGVSLLGIDS 273 (330) T ss_pred CCCCCCHHHCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHCCCCE-EEEEECCCCCCCCCEECCHHHCCCEEEEECH T ss_conf 110577112276878886312222215746875799999984068978-9999426798635012550432853888530 Q ss_pred EECCHHHHHHHHHH Q ss_conf 12149999999999 Q gi|254780970|r 713 LNIPVATLQTKYES 726 (737) Q Consensus 713 ~~~~i~~l~~~w~~ 726 (737) ...+++.=+.+|++ T Consensus 274 V~~p~~~R~~~W~R 287 (330) T TIGR02823 274 VYCPMALREAAWQR 287 (330) T ss_pred HCCCHHHHHHHHHH T ss_conf 01887899999998 No 240 >PRK12427 flagellar biosynthesis sigma factor; Provisional Probab=20.96 E-value=48 Score=12.45 Aligned_cols=13 Identities=23% Similarity=0.319 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999985221028 Q gi|254780970|r 511 SVKGIREACQILD 523 (737) Q Consensus 511 av~gl~d~c~~lg 523 (737) .+-|+.++...|+ T Consensus 50 G~iGLi~Ai~rFD 62 (229) T PRK12427 50 ALMGLLEALRRYG 62 (229) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999998449 No 241 >cd00168 SCP SCP: SCP-like extracellular protein domain, found in eukaryotes and prokaryotes. This family includes plant pathogenesis-related protein 1 (PR-1), which accumulates after infections with pathogens, and may act as an anti-fungal agent or be involved in cell wall loosening. This family also includes CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Roles for CRISP, in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. The human GAPR-1 protein has been reported to dimerize, and such a dimer may form an active site containing a catalytic triad. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as Probab=20.83 E-value=46 Score=12.55 Aligned_cols=21 Identities=19% Similarity=0.340 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHCCCCCCCCC Q ss_conf 188999985330011074458 Q gi|254780970|r 33 TLTEIGIISAMWNEHCSYKSS 53 (737) Q Consensus 33 t~~El~~~~~~wSEHC~~k~~ 53 (737) =|.||.-.+|.|.++|..+.+ T Consensus 23 Wd~~La~~Aq~wA~~c~~~h~ 43 (122) T cd00168 23 WDAELAKTAQNYANRCIFKHS 43 (122) T ss_pred ECHHHHHHHHHHHHHCCCCCC T ss_conf 499999999999986041368 No 242 >KOG1004 consensus Probab=20.78 E-value=42 Score=12.86 Aligned_cols=50 Identities=28% Similarity=0.353 Sum_probs=30.6 Q ss_pred CCCCCCEEEEE----CCCCCCCCCHHHHHHH---HHHCCCCCCCCCCCCCCCHHHHH Q ss_conf 25689779996----5765644410466667---77286533344433267989998 Q gi|254780970|r 197 ARGIGLPLVYL----GAKTGRDGIGGASMAS---EEFGENIAKKRPTVQVGDPFTGK 246 (737) Q Consensus 197 ~~~~Gd~ii~~----G~~tg~dGi~Ga~~sS---~~~~~~~~~~~~~VQ~gdp~~ek 246 (737) -+.+||.||=+ -+..+|.-|||+..+| ..|.--....+|.+|+||-...| T Consensus 62 iP~~~D~VIGiV~~~~gd~ykVDigg~~~a~L~~laFe~AtkrNrPnl~vGdliyak 118 (230) T KOG1004 62 IPVKGDHVIGIVTSKSGDIYKVDIGGSEPASLSYLAFEGATKRNRPNLQVGDLIYAK 118 (230) T ss_pred CCCCCCEEEEEEEECCCCEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEEE T ss_conf 378899899999850465179963898665331002368543578864316578999 No 243 >pfam02864 STAT_bind STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain pfam00017. Probab=20.72 E-value=33 Score=13.55 Aligned_cols=11 Identities=18% Similarity=0.078 Sum_probs=3.9 Q ss_pred CCCCHHHHHHH Q ss_conf 68621346999 Q gi|254780970|r 500 EKEEIMGQFVH 510 (737) Q Consensus 500 ~~~e~~~ql~~ 510 (737) ++.-.+|+-.. T Consensus 236 ~r~FtFW~Wf~ 246 (254) T pfam02864 236 GKSFTFWQWFD 246 (254) T ss_pred CCCCCHHHHHH T ss_conf 99976898899 No 244 >TIGR02997 Sig70-cyanoRpoD RNA polymerase sigma factor, cyanobacterial RpoD-like family. Probab=20.52 E-value=49 Score=12.39 Aligned_cols=27 Identities=26% Similarity=0.359 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHCCCCC----HHHHHHHCC Q ss_conf 356789777641027851----022322102 Q gi|254780970|r 469 TKQAVAECWRNIIATGAK----PLAITDNLN 495 (737) Q Consensus 469 a~~AV~Eal~Nl~a~Ga~----Plait~~ln 495 (737) |++=|==++|.-+|.=++ |++||.=+| T Consensus 167 AYWWIRQ~ITRAIA~qSRTIRLPiH~~EKLn 197 (336) T TIGR02997 167 AYWWIRQGITRAIANQSRTIRLPIHITEKLN 197 (336) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH T ss_conf 2468756889998436883247410766668 No 245 >pfam08652 RAI1 RAI1 like. RAI1 is homologous to Caenorhabditis elegans DOM-3 and human DOM3Z and binds to a nuclear exoribonuclease. It is required for 5.8S rRNA processing. Probab=20.08 E-value=50 Score=12.33 Aligned_cols=29 Identities=17% Similarity=0.343 Sum_probs=20.9 Q ss_pred CCCHHHHHHHHHCCCCEEECCCCCEEEEE Q ss_conf 45889997512069748993347736898 Q gi|254780970|r 51 KSSKKWLRTLPTTGKHVIQGPGENAGVVD 79 (737) Q Consensus 51 k~~~~~l~~~~~~~~~vi~~~~~~a~vi~ 79 (737) |.++-|+..|----++++.|+-|+.|++. T Consensus 39 Kllk~W~QsfLvGv~~IvvGfRd~~g~l~ 67 (68) T pfam08652 39 KLLKWWAQSFLVGVPRIVVGFRDDDGILV 67 (68) T ss_pred HHHHHHHHHHHCCCCEEEEEEECCCCEEC T ss_conf 99999999986189859999977997483 Done!