Query gi|254780234|ref|YP_003064647.1| argininosuccinate synthase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 404 No_of_seqs 140 out of 1321 Neff 5.0 Searched_HMMs 39220 Date Mon May 23 23:03:56 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780234.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00509 argininosuccinate syn 100.0 0 0 1200.9 42.4 396 5-403 2-397 (398) 2 TIGR00032 argG argininosuccina 100.0 0 0 1188.5 35.2 395 7-403 1-420 (420) 3 pfam00764 Arginosuc_synth Argi 100.0 0 0 1173.2 40.2 388 9-401 1-389 (389) 4 COG0137 ArgG Argininosuccinate 100.0 0 0 1175.3 38.3 396 4-403 3-400 (403) 5 cd01999 Argininosuccinate_Synt 100.0 0 0 1167.6 40.8 384 8-394 1-385 (385) 6 PRK13820 argininosuccinate syn 100.0 0 0 1162.4 42.6 392 1-403 1-394 (395) 7 PRK04527 argininosuccinate syn 100.0 0 0 1115.9 39.9 392 1-403 1-397 (397) 8 PRK05370 argininosuccinate syn 100.0 0 0 1080.6 37.8 388 5-396 11-414 (447) 9 KOG1706 consensus 100.0 0 0 1013.9 18.8 398 1-402 1-407 (412) 10 PRK00143 trmU tRNA (5-methylam 99.8 1.9E-17 4.8E-22 145.6 13.6 158 4-171 3-189 (355) 11 COG0482 TrmU Predicted tRNA(5- 99.7 1.9E-16 5E-21 138.3 12.8 238 5-282 3-270 (356) 12 cd01998 tRNA_Me_trans tRNA met 99.7 2.9E-16 7.3E-21 137.0 13.1 162 7-183 1-192 (349) 13 pfam03054 tRNA_Me_trans tRNA m 99.7 3.7E-16 9.4E-21 136.2 12.7 155 7-171 2-187 (354) 14 TIGR00420 trmU tRNA (5-methyla 99.4 4.7E-12 1.2E-16 106.8 11.2 186 6-203 1-234 (394) 15 cd01990 Alpha_ANH_like_I This 99.4 4.4E-12 1.1E-16 107.0 10.9 147 8-170 1-149 (202) 16 COG1606 ATP-utilizing enzymes 99.4 1.1E-11 2.7E-16 104.3 10.9 154 3-173 15-170 (269) 17 PRK11106 queuosine biosynthesi 99.3 5.6E-10 1.4E-14 91.9 15.7 157 5-172 1-185 (231) 18 cd01712 ThiI ThiI is required 99.2 2.4E-09 6E-14 87.4 16.2 150 7-169 1-159 (177) 19 COG0037 MesJ tRNA(Ile)-lysidin 99.2 1.3E-09 3.2E-14 89.4 13.7 149 6-172 22-191 (298) 20 cd01993 Alpha_ANH_like_II This 99.2 1.1E-09 2.7E-14 89.9 12.9 159 7-182 1-184 (185) 21 TIGR00268 TIGR00268 conserved 99.1 2.3E-09 5.8E-14 87.5 11.0 153 5-174 15-172 (263) 22 pfam06508 ExsB ExsB. This fami 99.1 1.9E-09 4.8E-14 88.1 9.8 111 7-121 1-121 (137) 23 COG0603 Predicted PP-loop supe 99.0 2.3E-08 5.9E-13 80.3 14.9 157 5-173 2-184 (222) 24 cd01995 ExsB ExsB is a transcr 99.0 2.5E-08 6.4E-13 80.0 13.8 131 7-173 1-144 (169) 25 KOG2805 consensus 99.0 1.1E-08 2.8E-13 82.7 11.6 164 1-169 1-193 (377) 26 pfam02568 ThiI Thiamine biosyn 98.9 2.1E-07 5.4E-12 73.4 14.8 148 6-167 4-161 (197) 27 PRK10660 tilS tRNA(Ile)-lysidi 98.9 1.2E-07 3E-12 75.2 13.3 162 3-188 11-193 (433) 28 pfam01171 ATP_bind_3 PP-loop f 98.8 4.1E-07 1E-11 71.4 13.9 147 7-175 1-169 (182) 29 PRK10696 C32 tRNA thiolase; Pr 98.7 8E-07 2E-11 69.3 14.3 168 1-189 35-223 (311) 30 cd01992 PP-ATPase N-terminal d 98.7 3.7E-07 9.4E-12 71.7 11.6 147 7-175 1-169 (185) 31 TIGR00364 TIGR00364 exsB prote 98.7 4.2E-08 1.1E-12 78.5 6.1 159 9-173 2-201 (227) 32 PRK08349 hypothetical protein; 98.6 4.3E-06 1.1E-10 64.0 15.2 151 7-167 2-158 (198) 33 cd01994 Alpha_ANH_like_IV This 98.6 3.7E-06 9.5E-11 64.5 14.8 149 7-192 1-161 (194) 34 PRK00919 GMP synthase subunit 98.6 7.5E-07 1.9E-11 69.5 10.4 100 5-119 20-125 (306) 35 cd01997 GMP_synthase_C The C-t 98.6 1.1E-06 2.7E-11 68.3 10.8 145 7-169 1-166 (295) 36 cd01986 Alpha_ANH_like Adenine 98.5 1.7E-07 4.4E-12 74.1 6.0 80 8-131 1-80 (103) 37 COG0519 GuaA GMP synthase, PP- 98.5 3.2E-06 8E-11 65.0 10.7 146 5-168 21-184 (315) 38 pfam01507 PAPS_reduct Phosphoa 98.4 3.7E-06 9.4E-11 64.5 10.7 151 7-170 1-156 (174) 39 TIGR00884 guaA_Cterm GMP synth 98.4 1.1E-06 2.8E-11 68.3 7.7 182 6-253 17-224 (319) 40 cd01713 PAPS_reductase This do 98.4 1E-05 2.5E-10 61.4 12.5 150 7-170 1-162 (173) 41 PRK00074 guaA GMP synthase; Re 98.4 1.6E-06 4E-11 67.2 8.2 148 5-169 216-383 (513) 42 TIGR02432 lysidine_TilS_N tRNA 98.4 6.8E-06 1.7E-10 62.6 11.1 150 7-175 1-188 (204) 43 pfam01902 ATP_bind_4 ATP-bindi 98.4 2.6E-05 6.6E-10 58.4 13.6 154 7-197 2-163 (219) 44 PRK13795 hypothetical protein; 98.4 1.8E-05 4.7E-10 59.5 12.8 145 5-169 245-402 (630) 45 TIGR03573 WbuX N-acetyl sugar 98.4 1.5E-05 3.7E-10 60.2 11.8 106 8-121 62-169 (343) 46 pfam00733 Asn_synthase Asparag 98.3 9.9E-06 2.5E-10 61.4 10.5 143 6-170 18-166 (195) 47 PRK13794 hypothetical protein; 98.3 3.6E-05 9.2E-10 57.4 13.3 146 5-170 245-403 (473) 48 cd01991 Asn_Synthase_B_C The C 98.3 1.3E-05 3.2E-10 60.6 10.5 112 6-128 16-129 (269) 49 PRK01269 thiamine biosynthesis 98.3 9.9E-05 2.5E-09 54.3 14.9 148 6-167 178-333 (483) 50 COG2117 Predicted subunit of t 98.3 5.8E-06 1.5E-10 63.1 8.0 129 7-148 2-134 (198) 51 COG2102 Predicted ATPases of P 98.2 8E-05 2E-09 54.9 13.7 153 7-195 2-162 (223) 52 COG0175 CysH 3'-phosphoadenosi 98.2 1.3E-05 3.2E-10 60.7 9.4 150 6-171 40-198 (261) 53 cd01996 Alpha_ANH_like_III Thi 98.2 4.4E-05 1.1E-09 56.8 11.7 106 7-121 3-111 (154) 54 COG0301 ThiI Thiamine biosynth 98.2 0.00026 6.7E-09 51.2 15.4 149 6-168 176-333 (383) 55 PRK08557 hypothetical protein; 98.2 8.7E-05 2.2E-09 54.7 12.6 143 7-170 185-341 (420) 56 TIGR01536 asn_synth_AEB aspara 98.1 5.9E-05 1.5E-09 55.9 10.9 107 6-119 312-439 (646) 57 PRK08576 hypothetical protein; 98.1 0.00017 4.4E-09 52.5 13.2 139 7-170 237-389 (439) 58 PRK01565 thiamine biosynthesis 98.1 0.00036 9.3E-09 50.2 14.8 148 6-167 177-332 (399) 59 PRK09431 asnB asparagine synth 98.0 7.3E-05 1.9E-09 55.2 10.1 113 6-128 228-354 (555) 60 COG0367 AsnB Asparagine syntha 98.0 0.00014 3.6E-09 53.1 11.5 106 6-119 231-339 (542) 61 PRK13980 NAD synthetase; Provi 98.0 0.00042 1.1E-08 49.8 12.9 146 4-171 27-181 (264) 62 COG1365 Predicted ATPase (PP-l 98.0 0.0001 2.6E-09 54.2 9.6 135 4-170 59-205 (255) 63 cd00553 NAD_synthase NAD+ synt 98.0 0.00064 1.6E-08 48.4 13.6 151 4-171 22-181 (248) 64 PRK13981 NAD synthetase; Provi 98.0 0.0016 4E-08 45.7 15.5 149 4-167 280-438 (543) 65 PTZ00077 asparagine synthetase 97.9 0.00032 8.3E-09 50.6 11.3 112 7-128 239-392 (610) 66 pfam02540 NAD_synthase NAD syn 97.9 0.00029 7.5E-09 50.9 11.0 152 4-171 17-172 (243) 67 TIGR00289 TIGR00289 conserved 97.8 0.00017 4.3E-09 52.6 9.1 110 7-147 2-122 (227) 68 cd01984 AANH_like Adenine nucl 97.7 7.3E-05 1.9E-09 55.2 5.7 66 8-132 1-68 (86) 69 TIGR03108 eps_aminotran_1 exos 97.7 0.00092 2.3E-08 47.3 11.3 108 6-127 259-369 (628) 70 TIGR03104 trio_amidotrans aspa 97.6 0.00065 1.6E-08 48.4 9.5 108 7-128 262-376 (589) 71 KOG1622 consensus 97.6 0.00044 1.1E-08 49.6 7.5 66 6-72 231-299 (552) 72 TIGR03183 DNA_S_dndC putative 97.5 0.0021 5.4E-08 44.7 10.3 145 7-165 15-200 (447) 73 PRK06850 hypothetical protein; 97.4 0.0048 1.2E-07 42.2 11.5 146 7-166 23-209 (488) 74 PRK00768 nadE NAD synthetase; 97.2 0.011 2.9E-07 39.5 11.4 157 4-172 39-206 (274) 75 TIGR00342 TIGR00342 thiamine b 97.2 0.021 5.4E-07 37.5 12.7 148 6-167 187-345 (391) 76 PTZ00323 NAD+ synthase; Provis 97.2 0.0085 2.2E-07 40.4 10.6 158 4-170 45-213 (294) 77 PRK00876 nadE NAD synthetase; 97.2 0.0027 7E-08 43.9 8.0 159 4-172 32-241 (325) 78 PRK08384 thiamine biosynthesis 97.2 0.03 7.6E-07 36.5 13.1 122 6-166 180-307 (310) 79 COG0171 NadE NAD synthase [Coe 97.1 0.0027 6.8E-08 44.0 7.3 155 4-170 24-187 (268) 80 PRK02628 nadE NAD synthetase; 97.0 0.013 3.2E-07 39.2 10.3 201 5-259 362-584 (678) 81 KOG0571 consensus 96.5 0.05 1.3E-06 34.9 10.0 113 7-128 227-346 (543) 82 PRK12563 sulfate adenylyltrans 96.5 0.048 1.2E-06 35.0 9.6 212 4-250 36-283 (312) 83 PRK02090 phosphoadenosine phos 96.4 0.068 1.7E-06 33.9 10.0 147 5-170 41-196 (243) 84 PRK05253 sulfate adenylyltrans 96.3 0.093 2.4E-06 32.9 10.2 212 5-251 26-272 (300) 85 PRK09590 celB cellobiose phosp 95.0 0.1 2.6E-06 32.7 6.4 62 5-67 1-84 (104) 86 TIGR02039 CysD sulfate adenyly 94.9 0.21 5.4E-06 30.3 7.9 241 5-277 19-294 (295) 87 cd01987 USP_OKCHK USP domain i 94.1 0.58 1.5E-05 27.2 9.9 87 7-125 1-98 (124) 88 cd05565 PTS_IIB_lactose PTS_II 92.9 0.64 1.6E-05 26.9 7.3 59 7-66 2-80 (99) 89 COG3969 Predicted phosphoadeno 92.5 0.21 5.3E-06 30.4 4.3 72 5-84 27-104 (407) 90 PRK05406 LamB/YcsF family prot 91.5 0.43 1.1E-05 28.1 4.9 146 102-258 93-246 (246) 91 cd05564 PTS_IIB_chitobiose_lic 90.3 0.94 2.4E-05 25.7 5.7 60 7-67 1-80 (96) 92 PRK10499 N,N'-diacetylchitobio 89.4 1.3 3.2E-05 24.8 5.9 35 1-37 1-40 (106) 93 TIGR00552 nadE NAD+ synthetase 88.3 2 5.1E-05 23.4 6.2 157 4-173 25-199 (286) 94 KOG0573 consensus 87.7 0.44 1.1E-05 28.1 2.6 21 6-26 251-271 (520) 95 TIGR01949 AroFGH_arch predicte 86.4 1.1 2.8E-05 25.2 4.0 90 28-138 101-193 (259) 96 TIGR02823 oxido_YhdH putative 86.2 1.7 4.2E-05 23.9 4.8 91 7-118 152-247 (330) 97 cd06313 PBP1_ABC_sugar_binding 85.2 3.1 7.9E-05 22.0 5.8 67 101-170 18-84 (272) 98 cd06319 PBP1_ABC_sugar_binding 84.7 3.4 8.7E-05 21.7 6.3 67 101-170 18-84 (277) 99 pfam03746 LamB_YcsF LamB/YcsF 84.3 1.9 4.9E-05 23.5 4.4 138 102-251 91-237 (242) 100 cd06320 PBP1_allose_binding Pe 82.8 4 0.0001 21.2 5.8 68 100-170 17-86 (275) 101 COG1440 CelA Phosphotransferas 82.4 4.2 0.00011 21.1 6.6 62 5-67 1-82 (102) 102 cd01538 PBP1_ABC_xylose_bindin 82.3 4.2 0.00011 21.0 5.9 69 99-170 16-84 (288) 103 cd06318 PBP1_ABC_sugar_binding 82.2 4.2 0.00011 21.0 6.4 65 102-169 19-83 (282) 104 cd06323 PBP1_ribose_binding Pe 82.1 4.3 0.00011 21.0 6.1 68 100-170 17-84 (268) 105 COG0439 AccC Biotin carboxylas 81.7 4.4 0.00011 20.9 8.8 148 5-172 2-158 (449) 106 TIGR01283 nifE nitrogenase MoF 81.3 4.6 0.00012 20.8 5.5 113 6-169 337-456 (470) 107 PRK12767 carbamoyl phosphate s 81.2 3.8 9.8E-05 21.3 5.0 49 5-65 1-50 (325) 108 PRK09958 DNA-binding transcrip 81.0 0.95 2.4E-05 25.7 1.8 142 15-167 57-203 (204) 109 TIGR02298 HpaD_Fe 3,4-dihydrox 80.1 3.6 9.2E-05 21.5 4.6 88 44-151 93-194 (343) 110 cd01985 ETF The electron trans 79.4 5.2 0.00013 20.4 8.1 78 22-126 28-105 (181) 111 cd01714 ETF_beta The electron 78.0 5.7 0.00015 20.1 10.2 95 22-142 44-140 (202) 112 PRK10840 transcriptional regul 77.0 1.5 3.7E-05 24.3 1.8 135 17-171 66-213 (216) 113 pfam05368 NmrA NmrA-like famil 76.8 6.2 0.00016 19.9 7.1 211 8-256 2-227 (232) 114 cd06305 PBP1_methylthioribose_ 75.3 6.7 0.00017 19.6 6.3 68 100-170 17-84 (273) 115 KOG2594 consensus 75.2 6.8 0.00017 19.6 11.1 225 4-237 61-352 (396) 116 PRK12921 2-dehydropantoate 2-r 75.0 6 0.00015 20.0 4.5 84 243-337 218-302 (306) 117 cd06317 PBP1_ABC_sugar_binding 74.1 7.2 0.00018 19.4 6.3 65 102-169 20-84 (275) 118 PRK08462 biotin carboxylase; V 72.5 7.8 0.0002 19.1 7.6 35 3-39 2-36 (446) 119 cd06312 PBP1_ABC_sugar_binding 72.3 7.9 0.0002 19.1 6.0 68 100-170 18-86 (271) 120 PRK09955 rihB ribonucleoside h 72.0 8 0.0002 19.0 12.4 51 1-54 1-51 (313) 121 cd06310 PBP1_ABC_sugar_binding 71.4 8.2 0.00021 19.0 7.1 67 100-170 17-86 (273) 122 PRK09628 oorB 2-oxoglutarate-a 71.3 6.6 0.00017 19.6 4.0 68 103-170 156-230 (281) 123 cd01534 4RHOD_Repeat_3 Member 70.7 7.9 0.0002 19.1 4.3 12 57-68 13-24 (95) 124 cd03376 TPP_PFOR_porB_like Thi 70.3 8.7 0.00022 18.8 4.6 134 21-163 71-217 (235) 125 COG1830 FbaB DhnA-type fructos 69.9 8.8 0.00023 18.7 6.5 29 32-60 23-59 (265) 126 PRK12569 hypothetical protein; 69.7 8.9 0.00023 18.7 4.4 89 102-199 96-192 (245) 127 PRK09935 transcriptional regul 69.7 1 2.6E-05 25.5 -0.5 141 16-167 62-209 (210) 128 cd00958 DhnA Class I fructose- 69.5 9 0.00023 18.7 5.9 191 44-292 21-233 (235) 129 PRK08305 spoVFB dipicolinate s 69.3 9.1 0.00023 18.6 5.0 41 1-42 1-45 (195) 130 PRK05920 aromatic acid decarbo 67.9 9.7 0.00025 18.4 5.2 40 1-41 1-42 (205) 131 PRK07226 fructose-bisphosphate 67.8 9.7 0.00025 18.4 6.6 208 32-298 20-256 (266) 132 pfam04800 ETC_C1_NDUFA4 ETC co 67.5 2.7 6.9E-05 22.4 1.3 22 151-172 54-75 (101) 133 cd06309 PBP1_YtfQ_like Peripla 65.7 11 0.00027 18.1 6.5 50 117-166 130-183 (273) 134 COG0056 AtpA F0F1-type ATP syn 65.3 11 0.00028 18.1 4.8 80 17-117 175-258 (504) 135 cd06289 PBP1_MalI_like Ligand- 65.2 11 0.00028 18.1 6.2 63 103-169 20-82 (268) 136 TIGR00853 pts-lac PTS system, 64.8 11 0.00028 18.0 6.0 74 88-173 49-125 (142) 137 COG2403 Predicted GTPase [Gene 64.3 11 0.00029 18.0 8.8 100 1-101 2-130 (449) 138 TIGR00290 MJ0570_dom MJ0570-re 64.2 11 0.00029 18.0 5.5 145 7-164 2-156 (265) 139 cd06302 PBP1_LsrB_Quorum_Sensi 63.7 12 0.00029 17.9 5.7 66 101-169 18-84 (298) 140 PRK08654 pyruvate carboxylase 63.3 12 0.0003 17.8 8.3 31 6-38 3-33 (497) 141 PRK08339 short chain dehydroge 63.0 12 0.0003 17.8 7.0 57 8-69 12-70 (263) 142 PRK07165 F0F1 ATP synthase sub 62.5 12 0.00031 17.8 4.6 54 10-63 149-203 (507) 143 COG2086 FixA Electron transfer 62.4 12 0.00031 17.7 9.3 101 22-148 46-149 (260) 144 cd06390 PBP1_iGluR_AMPA_GluR1 61.9 12 0.00032 17.7 8.4 42 97-139 101-142 (364) 145 COG0075 Serine-pyruvate aminot 61.3 13 0.00032 17.6 7.8 98 9-131 83-180 (383) 146 PRK10768 ribonucleoside hydrol 60.8 13 0.00033 17.5 10.6 48 6-54 3-50 (304) 147 PRK13209 L-xylulose 5-phosphat 60.4 13 0.00033 17.5 4.8 122 13-172 18-157 (283) 148 PRK11864 2-ketoisovalerate fer 60.2 13 0.00034 17.5 3.7 60 94-153 153-215 (300) 149 COG1341 Predicted GTPase or GT 59.8 13 0.00034 17.4 3.9 118 6-142 72-204 (398) 150 pfam03480 SBP_bac_7 Bacterial 59.4 13 0.00034 17.4 3.6 18 104-121 98-115 (285) 151 TIGR02875 spore_0_A sporulatio 59.0 6.9 0.00018 19.5 2.1 134 40-193 87-246 (270) 152 TIGR00962 atpA ATP synthase F1 58.5 14 0.00036 17.3 4.8 44 18-61 175-227 (520) 153 PRK10100 DNA-binding transcrip 58.4 4.3 0.00011 21.0 1.0 125 31-166 82-214 (216) 154 cd06301 PBP1_rhizopine_binding 56.2 15 0.00039 17.0 6.4 64 103-169 20-84 (272) 155 PRK03359 putative electron tra 55.4 16 0.0004 16.9 9.2 99 22-145 45-147 (256) 156 cd06322 PBP1_ABC_sugar_binding 54.9 16 0.00041 16.9 6.0 67 101-170 18-84 (267) 157 COG2264 PrmA Ribosomal protein 54.8 16 0.00041 16.9 8.8 68 10-82 139-208 (300) 158 cd01976 Nitrogenase_MoFe_alpha 54.3 16 0.00042 16.8 5.4 67 103-172 120-204 (421) 159 PRK05586 biotin carboxylase; V 54.2 16 0.00042 16.8 3.7 32 6-39 3-34 (447) 160 TIGR00787 dctP TRAP transporte 53.6 15 0.00039 17.0 3.2 51 82-145 28-79 (267) 161 cd06315 PBP1_ABC_sugar_binding 53.2 17 0.00043 16.7 5.7 66 101-170 19-85 (280) 162 TIGR03324 alt_F1F0_F1_al alter 52.4 18 0.00045 16.6 4.9 103 10-138 168-274 (497) 163 COG1540 Uncharacterized protei 52.4 18 0.00045 16.6 5.5 144 104-259 95-247 (252) 164 PRK07792 fabG 3-ketoacyl-(acyl 51.8 18 0.00046 16.5 7.2 58 8-69 13-71 (303) 165 pfam08901 DUF1847 Protein of u 51.8 18 0.00046 16.5 4.3 35 6-41 56-92 (157) 166 COG4594 FecB ABC-type Fe3+-cit 51.1 12 0.00031 17.7 2.3 78 107-190 212-289 (310) 167 pfam07338 DUF1471 Protein of u 51.0 18 0.00047 16.4 3.9 39 30-68 13-52 (66) 168 cd03017 PRX_BCP Peroxiredoxin 50.5 19 0.00048 16.4 5.3 16 6-21 24-39 (140) 169 TIGR01472 gmd GDP-mannose 4,6- 50.5 15 0.00039 17.1 2.7 70 15-86 10-86 (365) 170 COG4126 Hydantoin racemase [Am 50.4 19 0.00048 16.4 3.5 82 50-134 102-196 (230) 171 cd05802 GlmM GlmM is a bacteri 50.4 19 0.00048 16.4 6.5 17 104-120 185-201 (434) 172 PRK12367 short chain dehydroge 50.1 19 0.00048 16.3 5.6 128 6-195 18-146 (250) 173 PRK11865 pyruvate ferredoxin o 49.7 19 0.00049 16.3 4.7 52 102-153 163-217 (298) 174 PRK11411 fecB iron-dicitrate t 49.1 19 0.00049 16.3 3.1 16 5-20 39-54 (297) 175 PRK06249 2-dehydropantoate 2-r 49.0 20 0.0005 16.2 3.4 35 1-37 1-35 (313) 176 pfam02677 DUF208 Uncharacteriz 48.9 20 0.0005 16.2 13.1 143 14-168 7-162 (176) 177 KOG2316 consensus 48.4 20 0.00051 16.2 4.3 210 7-235 2-237 (277) 178 PRK07178 acetyl-CoA carboxylas 47.0 21 0.00054 16.0 3.8 73 98-171 59-156 (471) 179 KOG2862 consensus 46.5 21 0.00055 16.0 6.8 48 7-55 93-140 (385) 180 PRK12608 transcription termina 46.2 22 0.00055 15.9 4.6 55 237-303 285-339 (379) 181 cd01539 PBP1_GGBP Periplasmic 46.0 22 0.00056 15.9 5.8 144 11-166 36-199 (303) 182 COG0174 GlnA Glutamine synthet 45.9 3.3 8.3E-05 21.8 -1.3 149 16-181 75-237 (443) 183 PRK12778 putative bifunctional 45.8 22 0.00056 15.9 8.0 101 9-117 101-208 (760) 184 PRK09281 F0F1 ATP synthase sub 45.6 22 0.00056 15.9 5.2 98 13-136 171-272 (502) 185 PRK09195 gatY tagatose-bisphos 45.4 21 0.00053 16.1 2.8 121 21-173 9-135 (284) 186 KOG1865 consensus 45.4 22 0.00057 15.9 5.6 65 128-196 155-237 (545) 187 cd06324 PBP1_ABC_sugar_binding 45.2 22 0.00057 15.8 5.7 69 98-166 123-201 (305) 188 PRK11579 putative oxidoreducta 45.2 22 0.00057 15.8 9.5 119 1-137 1-133 (346) 189 KOG0780 consensus 45.0 22 0.00057 15.8 8.8 145 13-196 110-261 (483) 190 pfam02441 Flavoprotein Flavopr 44.8 23 0.00058 15.8 4.6 37 6-42 1-39 (118) 191 PRK10355 xylF D-xylose transpo 44.4 23 0.00058 15.8 6.0 81 33-119 27-113 (330) 192 PRK13237 tyrosine phenol-lyase 44.4 23 0.00058 15.8 8.5 148 10-171 155-341 (459) 193 COG0541 Ffh Signal recognition 44.3 23 0.00059 15.7 7.5 124 15-172 112-242 (451) 194 pfam11138 DUF2911 Protein of u 44.0 16 0.0004 16.9 2.0 42 180-230 84-125 (145) 195 COG1832 Predicted CoA-binding 43.9 23 0.00059 15.7 5.2 102 7-122 19-130 (140) 196 PRK13303 L-aspartate dehydroge 43.7 23 0.0006 15.7 9.5 118 5-130 1-131 (265) 197 cd01523 RHOD_Lact_B Member of 43.5 24 0.0006 15.7 4.4 21 54-75 10-32 (100) 198 PRK09437 bcp thioredoxin-depen 43.4 24 0.0006 15.7 4.6 64 6-74 31-103 (156) 199 PRK09390 fixJ response regulat 43.1 24 0.00061 15.6 4.9 17 151-167 185-201 (202) 200 PRK00724 formate dehydrogenase 43.0 24 0.00061 15.6 7.9 21 8-28 41-61 (262) 201 cd01524 RHOD_Pyr_redox Member 43.0 24 0.00061 15.6 4.2 15 59-75 12-26 (90) 202 PRK06801 hypothetical protein; 42.9 24 0.00061 15.6 2.8 120 22-173 10-135 (286) 203 cd07202 cPLA2_Grp-IVC Group IV 42.8 24 0.00062 15.6 4.6 48 234-281 312-363 (430) 204 PRK10365 transcriptional regul 42.6 24 0.00062 15.6 9.1 107 21-135 67-185 (441) 205 COG3033 TnaA Tryptophanase [Am 42.5 24 0.00062 15.6 8.0 113 11-136 164-292 (471) 206 PRK08591 acetyl-CoA carboxylas 42.3 25 0.00063 15.5 6.1 22 98-119 60-81 (449) 207 PRK06029 3-octaprenyl-4-hydrox 42.1 25 0.00063 15.5 4.8 148 5-184 1-162 (187) 208 PRK04148 hypothetical protein; 42.0 25 0.00063 15.5 5.0 82 15-119 26-109 (135) 209 cd01977 Nitrogenase_VFe_alpha 41.9 25 0.00064 15.5 7.5 22 149-170 170-192 (415) 210 PRK12857 putative aldolase; Re 41.9 25 0.00064 15.5 2.8 115 28-173 15-135 (284) 211 PRK06111 acetyl-CoA carboxylas 41.7 25 0.00064 15.5 3.9 71 98-172 60-158 (449) 212 PRK13238 tnaA tryptophanase; P 41.5 25 0.00065 15.4 8.8 87 11-110 156-253 (461) 213 TIGR01328 met_gam_lyase methio 41.5 25 0.00065 15.4 3.3 87 44-166 161-279 (392) 214 cd06283 PBP1_RegR_EndR_KdgR_li 41.3 25 0.00065 15.4 6.0 109 27-137 26-143 (267) 215 cd01537 PBP1_Repressors_Sugar_ 41.3 25 0.00065 15.4 6.2 42 96-137 101-144 (264) 216 cd00293 USP_Like Usp: Universa 41.3 25 0.00065 15.4 10.5 104 7-126 1-107 (130) 217 PRK10653 D-ribose transporter 41.2 26 0.00065 15.4 6.0 65 102-169 46-110 (295) 218 pfam00609 DAGK_acc Diacylglyce 41.2 20 0.00051 16.2 2.2 20 220-239 138-157 (157) 219 cd01320 ADA Adenosine deaminas 41.1 26 0.00065 15.4 7.7 21 11-32 10-30 (325) 220 TIGR01074 rep ATP-dependent DN 41.1 26 0.00065 15.4 3.7 20 244-263 419-438 (677) 221 smart00829 PKS_ER Enoylreducta 41.0 26 0.00066 15.4 7.0 54 104-168 96-151 (288) 222 pfam01156 IU_nuc_hydro Inosine 40.8 26 0.00066 15.4 11.8 48 6-54 1-48 (303) 223 COG1564 THI80 Thiamine pyropho 40.1 11 0.00029 17.9 0.8 198 7-235 2-206 (212) 224 PRK10310 galactitol-specific P 40.1 27 0.00068 15.3 4.7 36 5-41 2-42 (94) 225 TIGR02064 dsrA sulfite reducta 39.7 22 0.00057 15.8 2.2 26 138-164 227-253 (430) 226 PRK09701 D-allose transporter 39.2 27 0.0007 15.2 6.4 66 102-170 44-111 (311) 227 COG1064 AdhP Zn-dependent alco 39.1 27 0.0007 15.2 6.0 139 6-171 168-308 (339) 228 cd06308 PBP1_sensor_kinase_lik 38.7 28 0.00071 15.2 6.0 141 10-165 34-180 (270) 229 cd00640 Trp-synth-beta_II Tryp 38.2 28 0.00072 15.1 11.6 52 8-65 52-103 (244) 230 TIGR02079 THD1 threonine dehyd 38.0 29 0.00073 15.1 6.6 108 8-117 277-403 (415) 231 TIGR00506 ribB 3,4-dihydroxy-2 37.9 14 0.00036 17.3 1.0 102 32-169 98-220 (230) 232 PRK07998 gatY putative fructos 37.6 29 0.00074 15.0 2.8 121 21-173 9-135 (283) 233 pfam10281 Ish1 Putative stress 37.6 26 0.00065 15.4 2.2 20 154-173 5-24 (38) 234 cd03014 PRX_Atyp2cys Peroxired 37.5 26 0.00065 15.4 2.2 34 6-39 27-67 (143) 235 cd00947 TBP_aldolase_IIB Tagat 37.4 29 0.00074 15.0 3.3 115 28-173 10-130 (276) 236 PRK08063 enoyl-(acyl carrier p 37.4 29 0.00074 15.0 6.3 65 1-69 1-66 (250) 237 TIGR01235 pyruv_carbox pyruvat 37.2 29 0.00075 15.0 4.1 92 80-171 41-161 (1169) 238 PRK08610 fructose-bisphosphate 37.0 29 0.00075 15.0 2.7 33 28-63 15-47 (286) 239 COG0108 RibB 3,4-dihydroxy-2-b 36.7 30 0.00076 14.9 4.7 130 10-170 33-189 (203) 240 cd01536 PBP1_ABC_sugar_binding 36.6 30 0.00076 14.9 6.4 17 102-118 19-35 (267) 241 TIGR00789 flhB_rel FlhB domain 36.4 20 0.00052 16.1 1.6 42 155-210 29-70 (84) 242 PRK10217 dTDP-glucose 4,6-dehy 36.3 30 0.00077 14.9 7.2 73 5-79 1-73 (355) 243 cd06267 PBP1_LacI_sugar_bindin 36.3 30 0.00077 14.9 5.9 38 100-137 103-142 (264) 244 pfam01070 FMN_dh FMN-dependent 36.1 27 0.0007 15.2 2.2 174 48-261 78-255 (301) 245 PRK06849 hypothetical protein; 36.0 30 0.00078 14.9 8.9 130 6-175 4-137 (387) 246 PRK09483 response regulator; P 35.9 31 0.00078 14.9 6.7 113 49-167 89-208 (216) 247 PRK13304 L-aspartate dehydroge 35.8 31 0.00078 14.8 10.5 115 5-129 1-130 (265) 248 PRK13124 consensus 35.7 31 0.00079 14.8 4.2 127 3-170 5-142 (257) 249 cd01541 PBP1_AraR Ligand-bindi 35.6 31 0.00079 14.8 6.6 11 28-38 27-37 (273) 250 TIGR02025 BchH magnesium chela 35.4 17 0.00044 16.7 1.1 21 236-256 809-829 (1384) 251 cd00617 Tnase_like Tryptophana 35.2 31 0.0008 14.8 8.6 147 11-171 131-316 (431) 252 PRK10443 rihA ribonucleoside h 35.2 31 0.0008 14.8 6.8 39 1-42 1-39 (311) 253 cd02971 PRX_family Peroxiredox 35.1 31 0.0008 14.8 5.0 41 29-74 55-96 (140) 254 pfam01012 ETF Electron transfe 34.8 32 0.00081 14.7 7.6 79 21-126 23-101 (161) 255 TIGR02635 RhaI_grampos L-rhamn 34.7 32 0.0008 14.8 2.3 145 77-258 86-272 (382) 256 COG3459 Cellobiose phosphoryla 34.7 30 0.00077 14.9 2.2 19 113-131 227-245 (1056) 257 cd02969 PRX_like1 Peroxiredoxi 34.1 33 0.00083 14.7 6.1 68 6-74 26-105 (171) 258 PRK07094 biotin synthase; Prov 34.0 33 0.00083 14.6 6.4 23 351-373 279-301 (323) 259 cd06285 PBP1_LacI_like_7 Ligan 33.8 33 0.00084 14.6 5.9 17 102-118 19-35 (265) 260 cd06388 PBP1_iGluR_AMPA_GluR4 33.8 33 0.00084 14.6 9.5 51 97-148 108-158 (371) 261 PRK12337 2-phosphoglycerate ki 33.7 33 0.00084 14.6 2.6 15 184-198 312-326 (492) 262 pfam00289 CPSase_L_chain Carba 33.6 33 0.00085 14.6 5.2 49 95-146 56-104 (109) 263 cd06316 PBP1_ABC_sugar_binding 33.5 33 0.00085 14.6 6.0 141 12-165 36-184 (294) 264 cd01540 PBP1_arabinose_binding 33.3 34 0.00086 14.6 6.2 67 99-165 113-190 (289) 265 PRK10116 universal stress prot 33.2 34 0.00086 14.6 7.7 109 1-123 1-113 (142) 266 PRK06824 translation initiatio 33.1 34 0.00086 14.6 6.0 80 177-270 9-93 (118) 267 cd01575 PBP1_GntR Ligand-bindi 33.1 34 0.00086 14.6 6.2 111 27-138 26-143 (268) 268 TIGR02198 rfaE_dom_I rfaE bifu 33.0 27 0.00069 15.2 1.8 37 155-200 164-200 (321) 269 cd03466 Nitrogenase_NifN_2 Nit 32.9 34 0.00087 14.5 9.7 11 160-170 175-185 (429) 270 PRK13394 3-hydroxybutyrate deh 32.9 34 0.00087 14.5 7.2 32 155-191 169-200 (262) 271 PRK09358 adenosine deaminase; 32.8 34 0.00087 14.5 8.8 20 12-32 18-37 (333) 272 PRK10565 hypothetical protein; 32.7 34 0.00087 14.5 3.5 116 5-135 60-192 (515) 273 PRK05301 pyrroloquinoline quin 32.7 34 0.00087 14.5 6.4 120 45-168 51-185 (375) 274 KOG2282 consensus 32.5 23 0.00059 15.7 1.3 100 33-148 127-230 (708) 275 cd03375 TPP_OGFOR Thiamine pyr 32.3 26 0.00065 15.4 1.5 37 134-171 63-101 (193) 276 pfam08546 ApbA_C Ketopantoate 32.2 21 0.00054 16.0 1.1 99 65-173 1-115 (125) 277 TIGR02090 LEU1_arch isopropylm 31.8 23 0.0006 15.7 1.3 16 159-174 120-135 (371) 278 PRK06940 short chain dehydroge 31.7 36 0.00091 14.4 7.2 58 6-69 6-64 (277) 279 PRK13566 anthranilate synthase 31.7 36 0.00091 14.4 7.8 60 102-169 537-598 (724) 280 pfam02634 FdhD-NarQ FdhD/NarQ 31.5 36 0.00091 14.4 7.6 21 8-28 16-36 (235) 281 COG1794 RacX Aspartate racemas 31.5 36 0.00091 14.4 6.3 17 105-121 181-197 (230) 282 PRK11908 NAD-dependent epimera 31.4 36 0.00091 14.4 8.9 151 5-172 1-174 (347) 283 pfam03853 YjeF_N YjeF-related 31.3 36 0.00092 14.3 4.7 53 6-59 29-86 (170) 284 COG4943 Predicted signal trans 31.2 36 0.00092 14.3 7.4 29 1-29 1-29 (524) 285 PRK10867 signal recognition pa 31.0 36 0.00093 14.3 8.2 94 23-144 122-219 (453) 286 PRK12738 kbaY tagatose-bisphos 31.0 36 0.00093 14.3 2.7 122 20-173 8-135 (286) 287 PRK10936 periplasmic sensory p 30.9 37 0.00093 14.3 4.7 46 102-147 65-112 (340) 288 cd06321 PBP1_ABC_sugar_binding 30.7 37 0.00094 14.3 6.2 65 102-169 19-85 (271) 289 cd07200 cPLA2_Grp-IVA Group IV 30.7 37 0.00094 14.3 4.1 12 4-15 42-53 (505) 290 COG3867 Arabinogalactan endo-1 30.5 27 0.00068 15.3 1.4 199 46-261 106-338 (403) 291 cd06292 PBP1_LacI_like_10 Liga 30.3 37 0.00095 14.2 6.6 110 27-138 26-149 (273) 292 PRK07105 pyridoxamine kinase; 30.3 37 0.00096 14.2 8.8 126 3-148 2-145 (284) 293 cd06273 PBP1_GntR_like_1 This 29.9 38 0.00097 14.2 6.0 113 23-138 23-144 (268) 294 PRK12737 gatY tagatose-bisphos 29.9 38 0.00097 14.2 2.8 120 22-173 10-135 (284) 295 PRK12361 hypothetical protein; 29.8 38 0.00097 14.2 3.1 88 6-99 242-338 (546) 296 COG3155 ElbB Uncharacterized p 29.8 38 0.00097 14.2 4.0 45 5-49 1-52 (217) 297 cd03018 PRX_AhpE_like Peroxire 29.8 38 0.00097 14.2 3.9 27 136-163 119-145 (149) 298 cd05566 PTS_IIB_galactitol PTS 29.4 39 0.00099 14.1 3.7 35 6-41 1-40 (89) 299 TIGR03569 NeuB_NnaB N-acetylne 29.3 39 0.00099 14.1 4.7 216 36-300 3-229 (329) 300 TIGR01463 mtaA_cmuA methyltran 29.0 39 0.001 14.1 2.4 20 102-121 228-247 (350) 301 PRK07945 hypothetical protein; 28.8 39 0.001 14.1 6.1 102 11-123 105-231 (335) 302 PRK06361 hypothetical protein; 28.7 40 0.001 14.1 4.7 118 104-265 82-203 (216) 303 PRK07313 phosphopantothenoylcy 28.6 40 0.001 14.0 4.7 133 6-169 3-147 (180) 304 PRK02621 consensus 28.5 40 0.001 14.0 7.2 23 45-67 31-53 (254) 305 pfam11116 DUF2624 Protein of u 28.1 41 0.001 14.0 2.0 21 151-172 14-34 (84) 306 cd03332 LMO_FMN L-Lactate 2-mo 27.9 41 0.001 14.0 6.2 33 229-261 307-339 (383) 307 COG1363 FrvX Cellulase M and r 27.8 41 0.001 13.9 2.3 38 154-193 271-308 (355) 308 TIGR03106 trio_M42_hydro hydro 27.6 41 0.0011 13.9 2.6 19 243-261 266-284 (343) 309 PTZ00110 helicase; Provisional 27.6 28 0.0007 15.2 1.1 149 10-170 113-287 (602) 310 COG0604 Qor NADPH:quinone redu 27.5 42 0.0011 13.9 5.8 13 30-42 62-74 (326) 311 pfam01116 F_bP_aldolase Fructo 27.4 42 0.0011 13.9 6.6 121 21-173 7-133 (283) 312 KOG0625 consensus 27.4 42 0.0011 13.9 3.5 22 5-26 83-104 (558) 313 KOG1198 consensus 27.2 42 0.0011 13.9 8.0 25 324-348 321-345 (347) 314 PRK13982 bifunctional SbtC-lik 27.0 42 0.0011 13.8 5.0 37 2-39 68-107 (476) 315 TIGR00406 prmA ribosomal prote 27.0 42 0.0011 13.8 3.7 59 9-71 170-231 (330) 316 PTZ00098 phosphoethanolamine N 26.9 43 0.0011 13.8 6.2 149 23-197 68-217 (263) 317 PTZ00308 ethanolamine-phosphat 26.8 43 0.0011 13.8 2.5 175 105-332 88-277 (353) 318 cd02922 FCB2_FMN Flavocytochro 26.8 43 0.0011 13.8 3.6 33 228-260 269-301 (344) 319 PRK00522 tpx thiol peroxidase; 26.7 43 0.0011 13.8 5.1 35 6-40 46-87 (168) 320 pfam01177 Asp_Glu_race Asp/Glu 26.5 43 0.0011 13.8 8.3 15 51-66 58-72 (212) 321 PRK06701 short chain dehydroge 26.5 43 0.0011 13.8 7.4 56 9-68 50-106 (289) 322 TIGR00434 cysH phosophoadenyly 26.5 43 0.0011 13.8 3.6 168 7-202 15-205 (226) 323 TIGR02812 fadR_gamma fatty aci 26.5 29 0.00074 15.0 1.0 54 68-126 8-72 (275) 324 TIGR01830 3oxo_ACP_reduc 3-oxo 26.4 43 0.0011 13.8 7.4 58 7-66 24-83 (238) 325 PTZ00124 adenosine deaminase; 26.3 44 0.0011 13.8 6.3 38 227-264 298-337 (362) 326 PRK10490 sensor protein KdpD; 26.2 44 0.0011 13.7 9.7 112 5-118 250-374 (895) 327 pfam00582 Usp Universal stress 25.8 44 0.0011 13.7 6.3 106 6-127 3-117 (140) 328 PRK08227 aldolase; Validated 25.8 45 0.0011 13.7 5.3 212 32-298 50-278 (291) 329 PRK09856 fructoselysine 3-epim 25.7 45 0.0011 13.7 8.0 117 20-173 17-149 (276) 330 PRK07985 oxidoreductase; Provi 25.7 45 0.0011 13.7 6.8 58 8-68 53-111 (294) 331 COG3962 Acetolactate synthase 25.5 45 0.0011 13.7 2.7 42 154-195 245-288 (617) 332 PRK10754 quinone oxidoreductas 25.4 45 0.0011 13.6 6.2 46 113-169 142-188 (327) 333 cd02649 nuc_hydro_CeIAG nuc_hy 25.4 45 0.0012 13.6 6.8 37 6-42 1-37 (306) 334 pfam04309 G3P_antiterm Glycero 25.4 45 0.0012 13.6 5.3 98 75-191 11-110 (174) 335 cd00439 Transaldolase Transald 25.1 32 0.00083 14.7 1.0 132 138-338 115-248 (252) 336 PRK11548 hypothetical protein; 25.1 46 0.0012 13.6 2.2 30 211-240 79-109 (113) 337 cd01467 vWA_BatA_type VWA BatA 24.9 46 0.0012 13.6 7.7 15 155-169 121-135 (180) 338 cd06299 PBP1_LacI_like_13 Liga 24.9 46 0.0012 13.6 6.3 110 27-137 26-142 (265) 339 PRK08185 hypothetical protein; 24.8 46 0.0012 13.6 2.9 115 27-173 9-129 (283) 340 pfam02353 CMAS Cyclopropane-fa 24.7 46 0.0012 13.6 5.0 55 9-68 66-123 (273) 341 CHL00059 atpA ATP synthase CF1 24.6 47 0.0012 13.5 6.0 106 9-137 168-274 (501) 342 COG2065 PyrR Pyrimidine operon 24.5 21 0.00053 16.1 -0.1 24 267-293 34-57 (179) 343 PRK00910 ribB 3,4-dihydroxy-2- 24.4 47 0.0012 13.5 4.5 29 141-169 170-200 (218) 344 cd06282 PBP1_GntR_like_2 Ligan 24.3 47 0.0012 13.5 6.4 40 98-137 101-143 (266) 345 smart00532 LIGANc Ligase N fam 24.3 47 0.0012 13.5 3.4 60 201-264 299-369 (441) 346 COG1089 Gmd GDP-D-mannose dehy 24.2 47 0.0012 13.5 4.3 106 7-113 4-121 (345) 347 PRK12815 carB carbamoyl phosph 24.2 47 0.0012 13.5 10.9 149 4-173 6-170 (1068) 348 PRK11634 ATP-dependent RNA hel 24.0 48 0.0012 13.5 3.8 57 9-71 106-162 (629) 349 TIGR02631 xylA_Arthro xylose i 24.0 48 0.0012 13.5 1.8 153 13-215 29-198 (390) 350 PRK11361 acetoacetate metaboli 23.9 48 0.0012 13.4 8.5 113 17-137 62-191 (457) 351 TIGR03461 pabC_Proteo aminodeo 23.8 46 0.0012 13.6 1.6 32 140-171 182-216 (261) 352 PRK06932 glycerate dehydrogena 23.8 48 0.0012 13.4 1.8 10 161-170 83-92 (314) 353 TIGR01862 N2-ase-Ialpha nitrog 23.8 48 0.0012 13.4 3.6 147 45-210 287-452 (510) 354 PRK07709 fructose-bisphosphate 23.6 49 0.0012 13.4 2.7 33 28-63 15-47 (285) 355 TIGR02080 O_succ_thio_ly O-suc 23.6 49 0.0012 13.4 2.5 122 95-243 51-180 (383) 356 KOG1023 consensus 23.5 32 0.00082 14.7 0.8 20 145-164 156-176 (484) 357 LOAD_USPA consensus 23.5 49 0.0012 13.4 9.1 104 6-125 1-110 (135) 358 PRK06522 2-dehydropantoate 2-r 23.4 49 0.0012 13.4 4.2 30 7-38 2-31 (307) 359 KOG4318 consensus 23.2 14 0.00035 17.4 -1.2 96 155-253 608-708 (1088) 360 PRK08594 enoyl-(acyl carrier p 23.0 50 0.0013 13.3 6.3 64 1-69 1-70 (256) 361 KOG1502 consensus 23.0 50 0.0013 13.3 6.9 73 4-81 5-78 (327) 362 PTZ00185 ATPase alpha subunit; 22.8 50 0.0013 13.3 5.6 48 17-64 202-258 (574) 363 PRK05778 2-oxoglutarate ferred 22.8 50 0.0013 13.3 2.8 68 102-169 157-231 (306) 364 cd03080 GST_N_Metaxin_like GST 22.6 51 0.0013 13.3 1.7 22 233-254 53-75 (75) 365 cd01981 Pchlide_reductase_B Pc 22.4 51 0.0013 13.3 8.7 29 225-253 296-325 (430) 366 PRK07315 fructose-bisphosphate 22.4 51 0.0013 13.3 3.1 34 27-63 14-47 (293) 367 pfam09255 Antig_Caf1 Caf1 Caps 22.3 51 0.0013 13.2 2.2 15 220-234 3-17 (136) 368 PRK03352 DNA polymerase IV; Va 22.3 24 0.00062 15.6 -0.0 122 62-212 8-137 (345) 369 cd01477 vWA_F09G8-8_type VWA F 22.2 52 0.0013 13.2 6.6 24 104-127 153-176 (193) 370 TIGR02824 quinone_pig3 putativ 22.2 52 0.0013 13.2 5.0 49 13-71 154-203 (334) 371 pfam00926 DHBP_synthase 3,4-di 22.0 52 0.0013 13.2 4.4 32 139-170 151-184 (193) 372 PRK07454 short chain dehydroge 22.0 52 0.0013 13.2 8.4 187 1-209 1-216 (241) 373 pfam06342 DUF1057 Alpha/beta h 21.9 34 0.00086 14.5 0.6 33 79-128 19-52 (297) 374 PRK11705 cyclopropane fatty ac 21.7 53 0.0013 13.2 3.9 48 301-350 321-372 (383) 375 TIGR02855 spore_yabG sporulati 21.7 28 0.00072 15.1 0.2 43 102-145 127-169 (292) 376 PRK09376 rho transcription ter 21.7 27 0.00069 15.2 0.1 31 43-74 9-39 (416) 377 PRK07369 dihydroorotase; Provi 21.6 53 0.0014 13.1 8.5 53 308-369 335-387 (419) 378 PRK12937 short chain dehydroge 21.5 53 0.0014 13.1 7.3 65 1-69 1-67 (245) 379 PRK06128 oxidoreductase; Provi 21.4 53 0.0014 13.1 7.0 65 8-76 59-126 (300) 380 TIGR02146 LysS_fung_arch homoc 21.4 46 0.0012 13.6 1.2 19 155-173 116-136 (355) 381 TIGR00329 gcp metalloendopepti 21.4 33 0.00084 14.6 0.5 48 6-57 133-180 (337) 382 COG4789 EscV Type III secretor 21.4 54 0.0014 13.1 5.9 72 270-341 471-542 (689) 383 TIGR00584 mug mismatch-specifi 21.4 32 0.00083 14.7 0.4 78 207-287 134-233 (347) 384 PRK05764 aspartate aminotransf 21.3 54 0.0014 13.1 6.1 143 27-170 27-201 (389) 385 TIGR01367 pyrE_Therm orotate p 21.2 52 0.0013 13.2 1.5 27 80-106 8-36 (205) 386 smart00045 DAGKa Diacylglycero 21.2 54 0.0014 13.1 3.1 21 219-239 138-160 (160) 387 TIGR01520 FruBisAldo_II_A fruc 21.1 47 0.0012 13.5 1.2 23 100-122 42-64 (365) 388 TIGR00609 recB exodeoxyribonuc 21.1 54 0.0014 13.1 3.9 67 282-351 1070-1142(1324) 389 PRK08936 glucose-1-dehydrogena 21.0 55 0.0014 13.1 7.4 58 8-69 11-69 (261) 390 cd06306 PBP1_TorT-like TorT-li 21.0 55 0.0014 13.1 6.1 46 102-147 19-66 (268) 391 cd01518 RHOD_YceA Member of th 21.0 55 0.0014 13.1 4.3 16 58-75 15-30 (101) 392 pfam04373 DUF511 Protein of un 21.0 55 0.0014 13.1 2.9 129 98-266 117-247 (310) 393 COG3414 SgaB Phosphotransferas 20.9 55 0.0014 13.0 2.5 35 5-40 1-42 (93) 394 COG0615 TagD Cytidylyltransfer 20.8 55 0.0014 13.0 4.6 58 50-119 21-93 (140) 395 TIGR01963 PHB_DH 3-hydroxybuty 20.8 55 0.0014 13.0 3.4 128 26-172 47-209 (258) 396 COG0794 GutQ Predicted sugar p 20.8 55 0.0014 13.0 4.1 65 98-181 51-126 (202) 397 COG3473 Maleate cis-trans isom 20.7 55 0.0014 13.0 3.1 84 140-255 120-205 (238) 398 TIGR01935 NOT-MenG RraA family 20.5 30 0.00076 14.9 0.1 72 43-140 43-116 (155) 399 PRK12384 sorbitol-6-phosphate 20.5 56 0.0014 13.0 4.6 58 7-69 5-65 (259) 400 COG2205 KdpD Osmosensitive K+ 20.4 56 0.0014 13.0 9.5 60 6-66 249-319 (890) 401 TIGR02627 rhamnulo_kin rhamnul 20.3 56 0.0014 13.0 1.9 19 155-173 346-364 (460) 402 pfam00148 Oxidored_nitro Nitro 20.3 56 0.0014 13.0 9.4 12 158-169 164-175 (398) 403 PRK08195 4-hydroxy-2-ketovaler 20.2 56 0.0014 12.9 6.6 16 201-216 170-185 (337) 404 COG1013 PorB Pyruvate:ferredox 20.2 56 0.0014 12.9 3.1 47 103-149 159-207 (294) 405 COG0452 Dfp Phosphopantothenoy 20.2 57 0.0014 12.9 4.6 34 5-38 4-39 (392) 406 PRK07424 bifunctional sterol d 20.0 57 0.0015 12.9 4.3 29 8-38 184-212 (410) No 1 >PRK00509 argininosuccinate synthase; Provisional Probab=100.00 E-value=0 Score=1200.93 Aligned_cols=396 Identities=62% Similarity=1.025 Sum_probs=390.6 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 77899991477259999999987189869999945787110689999999739807998200899999879999736865 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) |||||||||||||||||++||+++||+||||||+|+||++|++.+++||+++||++++++|+|+||+++||||+|+|||+ T Consensus 2 ~kKVvLAySGGLDTSv~l~wL~e~yg~eVia~~~d~Gq~~d~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~p~I~ana~ 81 (398) T PRK00509 2 VKKVVLAYSGGLDTSVIIKWLKEEYGCEVVAFTADVGQGEELEPIREKALASGASEIYVEDLREEFVRDYVFPAIRANAL 81 (398) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCC T ss_conf 85499990898459999999987539889999997988778899999999829978999630999999999999960854 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHH Q ss_conf 47874201301243359999999885689287500111682156799999985879707744100687637999999997 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEK 164 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~ 164 (404) |||+|||+|||+|||||+++|++|+++||++||||||||||||||||+++++|+|+++||||||||+|.||+++++||++ T Consensus 82 Yeg~YpL~tslaRplia~~~ve~A~~~ga~aiaHG~TGkGNDQvRFe~~~~aL~P~l~iiaP~Rd~~~~sR~~~i~ya~~ 161 (398) T PRK00509 82 YEGKYPLGTALARPLIAKKLVEIARKEGADAVAHGCTGKGNDQVRFELAIAALAPDLKVIAPWREWDLMSREELIAYAEE 161 (398) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHH T ss_conf 55854552203188999999999997198599854556786089999999985999768611321341689999999998 Q ss_pred HCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHH Q ss_conf 08967566567886415724311545763245845768201001465812279996389999950247875162318999 Q gi|254780234|r 165 HAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEV 244 (404) Q Consensus 165 ~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~ 244 (404) |||||+++++ +|||+|+||||+|+|||+|||||++||+++|.||++|+++||+|++|+|+|++|+||+|||++|+|++ T Consensus 162 ~gIpv~~~~~--~~yS~D~NLwg~S~Egg~Ledp~~~ppe~~~~~t~~p~~ap~~pe~i~I~Fe~G~Pv~vnG~~~~~~~ 239 (398) T PRK00509 162 HGIPIPVTKK--SPYSIDANLWHRSIEGGVLEDPWNEPPEDVYEWTVSPEDAPDEPEYVEIEFEKGVPVAINGEKLSPAE 239 (398) T ss_pred CCCCCCCCCC--CCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHCCCCHHHCCCCCEEEEEEEECCEEEECCCCCCCHHH T ss_conf 6998887778--87302566132101567546865345065753569977889997599999854500450795379899 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 99999888642163741000773202100011137378999999999986508999999999999999998753514698 Q gi|254780234|r 245 LLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFWFSP 324 (404) Q Consensus 245 li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~wf~p 324 (404) ||++||+|||+|||||+||||||+|||||||||||||++||++|||+||++||||+++++|++++++|++|||+|+|||| T Consensus 240 li~~LN~i~g~~GIGR~d~vEnR~vGiKsRgvYEaPaa~iL~~AHr~LE~~~Ldr~~~~~K~~l~~~~a~lvY~G~wfsP 319 (398) T PRK00509 240 LIEKLNELAGKHGIGRIDIVENRLVGIKSREVYETPGGTILIKAHRALESLTLDREVAHFKDELEPKYAELVYNGLWFSP 319 (398) T ss_pred HHHHHHHHHHHCCCCEEEEECCEEEEEEEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 99999988765462705542432232321004516548899999999999858799999999999999999863475788 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHCCC Q ss_conf 9999999999972336739999990615999877278310883222301577878887854788870334898884178 Q gi|254780234|r 325 EREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALRLRTLARRNK 403 (404) Q Consensus 325 ~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~~~~~~~~~k 403 (404) +|++|++||+++|++|||+|+|+|||||++++||+||+||||+++|||+++ +.|||+||+|||+|+||+++++++++| T Consensus 320 ~~~~l~afi~~sQ~~VtG~V~lkLykG~~~v~gr~S~~sLY~~~lasy~~~-~~~~~~da~GFI~i~~l~~k~~~~~~~ 397 (398) T PRK00509 320 LREALQAFIDETQEHVTGEVRLKLYKGNAIVVGRKSPNSLYDEDLATYEED-DVYDQKDAEGFIKLWGLPSRIAALRNK 397 (398) T ss_pred HHHHHHHHHHHHHCCCEEEEEEEEECCEEEEEEEECCHHHCCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 899999999985191169999999787589998638724337635624478-987968826889980389999998628 No 2 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=100.00 E-value=0 Score=1188.52 Aligned_cols=395 Identities=46% Similarity=0.844 Sum_probs=383.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 8999914772599999999871898-699999457871--1068999999973980799820089999987999973686 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) ||||||||||||||||+||+++||+ +|||+||||||| +|++.+++||+++||.+|+++|+|+|||+||+|++||||| T Consensus 1 KVvLAySGGLDTSv~l~wL~~kyG~~~Via~~~dvGQPDE~d~~~~~~kA~~~GA~~~~~iDak~eFv~dy~f~aiqanA 80 (420) T TIGR00032 1 KVVLAYSGGLDTSVCLKWLEEKYGYEEVIAVTADVGQPDEEDIDAIEEKALKYGAEKHYTIDAKEEFVKDYLFAAIQANA 80 (420) T ss_pred CEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHCCC T ss_conf 96888607025999999988652898607999756799988888899998873671025772368899999888765066 Q ss_pred CCCCC----CCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCC----CH Q ss_conf 54787----42013012433599999998856892875001116821567999999858797077441006876----37 Q gi|254780234|r 84 LYEGY----YLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFK----GR 155 (404) Q Consensus 84 ~Yeg~----Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~----sR 155 (404) +|||+ |||+|||||||||+++|++|++.||++||||||||||||+|||.++++++|+|+||||||||+|+ || T Consensus 81 ~Ye~~GG~~Y~L~TaLaRPlIA~~lVe~Ak~~Ga~AvaHGCTGKGNDQ~RFe~~~~~~~p~LkviaP~R~~~~~e~lgGR 160 (420) T TIGR00032 81 VYEGTGGLVYPLSTALARPLIAKKLVEVAKKEGAEAVAHGCTGKGNDQVRFERSIRALNPDLKVIAPWRDLNLTEELGGR 160 (420) T ss_pred EECCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHCCCCEEECCCCCHHHHHHHCCC T ss_conf 03178741034766114579999999998466954987076877741357658887526898587473464677640594 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCC-CCCCCHHHCCCCCCHHHCC-CCCEEEEEEEECCEEE Q ss_conf 9999999970896756656788641572431154576324584-5768201001465812279-9963899999502478 Q gi|254780234|r 156 QDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDP-SQPAPEYVYKMIVSPEEAP-DTPTTIRIDFQRGDPI 233 (404) Q Consensus 156 e~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp-~~~~pe~~~~~t~~p~~ap-~~pe~v~I~Fe~G~PV 233 (404) +|+|+|++++|||+|.++ +||||||+||||||+|||+|||| +..||+++|.||+.|.+.| |+||.|+|+||+|+|| T Consensus 161 ~e~~eYa~~~Gip~p~~~--~K~YSiD~Nl~grs~Ea~~LEdP~~~~Ppe~~y~~t~~p~~~~~d~PE~v~I~Fe~GvPV 238 (420) T TIGR00032 161 EEEIEYAAQKGIPVPMTK--EKPYSIDENLWGRSIEAGILEDPSDTEPPEDIYELTKFPIEATPDEPEVVEIDFEKGVPV 238 (420) T ss_pred HHHHHHHHHCCCCCCCCC--CCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCHHHCCCCCCCEEEEEECCCCCE T ss_conf 889999996688876546--777762366775555055357887888814578863073346799971688862378532 Q ss_pred EEC--------CEECC-HHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 751--------62318-999999998886421637410007732021000111373789999999999865089999999 Q gi|254780234|r 234 AIN--------GQVMS-PEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHL 304 (404) Q Consensus 234 ain--------G~~~~-~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~ 304 (404) ||| |+.++ |++||.+||+|||+|||||+||||||+||+|||||||+|||++|++|||+||++||++++++| T Consensus 239 aLnytdnktskg~~~~~~veLi~~aNeiaG~HGvGR~D~iEdR~ig~KSRe~YE~Pg~~~L~~AH~~Le~L~l~~~~~~f 318 (420) T TIGR00032 239 ALNYTDNKTSKGVSLEKPVELILKANEIAGKHGVGRIDIIEDRIIGLKSREIYEAPGATLLIKAHRDLEQLTLTRDVLRF 318 (420) T ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEECEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 30155464437733063789999999851007886431533034313245401015799999999998611111899999 Q ss_pred HH-HHHHHHHHHHHCCCCCCHHHHHH-HHHHHHHHCCCCEEEEEEEECCEEEEEEEE-CCCCCCCHHHCCCCCCCCCCCH Q ss_conf 99-99999999875351469899999-999999723367399999906159998772-7831088322230157787888 Q gi|254780234|r 305 KD-DLMSRYASIVYQGFWFSPEREML-QALIDKSQEYVEGSITLKLYKGNVMVVGRE-SNKSLYSDKLVTFEDDLDAYNQ 381 (404) Q Consensus 305 K~-~~~~~~a~lvy~G~wf~p~~~~l-~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~-S~~sLy~~~~as~~~~~~~~~~ 381 (404) |+ .++++||+|||+|+||||++++| ++||+++|++|||+|||||||||+.++||+ ||++||+++++||+.+.+.||| T Consensus 319 ~~s~~~~~y~~LiY~GlwfdPl~e~LR~a~i~~~Q~~VTG~VrVkL~kG~~~v~gR~~S~y~lY~~e~~s~~ke~~~fd~ 398 (420) T TIGR00032 319 KDSIVEEKYSELIYNGLWFDPLKEALRDAFIRKTQERVTGTVRVKLFKGNAIVIGRKTSPYSLYDEELVSFDKEDKEFDP 398 (420) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEECCCEEEEECCCCCCCCCCHHHCCCCCCCCCCCH T ss_conf 88641157899986201478078999999988877400755999998281799732578610031022123557555697 Q ss_pred HHHHHHHHHCCHHHHHHHHCCC Q ss_conf 7854788870334898884178 Q gi|254780234|r 382 SDSEGFIKLHALRLRTLARRNK 403 (404) Q Consensus 382 ~da~GFi~i~~l~~~~~~~~~k 403 (404) .|+.|||++|+|+.+.|.++.| T Consensus 399 ~~~~Gf~~~~gL~~~~~~~~~~ 420 (420) T TIGR00032 399 KDAIGFIAIRGLQIKVYREKKK 420 (420) T ss_pred HHHHHHHHHHCCCHHHHHHHCC T ss_conf 8999999982465234644319 No 3 >pfam00764 Arginosuc_synth Arginosuccinate synthase. This family contains a PP-loop motif. Probab=100.00 E-value=0 Score=1173.24 Aligned_cols=388 Identities=50% Similarity=0.869 Sum_probs=381.0 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99914772599999999871898699999457871-10689999999739807998200899999879999736865478 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) |||||||||||||++||+++ ++|||||++|+||+ +|++.+++||.++||++++++|+|+||+++||||+|++||+||| T Consensus 1 VLAySGGLDTSv~l~wL~e~-~~eVia~~~d~Gq~~ed~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~p~i~ana~Yeg 79 (389) T pfam00764 1 VLAYSGGLDTSVCIPWLKEK-YYEVIAVAVDVGQGEEDLDEAREKALKLGAVKHYVIDAKEEFVEDYIFPAIKANALYEG 79 (389) T ss_pred CEEECCCCHHHHHHHHHHHC-CCEEEEEEEECCCCHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCCCCC T ss_conf 96036762899999999970-99289999979997787899999999829978999737999999989999966867446 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 74201301243359999999885689287500111682156799999985879707744100687637999999997089 Q gi|254780234|r 88 YYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAI 167 (404) Q Consensus 88 ~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gI 167 (404) +|||+|||+|||||+++|++|+++||++||||||||||||||||+++++|+|+++||||||||+| +|+++++||++||| T Consensus 80 ~Y~L~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGNDQvRFe~~~~aL~P~l~iiaP~R~~~~-sR~~~i~ya~~~gI 158 (389) T pfam00764 80 RYPLGTALARPLIAKKLVEAAKKEGADAVAHGCTGKGNDQVRFELSIRSLAPDLKVIAPVRDLNL-TREEEIEYAKEKGI 158 (389) T ss_pred CEEECCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCEEECCCCCCCC-CHHHHHHHHHHCCC T ss_conf 47852432166999999999876397699436667997453547889974999758545245320-57999999998599 Q ss_pred CCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHH Q ss_conf 67566567886415724311545763245845768201001465812279996389999950247875162318999999 Q gi|254780234|r 168 PIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLE 247 (404) Q Consensus 168 pv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~ 247 (404) |++++++ +|||+|+||||+|+|||+|||||++||+++|.||++|+++|++|++|+|+|++|+||+|||++|+|++||+ T Consensus 159 ~v~~~~~--~pyS~D~NLwg~S~Egg~Ledp~~~ppe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~eli~ 236 (389) T pfam00764 159 PIPVTKK--KPYSIDENLWGRSIEGGILEDPWNAPPEDIYEWTKDPAKAPDEPEVVEIGFEKGVPVSLDGEPVSVLELIK 236 (389) T ss_pred CCCCCCC--CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHCCCCCEEEEEEEECCEEEEECCCCCCHHHHHH T ss_conf 8875688--87333545000122467557844345633000568956777787379999863268996797589999999 Q ss_pred HHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 99888642163741000773202100011137378999999999986508999999999999999998753514698999 Q gi|254780234|r 248 QLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFWFSPERE 327 (404) Q Consensus 248 ~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~wf~p~~~ 327 (404) +||+|||+|||||+||||||+|||||||||||||++||++|||+||++||||+++++|++++++|++|||+|+||||+|+ T Consensus 237 ~LN~i~g~~GIGR~d~vEnR~vGiKsRgvYEaPgatiL~~AHr~LE~l~Ldr~~~~~K~~l~~~~a~lvY~G~wfsP~r~ 316 (389) T pfam00764 237 ELNEIAGAHGVGRIDIVEDRLVGLKSREIYEAPAATVLITAHRDLENLTLTREVLRFKRIVDQKYAELVYDGLWFSPLKE 316 (389) T ss_pred HHHHHHHHCEECCEEEECCCEEEEEECCEECCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 99998764645740155341000230303118169999999999999847698999999999999998755463788999 Q ss_pred HHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHC Q ss_conf 99999999723367399999906159998772783108832223015778788878547888703348988841 Q gi|254780234|r 328 MLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALRLRTLARR 401 (404) Q Consensus 328 ~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~~~~~~~~ 401 (404) +|++||+++|++|||+|+|+|||||++++||+||+||||+++|||+++ +.|+|.||+|||+|+||++|+|+++ T Consensus 317 ~l~afi~~~q~~VtG~V~lkLykGn~~v~gr~S~~sLY~~~~as~~~~-~~~~~~da~GFi~i~~l~~k~~~~~ 389 (389) T pfam00764 317 ALDAFIDKTQERVTGTVRVKLYKGSARVLGRESPYSLYDEELVSYDEG-DTFDQTDATGFIKIHGLQAKIYARR 389 (389) T ss_pred HHHHHHHHHHCCCEEEEEEEEECCEEEEEEEECCCHHCCCCCCCCCCC-CCCCHHHHHHHHHHCCHHHHHHHCC T ss_conf 999999996092289999999798589987788600048613515589-9989588037799756888986249 No 4 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=1175.30 Aligned_cols=396 Identities=58% Similarity=0.965 Sum_probs=388.2 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 7778999914772599999999871898699999457871-106899999997398079982008999998799997368 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) .+|||||||||||||||||+||++++|+||||||+|+||+ +|++.+++||+++||++++++|+|+||+++||||+|++| T Consensus 3 ~~kkvvLAYSGGLDTSv~i~wL~e~~~~eVit~tadvGQ~eed~~~i~eKA~~~Ga~~~~viD~reeFv~~yi~~~i~an 82 (403) T COG0137 3 KVKKVVLAYSGGLDTSVAIKWLKEKGGAEVIAVTADVGQPEEDLDAIREKALELGAEEAYVIDAREEFVEDYIFPAIKAN 82 (403) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHH T ss_conf 77679999548823999999999765946999997589975775799999998188528996438999999999999730 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHH Q ss_conf 65478742013012433599999998856892875001116821567999999858797077441006876379999999 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFA 162 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya 162 (404) ++|||+|||+|||+|||||+++|++|++.||++||||||||||||||||+++++|+|+++||||||+|+|+ |+++|+|| T Consensus 83 a~Yeg~YpL~TalaRPLIak~lVe~A~k~Ga~avaHGcTGKGNDQvRFE~~~~al~pdlkiiAP~Rew~~~-R~~~i~Ya 161 (403) T COG0137 83 ALYEGVYPLGTALARPLIAKKLVEAAKKEGADAVAHGCTGKGNDQVRFELAILALNPDLKIIAPWREWNLT-REEEIEYA 161 (403) T ss_pred CEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEEEEEEHHHHCCCCEEEEEHHHHCCC-HHHHHHHH T ss_conf 51215641454346779999999999971996997467888875354320045418996798563341337-69999999 Q ss_pred HHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCH Q ss_conf 97089675665678864157243115457632458457682010014658122799963899999502478751623189 Q gi|254780234|r 163 EKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSP 242 (404) Q Consensus 163 ~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~ 242 (404) ++|||||+.++ ++|||||+||||+|+|||+|||||++||+|+|.||++|+++|++|++|+|+|++|+||+|||++++| T Consensus 162 ~~~gipv~~~~--~kpySiD~Nlwg~S~Egg~LEdp~~~pped~~~~tv~p~~ap~~~e~v~I~Fe~G~PValnG~~~~~ 239 (403) T COG0137 162 EEHGIPVKATK--EKPYSIDENLWGRSIEGGDLEDPWNEPPEDAYEWTVSPEDAPDEPEEVEIGFEKGVPVALNGEKLSP 239 (403) T ss_pred HHCCCCCCCCC--CCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHCCCHHHCCCCCEEEEEEEECCEEEEECCEECCH T ss_conf 98499766567--8886444333234033664568776897158761378535899972899997577378875974889 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHH-HHHHHHHHHHCCCC Q ss_conf 99999998886421637410007732021000111373789999999999865089999999999-99999998753514 Q gi|254780234|r 243 EVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDD-LMSRYASIVYQGFW 321 (404) Q Consensus 243 ~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~-~~~~~a~lvy~G~w 321 (404) ++||.+||+|||+|||||+||||||+||||||||||+|||++|++|||+||++||||++++||+. ++++|++|||+|+| T Consensus 240 ~~li~~lN~iag~hGvGRiD~vEnR~vG~KSRevYE~Pg~tiL~~AHr~LE~ltl~r~~~~~k~~~~~~~~a~lvY~GlW 319 (403) T COG0137 240 VELILELNEIAGRHGVGRIDIVENRLVGIKSREVYEAPGATILLTAHRALESLTLDREEAHFKDIELGPKYAELVYNGLW 319 (403) T ss_pred HHHHHHHHHHHHHCCCCCHHEEEHHEEEEECCHHCCCCHHHHHHHHHHHHHHHEECHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999987445876031132200112021002482199999999999763406877764432217899999867873 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHC Q ss_conf 69899999999999723367399999906159998772783108832223015778788878547888703348988841 Q gi|254780234|r 322 FSPEREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALRLRTLARR 401 (404) Q Consensus 322 f~p~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~~~~~~~~ 401 (404) |||+|++|++|++++|++|||+|+|||||||++++||+||+||||++++||+++ +.|+|.+|+|||++|+|++++++++ T Consensus 320 f~Pl~~~L~a~~~~~q~~VtG~V~lkL~kG~~~v~gr~S~~slY~~~l~t~~~~-~~~dq~~a~GFi~~~~l~~k~~~~~ 398 (403) T COG0137 320 FSPLREALRAFIDETQEHVTGEVRLKLYKGNVYVIGRKSPNSLYSEELATYEEG-DAFDQRDAQGFIKLHGLRSKLAARV 398 (403) T ss_pred CCHHHHHHHHHHHHHHHHCCEEEEEEEECCEEEEEECCCCCCCCCHHHCCCCCC-CCCCHHHHHHHHHHHCCHHHHHHHH T ss_conf 487999999999998855114899999778389984148754367422653556-6745667657899804389999987 Q ss_pred CC Q ss_conf 78 Q gi|254780234|r 402 NK 403 (404) Q Consensus 402 ~k 403 (404) ++ T Consensus 399 ~~ 400 (403) T COG0137 399 KK 400 (403) T ss_pred HH T ss_conf 53 No 5 >cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity Probab=100.00 E-value=0 Score=1167.59 Aligned_cols=384 Identities=58% Similarity=0.994 Sum_probs=378.0 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCH-HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99991477259999999987189869999945787110-68999999973980799820089999987999973686547 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEE-LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYE 86 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d-~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Ye 86 (404) ||||||||||||||++||++++++||||||+|+||++| ++.+++||+++||++++++|+|+||+++||||+|+|||+|| T Consensus 1 VvLAySGGLDTSv~l~wL~e~~~~eVia~~~d~Gq~~e~~~~i~~kA~~~GA~~~~v~D~r~ef~~~~i~pai~ana~Ye 80 (385) T cd01999 1 VVLAYSGGLDTSVILKWLKEKGGYEVIAVTADVGQPEEEIEAIEEKALKLGAKKHVVVDLREEFVEDYIFPAIQANALYE 80 (385) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCCHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHCCCCC T ss_conf 97997788179999999997459849999998999667778999999981997899961299999999999996086433 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHC Q ss_conf 87420130124335999999988568928750011168215679999998587970774410068763799999999708 Q gi|254780234|r 87 GYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHA 166 (404) Q Consensus 87 g~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~g 166 (404) |+|||+|||+|||||+++|++|+++||++||||||||||||||||+++++|+|+++||||||||++.||+++|+||++|| T Consensus 81 g~Y~L~tslaRplIak~~ve~A~~~ga~aiaHG~TGkGNDQvRFe~~~~aL~P~l~iiAP~Rd~~~~sR~~~i~ya~~~g 160 (385) T cd01999 81 GTYPLGTALARPLIAKALVEVAKEEGADAVAHGCTGKGNDQVRFELAFYALNPDLKIIAPWRDWEFLSREEEIEYAEEHG 160 (385) T ss_pred CCEECCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHCC T ss_conf 75321350157899999999999809849974566688840689999998599975851473144488899999999859 Q ss_pred CCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHH Q ss_conf 96756656788641572431154576324584576820100146581227999638999995024787516231899999 Q gi|254780234|r 167 IPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLL 246 (404) Q Consensus 167 Ipv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li 246 (404) |||+++++ +|||+|+||||+|+|||+|||||++||+++|.||++|+++|++|++|+|+|++|+||+|||++|+|++|| T Consensus 161 i~v~~~~~--~pyS~D~NLwg~S~Egg~Ledp~~~~p~~~~~~t~~p~~aP~~pe~v~I~Fe~G~PvainG~~~~~~~li 238 (385) T cd01999 161 IPVPVTKK--KPYSIDENLWGRSIEGGILEDPDNEPPEDAYEWTVSPEDAPDEPEYVEIEFEKGVPVALNGEKLDPVELI 238 (385) T ss_pred CCCCCCCC--CCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHCCCCHHHCCCCCEEEEEEEECCEEEEECCCCCCHHHHH T ss_conf 99886678--8851463600110046552584546986774257997788999869999986445789658406899999 Q ss_pred HHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 99988864216374100077320210001113737899999999998650899999999999999999875351469899 Q gi|254780234|r 247 EQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFWFSPER 326 (404) Q Consensus 247 ~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~wf~p~~ 326 (404) ++||+|||+|||||+||||||+|||||||||||||++||++|||+||++||||+++++|++++++|++|||+|+||||+| T Consensus 239 ~~LN~igg~~GIGr~d~vEnr~vG~KsRgvYEaPg~tiL~~AHr~LE~~~Ldr~~~~~K~~l~~~~a~lvY~G~wfsPlr 318 (385) T cd01999 239 LELNEIAGKHGVGRIDIVENRVIGIKSREVYEAPGATILIKAHRDLESLTLDREVLHFKDIVDPKYAELVYNGLWFSPLR 318 (385) T ss_pred HHHHHHHHHCCCCCEEECCCEEECEEEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 99998875341051320164000112200563553889999999999985779999999999999999985435578889 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHH Q ss_conf 99999999972336739999990615999877278310883222301577878887854788870334 Q gi|254780234|r 327 EMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALR 394 (404) Q Consensus 327 ~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~ 394 (404) ++|++||+++|++|||+|+|+|||||++++||+||+||||+++|||+++ +.|+|+||+|||+|+||+ T Consensus 319 ~~l~afi~~tq~~VtG~V~l~LykG~~~v~gr~S~~sLY~~~~as~~~~-~~~~~~da~GFi~i~~l~ 385 (385) T cd01999 319 EALEAFIDKTQERVNGTVRLKLYKGNVIVVGRESPNSLYSEELASYDED-GGFDQKDAEGFIKIHGLR 385 (385) T ss_pred HHHHHHHHHHHCEEEEEEEEEEECCEEEEEEEECCCHHCCCCCCCCCCC-CCCCHHHHHHHHHHHCCC T ss_conf 9999999986384279999999788389998648701146524513479-998968834557873379 No 6 >PRK13820 argininosuccinate synthase; Provisional Probab=100.00 E-value=0 Score=1162.37 Aligned_cols=392 Identities=47% Similarity=0.791 Sum_probs=381.6 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 9877778999914772599999999871898-699999457871-10689999999739807998200899999879999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADLGQG-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) ||+ ||||||||||||||||++||+++||| |||||++|+||+ +|++.+++||.++|+ +++++|+|+||+++||||+ T Consensus 1 M~~--kKVvLAySGGLDTSv~l~wL~e~yg~~eVi~~~~d~Gq~~e~~~~~~~kA~~~G~-~~~viD~r~ef~~~~i~pa 77 (395) T PRK13820 1 MMK--KKVVLAYSGGLDTSVCIPLLKEKYGYDEVITVAVDVGQPEEEIKEAEEKAKKLGV-KHYTIDAKEEFAKDYIFPA 77 (395) T ss_pred CCC--CEEEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCCC-CEEEEECHHHHHHHHHHHH T ss_conf 998--7299996898489999999998519986999999899976776999999997099-4999816999999999999 Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHH Q ss_conf 73686547874201301243359999999885689287500111682156799999985879707744100687637999 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDL 158 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~ 158 (404) |+|||+||| |||+|||+|||||+++|++|+++||++||||||||||||||||++++ +|+++||||||||++ +|+++ T Consensus 78 I~ana~Yeg-YpL~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGNDQvRFe~~~~--ap~~~iiaP~R~~~~-~R~~~ 153 (395) T PRK13820 78 IKANALYEG-YPLGTALARPLIAKKIVEVAKKEGASAIAHGCTGKGNDQLRFEAVFR--ATDLDVIAPIRELNL-TREWE 153 (395) T ss_pred HHCCCCCCC-CCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHH--CCCCEEEEEECCHHH-HHHHH T ss_conf 970873267-30324104689999999999975982983077789870999999986--789868721045334-58999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCE Q ss_conf 99999708967566567886415724311545763245845768201001465812279996389999950247875162 Q gi|254780234|r 159 IDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQ 238 (404) Q Consensus 159 i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~ 238 (404) ++||++|||||+++++ +|||+|+||||+|+|||+|||||++||+|+|.||++|+++||+|++|+|+||+|+||+|||+ T Consensus 154 i~ya~~~gI~v~~~~~--~~yS~D~NLwg~S~Egg~LEdp~~~pped~~~~t~~p~~aPd~pe~v~I~Fe~G~PvainG~ 231 (395) T PRK13820 154 IEYAKEHGIPVPVGKE--KPWSIDENLWSRSIEGGKLEDPMFEPPEEIYAWTVSPEDAPDKPEIVEIEFEAGVPVAINGE 231 (395) T ss_pred HHHHHHCCCCCCCCCC--CCCCCCCCCCCEEECCCCCCCCCCCCCCCCEEECCCHHHCCCCCEEEEEEEECCEEEEECCE T ss_conf 9999985998888888--88524555023110578768965466523010047966789887699999966678998797 Q ss_pred ECCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 31899999999888642163741000773202100011137378999999999986508999999999999999998753 Q gi|254780234|r 239 VMSPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQ 318 (404) Q Consensus 239 ~~~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~ 318 (404) +|+|++||++||+|||+|||||+||||||+|||||||||||||++||++|||+||++||||+++++|++++++|++|||+ T Consensus 232 ~~~~~eli~~LN~i~g~~GIGr~d~vEnr~vGiKsR~vYEaPaatiL~~AHr~LE~~tLdr~~~~~K~~l~~~~a~lvY~ 311 (395) T PRK13820 232 KMEGVELIRKLNEIAGKHGVGRTDMIEDRVLGLKARENYEHPAATVLLTAHKALEQLVLTREELKFKEIVDSKWAELAYK 311 (395) T ss_pred ECCHHHHHHHHHHHHHHCCCCCEEEECCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHC T ss_conf 88899999999999861611754666042011322630058189999999999998627788999999999999999856 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHHHHHH Q ss_conf 51469899999999999723367399999906159998772783108832223015778788878547888703348988 Q gi|254780234|r 319 GFWFSPEREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALRLRTL 398 (404) Q Consensus 319 G~wf~p~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~~~~~ 398 (404) |+||||+|++|++||+++|++|||+|+|+|||||++++||+||+||||++++||++ +.|||.||+|||+|+||++++| T Consensus 312 G~wfsPlr~~l~afi~~sq~~VtG~V~l~LykG~~~v~gr~S~~sLy~~~~as~~~--~~~~~~da~GFI~i~gl~~~~~ 389 (395) T PRK13820 312 GLVHEPLRDDLNAFIDKTQERVTGTVTVKLYKGSARVVGRESPYALYSEELVSFDS--KTIDQRDAEGMAKYHGLQARLY 389 (395) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEEEEEECCEEEEEEEECCCHHCCCCCCEEEC--CCCCHHHCCHHHHHHCHHHHHH T ss_conf 76378899999999998509257899999978879998877862012851253338--9879588157689716999999 Q ss_pred HHCCC Q ss_conf 84178 Q gi|254780234|r 399 ARRNK 403 (404) Q Consensus 399 ~~~~k 403 (404) ++.+| T Consensus 390 ~~~~~ 394 (395) T PRK13820 390 RKLKR 394 (395) T ss_pred HHHHC T ss_conf 99747 No 7 >PRK04527 argininosuccinate synthase; Provisional Probab=100.00 E-value=0 Score=1115.92 Aligned_cols=392 Identities=32% Similarity=0.494 Sum_probs=377.5 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 987777899991477259999999987189869999945787--110689999999739807998200899999879999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) |++ ||||||||||||||||++||+ |+||+||||++|+|| .+|++.+++||.++||++++++|+|+||+++||+|+ T Consensus 1 M~~--kkVvLAySGGLDTSv~l~wL~-e~g~~Vi~~~ad~G~~~~e~~~~i~~kA~~~GA~~~~v~D~~~ef~~~~v~~~ 77 (397) T PRK04527 1 MST--KDIVLAFSGGLDTSFCIPYLQ-ERGYAVHTVFADTGGVDAEERDFIEKRAAELGAASHVTVDGGPAIWEGFVKPF 77 (397) T ss_pred CCC--CEEEEEECCCCHHHHHHHHHH-HCCCCEEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHH T ss_conf 998--759999089728999999998-75994799999789986151578999999819977999728999999999999 Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCC---CCCH Q ss_conf 7368654787420130124335999999988568928750011168215679999998587970774410068---7637 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWS---FKGR 155 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~---~~sR 155 (404) |+|||+|||+|||+|| +|||||+++|++|+++||++||||||||||||||||+++++|+ +++||||||+|+ +.+| T Consensus 78 i~ana~Yeg~YpL~ts-aRplIak~~ve~A~~~ga~~iaHG~TGkGNDQvRFe~~~~al~-~~~viAP~R~~~~~~~~~R 155 (397) T PRK04527 78 VWAGEGYQGQYPLLVS-DRYLIVDAALKRAEELGTRIIAHGCTGMGNDQVRFDLAVKALG-DYQIVAPIREIQKEHTQTR 155 (397) T ss_pred HHHCHHHCCCCCCCCC-HHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHCC-CCCCCCCHHHHHHCCHHHH T ss_conf 8606454185556321-0799999999999972983840476667870256776888636-7654365565310222029 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEE Q ss_conf 99999999708967566567886415724311545763245845768201001465812279996389999950247875 Q gi|254780234|r 156 QDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAI 235 (404) Q Consensus 156 e~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVai 235 (404) +++++||++|||||+++ .+|||+|+||||+|+|||+| |+|.+||+++|.||.+|.++||+|++|+|+||+|+||+| T Consensus 156 ~~~~~ya~~~gipv~~~---~k~yS~D~NLwg~S~E~g~L-d~~~~P~e~~~~~t~p~~~~Pd~p~~v~I~Fe~G~PVal 231 (397) T PRK04527 156 AYEQKYLEERGFGVRAK---QKAYTINENLLGLTMSGGEI-DRWEAPGEGARGWCAPRSAWPTEALTVTLKFVEGEAVAL 231 (397) T ss_pred HHHHHHHHHCCCCCCCC---CCCCCCCCCEEEEECCCCCC-CCCCCCHHHHHHCCCCHHHCCCCCEEEEEEEECCEEEEE T ss_conf 99997678649998877---78766756601231136667-886698177764248852599997698888878785574 Q ss_pred CCEECCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 16231899999999888642163741000773202100011137378999999999986508999999999999999998 Q gi|254780234|r 236 NGQVMSPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASI 315 (404) Q Consensus 236 nG~~~~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~l 315 (404) ||++|++++||++||+|||+|||||+||||||+|||||||||||||++||+.|||+||++||||+++++|++++++|++| T Consensus 232 nG~~~~~~eli~~LN~igg~hGiGRid~VEnR~vG~KsRgvYEaPa~~iL~~AHr~LE~~~ld~~~~~~K~~l~~~~a~l 311 (397) T PRK04527 232 DGKPLPGAQILAKLNTLFAQYGVGRGVYTGDTVIGLKGRIVFEAPGLVSLLTAHRALEDAVLTKQQNRFKPDVARKWVEL 311 (397) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCEEEECCCEEEEECCEEEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 68447869999999998763562854665253232341514338189999999999998717899999999988999886 Q ss_pred HHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHCCHHH Q ss_conf 75351469899999999999723367399999906159998772783108832223015778788878547888703348 Q gi|254780234|r 316 VYQGFWFSPEREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLHALRL 395 (404) Q Consensus 316 vy~G~wf~p~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~~l~~ 395 (404) ||+|+||||+|++|++||+++|++|||+|+|+|||||++++||+||+ +|++++|||+++ +.|+|+||+|||+||||++ T Consensus 312 vY~G~WfsPlr~~l~afi~~sq~~VtG~V~lkLykG~~~v~gr~Sp~-~~~~~~as~~~~-~~~~~~da~GFI~i~gl~~ 389 (397) T PRK04527 312 VYEGFYHDPLKTDIEAFLKSSQAKVNGEVTLETRGGRVDAVAVRSPH-LLNTKGATYAQS-ADWGVEEAEGFIKLFGMSS 389 (397) T ss_pred HCCCCCCCHHHHHHHHHHHHHHCCEEEEEEEEEECCEEEEEEECCCH-HCCCCCCEECCC-CCCCHHHHHHHHHHHCHHH T ss_conf 50775078899999999999739607999999989869998863811-107555641589-9859799637699725799 Q ss_pred HHHHHCCC Q ss_conf 98884178 Q gi|254780234|r 396 RTLARRNK 403 (404) Q Consensus 396 ~~~~~~~k 403 (404) |+|++++| T Consensus 390 ~~~~~~~k 397 (397) T PRK04527 390 TLYAQVNR 397 (397) T ss_pred HHHHHHCC T ss_conf 99998339 No 8 >PRK05370 argininosuccinate synthase; Validated Probab=100.00 E-value=0 Score=1080.60 Aligned_cols=388 Identities=27% Similarity=0.446 Sum_probs=368.4 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC--HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 7789999147725999999998718986999994578711--06899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGE--ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~--d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) -||||||||||||||||++||++ +||||||||||+||++ |++.+++||.++||++++++|+|+||+++|+ |+|+|| T Consensus 11 gkKVvLAYSGGLDTSv~l~wL~e-~g~eVia~taDvGQ~de~d~~~i~~kA~~~GA~k~~v~D~reefv~~~i-~ai~a~ 88 (447) T PRK05370 11 GQRVGIAFSGGLDTSAALLWMRQ-KGAVPYAYTANLGQPDEDDYDAIPRRAMEYGAENARLIDCRAQLVAEGI-AAIQCG 88 (447) T ss_pred CCEEEEEECCCCHHHHHHHHHHH-CCCEEEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHH-HHHHHC T ss_conf 88899995898279999999987-5987999999799986133899999999829978999717999999999-999736 Q ss_pred CCCCC----CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCC----CC Q ss_conf 65478----74201301243359999999885689287500111682156799999985879707744100687----63 Q gi|254780234|r 83 ALYEG----YYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSF----KG 154 (404) Q Consensus 83 a~Yeg----~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~----~s 154 (404) |+|++ .|||+|||+||+|++.+|++|+++||++||||||||||||||||+++.+++|+++||||||||+| +| T Consensus 89 A~y~~~~g~~Y~l~t~laRplia~~lv~~a~~~ga~~iaHGaTGKGNDQVRFe~~~~~l~P~lkIiaPwRd~~f~~el~s 168 (447) T PRK05370 89 AFHISTGGVTYFNTTPLGRAVTGTMLVAAMKEDGVNIWGDGSTYKGNDIERFYRYGLLTNPELKIYKPWLDQDFIDELGG 168 (447) T ss_pred CHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHCCCCEEECCCCCHHHHHHCCC T ss_conf 23330257644587730788999999999998399899557767874388999999852998769714334666654157 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCC--CCCHHHCCCCCCHHHCCCCCEEEEEEEECCEE Q ss_conf 7999999997089675665678864157243115457632458457--68201001465812279996389999950247 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQ--PAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDP 232 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~--~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~P 232 (404) |+++++||++|||||+.+++ +|||+|+||||+|+|||+||||+. +++++++..+.+|+++|++|++|+|+||+|+| T Consensus 169 R~e~i~Ya~~~gIpv~~s~e--k~YSiD~NLwg~S~Egg~LEd~~~~~~~~~p~~~~~~~~~~a~~~pe~v~I~Fe~G~P 246 (447) T PRK05370 169 RAEMSEFMIACGFDYKMSVE--KAYSTDSNMLGATHEAKDLEHLNSGIKIVNPIMGVAFWDEDVKIKAEEVTVRFEQGQP 246 (447) T ss_pred HHHHHHHHHHCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECEEE T ss_conf 49999999984998766788--8853363301323444445676645677887334667865479998499999673077 Q ss_pred EEECCEEC-CHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 87516231-89999999988864216374100077320210001113737899999999998650899999999999999 Q gi|254780234|r 233 IAINGQVM-SPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSR 311 (404) Q Consensus 233 VainG~~~-~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~ 311 (404) |+|||++| +|++||++||+|||+|||||+||||||||||||||||||||++||+.|||+||++||||+++++|+.++++ T Consensus 247 ValnG~~~~spveli~~LN~igG~hGIGR~D~VEnR~VG~KSRgvYEtPg~tiL~~AHr~LE~~~ldr~~~~~k~~~~~~ 326 (447) T PRK05370 247 VALNGKTFSDPVELMLEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMALLHIAYERLVTGIHNEDTIEQYRINGRR 326 (447) T ss_pred EEECCEECCCHHHHHHHHHHHHHHCCCCCEEEECCEEEEEEECCEECCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 89899867899999999999998753175255602024453041332814999999999999985489999999888999 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHH-HHHCCCCEEEEEEEECCE-EEEEEEECCCCCCCHHHCCCCCCC-CCCCHHHHHHHH Q ss_conf 99987535146989999999999-972336739999990615-999877278310883222301577-878887854788 Q gi|254780234|r 312 YASIVYQGFWFSPEREMLQALID-KSQEYVEGSITLKLYKGN-VMVVGRESNKSLYSDKLVTFEDDL-DAYNQSDSEGFI 388 (404) Q Consensus 312 ~a~lvy~G~wf~p~~~~l~a~i~-~~q~~VtG~V~l~L~kG~-~~v~gr~S~~sLy~~~~as~~~~~-~~~~~~da~GFi 388 (404) |++|||+|+||||+|.+|+.+++ ++|+.|||+|+||||||| ++++||+||+|||+++++|||... ..|+|.||+||| T Consensus 327 ~a~LvYnG~WFsPe~~~Lr~~l~~~~q~~VtG~V~lkLyKGn~~~v~~r~sp~sLY~~~~~s~e~~e~~~y~~~Da~Gfi 406 (447) T PRK05370 327 LGRLLYQGRWFDPQALMLRETLQRWVASAITGEVTLELRRGNDYSILNTVSPNLTYKPERLSMEKVESAAFSPDDRIGQL 406 (447) T ss_pred HHHHHHCCEECCCHHHHHHHHHHHHHHCCCEEEEEEEEECCCEEEEEECCCCCCCCCHHHCCEECCCCCCCCCCCCCHHE T ss_conf 99998558824805899999999986575317999999759758983042851016724374323777775953443011 Q ss_pred HHCCHHHH Q ss_conf 87033489 Q gi|254780234|r 389 KLHALRLR 396 (404) Q Consensus 389 ~i~~l~~~ 396 (404) ++++|+++ T Consensus 407 ~l~~L~~~ 414 (447) T PRK05370 407 TMRNLDIT 414 (447) T ss_pred EECCCCCC T ss_conf 22277575 No 9 >KOG1706 consensus Probab=100.00 E-value=0 Score=1013.88 Aligned_cols=398 Identities=51% Similarity=0.890 Sum_probs=385.9 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 98777789999147725999999998718986999994578711068999999973980799820089999987999973 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) |++++++||||||||||||++++||+ |||||||||.+|+||.+|++.+++||++.||+++++.|+++||++|++||+++ T Consensus 1 ~~~~~~~vVLAySGgLDTscil~WLk-eqGyeViay~AnvGQ~edfe~ar~kAlk~Gakk~~~ed~~~eFvedfi~Pa~q 79 (412) T KOG1706 1 MMSSKKSVVLAYSGGLDTSCILAWLK-EQGYEVIAYLANVGQKEDFEEARKKALKSGAKKVVVEDVREEFVEDFIWPALQ 79 (412) T ss_pred CCCCCCEEEEEECCCCCCHHHHHHHH-HCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCHHHHH T ss_conf 98777459999538867234358888-62965898400156266689999753312760798613158887602016443 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCC----CCCCHH Q ss_conf 6865478742013012433599999998856892875001116821567999999858797077441006----876379 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHW----SFKGRQ 156 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~----~~~sRe 156 (404) ++|+||++|+|+|||+||+||+++|++|.++||.+|+||||||||||||||+++.+|+|++++|||||.+ .|.||+ T Consensus 80 s~a~YEd~YLLGTSlaRp~ia~~qv~va~~eg~~aVsHGcTGKGNDQvrFELt~ysl~P~~kviapwrmp~f~~rf~Gr~ 159 (412) T KOG1706 80 SSALYEDRYLLGTSLARPVIAKAQVDVAQREGAKAVSHGCTGKGNDQVRFELTFYSLKPDVKVIAPWRMPEFYERFKGRK 159 (412) T ss_pred HCCHHHCEEEECCCCCCCHHHHHHHHHHHHCCCEEEECCCCCCCCCCEEEEEEEECCCCCCEEECCCCCHHHHHHHCCCH T ss_conf 11210101342244443134333535776517503303556677740246653311477630333545467887642714 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEEC Q ss_conf 99999997089675665678864157243115457632458457682010014658122799963899999502478751 Q gi|254780234|r 157 DLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAIN 236 (404) Q Consensus 157 ~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVain 236 (404) ++++||++||||||.+++ +|||+|+||||+|+|+|+||||.++||.++|.+|++|++|||+|++++|+||+|+||+++ T Consensus 160 Dl~eYakq~giPvpvT~k--~pwsmDeNl~HiSyEagiLEdP~nqpp~~m~~~~~DP~~APd~pe~LeI~fekG~Pvklt 237 (412) T KOG1706 160 DLLEYAKQHGIPVPVTPK--NPWSMDENLMHISYEAGILEDPKNQPPKDMYKLTVDPEDAPDTPEILEIEFEKGVPVKLT 237 (412) T ss_pred HHHHHHHHCCCCCCCCCC--CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCEEEEEEECCCCEEEE T ss_conf 789999864997665688--876544110010000353348677998100136678312899984589985238844888 Q ss_pred ---CEEC--CHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf ---6231--89999999988864216374100077320210001113737899999999998650899999999999999 Q gi|254780234|r 237 ---GQVM--SPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSR 311 (404) Q Consensus 237 ---G~~~--~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~ 311 (404) |... +.++|+.+||+++|+|||||+||||||+||||||||||+||+|||.+||.+||.+||||++.++|+.+..+ T Consensus 238 ~vk~~tt~qtSlelf~~ln~vagkhgVGrIDIVENRfigmKSRGiYetp~gTIL~~AhlDle~~TmdrEvrkik~~~~lk 317 (412) T KOG1706 238 NVKDGTTHQTSLELFMYLNEVAGKHGVGRIDIVENRFIGMKSRGIYETPAGTILYHAHLDLEGFTMDREVRKIKDGLGLK 317 (412) T ss_pred ECCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEHHHHHHCCCCCCEEECCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCH T ss_conf 63788566025999999998752167550101043520554452340788346531004265500278899887501623 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHHCCCCCCCCCCCHHHHHHHHHHC Q ss_conf 99987535146989999999999972336739999990615999877278310883222301577878887854788870 Q gi|254780234|r 312 YASIVYQGFWFSPEREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKLVTFEDDLDAYNQSDSEGFIKLH 391 (404) Q Consensus 312 ~a~lvy~G~wf~p~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~as~~~~~~~~~~~da~GFi~i~ 391 (404) |++|||+|+||||+|++++++|..+|+.|+|+|+|+|||||++++||+||.|||+++++||+.. ..|+|.||+|||+|+ T Consensus 318 ~aelvy~G~w~sp~~e~v~~~~~~sqe~v~GkVqvsvyKGqV~IlGresp~sLY~ee~~Smn~~-g~y~p~datGFI~I~ 396 (412) T KOG1706 318 YAELVYTGFWFSPECESVRACIAKSQERVEGKVQVSVYKGQVYILGRESPLSLYNEELVSMNVQ-GDYEPIDATGFININ 396 (412) T ss_pred HHHHHHCCEECCHHHHHHHHHHHCCCCCCCCEEEEEEEECEEEEEECCCCHHHHCCCCCCCCCC-CCCCCCCCCCEEEEE T ss_conf 5553210332380678899875325231032599999504699971468444416654320133-785523443457750 Q ss_pred CHHHHHHHHCC Q ss_conf 33489888417 Q gi|254780234|r 392 ALRLRTLARRN 402 (404) Q Consensus 392 ~l~~~~~~~~~ 402 (404) +|++|.|++.+ T Consensus 397 slrlKey~~~~ 407 (412) T KOG1706 397 SLRLKEYHRLQ 407 (412) T ss_pred EEEHHHHHHHH T ss_conf 33223444533 No 10 >PRK00143 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; Reviewed Probab=99.76 E-value=1.9e-17 Score=145.56 Aligned_cols=158 Identities=28% Similarity=0.385 Sum_probs=114.7 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC----------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHH Q ss_conf 777899991477259999999987189869999945787----------1106899999997398079982008999998 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ----------GEELKIASDKARLLGAKEVYVKDLRREFVRD 73 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq----------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~ 73 (404) ++|||++|+|||+||||++.+|+ ++||||+++++-++. .+|+.++++.|..+|+ +|+++|++++|.++ T Consensus 3 sk~rV~VamSGGVDSsVaA~LL~-~~Gy~V~Gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgI-p~~v~d~~~~f~~~ 80 (355) T PRK00143 3 SKKRVVVGMSGGVDSSVAAALLK-EQGYDVIGLFMKLWDDDDGSGGCCAEEDIADARAVADKLGI-PLYVVDFAKEFWDN 80 (355) T ss_pred CCCEEEEEECCCHHHHHHHHHHH-HCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCC-CEEEECHHHHHHHH T ss_conf 88889999167899999999999-77995899999887688777899857889999999998698-57996869987678 Q ss_pred HHHHHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHCCCCEEECHHHC----------------CCCHHHHHH--HHH Q ss_conf 799997368654787420130-1243359999999885689287500111----------------682156799--999 Q gi|254780234|r 74 FVFPMFRANALYEGYYLLGTA-IARPLIAKYLVDIANETGADAIAHGSTG----------------KGNDQVRFE--LSA 134 (404) Q Consensus 74 ~i~~~I~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~~ga~~iaHG~Tg----------------kGNDQvRFe--~~~ 134 (404) .+-|.+.. .-.|+.|+-+- --|-+--..+.+.|+++||++||.|-=. ...||--|= +.- T Consensus 81 Vi~~f~~~--Y~~G~TPNPcv~CN~~IKFg~l~~~A~~lgad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL~~l~~ 158 (355) T PRK00143 81 VFDYFLDE--YKAGRTPNPCVLCNKEIKFKAFLDYALELGADYIATGHYARIRDEDGRYELLRAVDPNKDQSYFLYQLTQ 158 (355) T ss_pred HHHHHHHH--HHCCCCCCCCHHCCCCCCHHHHHHHHHHCCCCEECCCCEEEEEECCCEEEEEECCCCCCCCEEEEECCCH T ss_conf 89999999--9769999973340872339999999987399842335259999159818998678898872057610579 Q ss_pred HHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 9858797077441006876379999999970896756 Q gi|254780234|r 135 YSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 135 ~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~ 171 (404) ..| -+++=|.- ++ ..+|--+.|+++|+|+.. T Consensus 159 ~~L---~~~~FPLG--~~-~K~eVR~iA~~~gL~~a~ 189 (355) T PRK00143 159 EQL---AKLLFPLG--EL-TKPEVREIAAEIGLPTAK 189 (355) T ss_pred HHH---HHEECCCC--CC-CHHHHHHHHHHCCCCCCC T ss_conf 887---42075876--77-358999999975898899 No 11 >COG0482 TrmU Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=99.72 E-value=1.9e-16 Score=138.25 Aligned_cols=238 Identities=24% Similarity=0.304 Sum_probs=150.4 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC---------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH Q ss_conf 77899991477259999999987189869999945787---------110689999999739807998200899999879 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ---------GEELKIASDKARLLGAKEVYVKDLRREFVRDFV 75 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq---------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i 75 (404) ++||++|||||+||||++.+|+ +|||||+++++.++. .+|+.++++.|..+|+ +++++|++++|++..+ T Consensus 3 ~~kV~v~mSGGVDSSVaA~lLk-~QGyeViGl~m~~~~~~~~~~C~s~~d~~da~~va~~LGI-p~~~vdf~~~y~~~V~ 80 (356) T COG0482 3 KKKVLVGMSGGVDSSVAAYLLK-EQGYEVIGLFMKNWDEDGGGGCCSEEDLRDAERVADQLGI-PLYVVDFEKEFWNKVF 80 (356) T ss_pred CCEEEEECCCCHHHHHHHHHHH-HCCCEEEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCC-CEEEECHHHHHHHHHH T ss_conf 7679999057778999999999-7697499999996414788867725678999999998099-5599846899888888 Q ss_pred HHHHHCCCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCCEEECH--HHCCCCHHHHHHHHHHHHCCC-----------C Q ss_conf 999736865478742013012-43359999999885689287500--111682156799999985879-----------7 Q gi|254780234|r 76 FPMFRANALYEGYYLLGTAIA-RPLIAKYLVDIANETGADAIAHG--STGKGNDQVRFELSAYSLNSD-----------I 141 (404) Q Consensus 76 ~~~I~ana~Yeg~Ypl~tsla-Rplia~~lv~~a~~~ga~~iaHG--~TgkGNDQvRFe~~~~~l~P~-----------l 141 (404) -+.+. ..-.|++|+-..+. |.+.-+.+++.|+++|||+||.| +-.+.+++ ++.+ .+..+++ - T Consensus 81 ~~f~~--~Y~~G~TPNPci~CN~~iKF~~~l~~a~~lgad~iATGHYar~~~~~~-~~~l-~r~~D~~KDQsYfL~~~~~ 156 (356) T COG0482 81 EYFLA--EYKAGKTPNPCILCNKEIKFKALLDYAKELGADYIATGHYARQREDEG-IELL-LRGVDLNKDQSYFLYALSQ 156 (356) T ss_pred HHHHH--HHHCCCCCCCCHHCCHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCC-CCCC-CCCCCCCCCHHHEECCCCH T ss_conf 87789--985799999651048788999999999975998578744676543776-0001-4678742030002012688 Q ss_pred EEEEEECCC--CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCC Q ss_conf 077441006--876379999999970896756656788641572431154576324584576820100146581227999 Q gi|254780234|r 142 EIIAPWRHW--SFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDT 219 (404) Q Consensus 142 ~viaP~Rd~--~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~ 219 (404) ..+.+.+.+ ++ .-.+.-.-|+++|+|+...+ =|.+- |.--|+.-. +.... .-|. T Consensus 157 ~ql~~~lFPlG~l-~K~evR~iA~~~gL~~a~Kk-----dS~~I---CFi~~~~~~---------~fl~~-----~~p~- 212 (356) T COG0482 157 EQLERLLFPLGDL-EKLEVRPIAAEKGLPTAKKK-----DSQGI---CFIGERKFK---------DFLGR-----YLPA- 212 (356) T ss_pred HHHHHCCCCCCCC-CHHHHHHHHHHCCCCCCCCC-----CCCCC---EECCCCCHH---------HHHHH-----HCCC- T ss_conf 7775322567777-77899999997599766766-----67762---213787789---------99873-----1877- Q ss_pred CEEEEEEEECCEEEEECCEECCHHH-HHHHHHHHHHHCCCCCEECC-CCCC---CCCCCCCCCCCHHH Q ss_conf 6389999950247875162318999-99999888642163741000-7732---02100011137378 Q gi|254780234|r 220 PTTIRIDFQRGDPIAINGQVMSPEV-LLEQLNQYGRCNGIGRIDIV-ENRF---VGIKSRGVYETPGG 282 (404) Q Consensus 220 pe~v~I~Fe~G~PVainG~~~~~~~-li~~LN~igg~~GvGr~d~v-Enr~---vG~KsR~vYEaPga 282 (404) +.|.-+.++|+.+.-.+ ++-+ -||-|.|+|+-..- |-.+ +-+|...+|=.|+. T Consensus 213 --------~~G~ii~~~G~v~g~H~G~~~y--TiGQR~Gl~I~~~~~~p~yVv~kd~~~n~~~V~~~~ 270 (356) T COG0482 213 --------KPGEIIDKDGKVLGEHDGLMYY--TIGQRKGLGIGGLKGEPWYVVGKDLKKNRLYVGQGE 270 (356) T ss_pred --------CCCEEEECCCCEEEECCCEEEE--ECCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCH T ss_conf --------8852792488180012877888--426633567688888877999871666759996404 No 12 >cd01998 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. This family of enzyme only presents in bacteria and eukaryote. The archaeal counterpart of this enzyme performs same function, but is completely unrelated in sequence. Probab=99.71 E-value=2.9e-16 Score=137.04 Aligned_cols=162 Identities=28% Similarity=0.387 Sum_probs=115.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC----------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHH Q ss_conf 899991477259999999987189869999945787----------1106899999997398079982008999998799 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ----------GEELKIASDKARLLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq----------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) ||++|+|||+||||++.+|+ ++||||+++++-+.. .+|++++++.|..+|+ +|+++|++++|.++.+- T Consensus 1 kV~v~mSGGVDSsvaA~LL~-~~G~~V~gv~m~~w~~~~~~~~C~~~~d~~dA~~va~~LgI-p~~v~d~~~ef~~~V~~ 78 (349) T cd01998 1 KVVVAMSGGVDSSVAAALLK-EQGYEVIGVFMKNWDEDDGKGGCCSEEDLKDARRVADQLGI-PHYVVNFEKEYWEKVFE 78 (349) T ss_pred CEEEECCCCHHHHHHHHHHH-HCCCCEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCC-CEEEECHHHHHHHHHHH T ss_conf 98999077899999999998-77995799999967887667898867789999999998699-67996809988688899 Q ss_pred HHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHCCCCEEECHHHC-------------CC----CHHHHHHHHHHHHC Q ss_conf 997368654787420130-1243359999999885689287500111-------------68----21567999999858 Q gi|254780234|r 77 PMFRANALYEGYYLLGTA-IARPLIAKYLVDIANETGADAIAHGSTG-------------KG----NDQVRFELSAYSLN 138 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~~ga~~iaHG~Tg-------------kG----NDQvRFe~~~~~l~ 138 (404) |++..- -.|+.|+-.- --|-+--..+++.|++.||++||.|-=. +| .||--|= ..|. T Consensus 79 ~f~~~Y--~~G~TPNPcv~CN~~IKF~~l~~~A~~~g~d~iATGHYAri~~~~~~~~~L~r~~D~~KDQSYfL---~~l~ 153 (349) T cd01998 79 PFLEEY--KKGRTPNPDILCNKEIKFGALLDYAKKLGADYIATGHYARIEEDNNGRYRLLRGVDPNKDQSYFL---SQLS 153 (349) T ss_pred HHHHHH--HCCCCCCCCHHCCCEEEHHHHHHHHHHCCCCEECCCCEEEEEECCCCCEEEEECCCCCCCCCEEE---ECCC T ss_conf 999999--74899987621187351999999998759986413514788975898489987689987820564---0124 Q ss_pred CC--CEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCH Q ss_conf 79--7077441006876379999999970896756656788641572 Q gi|254780234|r 139 SD--IEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDT 183 (404) Q Consensus 139 P~--l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~ 183 (404) ++ -+++=|.- ++ ..+|--+.|+++|+|+. . +|=|.|= T Consensus 154 ~~~L~~~~FPLG--~~-~K~eVR~iA~~~gl~~a-~----K~dSq~I 192 (349) T cd01998 154 QEQLSRLIFPLG--DL-TKPEVREIAKELGLPVA-K----KKDSQGI 192 (349) T ss_pred HHHHHHEECCCC--CC-CHHHHHHHHHHCCCCCC-C----CCCCCCE T ss_conf 876610387887--86-53899999998599989-9----7777760 No 13 >pfam03054 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. Probab=99.70 E-value=3.7e-16 Score=136.24 Aligned_cols=155 Identities=28% Similarity=0.407 Sum_probs=112.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-----------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH Q ss_conf 899991477259999999987189869999945787-----------110689999999739807998200899999879 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-----------GEELKIASDKARLLGAKEVYVKDLRREFVRDFV 75 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-----------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i 75 (404) ||++|+|||+||||++.+|+ ++||||+++++-+.. .+|+.++++.|..+|+ +|+++|++++|.++.+ T Consensus 2 kV~VamSGGVDSsVaA~LL~-~~Gy~V~Gv~m~~w~~~~~~~~~C~~~~d~~dA~~va~~LgI-p~~v~d~~~~f~~~Vi 79 (354) T pfam03054 2 KVVVAMSGGVDSSVAAYLLK-EQGYEVIGVFMKNWDEEDEFGHGCCSEEDLADAQRVCEQLGI-PLYVVNFEKEYWEKVF 79 (354) T ss_pred EEEEEECCCHHHHHHHHHHH-HCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCC-CEEEECHHHHHHHHHH T ss_conf 69999047899999999999-779963999999566887667889988999999999998699-7899787998758999 Q ss_pred HHHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHH-CCCCEEECHH---HC-------------CCCHHHHHHHHHHHH Q ss_conf 9997368654787420130-1243359999999885-6892875001---11-------------682156799999985 Q gi|254780234|r 76 FPMFRANALYEGYYLLGTA-IARPLIAKYLVDIANE-TGADAIAHGS---TG-------------KGNDQVRFELSAYSL 137 (404) Q Consensus 76 ~~~I~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~-~ga~~iaHG~---Tg-------------kGNDQvRFe~~~~~l 137 (404) -|.+.. .-.|+.|+-.- --|-+--..+.+.|++ +||++||.|- +. ...||--| ...| T Consensus 80 ~~fv~~--Y~~G~TPNPcv~CN~~IKFg~l~~~A~~~lGad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYf---L~~l 154 (354) T pfam03054 80 EPFLDE--YKNGRTPNPDILCNREIKFGALLDYAKQVLGADYLATGHYARVSLNENESELLRGKDKNKDQSYF---LSGL 154 (354) T ss_pred HHHHHH--HHCCCCCCCCHHHCCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCCEEEEEECCCCCCCCCEE---EECC T ss_conf 999999--97699998414418964389999999986699810447428999049938999899999980178---6107 Q ss_pred CCC--CEEEEEECCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 879--7077441006876379999999970896756 Q gi|254780234|r 138 NSD--IEIIAPWRHWSFKGRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 138 ~P~--l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~ 171 (404) .++ -+++=|.-+ + ..++--+.|++.|+|+.. T Consensus 155 ~q~~L~~~lFPLG~--~-~K~eVR~iA~~~gL~~a~ 187 (354) T pfam03054 155 SQEQLEKLLFPLGD--L-TKEEVRKIAKEAGLPTAK 187 (354) T ss_pred CHHHHHHEECCCCC--C-CHHHHHHHHHHCCCCCCC T ss_conf 89876431828768--8-528999999976998899 No 14 >TIGR00420 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; InterPro: IPR004506 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (2.1.1.61 from EC) catalyses the addition of 5-methylaminomethyl-2-thiouridylate to tRNAs using S-adenosyl-L-methionine as a substrate and releasing S-adenosyl-L-homocysteine. The enzyme is cytoplasmic and is involved in tRNA processing.; GO: 0004808 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.40 E-value=4.7e-12 Score=106.83 Aligned_cols=186 Identities=22% Similarity=0.324 Sum_probs=129.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECC------C----------------CCHHHHHHHHHHHCCCEEEEE Q ss_conf 789999147725999999998718986999994578------7----------------110689999999739807998 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLG------Q----------------GEELKIASDKARLLGAKEVYV 63 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~G------q----------------~~d~~~~~~~A~~~Ga~~~~v 63 (404) +||++++|||+||||++..|+ ++|++|+++++..+ + .+|+.++.+-|..+|+ ++.+ T Consensus 1 ~kv~VglSGGVDSSVsA~lL~-~qg~~v~G~ylk~W~~~~~~d~~GH~~~N~~K~~C~~~~dl~~a~~ic~~LGi-~l~~ 78 (394) T TIGR00420 1 KKVIVGLSGGVDSSVSAYLLK-QQGYEVVGVYLKNWDEDEKNDILGHKKINENKSGCSSAEDLRDAQAICEKLGI-PLEK 78 (394) T ss_pred CEEEEECCCCHHHHHHHHHHH-HCCCEEEEEEEEEEECCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCC-CEEE T ss_conf 948998068547899999997-14965899987765124666644316562467788368879999999985598-1898 Q ss_pred ECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH-HHHHHHHHHHC---CCCEEECH--H-HC-CCCHHHHHHHHHH Q ss_conf 2008999998799997368654787420130124335-99999998856---89287500--1-11-6821567999999 Q gi|254780234|r 64 KDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI-AKYLVDIANET---GADAIAHG--S-TG-KGNDQVRFELSAY 135 (404) Q Consensus 64 ~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli-a~~lv~~a~~~---ga~~iaHG--~-Tg-kGNDQvRFe~~~~ 135 (404) +|++++|+.+..-|+|+ +.-+|..|+=..+-=+.| -..+.+.|.+. |++.||.| + -- .-|.+-||++. + T Consensus 79 ~nf~~~Y~~~Vf~~~i~--~y~~G~TPnPDi~CN~~iKFG~~~e~~~~~~Gtg~~~~aTGHYAr~~q~~~~~~~~~L~-~ 155 (394) T TIGR00420 79 VNFQKEYWNKVFEPFIQ--EYKEGLTPNPDILCNKLIKFGLLLEYAKQLLGTGNDKIATGHYARIAQEIENKSLFELL-R 155 (394) T ss_pred ECHHHHHHHHHHHHHHH--HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCHHHHHHHHHCCCCHHHHH-C T ss_conf 55078878888899999--87367777862004712003789999998528995434424476544342131014443-0 Q ss_pred HHCCC-----------C----EEEEEECCCCCCCHHHHHHHHHHHC--CCCCCCCCCCCCCCCCHHHHCCC-CCCCCCCC Q ss_conf 85879-----------7----0774410068763799999999708--96756656788641572431154-57632458 Q gi|254780234|r 136 SLNSD-----------I----EIIAPWRHWSFKGRQDLIDFAEKHA--IPIDKNKRGEAPFSIDTNLLHSS-SEGRVLED 197 (404) Q Consensus 136 ~l~P~-----------l----~viaP~Rd~~~~sRe~~i~ya~~~g--Ipv~~~~~~~~~yS~D~Nlwg~S-~Egg~Led 197 (404) +++.. - +++=|.= ++. -.+--.-|++++ +++..++ |=|.+==..+-+ +.|.--|- T Consensus 156 ~~D~~KDQSYFL~~l~~~~~~~~~FPlG--~L~-K~~vR~iA~~~~PfL~ta~tk----KDSqgICFig~~qhGGkf~dF 228 (394) T TIGR00420 156 ALDKNKDQSYFLYHLSQEQLAKLLFPLG--ELL-KPEVRQIAKNAGPFLSTAETK----KDSQGICFIGERQHGGKFKDF 228 (394) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCC--CCC-CHHHHHHHHHCCCCCCCCCCC----CCCCCCEEECCCCCCCCHHHH T ss_conf 4678886117376401898887505301--036-778999998648632344568----864450210351568846888 Q ss_pred CCCCCC Q ss_conf 457682 Q gi|254780234|r 198 PSQPAP 203 (404) Q Consensus 198 p~~~~p 203 (404) ..+-.| T Consensus 229 Lk~Ylp 234 (394) T TIGR00420 229 LKKYLP 234 (394) T ss_pred HHHHCC T ss_conf 764257 No 15 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=99.40 E-value=4.4e-12 Score=106.99 Aligned_cols=147 Identities=22% Similarity=0.333 Sum_probs=114.2 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCC Q ss_conf 99991477259999999987189869999945787--1106899999997398079982008999998799997368654 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALY 85 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Y 85 (404) |++|||||+|||+++....+--|-.|+|+++|-|- ..|.+.+++.|..+|+ ++.++|..+-+. +.+..|.. T Consensus 1 V~vA~SGGVDSs~la~la~~alG~~v~aV~~~s~~~~~~E~e~a~~~a~~~gi-~~~~i~~~~l~~-----~~~~~N~~- 73 (202) T cd01990 1 VAVAFSGGVDSTLLLKAAVDALGDRVLAVTATSPLFPRRELEEAKRLAKEIGI-RHEVIETDELDD-----PEFAKNPP- 73 (202) T ss_pred CEEECCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHCCC-CEEEEECCHHHH-----HHHCCCCH- T ss_conf 98982385999999999999768787999972798997899999999986398-579973411101-----65545975- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 78742013012433599999998856892875001116821567999999858797077441006876379999999970 Q gi|254780234|r 86 EGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKH 165 (404) Q Consensus 86 eg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~ 165 (404) +.-| ..+-.+...+.+.|++.|.++|+.|.+.- |..-+.-..+++. ++.|.+|+++..|+ .++--+.+++. T Consensus 74 ~RCy-----~CK~~l~~~l~~~a~~~g~~~v~dGtn~d--Dl~d~RPGl~A~~-e~~v~sPL~e~gl~-K~eVR~la~~l 144 (202) T cd01990 74 DRCY-----LCKKALYEALKEIAEELGLDVVLDGTNAD--DLGDYRPGLKALR-ELGVRSPLAEAGLG-KAEIRELAREL 144 (202) T ss_pred HHHH-----HHHHHHHHHHHHHHHHCCCCEEEECCCCC--CCCCCCCCHHHHH-HCCCCCCHHHCCCC-HHHHHHHHHHC T ss_conf 5622-----99899999999999967997894157576--3235578688998-76998852651978-89999999983 Q ss_pred CCCCC Q ss_conf 89675 Q gi|254780234|r 166 AIPID 170 (404) Q Consensus 166 gIpv~ 170 (404) |+|+. T Consensus 145 gLp~~ 149 (202) T cd01990 145 GLPTW 149 (202) T ss_pred CCCCC T ss_conf 99856 No 16 >COG1606 ATP-utilizing enzymes of the PP-loop superfamily [General function prediction only] Probab=99.36 E-value=1.1e-11 Score=104.29 Aligned_cols=154 Identities=23% Similarity=0.400 Sum_probs=118.2 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-C-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 7777899991477259999999987189869999945787-1-1068999999973980799820089999987999973 Q gi|254780234|r 3 RDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-G-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 3 ~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) ++++||++|||||.|||+.+...++.-|-+|+|+|+|..- + .+++.++.-|..+|. +|.+++.- ++. |-.. T Consensus 15 k~~~kv~vAfSGGvDSslLa~la~~~lG~~v~AvTv~sP~~p~~e~e~A~~~A~~iGi-~H~~i~~~-~~~-----~~~~ 87 (269) T COG1606 15 KEKKKVVVAFSGGVDSSLLAKLAKEALGDNVVAVTVDSPYIPRREIEEAKNIAKEIGI-RHEFIKMN-RMD-----PEFK 87 (269) T ss_pred HHCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCC-CCEEEEHH-HCC-----HHHC T ss_conf 6437399996588427999999999735646999971687776660678999999487-62465500-025-----2330 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHH Q ss_conf 68654787420130124335999999988568928750011168215679999998587970774410068763799999 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLID 160 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ 160 (404) .|. +++-. +.+-.+-..+++.|.+.|-|+|+.|.+.. |.-+.--.++++ -++.|-+||-+..++ +++-.+ T Consensus 88 ~n~--~~rCY----~CK~~v~~~l~~~a~~~Gyd~V~dGtNas--Dl~~~RPG~rA~-kE~gi~sPl~e~git-k~eIre 157 (269) T COG1606 88 ENP--ENRCY----LCKRAVYSTLVEEAEKRGYDVVADGTNAS--DLFDYRPGLRAL-KELGIRSPLAEFGIT-KKEIRE 157 (269) T ss_pred CCC--CCCCH----HHHHHHHHHHHHHHHHCCCCEEEECCCHH--HHCCCCCCHHHH-HHCCCCCHHHHHCCC-HHHHHH T ss_conf 499--87355----77899999999999973998897477578--733788302357-760777718883975-999999 Q ss_pred HHHHHCCCCCCCC Q ss_conf 9997089675665 Q gi|254780234|r 161 FAEKHAIPIDKNK 173 (404) Q Consensus 161 ya~~~gIpv~~~~ 173 (404) +++..|+|....+ T Consensus 158 ~a~~lgl~~~~kp 170 (269) T COG1606 158 IAKSLGLPTWDKP 170 (269) T ss_pred HHHHCCCCCCCCC T ss_conf 9997599845586 No 17 >PRK11106 queuosine biosynthesis protein QueC; Provisional Probab=99.27 E-value=5.6e-10 Score=91.92 Aligned_cols=157 Identities=20% Similarity=0.241 Sum_probs=105.7 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 778999914772599999999871898699999457871--106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) |||+|+.+||||||++|+.|... .+++|++++.|-||. .|++.+++-|..+|+.+|.|+|+. |..+. ..+ T Consensus 1 MkkavVLlSGGlDStt~L~~a~~-~~~~v~alsfdYGQrh~~El~~A~~ia~~~gv~~h~vidl~--~l~~~-----~~S 72 (231) T PRK11106 1 MKRAVVVFSGGQDSTTCLIQALQ-QYDEVHCITFDYGQRHRAEIDVARELALKLGARAHKVLDVT--LLNEL-----AVS 72 (231) T ss_pred CCEEEEEECCCHHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEECCH--HHHHH-----CCC T ss_conf 98089990787899999999998-19949999877786719999999999998598400675327--77775-----235 Q ss_pred CCCCC------------CCCCCCCHHHHHHH-HHHHHHHHHCCCCEEECHHHCC---CC---HH---HHHHHHHHH-HCC Q ss_conf 65478------------74201301243359-9999998856892875001116---82---15---679999998-587 Q gi|254780234|r 83 ALYEG------------YYLLGTAIARPLIA-KYLVDIANETGADAIAHGSTGK---GN---DQ---VRFELSAYS-LNS 139 (404) Q Consensus 83 a~Yeg------------~Ypl~tslaRplia-~~lv~~a~~~ga~~iaHG~Tgk---GN---DQ---vRFe~~~~~-l~P 139 (404) ++-.+ .-|.+---.|-.|- ....-+|...|++.|..|...- |= .+ -.|+.++.. ... T Consensus 73 aLt~~~i~vp~~~~~~~~~p~T~VP~RN~ifLsiAaa~Ae~~ga~~I~~G~~~~D~sgYPDCr~eFi~a~~~al~~g~~~ 152 (231) T PRK11106 73 SLTRDSIPVPDYEPEADGLPNTFVPGRNILFLTLAAIYAYQVKAEAVITGVCETDFSGYPDCRDEFVKALNHAVVLGMAK 152 (231) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 66777877777654445787646716729999999999998599959980465566889989899999999999851789 Q ss_pred CCEEEEEECCCCCCCHHHHHHHHHH---HCCCCCCC Q ss_conf 9707744100687637999999997---08967566 Q gi|254780234|r 140 DIEIIAPWRHWSFKGRQDLIDFAEK---HAIPIDKN 172 (404) Q Consensus 140 ~l~viaP~Rd~~~~sRe~~i~ya~~---~gIpv~~~ 172 (404) .++|.+|..+ + +..|-+.-+++ -|+|...| T Consensus 153 ~i~i~aPl~~--l-~K~eiv~l~~~~~~L~~p~~~T 185 (231) T PRK11106 153 DIRFETPLMW--L-DKAETWALADYYGQLDLVRHET 185 (231) T ss_pred CEEEEECCCC--C-CHHHHHHHHHHCCCCCCCCCCC T ss_conf 8199837777--8-8889999998616668650235 No 18 >cd01712 ThiI ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway. It belongs to the Adenosine Nucleotide Hydrolysis suoerfamily and predicted to bind to Adenosine nucleotide. Probab=99.21 E-value=2.4e-09 Score=87.42 Aligned_cols=150 Identities=22% Similarity=0.272 Sum_probs=98.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC---HHH---HHHHHHHHCCC-EEEEEECHHHHHHHHHHHHHH Q ss_conf 89999147725999999998718986999994578711---068---99999997398-079982008999998799997 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGE---ELK---IASDKARLLGA-KEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~---d~~---~~~~~A~~~Ga-~~~~v~D~r~ef~~~~i~~~I 79 (404) ||++++|||+||+|+..+| .++|++|++++.+++-.. ..+ ++.+.....+. ...++++.... ++..+ T Consensus 1 kvl~L~SGGiDS~VAa~ll-~krG~~V~~l~f~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-----~~~~~ 74 (177) T cd01712 1 KALALLSGGIDSPVAAWLL-MKRGIEVDALHFNSGPFTSEKAREKVEDLARKLARYSPGHKLVVIIFTFF-----VQKEI 74 (177) T ss_pred CEEEEECCCCHHHHHHHHH-HHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCHH-----HHHHH T ss_conf 9899957781699999999-98799899999989998888999999999999999489962899945389-----99999 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH-HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHH Q ss_conf 36865478742013012433599999998856892875001-11682156799999985879707744100687637999 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS-TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDL 158 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~-TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~ 158 (404) .. .|..|+-+-+-|.++-+.+.++|+++||++|+.|- -|.--.|-.-++.....+-++.|+-|.- .+. -+|- T Consensus 75 ~~----~g~~~npcv~ckr~m~r~a~~~A~~~ga~~IvTGe~lGQvasqt~~nl~~i~~~~~~~ilRPL~--~~d-K~EI 147 (177) T cd01712 75 YG----YGKEKYRCILCKRMMYRIAEKLAEELGADAIVTGESLGQVASQTLENLLVISSGTDLPILRPLI--GFD-KEEI 147 (177) T ss_pred HH----HCCCCCCCEEHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CCC-HHHH T ss_conf 97----0888898636299999999999998699899866522204476899999998742564214888--999-8999 Q ss_pred HHHHHHHCC-CC Q ss_conf 999997089-67 Q gi|254780234|r 159 IDFAEKHAI-PI 169 (404) Q Consensus 159 i~ya~~~gI-pv 169 (404) ++.|++-|. ++ T Consensus 148 ~~~A~~igt~~~ 159 (177) T cd01712 148 IGIARRIGTYDI 159 (177) T ss_pred HHHHHHHCCHHH T ss_conf 999998096744 No 19 >COG0037 MesJ tRNA(Ile)-lysidine synthase MesJ [Cell cycle control, cell division, chromosome partitioning] Probab=99.18 E-value=1.3e-09 Score=89.39 Aligned_cols=149 Identities=26% Similarity=0.302 Sum_probs=106.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCC-CEEEEEEEECCCC----CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 7899991477259999999987189-8699999457871----1068999999973980799820089999987999973 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKG-LEVIVFIADLGQG----EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g-~eVi~~~~d~Gq~----~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) .||++|+|||.||++++..|++-++ .++.++++|.|+. .+.+.+++-+...|. ++++.++...+..+..- T Consensus 22 ~~ilVavSGGkDS~~ll~~L~~l~~~~~~~a~~Vd~~~~~~~~~~~~~~~~~~~~~~~-~~~v~~~~~~~~~~~~~---- 96 (298) T COG0037 22 YKILVAVSGGKDSLALLHLLKELGRRIEVEAVHVDHGLRGYSDQEAELVEKLCEKLGI-PLIVERVTDDLGRETLD---- 96 (298) T ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCC-CEEEEEHHHHHHHCCCC---- T ss_conf 8589993787899999999998422573899997089886432899999999996499-84886204543100246---- Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHH------HHH----HHHHCC-----CC-EEE Q ss_conf 68654787420130124335999999988568928750011168215679------999----998587-----97-077 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRF------ELS----AYSLNS-----DI-EII 144 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRF------e~~----~~~l~P-----~l-~vi 144 (404) + .-..+.+|-+.-..+-+.|++.|++.|+.|=| .+||+-. ... .+.+.| +. .++ T Consensus 97 ------~--~~~c~~c~~~R~~~l~~~a~~~g~~~i~tgH~--~dD~~et~lm~l~~g~~~~~l~~~~~~~~~~~~~~~i 166 (298) T COG0037 97 ------G--KSICAACRRLRRGLLYKIAKELGADKIATGHH--LDDQAETFLMNLLRGSGLRGLRGMPPKRPFEGGLLII 166 (298) T ss_pred ------C--CCHHHHHHHHHHHHHHHHHHHCCCCEEEECCC--HHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCEEE T ss_conf ------7--67879999999999999999859998985678--4689999999986175224564088524457886454 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 4410068763799999999708967566 Q gi|254780234|r 145 APWRHWSFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 145 aP~Rd~~~~sRe~~i~ya~~~gIpv~~~ 172 (404) -|... + .+.+...||..+|||.-.. T Consensus 167 RPL~~--~-~~~ei~~~~~~~~l~~~~d 191 (298) T COG0037 167 RPLLY--V-REKEIELYAKEKGLPYIED 191 (298) T ss_pred CCCCC--C-CHHHHHHHHHHCCCCEECC T ss_conf 75746--8-8999999999759987517 No 20 >cd01993 Alpha_ANH_like_II This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=99.17 E-value=1.1e-09 Score=89.93 Aligned_cols=159 Identities=23% Similarity=0.387 Sum_probs=100.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHH-----CCCEEEEEEEECCC-C---CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHH Q ss_conf 8999914772599999999871-----89869999945787-1---1068999999973980799820089999987999 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVE-----KGLEVIVFIADLGQ-G---EELKIASDKARLLGAKEVYVKDLRREFVRDFVFP 77 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e-----~g~eVi~~~~d~Gq-~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~ 77 (404) ||++|+|||.||.+++..|..- .+.+++++++|-|- + ++.+.+++-+..+|. ++++...++......... T Consensus 1 kilva~SGG~DS~~Ll~ll~~~~~~~~~~~~l~~~hvnh~~r~~s~~~~~~v~~~~~~~~i-~~~~~~~~~~~~~~~~~~ 79 (185) T cd01993 1 RILVALSGGKDSLVLLHVLKKLQRRYPYGFELEALTVDEGIPGYRDESLEVVERLAEELGI-ELEIVSFKEEYTDDIEVK 79 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHCCC-CEEEEEEEECCCCCHHHH T ss_conf 9899965849999999999998886488956999995699888889999999998996299-359987640367638999 Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH-----HHHHH--HHHHCCC---------C Q ss_conf 973686547874201301243359999999885689287500111682156-----79999--9985879---------7 Q gi|254780234|r 78 MFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV-----RFELS--AYSLNSD---------I 141 (404) Q Consensus 78 ~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv-----RFe~~--~~~l~P~---------l 141 (404) ...... .++++|-+--..+.+.|++.|++.|+-|-+ .|||+ |+=.. ...+.|- + T Consensus 80 ~~~~~~--------~c~~c~r~Rr~~l~~~~~~~~~~~i~~gHh--~dD~~ET~l~~l~rg~~~~~~~~~~~~~~~~~~i 149 (185) T cd01993 80 KRGGKS--------PCSLCGVLRRGLLNKIAKELGADKLATGHN--LDDEAETLLMNLLRGGILRLMRPGPILYLDEGDV 149 (185) T ss_pred HHHHCC--------HHHHHHHHHHHHHHHHHHHCCCCEEEEECC--HHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCCE T ss_conf 986406--------346899999999999999819988975330--7689999999998489700056766333568984 Q ss_pred EEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 07744100687637999999997089675665678864157 Q gi|254780234|r 142 EIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSID 182 (404) Q Consensus 142 ~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D 182 (404) .++-|.- .+ +|++..+||++++||.-..+ .|||-+ T Consensus 150 ~iiRPLL--~~-~k~ei~~y~~~~~l~~~eD~---~~~~~~ 184 (185) T cd01993 150 TRIRPLV--YV-REKEIVLYAELNGLPFVEEE---CPYAGN 184 (185) T ss_pred EEEECCC--CC-CHHHHHHHHHHCCCCEEECC---CCCCCC T ss_conf 7995498--89-89999999998799878899---988989 No 21 >TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232 This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily .. Probab=99.08 E-value=2.3e-09 Score=87.54 Aligned_cols=153 Identities=23% Similarity=0.408 Sum_probs=116.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--C-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHC Q ss_conf 77899991477259999999987189869999945787--1-10689999999739807998200899999879999736 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--G-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~a 81 (404) .|||++|||||+|||.+...+.+--|.||+|+|+-.-+ + .+++++...|..+|. .|-++.. +.+. -|-.+. T Consensus 15 nk~~~IAySGGvDS~lla~v~~~v~g~~~lait~~sP~~sp~~el~~A~~~A~~~g~-~he~~~~-d~~~----n~~f~~ 88 (263) T TIGR00268 15 NKKVLIAYSGGVDSSLLAAVCSDVAGTEVLAITVVSPSISPRRELEDAKAIAKEIGV-KHELVKI-DKMA----NPNFRA 88 (263) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHHHHCEEEEEEECCCCCCHHHHHHHHHHHHHHCC-CEEEECC-CCCC----CCCCCC T ss_conf 081699951746589999999875310113578762775735448999999988083-0121100-1236----850016 Q ss_pred CCCCCCC-CCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE-EEECCCCCCCHHHHH Q ss_conf 8654787-42013012433599999998856892875001116821567999999858797077-441006876379999 Q gi|254780234|r 82 NALYEGY-YLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII-APWRHWSFKGRQDLI 159 (404) Q Consensus 82 na~Yeg~-Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi-aP~Rd~~~~sRe~~i 159 (404) |. +++ |+ ..--.-..|++.|.++|-++|-.|..+. |..-.-=.++|. -++... +||-|.+. |..|-. T Consensus 89 N~--~~RCY~-----CK~~~~~~L~~~a~~~gy~~V~dGtN~d--DL~~~RPG~~A~-~E~~g~~SP~aef~I-~K~eir 157 (263) T TIGR00268 89 NV--EERCYF-----CKKKVLSILVKLAEKRGYDVVVDGTNAD--DLEDHRPGLRAV-KELNGVYSPWAEFGI-TKKEIR 157 (263) T ss_pred CC--CCCCHH-----HHHHHHHHHHHHHHHCCCCEEEECCCCC--CCCCCCCHHHHH-HHCCCCCCCCCCCCC-CHHHHH T ss_conf 88--544415-----4888999989999863995798234620--002367513567-660788687200256-879999 Q ss_pred HHHHHHCCCCCCCCC Q ss_conf 999970896756656 Q gi|254780234|r 160 DFAEKHAIPIDKNKR 174 (404) Q Consensus 160 ~ya~~~gIpv~~~~~ 174 (404) +-|+.-|++....++ T Consensus 158 ~ia~~lg~~~~DKP~ 172 (263) T TIGR00268 158 EIAKSLGLSFYDKPS 172 (263) T ss_pred HHHHHCCCCCCCCCC T ss_conf 999974889888866 No 22 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=99.05 E-value=1.9e-09 Score=88.12 Aligned_cols=111 Identities=31% Similarity=0.405 Sum_probs=75.3 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC- Q ss_conf 8999914772599999999871898699999457871--1068999999973980799820089999987999973686- Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA- 83 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana- 83 (404) |+|+.+|||+||++++.|.+ ++|++|+|++.|-||. .|++.+++-|..+|+ +|.++|+. +..+.---++..+. T Consensus 1 Kavvl~SGG~DSt~~l~~a~-~~~~~v~ait~dYGQ~~~~Ei~~A~~ia~~l~i-~h~vidl~--~l~~~~~saL~~~~~ 76 (137) T pfam06508 1 KAVVLLSGGLDSTTCLAWAK-KEGYEVYALTFDYGQRHSKELECAKKIAKALGV-EHKIVDLD--FLKQIGGSALTDDSI 76 (137) T ss_pred CEEEEECCCHHHHHHHHHHH-HCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCC-CCEEECCC--HHHHHCCCCCCCCCC T ss_conf 98999178789999999999-869968999814788739999999999998299-75030330--244416552236885 Q ss_pred -----CCC-CCCCCCCCHHHHHHH-HHHHHHHHHCCCCEEECHHH Q ss_conf -----547-874201301243359-99999988568928750011 Q gi|254780234|r 84 -----LYE-GYYLLGTAIARPLIA-KYLVDIANETGADAIAHGST 121 (404) Q Consensus 84 -----~Ye-g~Ypl~tslaRplia-~~lv~~a~~~ga~~iaHG~T 121 (404) -|+ +.-|..-=-.|-.|- ....-.|...|++.|+.|.. T Consensus 77 ~vp~~~~~~~~~~~t~VP~RN~iflsiA~a~A~~~g~~~I~~G~~ 121 (137) T pfam06508 77 EVPDSELESEEIPNTYVPGRNLIFLSIAASYAEAIGANDIFIGVN 121 (137) T ss_pred CCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 556654345678776563654999999999999869997999565 No 23 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=99.04 E-value=2.3e-08 Score=80.33 Aligned_cols=157 Identities=27% Similarity=0.404 Sum_probs=109.7 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 778999914772599999999871898699999457871--106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) ++|.|+-+|||+||++|+.|++++ +++|++++-|-||. .|++.+++-|..+|+. +.++|+. |..+ +..+ T Consensus 2 ~~kavvl~SGG~DStt~l~~a~~~-~~ev~alsfdYGQrh~~Ele~A~~iak~lgv~-~~iid~~--~~~~-----~~~s 72 (222) T COG0603 2 MKKAVVLLSGGLDSTTCLAWAKKE-GYEVHALTFDYGQRHRKELEAAKELAKKLGVP-HHIIDVD--LLGE-----IGGS 72 (222) T ss_pred CCEEEEECCCCHHHHHHHHHHHHC-CCEEEEEEEECCCCCHHHHHHHHHHHHHCCCC-EEEECHH--HHHH-----CCCC T ss_conf 834999916881389999999960-98899997507898689999999999981998-2895416--8753-----5888 Q ss_pred CCCCC-------CC-----CCCCCHHHHHHHHH-HHHHHHHCCCCEEECHH---HCCCCHHH------HHHHHHH-HHCC Q ss_conf 65478-------74-----20130124335999-99998856892875001---11682156------7999999-8587 Q gi|254780234|r 83 ALYEG-------YY-----LLGTAIARPLIAKY-LVDIANETGADAIAHGS---TGKGNDQV------RFELSAY-SLNS 139 (404) Q Consensus 83 a~Yeg-------~Y-----pl~tslaRplia~~-lv~~a~~~ga~~iaHG~---TgkGNDQv------RFe~~~~-~l~P 139 (404) |+-.+ .+ |.+-.-+|-+|--. ..-+|...|++.|.-|. -+.|=--. +|+.++. ++.. T Consensus 73 aLtd~~~~vp~~~~~~~~~p~t~VP~RN~iflsiA~~~Ae~~g~~~I~~Gv~~~D~sgYPDcrpefi~a~~~~~~l~~~~ 152 (222) T COG0603 73 ALTDDSIDVPKYEFAEEEIPATFVPARNLIFLSIAAAYAEALGADAIIIGVNEEDFSGYPDCRPEFIEALNEALNLGTEK 152 (222) T ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 67588863666666666686447606609999999999987599859997532104789988889999999999862557 Q ss_pred CCE-EEEEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 970-7744100687637999999997089675665 Q gi|254780234|r 140 DIE-IIAPWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 140 ~l~-viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ... |.||.-. + +-.+.+.-+.+.|.|...|- T Consensus 153 ~~~~i~aPl~~--l-~Ka~iv~l~~elg~~~~~T~ 184 (222) T COG0603 153 GVRIIHAPLME--L-TKAEIVKLADELGVPLELTW 184 (222) T ss_pred CCCEEECCEEE--C-CHHHHHHHHHHHCCCCHHCE T ss_conf 86177678331--3-49999998888688510026 No 24 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=99.00 E-value=2.5e-08 Score=80.04 Aligned_cols=131 Identities=26% Similarity=0.368 Sum_probs=91.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 8999914772599999999871898699999457871--10689999999739807998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) |+|+.+||||||++++.|++ +.|++|++++.|-||. .|++.+++-|..+|.. ..+..| T Consensus 1 kavvllSGGlDSt~~l~~~~-~~g~~v~~l~~dYGQr~~~E~~~a~~i~~~l~~~--~~VP~R----------------- 60 (169) T cd01995 1 KAVVLLSGGLDSTTCLAWAK-KEGYEVHALSFDYGQRHAKEEEAAKLIAEKLGPS--TYVPAR----------------- 60 (169) T ss_pred CEEEEECCCHHHHHHHHHHH-HCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCC--CEECCC----------------- T ss_conf 98999078788999999999-8499699999756985499999999999996899--465283----------------- Q ss_pred CCCCCCCCCCHHHHHHH-HHHHHHHHHCCCCEEECHHHCC------CCHHH---HHHHHHHH-HCCCCEEEEEECCCCCC Q ss_conf 47874201301243359-9999998856892875001116------82156---79999998-58797077441006876 Q gi|254780234|r 85 YEGYYLLGTAIARPLIA-KYLVDIANETGADAIAHGSTGK------GNDQV---RFELSAYS-LNSDIEIIAPWRHWSFK 153 (404) Q Consensus 85 Yeg~Ypl~tslaRplia-~~lv~~a~~~ga~~iaHG~Tgk------GNDQv---RFe~~~~~-l~P~l~viaP~Rd~~~~ 153 (404) -.|- ....-.|...|++.|+.|...- -|.+. +|+..+.. ....++|.+|..+ + T Consensus 61 -------------N~ifls~A~~~A~~~g~~~v~~G~~~~d~~~ypDc~~~F~~a~~~~~~~~~~~~v~i~aP~~~--~- 124 (169) T cd01995 61 -------------NLIFLSIAAAYAEALGAEAIIIGVNAEDYSGYPDCRPEFIEAMNKALNLGTENGIKIHAPLID--L- 124 (169) T ss_pred -------------HHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCC--C- T ss_conf -------------899999999999982999689998247756899986999999999998308798589843335--9- Q ss_pred CHHHHHHHHHHHCCCCCCCC Q ss_conf 37999999997089675665 Q gi|254780234|r 154 GRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 154 sRe~~i~ya~~~gIpv~~~~ 173 (404) +..+.+..+.+.|+|...|- T Consensus 125 ~K~eiv~~g~~lgv~~~~T~ 144 (169) T cd01995 125 SKAEIVRLGGELGVPLELTW 144 (169) T ss_pred CHHHHHHHHHHCCCCHHHCC T ss_conf 89999999988499799821 No 25 >KOG2805 consensus Probab=98.99 E-value=1.1e-08 Score=82.67 Aligned_cols=164 Identities=16% Similarity=0.238 Sum_probs=107.8 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-----------CCHHHHHHHHHHHCCCEEEEEECHHHH Q ss_conf 987777899991477259999999987189869999945787-----------110689999999739807998200899 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-----------GEELKIASDKARLLGAKEVYVKDLRRE 69 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-----------~~d~~~~~~~A~~~Ga~~~~v~D~r~e 69 (404) |.+.+.+||+|+|||+||||++.+|+ .+|+.|++|.+-.++ .+|+.+++..+..+++ ++..+++.+| T Consensus 1 ~p~~~~~VvvamSgGVDSsVaa~Ll~-~~g~~v~gv~M~nWd~~de~~s~cp~e~D~~da~~Vc~~LnI-~~~~Vnf~kE 78 (377) T KOG2805 1 MPEKPDRVVVAMSGGVDSSVAARLLA-ARGYNVTGVFMKNWDSLDEFGSQCPAERDWKDAKRVCKQLNI-PLHQVNFVKE 78 (377) T ss_pred CCCCCCEEEEEECCCCHHHHHHHHHH-HCCCCEEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-EEEEEEEHHH T ss_conf 98666548999537711899999997-418871699662201222356689811208999999987097-0278743799 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHH-HHHCCCCEEECHHHCC------CCHHHHHHHHHHHHCCCC Q ss_conf 9998799997368654787420130-1243359999999-8856892875001116------821567999999858797 Q gi|254780234|r 70 FVRDFVFPMFRANALYEGYYLLGTA-IARPLIAKYLVDI-ANETGADAIAHGSTGK------GNDQVRFELSAYSLNSDI 141 (404) Q Consensus 70 f~~~~i~~~I~ana~Yeg~Ypl~ts-laRplia~~lv~~-a~~~ga~~iaHG~Tgk------GNDQvRFe~~~~~l~P~l 141 (404) ++++...|.|. ..-+|+.|+--- --|.+---++-+. -...|+|+||.|--.| +|+..|........-++- T Consensus 79 YW~~Vfs~~L~--~Y~~G~TPNPDI~CN~~IKFg~~~~~a~en~~~d~latGHYAr~~~~~~~~~~~~l~~~~d~~KDQt 156 (377) T KOG2805 79 YWNDVFSPFLE--EYENGRTPNPDILCNKHIKFGKFFKHAIENLGYDWLATGHYARVVLEDEDNAESHLLISKDMVKDQT 156 (377) T ss_pred HHHHHHHHHHH--HHHCCCCCCCCCCCCCEEECCHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCEEEEECCCCCCCCE T ss_conf 99999999988--8765899999713445142117899998755887377421214403755676235761256667732 Q ss_pred EEEEEECCC----------CCCCHHHHHHHHHHHCCCC Q ss_conf 077441006----------8763799999999708967 Q gi|254780234|r 142 EIIAPWRHW----------SFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 142 ~viaP~Rd~----------~~~sRe~~i~ya~~~gIpv 169 (404) -.++-++-. .++ .++--..|++.|+|+ T Consensus 157 ~FL~~in~~~L~r~lfPlg~~~-K~eVk~lA~~~gf~~ 193 (377) T KOG2805 157 YFLSTINQTQLKRLLFPLGCLT-KSEVKKLAKQAGFPN 193 (377) T ss_pred EEEECCCHHHHHHHHCCCCCCC-HHHHHHHHHHCCCCC T ss_conf 6763036999986206676567-799999998659864 No 26 >pfam02568 ThiI Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis. Probab=98.88 E-value=2.1e-07 Score=73.39 Aligned_cols=148 Identities=22% Similarity=0.363 Sum_probs=102.3 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--C-CHHHHHHHHHHHC---C---CEEEEEECHHHHHHHHHHH Q ss_conf 7899991477259999999987189869999945787--1-1068999999973---9---8079982008999998799 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--G-EELKIASDKARLL---G---AKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~-~d~~~~~~~A~~~---G---a~~~~v~D~r~ef~~~~i~ 76 (404) .|++..+|||+||+|+ .||..++|++|++++.+.+- . +..+.+++-+..+ + -.+++++|+.+.+. + T Consensus 4 gk~l~LlSGGiDSpVA-a~lmmkRG~~V~~vhf~~~p~~~~~~~~k~~~l~~~l~~y~~~~~~~l~vv~~~~~~~-~--- 78 (197) T pfam02568 4 GKVLALLSGGIDSPVA-AYLMMRRGCRVVALHFINEPGTSEEAIEKVRKLAELLAEYGTSCEGKLVVVDFTKVQK-E--- 78 (197) T ss_pred CCEEEEECCCCHHHHH-HHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHH-H--- T ss_conf 7189886687129999-9999987997999998799999899999999999999973788754499957499999-9--- Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH-HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCH Q ss_conf 99736865478742013012433599999998856892875001-11682156799999985879707744100687637 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS-TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGR 155 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~-TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sR 155 (404) |..+. ...|+ +-+-|-.+-+..-++|++.||++|..|- .|-=-+|-=.++..-.-+-++.|+-|.- .|. . T Consensus 79 --i~~~~--~~~~~--cv~cKr~M~r~A~~iA~~~ga~~IVTGEsLGQVaSQTl~nl~~i~~~~~~pilRPLi--g~D-K 149 (197) T pfam02568 79 --IIEKA--PEKYR--CVLCKRCMYRAAEKVAEEEGADALVTGESLGQVASQTLDNLRVISAATNLPILRPLI--GLD-K 149 (197) T ss_pred --HHHCC--CCCCE--EHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHCCCCHHHHHHHHHCCCCCCCCC--CCC-H T ss_conf --99628--98765--456999999999999998499899847303121004531069998762485313434--699-9 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999997089 Q gi|254780234|r 156 QDLIDFAEKHAI 167 (404) Q Consensus 156 e~~i~ya~~~gI 167 (404) +|-++.|++-|. T Consensus 150 ~EIi~~Ar~IGt 161 (197) T pfam02568 150 EEIINLAKEIGT 161 (197) T ss_pred HHHHHHHHHHCC T ss_conf 999999999486 No 27 >PRK10660 tilS tRNA(Ile)-lysidine synthetase; Provisional Probab=98.87 E-value=1.2e-07 Score=75.24 Aligned_cols=162 Identities=17% Similarity=0.202 Sum_probs=95.9 Q ss_pred CCCCEEEEEECCCHHHHHHHHHH---HHHC-CCEEEEEEEECCC-C---CHHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 77778999914772599999999---8718-9869999945787-1---1068999999973980799820089999987 Q gi|254780234|r 3 RDVKKVVLAYSGGLDTSIILKWL---QVEK-GLEVIVFIADLGQ-G---EELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 3 ~~~kkVvlaySGGLDTSv~i~~L---~~e~-g~eVi~~~~d~Gq-~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) .+.+||++|+|||.||.+++..| +++. +..++++++|=|- + +|...+++-|.++|+ ++++.++.- T Consensus 11 ~~~~~ilvavSGG~DS~~LL~~L~~l~~~~~~~~l~~~HvnHgl~~~ad~~~~fv~~~c~~~~i-p~~~~~~~v------ 83 (433) T PRK10660 11 LESRQILVAFSGGLDSTVLLHQLVQWRTQNPGVTLRAIHIHHGLSPNADSWVKHCEQVCQQWQV-PLVVERVQL------ 83 (433) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEEE------ T ss_conf 9999899997280999999999999998668982899997189792669999999999997599-789999872------ Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH--HHHHHHHHCCC-CEEEEEECC-- Q ss_conf 9999736865478742013012433599999998856892875001116821567--99999985879-707744100-- Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR--FELSAYSLNSD-IEIIAPWRH-- 149 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR--Fe~~~~~l~P~-l~viaP~Rd-- 149 (404) .. .+. -.-.-||-+=-+..-+.+.+.++=+.|| -.|||+= |=.-++--+|. |.=+.|.|. T Consensus 84 -----~~----~~~--~~E~aAR~~RY~~f~~~~~~~~~l~tAH----h~dDQaETvLlrL~RGsG~~GL~gm~~~r~~~ 148 (433) T PRK10660 84 -----AQ----EGL--GIEAAARQARYQAFARTLLPGEVLVTAQ----HLDDQCETFLLALKRGSGPAGLSAMAEVSPFA 148 (433) T ss_pred -----CC----CCC--CHHHHHHHHHHHHHHHHHHHCCEEEECC----CCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf -----79----997--6999999999999999874388799624----56519999999986589964456775224478 Q ss_pred -------C-CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCC Q ss_conf -------6-87637999999997089675665678864157243115 Q gi|254780234|r 150 -------W-SFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHS 188 (404) Q Consensus 150 -------~-~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~ 188 (404) + ++ +|++..+||+++|||.-...+|. -=..+.|-.-. T Consensus 149 ~~~liRPLL~~-~r~eI~~Y~~~~~l~~~eD~SN~-d~~y~RN~iR~ 193 (433) T PRK10660 149 GTQLLRPLLAR-TRGELEQWAQAHGLRWIEDESNQ-DDRYDRNFLRL 193 (433) T ss_pred CCEEECCCHHH-HHHHHHHHHHHCCCCCEECCCCC-CCCCCHHHHHH T ss_conf 97077374466-59999999998499803899999-76112599999 No 28 >pfam01171 ATP_bind_3 PP-loop family. This family of proteins belongs to the PP-loop superfamily. Probab=98.79 E-value=4.1e-07 Score=71.35 Aligned_cols=147 Identities=20% Similarity=0.314 Sum_probs=94.9 Q ss_pred EEEEEECCCHHHHHHHHHHH---HHCCCEEEEEEEECCC-C---CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 89999147725999999998---7189869999945787-1---106899999997398079982008999998799997 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQ---VEKGLEVIVFIADLGQ-G---EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~---~e~g~eVi~~~~d~Gq-~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ||++|+|||.||.+++..|. ...+.+++++++|-|. + ++.+.+++-|.++|. ++++......- T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~~~~~~~hvnh~lr~~s~~~~~~v~~~~~~~~i-~~~i~~~~~~~--------- 70 (182) T pfam01171 1 KILVAVSGGPDSMALLYLLKKLKPKFGIDLTAAHVDHGLREESDREAQFVKELCRQLNI-PLEVLRVDVAK--------- 70 (182) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCC-CEEEEEEECCC--------- T ss_conf 99999678599999999999999975997899998799865405899999999998599-75999973476--------- Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH-H-HHHHHH-----H---C-----CCCEEE Q ss_conf 36865478742013012433599999998856892875001116821567-9-999998-----5---8-----797077 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR-F-ELSAYS-----L---N-----SDIEII 144 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR-F-e~~~~~-----l---~-----P~l~vi 144 (404) ..+. .....+|-.=-..+.++|++.|++.|+-|-+ -+||+- | -.-++- | . .++.++ T Consensus 71 ~~~~-------~~e~~aR~~Ry~~l~~~a~~~~~~~i~lgHh--~DD~~ET~lm~l~rG~~~~gl~gm~~~~~~~~~~ii 141 (182) T pfam01171 71 KSGL-------NLEEAAREARYDFFEEIAKKNGAEVLLTAHH--ADDQAETFLMRLLRGSGLAGLAGIAPVRPLAGGRIV 141 (182) T ss_pred CCCC-------CHHHHHHHHHHHHHHHHHHHHCCCEEEEECC--CCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 6787-------7757899999999999898617664887434--232999999999728981103388751346894387 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 4410068763799999999708967566567 Q gi|254780234|r 145 APWRHWSFKGRQDLIDFAEKHAIPIDKNKRG 175 (404) Q Consensus 145 aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~ 175 (404) -|.. .+ +|++.++||++++||.-..+.+ T Consensus 142 RPLl--~~-~k~ei~~~a~~~~l~~~~D~sN 169 (182) T pfam01171 142 RPLL--KV-TKSEIEEYLKEHGIPWVEDESN 169 (182) T ss_pred CCCC--CC-CHHHHHHHHHHCCCCEEECCCC T ss_conf 0211--48-8999999999869936899088 No 29 >PRK10696 C32 tRNA thiolase; Provisional Probab=98.75 E-value=8e-07 Score=69.26 Aligned_cols=168 Identities=23% Similarity=0.291 Sum_probs=105.3 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHH-----CCCEEEEEEEECCCCC-HHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 9877778999914772599999999871-----8986999994578711-068999999973980799820089999987 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVE-----KGLEVIVFIADLGQGE-ELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e-----~g~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) |-++-.||++|.|||-||.+++..|+.- ..++++++++|-||+. +.+.+.+-...+|. +++++.- +.+ .. T Consensus 35 MIedGDRVlVglSGGKDS~~LL~iL~~Lq~~api~FeLvAv~lD~~~pGf~~~~L~~yl~~lGv-p~~i~~~-d~~--~i 110 (311) T PRK10696 35 MIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPEYLEKLGV-EYKIVEE-NTY--GI 110 (311) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCC-CCEEEEE-EHH--HH T ss_conf 7789999999826788899999999999985899855999983789999880688999997499-7079984-076--79 Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHH------HHHHCCC------- Q ss_conf 9999736865478742013012433599999998856892875001116821567-9999------9985879------- Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR-FELS------AYSLNSD------- 140 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~------~~~l~P~------- 140 (404) +... ..+|..+ +++.+-+=-..+-++|++.||+.||-|- --+|++- |=+. ++++-|. T Consensus 111 v~~~-----~~egks~--CslCsRlRRg~Ly~~A~e~G~nKIALGH--H~DDi~ETfLMNlf~gG~LktM~Pkl~~d~g~ 181 (311) T PRK10696 111 VKEK-----IPEGKTT--CSLCSRLRRGILYRTATELGATKIALGH--HRDDILQTLFLNMFYGGKMKGMPPKLMSDDGK 181 (311) T ss_pred HHHH-----CCCCCCH--HHHHHHHHHHHHHHHHHHCCCCEEEECC--CHHHHHHHHHHHHHHCCCCCCCCCEEECCCCC T ss_conf 9875-----4368863--7899999999999999986998798606--50558999999999668763679766748997 Q ss_pred CEEEEEECCCCCCCHHHHHHHHHHHCCC-CCCCCCCCCCCCCCHHHHCCC Q ss_conf 7077441006876379999999970896-756656788641572431154 Q gi|254780234|r 141 IEIIAPWRHWSFKGRQDLIDFAEKHAIP-IDKNKRGEAPFSIDTNLLHSS 189 (404) Q Consensus 141 l~viaP~Rd~~~~sRe~~i~ya~~~gIp-v~~~~~~~~~yS~D~Nlwg~S 189 (404) +.||-|. -.-...+.++||+..++| +|-..- ..++||---. T Consensus 182 ~~VIRPL---~y~~E~di~~~a~~~~fPiIpcnlc-----gsq~~lqR~~ 223 (311) T PRK10696 182 HIVIRPL---AYCREKDIERFADAKAFPIIPCNLC-----GSQPNLQRQV 223 (311) T ss_pred EEEEEEC---CCCCHHHHHHHHHHCCCCEECCCCC-----CCCHHHHHHH T ss_conf 5998304---2015999999999759987416677-----8836789999 No 30 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=98.71 E-value=3.7e-07 Score=71.68 Aligned_cols=147 Identities=23% Similarity=0.322 Sum_probs=94.7 Q ss_pred EEEEEECCCHHHHHHHHHHH---HHCCCEEEEEEEECC-CC---CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 89999147725999999998---718986999994578-71---106899999997398079982008999998799997 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQ---VEKGLEVIVFIADLG-QG---EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~---~e~g~eVi~~~~d~G-q~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ||++|+|||.||.+++..|. ...+.+++++++|-| ++ ++.+.+++-|..+|. ++++.+....-. . T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~~~l~a~hvdh~lr~~s~~~~~~v~~~~~~~~i-~~~i~~~~~~~~-------~ 72 (185) T cd01992 1 KILVAVSGGPDSMALLHLLSELKPRLGLRLVAVHVDHGLRPESDEEAAFVADLCAKLGI-PLYILVVALAPK-------P 72 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCC-CEEEEEEEECCC-------C T ss_conf 99999678499999999999999974994899998189888888999999999998599-889999775367-------8 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH--HHHHHHHHC-------------CCCEEE Q ss_conf 36865478742013012433599999998856892875001116821567--999999858-------------797077 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR--FELSAYSLN-------------SDIEII 144 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR--Fe~~~~~l~-------------P~l~vi 144 (404) +.+ ....+|-+=-+.+.+.|++.|++.|+-|-+ .|||+- |-.-++--. .++.++ T Consensus 73 ~~~---------~e~~aR~~Ry~~l~~~~~~~~~~~i~lgHh--~dD~~ET~lm~l~rg~~~~gl~gm~~~~~~~~~~ii 141 (185) T cd01992 73 GGN---------LEAAAREARYDFFAEIAKEHGADVLLTAHH--ADDQAETVLMRLLRGSGLRGLAGMPARIPFGGGRLI 141 (185) T ss_pred CCC---------HHHHHHHHHHHHHHHHHHHHCCCCEEECCC--HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 999---------999999999999999998735450420363--036899999998718996415277841357995287 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 4410068763799999999708967566567 Q gi|254780234|r 145 APWRHWSFKGRQDLIDFAEKHAIPIDKNKRG 175 (404) Q Consensus 145 aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~ 175 (404) =|.. ++ +|++..+||++++||.-..+++ T Consensus 142 RPLL--~~-~k~ei~~~~~~~~i~~~~D~sN 169 (185) T cd01992 142 RPLL--GI-TRAEIEAYLRENGLPWWEDPSN 169 (185) T ss_pred EHHH--HH-HHHHHHHHHHHCCCCEEECCCC T ss_conf 1577--85-3999999999849956799087 No 31 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=98.68 E-value=4.2e-08 Score=78.46 Aligned_cols=159 Identities=23% Similarity=0.291 Sum_probs=105.0 Q ss_pred EEEECCCHHHHHHHHHHHHHCC--CEEEEEEEECCCC---CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHH-HHHHCC Q ss_conf 9991477259999999987189--8699999457871---106899999997398079982008999998799-997368 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKG--LEVIVFIADLGQG---EELKIASDKARLLGAKEVYVKDLRREFVRDFVF-PMFRAN 82 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g--~eVi~~~~d~Gq~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~-~~I~an 82 (404) |+-||||+|||+|+.|++.+ | |||+|+|=|=||. .|++.+++-|..+|+..+.++|+ .|+++... -++..+ T Consensus 2 v~~lSGG~DStT~~~~a~~~-~GkyeV~a~TF~YGQR~H~~Ele~A~~ia~~Lgi~~~~~~Dl--~~l~~l~~YsaLt~~ 78 (227) T TIGR00364 2 VVVLSGGQDSTTVLLIALDE-GGKYEVHAITFDYGQRAHSRELESARKIAEALGIRHHFVIDL--SLLKQLGKYSALTDE 78 (227) T ss_pred EEEEECCHHHHHHHHHHHHC-CCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECH--HHHHHHHCCCCCCCC T ss_conf 68743734689999999961-795079985465013789999999999999808970786177--999855404523578 Q ss_pred CCC-CCC------------CCCCCCHHHHHHH-HHHHHHHHHCCCCEEECHH------HCCCCHHHH---HHHHHH-H-H Q ss_conf 654-787------------4201301243359-9999998856892875001------116821567---999999-8-5 Q gi|254780234|r 83 ALY-EGY------------YLLGTAIARPLIA-KYLVDIANETGADAIAHGS------TGKGNDQVR---FELSAY-S-L 137 (404) Q Consensus 83 a~Y-eg~------------Ypl~tslaRplia-~~lv~~a~~~ga~~iaHG~------TgkGNDQvR---Fe~~~~-~-l 137 (404) ..- +.. -|.+==-+|=+|- .-.+-+|.+.||..|.-|- +.==|...= |+.++. + . T Consensus 79 ~~~~~~~~~e~d~~~~~E~~p~tfVPgRN~~fl~~a~~yA~~~gA~~v~~G~~~~DfSgYPDCr~EFvka~n~~l~Lgn~ 158 (227) T TIGR00364 79 QEIPEQKSNEEDKEKALETLPNTFVPGRNLIFLSLAASYAEALGAEAVITGVCETDFSGYPDCRDEFVKALNHALNLGNM 158 (227) T ss_pred CCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 75676666532023565311333378047999999999888618425220010003657988868899999999997412 Q ss_pred CCCCEE-----EEEECCCCCCCHHHHHHHHHHHC-----CCCCCCC Q ss_conf 879707-----74410068763799999999708-----9675665 Q gi|254780234|r 138 NSDIEI-----IAPWRHWSFKGRQDLIDFAEKHA-----IPIDKNK 173 (404) Q Consensus 138 ~P~l~v-----iaP~Rd~~~~sRe~~i~ya~~~g-----Ipv~~~~ 173 (404) .-.++| -||.- .+ +-.|...-+.+-| +|...|- T Consensus 159 ~~~v~i~Piql~aPL~--~l-tKaeiv~la~elGkk~l~lv~~~T~ 201 (227) T TIGR00364 159 LTPVEIRPIQLEAPLM--DL-TKAEIVKLADELGKKLLDLVIKETY 201 (227) T ss_pred CCCCEEEEEEEECCEE--EC-CHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 6881120246666701--26-9899999988735112311000111 No 32 >PRK08349 hypothetical protein; Validated Probab=98.64 E-value=4.3e-06 Score=64.01 Aligned_cols=151 Identities=21% Similarity=0.300 Sum_probs=100.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCH---HHHHHHHHHHCCC--EEEEEECHHHHHHHHHHHHHHHC Q ss_conf 899991477259999999987189869999945787110---6899999997398--07998200899999879999736 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEE---LKIASDKARLLGA--KEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d---~~~~~~~A~~~Ga--~~~~v~D~r~ef~~~~i~~~I~a 81 (404) ||+..+|||.||.|+ .|+....|++|++++.+.|+..+ .+.++.-+.-.|. ..++++|..+.+.. ++..|.. T Consensus 2 Kvl~LlSGGiDSPVA-a~~mmKRG~~V~~lhf~~~~~~~~kv~~~~~~L~~~~~~~~~~~~iv~~~~~~~~--i~~~i~~ 78 (198) T PRK08349 2 KVVALLSSGIDSPVA-IYLMLSRGVEIYPLHFRQDEKKEHKARELVEILQEIHGGKVKDPVIVDAYEVQGP--VFEKLRE 78 (198) T ss_pred EEEEEECCCCCHHHH-HHHHHHCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHH--HHHHHHH T ss_conf 499996588438999-9999977997999986387788999999999999970888752899772253289--9999986 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH-HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHH Q ss_conf 865478742013012433599999998856892875001-1168215679999998587970774410068763799999 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS-TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLID 160 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~-TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ 160 (404) +. ..+| .+-+-|-..-+..-++|++.|++++..|- -|-=-.|-==++..-.-+-++.|+-|.- .+. .+|-|+ T Consensus 79 ~~--~~~~--~~vl~rr~M~riA~~iA~~~g~~aivTGEsLGQVASQTl~NL~~i~~~~~~pVlRPLi--g~D-K~EII~ 151 (198) T PRK08349 79 IG--KEKW--TCLFCKYTMYRVAERYAHEIGAKAIVTGDSLGQVASQTLDNLMVISTATDLPILRPLI--GLD-KEEIVR 151 (198) T ss_pred CC--CCCC--EEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CCC-HHHHHH T ss_conf 07--7651--3099999999999999998599889845216788899998899998750676647766--799-899999 Q ss_pred HHHHHCC Q ss_conf 9997089 Q gi|254780234|r 161 FAEKHAI 167 (404) Q Consensus 161 ya~~~gI 167 (404) +|++-|- T Consensus 152 ~Ar~IGT 158 (198) T PRK08349 152 IAKEIGT 158 (198) T ss_pred HHHHHCC T ss_conf 9998195 No 33 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=98.63 E-value=3.7e-06 Score=64.47 Aligned_cols=149 Identities=24% Similarity=0.291 Sum_probs=108.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECC--------CCCHHHHHHHHHHHCCCEEEEEECH---HHHHHHHHH Q ss_conf 89999147725999999998718986999994578--------7110689999999739807998200---899999879 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLG--------QGEELKIASDKARLLGAKEVYVKDL---RREFVRDFV 75 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~G--------q~~d~~~~~~~A~~~Ga~~~~v~D~---r~ef~~~~i 75 (404) ||++.||||-||++++.++. +.|++|+|+..=+- |.-..+-++..|..+|. +.+.+.. .+++.+++ T Consensus 1 Kv~~l~SGGKDS~lAl~~a~-~~g~~v~~L~~~~~~~~~~~~~H~~~~~l~~~qAealgi-Pl~~~~~~~~~~~~~~~l- 77 (194) T cd01994 1 KVVALISGGKDSCYALYRAL-EEGHEVVALLNLTPEEGSSMMYHTVNHELLELQAEAMGI-PLIRIEISGEEEDEVEDL- 77 (194) T ss_pred CEEEEECCCHHHHHHHHHHH-HCCCEEEEEEEEECCCCCEEECCCCCHHHHHHHHHHCCC-CEEEEECCCCCCHHHHHH- T ss_conf 98999778599999999999-869925999999639988052505578999999998599-669996689871799999- Q ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHH-HHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 9997368654787420130124335999999988568928750011168215-679999998587970774410068763 Q gi|254780234|r 76 FPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQ-VRFELSAYSLNSDIEIIAPWRHWSFKG 154 (404) Q Consensus 76 ~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQ-vRFe~~~~~l~P~l~viaP~Rd~~~~s 154 (404) +.++..+++.|++.|.+|.--. ++| -|.|..+..+ .++.++|. |+. . T Consensus 78 --------------------------~~~L~~~k~~gi~~vv~GdI~s-~~qr~~~e~~c~~l--gl~~~~PL--W~~-~ 125 (194) T cd01994 78 --------------------------KELLRKLKEEGVDAVVFGAILS-EYQRTRVERVCERL--GLEPLAPL--WGR-D 125 (194) T ss_pred --------------------------HHHHHHHHHCCCCEEEECCCCC-HHHHHHHHHHHHHC--CCEEECHH--CCC-C T ss_conf --------------------------9999999975995999996332-88999999999973--98887001--079-9 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 79999999970896756656788641572431154576 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEG 192 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Eg 192 (404) +++.+.-+-+.|+.+-.++. +.+-.|+...|+.+.- T Consensus 126 ~~~ll~e~i~~Gf~aiiv~V--~a~~L~~~~lGr~id~ 161 (194) T cd01994 126 QEELLREMIEAGFKAIIIKV--AAEGLDESWLGREIDE 161 (194) T ss_pred HHHHHHHHHHCCCEEEEEEE--ECCCCCHHHCCCCCCH T ss_conf 99999999987990999995--0268996895988048 No 34 >PRK00919 GMP synthase subunit B; Validated Probab=98.60 E-value=7.5e-07 Score=69.48 Aligned_cols=100 Identities=22% Similarity=0.330 Sum_probs=75.3 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 77899991477259999999987189869999945787--1106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) .+||++|.|||+|||||..+|..--|-.++|+++|.|- ..|.+.+.+-... |. ...++|+++.|.+. +++- T Consensus 20 ~~kvi~~lSGGVDStV~A~Ll~kAig~~l~~v~VD~GllR~~E~~~V~~~~~~-~~-~l~~vda~~~Fl~~-----L~gv 92 (306) T PRK00919 20 DGKAIIALSGGVDSSVAAVLAHRAIGDRLLAVYVDTGLMRKGETERIREIFKD-GL-NLRIVDASDRFLEA-----LKGV 92 (306) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHH-CC-CCEEEEHHHHHHHH-----HCCC T ss_conf 98599991688479999999998864265999986898879869999999983-58-92898349999997-----3799 Q ss_pred CCCCCCCCCCCCHHHHHHHHHH----HHHHHHCCCCEEECH Q ss_conf 6547874201301243359999----999885689287500 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYL----VDIANETGADAIAHG 119 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~l----v~~a~~~ga~~iaHG 119 (404) .-=| --|-.|.+.- -+.|++.|++.+|+| T Consensus 93 ~DPE--------~KRKiIG~~Fi~vfe~~~~~~~~~~LaQG 125 (306) T PRK00919 93 TDPE--------EKRKIIGETFIRVFEEVAKEIGAEYLVQG 125 (306) T ss_pred CCHH--------HCCCHHHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 8956--------74602579999999999998298768511 No 35 >cd01997 GMP_synthase_C The C-terminal domain of GMP synthetase. It contains two subdomains; the ATP pyrophosphatase domain which closes to the N-termial and the dimerization domain at C-terminal end. The ATP-PPase is a twisted, five-stranded parallel beta-sheet sandwiched between helical layers. It has a signature nucleotide-binding motif, or P-loop, at the end of the first-beta strand.The dimerization domain formed by the C-terminal 115 amino acid for prokaryotic proteins. It is adjacent to teh ATP-binding site of the ATP-PPase subdomain. The largest difference between the primary sequence of prokaryotic and eukaryotic GMP synthetase map to the dimerization domain.Eukaryotic GMP synthetase has several large insertions relative to prokaryotes. Probab=98.58 E-value=1.1e-06 Score=68.34 Aligned_cols=145 Identities=22% Similarity=0.373 Sum_probs=93.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 899991477259999999987189869999945787--110689999999739807998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) ||++|.|||+|||||..+|..--|-.++|+++|.|- ..|.+.+++-..+.+-....++|++++|.+. ++.-.- T Consensus 1 kVi~~lSGGVDStV~A~Ll~kAig~~l~~vfVDnGlmRk~E~e~V~~~~~~~~~~~~~~vdas~~Fl~~-----L~gv~D 75 (295) T cd01997 1 KVILALSGGVDSTVAAVLLHKAIGDRLTCVFVDNGLLRKNEAERVEELFSKLLGINLIVVDASERFLSA-----LKGVTD 75 (295) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEECHHHHHHHH-----HCCCCC T ss_conf 989990578289999999998865056999985897888829999999987069977995669999998-----658779 Q ss_pred CCCCCCCCCCHHHHHHHHHH----HHHHHHCC-CCEEECHH--------HC-CCC-HHHHHH--HH--HHHHCCCCEEEE Q ss_conf 47874201301243359999----99988568-92875001--------11-682-156799--99--998587970774 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYL----VDIANETG-ADAIAHGS--------TG-KGN-DQVRFE--LS--AYSLNSDIEIIA 145 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~l----v~~a~~~g-a~~iaHG~--------Tg-kGN-DQvRFe--~~--~~~l~P~l~via 145 (404) =|- -|-.|.+.- -+.|++.+ +.++++|- .+ +|. +-+.-. .. .+.+ +++++- T Consensus 76 PE~--------KRKiIG~~Fi~vf~~~~~~~~~~~~L~QGTlyPDvIES~~~~~~a~~IKsHHNvggLp~~~--~~kliE 145 (295) T cd01997 76 PEE--------KRKIIGETFIEVFEEEAKKLGLAEYLAQGTLYPDVIESGSGKGSADTIKSHHNVGGLPEDM--KLKLIE 145 (295) T ss_pred HHH--------HHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCEEECCCCCCCCCEEEEECCCCCCCHHC--CCCCCH T ss_conf 256--------3656538999999999975588618972430262575078888875145224567772322--654001 Q ss_pred EECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 410068763799999999708967 Q gi|254780234|r 146 PWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 146 P~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) |.|+ | -.+|--+..++-|+|- T Consensus 146 PLr~--L-fKDEVR~lG~~LGlp~ 166 (295) T cd01997 146 PLRD--L-FKDEVRELGRELGLPE 166 (295) T ss_pred HHHH--H-HHHHHHHHHHHHCCCH T ss_conf 6788--7-4599999999868978 No 36 >cd01986 Alpha_ANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases and ATP sulphurylases. The domain forms a apha/beta/apha fold which binds to Adenosine group.. Probab=98.55 E-value=1.7e-07 Score=74.07 Aligned_cols=80 Identities=30% Similarity=0.404 Sum_probs=62.7 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99991477259999999987189869999945787110689999999739807998200899999879999736865478 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) |+|+||||+|||+.+.+|+ +.|++|+++++|.|-.++.+..++.|.+ T Consensus 1 ~~v~~sgG~ds~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~-------------------------------- 47 (103) T cd01986 1 VLVAFSGGKDSSVAAALLK-KLGYQVIAVTVDHGISPRLEDAKEIAKE-------------------------------- 47 (103) T ss_pred CEEEECCCCHHHHHHHHHH-HCCCCEEEEEEECCCCCHHHHHHHHHHH-------------------------------- T ss_conf 9897428822199999999-8698628999853766278899999999-------------------------------- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHH Q ss_conf 74201301243359999999885689287500111682156799 Q gi|254780234|r 88 YYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFE 131 (404) Q Consensus 88 ~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe 131 (404) .....+.+.+.+.|++.|+.|-....+...+.. T Consensus 48 -----------~~~~~~~~~a~~~~~~~i~~G~~~~~~~~~~~~ 80 (103) T cd01986 48 -----------AREEAAKRIAKEKGAETIATGTRRDDVANRALG 80 (103) T ss_pred -----------HHHHHHHHHHHHCCCHHHCCCCCHHHHHHHHHH T ss_conf -----------999999999886172143047531677887761 No 37 >COG0519 GuaA GMP synthase, PP-ATPase domain/subunit [Nucleotide transport and metabolism] Probab=98.46 E-value=3.2e-06 Score=64.99 Aligned_cols=146 Identities=23% Similarity=0.379 Sum_probs=96.4 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHH-CCCEEEEEECHHHHHHHHHHHHHHHC Q ss_conf 77899991477259999999987189869999945787--1106899999997-39807998200899999879999736 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARL-LGAKEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~-~Ga~~~~v~D~r~ef~~~~i~~~I~a 81 (404) -+||.+|.|||+||||+..++..-.|-..+|+.+|.|- ..|-+.+.+-... +|. ....+|++++|.... ++ T Consensus 21 ~~kvi~alSGGVDSsv~a~L~~~AiGd~l~cvfVD~GLlR~~E~e~V~~~f~~~~~~-nl~~VdA~~~Fl~~L-----~G 94 (315) T COG0519 21 DGKVILALSGGVDSSVAAVLAHRAIGDQLTCVFVDHGLLRKGEAEQVVEMFREHLGL-NLIVVDAKDRFLSAL-----KG 94 (315) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHHHHHCCC-CEEEECHHHHHHHHH-----CC T ss_conf 853999833878089999999997405248999637743478569999998753298-459975498899884-----39 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHH----HHHHHHCCCCEEECHH---------HCCCCHHHHHHHHHHHHCC--CCEEEEE Q ss_conf 86547874201301243359999----9998856892875001---------1168215679999998587--9707744 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYL----VDIANETGADAIAHGS---------TGKGNDQVRFELSAYSLNS--DIEIIAP 146 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~l----v~~a~~~ga~~iaHG~---------TgkGNDQvRFe~~~~~l~P--~l~viaP 146 (404) =..-| --|-.|.+.- =+.|++.+++++|+|- +|++ +.+.-.-.+--|-. +++.+-| T Consensus 95 vtDPE--------~KRKiIG~~Fi~VFe~ea~k~~~~~LaQGTiYpDvIES~~g~~-~~IKSHHNVGGLP~~m~lkLvEP 165 (315) T COG0519 95 VTDPE--------EKRKIIGREFIEVFEEEAKKLGAEFLAQGTIYPDVIESGTGKA-GTIKSHHNVGGLPEDMKLKLVEP 165 (315) T ss_pred CCCHH--------HHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEECCCCC-CCCCCCCCCCCCCCCCCEEEEHH T ss_conf 99889--------9899987999999999998578606870443450465068888-86213562688950142223077 Q ss_pred ECCCCCCCHHHHHHHHHHHCCC Q ss_conf 1006876379999999970896 Q gi|254780234|r 147 WRHWSFKGRQDLIDFAEKHAIP 168 (404) Q Consensus 147 ~Rd~~~~sRe~~i~ya~~~gIp 168 (404) .|+ + -.+|--..+++-|+| T Consensus 166 Lr~--L-fKDEVR~lg~~LGlp 184 (315) T COG0519 166 LRE--L-FKDEVRELGRELGLP 184 (315) T ss_pred HHH--H-HHHHHHHHHHHHCCC T ss_conf 898--8-689999999985998 No 38 >pfam01507 PAPS_reduct Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP). It is also found in NodP nodulation protein P from Rizobium which has ATP sulfurylase activity (sulfate adenylate transferase). Probab=98.45 E-value=3.7e-06 Score=64.51 Aligned_cols=151 Identities=17% Similarity=0.208 Sum_probs=92.7 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 899991477259999999987189869999945787--110689999999739807998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) +|+++||||.||+|++.++. +.+.++-.+.+|.|- ++-++-+++-+...|. +..+......+.+...... . T Consensus 1 ~v~vsfSGGKDS~vlL~L~~-~~~~~~~vvf~Dtg~efpet~~~v~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-----~ 73 (174) T pfam01507 1 ELVVSFSGGKDSLVLLHLAS-KAFPPGPVIFIDTGYEFPETYEFVDELEEKYGL-NLKVYRPEDSFAEGINPEG-----I 73 (174) T ss_pred CEEEEECCHHHHHHHHHHHH-HHCCCCCEEEEECCCCCHHHHHHHHHHHHHHCC-EEEEEECCHHHHHHHHHCC-----C T ss_conf 99999576099999999999-848997579997899868999999999998498-1899808677987763127-----9 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH---CCCCEEEEEECCCCCCCHHHHHHH Q ss_conf 47874201301243359999999885689287500111682156799999985---879707744100687637999999 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL---NSDIEIIAPWRHWSFKGRQDLIDF 161 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l---~P~l~viaP~Rd~~~~sRe~~i~y 161 (404) ....+...+ .+-....-+-+..++.+.+++.-|- -+.-.--|....+... .|+.--+.|+-+| +.++-.+| T Consensus 74 ~~~~~~~~~--~~~~K~~p~~~~l~~~~~~~~i~Gi-R~~Es~~R~~~~~~~~~~~~~~~~~~~PI~~W---t~~DVw~y 147 (174) T pfam01507 74 PSKLWEDCP--CRLRKVEPLKRALKKLDFDAWFTGL-RRDESPSRAKLPIVSIDGDFPKVIKVFPLLNW---TETDVWQY 147 (174) T ss_pred CHHHCCCCC--HHHHHHHHHHHHHHHCCCCEEEEEE-ECCCHHHHHHCCEEEEECCCCCEEEEECHHHC---CHHHHHHH T ss_conf 501145452--1466517999999865996899951-00244566407167430578885999500529---99999999 Q ss_pred HHHHCCCCC Q ss_conf 997089675 Q gi|254780234|r 162 AEKHAIPID 170 (404) Q Consensus 162 a~~~gIpv~ 170 (404) +++||||+. T Consensus 148 i~~~~lp~n 156 (174) T pfam01507 148 ILANNVPYN 156 (174) T ss_pred HHHCCCCCC T ss_conf 998399999 No 39 >TIGR00884 guaA_Cterm GMP synthase, C-terminal domain; InterPro: IPR001674 The amidotransferase family of enzymes utilises the ammonia derived from the hydrolysis of glutamine for a subsequent chemical reaction catalyzed by the same enzyme. The ammonia intermediate does not dissociate into solution during the chemical transformations . GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. The C-terminal domain is specific to the GMP synthases 6.3.5.2 from EC. In prokaryotes this domain mediates dimerisation. Eukaryotic GMP synthases are monomers. This domain in eukaryotes includes several large insertions that may form globular domains .; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006164 purine nucleotide biosynthetic process, 0006177 GMP biosynthetic process. Probab=98.43 E-value=1.1e-06 Score=68.27 Aligned_cols=182 Identities=22% Similarity=0.374 Sum_probs=121.1 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHH-HCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 7899991477259999999987189869999945787--110689999999-7398079982008999998799997368 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKAR-LLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~-~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) .||++|-|||+||||+..++..==|=..+|+.+|-|= ..|-+.+.+-.. .+|. ...++|++++|-+. +++= T Consensus 17 ~~vi~ALSGGVDSsV~A~L~hrAIGD~L~~vFVD~GLlR~gE~E~V~~~F~~~lg~-nl~~VDA~e~FL~~-----L~GV 90 (319) T TIGR00884 17 AKVIIALSGGVDSSVAAVLLHRAIGDRLTCVFVDHGLLRKGEAERVVKTFSDKLGL-NLVVVDAKERFLSA-----LKGV 90 (319) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCC-CCEEECCCHHHHHH-----CCCC T ss_conf 46899810881589999999864227604898227888766378999998753089-82787610798875-----2889 Q ss_pred CCCCCCCCCCCCHHHHHHHHHH----HHHHHHCC----CCEEECH--------HH---CC-CCHHHHHHHHHHHH--CCC Q ss_conf 6547874201301243359999----99988568----9287500--------11---16-82156799999985--879 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYL----VDIANETG----ADAIAHG--------ST---GK-GNDQVRFELSAYSL--NSD 140 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~l----v~~a~~~g----a~~iaHG--------~T---gk-GNDQvRFe~~~~~l--~P~ 140 (404) .-=|. =|=+|.+.- =+.|++.+ +.++++| |. |+ +-+.+.-.=.+-=| .=. T Consensus 91 ~DPE~--------KRKIIG~~FI~VFE~~A~~~~~~~~a~yL~QGTlYPDvIES~~~kG~~~a~~IKsHHNVGGLP~~m~ 162 (319) T TIGR00884 91 TDPEE--------KRKIIGRVFIEVFEREAEKIGDKKKAEYLVQGTLYPDVIESASSKGTSGAATIKSHHNVGGLPEDMK 162 (319) T ss_pred CCCHH--------HCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 87034--------1311225667778889985079977999973434474646023047764764563106688854260 Q ss_pred CEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCC Q ss_conf 70774410068763799999999708967566567886415724311545763245845768201001465812279996 Q gi|254780234|r 141 IEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTP 220 (404) Q Consensus 141 l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~p 220 (404) ++++=|.|+ | --+|--.-+++-|+|-..-.+ +|.| -| T Consensus 163 l~LvEPLR~--L-fKDEVR~lG~~LGlP~ei~~R-------------------------~PFP------------GP--- 199 (319) T TIGR00884 163 LKLVEPLRE--L-FKDEVRKLGKELGLPEEIVWR-------------------------HPFP------------GP--- 199 (319) T ss_pred EEEECCCHH--C-CHHHHHHHHHHCCCCHHHHCC-------------------------CCCC------------CC--- T ss_conf 258754023--0-217899999871886799647-------------------------8368------------69--- Q ss_pred EEEEEEEECCEEEEECCEEC-CHHHHHHHHHHHH Q ss_conf 38999995024787516231-8999999998886 Q gi|254780234|r 221 TTIRIDFQRGDPIAINGQVM-SPEVLLEQLNQYG 253 (404) Q Consensus 221 e~v~I~Fe~G~PVainG~~~-~~~~li~~LN~ig 253 (404) |.-|-|=|+-. .-++++.++|.|. T Consensus 200 ---------GLAvRv~GEVt~e~l~i~R~AD~Iv 224 (319) T TIGR00884 200 ---------GLAVRVLGEVTKEKLEILREADAIV 224 (319) T ss_pred ---------CCEEEEECCCCHHHHHHHHHCCHHH T ss_conf ---------6227886166779999976306048 No 40 >cd01713 PAPS_reductase This domain is found in phosphoadenosine phosphosulphate (PAPS) reductase enzymes or PAPS sulphotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. A highly modified version of the P loop, the fingerprint peptide of mononucleotide-binding proteins, is present in the active site of the protein, which appears to be a positively charged cleft containing a number of conserved arginine and lysine residues. Although PAPS reductase has no ATPase activity, it shows a striking similarity to the structure of the ATP pyrophosphatase (ATP PPase) domain of GMP synthetase, indicating that both enzyme families have evolved from a common ancestral nucleotide-binding fold. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP) . It is also found in NodP nodulation protein P from Rhizobium meliloti which has ATP sulphurylase acti Probab=98.43 E-value=1e-05 Score=61.41 Aligned_cols=150 Identities=20% Similarity=0.206 Sum_probs=92.5 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCC---EEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHC Q ss_conf 8999914772599999999871898---69999945787--110689999999739807998200899999879999736 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGL---EVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~---eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~a 81 (404) +|+++||||-||+|++.++.+- .. .+..+++|+|. ++-.+.+++-+..+|.. ..+.-....+... ..-. T Consensus 1 ~~~vsfSGGKDS~vll~L~~~~-~~~~~~~~vvf~DTg~e~pet~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~--- 74 (173) T cd01713 1 NVVVSFSGGKDSTVLLHLALKA-LPELKPVPVIFLDTGYEFPETYEFVDRVAERYGLP-LVVVRPPDSPAEG-LALG--- 74 (173) T ss_pred CEEEEECCHHHHHHHHHHHHHH-CCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCE-EEEEECCCCHHHH-HHHH--- T ss_conf 9599964649999999999996-65558837999688898989999999999873990-7999689728999-9861--- Q ss_pred CCCCCCCCCCCCCHHH-HHHHH---HHHHHHHHCCCCEEECHHHCCCCHHHHHHHHH---HHHCCCCEEEEEECCCCCCC Q ss_conf 8654787420130124-33599---99999885689287500111682156799999---98587970774410068763 Q gi|254780234|r 82 NALYEGYYLLGTAIAR-PLIAK---YLVDIANETGADAIAHGSTGKGNDQVRFELSA---YSLNSDIEIIAPWRHWSFKG 154 (404) Q Consensus 82 na~Yeg~Ypl~tslaR-plia~---~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~---~~l~P~l~viaP~Rd~~~~s 154 (404) ...++....-.| ++... -+-++.++.+.+++.-|- -+.-...|..... ....|+.-.+.|+.+| + T Consensus 75 ----~~~~~~~~~~~~~c~~~~K~~P~~~~~~~~~~~~~~~Gi-R~~Es~~R~~~~~~~~~~~~~~~~~~~Pi~~W---t 146 (173) T cd01713 75 ----LKGFPLPSPDRRWCCRILKVEPLRRALKELGVVAWITGI-RRDESARRALLPVVWTDDGKGGILKVNPLLDW---T 146 (173) T ss_pred ----CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEC-CCCCCHHHCCCCCCCCCCCCCCEEEEECHHHC---C T ss_conf ----357799832268898888078999999832982899941-21462444437623432688998999322859---9 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 7999999997089675 Q gi|254780234|r 155 RQDLIDFAEKHAIPID 170 (404) Q Consensus 155 Re~~i~ya~~~gIpv~ 170 (404) .++-.+|+.+|+||+. T Consensus 147 ~~dVw~yi~~~~l~~~ 162 (173) T cd01713 147 YEDVWAYLARHGLPYN 162 (173) T ss_pred HHHHHHHHHHCCCCCC T ss_conf 9999999998399999 No 41 >PRK00074 guaA GMP synthase; Reviewed Probab=98.42 E-value=1.6e-06 Score=67.18 Aligned_cols=148 Identities=20% Similarity=0.348 Sum_probs=94.0 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHH-HHCCCEEEEEECHHHHHHHHHHHHHHHC Q ss_conf 77899991477259999999987189869999945787--11068999999-9739807998200899999879999736 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKA-RLLGAKEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A-~~~Ga~~~~v~D~r~ef~~~~i~~~I~a 81 (404) .+||++|.|||+||||+..+|..--|-.++|+.+|.|= ..|.+.+.+.. ..+|. +..++|+++.|.+. ++. T Consensus 216 ~~kVi~~lSGGVDStV~A~Ll~kAIGd~l~cvfVD~GllRknE~~~V~~~~~~~lgl-~~~~vdA~~~Fl~~-----L~g 289 (513) T PRK00074 216 DKKVILGLSGGVDSSVAAVLLHKAIGDQLTCVFVDHGLLRKNEAEQVMEMFREHFGL-NLIHVDASDRFLSA-----LAG 289 (513) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHCEEEEEECCCCCCCCCHHHHHHHHHHHCCC-CEEEEEHHHHHHHH-----HCC T ss_conf 762899723883089999999998511248999537711167299999999886199-67992248999997-----379 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHH----HHHHC-CCCEEECHH---------HCCCCHH-HHHHHHHHHH--CCCCEEE Q ss_conf 8654787420130124335999999----98856-892875001---------1168215-6799999985--8797077 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYLVD----IANET-GADAIAHGS---------TGKGNDQ-VRFELSAYSL--NSDIEII 144 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~lv~----~a~~~-ga~~iaHG~---------TgkGNDQ-vRFe~~~~~l--~P~l~vi 144 (404) =.-=| --|-+|.+.-++ .|++. +++.+|+|- .+.|+.. +.-.-.+--| .-.++++ T Consensus 290 v~DPE--------~KRKIIG~~FI~vfe~~a~~~~~~~~L~QGTlYPDvIES~~~~~~a~~IKsHHNVgglp~~~~~~lv 361 (513) T PRK00074 290 VTDPE--------EKRKIIGREFIEVFEEEAKKLGGVKFLAQGTLYPDVIESASGTGKAATIKSHHNVGGLPEDMKLKLV 361 (513) T ss_pred CCCHH--------HHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCEEEECCCCCCCCCCCCCCCCCCCHHHCCCEEE T ss_conf 98979--------9678877899999999997579983996467655068847989998860487758788566487545 Q ss_pred EEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 4410068763799999999708967 Q gi|254780234|r 145 APWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 145 aP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) -|.|+ | -.+|--+..++-|||- T Consensus 362 EPlr~--l-fKDEVR~lg~~Lglp~ 383 (513) T PRK00074 362 EPLRE--L-FKDEVRKLGLELGLPE 383 (513) T ss_pred EEHHH--H-CCHHHHHHHHHHCCCH T ss_conf 03175--6-1589999999968977 No 42 >TIGR02432 lysidine_TilS_N tRNA(Ile)-lysidine synthetase; InterPro: IPR012795 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . The only examples in which the wobble position of a tRNA must discriminate between G and A of mRNA are AUA (Ile) vs. AUG (Met) and UGA (stop) vs. UGG (Trp). In all bacteria, the wobble position of the tRNA(Ile) recognizing AUA is lysidine, a lysine derivative of cytidine. This domain is found, apparently, in all bacteria in a single copy. Eukaryotic sequences appear to be organellar. The domain architecture of this protein is variable; some, including characterised proteins of Escherichia coli and Bacillus subtilis known to be tRNA(Ile)-lysidine synthetase, include a conserved 50-residue domain that many other members lack. This protein belongs to the ATP-binding PP-loop family. It appears in the literature and protein databases as TilS, YacA, and putative cell cycle protein MesJ (a misnomer).; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0016879 ligase activity forming carbon-nitrogen bonds, 0008033 tRNA processing, 0005737 cytoplasm. Probab=98.40 E-value=6.8e-06 Score=62.58 Aligned_cols=150 Identities=18% Similarity=0.287 Sum_probs=94.5 Q ss_pred EEEEEECCCHHHHHHHHHHHH---HCCCE-------EEEEEEECCC----CC-HHHHHHHHHHHCCCEEEEEEC-H--HH Q ss_conf 899991477259999999987---18986-------9999945787----11-068999999973980799820-0--89 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQV---EKGLE-------VIVFIADLGQ----GE-ELKIASDKARLLGAKEVYVKD-L--RR 68 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~---e~g~e-------Vi~~~~d~Gq----~~-d~~~~~~~A~~~Ga~~~~v~D-~--r~ 68 (404) |+++|+|||.||.+++..|.+ ..+-. ++++++|=|- .+ |.+.+++-|.+.|+ ++++.. . +. T Consensus 1 ~~lvAvSGG~DS~aLL~~L~~~~~~~~~~~~~~~~~~~a~h~nH~lR~~s~~~~~~~v~~~c~~~~~-~~~~~~~~~~~~ 79 (204) T TIGR02432 1 KILVAVSGGVDSMALLHLLLKLQPKLKISWQIKDFKLIAAHVNHGLRPESAQEEAEFVQQFCEKLNI-PLEIKKLVDVKA 79 (204) T ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHHHHHCCC-CEEEEEEECCCC T ss_conf 9788862864279999999997663278778875068999961778841589999999999996189-569984212411 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC-CCEEECHHHCCCCHHHH--HHHHHHHHCCC----C Q ss_conf 99998799997368654787420130124335999999988568-92875001116821567--99999985879----7 Q gi|254780234|r 69 EFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETG-ADAIAHGSTGKGNDQVR--FELSAYSLNSD----I 141 (404) Q Consensus 69 ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~g-a~~iaHG~TgkGNDQvR--Fe~~~~~l~P~----l 141 (404) . .... .--+. +-||-+==+.+-++|++.| +++|+-|= -.|||+= +-.-++--++. | T Consensus 80 ~-~~~~-~~~~E-------------~~AR~~RY~~f~~~~~~~~~~~~i~tAH--h~dDq~ET~L~rL~RG~~~~Gl~g~ 142 (204) T TIGR02432 80 L-AKGK-KKNLE-------------EAAREARYAFFEEIAKKHGKADYILTAH--HADDQAETILMRLLRGSGLRGLSGM 142 (204) T ss_pred C-CCCC-CCCHH-------------HHHHHHHHHHHHHHHHHCCCCEEEEECC--CCCCHHHHHHHHHHHCCCCCHHHHH T ss_conf 0-1224-76778-------------9999999999999999729940899724--8322799999987516870001110 Q ss_pred EEEEEE------------CC-CCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 077441------------00-68763799999999708967566567 Q gi|254780234|r 142 EIIAPW------------RH-WSFKGRQDLIDFAEKHAIPIDKNKRG 175 (404) Q Consensus 142 ~viaP~------------Rd-~~~~sRe~~i~ya~~~gIpv~~~~~~ 175 (404) ..+.|. |- ..+ +|+|..+||++++||.-...+| T Consensus 143 ~~~~~~~~~g~~~~~~~~RPLL~~-~k~ei~~y~~~~~l~~~eD~tN 188 (204) T TIGR02432 143 PEIRPLGSLGWYKGGQIIRPLLNI-SKSEIEEYLKENGLPYFEDETN 188 (204) T ss_pred CCCCCCCCCCCCCCEEEEECCCCC-CHHHHHHHHHHCCCCEEECCCC T ss_conf 133663324422563277077567-7899999999668965657888 No 43 >pfam01902 ATP_bind_4 ATP-binding region. This family of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042. Probab=98.38 E-value=2.6e-05 Score=58.44 Aligned_cols=154 Identities=19% Similarity=0.272 Sum_probs=109.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 899991477259999999987189869999945787--------110689999999739807998200899999879999 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--------GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) |++..||||-||++++.+.. ++ ++|+|+..=+-+ .-..+-++..|..+|. +.+....+.+ .++++ T Consensus 2 K~~~l~SGGKDS~~Al~~a~-~~-~~v~~L~t~~~~~~ds~~~H~~~~~l~~~qA~algi-Pl~~~~~~~~-~e~~~--- 74 (219) T pfam01902 2 KVAALYSGGKDSNYALYWAL-KE-IEVPYLVSMKSENKESYMFHEPNLHLTKLLAEALGI-PIIKLYTKGE-EEKEV--- 74 (219) T ss_pred CEEEEECCCHHHHHHHHHHH-HC-CCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHCCC-CEEEEECCCC-HHHHH--- T ss_conf 39999728699999999998-71-983799999637998102615788999999997599-6899967986-17999--- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHH Q ss_conf 73686547874201301243359999999885689287500111682156799999985879707744100687637999 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDL 158 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~ 158 (404) +.+.+..++.|++.|.+|.--.-.-.-|.|..+..+ .++.++|. |+.. +++. T Consensus 75 -----------------------~~L~~~l~~~~i~~vv~GdI~s~~qr~~~e~~c~~l--gl~~~~PL--W~~d-~~~l 126 (219) T pfam01902 75 -----------------------EDLAGFLESLDVDALVAGAIYSEYQKSRIESVCREL--GLKPFAPL--WGRD-PRKL 126 (219) T ss_pred -----------------------HHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHC--CCEEEECC--CCCC-HHHH T ss_conf -----------------------999999987598699998603688999999999972--98897103--4899-9999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCC Q ss_conf 999997089675665678864157243115457632458 Q gi|254780234|r 159 IDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLED 197 (404) Q Consensus 159 i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Led 197 (404) +.-.-+.|+.+-..+. ...-.|+...|+.+-...+++ T Consensus 127 l~e~i~~Gf~aiIv~v--~~~~L~~~~LGr~id~~~i~~ 163 (219) T pfam01902 127 AEEIVREGFEVAIVAV--SAEGLGESWLGRRFDRKNIDE 163 (219) T ss_pred HHHHHHCCCEEEEEEE--ECCCCCHHHCCCCCCHHHHHH T ss_conf 9999987994999998--548899789698837999999 No 44 >PRK13795 hypothetical protein; Provisional Probab=98.37 E-value=1.8e-05 Score=59.50 Aligned_cols=145 Identities=18% Similarity=0.198 Sum_probs=87.8 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 77899991477259999999987189869999945787--1106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) .+.|.++||||-||++++.+-++- +-++.++.+|+|= ++-++.+++-+.++|. ++++.+..+.|++..-.... T Consensus 245 ~~pv~VsfSGGKDS~v~L~La~ka-~~~~~~~f~dTglEfPeT~e~v~~~~~~~gi-~~~~~~a~~~fw~~~~~~Gp--- 319 (630) T PRK13795 245 NLPVVVSFSGGKDSLVVLDLAAEA-LRKFKAFFNNTGLEFPETVENVKEVAEEYGV-ELIVADAGDAFWRAIEKFGP--- 319 (630) T ss_pred CCCEEEECCCCHHHHHHHHHHHHH-CCCEEEEEECCCCCCHHHHHHHHHHHHHHCC-CEEEECCCHHHHHHHHHCCC--- T ss_conf 897599526868999999999985-4982899950776764699999999998698-18996563678888887499--- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC-EE-ECHHHCCCCHHHHHHHHHHHHCC---------CCEEEEEECCCC Q ss_conf 65478742013012433599999998856892-87-50011168215679999998587---------970774410068 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANETGAD-AI-AHGSTGKGNDQVRFELSAYSLNS---------DIEIIAPWRHWS 151 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~-~i-aHG~TgkGNDQvRFe~~~~~l~P---------~l~viaP~Rd~~ 151 (404) -... |=..|.. +--.-+-++-++..-. ++ --| |.++|..-++-.| .....+|+++|. T Consensus 320 P~rd--~RWCckv---~Kl~pl~~~i~~~~~~~~l~~~G-------~R~~ES~~R~~~~rv~~n~~i~~q~~a~PI~~Ws 387 (630) T PRK13795 320 PARD--YRWCCKV---CKLGPITRAIKSNFPQGCLSFVG-------QRKYESFARAKSPRVWRNPWVPNQIGAAPIQDWT 387 (630) T ss_pred CCCC--CCCCCCC---CCCHHHHHHHHHHCCCCEEEEEE-------CCHHHHHHHHCCCCCCCCCCCCCEEEEECHHCCC T ss_conf 9655--6435543---10057999999758996299985-------5246637660488614477877726872033173 Q ss_pred CCCHHHHHHHHHHHCCCC Q ss_conf 763799999999708967 Q gi|254780234|r 152 FKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 152 ~~sRe~~i~ya~~~gIpv 169 (404) - -+..-|.-.++||. T Consensus 388 ~---~~VwLYi~~~~l~~ 402 (630) T PRK13795 388 A---LEVWLYIFSRKLPY 402 (630) T ss_pred H---HHHHHHHHHCCCCC T ss_conf 8---69999998848998 No 45 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=98.35 E-value=1.5e-05 Score=60.22 Aligned_cols=106 Identities=19% Similarity=0.218 Sum_probs=75.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCH--HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCC Q ss_conf 99991477259999999987189869999945787110--6899999997398079982008999998799997368654 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEE--LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALY 85 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d--~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Y 85 (404) +++.+|||.|||+.+.+|+.++|..+.|||.|-|...| ...++.-..++|.. ++.+-.--...++....++.. + T Consensus 62 Civ~vSGGkDS~y~~~~l~~~~gl~pL~vt~d~~~~t~~g~~Ni~~l~~~lgvD-~i~~~~n~~~~k~l~k~~~~~---~ 137 (343) T TIGR03573 62 CIIGVSGGKDSTYQAHVLKKKLGLNPLLVTVDPGWNTELGVKNLNNLIKKLGFD-LHTITINPETFRKLQRAYFKK---V 137 (343) T ss_pred EEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCC-EEEECCCHHHHHHHHHHHHHH---C T ss_conf 898688772899999999998299259998359877989999999999836998-587469999999999999986---6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHH Q ss_conf 787420130124335999999988568928750011 Q gi|254780234|r 86 EGYYLLGTAIARPLIAKYLVDIANETGADAIAHGST 121 (404) Q Consensus 86 eg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~T 121 (404) .+.. +.+ -..|....+++|.+.++..|-.|-+ T Consensus 138 gd~~-~~~---~~~i~~~~~~iA~k~~IplIi~Gen 169 (343) T TIGR03573 138 GDPE-WPQ---DHAIFASVYQVALKFNIPLIIWGEN 169 (343) T ss_pred CCHH-HHH---HHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 8946-999---9999999999999819998997567 No 46 >pfam00733 Asn_synthase Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine. Probab=98.33 E-value=9.9e-06 Score=61.44 Aligned_cols=143 Identities=23% Similarity=0.306 Sum_probs=91.1 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 78999914772599999999871898699999457871--1068999999973980799820089999987999973686 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) ..|.+++|||||||.++..+......++.||+++.... +|...+++.|..+|.. |..+++..+-..+.+...|.... T Consensus 18 vpig~~LSGGlDSs~ia~l~~~~~~~~i~~~s~~~~~~~~~E~~~a~~~a~~~~~~-~~~v~~~~~~~~~~~~~~i~~~~ 96 (195) T pfam00733 18 VPVGVLLSGGLDSSLIAALAARQSSPPLKTFSVGFEGSDYDEAPYAELVADHLGTD-HHEIIVTEEELLDALPEVIYHLE 96 (195) T ss_pred CCEEEECCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHCCC-CEEEEECHHHHHHHHHHHHHHHC T ss_conf 83675306746899999999985389953895578889975899999999651335-17877255999999888899862 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH----HHHHHHHHHCCCCEEEEEECCCCCCCHHHHH Q ss_conf 547874201301243359999999885689287500111682156----7999999858797077441006876379999 Q gi|254780234|r 84 LYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV----RFELSAYSLNSDIEIIAPWRHWSFKGRQDLI 159 (404) Q Consensus 84 ~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv----RFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i 159 (404) .|.+.+..-|. -.+.+.+++ |+.++-- |-|-|.+ |-++.. ++-.+++-.|.-|.+ .+ T Consensus 97 -----~P~~~~~~~~~--~~l~k~~~~-~~kV~ls---G~GaDElf~GYr~Dr~~--m~~gvE~R~Pfld~~------lv 157 (195) T pfam00733 97 -----EPFGDSSAIPL--YLLSRLARK-GVKVVLS---GEGADELFGGYRDDRMS--MAHGLEVRVPFLDHR------LV 157 (195) T ss_pred -----CCCCCCCHHHH--HHHHHHHHC-CCEEEEE---EECHHHHHCCCCCCCHH--HHCCCEEECCCCCHH------HH T ss_conf -----98777517899--999999868-9849997---15688884798687012--225730556622379------99 Q ss_pred HHHHHHCCCCC Q ss_conf 99997089675 Q gi|254780234|r 160 DFAEKHAIPID 170 (404) Q Consensus 160 ~ya~~~gIpv~ 170 (404) +|+. +||.. T Consensus 158 ~~~~--~ip~~ 166 (195) T pfam00733 158 EFAL--SLPPE 166 (195) T ss_pred HHHH--HCCHH T ss_conf 9999--49999 No 47 >PRK13794 hypothetical protein; Provisional Probab=98.32 E-value=3.6e-05 Score=57.40 Aligned_cols=146 Identities=22% Similarity=0.198 Sum_probs=90.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 77899991477259999999987189869999945787--1106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) .+.|.++||||-||++++.+.++..+-++.++.+|+|= ++-++.+++-+.++|. +.++.+..+ |++..-.. +- T Consensus 245 ~~pv~VSfSGGKDS~v~L~La~ka~~~~~~vvF~DTglEfPeT~e~ve~v~~~~gv-~ii~~~~~~-fw~~~~~~---Gp 319 (473) T PRK13794 245 GKPPSVAYSGGKDSLATLLLALKAFGNEFPVLFADTGLEFPETLENVEDVEELYGL-EIIRTSSGE-FWEKLEEY---GP 319 (473) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCC-CEEEECCCC-HHHHHHHC---CC T ss_conf 99839970686899999999998608971699960888767699999999998598-789915644-77888861---99 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHC--CCCEEECHHHCCCCHHHHHHHHHHHHCCC---------CEEEEEECCCC Q ss_conf 65478742013012433599999998856--89287500111682156799999985879---------70774410068 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANET--GADAIAHGSTGKGNDQVRFELSAYSLNSD---------IEIIAPWRHWS 151 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~--ga~~iaHG~TgkGNDQvRFe~~~~~l~P~---------l~viaP~Rd~~ 151 (404) .. ..|=.+|.. +--..+-++-++. +-...-.| |.++|..-++-.|. .-..+|+++|. T Consensus 320 P~--rd~RWCc~v---~Kl~pl~~~l~~~~~~~~l~~vG-------~R~~ES~~Rak~prv~~n~~i~~qi~~~PI~~Wt 387 (473) T PRK13794 320 PA--RDNRWCSEV---CKLSPLGKLIDEKYEGECLSFVG-------QRKYESFNRSKKPRIWRNPTIKKQMLAAPILNWT 387 (473) T ss_pred CC--CCCCCCCHH---HHHHHHHHHHHHHCCCCCEEEEE-------CCHHHHHHHHCCCCCCCCCCCCCCEEEECHHCCH T ss_conf 97--666021322---63789999999748997189985-------2168768750487412378876715761431051 Q ss_pred CCCHHHHHHHHHHHCCCCC Q ss_conf 7637999999997089675 Q gi|254780234|r 152 FKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 152 ~~sRe~~i~ya~~~gIpv~ 170 (404) - -+..-|.-.+++|.. T Consensus 388 a---l~VwLYI~~~~lp~N 403 (473) T PRK13794 388 A---MHVWIYLFREKAPYN 403 (473) T ss_pred H---HHHHHHHHHCCCCCC T ss_conf 8---799999987589988 No 48 >cd01991 Asn_Synthase_B_C The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase . Probab=98.30 E-value=1.3e-05 Score=60.64 Aligned_cols=112 Identities=21% Similarity=0.218 Sum_probs=75.4 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 78999914772599999999871898699999457871--1068999999973980799820089999987999973686 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) .+|.+..|||||||.++..+.....-.+.+++++.+.. +|...+++.|..+|.. +.++.....-..+.+...+... T Consensus 16 ~~vg~~LSGGlDSs~ia~~~~~~~~~~~~~~s~~~~~~~~de~~~a~~va~~~~~~-~~~i~~~~~~~~~~~~~~~~~~- 93 (269) T cd01991 16 VPVGVLLSGGLDSSLVAALAARLLPEPVKTFSIGFGFEGSDEREYARRVAEHLGTE-HHEVEFTPADLLAALPDVIWEL- 93 (269) T ss_pred CCEEEEECCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCC-CEEEECCHHHHHHHHHHHHHHH- T ss_conf 85686513569999999999983689984799862899973789999999980988-2453143488999999999986- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH Q ss_conf 547874201301243359999999885689287500111682156 Q gi|254780234|r 84 LYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV 128 (404) Q Consensus 84 ~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv 128 (404) + -|...+..-+ -..+.+.|++.|+.++-.| -|-|++ T Consensus 94 --~--~p~~~~~~~~--~~~l~~~a~~~g~~VllsG---~GgDEl 129 (269) T cd01991 94 --D--EPFADSSAIP--LYLLSRLARKHGIKVVLSG---EGADEL 129 (269) T ss_pred --C--CCCHHHHHHH--HHHHHHHHHHCCCEEEEEC---CCHHHH T ss_conf --3--9601254789--9999999985397799967---761464 No 49 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=98.29 E-value=9.9e-05 Score=54.26 Aligned_cols=148 Identities=18% Similarity=0.310 Sum_probs=99.3 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHH----CCC---EEEEEECHHHHHHHHHHHHH Q ss_conf 78999914772599999999871898699999457871106899999997----398---07998200899999879999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARL----LGA---KEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~----~Ga---~~~~v~D~r~ef~~~~i~~~ 78 (404) .||++..|||.||.|+ .||....|++|+.++.++|.+.....+++.|.. .|. .+++++|+.+ ...+.. . T Consensus 178 GkvL~LlSGGiDSPVA-a~~mmKRG~~v~~l~F~lg~~~~e~~V~~va~~L~~~~~~~~~vr~~~V~f~~-v~~eI~-~- 253 (483) T PRK01269 178 EDVLSLISGGFDSGVA-SYMLMRRGSRVHYCFFNLGGAAHEIGVKQVAHYLWNRYGSSHRVRFISVDFEP-VVGEIL-E- 253 (483) T ss_pred CCEEEEECCCCCCHHH-HHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCHH-HHHHHH-H- T ss_conf 8689995278883888-99998459879999957998688999999999999973888870899976799-999998-7- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH-HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHH Q ss_conf 73686547874201301243359999999885689287500-11168215679999998587970774410068763799 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG-STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQD 157 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG-~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~ 157 (404) ++ ...|.. + +-|-.+-+..-++|++.||+++..| |-|-=-.|-==++..-.-+-++.|+-|.- .+. .+| T Consensus 254 -~v----~~~~~~-v-v~KR~M~R~A~~iA~~~g~~ALVTGESLGQVASQTL~NL~~i~~~~~~pVlRPLI--g~D-K~e 323 (483) T PRK01269 254 -KV----DNGQMG-V-VLKRMMLRAASKVAERYGIQALVTGEALGQVSSQTLTNLRLIDNVTDTLILRPLI--AMD-KED 323 (483) T ss_pred -HC----CHHHEE-H-HHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCEECCCC--CCC-HHH T ss_conf -38----763443-9-9999999999999998599889855503456776787889988740774004766--798-899 Q ss_pred HHHHHHHHCC Q ss_conf 9999997089 Q gi|254780234|r 158 LIDFAEKHAI 167 (404) Q Consensus 158 ~i~ya~~~gI 167 (404) -|+.|++=|- T Consensus 324 II~~Ar~IGT 333 (483) T PRK01269 324 IIDLARQIGT 333 (483) T ss_pred HHHHHHHHCH T ss_conf 9999998392 No 50 >COG2117 Predicted subunit of tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=98.26 E-value=5.8e-06 Score=63.10 Aligned_cols=129 Identities=28% Similarity=0.352 Sum_probs=93.3 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCC Q ss_conf 89999147725999999998718986999994578711068999999973980799820089999987999973686547 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYE 86 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Ye 86 (404) +|.+.||||-|||.+ +|+.+.-||+|.-+|++.|--+++.-+++-|..+|- +|.|+-+-.+..++-+--.| | T Consensus 2 ~v~vLfSGGKDSSLa-A~iL~klgyev~LVTvnFGv~d~~k~A~~tA~~lgF-~h~vl~Ldr~ile~A~em~i------e 73 (198) T COG2117 2 DVYVLFSGGKDSSLA-ALILDKLGYEVELVTVNFGVLDSWKYARETAAILGF-PHEVLQLDREILEDAVEMII------E 73 (198) T ss_pred CEEEEECCCCCHHHH-HHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHCC-CCCEECCCHHHHHHHHHHHH------H T ss_conf 568984489710589-999987178717999873346640668999998299-70000168899999999998------5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHH--HHHHHC--CCCEEEEEEC Q ss_conf 8742013012433599999998856892875001116821567999--999858--7970774410 Q gi|254780234|r 87 GYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFEL--SAYSLN--SDIEIIAPWR 148 (404) Q Consensus 87 g~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~--~~~~l~--P~l~viaP~R 148 (404) ..||-..- .-|-+..+|+.-....+.||.| | +-.|.|-|=. -+++|- -++..+.|.- T Consensus 74 dg~P~~aI---q~iH~~alE~~A~r~~~~iaDG-T-RRDDrvP~ls~~~~qSLEdR~nv~Yi~PL~ 134 (198) T COG2117 74 DGYPRNAI---QYIHEMALEALASREVDRIADG-T-RRDDRVPKLSRSEAQSLEDRLNVQYIRPLL 134 (198) T ss_pred CCCCCHHH---HHHHHHHHHHHHHHHHHHHCCC-C-CCCCCCCCCCHHHHHHHHHHCCCEEECCCC T ss_conf 38985088---9999999999987778887578-7-445667642498875188751864553033 No 51 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=98.25 E-value=8e-05 Score=54.90 Aligned_cols=153 Identities=22% Similarity=0.264 Sum_probs=104.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECC--------CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 89999147725999999998718986999994578--------7110689999999739807998200899999879999 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLG--------QGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~G--------q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) |++..||||-||++++.|.. +.|.||+++..=.- |..+.+.++..|...|. +.+..+.+.++ ++.+ T Consensus 2 k~~aL~SGGKDS~~Al~~a~-~~G~eV~~Ll~~~p~~~dS~m~H~~n~~~~~~~Ae~~gi-~l~~~~~~g~~-e~ev--- 75 (223) T COG2102 2 KVIALYSGGKDSFYALYLAL-EEGHEVVYLLTVKPENGDSYMFHTPNLELAELQAEAMGI-PLVTFDTSGEE-EREV--- 75 (223) T ss_pred CEEEEEECCCHHHHHHHHHH-HCCCEEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCCC-CEEEEECCCCC-HHHH--- T ss_conf 68999816718899999999-759836899998159987044222256799999874488-35998347630-5569--- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHH Q ss_conf 73686547874201301243359999999885689287500111682156799999985879707744100687637999 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDL 158 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~ 158 (404) +.+.++-+.+++++|.+|+-..---.-|.|+-+.-++ +++++|. |... -++. T Consensus 76 -----------------------e~L~~~l~~l~~d~iv~GaI~s~yqk~rve~lc~~lG--l~~~~PL--Wg~d-~~el 127 (223) T COG2102 76 -----------------------EELKEALRRLKVDGIVAGAIASEYQKERVERLCEELG--LKVYAPL--WGRD-PEEL 127 (223) T ss_pred -----------------------HHHHHHHHHCCCCEEEECHHHHHHHHHHHHHHHHHHC--CEEEECC--CCCC-HHHH T ss_conf -----------------------9999999737633799734330999999999999729--7885125--1789-8999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCC Q ss_conf 9999970896756656788641572431154576324 Q gi|254780234|r 159 IDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVL 195 (404) Q Consensus 159 i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~L 195 (404) +.-.-+.|+.+-..+. +.+--|+.+.|+++--..+ T Consensus 128 l~e~~~~Gf~~~Iv~V--sa~gL~~~~lGr~i~~~~~ 162 (223) T COG2102 128 LEEMVEAGFEAIIVAV--SAEGLDESWLGRRIDREFL 162 (223) T ss_pred HHHHHHCCCEEEEEEE--ECCCCCHHHHCCCCCHHHH T ss_conf 9999973985999998--4147996795880589999 No 52 >COG0175 CysH 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [Amino acid transport and metabolism / Coenzyme metabolism] Probab=98.24 E-value=1.3e-05 Score=60.69 Aligned_cols=150 Identities=19% Similarity=0.230 Sum_probs=102.8 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 7899991477259999999987189869999945787--11068999999973980799820089999987999973686 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) ..++++||||.||+|++.... +-..++-.+++|+|. ++-++-+.+-+.+.|. ...+......+.+. + T Consensus 40 ~~~~~~~S~Gkds~V~l~L~~-k~~~~~~vif~DTg~~f~Et~~~~d~~~~~~~~-~l~~~~~~~~~~~~---------~ 108 (261) T COG0175 40 NPVVVSFSGGKDSTVLLHLAA-KAFPDFPVIFLDTGYHFPETYEFRDRLAEEYGL-DLKVYRPDDEVAEG---------E 108 (261) T ss_pred CCEEEEECCCHHHHHHHHHHH-HHCCCCCEEEEECCCCCHHHHHHHHHHHHHCCC-EEEEECCCHHHHHH---------H T ss_conf 975998127467999999998-602587189970777578899999999987297-17981673234434---------4 Q ss_pred CCCCCCCCCCCHHH-HHHHHHHH---HHHHHCCCCEEECHHH---CCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHH Q ss_conf 54787420130124-33599999---9988568928750011---16821567999999858797077441006876379 Q gi|254780234|r 84 LYEGYYLLGTAIAR-PLIAKYLV---DIANETGADAIAHGST---GKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQ 156 (404) Q Consensus 84 ~Yeg~Ypl~tslaR-plia~~lv---~~a~~~ga~~iaHG~T---gkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe 156 (404) .|-+.++..+ -.| ++...++. ++-++.+.+++--|.- ++.--+.||.. .....++..-+.|+.+|. .. T Consensus 109 ~~~~~~~~~~-~~r~c~~i~K~~pl~~al~~~~~~a~~~G~Rrdes~~Rak~~~~~-~~~~~~~~~rv~Pl~~Wt---~~ 183 (261) T COG0175 109 KYGGKLWEPS-VERWCCDIRKVEPLKRALDEYGFDAWFTGLRRDESPTRAKLPVVS-FDSEFGESIRVNPLADWT---EL 183 (261) T ss_pred HCCCCCCCCC-CCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEC-CCCCCCCCEEECCHHCCC---HH T ss_conf 2136679997-402332568551399999635862499861204563202486453-467876836871532099---99 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 999999970896756 Q gi|254780234|r 157 DLIDFAEKHAIPIDK 171 (404) Q Consensus 157 ~~i~ya~~~gIpv~~ 171 (404) +-..|...++||+.- T Consensus 184 dVw~Yi~~~~lp~np 198 (261) T COG0175 184 DVWLYILANNLPYNP 198 (261) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999995899884 No 53 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=98.21 E-value=4.4e-05 Score=56.77 Aligned_cols=106 Identities=22% Similarity=0.276 Sum_probs=69.8 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC--HHHHHHHHHHHCCCE-EEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 89999147725999999998718986999994578711--068999999973980-799820089999987999973686 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGE--ELKIASDKARLLGAK-EVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~--d~~~~~~~A~~~Ga~-~~~v~D~r~ef~~~~i~~~I~ana 83 (404) -+++++|||.|||+++..|++.+|..+.|||.|-|... ..+.+++-..+ |.. .+++.|-... ...-......+ T Consensus 3 DcIVpvSGGKDS~y~~~~lk~kyglnpL~Vt~~~~~~t~~g~~Nl~nl~~~-g~D~~~~~~~~~~~--~~l~~~~~~~~- 78 (154) T cd01996 3 DCIIGVSGGKDSSYALYLLKEKYGLNPLAVTVDNGFNSEEAVKNIKNLIKK-GLDLDHLVINPEEM--KDLQLARFKAK- 78 (154) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHC-CCCEEEEECCHHHH--HHHHHHHHHHC- T ss_conf 789968888189999999999949965999967987546899999999974-99816993288899--99999999844- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHH Q ss_conf 54787420130124335999999988568928750011 Q gi|254780234|r 84 LYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGST 121 (404) Q Consensus 84 ~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~T 121 (404) +.+.++. .-..|....+++|.+.+...|.-|.. T Consensus 79 -~gd~~~~----~~~~i~~~~~~~A~k~~IplIi~Gen 111 (154) T cd01996 79 -VGDPCWP----CDTAIFTSLYKVALKFGIPLIITGEN 111 (154) T ss_pred -CCCCHHH----HHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf -7993488----99879989999999939998997278 No 54 >COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Probab=98.19 E-value=0.00026 Score=51.20 Aligned_cols=149 Identities=23% Similarity=0.350 Sum_probs=98.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-CCH--HHHHHHHH-HHCC----CEEEEEECHHHHHHHHHHHH Q ss_conf 7899991477259999999987189869999945787-110--68999999-9739----80799820089999987999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-GEE--LKIASDKA-RLLG----AKEVYVKDLRREFVRDFVFP 77 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-~~d--~~~~~~~A-~~~G----a~~~~v~D~r~ef~~~~i~~ 77 (404) .|+++..|||.||.|+ .||....|.+|+.++.+.+- ..+ .+.....+ ..+. -...+++|+.+-. ++.. - T Consensus 176 Gk~l~LlSGGIDSPVA-~~l~mkRG~~v~~v~f~~~p~~~~~a~~k~~~l~~~~~~~~~~~~~~~~v~f~~v~-~~i~-~ 252 (383) T COG0301 176 GKVLLLLSGGIDSPVA-AWLMMKRGVEVIPVHFGNPPYTSEKAREKVVALALLRLTSYGGKVRLYVVPFTEVQ-EEIL-E 252 (383) T ss_pred CCEEEEEECCCCHHHH-HHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECHHHHH-HHHH-H T ss_conf 7489997078774999-99998569877999972799851889999999986430246872289997629999-9998-6 Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH-HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHH Q ss_conf 973686547874201301243359999999885689287500-1116821567999999858797077441006876379 Q gi|254780234|r 78 MFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG-STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQ 156 (404) Q Consensus 78 ~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG-~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe 156 (404) .. . ++ |. +-+-|-.+-+..-++|++.||.+|..| +-|-=..|-==.+..-....+..|+-|.- .|. -+ T Consensus 253 ~~--~---~~-y~--~v~~rR~M~riA~~iae~~g~~aIvtGEsLGQVASQTl~nL~~i~~~t~~pIlRPLI--~~D-K~ 321 (383) T COG0301 253 KV--P---ES-YR--CVLLKRMMYRIAEKLAEEFGAKAIVTGESLGQVASQTLENLRVIDSVTNTPVLRPLI--GLD-KE 321 (383) T ss_pred HC--C---CC-CE--EHHHHHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEECCCC--CCC-HH T ss_conf 36--7---53-02--047899999999999998398399836861046676788899998155874231445--799-89 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999970896 Q gi|254780234|r 157 DLIDFAEKHAIP 168 (404) Q Consensus 157 ~~i~ya~~~gIp 168 (404) |-++.|++-|-. T Consensus 322 eIi~~Ar~IgT~ 333 (383) T COG0301 322 EIIEIARRIGTY 333 (383) T ss_pred HHHHHHHHHCCH T ss_conf 999999982974 No 55 >PRK08557 hypothetical protein; Provisional Probab=98.17 E-value=8.7e-05 Score=54.66 Aligned_cols=143 Identities=15% Similarity=0.123 Sum_probs=83.3 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 899991477259999999987189869999945787--110689999999739807998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) -|.++||||-||++++.+.+ +-+-++.++.+|+|- ++-.+.+++-+..+|. +.++++. +.|++..-.. +=.. T Consensus 185 pV~VSfSGGKDS~a~L~La~-~a~~~~~vvF~DTGlEfPeT~eyve~~~~~~~~-~l~~~~~-~~fw~~~~~~---GpP~ 258 (420) T PRK08557 185 AINASFSGGKDSAVSTLLSK-EVIPDLDVVFIDTGLEYPETLNYVKDFAKKYDI-NLDTVDG-DNFWDNLEKE---GIPT 258 (420) T ss_pred CEEEEECCCHHHHHHHHHHH-HHCCCCEEEEECCCCCCHHHHHHHHHHHHHCCC-CEEEECC-HHHHHHHHHC---CCCC T ss_conf 46996068789999999999-854881699962887772389999999998099-6799654-6689887754---9997 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHC--CCC-EEECHHHCCCCHHHHHHHHHHHH------C---CCCEEEEEECCCCC Q ss_conf 478742013012433599999998856--892-87500111682156799999985------8---79707744100687 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANET--GAD-AIAHGSTGKGNDQVRFELSAYSL------N---SDIEIIAPWRHWSF 152 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~--ga~-~iaHG~TgkGNDQvRFe~~~~~l------~---P~l~viaP~Rd~~~ 152 (404) .. |=.+|.. +--.-+-+.-++. +-. ..-+| |.++|..-++. + |.-...+|+++|.- T Consensus 259 rd--~RWCc~v---~Kl~Pl~~~i~~~~~~~~vl~~~G-------~R~~ES~~R~~~~~~~~~~~v~~q~~~~PI~~Wsa 326 (420) T PRK08557 259 KD--NRWCNSA---CKLMPLKRYLKKKYGNKKVLTIDG-------SRKYESFTRANLDYERKSGFIDFQTNVFPILDWNS 326 (420) T ss_pred CC--CCCCCCC---HHHHHHHHHHHHHCCCCCEEEEEE-------ECHHHHHHHHCCCCCCCCCCCCCCEEEEHHHCCCH T ss_conf 66--7250743---101789999998578987599997-------01454076605874134777666503531532607 Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 637999999997089675 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPID 170 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~ 170 (404) -+-.-|.-.+|+|.. T Consensus 327 ---~~VwLYI~~~~l~~N 341 (420) T PRK08557 327 ---LDIWSYIFSNDILYN 341 (420) T ss_pred ---HHHHHHHHHCCCCCC T ss_conf ---699999997599998 No 56 >TIGR01536 asn_synth_AEB asparagine synthase (glutamine-hydrolyzing); InterPro: IPR006426 These sequences represent glutamine-hydrolysing asparagine synthase. The group have a poorly conserved C-terminal extension while bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis .; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process. Probab=98.13 E-value=5.9e-05 Score=55.86 Aligned_cols=107 Identities=21% Similarity=0.252 Sum_probs=77.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCC-------CEEEEEEEECC-CCCHHH-----------HHHHHHHHCCCEEEEEECH Q ss_conf 7899991477259999999987189-------86999994578-711068-----------9999999739807998200 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKG-------LEVIVFIADLG-QGEELK-----------IASDKARLLGAKEVYVKDL 66 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g-------~eVi~~~~d~G-q~~d~~-----------~~~~~A~~~Ga~~~~v~D~ 66 (404) ..|+...|||||||.++..++..-. ..|.+|++... +..+++ .+++.|..+|.. |+.+.+ T Consensus 312 vpvG~~LSGGlDSS~vaa~a~~~~~~~~~~W~~~~~tFsigf~~~~~~~~Es~l~P~~D~P~A~~vA~~~G~~-h~~~~~ 390 (646) T TIGR01536 312 VPVGVLLSGGLDSSLVAAIARREAPSEGVDWKGPVKTFSIGFEGDSKDFDESKLRPDDDQPYARKVADELGTE-HHEVLI 390 (646) T ss_pred CCEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEECCCCCCCCHHHHHHHHHHHCCC-CEEEEE T ss_conf 6068887171679999999999767643311774347886517787653002578998618999999984983-248984 Q ss_pred H-HHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEECH Q ss_conf 8-99999879999736865478742013012433599999998856-89287500 Q gi|254780234|r 67 R-REFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANET-GADAIAHG 119 (404) Q Consensus 67 r-~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~-ga~~iaHG 119 (404) - +|++ +.+-..|..--.|+ |.+.+.+=|+ =.+.+.|++. |..++--| T Consensus 391 s~~e~~-~~l~~v~~~~e~y~---p~~~~~~~P~--Yl~s~~ar~~Tg~kV~LSG 439 (646) T TIGR01536 391 SEEEVL-KALPEVIYHLETYD---PTAIRASIPL--YLLSKLAREDTGVKVVLSG 439 (646) T ss_pred CHHHHH-HHHHHHHHHHCCCC---CCCCCHHHHH--HHHHHHHHHCCCCEEEECC T ss_conf 589999-99888765424885---4303112358--9999999962883899748 No 57 >PRK08576 hypothetical protein; Provisional Probab=98.12 E-value=0.00017 Score=52.51 Aligned_cols=139 Identities=18% Similarity=0.229 Sum_probs=91.6 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 899991477259999999987189869999945787--110689999999739807998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) .|++.||||-||++++..-++-.+ +|-++.+|+|- ++-.+.+++-+.++|. +.+..|.-..| + |....+ T Consensus 237 ~iiVp~SGGKDStA~LlLA~ea~~-dv~aVfvDTGlEfPeT~eyVe~va~klGv-~lv~a~~d~~~------~-i~~~G~ 307 (439) T PRK08576 237 TIIVPWSGGKDSTAALLLALKAFD-EVTAVYVDTGYEMPLTDEYVEKVAEKLGV-DLIRAGVDVPM------P-IEGYGL 307 (439) T ss_pred CEEEECCCCHHHHHHHHHHHHHHC-CEEEEEEECCCCCHHHHHHHHHHHHHHCC-EEEEECCCCCC------H-HCCCCC T ss_conf 889956887799999999998618-83699970798657589999999998597-69994788642------2-104799 Q ss_pred --CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCC----------CCEEEEEECCCCC Q ss_conf --4787420130124335999999988568928750011168215679999998587----------9707744100687 Q gi|254780234|r 85 --YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNS----------DIEIIAPWRHWSF 152 (404) Q Consensus 85 --Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P----------~l~viaP~Rd~~~ 152 (404) .+++ .+|.+ --+.+-++.++..-..+.- .|.++|..-+.+.| +.-.++|+.+|+ T Consensus 308 Pt~d~R--WCtk~----KL~PL~~~i~~~~~~l~~v-------G~R~~ES~~R~~~~~v~~rn~~vg~~~~~~PIk~Ws- 373 (439) T PRK08576 308 PTHENR--WCTKM----KVEALYEAIRELSNPLLVV-------GDRDGESARRRLRPPVRERETDFGKILVVMPLKFWS- 373 (439) T ss_pred CCCCCC--CCCCC----CCHHHHHHHHHHCCCEEEE-------CCCHHHHHHHHCCCCEEECCCCCCCCEEEECHHHCC- T ss_conf 976663--31020----0255899999648976995-------272565477611896355047777726672343076- Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 637999999997089675 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPID 170 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~ 170 (404) ..+-..|.-.+|||.. T Consensus 374 --a~~VwLYI~~~gip~N 389 (439) T PRK08576 374 --GAMVQLYILMNGIELN 389 (439) T ss_pred --HHHHHHHHHHCCCCCC T ss_conf --9999999997599998 No 58 >PRK01565 thiamine biosynthesis protein ThiI; Provisional Probab=98.12 E-value=0.00036 Score=50.20 Aligned_cols=148 Identities=20% Similarity=0.309 Sum_probs=98.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC---CCHHHHHHHHHH---H-CCCEEEEEECHHHHHHHHHHHHH Q ss_conf 7899991477259999999987189869999945787---110689999999---7-39807998200899999879999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ---GEELKIASDKAR---L-LGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq---~~d~~~~~~~A~---~-~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) .|+++..|||.||.|+ .|+....|++|+.++.+.+- .+-.+.+.+-|. . .+-.+.|++|+.+ +.. . T Consensus 177 Gk~l~LlSGGIDSPVA-a~~mmkRG~~v~~v~f~~~p~t~~~a~~kv~~l~~~L~~y~~~~kl~~v~f~~-~~~-----~ 249 (399) T PRK01565 177 GKALLLLSGGIDSPVA-GYLAMKRGVEIEAVHFHSPPYTSERAKEKVIDLARILAKYGGRIKLHVVPFTE-IQE-----E 249 (399) T ss_pred CCEEEEECCCCCHHHH-HHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHH-HHH-----H T ss_conf 8579986178753999-99998559879999986899877899999999999999728885599966399-999-----9 Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH-HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHH Q ss_conf 73686547874201301243359999999885689287500-11168215679999998587970774410068763799 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG-STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQD 157 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG-~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~ 157 (404) |..+. ...| .+-+-|-++-+..-.+|++.||.++..| |-|-=-.|-==.+..-.-+-++.|+-|.- .|. -+| T Consensus 250 i~~~~--~~~~--~~vl~RR~M~riA~~iA~~~~~~alvTGESLGQVASQTl~Nl~~i~~~~~~pVlRPLI--g~D-K~E 322 (399) T PRK01565 250 IKKKV--PESY--LMTLMRRFMMRIADKIAEKRGALAIVTGESLGQVASQTLESMQAINAVTNLPVLRPLI--GMD-KEE 322 (399) T ss_pred HHHCC--CCCE--EEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC--CCC-HHH T ss_conf 98608--8003--8599999999999999998599789824135567675567788887521775446655--789-899 Q ss_pred HHHHHHHHCC Q ss_conf 9999997089 Q gi|254780234|r 158 LIDFAEKHAI 167 (404) Q Consensus 158 ~i~ya~~~gI 167 (404) -|+.|++-|- T Consensus 323 II~~Ar~IGT 332 (399) T PRK01565 323 IIEIAKEIGT 332 (399) T ss_pred HHHHHHHCCC T ss_conf 9999987096 No 59 >PRK09431 asnB asparagine synthetase B; Provisional Probab=98.05 E-value=7.3e-05 Score=55.18 Aligned_cols=113 Identities=20% Similarity=0.229 Sum_probs=70.7 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCC-------------CEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHH-HHHH Q ss_conf 7899991477259999999987189-------------8699999457871106899999997398079982008-9999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKG-------------LEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLR-REFV 71 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g-------------~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r-~ef~ 71 (404) -.|.+..|||||||.++...+.... .++.+|++-.-..+|+..+++.|..+|.. |+.+.+. +++. T Consensus 228 vpvG~~LSGGLDSSlIaala~k~~~~~~~~~~~~~~~~~~l~tFsiG~~~s~D~~~Ar~vA~~lgt~-h~ei~~t~~d~~ 306 (555) T PRK09431 228 VPYGVLLSGGLDSSLISAIAKKYAARRIEDDERSEAWWPQLHSFAVGLEGSPDLKAAREVADYLGTV-HHEIHFTVQEGL 306 (555) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHCCC-CEEEEECHHHHH T ss_conf 8624320577067999999998520244332222234678753660479981499999999982995-659995479999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH Q ss_conf 987999973686547874201301243359999999885689287500111682156 Q gi|254780234|r 72 RDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV 128 (404) Q Consensus 72 ~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv 128 (404) +.+-..|..-..|+- .....++.-.+ +.+.+++.|+.+|-- |-|-|.+ T Consensus 307 -~~l~~vi~~lE~~d~-~~~~asip~yl----lsk~i~~~gvKVvLS---GeGADEL 354 (555) T PRK09431 307 -DALRDVIYHLETYDV-TTIRASTPMYL----MSRKIKAMGIKMVLS---GEGADEL 354 (555) T ss_pred -HHHHHHHHHHHCCCC-CCHHHHHHHHH----HHHHHHHCCCEEEEE---CCCHHHH T ss_conf -999999987414687-50256899999----999999639879992---7750441 No 60 >COG0367 AsnB Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism] Probab=98.04 E-value=0.00014 Score=53.12 Aligned_cols=106 Identities=25% Similarity=0.283 Sum_probs=73.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEE-EEEECCCCC--HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 789999147725999999998718986999-994578711--06899999997398079982008999998799997368 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIV-FIADLGQGE--ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~-~~~d~Gq~~--d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) -.|++.+|||||||++...+...-.....+ |+++....+ |.+.+++.|..+|...+.++=..+|+. +.+...+++- T Consensus 231 vpvg~~lSGGlDSS~Iaa~a~~~~~~~~~~~fsvg~~~~~~~D~~~a~~~A~~lg~~h~~~~~~~~e~~-~~~~~vv~~~ 309 (542) T COG0367 231 VPVGVFLSGGLDSSLIAAIAAEELGKEGKTTFTVGFEDSDSPDAKYARAVAKFLGTPHHEIILTNEELL-NALPEVVKAL 309 (542) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEECHHHHH-HHHHHHHHHH T ss_conf 878999677622999999999862345521589854899970799999999985998579961789999-9999999870 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH Q ss_conf 6547874201301243359999999885689287500 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG 119 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG 119 (404) - +|...+.+=|+ -.+.+.|++.|..++--| T Consensus 310 ~-----~p~~~~~~~pl--y~~~~~a~~~g~kVvLSG 339 (542) T COG0367 310 D-----TPGGMAASIPL--YLLSRKARAEGEKVVLSG 339 (542) T ss_pred C-----CCCCCHHHHHH--HHHHHHHHHCCCEEEECC T ss_conf 7-----88620045799--999987653387898238 No 61 >PRK13980 NAD synthetase; Provisional Probab=97.98 E-value=0.00042 Score=49.78 Aligned_cols=146 Identities=23% Similarity=0.302 Sum_probs=95.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEE-CCC-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 7778999914772599999999871898-69999945-787-11068999999973980799820089999987999973 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIAD-LGQ-GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d-~Gq-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) ..|++||..|||+||++++....+--|. .|+++++- -.+ +++.+++++-|..+|+ ++.++|...-| +.|. ..+. T Consensus 27 g~kg~VlGlSGGIDSavva~La~~Alg~~~v~~v~mP~~~ss~~s~~dA~~la~~lgi-~~~~i~I~~~~-~~~~-~~~~ 103 (264) T PRK13980 27 GFKGVVLGLSGGIDSAVVAYLAVKALGKENVLALLMPYRVSPPEDLEDALLVAERLGI-EYKVIEITPIV-DAFF-SAVP 103 (264) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCC-CEEEEECHHHH-HHHH-HHCC T ss_conf 9980999798688899999999985496606899898999988789999999998699-84998279999-9999-8631 Q ss_pred CCCCCCCCCCCCCCHHH-HHHHH----HHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCCCC Q ss_conf 68654787420130124-33599----99999885689287500111682156799999985879707-74410068763 Q gi|254780234|r 81 ANALYEGYYLLGTAIAR-PLIAK----YLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSFKG 154 (404) Q Consensus 81 ana~Yeg~Ypl~tslaR-plia~----~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~~s 154 (404) . ...+++ -+.|+ .+-.+|.+.|.=++. -||.-. +-+.+-++.-|..+ ++|+-| + . T Consensus 104 ~----------~~~~~~~NiqaR~Rm~~Ly~~An~~~~lVlg-----TgNksE-~~~Gy~TkyGD~~~d~~Pi~d--L-~ 164 (264) T PRK13980 104 D----------ADRLRRGNIMARTRMVLLYDYANRDNRLVLG-----TSNKSE-LLLGYFTKYGDGAVDINPIGD--L-Y 164 (264) T ss_pred C----------CCHHHHHHHHHHHHHHHHHHHHHHCCCEEEC-----CCCHHH-HHHHHCCCCCCCCCCHHHHCC--C-C T ss_conf 0----------0066888758989999999986433977655-----886747-987210016876656245258--7-3 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 79999999970896756 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDK 171 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~ 171 (404) ..+--+.|+.-|||-.. T Consensus 165 Kt~V~~La~~l~vP~~I 181 (264) T PRK13980 165 KTQVRELARHLGVPEDI 181 (264) T ss_pred HHHHHHHHHHHCCCHHH T ss_conf 99999999993996644 No 62 >COG1365 Predicted ATPase (PP-loop superfamily) [General function prediction only] Probab=97.96 E-value=0.0001 Score=54.20 Aligned_cols=135 Identities=27% Similarity=0.425 Sum_probs=83.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC---CCHH-HHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 777899991477259999999987189869999945787---1106-899999997398079982008999998799997 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ---GEEL-KIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq---~~d~-~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ++-|+++|||||.|||..+.-|+ .-| +++|-|. ++.+ ...++-+..+|.+ -+|+++-. .-| T Consensus 59 d~~kiaVA~SGG~DSsas~iilR-~~g-----~~v~p~t~~Lp~~ir~n~~~l~~~lg~~--------p~yveedl-~~i 123 (255) T COG1365 59 DKPKIAVAYSGGVDSSASAIILR-WAG-----FTVDPGTAILPDHIRRNKEELETLLGEV--------PEYVEEDL-EDI 123 (255) T ss_pred CCCEEEEEECCCCCHHHHHHHHH-HHC-----EEECCCCCCCCHHHHHHHHHHHHHHCCC--------HHHHHHHH-HHH T ss_conf 88438998427864278899988-621-----0414632138778867899998997257--------77899899-999 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH---HHCCCC-----HHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 3686547874201301243359999999885689287500---111682-----15679999998587970774410068 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG---STGKGN-----DQVRFELSAYSLNSDIEIIAPWRHWS 151 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG---~TgkGN-----DQvRFe~~~~~l~P~l~viaP~Rd~~ 151 (404) .-+|+- |+++-+ .-.--+|-+.+.+.|++.++++||.| +||.|- +.+||++-. -.. T Consensus 124 ~kGaln-GRfhpC-GRCh~~I~~~V~~k~re~di~~vafGDlLs~G~~svy~eD~i~rlnlPA--------------flA 187 (255) T COG1365 124 EKGALN-GRFHPC-GRCHSMIENAVMDKARELDIDVVAFGDLLSTGYGSVYREDGIFRLNLPA--------------FLA 187 (255) T ss_pred HHHHCC-CCCCCC-CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEECCCCEEEECCHH--------------HHH T ss_conf 863205-877875-2378999999999998538708997463234663212058879972588--------------885 Q ss_pred CCCHHHHHHHHHHHCCCCC Q ss_conf 7637999999997089675 Q gi|254780234|r 152 FKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 152 ~~sRe~~i~ya~~~gIpv~ 170 (404) + +..|+.+-++.+|+... T Consensus 188 l-tK~Elr~il~~~~~e~~ 205 (255) T COG1365 188 L-TKDELRSILKWNGYELE 205 (255) T ss_pred H-CCHHHHHHHHHCCCCCH T ss_conf 1-71999999871384300 No 63 >cd00553 NAD_synthase NAD+ synthase is a homodimer, which catalyzes the final step in de novo nicotinamide adenine dinucleotide (NAD+) biosynthesis, an amide transfer from either ammonia or glutamine to nicotinic acid adenine dinucleotide (NaAD). The conversion of NaAD to NAD+ occurs via an NAD-adenylate intermediate and requires ATP and Mg2+. The intemediate is subsequently cleaved into NAD+ and AMP. In many prokaryotes, such as E. coli , NAD synthetase consists of a single domain and is strictly ammonia dependent. In contrast, eukaryotes and other prokaryotes have an additional N-terminal amidohydrolase domain that prefer glutamine, Interestingly, NAD+ synthases in these prokaryotes, can also utilize ammonia as an amide source . Probab=97.96 E-value=0.00064 Score=48.44 Aligned_cols=151 Identities=21% Similarity=0.238 Sum_probs=95.9 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCC-CEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 777899991477259999999987189-869999945787--11068999999973980799820089999987999973 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKG-LEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g-~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) ..+++||+-|||+||++++....+--| ..|+++.+-... +++.+++++-|..+|. ++.++|....+ +. ....+. T Consensus 22 g~~~~viGlSGGIDSav~a~La~~Alg~~~v~~v~mP~~~ss~~s~~~A~~la~~lgi-~~~~i~I~~~~-~~-~~~~~~ 98 (248) T cd00553 22 GFKGVVLGLSGGIDSALVAALAVRALGRENVLALFMPSRYSSEETREDAKELAEALGI-EHVNIDIDPAV-EA-FLALLG 98 (248) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCC-CEEEECHHHHH-HH-HHHHHH T ss_conf 9991999588889999999999997288759998899978987589999999999589-16871559999-99-999864 Q ss_pred CCCCCCCCCCCCCCHHHH-HHH----HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCC-CEEEEEECCCCCCC Q ss_conf 686547874201301243-359----999999885689287500111682156799999985879-70774410068763 Q gi|254780234|r 81 ANALYEGYYLLGTAIARP-LIA----KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSD-IEIIAPWRHWSFKG 154 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRp-lia----~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~-l~viaP~Rd~~~~s 154 (404) ... + +....+++- +.+ ..+-.+|.+.|.=++. -||.-..+=. +-++.-| ..=++|+.+ + . T Consensus 99 ~~~---~--~~~~~~~~~NiqaRlRm~~Ly~~An~~~~lVlg-----TgNksE~~~G-y~TkyGD~~~d~~Pi~~--L-~ 164 (248) T cd00553 99 ESG---G--SELEDLALGNIQARLRMVILYALANKLGGLVLG-----TGNKSELLLG-YFTKYGDGAADINPIGD--L-Y 164 (248) T ss_pred HHC---C--CCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEC-----CCCHHHHHHC-CCEECCCCCCCCCCCCC--C-C T ss_conf 404---8--705567874248788899999999744998962-----8746078757-21000787758221368--8-4 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 79999999970896756 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDK 171 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~ 171 (404) ..+-.+.|+..|||-.. T Consensus 165 Kt~V~~la~~l~vp~~I 181 (248) T cd00553 165 KTQVRELARYLGVPESI 181 (248) T ss_pred HHHHHHHHHHHCCHHHH T ss_conf 99999999996888987 No 64 >PRK13981 NAD synthetase; Provisional Probab=97.95 E-value=0.0016 Score=45.65 Aligned_cols=149 Identities=21% Similarity=0.217 Sum_probs=92.1 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEEC-CC-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH---HH Q ss_conf 7778999914772599999999871898-699999457-87-110689999999739807998200899999879---99 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADL-GQ-GEELKIASDKARLLGAKEVYVKDLRREFVRDFV---FP 77 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~-Gq-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i---~~ 77 (404) ..+||||.-|||+||++++..-.+--|. .|+++++-- +. ....+++++-|..+|+ ++..++..+.| +.+. -| T Consensus 280 gf~~vviGLSGGiDSaL~aaiA~dALG~~nV~~v~MPs~~tS~~s~~dA~~La~~LGi-~~~~i~I~~~~-~~~~~~l~~ 357 (543) T PRK13981 280 GFPGVVLGLSGGIDSALVAAIAVDALGAENVRAVMMPSRYTSDESLDDAAALARNLGV-RYDIIPIEPAF-EAFEAALAP 357 (543) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCHHHEEEEECCCCCCCHHHHHHHHHHHHHHCC-CEEEEEHHHHH-HHHHHHHHH T ss_conf 8974999767883599999999985383437886458777866569999999999599-74875149999-999997404 Q ss_pred HHHC---CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCC-CCEEEEEECCCCCC Q ss_conf 9736---8654787420130124335999999988568928750011168215679999998587-97077441006876 Q gi|254780234|r 78 MFRA---NALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNS-DIEIIAPWRHWSFK 153 (404) Q Consensus 78 ~I~a---na~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P-~l~viaP~Rd~~~~ 153 (404) .+.. +..|| +.-|-.|-. -|...|.+.|+=++..| |.-. .-+.+-+++- .+.-++|.-|..-+ T Consensus 358 ~f~~~~~dvt~E----NiQAR~R~~---iLm~laN~~g~lvl~Tg-----nkSE-~avGy~TlyGD~~g~~avi~dv~KT 424 (543) T PRK13981 358 LFAGTEPDITEE----NLQSRIRGT---LLMALSNKFGSLVLTTG-----NKSE-MAVGYATLYGDMAGGFAPIKDVYKT 424 (543) T ss_pred HHCCCCCCCCHH----HHHHHHHHH---HHHHHHHCCCCEEECCC-----CHHH-HHHCHHHHCCCCCCCCCCCCCCCHH T ss_conf 416876665223----454888999---99998733896796478-----6727-8747465337655674303686599 Q ss_pred CHHHHHHHHHHHCC Q ss_conf 37999999997089 Q gi|254780234|r 154 GRQDLIDFAEKHAI 167 (404) Q Consensus 154 sRe~~i~ya~~~gI 167 (404) ---+...|..++.- T Consensus 425 ~V~~L~r~~n~~~~ 438 (543) T PRK13981 425 LVYRLCRWRNTNHP 438 (543) T ss_pred HHHHHHHHHHHHCC T ss_conf 99999999865364 No 65 >PTZ00077 asparagine synthetase; Provisional Probab=97.90 E-value=0.00032 Score=50.56 Aligned_cols=112 Identities=21% Similarity=0.271 Sum_probs=73.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCC-----------------------------------------CEEEEEEEECCCCCH Q ss_conf 899991477259999999987189-----------------------------------------869999945787110 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKG-----------------------------------------LEVIVFIADLGQGEE 45 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g-----------------------------------------~eVi~~~~d~Gq~~d 45 (404) .|.+..|||||||.+...++..-. ..+.+|++-+-...| T Consensus 239 PvG~lLSGGLDSSlIaAia~k~l~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~tFsIGl~gspD 318 (610) T PTZ00077 239 PFGILLSGGLDSSIIAAILAKHLKIIDGEGGGGIQGGGAAAAHNAAHEANNNNGEDNCGNNNADPCHLKSFSIGLKGSPD 318 (610) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCH T ss_conf 23666347705899999999865420111222222443210001111124444322123566674442268862589824 Q ss_pred HHHHHHHHHHCCCEEEEEECHH-HHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCC Q ss_conf 6899999997398079982008-999998799997368654787420130124335999999988568928750011168 Q gi|254780234|r 46 LKIASDKARLLGAKEVYVKDLR-REFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKG 124 (404) Q Consensus 46 ~~~~~~~A~~~Ga~~~~v~D~r-~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkG 124 (404) +..+++.|..+|.. |+.+.+. +++. +.+...|..-..|.=. ....|+.-.++ -+.+++.|..+|. ||-| T Consensus 319 l~~Ar~vA~~lGt~-Hhe~~~t~ee~l-dal~dvIy~lEtyd~t-tirAs~pmYLl----sr~ik~~gvKVvL---SGEG 388 (610) T PTZ00077 319 LKAAKEVAEFLGIE-HTEFHFTVEEGI-DSLHDVIYHIETYDIT-TIRASTPMYIL----SRLIKSSCVKMVL---SGEG 388 (610) T ss_pred HHHHHHHHHHHCCC-CEEEEECHHHHH-HHHHHHHHHCCCCCCC-CCCHHHHHHHH----HHHHHHCCCEEEE---CCCC T ss_conf 89999999983883-459997779998-7757665212566755-42103589999----9999864988995---3887 Q ss_pred CHHH Q ss_conf 2156 Q gi|254780234|r 125 NDQV 128 (404) Q Consensus 125 NDQv 128 (404) -|.+ T Consensus 389 ADEl 392 (610) T PTZ00077 389 ADEI 392 (610) T ss_pred CHHH T ss_conf 4465 No 66 >pfam02540 NAD_synthase NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation. Probab=97.90 E-value=0.00029 Score=50.88 Aligned_cols=152 Identities=24% Similarity=0.305 Sum_probs=93.0 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEECC-C-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 7778999914772599999999871898-6999994578-7-11068999999973980799820089999987999973 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADLG-Q-GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~G-q-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) ..++||++.|||+||+|++....+--|. .|+++..--- + +++.++++.-|..+|+ ++.++|..+-| +.+. ..+. T Consensus 17 g~~gvViGlSGGIDSav~a~La~~AlG~~~v~~v~mP~~~ss~~s~~~A~~la~~lgi-~~~~i~I~~~~-~~~~-~~~~ 93 (243) T pfam02540 17 GAKGVVLGLSGGIDSAVVAYLAVKALGKENVLALIMPSINSSEEDVQDALALAENLGI-NYKTIDIKPIV-RAFS-QLFQ 93 (243) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCC-CEEEEECHHHH-HHHH-HHHC T ss_conf 9991999788879999999999995597428999647778988899999999999689-26999888999-9999-8853 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCCCCHHHHH Q ss_conf 686547874201301243359999999885689287500111682156799999985879707-7441006876379999 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSFKGRQDLI 159 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~~sRe~~i 159 (404) ... ...-.+---+|-=. -.+..+|.+.|. +.- |-||.-. +-+.+-++.-|... ++|+.+ + ...+-- T Consensus 94 ~~~---~~~a~~Ni~aR~Rm-~~ly~~A~~~~~--lVl---gT~NksE-~~~Gy~TkyGD~~~di~Pi~d--L-~KteV~ 160 (243) T pfam02540 94 PAK---DDLAKGNLKARIRM-IILYAHANKFNR--LVL---GTGNKSE-LALGYFTKYGDGACDIAPIGD--L-YKTQVY 160 (243) T ss_pred CCH---HHHHHHHHHHHHHH-HHHHHHHCCCCE--EEE---ECCCCCC-EEEEEEEECCCCCCCHHHHCC--C-CHHHHH T ss_conf 201---37787455788889-999986233791--898---1687042-005432312777655154168--8-189999 Q ss_pred HHHHHHCCCCCC Q ss_conf 999970896756 Q gi|254780234|r 160 DFAEKHAIPIDK 171 (404) Q Consensus 160 ~ya~~~gIpv~~ 171 (404) +.|+..|||-.. T Consensus 161 ~la~~l~vP~~i 172 (243) T pfam02540 161 ELAKRLNVPERI 172 (243) T ss_pred HHHHHHCCCHHH T ss_conf 999994988889 No 67 >TIGR00289 TIGR00289 conserved hypothetical protein TIGR00289; InterPro: IPR005237 This family of conserved hypothetical proteins has no known function. Homologous proteins related to MJ0570 of Methanococcus jannaschii include both the apparent orthologs in the family, the much longer protein YLR143W from Saccharomyces cerevisiae (Baker's yeast), and second homologous proteins from Archaeoglobus fulgidus and Pyrococcus horikoshii that appear to represent a second orthologous group.. Probab=97.85 E-value=0.00017 Score=52.57 Aligned_cols=110 Identities=25% Similarity=0.392 Sum_probs=80.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-CCH--------HHHHHHHHHHCCC--EEEEEECHHHHHHHHHH Q ss_conf 899991477259999999987189869999945787-110--------6899999997398--07998200899999879 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-GEE--------LKIASDKARLLGA--KEVYVKDLRREFVRDFV 75 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-~~d--------~~~~~~~A~~~Ga--~~~~v~D~r~ef~~~~i 75 (404) ||.+.||||-||..+|.|-. +.|.||. +.|++=. .+| +.-..-.|..+|. .+.|+-=.+++=++|. T Consensus 2 kvAvLySGGKDS~~ALy~al-~eG~eV~-~LV~v~seN~eSyMfH~pN~Hl~~l~AEavGiPL~klyt~GeeEkEVedL- 78 (227) T TIGR00289 2 KVAVLYSGGKDSILALYKAL-EEGFEVK-YLVGVISENEESYMFHVPNVHLTDLVAEAVGIPLIKLYTSGEEEKEVEDL- 78 (227) T ss_pred EEEEEECCCHHHHHHHHHHH-HCCCCEE-EEEEEECCCCCCCEEECHHHHHHHHHHHHCCCCHHHHHCCCCCCCHHHHH- T ss_conf 06898518713689999998-6288547-85100028898620534257899999976176625630278652117666- Q ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 999736865478742013012433599999998856892875001116821567999999858797077441 Q gi|254780234|r 76 FPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPW 147 (404) Q Consensus 76 ~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~ 147 (404) ++.++.-+++|+++|.-|+-..-=-.-|||..++-|+ |+-+||. T Consensus 79 --------------------------ag~l~~lderGvea~~~Ga~~S~YQK~Rid~vCrelG--lks~APL 122 (227) T TIGR00289 79 --------------------------AGQLEELDERGVEAVVIGAIESEYQKSRIDKVCRELG--LKSIAPL 122 (227) T ss_pred --------------------------HHHHHHHHHCCCCEEEECCEECCHHHHHHHHHHHHHC--CHHCCCC T ss_conf --------------------------2155445530545478853410112335778877714--1002760 No 68 >cd01984 AANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases, ATP sulphurylases Universal Stress Response protein and electron transfer flavoprotein (ETF). The domain forms a apha/beta/apha fold which binds to Adenosine nucleotide. Probab=97.74 E-value=7.3e-05 Score=55.22 Aligned_cols=66 Identities=30% Similarity=0.468 Sum_probs=56.9 Q ss_pred EEEEECCCHHHHHHHHHHHHH--CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCC Q ss_conf 999914772599999999871--898699999457871106899999997398079982008999998799997368654 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVE--KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALY 85 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e--~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Y 85 (404) |++++|||+||+++++|++.. .+.+++++..+ T Consensus 1 ilv~~Sgg~dS~~~~~la~~~~~~~~~~~~~~~~---------------------------------------------- 34 (86) T cd01984 1 ILVALSGGLDSSVLLHLAKRLKSGGPEVVALVVV---------------------------------------------- 34 (86) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEHH---------------------------------------------- T ss_conf 9788617853899999999852599808976389---------------------------------------------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHH Q ss_conf 78742013012433599999998856892875001116821567999 Q gi|254780234|r 86 EGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFEL 132 (404) Q Consensus 86 eg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~ 132 (404) .+.+.+-+.+.++|++.|.+|...+.++..|... T Consensus 35 -------------~~~~~~~~~~~~~~~~~i~~G~~~~~~~~~~~~~ 68 (86) T cd01984 35 -------------AFVRILKRLAAEEGADVIILGHNADDVAGRRLGA 68 (86) T ss_pred -------------HHHHHHHHHHHHCCCCEEEECCCCHHHHHHCCCC T ss_conf -------------9999999987760898899867732453310166 No 69 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=97.73 E-value=0.00092 Score=47.30 Aligned_cols=108 Identities=16% Similarity=0.183 Sum_probs=70.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 7899991477259999999987189869999945787--11068999999973980799820089999987999973686 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) -.|.+..|||||||.++..+....+..+.||++.... .+|...+++.|..+|.. |+++.+..+-. +.+.+.+.. T Consensus 259 VpvG~~LSGGlDSS~I~a~~~~~~~~~i~tfsi~f~~~~~DE~~~A~~vA~~~g~~-h~~~~~~~~~~-~~~~~~~~~-- 334 (628) T TIGR03108 259 VPLGAFLSGGVDSSAVVALMAGLSDTPVNTCSIAFDDPAFDESAYARQVAERYGTN-HRVETVDPDDF-SLVDRLAGL-- 334 (628) T ss_pred CCEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCC-CEEEECCHHHH-HHHHHHHHH-- T ss_conf 85687646886368999999984299864366534888754369999999860876-47997798999-879999998-- Q ss_pred CCCCCCCCCCCHHHHHHHH-HHHHHHHHCCCCEEECHHHCCCCHH Q ss_conf 5478742013012433599-9999988568928750011168215 Q gi|254780234|r 84 LYEGYYLLGTAIARPLIAK-YLVDIANETGADAIAHGSTGKGNDQ 127 (404) Q Consensus 84 ~Yeg~Ypl~tslaRplia~-~lv~~a~~~ga~~iaHG~TgkGNDQ 127 (404) ++ -|++.+. .++. .+.+.|++ ++.++-- |-|-|. T Consensus 335 -~d--eP~~d~s---~i~~~~l~~~a~~-~vkV~Ls---G~GgDE 369 (628) T TIGR03108 335 -YD--EPFADSS---ALPTYRVCELARK-RVTVALS---GDGGDE 369 (628) T ss_pred -HC--CCCCCHH---HHHHHHHHHHHHC-CCEEEEE---CCCCCC T ss_conf -46--8866678---9999999999864-9749996---677540 No 70 >TIGR03104 trio_amidotrans asparagine synthase family amidotransferase. Members of this protein family are closely related to several isoforms of asparagine synthetase (glutamine amidotransferase) and typically have been given this name in genome annotation to date. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and a member of the GNAT family of acetyltransferases. Probab=97.65 E-value=0.00065 Score=48.40 Aligned_cols=108 Identities=20% Similarity=0.246 Sum_probs=69.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEC---CC--CCHHHHHHHHHHHCCCEEEEEECHH-HHHHHHHHHHHHH Q ss_conf 8999914772599999999871898699999457---87--1106899999997398079982008-9999987999973 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADL---GQ--GEELKIASDKARLLGAKEVYVKDLR-REFVRDFVFPMFR 80 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~---Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r-~ef~~~~i~~~I~ 80 (404) .|.+..|||||||.++..+.+...-++.||++.. +. .+|...+++.|..+|.. |+.+.+. ++|. +.+..++. T Consensus 262 pvg~~LSGGlDSS~i~a~~~~~~~~~i~tfs~~f~~~~~~~~dE~~~a~~va~~~~~~-h~~~~~~~~~~~-~~l~~~~~ 339 (589) T TIGR03104 262 PVGVLLSGGLDSSLIVGLLAEAGVDGLRTFSIGFEDVGGEKGDEFEYSDIIAERFHTR-HHKIRIPNHRVL-PALPEAVA 339 (589) T ss_pred CCCEECCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCC-CEEEEECCHHHH-HHHHHHHH T ss_conf 6300017875438999999995699985278872468877666799999999973878-579995717899-98899885 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHH-HHHHHHHCCCCEEECHHHCCCCHHH Q ss_conf 68654787420130124335999-9999885689287500111682156 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKY-LVDIANETGADAIAHGSTGKGNDQV 128 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~-lv~~a~~~ga~~iaHG~TgkGNDQv 128 (404) +. + -|++.. ..++.. +.+.|++ ++.++-- |-|-|.+ T Consensus 340 ~~---d--eP~~~~---~~~~~~~l~~~~~~-~~kV~Ls---G~GaDEl 376 (589) T TIGR03104 340 AM---S--EPMVSH---DCVAFYLLSEEVSK-HVKVVQS---GQGADEV 376 (589) T ss_pred CC---C--CCCCCC---HHHHHHHHHHHHCC-CCEEEEE---EECCCCC T ss_conf 26---8--987683---06889999987328-8369995---2033323 No 71 >KOG1622 consensus Probab=97.55 E-value=0.00044 Score=49.59 Aligned_cols=66 Identities=27% Similarity=0.354 Sum_probs=57.2 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHH Q ss_conf 78999914772599999999871898-69999945787--110689999999739807998200899999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVR 72 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~ 72 (404) .||+.+.|||.|||||.++|+.--|. .++|+.+|-|- ..|-+..++--..+|. +..++|.-++|.. T Consensus 231 ~~Vl~~vSGgvdStV~a~Ll~~alg~~R~~ai~vdNG~mrk~Ea~~V~~tl~~lgi-~i~v~~as~~f~s 299 (552) T KOG1622 231 YKVLVAVSGGVDSTVCAALLRRALGPDRVHAIHVDNGFMRKKEAEQVEKTLVYLGI-PITVVDASETFLS 299 (552) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCC-CEEEEECHHHHHH T ss_conf 45599834885089999999986378944899812640103677889999997388-4499621588887 No 72 >TIGR03183 DNA_S_dndC putative sulfurtransferase DndC. Members of this protein family are the DndC protein from the dnd (degradation during electrophoresis) operon. The dnd phenotype reflects a sulfur-containing modification to DNA. This operon is sparsely and sporadically distributed among bactera; among the first eight examples are members from the Actinobacteria, Firmicutes, Gammaproteobacteria, Cyanobacteria. DndC is suggested to be a sulfurtransferase. Probab=97.49 E-value=0.0021 Score=44.69 Aligned_cols=145 Identities=19% Similarity=0.255 Sum_probs=94.7 Q ss_pred EEEEEECCCHHHHHHHHHH--------HHHCCCEEEEEEEECCCC---------CHHHHHHHHHHHCCC--EEEEEE-CH Q ss_conf 8999914772599999999--------871898699999457871---------106899999997398--079982-00 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWL--------QVEKGLEVIVFIADLGQG---------EELKIASDKARLLGA--KEVYVK-DL 66 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L--------~~e~g~eVi~~~~d~Gq~---------~d~~~~~~~A~~~Ga--~~~~v~-D~ 66 (404) --|++||||-|||+++.++ .+...-.|+.++.|++-. .-++.+++.|.+.|. ..+.+. ++ T Consensus 15 PWvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnP~V~~~v~~~L~~i~~~a~~~~LPi~~~~~~P~~ 94 (447) T TIGR03183 15 PWVIGYSGGKDSTAVLQLVWNALSELPPEQRTKKIHVISTDTLVENPIVAAWVNVSLERMQEAAQKQGLPIEPHRLTPAI 94 (447) T ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCC T ss_conf 87997587739999999999999847887576888999799875577999999999999999999749982368933785 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH--------HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHH---- Q ss_conf 89999987999973686547874201301243359--------999999885689287500111682156799999---- Q gi|254780234|r 67 RREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA--------KYLVDIANETGADAIAHGSTGKGNDQVRFELSA---- 134 (404) Q Consensus 67 r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia--------~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~---- 134 (404) .+.|+-..| ...||-.+.--|=|.- +.+-+...+.|-.++.=| |.+.-.+.|=..-- T Consensus 95 ~dtFWvnlI----------GrGYPaP~~~FRWCTdrLKI~P~~~fi~~~v~~~ge~ilvLG-tR~~ES~~Ra~~i~k~e~ 163 (447) T TIGR03183 95 EDTFWVNLI----------GKGYPAPRQKFRWCTDRLKISPSNTFIRDKVAANGEVILVLG-TRKNESQARAAVMEKHEG 163 (447) T ss_pred CCCHHHHHC----------CCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHHHHHHHC T ss_conf 443000103----------463079998765255766368289999986521496899995-652777999999998644 Q ss_pred ----HHHC-----CCCEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf ----9858-----797077441006876379999999970 Q gi|254780234|r 135 ----YSLN-----SDIEIIAPWRHWSFKGRQDLIDFAEKH 165 (404) Q Consensus 135 ----~~l~-----P~l~viaP~Rd~~~~sRe~~i~ya~~~ 165 (404) ..|. |+.-|++|+.+|.. ++-.+|+.++ T Consensus 164 ~~~~~~ls~h~~l~n~~vy~PI~dws~---ddVW~yL~~~ 200 (447) T TIGR03183 164 GSTRDRLSRNSSLPNSWVYTPIEDWSN---DDVWMYLLQN 200 (447) T ss_pred CCCHHHCCCCCCCCCCEEECCHHHCCH---HHHHHHHCCC T ss_conf 670421276888888379730674764---0899986338 No 73 >PRK06850 hypothetical protein; Provisional Probab=97.43 E-value=0.0048 Score=42.17 Aligned_cols=146 Identities=18% Similarity=0.246 Sum_probs=94.4 Q ss_pred EEEEEECCCHHHHHHHHHH--------HHHCCCEEEEEEEECCC--C-------CHHHHHHHHHHHCCC--EEEEE-ECH Q ss_conf 8999914772599999999--------87189869999945787--1-------106899999997398--07998-200 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWL--------QVEKGLEVIVFIADLGQ--G-------EELKIASDKARLLGA--KEVYV-KDL 66 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L--------~~e~g~eVi~~~~d~Gq--~-------~d~~~~~~~A~~~Ga--~~~~v-~D~ 66 (404) --|++||||-|||+++.+. .++..-.|+.++-|++= | .-++.+++.|...|. ..+.+ =++ T Consensus 23 PWvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnPiV~~~v~~sL~~i~~~A~~~~LPI~~~~v~P~~ 102 (488) T PRK06850 23 PWVIGYSGGKDSTAVLQLVWNALAGLPPEKRHKPVYVISTDTLVENPVVVAWVNKSLERIKEAAKKQGLPITPHKLTPKV 102 (488) T ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCC T ss_conf 87997587719999999999999837887668988999589875677999999999999999998659982268814886 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH--------HHHHHHHHHCCCCEEECHHHCCCCHHHHH---HHHH- Q ss_conf 89999987999973686547874201301243359--------99999988568928750011168215679---9999- Q gi|254780234|r 67 RREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA--------KYLVDIANETGADAIAHGSTGKGNDQVRF---ELSA- 134 (404) Q Consensus 67 r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia--------~~lv~~a~~~ga~~iaHG~TgkGNDQvRF---e~~~- 134 (404) .+.|+-..| ...||-.+.--|=|.- +.+-+...+.|-.++.=| |.+.-.+.|= +..- T Consensus 103 ~dtFWvnlI----------GrGYPaP~~~FRWCTdRLKI~P~~rFI~~~v~~~ge~IlvLG-tR~~ES~~Ra~~m~k~e~ 171 (488) T PRK06850 103 KDTFWVNLI----------GKGYPAPRRKFRWCTERLKINPSNDFIRDKVSAFGEVIVVLG-TRKAESAARAQVMAKHKI 171 (488) T ss_pred CCCHHHHHC----------CCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHCCCEEEEEE-CCHHHHHHHHHHHHHHHC T ss_conf 544335420----------476799987665475665148489999999875397899985-452777999999998752 Q ss_pred ----HHHCC-----CCEEEEEECCCCCCCHHHHHHHHHHHC Q ss_conf ----98587-----970774410068763799999999708 Q gi|254780234|r 135 ----YSLNS-----DIEIIAPWRHWSFKGRQDLIDFAEKHA 166 (404) Q Consensus 135 ----~~l~P-----~l~viaP~Rd~~~~sRe~~i~ya~~~g 166 (404) ..|.| +.-|++|+.+|... +-..|+.+.. T Consensus 172 ~~~r~~Ls~h~~l~n~~vy~PI~~ws~d---dVW~yL~~~~ 209 (488) T PRK06850 172 GGTRDRLSRHSTLPNSFVYTPIEDWSND---DVWKYLLQWE 209 (488) T ss_pred CCCCCCCCCCCCCCCCEEECCHHHCCCC---HHHHHHHHCC T ss_conf 4730125778888886897125537733---7999986189 No 74 >PRK00768 nadE NAD synthetase; Reviewed Probab=97.21 E-value=0.011 Score=39.52 Aligned_cols=157 Identities=17% Similarity=0.174 Sum_probs=84.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHH-------HHH---CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHH Q ss_conf 7778999914772599999999-------871---898699999457871106899999997398079982008999998 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWL-------QVE---KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRD 73 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L-------~~e---~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~ 73 (404) ..+.+||.-|||.||+++...- ..+ ..+.++++..-.++..+.++++..+..+++.....+|...-+ +. T Consensus 39 g~~g~VlGLSGGIDSAvta~L~~~Av~al~~~~~~~~~~~iav~mP~~~~~~~~da~~~~~~i~~~~~~~i~I~~~~-d~ 117 (274) T PRK00768 39 GLKTLVLGISGGQDSTLAGRLAQLAVEELRAETGDASYQFIAVRLPYGVQADEDDAQDALAFIQPDEVLTVNIKPAV-DA 117 (274) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHH-HH T ss_conf 99839981885768899999999999986530476642268976898865799999999852276604886170999-99 Q ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCC Q ss_conf 7999973686547874201301243359999999885689287500111682156799999985879707-744100687 Q gi|254780234|r 74 FVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSF 152 (404) Q Consensus 74 ~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~ 152 (404) +....-.+....++ .-.+-.-+|-= .-.+-.+|...|.=+ - |-||.-..+ +.+-++.-|..+ ++|+-| + T Consensus 118 ~~~~l~~~~~~~~d-~~~~NiqaRiR-M~~LY~~An~~g~LV--l---GTgNksE~~-vGYfTkYGDg~~Di~PI~d--L 187 (274) T PRK00768 118 SVAALRAAGIELSD-FVKGNIKARER-MIAQYAIAGARGGLV--V---GTDHAAEAI-TGFFTKFGDGGADLLPLFG--L 187 (274) T ss_pred HHHHHHHHCCCCHH-HHHHHHHHHHH-HHHHHHHHHCCCCEE--E---CCCCHHHHH-CCCEEEECCCCCCHHHHCC--C T ss_conf 99999872775116-78888579899-999999982789988--6---488640444-0850102677517376325--6 Q ss_pred CCHHHHHHHHHHHCCCCCCC Q ss_conf 63799999999708967566 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~~~ 172 (404) ...+--+.|+..|+|-..- T Consensus 188 -~KteV~~lA~~LgvP~~Ii 206 (274) T PRK00768 188 -NKRQGRALLAALGAPEHLY 206 (274) T ss_pred -CHHHHHHHHHHHCCCHHHH T ss_conf -1999999999959799983 No 75 >TIGR00342 TIGR00342 thiamine biosynthesis/tRNA modification protein ThiI; InterPro: IPR003720 Thiamine pyrophosphate (TPP) is synthesized de novo in many bacteria and is a required cofactor for many enzymes in the cell. ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway . Almost all the proteins in this group have an N-terminal THUMP domain (see IPR004114 from INTERPRO).; GO: 0009228 thiamin biosynthetic process. Probab=97.20 E-value=0.021 Score=37.54 Aligned_cols=148 Identities=20% Similarity=0.309 Sum_probs=95.0 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC---HHHHHHHHHH------HCCCEEEEEECHHHHHHHHHHH Q ss_conf 789999147725999999998718986999994578711---0689999999------7398079982008999998799 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGE---ELKIASDKAR------LLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~---d~~~~~~~A~------~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) .||+..-|||.||.|+. +|..+.|++|++++.+..... -.+.++.=|. ..+-.+.+++|+.+ ..++.. T Consensus 187 Gkvl~LlSGGiDSPVAa-f~~m~RGc~V~~vhf~~~~~~~~~~~~kv~~la~~~~~~~~~~~~~l~~~~F~~-~~~~~~- 263 (391) T TIGR00342 187 GKVLALLSGGIDSPVAA-FLAMKRGCRVVAVHFFNEPAASEKAREKVERLANLLSLNETGGSVKLYVVDFTD-VQEEII- 263 (391) T ss_pred HHHHHHHCCCCCHHHHH-HHHHHCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEEECHHH-HHHHHH- T ss_conf 36320411883516799-999663877999973288552468999999999988540003799999854389-999998- Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHC-CCCHHHHHHHHHHHHCCCCEEEE-EECCCCCCC Q ss_conf 9973686547874201301243359999999885689287500111-68215679999998587970774-410068763 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTG-KGNDQVRFELSAYSLNSDIEIIA-PWRHWSFKG 154 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~Tg-kGNDQvRFe~~~~~l~P~l~via-P~Rd~~~~s 154 (404) . -.-|+ | ..-+-|=++-+..=.+|.++||+||-.|..- ==-.|-==++.+-.-+-+..|+. |.= .+ + T Consensus 264 ~-----~~~e~-~--~~v~~rR~M~~~A~~~ae~~g~~A~VTGe~LGQVASQTL~Nl~vI~~~~~~~iL~RPLI--g~-D 332 (391) T TIGR00342 264 E-----IIPEK-Y--TMVLCRRLMLKIASKVAEKEGCLAIVTGESLGQVASQTLENLRVIQAVVNTPILRRPLI--GM-D 332 (391) T ss_pred H-----CCCCC-C--EEEEHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEECCCC--CC-C T ss_conf 4-----18998-7--88740667999999888860994899766342037778889999972378742407755--47-9 Q ss_pred HHHHHHHHHHHCC Q ss_conf 7999999997089 Q gi|254780234|r 155 RQDLIDFAEKHAI 167 (404) Q Consensus 155 Re~~i~ya~~~gI 167 (404) -++-++.|++-|= T Consensus 333 K~~Ii~~Ak~IgT 345 (391) T TIGR00342 333 KEEIIELAKEIGT 345 (391) T ss_pred HHHHHHHHHHCCC T ss_conf 7899999741396 No 76 >PTZ00323 NAD+ synthase; Provisional Probab=97.19 E-value=0.0085 Score=40.38 Aligned_cols=158 Identities=16% Similarity=0.226 Sum_probs=87.0 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCE------EEEEEEECCC-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHH Q ss_conf 77789999147725999999998718986------9999945787-1106899999997398079982008999998799 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGLE------VIVFIADLGQ-GEELKIASDKARLLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~e------Vi~~~~d~Gq-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) ..++|||.-|||.||++++.....--|.| ++++..-..+ ...+++.++.+..+|+. .+++|...-+ +.+. T Consensus 45 GfkgvVLGlSGGIDSAl~aaLA~~Alg~env~~~r~~gv~~P~~ss~~s~~~a~~~a~~~g~~-~~~~~~~~i~--~~~~ 121 (294) T PTZ00323 45 GLKACVTSVSGGIDSAVVLALCSRAMRMPNSPIVRNVGICQPICSSAWALARGRENIAACGAT-EVVVDQTELH--KQLS 121 (294) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCC-EEECCCHHHH--HHHH T ss_conf 998599958636999999999999865545753035677678545766598799999976885-2323607779--9999 Q ss_pred HHHHCCCCCCC-CCCCCCCHHH--HHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCC Q ss_conf 99736865478-7420130124--3359999999885689287500111682156799999985879707-744100687 Q gi|254780234|r 77 PMFRANALYEG-YYLLGTAIAR--PLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSF 152 (404) Q Consensus 77 ~~I~ana~Yeg-~Ypl~tslaR--plia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~ 152 (404) .++....-.++ .+-.+.--+| -.+.-.+...+.+.|-..+-- |-||.-..+-+.+-++.-|+.. ++|+-| + T Consensus 122 ~~~~~~~g~~~~d~~~gNlqAR~Rm~~~~~la~l~n~~g~~~LVl---gTgNkSE~~~vGY~TkYGD~agd~apI~D--L 196 (294) T PTZ00323 122 TLVETAVGIDGGDFARGQLRSYMRTPVGYYVAQLLSQEGTPAIVM---GTGNMDEDGYLGYFCKAGDGVVDVQLISD--L 196 (294) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE---CCCCCCCCHHCEEEEEECCCCCCCHHHCC--C T ss_conf 999986088610467889999998889999999860048870786---28975542120044203477547033046--8 Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 637999999997089675 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPID 170 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~ 170 (404) . ..+--+.|+..|||-. T Consensus 197 ~-Kt~V~~Lar~LgiPe~ 213 (294) T PTZ00323 197 H-KSEVFLVAEELGVPEN 213 (294) T ss_pred C-HHHHHHHHHHCCCCHH T ss_conf 3-9999999998099899 No 77 >PRK00876 nadE NAD synthetase; Reviewed Probab=97.19 E-value=0.0027 Score=43.91 Aligned_cols=159 Identities=21% Similarity=0.257 Sum_probs=91.3 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEEC--CCCCHHHHHHHHHHHCCCEEEEEECHHHHHHH-------- Q ss_conf 7778999914772599999999871898-699999457--87110689999999739807998200899999-------- Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGL-EVIVFIADL--GQGEELKIASDKARLLGAKEVYVKDLRREFVR-------- 72 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~-eVi~~~~d~--Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~-------- 72 (404) ..|-+||+-|||.||||+..+...--|- .|+++.+-- -.+++.+.++..|..+|+ ++.++|...-+-. T Consensus 32 ~~kG~VlGlSGGIDSAv~a~Lav~AlG~e~V~gl~MP~~~s~~~s~~~a~~la~~lGi-~~~~idI~~~l~~~~~y~~~d 110 (325) T PRK00876 32 KRRGVVLGLSGGIDSSVTLALCVRALGKDRVLGLLMPERDSSPDSLRLGRMLAESLGV-EYVVEDISPALEALGCYERRD 110 (325) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCC-CEEEEECHHHHHHHHHHHHHH T ss_conf 9867999687688899999999997485516999788656887789999999998199-359984489998754313568 Q ss_pred HH---HHHH---------HHCCCCCCCCCCC---------C------CCHHH--------HHHH--H--HHHHHHHHCCC Q ss_conf 87---9999---------7368654787420---------1------30124--------3359--9--99999885689 Q gi|254780234|r 73 DF---VFPM---------FRANALYEGYYLL---------G------TAIAR--------PLIA--K--YLVDIANETGA 113 (404) Q Consensus 73 ~~---i~~~---------I~ana~Yeg~Ypl---------~------tslaR--------plia--~--~lv~~a~~~ga 113 (404) +. ++|- ...|++-.+.+-. + .+.+. -+.+ + .+-.+|.+.|. T Consensus 111 ~~i~~~~p~~~~~~~~~i~~~~~l~~~~~~~~~l~~~~~~~~~~~~~l~~~~~l~~~a~~NiKaR~RM~~lY~~A~~~n~ 190 (325) T PRK00876 111 EAIRRVVPEYGPGWKSKLVLPNFLEGDLLNVPYLVVQDPGGEVRKKRLPPADYLQIVAATNFKQRTRKMVEYYHADRLNY 190 (325) T ss_pred HHHHHHCCCCCCCHHCCCCCCCHHCCCCCCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 88875033324200000002311102322310121046421000013550344445553233689999999999987398 Q ss_pred CEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 287500111682156799999985879707-74410068763799999999708967566 Q gi|254780234|r 114 DAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 114 ~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~~sRe~~i~ya~~~gIpv~~~ 172 (404) -++ |-+|.-. |...+-+..-|..+ ++|+-+ +. ..+--+.|+.-|+|-..- T Consensus 191 lVl-----GT~NksE-~~~GyfTKyGDGa~Di~PI~~--Ly-KtQV~~LA~~LgvPe~Ii 241 (325) T PRK00876 191 AVA-----GTPNRLE-YDQGFFVKYGDGAADLKPIAH--LY-KTQVYALAEYLGVPEEIR 241 (325) T ss_pred EEE-----CCCCHHH-HHHCCEEEECCCCCCCCCCCC--CC-HHHHHHHHHHCCCCHHHH T ss_conf 776-----5884655-663514754478878451357--75-899999999809999993 No 78 >PRK08384 thiamine biosynthesis protein ThiI; Provisional Probab=97.16 E-value=0.03 Score=36.48 Aligned_cols=122 Identities=18% Similarity=0.218 Sum_probs=57.2 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHC-----CCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 789999147725999999998718986999994578711068999999973-----980799820089999987999973 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLL-----GAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~-----Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) .|+++.+|||. |.|+ .||....|++|++++.++|. +..+.+.+.+..+ |. +...+.+ T Consensus 180 Gk~l~LlSGGi-SPVA-a~~mmKRG~~v~~vhf~~~~-~~~~kv~~l~~~L~~y~~~~-~~~~~~~-------------- 241 (310) T PRK08384 180 GKMVGLLSDEL-SAVA-IFLMMKRGVEVIPVHIGMGE-KNLEKVRKLWNQLKKYSYGS-KGRLVVV-------------- 241 (310) T ss_pred CCEEEEECCCC-CHHH-HHHHHHCCCEEEEEEECCCH-HHHHHHHHHHHHHHHHCCCC-CEEEEEE-------------- T ss_conf 84899953886-3999-99998569879999856887-89999999999999867998-4699996-------------- Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH-HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHH Q ss_conf 6865478742013012433599999998856892875001-116821567999999858797077441006876379999 Q gi|254780234|r 81 ANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS-TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLI 159 (404) Q Consensus 81 ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~-TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i 159 (404) ++ -+.+-++|++.||.++..|- -|-=-.|-==.+..-.-+-++.|+-|.- .+. -+|-| T Consensus 242 ----------------~~--~~~a~~ia~~~~~~alvTGEsLGQVASQTl~nl~~i~~~~~~PVlRPLI--g~D-K~EII 300 (310) T PRK08384 242 ----------------KN--FERVNKIIRDFGAKGVVKGLRPEQLASETLENIYEDSRMFDVPVYYPLI--ALP-DEYIE 300 (310) T ss_pred ----------------CH--HHHHHHHHHHHCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCC--CCC-HHHHH T ss_conf ----------------65--9999999998599899977412554545798899997304887303856--899-89999 Q ss_pred HHHHHHC Q ss_conf 9999708 Q gi|254780234|r 160 DFAEKHA 166 (404) Q Consensus 160 ~ya~~~g 166 (404) +.|++-| T Consensus 301 ~~Ar~IG 307 (310) T PRK08384 301 KVKEKIG 307 (310) T ss_pred HHHHHCC T ss_conf 9998638 No 79 >COG0171 NadE NAD synthase [Coenzyme metabolism] Probab=97.10 E-value=0.0027 Score=43.97 Aligned_cols=155 Identities=23% Similarity=0.270 Sum_probs=91.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCC-----CEEEEEEEECC--CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHH Q ss_conf 777899991477259999999987189-----86999994578--71106899999997398079982008999998799 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKG-----LEVIVFIADLG--QGEELKIASDKARLLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g-----~eVi~~~~d~G--q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) ..|+|||.-|||+||++++.....--| +.|++++...+ ...+.+++..-+..+|+. ...+|.++.+ ..|.. T Consensus 24 ~~k~~VlGiSGGiDSa~~~~La~~A~~~~~~~~~~~av~mP~~~~~~~~~~da~~~~~~lg~~-~~~i~I~~~v-~~~~~ 101 (268) T COG0171 24 GFKGVVLGLSGGIDSALVLALAVRALGKGDSKENVLAVRLPYGYTVQADEEDAQDLAEALGID-YKEINIKPAV-DAFLK 101 (268) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHCCCCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHCCC-EEEEECHHHH-HHHHH T ss_conf 999869976668199999999999856565124326686788776534799999999982996-6997528889-99887 Q ss_pred HHHHCCCC-CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCCCC Q ss_conf 99736865-47874201301243359999999885689287500111682156799999985879707-74410068763 Q gi|254780234|r 77 PMFRANAL-YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSFKG 154 (404) Q Consensus 77 ~~I~ana~-Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~~s 154 (404) ++.....- +....-++---+|-= ...+-.+|.+.|.=++. -||--.-.-. +-+..-|..+ ++|+-+ + . T Consensus 102 ~~~~~~~~~~~~~~~~~NikaR~R-m~~lY~~An~~~~lVlG-----Tgn~sE~~~G-y~TkyGDg~~d~~Pi~~--L-~ 171 (268) T COG0171 102 KLLKLFLGIYLEDLALGNIKARLR-MVILYAIANKLGGLVLG-----TGNKSELALG-YFTKYGDGAVDINPIAD--L-Y 171 (268) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHH-HHHHHHHHHHCCCEEEC-----CCCHHHHHCC-CEECCCCCCCCHHHHCC--C-C T ss_conf 666652344211277764509999-99999998555978975-----8848778607-44320676447054148--7-5 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 7999999997089675 Q gi|254780234|r 155 RQDLIDFAEKHAIPID 170 (404) Q Consensus 155 Re~~i~ya~~~gIpv~ 170 (404) ..+--+.++.-|||-. T Consensus 172 KtqV~~La~~l~ipe~ 187 (268) T COG0171 172 KTQVYALARHLGIPEE 187 (268) T ss_pred HHHHHHHHHHCCCCHH T ss_conf 8999999987399999 No 80 >PRK02628 nadE NAD synthetase; Reviewed Probab=97.04 E-value=0.013 Score=39.17 Aligned_cols=201 Identities=21% Similarity=0.255 Sum_probs=113.0 Q ss_pred CCEEEEEECCCHHHHHHH----HHHHHHCC---CEEEEEEEE-CCC-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH Q ss_conf 778999914772599999----99987189---869999945-787-110689999999739807998200899999879 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIIL----KWLQVEKG---LEVIVFIAD-LGQ-GEELKIASDKARLLGAKEVYVKDLRREFVRDFV 75 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i----~~L~~e~g---~eVi~~~~d-~Gq-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i 75 (404) .+||||.-|||+||+.++ .-+ +.-| -.|+++++= -+. ..-.+++.+-|..+|+ .+..++.++.+- .+. T Consensus 362 ~~~vViGLSGGiDSaLaLLVaa~A~-d~Lg~~r~~V~~vtMPs~~TS~~S~~dA~~La~~LGi-~~~~I~I~~a~~-~~~ 438 (678) T PRK02628 362 IKKVVIGISGGLDSTLALLVAAKAF-DRLGLPRKNILAYTMPGFGTTDRTKNNAVALMKALGV-TAREIDIRPAAL-QML 438 (678) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHH-HHHCCCHHCEEEEECCCCCCCHHHHHHHHHHHHHCCC-CEEEEEHHHHHH-HHH T ss_conf 9818996777644799999999999-9848971224899778876578789999999997299-779976299999-999 Q ss_pred HHHHH-----C----CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHH--HHHH--CCCCE Q ss_conf 99973-----6----8654787420130124335999999988568928750011168215679999--9985--87970 Q gi|254780234|r 76 FPMFR-----A----NALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELS--AYSL--NSDIE 142 (404) Q Consensus 76 ~~~I~-----a----na~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~--~~~l--~P~l~ 142 (404) ..|- . +..||| .-|-.| +..|..+|++.|.=++..|= +-|++ +.++ +-.+. T Consensus 439 -~di~~~~~~~~~~~Dvt~EN----iQAR~R---~~iLM~laNk~g~LVL~TGN--------kSElAvGy~Ty~yGD~mg 502 (678) T PRK02628 439 -KDIGHPFARGEKVYDVTFEN----VQAGER---TQILFRLANQRGGLVIGTGD--------LSELALGWCTYGVGDHMS 502 (678) T ss_pred -HHHCCHHHCCCCCCCCHHHH----HHHHHH---HHHHHHHHCCCCCEEEECCC--------CCHHHHCEECCCCCCCCC T ss_conf -98444222378776602665----203556---78999974577967973685--------146864831047754124 Q ss_pred EEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEE Q ss_conf 77441006876379999999970896756656788641572431154576324584576820100146581227999638 Q gi|254780234|r 143 IIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTT 222 (404) Q Consensus 143 viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~ 222 (404) -++|.-+-.-+--...+.|+.+++---+ .. .+.+.++...-.+|+--|.. T Consensus 503 ~yavn~~VpKTlV~~L~r~~a~~~~~~~------~~---------------------~~vl~~Il~tpiSpEL~P~~--- 552 (678) T PRK02628 503 HYNVNASVPKTLIQHLIRWVAESGQFDE------AV---------------------SEVLLDILDTPISPELVPPD--- 552 (678) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCC------CH---------------------HHHHHHHCCCCCCCCCCCCC--- T ss_conf 6244678837899999999986244453------11---------------------25689854699991427988--- Q ss_pred EEEEEECCEEEEECCEECCHHHHHHHHHHHHHHCCCC Q ss_conf 9999950247875162318999999998886421637 Q gi|254780234|r 223 IRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRCNGIG 259 (404) Q Consensus 223 v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~~GvG 259 (404) ++|.+..-+-..+.|.+|.-..---.=++|.+ T Consensus 553 -----~~~~i~QktEd~lgPYel~Df~Ly~~lr~g~~ 584 (678) T PRK02628 553 -----EEGEIVQKTEDIVGPYELQDFFLYYFLRYGFR 584 (678) T ss_pred -----CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf -----66778874635578757789999999972898 No 81 >KOG0571 consensus Probab=96.50 E-value=0.05 Score=34.86 Aligned_cols=113 Identities=19% Similarity=0.225 Sum_probs=67.5 Q ss_pred EEEEEECCCHHHHHHHHHH----HH---HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 8999914772599999999----87---1898699999457871106899999997398079982008999998799997 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWL----QV---EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L----~~---e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ...+.-|||||||.+..-. ++ .+|..+++|++-+-..-|+..+++.|.-+|...|...=--+|= -|-+-..| T Consensus 227 p~GvLLSGGLDSSLvAsia~R~lk~~~~~~~~~lhsFaIGle~SPDL~aarkVAd~igt~Hhe~~ft~qeg-idal~eVI 305 (543) T KOG0571 227 PFGVLLSGGLDSSLVASIAARELKKAQAARGSKLHSFAIGLEDSPDLLAARKVADFIGTIHHEHTFTIQEG-IDALDEVI 305 (543) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECHHHH-HHHHHHHH T ss_conf 61589507731899999999999876641278754788528998537888999987488523899758887-77888774 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH Q ss_conf 3686547874201301243359999999885689287500111682156 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV 128 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv 128 (404) ..-..|.=+. .-.|..-.|+ -+.-++.|...|.- |-|-|.+ T Consensus 306 ~hLETYDvtt-IRastpmyLl----sr~Ikk~gvkmvlS---GEGsDEi 346 (543) T KOG0571 306 YHLETYDVTT-IRASTPMYLL----SRKIKKLGVKMVLS---GEGSDEI 346 (543) T ss_pred EEEECCCCCE-EECCCCHHHH----HHHHHHCCEEEEEE---CCCCHHH T ss_conf 2410134415-8617746789----99997655289981---5771344 No 82 >PRK12563 sulfate adenylyltransferase subunit 2; Provisional Probab=96.45 E-value=0.048 Score=35.00 Aligned_cols=212 Identities=17% Similarity=0.245 Sum_probs=112.0 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHH-C-C-CEEEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 7778999914772599999999871-8-9-8699999457871--10689999999739807998200899999879999 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVE-K-G-LEVIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e-~-g-~eVi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) +-++.|+.||||-||+|++.+-++- + + .-.-.+++|+|.. |=++--.+.|...|. +.+|-...+. T Consensus 36 ~f~~pvmL~S~GKDS~VmlhLA~KAF~p~~iPfPllHIDTg~kF~Emi~fRD~~a~~~~l-~LiV~~n~~~--------- 105 (312) T PRK12563 36 ECSKPVMLYSIGKDSSVMLHLAMKAFRPTRPPFPLLHVDTTWKFREMIDFRDRRAKELGL-DLVVHHNPDG--------- 105 (312) T ss_pred HHCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCC-EEEEECCHHH--------- T ss_conf 736967986167429999999999737789997358863688759999999999998198-6898328788--------- Q ss_pred HHCCCCCCCCCCC--CCCHH-HHHHHHHHHHHHHHCCCCEEECHHHCCCC---------------------HHHH--HHH Q ss_conf 7368654787420--13012-43359999999885689287500111682---------------------1567--999 Q gi|254780234|r 79 FRANALYEGYYLL--GTAIA-RPLIAKYLVDIANETGADAIAHGSTGKGN---------------------DQVR--FEL 132 (404) Q Consensus 79 I~ana~Yeg~Ypl--~tsla-Rplia~~lv~~a~~~ga~~iaHG~TgkGN---------------------DQvR--Fe~ 132 (404) |. .|..|. ++... ..+....|.+.-.+.+-|++--|+----. +|.- +++ T Consensus 106 i~-----~g~~p~~~~~~~~~~~lKT~~L~~ale~~~fDa~~gGaRRDEEksRAKERiFS~Rd~~~~WDPknQrPElWnl 180 (312) T PRK12563 106 IA-----RGIVPFRHGSALHTDVAKTQGLKQALDHHGFDAAIGGARRDEEKSRAKERIFSFRSAFHRWDPKAQRPELWSL 180 (312) T ss_pred HH-----CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHHH T ss_conf 86-----6999676783777668988999999997398878504532134234434030666778898855588358776 Q ss_pred HHHHHCCCCEE-EEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCC Q ss_conf 99985879707-74410068763799999999708967566567886415724311545763245845768201001465 Q gi|254780234|r 133 SAYSLNSDIEI-IAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIV 211 (404) Q Consensus 133 ~~~~l~P~l~v-iaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~ 211 (404) +-..+.|.-.+ +-|+- +++ --+-.+|-.+.+||++.- =|+-..-..-+ .|.| ....++-+. T Consensus 181 yN~~~~~Geh~RVfPls--NWT-ElDIW~YI~~E~Ipi~~L-----YFa~~R~vv~R---dG~l----i~vdd~~~~--- 242 (312) T PRK12563 181 YNARLRRGESLRVFPLS--NWT-ELDVWQYIAREKIPLVPL-----YFAKRRPVVER---DGLL----IMVDDERTP--- 242 (312) T ss_pred HCCCCCCCCEEEECCCC--CCC-HHHHHHHHHHHCCCCCCC-----EECCCCCEEEE---CCEE----EECCCCCCC--- T ss_conf 15567999714422355--661-789999999828898861-----21156642755---8948----835776667--- Q ss_pred CHHHCCCCCEEEEEEEECCEEEEECC----EECCHHHHHHHHH Q ss_conf 81227999638999995024787516----2318999999998 Q gi|254780234|r 212 SPEEAPDTPTTIRIDFQRGDPIAING----QVMSPEVLLEQLN 250 (404) Q Consensus 212 ~p~~ap~~pe~v~I~Fe~G~PVainG----~~~~~~~li~~LN 250 (404) -..-++|+.-.+.|..==|..+.| ++-+.-++|+.+- T Consensus 243 --l~~gE~~~~~~vRfRTLGc~p~TgaveS~A~tv~~II~El~ 283 (312) T PRK12563 243 --LRPGETPQQRKVRFRTLGCYPLTGAVESDADTVEKIIQEMA 283 (312) T ss_pred --CCCCCCCEECCEEEECCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf --88888430144212115663667820578889899999998 No 83 >PRK02090 phosphoadenosine phosphosulfate reductase; Provisional Probab=96.36 E-value=0.068 Score=33.88 Aligned_cols=147 Identities=17% Similarity=0.203 Sum_probs=87.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 77899991477259999999987189869999945787--1106899999997398079982008999998799997368 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) -.+++++.|-|.++.|+++.+. +.+-++--|++|+|- +|-++-+.+-...+|. .+.++--....++... .-+ T Consensus 41 ~~~~~~tsSFG~es~Vllhli~-~~~~~ipV~flDTG~~f~ETy~~~d~l~~~~~l-~i~~~~P~~~~~~~~~----~~g 114 (243) T PRK02090 41 GGGLALASSFGAESAVLLHLVA-QVDPDIPVIFLDTGYLFPETYRFIDQLTERLLL-NLKVYRPDASAAEQEA----RYG 114 (243) T ss_pred CCCEEEEECCCHHHHHHHHHHH-HHCCCCCEEEEECCCCCHHHHHHHHHHHHHHCC-CEEEECCCCHHHHHHH----HHC T ss_conf 9976999458678999999999-419999679865798888999999999999799-7799778814999999----829 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHC-CCCEEECHHHCCCCHHHHHHHHHHH------HCCCCEEEEEECCCCCCCH Q ss_conf 65478742013012433599999998856-8928750011168215679999998------5879707744100687637 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANET-GADAIAHGSTGKGNDQVRFELSAYS------LNSDIEIIAPWRHWSFKGR 155 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~-ga~~iaHG~TgkGNDQvRFe~~~~~------l~P~l~viaP~Rd~~~~sR 155 (404) .++. .-.+-..+++-.+++....+.+ +.+++..| . .|++...++ ..+.+--++|.-+| +. T Consensus 115 ~l~~---~~~~d~~~CC~irKvePL~raL~~~daWitG---~----Rr~Qs~~Ra~l~~ve~d~g~~KvnPL~~W---s~ 181 (243) T PRK02090 115 GLWE---QSVEDPDECCRIRKVEPLNRALAGLDAWITG---L----RREQSGTRAELPVLEIDRGRFKINPLADW---TN 181 (243) T ss_pred CCCC---CCCCCHHHHHHHHCCHHHHHHHHHCCEEEEC---C----CHHHCHHHCCCCCEEECCCEEEECCCHHC---CH T ss_conf 9766---7632489876776502799986017758744---7----14426334258734523998886700119---99 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999997089675 Q gi|254780234|r 156 QDLIDFAEKHAIPID 170 (404) Q Consensus 156 e~~i~ya~~~gIpv~ 170 (404) ++..+|.++|++|.- T Consensus 182 ~dv~~Yi~~~~LP~n 196 (243) T PRK02090 182 EDVWAYLKEHDLPYH 196 (243) T ss_pred HHHHHHHHHCCCCCC T ss_conf 999999998599988 No 84 >PRK05253 sulfate adenylyltransferase subunit 2; Provisional Probab=96.25 E-value=0.093 Score=32.94 Aligned_cols=212 Identities=17% Similarity=0.194 Sum_probs=113.8 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHH-CCCE--EEEEEEECCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 778999914772599999999871-8986--99999457871--106899999997398079982008999998799997 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVE-KGLE--VIVFIADLGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e-~g~e--Vi~~~~d~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) -++.|+.||||-||+|++.+-++- ++.. .-.+++|+|.+ |=++--.+-|...|. +.+|--..+ +| T Consensus 26 f~~pvmL~S~GKDS~VmLhLarKAF~p~~iPfPllHIDTg~kF~Emi~fRD~~a~e~gl-~LiV~~n~e---------~i 95 (300) T PRK05253 26 FENPVMLYSIGKDSSVMLHLARKAFYPGKLPFPLLHVDTGWKFPEMIEFRDRRAKELGL-ELIVHSNPE---------GI 95 (300) T ss_pred HCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCC-CEEEEECHH---------HH T ss_conf 36967987167259999999999857889996348830799818999999999998499-789975767---------77 Q ss_pred HCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHCCCCEEECHHHCCCC---------------------HHHH--HHHHHH Q ss_conf 368654787420130-1243359999999885689287500111682---------------------1567--999999 Q gi|254780234|r 80 RANALYEGYYLLGTA-IARPLIAKYLVDIANETGADAIAHGSTGKGN---------------------DQVR--FELSAY 135 (404) Q Consensus 80 ~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~~ga~~iaHG~TgkGN---------------------DQvR--Fe~~~~ 135 (404) ..+..... .+.. -.+.+....|.++..+.+-|++--|+..--. +|.- ++++-. T Consensus 96 ~~g~~p~~---~g~~~~~~~lKT~~L~~al~~~~fda~~gGaRRDEe~sRAKERiFS~R~~~~~Wdpk~QrPElW~lyn~ 172 (300) T PRK05253 96 ARGINPFT---HGSAKHTNIMKTEGLKQALEKYGFDAAFGGARRDEEKSRAKERIFSFRDEFGQWDPKNQRPELWNLYNG 172 (300) T ss_pred HCCCCCCC---CCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCC T ss_conf 56999566---882223157878999999996487656510123443212125424775646888856698026676355 Q ss_pred HHCCC--CEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCH Q ss_conf 85879--7077441006876379999999970896756656788641572431154576324584576820100146581 Q gi|254780234|r 136 SLNSD--IEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSP 213 (404) Q Consensus 136 ~l~P~--l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p 213 (404) .+.|. +.| -|+-. ++ --+-.+|-+..+||++.- =|+-..-..-+ .|.|- +.+|-+ | T Consensus 173 ~~~~geh~RV-fPlsn--WT-ElDIW~YI~~E~Ipi~~L-----YfA~~r~vv~r---~g~l~-----~~dd~~-----~ 230 (300) T PRK05253 173 RINKGEHIRV-FPLSN--WT-ELDIWQYIYRENIPIVPL-----YFAHERPVVER---DGMLI-----MVDDDM-----P 230 (300) T ss_pred CCCCCCEEEE-ECCCC--CH-HHHHHHHHHHHCCCCCCC-----CCCCCCCEEEE---CCEEE-----ECCCCC-----C T ss_conf 6799964775-11202--25-778999999848888871-----10378854878---99887-----438666-----7 Q ss_pred HHCCCCCEEEEEEEECCEEEEEC----CEECCHHHHHHHHHH Q ss_conf 22799963899999502478751----623189999999988 Q gi|254780234|r 214 EEAPDTPTTIRIDFQRGDPIAIN----GQVMSPEVLLEQLNQ 251 (404) Q Consensus 214 ~~ap~~pe~v~I~Fe~G~PVain----G~~~~~~~li~~LN~ 251 (404) .....+++...+.|..==|..+. -++-+.-++|+.+-. T Consensus 231 ~~~~e~~~~~~vRfRTLGc~p~TgaveS~A~t~~~II~El~~ 272 (300) T PRK05253 231 LRPGEVVEERMVRFRTLGCYPCTGAVESEAATLEEIIAEMLV 272 (300) T ss_pred CCCCCEEEECCCEECCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 799983533450122467745678415788898999999984 No 85 >PRK09590 celB cellobiose phosphotransferase system IIB component; Reviewed Probab=94.98 E-value=0.1 Score=32.68 Aligned_cols=62 Identities=18% Similarity=0.205 Sum_probs=44.5 Q ss_pred CCEEEEEECCCHHHHHHHHHHHH---HCCCEEE--EEEEECCCC----C-------------HHHHHHHHHHHCCCEEEE Q ss_conf 77899991477259999999987---1898699--999457871----1-------------068999999973980799 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQV---EKGLEVI--VFIADLGQG----E-------------ELKIASDKARLLGAKEVY 62 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~---e~g~eVi--~~~~d~Gq~----~-------------d~~~~~~~A~~~Ga~~~~ 62 (404) ||||.|..|||.-||.++.-+++ ++|.|+. ++.+.-++. + -++..++.|...|. ++- T Consensus 1 MkKILLvCaaGMSTSmlv~km~~~A~~~G~dveI~Av~~~e~~~~i~~~~yDv~LlgPQVr~~~~~~k~~a~~~gi-Pv~ 79 (104) T PRK09590 1 MAKALIICAAGMSSSLMAKKTTEFLKGKGKDIEVDAISATEGGKAIAAAAFDLYLVSPQTKMYFKQFEEAGSKAGK-PVV 79 (104) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHCCC-CEE T ss_conf 9559999689987999999999999976983699984888988763226887899887688789999999987299-778 Q ss_pred EECHH Q ss_conf 82008 Q gi|254780234|r 63 VKDLR 67 (404) Q Consensus 63 v~D~r 67 (404) ++|.+ T Consensus 80 vI~~~ 84 (104) T PRK09590 80 QIPPQ 84 (104) T ss_pred EECCC T ss_conf 87874 No 86 >TIGR02039 CysD sulfate adenylyltransferase, small subunit; InterPro: IPR011784 Metabolic assimilation of sulphur from inorganic sulphate, requires sulphate activation by coupling to a nucleoside, for the production of high-energy nucleoside phosphosulphates. This pathway appears to be similar in all prokaryotic organisms. Activation is first achieved through reaction of sulphate with ATP by sulphate adenylyltransferase (ATP sulphurylase) to produce 5'-phosphosulphate (APS), coupled by GTP hydrolysis. Subsequently, APS is phosphorylated by an APS kinase to produce 3'-phosphoadenosine-5'-phosphosulphate (PAPS). In Escherichia coli, ATP sulphurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulphur-containing amino acids. Homologous to this E. coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulphurase and APS kinase activity, yet are involved in Nod factor sulphation, and sulphation of other macromolecules.; GO: 0004781 sulfate adenylyltransferase (ATP) activity, 0019421 sulfate reduction APS pathway. Probab=94.88 E-value=0.21 Score=30.34 Aligned_cols=241 Identities=19% Similarity=0.223 Sum_probs=145.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHC--C-CEEEEEEEECCCC-CHHHHHHHHH-HHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 7789999147725999999998718--9-8699999457871-1068999999-97398079982008999998799997 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEK--G-LEVIVFIADLGQG-EELKIASDKA-RLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~--g-~eVi~~~~d~Gq~-~d~~~~~~~A-~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) -++=|+.||+|-||||+|++-.+-. | .----+++|+|-. .|.-..+.+. .+.+. +.+|-...+. ....+.|.. T Consensus 19 F~~PV~LyS~GKDS~VlLHLA~KaF~Pg~~PFplLHvDTgw~f~E~~~fRD~~~~~~~~-~L~v~~~~~~-~~~g~~P~~ 96 (295) T TIGR02039 19 FERPVLLYSIGKDSSVLLHLARKAFYPGRLPFPLLHVDTGWKFREMIAFRDELVAKYGL-ELIVHSNEEG-IAEGINPFT 96 (295) T ss_pred CCCCEEEEECCCHHHHHHHHHHHCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCC-EEEEEECHHH-HHHCCCCCH T ss_conf 37976898657347999998764058888797357740663278999999999987097-6888505023-541566201 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHH---------------------HHHHHHHHHC Q ss_conf 3686547874201301243359999999885689287500111682156---------------------7999999858 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQV---------------------RFELSAYSLN 138 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQv---------------------RFe~~~~~l~ 138 (404) ...+.+- .++....|-++-.+.+=|+.--|+- +-...- |=|+ ....| T Consensus 97 ~~~~~~~----------~~~~T~~L~~Al~~~~FDAa~gGAR-RdEE~sRaKERifS~R~~~h~WDPk~QRPEL-W~~YN 164 (295) T TIGR02039 97 EGSALHT----------DIMKTEALRQALDKNQFDAAFGGAR-RDEEKSRAKERIFSFRDAFHQWDPKKQRPEL-WNLYN 164 (295) T ss_pred HHHHHHH----------HHHHHHHHHHHHHHCCCCEEECCCC-CHHHHCCCCCCEEHHCCCCCCCCCCCCCHHH-HHHCC T ss_conf 0124555----------5563189999987338865631663-1344301201000100345788851257377-87505 Q ss_pred CCCEEEEEECCC---CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHH Q ss_conf 797077441006---87637999999997089675665678864157243115457632458457682010014658122 Q gi|254780234|r 139 SDIEIIAPWRHW---SFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEE 215 (404) Q Consensus 139 P~l~viaP~Rd~---~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ 215 (404) +.+.-.--+|-. ++| --+-..|.....|++..- =|.-......|.----..+|....+. T Consensus 165 g~~~~GE~~RvFPLSNwT-E~DiW~Y~~~~~i~~~~l-----YfA~~RpV~~R~g~l~~~~D~~~~~~------------ 226 (295) T TIGR02039 165 GRISKGESVRVFPLSNWT-ELDIWRYIAAENIEIVPL-----YFAAKRPVVERDGMLIMVDDERLELE------------ 226 (295) T ss_pred CCCCCCCEEEEECCCCCC-HHHHHHHHHHCCCCCCEE-----ECCCCCCEEECCCEEEEECCCCCCCC------------ T ss_conf 234688726872266650-445898874346542201-----10246862410744888606524667------------ Q ss_pred CCCCCEEEEEEEECCEEEEECC----EECCHHHHHHHHH-HHHHHCCCCC-EECCCCCCCCCCCCCCC Q ss_conf 7999638999995024787516----2318999999998-8864216374-10007732021000111 Q gi|254780234|r 216 APDTPTTIRIDFQRGDPIAING----QVMSPEVLLEQLN-QYGRCNGIGR-IDIVENRFVGIKSRGVY 277 (404) Q Consensus 216 ap~~pe~v~I~Fe~G~PVainG----~~~~~~~li~~LN-~igg~~GvGr-~d~vEnr~vG~KsR~vY 277 (404) .-.....-.|.|.-==|+.+.| ++-+.-++|..+. .....-|-|| +|.++.--+-.|-|+.| T Consensus 227 ~~E~~~~~~vrfRTlGc~~~tga~~S~A~~~~~ii~E~~~~~~sERg~~R~~D~~~~~~ME~kKR~GY 294 (295) T TIGR02039 227 PGEVVKERMVRFRTLGCYPLTGAIESDAATVEEIIAEVAVARTSERGQGRAIDRDQEASMEKKKREGY 294 (295) T ss_pred CCCCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHCCCCC T ss_conf 77510000455554057564543122341079999998875320254523324412000233003679 No 87 >cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. Probab=94.11 E-value=0.58 Score=27.24 Aligned_cols=87 Identities=21% Similarity=0.237 Sum_probs=64.8 Q ss_pred EEEEEECCCHHHHHHHHH---HHHHCCCEEEEEEEECCCC--------CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHH Q ss_conf 899991477259999999---9871898699999457871--------10689999999739807998200899999879 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKW---LQVEKGLEVIVFIADLGQG--------EELKIASDKARLLGAKEVYVKDLRREFVRDFV 75 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~---L~~e~g~eVi~~~~d~Gq~--------~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i 75 (404) ||++|.+|.-.+..++.. |....+.+.++++++.... +.++...+-|..+|+. ..++ T Consensus 1 rILV~vd~s~~s~~lir~a~rlA~~~~a~l~vl~V~~~~~~~~~~~~~~~l~~~~~la~~lga~-~~~~----------- 68 (124) T cd01987 1 RILVCISGGPNAERLIRRAARLADRLKAPWYVVYVETPRLNRLSEAERRRLAEALRLAEELGAE-VVTL----------- 68 (124) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE-EEEE----------- T ss_conf 9999958985079999999999996499899999955975658979999999999999985998-9999----------- Q ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCC Q ss_conf 99973686547874201301243359999999885689287500111682 Q gi|254780234|r 76 FPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGN 125 (404) Q Consensus 76 ~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGN 125 (404) +. .=+++.++++|++.|++-|-=|.+++.- T Consensus 69 ---------~~-----------~d~~~~I~~~A~~~~~t~IVlG~~~~~~ 98 (124) T cd01987 69 ---------PG-----------DDVAEAIVEFAREHNVTQIVVGKSRRSR 98 (124) T ss_pred ---------EC-----------CCHHHHHHHHHHHCCCCEEEECCCCCCH T ss_conf ---------47-----------9989999999998499899976898854 No 88 >cd05565 PTS_IIB_lactose PTS_IIB_lactose: subunit IIB of enzyme II (EII) of the lactose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS) found in Firmicutes as well as Actinobacteria. In this system, EII is a lactose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIC and IIB domains are expressed as a single protein from the lac operon. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include lactose, chitobiose/lichenan, ascorbate, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. Probab=92.94 E-value=0.64 Score=26.90 Aligned_cols=59 Identities=15% Similarity=0.237 Sum_probs=34.5 Q ss_pred EEEEEECCCHHHHHHHHHHHH---HCCCEEEEEEEECCCCCH-----------------HHHHHHHHHHCCCEEEEEECH Q ss_conf 899991477259999999987---189869999945787110-----------------689999999739807998200 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQV---EKGLEVIVFIADLGQGEE-----------------LKIASDKARLLGAKEVYVKDL 66 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~---e~g~eVi~~~~d~Gq~~d-----------------~~~~~~~A~~~Ga~~~~v~D~ 66 (404) ||.|..|||.-||..+.-|++ ++|.++..-.+-.|+-+| ++++++.|...|. +..++.. T Consensus 2 kVLl~Ca~GmSSs~la~~m~k~A~~~gi~~~i~A~~~~~~~d~~~~yDvilLaPQv~~~~~~lk~~ad~~Gi-~v~~i~~ 80 (99) T cd05565 2 NVLVLCAGGGTSGLLANALNKGAKERGVPLEAAAGAYGSHYDMIPDYDLVILAPQMASYYDELKKDTDRLGI-KLVTTTG 80 (99) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEECHHHHHHHHCCCEEEECHHHHHHHHHHHHHHHHCCC-EEEEECH T ss_conf 099990798858999999999999819976999510344898870799999985088889999999998399-1887083 No 89 >COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only] Probab=92.51 E-value=0.21 Score=30.39 Aligned_cols=72 Identities=24% Similarity=0.414 Sum_probs=43.9 Q ss_pred CCEEEEEECCCHHHHHHHHHHH---HHCCC-EEEEEEEEC-CCCC-HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 7789999147725999999998---71898-699999457-8711-0689999999739807998200899999879999 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQ---VEKGL-EVIVFIADL-GQGE-ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~---~e~g~-eVi~~~~d~-Gq~~-d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) -.+|+++||||-||+++++++. .+.|- .|..+..|. ||-. -++.+++-... ..|+++.|. -.|.|. T Consensus 27 f~~VcVSFSGGKDS~lmLhL~~~~ar~~~~~~i~VlfiD~E~QYs~TidyV~em~~~-------~~dv~~~~y-WvcLPl 98 (407) T COG3969 27 FPRVCVSFSGGKDSGLMLHLVAEVARENGRDKISVLFIDWEAQYSCTIDYVQEMRES-------YHDVIETFY-WVCLPL 98 (407) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHC-------CCCCCCCCE-EEEEEH T ss_conf 883899923787405899999999998198865799970045555689999998730-------457555405-999631 Q ss_pred HHCCCC Q ss_conf 736865 Q gi|254780234|r 79 FRANAL 84 (404) Q Consensus 79 I~ana~ 84 (404) -.-||+ T Consensus 99 ~t~na~ 104 (407) T COG3969 99 TTQNAL 104 (407) T ss_pred HCCCCH T ss_conf 101332 No 90 >PRK05406 LamB/YcsF family protein; Provisional Probab=91.47 E-value=0.43 Score=28.14 Aligned_cols=146 Identities=17% Similarity=0.259 Sum_probs=93.8 Q ss_pred HHHHHHHHHCCCCE---EECHHHC--CCCHH--HH-HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999988568928---7500111--68215--67-99999985879707744100687637999999997089675665 Q gi|254780234|r 102 KYLVDIANETGADA---IAHGSTG--KGNDQ--VR-FELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 102 ~~lv~~a~~~ga~~---iaHG~Tg--kGNDQ--vR-Fe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ..+..+|+..|... =.||+-+ .-+|. .+ +=.+++.++|++.+++|- -.+....|+++|+++-..- T Consensus 93 ~~l~~~a~~~g~~l~hvKpHGALYn~~~~d~~~a~~i~~~i~~~~~~l~l~~~~-------~s~~~~~A~~~Gl~~~~E~ 165 (246) T PRK05406 93 GALQAIARAAGGRVRHVKPHGALYNMAAKDPELADAIAEAVAAVDPSLILVGLA-------GSELIEAAKAAGLRTASEV 165 (246) T ss_pred HHHHHHHHHCCCCCCEECCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECC-------CCHHHHHHHHCCCCEEEEE T ss_conf 999999998499631324018999988459999999999999869995699518-------8199999998598189888 Q ss_pred CCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHH Q ss_conf 67886415724311545763245845768201001465812279996389999950247875162318999999998886 Q gi|254780234|r 174 RGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYG 253 (404) Q Consensus 174 ~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~ig 253 (404) =.++.|.-|-.|--|+.++..+.||..-. +-++.+...-.-....-+.+.+.++ -+=|-|..-..+++.+.+.+.- T Consensus 166 FADR~Y~~dG~Lv~R~~~gAvi~d~e~~~-~q~~~~~~~g~V~ti~G~~i~i~ad---TICvHgDtp~Av~iak~ir~~L 241 (246) T PRK05406 166 FADRAYTADGTLVPRSQPGAVIHDPEEAA-AQVLQMVQEGRVTAIDGEWIPVEAD---TICVHGDGPHAVAFARRIRAAL 241 (246) T ss_pred EECCCCCCCCCEEECCCCCCCCCCHHHHH-HHHHHHHHCCCEEECCCCEEECCCC---EEEECCCCHHHHHHHHHHHHHH T ss_conf 40233789998700568776158999999-9999999779689238988760489---8998999978999999999999 Q ss_pred HHCCC Q ss_conf 42163 Q gi|254780234|r 254 RCNGI 258 (404) Q Consensus 254 g~~Gv 258 (404) .++|| T Consensus 242 ~~~gI 246 (246) T PRK05406 242 EAAGI 246 (246) T ss_pred HHCCC T ss_conf 97769 No 91 >cd05564 PTS_IIB_chitobiose_lichenan PTS_IIB_chitobiose_lichenan: subunit IIB of enzyme II (EII) of the N,N-diacetylchitobiose-specific and lichenan-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In these systems, EII is either a lichenan- or an N,N-diacetylchitobiose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. In the chitobiose system, these subunits are expressed as separate proteins from chbA, chbB, and chbC of the chb operon (formerly the cel (cellulose) operon). In the lichenan system, these subunits are expressed from licA, licB, and licC of the lic operon. The lic operon of Bacillus subtilis is required for the transport and degradation of oligomeric beta-glucosides, which are produced by extracellular enzymes on substrates such as lichenan or barley glucan. The lic operon is transcribed from a gammaA-dependent promoter and is inducible by lichenan, lichenan hydrolysate, and cellobiose. The IIB d Probab=90.29 E-value=0.94 Score=25.70 Aligned_cols=60 Identities=32% Similarity=0.402 Sum_probs=36.1 Q ss_pred EEEEEECCCHHHHHHHHHHHH---HCCC--EEEEEEEEC-------------C-CC-CHHHHHHHHHHHCCCEEEEEECH Q ss_conf 899991477259999999987---1898--699999457-------------8-71-10689999999739807998200 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQV---EKGL--EVIVFIADL-------------G-QG-EELKIASDKARLLGAKEVYVKDL 66 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~---e~g~--eVi~~~~d~-------------G-q~-~d~~~~~~~A~~~Ga~~~~v~D~ 66 (404) ||+|+.+||.-||..+.-+++ ++|. +|.|+...- | |- -..+++++.+...|. ++.++|- T Consensus 1 kIlL~C~~GmSts~l~~km~~~a~~~~~~~~i~A~~~~~~~~~~~~~DviLl~PQv~y~~~~i~~~~~~~~i-pV~vI~~ 79 (96) T cd05564 1 KILLVCSAGMSTSILVKKMKKAAEKRGIDAEIEAVPESELEEYIDDADVVLLGPQVRYMLDEVKKKAAEYGI-PVAVIDM 79 (96) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHHHCCCEEEECCHHHHHHHHHHHHHHHCCC-CEEEECH T ss_conf 989995998659999999999999749977999964899997741599999772799999999998610499-2796073 Q ss_pred H Q ss_conf 8 Q gi|254780234|r 67 R 67 (404) Q Consensus 67 r 67 (404) + T Consensus 80 ~ 80 (96) T cd05564 80 M 80 (96) T ss_pred H T ss_conf 5 No 92 >PRK10499 N,N'-diacetylchitobiose-specific PTS system transporter subunit IIB; Provisional Probab=89.43 E-value=1.3 Score=24.78 Aligned_cols=35 Identities=23% Similarity=0.403 Sum_probs=26.0 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHH---HCCC--EEEEEE Q ss_conf 987777899991477259999999987---1898--699999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQV---EKGL--EVIVFI 37 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~---e~g~--eVi~~~ 37 (404) |.+ |||+|+.|+|.-||..+.-+++ ++|. ++.++. T Consensus 1 M~~--kkIlL~C~aGMSTSlLv~kM~~~A~~~~i~~~I~A~~ 40 (106) T PRK10499 1 MEK--KHIYLFCSAGMSTSLLVSKMRAQAEKYEVPVIIEAFP 40 (106) T ss_pred CCC--CEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEE T ss_conf 998--7799988999878999999999999769988999953 No 93 >TIGR00552 nadE NAD+ synthetase; InterPro: IPR003694 NAD+ synthase (6.3.5.1 from EC) catalyzes the last step in the biosynthesis of nicotinamide adenine dinucleotide and is induced by stress factors such as heat shock and glucose limitation. The three-dimensional structure of NH3-dependent NAD+ synthetase from Bacillus subtilis, in its free form and in complex with ATP shows that the enzyme consists of a tight homodimer with alpha/beta subunit topology .; GO: 0003952 NAD+ synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0009435 NAD biosynthetic process. Probab=88.29 E-value=2 Score=23.37 Aligned_cols=157 Identities=18% Similarity=0.163 Sum_probs=92.4 Q ss_pred CCCEEEEEECCCHHH--------HHHHHHHHHHCCC-EEEEEEEECC-C-CC-HHHHHHHHHHHCCCEEEEEECHHHHHH Q ss_conf 777899991477259--------9999999871898-6999994578-7-11-068999999973980799820089999 Q gi|254780234|r 4 DVKKVVLAYSGGLDT--------SIILKWLQVEKGL-EVIVFIADLG-Q-GE-ELKIASDKARLLGAKEVYVKDLRREFV 71 (404) Q Consensus 4 ~~kkVvlaySGGLDT--------Sv~i~~L~~e~g~-eVi~~~~d~G-q-~~-d~~~~~~~A~~~Ga~~~~v~D~r~ef~ 71 (404) ..++||+..|||+|+ +++...-.+-.|. +++++.+--+ + ++ |..++..-+..+|. ....++...-+. T Consensus 25 ~~~g~v~GlsGG~ds~l~g~l~~~~v~~~~~~~~~~~~~~~~~~p~~~~~~~~d~~~a~~~~~~~~~-~~~~~~~~~~~~ 103 (286) T TIGR00552 25 GAKGVVLGLSGGVDSTLAGKLCQAVVAALCVEALGKEQNHALLLPHSNTTPEDDVQDALALAEPLGI-NYKTIDIAPIAA 103 (286) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEEEECCCCCCCHHHHHHHHHHHHHCCC-CCEECCCHHHHH T ss_conf 7650467304762035677899999999998504611420332146678841217999998874054-421113046778 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 98799997368654787420130-12433599999998856892875001116821567999999858797077441006 Q gi|254780234|r 72 RDFVFPMFRANALYEGYYLLGTA-IARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHW 150 (404) Q Consensus 72 ~~~i~~~I~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~ 150 (404) -+-.....+. +..-.+..+ +---+-...+..+|.+ .+.+.- |.||.-.+.-.++...+.-.--++|+-+ T Consensus 104 -~~~~~~~~~~---~~~~~~~~gn~~~r~r~~~~y~~a~~--~~~lv~---gt~n~~e~~~Gy~t~~Gdg~~d~~p~~~- 173 (286) T TIGR00552 104 -SFQAQTETGD---PLADFLAEGNLKARLRMALLYAHANK--HNLLVL---GTGNKSELLLGYFTKYGDGGCDILPLGG- 173 (286) T ss_pred -HHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHC--CCCEEE---ECCCHHHHHHHHHHHCCCCCHHHHHHHH- T ss_conf -8776420344---33310233222478899999887310--163564---0461344543200101442000345532- Q ss_pred CCCCHHHHHHHHHHHCC-----CCCCCC Q ss_conf 87637999999997089-----675665 Q gi|254780234|r 151 SFKGRQDLIDFAEKHAI-----PIDKNK 173 (404) Q Consensus 151 ~~~sRe~~i~ya~~~gI-----pv~~~~ 173 (404) + -..+--+.++.-|+ |-..-. T Consensus 174 -l-~k~~~~~l~~~l~~~~~~~P~~~~~ 199 (286) T TIGR00552 174 -L-FKTEVYELAKRLGVEQTIIPEEIIE 199 (286) T ss_pred -H-HHHHHHHHHHHHCCCCCCCCHHHHC T ss_conf -2-3788999999817531014666733 No 94 >KOG0573 consensus Probab=87.67 E-value=0.44 Score=28.07 Aligned_cols=21 Identities=24% Similarity=0.596 Sum_probs=18.2 Q ss_pred CEEEEEECCCHHHHHHHHHHH Q ss_conf 789999147725999999998 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ 26 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~ 26 (404) -+|++.||||+|++|++..+- T Consensus 251 s~VcVlfSGGvDs~vvA~l~h 271 (520) T KOG0573 251 SNVCVLFSGGVDSTVVAVLAH 271 (520) T ss_pred CCEEEEECCCCHHHHHHHHHH T ss_conf 767999648840899999998 No 95 >TIGR01949 AroFGH_arch predicted phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR010210 This entry represents a clade of primarily archaeal sequences related to fructose-bisphosphate aldolase. The members appear to be phospho-2-dehydro-3-deoxyheptonate aldolases. This enzyme is the first step of the chorismate biosynthesis pathway. Evidence for this assignment is based on gene clustering and phylogenetic profiling. A group of species lack members of the three other types of phospho-2-dehydro-3-deoxyheptonate aldolase (represented by IPR006219 from INTERPRO, IPR002480 from INTERPRO and IPR006268 from INTERPRO), and also apparently lack the well-known forms of step 2 (3-dehydroquinate synthase), but contain all other steps of the pathway: Aquifex, Archaeoglobus, Halobacterium, Methanococcus, Methanopyrus and Methanobacterium. In Aquifex, Archaeoglobus, Halobacterium, Methanopyrus and Methanosarcina the genes are clustered with other genes from the chorismate, phenylalanine, tyrosine and tryptophan biosynthesis pathways. In addition, these genes in Archaeoglobus, Halobacterium, and Methanosarcina are adjacent to 3-dehydroquinate synthase (IPR002812 from INTERPRO) which also has the property of having members only in those species which lack steps 1 and 2. Together these two genes appear to perform the synthesis of 3-dehydroquinate. It is presumed that the substrates and the chemical transformations involved are identical, but this has not yet been proven experimentally.. Probab=86.38 E-value=1.1 Score=25.21 Aligned_cols=90 Identities=22% Similarity=0.333 Sum_probs=66.6 Q ss_pred HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH---HHHHHHHH Q ss_conf 189869999945787110689999999739807998200899999879999736865478742013012---43359999 Q gi|254780234|r 28 EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIA---RPLIAKYL 104 (404) Q Consensus 28 e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tsla---Rplia~~l 104 (404) ..|+|=+-+++|+|..+|.|.+++-+.- .+-+++|=.|+|.-- ||=|..+. -|-...+. T Consensus 101 ~~GADAVS~HvNvGs~~e~d~~~~lg~v------------A~~ad~~GvPlLAMm------YaRG~~i~~e~d~~~v~HA 162 (259) T TIGR01949 101 RLGADAVSIHVNVGSDTEADQIEDLGDV------------AEIADDWGVPLLAMM------YARGPRIDDEVDPEVVAHA 162 (259) T ss_pred HCCCCEEEEEEECCCCCCHHHHHHHHHH------------HHHHHHCCCCHHECC------CCCCCCCCCCCCHHHHHHH T ss_conf 3289867998864898738999999899------------998865488420112------7886887664476788888 Q ss_pred HHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHC Q ss_conf 9998856892875001116821567999999858 Q gi|254780234|r 105 VDIANETGADAIAHGSTGKGNDQVRFELSAYSLN 138 (404) Q Consensus 105 v~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~ 138 (404) ++++.|+|||.|=--=|| |---|+..++..+ T Consensus 163 aRlg~ElGADvvK~~Y~G---d~~SF~~VV~~c~ 193 (259) T TIGR01949 163 ARLGEELGADVVKVPYTG---DIDSFEEVVKACA 193 (259) T ss_pred HHHHHHHCCCEEEECCCC---CHHHHHHHHCCCC T ss_conf 765353457766514317---7467999962178 No 96 >TIGR02823 oxido_YhdH putative quinone oxidoreductase, YhdH/YhfP family; InterPro: IPR014188 This entry represents a subfamily of the alcohol dehydrogenase, a superfamily in which some members are zinc-binding medium-chain alcohol dehydrogenases while others are quinone oxidoreductases with no bound zinc. This entry includes YhdH from Escherichia coli and YhfP from Bacillus subtilis both of which bind NADPH or NAD, but not zinc. Both proteins have been studied crystallographically for insight into function. . Probab=86.16 E-value=1.7 Score=23.95 Aligned_cols=91 Identities=30% Similarity=0.310 Sum_probs=58.4 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHH-HHHHHH---HCC Q ss_conf 8999914772599999999871898699999457871106899999997398079982008999998-799997---368 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRD-FVFPMF---RAN 82 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~-~i~~~I---~an 82 (404) =+|..=|||+= |+++..|. +-||+|||.|. -.++- -+--+++||+++ ++ |++|-++ --.|+- +|+ T Consensus 152 VlVTGAtGGVG-S~Av~~L~-~lGY~V~A~tG---k~~~~---~~yL~~LGA~ev--i~-R~~l~~~a~~kPL~k~~WAG 220 (330) T TIGR02823 152 VLVTGATGGVG-SLAVAILS-KLGYEVVASTG---KAEEE---VEYLKELGASEV--ID-REELSEDAPGKPLEKERWAG 220 (330) T ss_pred EEEECCCCCHH-HHHHHHHH-HCCCEEEEEEC---CHHHH---HHHHHHCCCCCC--CC-HHHCCCCCCCCCCCCCCCCC T ss_conf 88706778778-99999998-37976999737---83778---899986581105--77-11227687888631222221 Q ss_pred CCCC-CCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEC Q ss_conf 6547-87420130124335999999988568928750 Q gi|254780234|r 83 ALYE-GYYLLGTAIARPLIAKYLVDIANETGADAIAH 118 (404) Q Consensus 83 a~Ye-g~Ypl~tslaRplia~~lv~~a~~~ga~~iaH 118 (404) |.-- |...| ..++.-++-.||-+.+= T Consensus 221 AvDtVGG~~L----------a~~l~~~~ygG~VA~cG 247 (330) T TIGR02823 221 AVDTVGGKTL----------ANVLAQLKYGGAVAACG 247 (330) T ss_pred EEECCCHHHH----------HHHHHHHCCCCEEEEEE T ss_conf 5746875799----------99998406897899994 No 97 >cd06313 PBP1_ABC_sugar_binding_like_5 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=85.18 E-value=3.1 Score=22.00 Aligned_cols=67 Identities=16% Similarity=0.081 Sum_probs=36.5 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 9999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) .+.+-+.|+++|+++....+.+--+.|++.=-.+-+-.++.-+++|+- -.+-...+.-|++.||||- T Consensus 18 ~~Gae~aA~e~Gv~v~~~~a~~D~~~Q~~~Ie~~I~~gvD~Iiv~p~d---~~a~~~~~~~A~~aGIpVV 84 (272) T cd06313 18 KQAADEAGKLLGVDVTWYGGALDAVKQVAAIENMASQGWDFIAVDPLG---IGTLTEAVQKAIARGIPVI 84 (272) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCC---HHHHHHHHHHHHHCCCEEE T ss_conf 999999999819989998699999999999999998599999996888---7899999999998699899 No 98 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=84.72 E-value=3.4 Score=21.70 Aligned_cols=67 Identities=18% Similarity=0.209 Sum_probs=36.3 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 9999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) .+.+-+.|++.|.+.+...+.+--+.|++.=..+.+-.++.-|++|+.. ..-...++.|++.||||- T Consensus 18 ~~G~~~~a~~~G~~~~~~~~~~d~~~q~~~i~~~i~~~vDgIii~p~~~---~~~~~~~~~a~~~gIPvv 84 (277) T cd06319 18 GRGVKSKAKALGYDAVELSAENSAKKELENLRTAIDKGVSGIIISPTNS---SAAVTLLKLAAQAKIPVV 84 (277) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEECCCCC---CCHHHHHHHHHHCCCCEE T ss_conf 9999999997299899976999999999999999966998799647774---110999999997699789 No 99 >pfam03746 LamB_YcsF LamB/YcsF family. This family includes LamB. The lam locus of Aspergillus nidulans consists of two divergently transcribed genes, lamA and lamB, involved in the utilisation of lactams such as 2-pyrrolidinone. Both genes are under the control of the positive regulatory gene amdR and are subject to carbon and nitrogen metabolite repression. The exact molecular function of the proteins in this family is unknown. Probab=84.25 E-value=1.9 Score=23.47 Aligned_cols=138 Identities=16% Similarity=0.271 Sum_probs=85.8 Q ss_pred HHHHHHHHHCCCCE--E-ECHHHC--CCCHHHH---HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999988568928--7-500111--6821567---99999985879707744100687637999999997089675665 Q gi|254780234|r 102 KYLVDIANETGADA--I-AHGSTG--KGNDQVR---FELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 102 ~~lv~~a~~~ga~~--i-aHG~Tg--kGNDQvR---Fe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ..+..+|+..|+.. | -||+-. .-+|..= +=.+++..+|++.++.|- -....+.++++|+++-... T Consensus 91 ~al~~~a~~~g~~l~hVKPHGALYn~~~~d~~la~~i~~ai~~~~~~l~l~~~~-------~s~~~~~A~~~Gl~~~~E~ 163 (242) T pfam03746 91 GALQAFARAQGLRLQHVKPHGALYNMAAKDRALARAVAEAVYDFDPALPLMGLA-------GSALLDAAKEAGLRLAFEV 163 (242) T ss_pred HHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECC-------CHHHHHHHHHCCCCEEEEE T ss_conf 999999998299631135319999999659999999999999869985288448-------7599999987699715899 Q ss_pred CCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCC-HHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHH Q ss_conf 678864157243115457632458457682010014658-122799963899999502478751623189999999988 Q gi|254780234|r 174 RGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVS-PEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQ 251 (404) Q Consensus 174 ~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~-p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~ 251 (404) =.++.|--|-+|--|+..+..+.||..-. +-+..+... .-.+ ..-+.|.|.++ -+-|-|..-+.+++++.+.+ T Consensus 164 FADR~Y~~dG~Lv~R~~~gAvi~d~~~~~-~qv~~~~~~g~V~t-~~G~~i~i~ad---TiCvHgDtp~Ave~a~~ir~ 237 (242) T pfam03746 164 FADRAYQPDGSLVPRSQPGAVIHDPEEAI-AQVLQMVREGKVRA-VDGEWVALEAD---TICVHGDNPHALAFARRIRA 237 (242) T ss_pred EECCCCCCCCCEEECCCCCCCCCCHHHHH-HHHHHHHHCCCEEE-CCCCEEECCCC---EEEECCCCHHHHHHHHHHHH T ss_conf 85156888999874137887668999999-99999997798891-58988752689---89989999789999999999 No 100 >cd06320 PBP1_allose_binding Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. The members of this group are belonging to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Like other periplasmic receptors of the ABC-type transport systems, the allose-binding protein consists of two alpha/beta domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Probab=82.81 E-value=4 Score=21.16 Aligned_cols=68 Identities=13% Similarity=0.170 Sum_probs=46.6 Q ss_pred HHHHHHHHHHHCCCCEEECHHHCCCC--HHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 59999999885689287500111682--156799999985879707744100687637999999997089675 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTGKGN--DQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~TgkGN--DQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+.+-+.|++.|.+..-..+.+.++ .|.++=..+..-.++.-+++|..+- .-...++-+.+.||||- T Consensus 17 ~~~g~e~~a~~~G~~v~v~~~~~~~d~~~q~~~i~~~i~~~vdgii~~p~~~~---~~~~~~~~a~~~gIpvv 86 (275) T cd06320 17 LKEGYENEAKKLGVSVDIQAAPSEGDQQGQLSIAENMINKGYKGLLFSPISDV---NLVPAVERAKKKGIPVV 86 (275) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCH---HHHHHHHHHHHCCCEEE T ss_conf 99999999997499899996899758999999999999749987987678806---56999999986799099 No 101 >COG1440 CelA Phosphotransferase system cellobiose-specific component IIB [Carbohydrate transport and metabolism] Probab=82.36 E-value=4.2 Score=21.05 Aligned_cols=62 Identities=29% Similarity=0.426 Sum_probs=39.6 Q ss_pred CCEEEEEECCCHHHHHHHHHHHH---HCCCE--EEEEEEE-------------CC-CCC-HHHHHHHHHHHCCCEEEEEE Q ss_conf 77899991477259999999987---18986--9999945-------------78-711-06899999997398079982 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQV---EKGLE--VIVFIAD-------------LG-QGE-ELKIASDKARLLGAKEVYVK 64 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~---e~g~e--Vi~~~~d-------------~G-q~~-d~~~~~~~A~~~Ga~~~~v~ 64 (404) |+||.|..|+|.-||..+.-+++ .+|-| +-++..+ +| |-. -+..+++.+...|. ++.++ T Consensus 1 Mk~IlLvC~aGmSTSlLV~Km~~aA~~kg~~~~I~A~s~~e~~~~~~~~DvvLlGPQv~y~~~~~~~~~~~~gi-PV~vI 79 (102) T COG1440 1 MKKILLVCAAGMSTSLLVTKMKKAAESKGKDVTIEAYSETELSEYIDNADVVLLGPQVRYMLKQLKEAAEEKGI-PVEVI 79 (102) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCEEEECHHHHHHHHHHHHHHCCCCC-CEEEE T ss_conf 94499984588748899999999999579965999810657887663089899872899889999998431599-76785 Q ss_pred CHH Q ss_conf 008 Q gi|254780234|r 65 DLR 67 (404) Q Consensus 65 D~r 67 (404) |-. T Consensus 80 ~~~ 82 (102) T COG1440 80 DML 82 (102) T ss_pred CHH T ss_conf 789 No 102 >cd01538 PBP1_ABC_xylose_binding Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic xylose-binding protein is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=82.33 E-value=4.2 Score=21.04 Aligned_cols=69 Identities=19% Similarity=0.204 Sum_probs=51.4 Q ss_pred HHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 359999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 99 LIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 99 lia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) .+.+.+-+.|+++|.+.+.-.+.+.-+.|++.=..+.+-.++.-|++|. +...-...++-|++.||||- T Consensus 16 ~~~~g~e~~A~e~G~~v~~~~~~~d~~~Q~~~i~~lia~~vD~Iii~p~---d~~a~~~~l~~a~~agIPVV 84 (288) T cd01538 16 RDRPNFEAALKELGAEVIVQNANGDPAKQISQIENMIAKGVDVLVIAPV---DGEALASAVEKAADAGIPVI 84 (288) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEECCC---CHHHHHHHHHHHHHCCCEEE T ss_conf 9999999999976998999718999999999999999849989998677---53443999999997599999 No 103 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=82.22 E-value=4.2 Score=21.01 Aligned_cols=65 Identities=17% Similarity=0.185 Sum_probs=24.7 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 99999988568928750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) +.+-+.|+++|.+++.-.+.+.-+.|++.=..+.+-.++.-|++|.- -..-...++-|++.|||| T Consensus 19 ~g~~~~a~~~G~~v~~~~~~~d~~~Q~~~i~~~i~~~vd~iii~~~d---~~~~~~~~~~a~~aGIPV 83 (282) T cd06318 19 EAAKAHAKALGYELISTDAQGDLTKQIADVEDLLTRGVNVLIINPVD---PEGLVPAVAAAKAAGVPV 83 (282) T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCC---HHHHHHHHHHHHHCCCCE T ss_conf 99999999729999997699999999999999997699879981168---056699999999779988 No 104 >cd06323 PBP1_ribose_binding Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Members of this group are belonging to the type I periplasmic binding protein superfamily, whose members are involved in chemotaxis, ATP-binding cassette transport, and intercellular communication in central nervous system. The thermophilic and mesophilic ribose-binding proteins are structurally very similar, but differ substantially in thermal stability. Probab=82.11 E-value=4.3 Score=20.99 Aligned_cols=68 Identities=16% Similarity=0.096 Sum_probs=34.3 Q ss_pred HHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 59999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +++.+-+.|++.|.+.+-..+.+..+.|.++=..+....++.-|++|+- -..-.+.++.+.+.|||+- T Consensus 17 ~~~gi~~~a~~~Gy~~~i~~s~~~~~~e~~~i~~l~~~~vdgiI~~p~~---~~~~~~~i~~~~~~~iPvV 84 (268) T cd06323 17 LKDGAQKEAKELGYELTVLDAQNDAAKQLNDIEDLITRGVDAIIINPTD---SDAVVPAVKAANEAGIPVF 84 (268) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCC---CCCCHHHHHHHHHCCCCEE T ss_conf 9999999999759989998199999999999999996499989975654---2124699999997699689 No 105 >COG0439 AccC Biotin carboxylase [Lipid metabolism] Probab=81.70 E-value=4.4 Score=20.89 Aligned_cols=148 Identities=16% Similarity=0.203 Sum_probs=68.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHH------HHHHHHH Q ss_conf 7789999147725999999998718986999994578711068999999973980799820089999------9879999 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFV------RDFVFPM 78 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~------~~~i~~~ 78 (404) ++||+.|..|...-.+ ...-.+.|.+.++++.+.-+.... ..-|.+.+.++- ...+ ...+-.+ T Consensus 2 ~~kiLIanrGeia~ri--~ra~~~lGi~tvav~s~~d~~~~~--------~~~adeav~i~~-~~~~~syl~i~~ii~~a 70 (449) T COG0439 2 FKKILIANRGEIAVRI--IRACRELGIETVAVYSEADADALH--------VALADEAVCIGP-APSADSYLNIDAIIAAA 70 (449) T ss_pred CCEEEEECCCHHHHHH--HHHHHHHCCEEEEEECCCCCCCHH--------HHHCCEEEECCC-CCCHHHHHHHHHHHHHH T ss_conf 7369995585368999--999998498589996610022525--------663756798388-65034565188899899 Q ss_pred --HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEE-EEEECCCCCCCH Q ss_conf --73686547874201301243359999999885689287500111682156799999985879707-744100687637 Q gi|254780234|r 79 --FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEI-IAPWRHWSFKGR 155 (404) Q Consensus 79 --I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~v-iaP~Rd~~~~sR 155 (404) ..+.|.+-|.=.|+.. ....+.+.+.|...+.-.++-. ...|.-...+.+.-.-.| ..|++|--..+- T Consensus 71 ~~~gadai~pGygflsen-------~~fae~~~~~gl~fiGP~~~~i--~~mgdK~~ar~~~~~aGVP~vpgs~~~~~~~ 141 (449) T COG0439 71 EETGADAIHPGYGFLSEN-------AAFAEACAEAGLTFIGPSAEAI--RRMGDKITARRLMAKAGVPVVPGSDGAVADN 141 (449) T ss_pred HHCCCCEECCCCHHHHCC-------HHHHHHHHHCCCEEECCCHHHH--HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCH T ss_conf 860876672350342178-------8999999974975108498999--9744589999999974999589978776888 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99999999708967566 Q gi|254780234|r 156 QDLIDFAEKHAIPIDKN 172 (404) Q Consensus 156 e~~i~ya~~~gIpv~~~ 172 (404) ++..+.++++|.||=.+ T Consensus 142 ee~~~~a~~iGyPVivK 158 (449) T COG0439 142 EEALAIAEEIGYPVIVK 158 (449) T ss_pred HHHHHHHHHCCCCEEEE T ss_conf 99999998719978999 No 106 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=81.28 E-value=4.6 Score=20.78 Aligned_cols=113 Identities=24% Similarity=0.409 Sum_probs=75.9 Q ss_pred CEEEEEECCC-HHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 7899991477-259999999987189869999945787110689999999739807998200899999879999736865 Q gi|254780234|r 6 KKVVLAYSGG-LDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 6 kkVvlaySGG-LDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) -|=|+.|+|| .=|=-+|.=|+ +-|+|||+.-.-.|.+||...|++.. |-.-+.+.|. |+ T Consensus 337 GKkaaiY~GGa~KswSlv~Al~-dLGMeVV~~GTqkg~~EDy~~I~e~~---~~~~~m~Dd~---------------~p- 396 (470) T TIGR01283 337 GKKAAIYTGGAVKSWSLVSALQ-DLGMEVVATGTQKGTEEDYERIRELM---GEGTVMLDDA---------------NP- 396 (470) T ss_pred CCEEEEECCCHHHHHHHHHHHH-HCCCEEEEEEECCCCHHHHHHHHHHH---CCCCEEEECC---------------CH- T ss_conf 9889986586478999998884-57917999830079888999999970---7996786258---------------87- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCC------CCCCHHHH Q ss_conf 478742013012433599999998856892875001116821567999999858797077441006------87637999 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHW------SFKGRQDL 158 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~------~~~sRe~~ 158 (404) ..|++...+.+||.+-=|. | | +-++ +|---||-|. .|.|=+=+ T Consensus 397 -----------------~~L~~~~~~~~ADlliaGg--k-------~---~y~A--lK~gipFlDiNhER~~~~aGY~G~ 445 (470) T TIGR01283 397 -----------------RELLKLLLEYKADLLIAGG--K-------E---RYLA--LKLGIPFLDINHEREHPYAGYDGM 445 (470) T ss_pred -----------------HHHHHHHHHHCCCEEEECC--H-------H---HHHH--HHCCCCCCCCCCCCCCCCCCCHHH T ss_conf -----------------8999999872898898573--1-------2---3356--551775424554546664220678 Q ss_pred HHHHHHHCCCC Q ss_conf 99999708967 Q gi|254780234|r 159 IDFAEKHAIPI 169 (404) Q Consensus 159 i~ya~~~gIpv 169 (404) +++|++=...| T Consensus 446 ~~fAr~v~~~~ 456 (470) T TIGR01283 446 VEFAREVDLTV 456 (470) T ss_pred HHHHHHHHHHH T ss_conf 89999999985 No 107 >PRK12767 carbamoyl phosphate synthase-like protein; Provisional Probab=81.24 E-value=3.8 Score=21.33 Aligned_cols=49 Identities=20% Similarity=0.165 Sum_probs=24.2 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEC Q ss_conf 778999914772599999999871-8986999994578711068999999973980799820 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKD 65 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D 65 (404) |.+|.+-=.|| -.-++.+|++. .++.||+.-.|--.+ +... |.++|++. T Consensus 1 m~nILvt~~G~--~~~ii~~lk~~~~~~~Vi~~D~~~~a~---------~~~~-aD~~y~~P 50 (325) T PRK12767 1 MMNILVTSAGR--RVQLVKALKKSLLGGKVIGADISPLAP---------ALYF-ADKFYVVP 50 (325) T ss_pred CCEEEEECCCC--HHHHHHHHHHCCCCCEEEEECCCCCCC---------CHHH-CCEEEECC T ss_conf 94899986786--899999999769985999968998995---------3445-48899878 No 108 >PRK09958 DNA-binding transcriptional activator EvgA; Provisional Probab=80.98 E-value=0.95 Score=25.66 Aligned_cols=142 Identities=15% Similarity=0.154 Sum_probs=70.0 Q ss_pred CHHHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 72599999999871-89869999945787110689999999739807998200899999879999736865478742013 Q gi|254780234|r 15 GLDTSIILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGT 93 (404) Q Consensus 15 GLDTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~t 93 (404) |.|.--.+.+++.. ....|+.+++. +| .....+|.+.||.-++..+.-.+ + +..+|+ ..+.|..++.. T Consensus 57 ~~~G~~~~~~ir~~~~~~~viv~s~~----~~-~~~~~~a~~~Ga~g~l~K~~~~~---~-l~~aI~--~v~~G~~~~~~ 125 (204) T PRK09958 57 GVNGIQVLETLRKRQYSGIIIIVSAK----ND-HFYGKHCADAGANGFVSKKEGMN---N-IIAAIE--AAKNGYCYFPF 125 (204) T ss_pred CCCCHHHHHHHHHHCCCEEEEEEECC----CC-CHHHHHHHHCCCCEEEECCCCHH---H-HHHHHH--HHHCCCCCCCH T ss_conf 99841678889872798059999724----65-21144335268637997899999---9-999999--99779970889 Q ss_pred CHHHHHHHHHH----HHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 01243359999----999885689287500111682156799999985879707744100687637999999997089 Q gi|254780234|r 94 AIARPLIAKYL----VDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAI 167 (404) Q Consensus 94 slaRplia~~l----v~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gI 167 (404) ++.|+...-.. +..-..+-..++-+=+.|+.|.|+-.++.+..-.=+-.+-.=.+-.+.++|.+.+.||.+||| T Consensus 126 ~~~~~~~~~~~~~~~~~~Lt~RE~eVL~ll~~G~snkeIA~~L~iS~~TV~~h~~~i~~KL~v~~r~el~~~A~~~~l 203 (204) T PRK09958 126 SLNRFVGSLTSDQQKLDSLSKQEISVMRYILDGKDNNDIAEKMFISNKTVSTYKSRLMEKLECKSLMDLYTFAQRNKI 203 (204) T ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 999998631356311256899999999999869999999989788999999999999998489999999999998289 No 109 >TIGR02298 HpaD_Fe 3,4-dihydroxyphenylacetate 2,3-dioxygenase; InterPro: IPR011984 This enzyme catalyses the ring-opening step in the degradation of 4-hydroxyphenylacetate .. Probab=80.14 E-value=3.6 Score=21.52 Aligned_cols=88 Identities=25% Similarity=0.350 Sum_probs=68.4 Q ss_pred CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH--------------HHHHHHHHHHHHHH Q ss_conf 1068999999973980799820089999987999973686547874201301--------------24335999999988 Q gi|254780234|r 44 EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAI--------------ARPLIAKYLVDIAN 109 (404) Q Consensus 44 ~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tsl--------------aRplia~~lv~~a~ 109 (404) +-..+|-++|+..|..-++|.|. .=++ +. -+=|.||+-++|.| ||= +-|-++..+.+.|. T Consensus 93 dghkEi~~rakE~~VdTivVfDt-HWLv-N~-~YHIN~~~~f~G~y---TS~E~PHfI~~l~Y~y~GnpaLg~lIA~~A~ 166 (343) T TIGR02298 93 DGHKEISRRAKEMGVDTIVVFDT-HWLV-NS-GYHINCNDQFSGSY---TSHELPHFIQDLEYDYPGNPALGQLIADEAQ 166 (343) T ss_pred HHHHHHHHHHHHCCCCEEEEECC-HHHH-CC-CCCCCCCCCCCCCC---CCCCCCHHHHCCCCCCCCCHHHHHHHHHHHH T ss_conf 99888864310046866999632-0210-05-42326677745421---4345530232355578988778999999997 Q ss_pred HCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 568928750011168215679999998587970774410068 Q gi|254780234|r 110 ETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWS 151 (404) Q Consensus 110 ~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~ 151 (404) +.|..+.||- +.+|.=|...+.|-|-.+ T Consensus 167 ~~Gv~~~aH~--------------~~sL~LEYGTlVPMrYmN 194 (343) T TIGR02298 167 EKGVKTLAHQ--------------VPSLGLEYGTLVPMRYMN 194 (343) T ss_pred HCCCEEECCC--------------CCCCCCCCCCCCCCHHHC T ss_conf 3891455055--------------563454366404423107 No 110 >cd01985 ETF The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases. ETF transfers electrons to the main respiratory chain via ETF-ubiquinone oxidoreductase. ETF is an heterodimer that consists of an alpha and a beta subunit which binds one molecule of FAD per dimer . A similar system also exists in some bacteria. The homologous pair of proteins (FixA/FixB) are essential for nitrogen fixation. The alpha subunit of ETF is structurally related to the bacterial nitrogen fixation protein fixB which could play a role in a redox process and feed electrons to ferredoxin. The beta subunit protein is distantly related to and forms a heterodimer with the alpha subunit. Probab=79.36 E-value=5.2 Score=20.36 Aligned_cols=78 Identities=24% Similarity=0.300 Sum_probs=52.2 Q ss_pred HHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99998718986999994578711068999999973980799820089999987999973686547874201301243359 Q gi|254780234|r 22 LKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 22 i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) ...|+ |+|-+|+++++ |.+ ..+...+++...||.+.++++-.. |. +| .+ -.++ T Consensus 28 A~~lk-e~~~~v~~v~~--G~~-~~~~~l~~~~a~GaD~v~~v~~~~-~~-~~--------------~~-------~~~a 80 (181) T cd01985 28 ALRLK-EYGGEVTALVI--GPP-AAEVALREALAMGADKVLLVEDPA-LA-GY--------------DP-------EATA 80 (181) T ss_pred HHHHH-HCCCCEEEEEE--CCC-HHHHHHHHHHHCCCCEEEEECCCC-CC-CC--------------CH-------HHHH T ss_conf 99864-44996899997--881-689999999772897899981674-45-76--------------88-------9999 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCH Q ss_conf 9999998856892875001116821 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGND 126 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGND 126 (404) +.+.++.++.+.+.|--|.|..|.| T Consensus 81 ~~la~~i~~~~~dlVl~g~~s~g~~ 105 (181) T cd01985 81 KALAALIKKEKPDLILAGATSIGKQ 105 (181) T ss_pred HHHHHHHHHCCCCEEEECCCHHHCC T ss_conf 9999999860998999604102225 No 111 >cd01714 ETF_beta The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases. ETF transfers electrons to the main respiratory chain via ETF-ubiquinone oxidoreductase. ETF is an heterodimer that consists of an alpha and a beta subunit which binds one molecule of FAD per dimer . A similar system also exists in some bacteria. The homologous pair of proteins (FixA/FixB) are essential for nitrogen fixation. The beta subunit protein is distantly related to and forms a heterodimer with the alpha subunit. Probab=78.00 E-value=5.7 Score=20.08 Aligned_cols=95 Identities=25% Similarity=0.184 Sum_probs=64.1 Q ss_pred HHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99998718986999994578711068999999973980799820089999987999973686547874201301243359 Q gi|254780234|r 22 LKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 22 i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) ...||+..|.+|+++++ | +.+.+..-++|+..||.+.+.+.-+ .| ++.=+ ..+| T Consensus 44 Alrlke~~g~~Vtvlsv--G-p~~a~~~Lr~alAmGaD~ai~i~d~-~~---------------~~~D~-------~~tA 97 (202) T cd01714 44 ALRLKEKYGGEVTVVSM--G-PPQAEEALREALAMGADRAILVSDR-AF---------------AGADT-------LATA 97 (202) T ss_pred HHHHHHCCCCEEEEEEE--C-CHHHHHHHHHHHHHCCCEEEEECCC-CC---------------CCCCH-------HHHH T ss_conf 99986603988999993--7-4788999999997089835998065-55---------------66698-------9999 Q ss_pred HHHHHHHHHCCCCEEECHH--HCCCCHHHHHHHHHHHHCCCCE Q ss_conf 9999998856892875001--1168215679999998587970 Q gi|254780234|r 102 KYLVDIANETGADAIAHGS--TGKGNDQVRFELSAYSLNSDIE 142 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~--TgkGNDQvRFe~~~~~l~P~l~ 142 (404) +.+....++.+.|.|--|. +--++.|+=.-++-..=-|.+. T Consensus 98 ~~La~~i~~~~~DLIl~G~~s~D~~~g~v~~~lA~~Lg~P~vt 140 (202) T cd01714 98 KALAAAIKKIGVDLILTGKQSIDGDTGQVGPLLAELLGWPQIT 140 (202) T ss_pred HHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHCCCCCC T ss_conf 9999999875998899954545799777999999994897224 No 112 >PRK10840 transcriptional regulator RcsB; Provisional Probab=77.04 E-value=1.5 Score=24.32 Aligned_cols=135 Identities=12% Similarity=0.145 Sum_probs=69.5 Q ss_pred HHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 599999999871-8986999994578711068999999973980799820089999987999973686547874201301 Q gi|254780234|r 17 DTSIILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAI 95 (404) Q Consensus 17 DTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tsl 95 (404) |.--++.+++.. ....|+.+|+.. + ...-.+|...||.-++..+...+- +..+|+ +.++|...+..++ T Consensus 66 dGl~~~~~i~~~~p~~~vivls~~~----~-~~~v~~al~~Ga~Gyl~K~~~~~~----L~~AI~--~v~~G~~~~~~~~ 134 (216) T PRK10840 66 DGITLIKYIKRHFPSLSIIVLTMNN----N-PAILSAVLDLDIEGIVLKQGAPTD----LPKALA--ALQKGKKFTPESV 134 (216) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCC----C-HHHHHHHHHCCCCEEEECCCCHHH----HHHHHH--HHHCCCCCCCHHH T ss_conf 8999999999858998089984778----7-899999985897489987899999----999999--9987994396999 Q ss_pred HHHHHHHHHHHHHHHCC----------CCEEECHHHCCCCHHHHHHHHH--HHHCCCCEEEEEECCCCCCCHHHHHHHHH Q ss_conf 24335999999988568----------9287500111682156799999--98587970774410068763799999999 Q gi|254780234|r 96 ARPLIAKYLVDIANETG----------ADAIAHGSTGKGNDQVRFELSA--YSLNSDIEIIAPWRHWSFKGRQDLIDFAE 163 (404) Q Consensus 96 aRplia~~lv~~a~~~g----------a~~iaHG~TgkGNDQvRFe~~~--~~l~P~l~viaP~Rd~~~~sRe~~i~ya~ 163 (404) ++-+ +.....+ ..++..=|.|+.|.|+--++.+ ++-. -.+-+=.|-.+.++|-+++.|+. T Consensus 135 ~~~l------~~~~~~~~~~~~LT~RE~eVL~lla~G~snkeIA~~L~iS~~TV~--~h~~~i~~KLgv~n~~eLv~ya~ 206 (216) T PRK10840 135 SRLL------EKISAGGYGDKRLSPKESEVLRLFAEGFLVTEIAKKLNRSIKTIS--SQKKSAMMKLGVENDIALLNYLS 206 (216) T ss_pred HHHH------HHHCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHH--HHHHHHHHHCCCCCHHHHHHHHH T ss_conf 9999------972058887789998999999999869999999989698999999--99999999829998999999999 Q ss_pred HHCCCCCC Q ss_conf 70896756 Q gi|254780234|r 164 KHAIPIDK 171 (404) Q Consensus 164 ~~gIpv~~ 171 (404) ++|+ +|. T Consensus 207 ~~gl-~p~ 213 (216) T PRK10840 207 SVTL-SPA 213 (216) T ss_pred HCCC-CCC T ss_conf 8699-985 No 113 >pfam05368 NmrA NmrA-like family. NmrA is a negative transcriptional regulator involved in the post-translational modification of the transcription factor AreA. NmrA is part of a system controlling nitrogen metabolite repression in fungi. This family only contains a few sequences as iteration results in significant matches to other Rossmann fold families. Probab=76.81 E-value=6.2 Score=19.85 Aligned_cols=211 Identities=18% Similarity=0.198 Sum_probs=104.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99991477259999999987189869999945787110689999999739807998200899999879999736865478 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) .|+.=+|-+-+.++-.++ +.|++|.+++=|.-. . ..+.....|+ +.+..|+.+. +-+..+++.- +. T Consensus 2 lV~GatG~iG~~vv~~L~--~~g~~Vr~l~R~~~~----~-~~~~l~~~gv-e~v~gD~~d~---~sl~~al~gv---d~ 67 (232) T pfam05368 2 LVFGATGYQGGSVVRASL--KAGHPVRALVRDPKS----E-LAKSLKAAGV-ELVEGDLDDH---ESLVEALKGV---DV 67 (232) T ss_pred EEECCCHHHHHHHHHHHH--HCCCCEEEEECCCCH----H-HHHHHHHCCC-EEEEECCCCH---HHHHHHHCCC---CE T ss_conf 998968289999999998--589938999718736----6-5666641798-8999068887---8999996799---88 Q ss_pred CCCCCCCHH--HHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 742013012--433599999998856892875001116821567999999858797077441006876379999999970 Q gi|254780234|r 88 YYLLGTAIA--RPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKH 165 (404) Q Consensus 88 ~Ypl~tsla--Rplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~ 165 (404) -|.+..... ..-..+.+++.|++-|..-+.+.+.+..++.. .|... ..|. +.+....-+|+++. T Consensus 68 v~~~~~~~~~~~~~~~~~~~~AA~~aGVk~~V~ss~~~~~~~~---------~~~~~-~~~~----~~~K~~~e~~l~~~ 133 (232) T pfam05368 68 VFSVTGFWLSKEIEDGKKLADAAKEAGVKHFIPSEFGNDVDRS---------NGVEP-AVPH----FDSKAEVERYIRAL 133 (232) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC---------CCCCC-CCHH----HHHHHHHHHHHHHH T ss_conf 9991588741779999999999997399834555501255456---------76665-5278----89899999999981 Q ss_pred CCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCC----CCC--HHHCCCCCCHHH-------CCCCCEEEEEEEECCEE Q ss_conf 89675665678864157243115457632458457----682--010014658122-------79996389999950247 Q gi|254780234|r 166 AIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQ----PAP--EYVYKMIVSPEE-------APDTPTTIRIDFQRGDP 232 (404) Q Consensus 166 gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~----~~p--e~~~~~t~~p~~-------ap~~pe~v~I~Fe~G~P 232 (404) |+|...- .|-..-+|+...-.....+..+.. ..| .....++ +.+| +-..|+. + +|.- T Consensus 134 g~~~til----rp~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~Dva~~~~~~l~~p~~----~-~~~~ 203 (232) T pfam05368 134 GIPYTFV----YAGFFMGNFLSNLAPPGDLAPPRDKVTLLGPGNPKAVPLD-DEEDIGTYVIKILDDPRK----L-KGKY 203 (232) T ss_pred CCCEEEE----ECCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEEE-EHHHHHHHHHHHHCCHHH----C-CCEE T ss_conf 9985999----6842543016565443202576536999448987611265-288999999999649121----1-9999 Q ss_pred EEECCEECCHHHHHHHHHHHHHHC Q ss_conf 875162318999999998886421 Q gi|254780234|r 233 IAINGQVMSPEVLLEQLNQYGRCN 256 (404) Q Consensus 233 VainG~~~~~~~li~~LN~igg~~ 256 (404) +.+-|+.++.-|+++.+.+..||- T Consensus 204 ~~~~~~~lT~~Eia~~~~~~~Gr~ 227 (232) T pfam05368 204 IRPPGNILSGNEIAELWSKKIGKT 227 (232) T ss_pred EEECCCCCCHHHHHHHHHHHHCCC T ss_conf 982898679999999999988998 No 114 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=75.32 E-value=6.7 Score=19.58 Aligned_cols=68 Identities=19% Similarity=0.068 Sum_probs=47.5 Q ss_pred HHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 59999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+.+-+.|++.|.+++..-+.+--..|++.=..+-+..+|.-|++|+.. ..-...++-|.+.||||- T Consensus 17 ~~~G~~~~A~~~G~~~~v~d~~~d~~~Q~~~i~~~i~~~vDgIii~p~d~---~~~~~~l~~a~~aGIPVV 84 (273) T cd06305 17 YLAGTKAEAEALGGDLRVYDAGGDDAKQADQIDQAIAQKVDAIIIQHGRA---EVLKPWVKRALDAGIPVV 84 (273) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCH---HHHHHHHHHHHHCCCCEE T ss_conf 99999999997499899973999999999999999985999999946871---444899999998599789 No 115 >KOG2594 consensus Probab=75.17 E-value=6.8 Score=19.56 Aligned_cols=225 Identities=16% Similarity=0.189 Sum_probs=115.4 Q ss_pred CCCEEEEEECCCHHHHHHHHHHH----HH------CCCEEEEEEEECCCCCH---HHHHHHHHHHCCCEEEEEE---CHH Q ss_conf 77789999147725999999998----71------89869999945787110---6899999997398079982---008 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQ----VE------KGLEVIVFIADLGQGEE---LKIASDKARLLGAKEVYVK---DLR 67 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~----~e------~g~eVi~~~~d~Gq~~d---~~~~~~~A~~~Ga~~~~v~---D~r 67 (404) .-+.|.+++|||.-+-|.+.-+- .. -+..|.-+.+-..+.++ ++.+++-....--.-.|++ +. T Consensus 61 ~sk~vLlv~~G~ssSlvlLd~vh~~l~q~k~~~~~~~~tv~v~~~~~~~~~~~~v~e~lq~l~~~~~~~~~~~V~~la~- 139 (396) T KOG2594 61 PSKNVLLVFDGGSSSLVLLDFVHLALKQLKNKRLRRDFTVLVLVVFQEFTDSTAVFEALQELIIDNIEWVRYVVSCLAP- 139 (396) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEECCCHHHHHHHHHHHHHHHCCCCCEEEEECCC- T ss_conf 5534699951884047699999999998666532768854899998852511888999999998405554268873373- Q ss_pred HHHHHH-HHHHHHHCCCCCCCCC-----CCCCC------HHHHHHHHHHHHHHHHCCCCEEECH--------------HH Q ss_conf 999998-7999973686547874-----20130------1243359999999885689287500--------------11 Q gi|254780234|r 68 REFVRD-FVFPMFRANALYEGYY-----LLGTA------IARPLIAKYLVDIANETGADAIAHG--------------ST 121 (404) Q Consensus 68 ~ef~~~-~i~~~I~ana~Yeg~Y-----pl~ts------laRplia~~lv~~a~~~ga~~iaHG--------------~T 121 (404) .+|-++ -.++.+.+|---.-.| +-+.+ |-+.+--+-+..+|.+.|++.|.-| |- T Consensus 140 ~~~~~~~~~~~~~~~n~El~ak~~kl~~~~sv~~~srqDLl~~lk~kll~~vA~~~g~~~i~~g~~~t~la~~vlt~v~~ 219 (396) T KOG2594 140 PEKDNHVVPVESINGNDELIAKDRKLKLSDSVPDDSRQDLLLHLKMKLLQKVAAENGYNRIVLGDSTTDLASHVLTAVVK 219 (396) T ss_pred HHHHCCCCCCCCCCCCCEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHH T ss_conf 57645756110037870011324330464456640178999999999999999985998798367352799999999983 Q ss_pred CCCCHHHHHHHHHHHHCCC--CEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCC--------H-------H Q ss_conf 1682156799999985879--707744100687637999999997089675665678864157--------2-------4 Q gi|254780234|r 122 GKGNDQVRFELSAYSLNSD--IEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSID--------T-------N 184 (404) Q Consensus 122 gkGNDQvRFe~~~~~l~P~--l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D--------~-------N 184 (404) |+|-. +-.+...-...|. .+.+-|.|| + .+.|.-.|+.-.|++.+.-.. ..++.. . + T Consensus 220 GRG~s-is~~v~~~d~r~~~d~~llrPLrD--l-~~~Ei~~y~~l~~l~~~~c~~--~~k~~~~~~q~sI~~lT~afva~ 293 (396) T KOG2594 220 GRGGS-ISTDVQVVDKRPKGDVKLLRPLRD--L-LSLEITSYCLLDGLAYYFCQG--RRKTVELASQCSINDLTSAFVAL 293 (396) T ss_pred CCCCC-CEEHHHHHCCCCCCCCEEEHHHHH--H-HHHHHHHHHHHHCCCCHHHHH--HHHHCCCHHHCCHHHHHHHHHHH T ss_conf 66765-100544533465787200002778--8-899999999760687034567--87623303206487889999999 Q ss_pred H--HCCCCC------CCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECC Q ss_conf 3--115457------6324584576820100146581227999638999995024787516 Q gi|254780234|r 185 L--LHSSSE------GRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAING 237 (404) Q Consensus 185 l--wg~S~E------gg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG 237 (404) | ---|.. +..|-.|+....+..-..+..|-+..+ +.--.+|+-|.|+++-| T Consensus 294 Lqn~f~S~vsTV~rTaaKl~~~s~s~~es~C~iCn~~l~~~~--s~~L~~ie~~~~~sv~s 352 (396) T KOG2594 294 LQNEFPSTVSTVVRTAAKLTVPSFSMTESFCPICNSPLNRSD--SSWLDTIEVGQPASVCS 352 (396) T ss_pred HHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC--HHHHHHEECCCCCCCCH T ss_conf 874140578899865654068888777552640377123770--45565342156764200 No 116 >PRK12921 2-dehydropantoate 2-reductase; Provisional Probab=74.96 E-value=6 Score=19.95 Aligned_cols=84 Identities=19% Similarity=0.086 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999988864216374100077320210001113-7378999999999986508999999999999999998753514 Q gi|254780234|r 243 EVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYE-TPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFW 321 (404) Q Consensus 243 ~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYE-aPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~w 321 (404) ..++.++-.++.++|+...+-...+++-.-.+...+ .|.|..=+.++|..|--.+.....+.-+..+- T Consensus 218 ~~l~~E~~~va~a~Gi~~~~~~~~~~~~~~~~~~~~~~sSM~qDi~~gr~tEid~i~G~iv~~a~~~Gv----------- 286 (306) T PRK12921 218 LALLDECLAVARAEGAPLRDQVVQRIVKIFAGAPGDMGTSMLRDLEKGRPLEIDHLNGVLLRRGRQHGI----------- 286 (306) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHCC----------- T ss_conf 999999999999869999868999999999705799982899998879853799999999999999489----------- Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 6989999999999972 Q gi|254780234|r 322 FSPEREMLQALIDKSQ 337 (404) Q Consensus 322 f~p~~~~l~a~i~~~q 337 (404) -.|..+++-+.+...+ T Consensus 287 ~~P~~~~l~~lvk~~E 302 (306) T PRK12921 287 PTPILDTIYALLKAYE 302 (306) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 9888999999999985 No 117 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=74.11 E-value=7.2 Score=19.38 Aligned_cols=65 Identities=20% Similarity=0.165 Sum_probs=23.1 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 99999988568928750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) +.+-+.|++.|.+++.--+.+.-+.|++.=..+.+..++.-|++|+.. ..-...+.-|.+.|||| T Consensus 20 ~g~e~~A~~~G~~~~v~~~~~d~~~q~~~i~~~i~~~vd~Iii~p~d~---~~~~~~i~~a~~agIpV 84 (275) T cd06317 20 KAFQAAAEEDGVEVIVLDANGDVARQAAQVEDLIAQKVDGIILWPTDG---QAYIPGLRKAKQAGIPV 84 (275) T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCC---HHHHHHHHHHHHCCCEE T ss_conf 999999997699899977999999999999999975999999967871---24579999999869949 No 118 >PRK08462 biotin carboxylase; Validated Probab=72.49 E-value=7.8 Score=19.12 Aligned_cols=35 Identities=26% Similarity=0.498 Sum_probs=25.3 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 7777899991477259999999987189869999945 Q gi|254780234|r 3 RDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIAD 39 (404) Q Consensus 3 ~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d 39 (404) +++|||.+|=+|=.---++ .-++ +-|.+.++++.+ T Consensus 2 r~~kkvLIANRGEIA~Ri~-ra~~-elgi~tVavys~ 36 (446) T PRK08462 2 KELKRILIANRGEIALRAI-RTIQ-EMGKEAIAIYST 36 (446) T ss_pred CCCCEEEEECCHHHHHHHH-HHHH-HCCCCEEEEECH T ss_conf 9788899989609999999-9999-829969999376 No 119 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=72.26 E-value=7.9 Score=19.08 Aligned_cols=68 Identities=19% Similarity=0.110 Sum_probs=49.2 Q ss_pred HHHHHHHHHHHCCCCEEECHHH-CCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 5999999988568928750011-1682156799999985879707744100687637999999997089675 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGST-GKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~T-gkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+.+-+.|+++|+++.--+.. +--+.|++.=..+-+..||.-+++|.- ...-...++-|.+.||||- T Consensus 18 v~~G~~~aA~~~Gv~v~~~~~~~~d~~~Q~~~i~~~i~~~vDaIii~p~~---~~~~~~~i~~a~~agIpVv 86 (271) T cd06312 18 VKNGAEDAAKDLGVDVEYRGPETFDVADMARLIEAAIAAKPDGIVVTIPD---PDALDPAIKRAVAAGIPVI 86 (271) T ss_pred HHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCC---CCHHHHHHHHHHHCCCEEE T ss_conf 99999999999699899996898999999999999997599989993788---3002699999996598699 No 120 >PRK09955 rihB ribonucleoside hydrolase 2; Provisional Probab=71.98 E-value=8 Score=19.04 Aligned_cols=51 Identities=16% Similarity=0.262 Sum_probs=39.8 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 987777899991477259999999987189869999945787110689999999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKAR 54 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~ 54 (404) |+| +||++=.-.|.|-.+++.++......||+++|+--| .-.++...+.|+ T Consensus 1 M~k--~kvIiD~D~G~DDa~Al~lal~~p~i~v~gITtv~G-N~~~~~~~~Nal 51 (313) T PRK09955 1 MEK--RKIILDCDPGHDDAIAIMMAAKHPAIDLLGITIVAG-NQTLDKTLINGL 51 (313) T ss_pred CCC--CEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEEECC-CCCHHHHHHHHH T ss_conf 998--669998999879999999998689981899999359-988899999999 No 121 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=71.42 E-value=8.2 Score=18.95 Aligned_cols=67 Identities=22% Similarity=0.251 Sum_probs=42.5 Q ss_pred HHHHHHHHHHHCCCCEEECHHHCCCC--HHHH-HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 59999999885689287500111682--1567-99999985879707744100687637999999997089675 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTGKGN--DQVR-FELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~TgkGN--DQvR-Fe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+.+-+.|++.|.+.+-.++.+.++ .|.. ++. +..-.+|.-|++|... ..-.+.++-+.+.||||- T Consensus 17 v~~G~e~aA~~~G~~v~~~~~~~~~d~~~q~~~i~~-~i~~~vDgIii~p~~~---~~~~~~l~~a~~~gIPvV 86 (273) T cd06310 17 VKAGAEAAAKELGVKVTFQGPASETDVAGQVNLLEN-AIARGPDAILLAPTDA---KALVPPLKEAKDAGIPVV 86 (273) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHH-HHHCCCCEEEEECCCC---HHHHHHHHHHHHCCCCEE T ss_conf 999999999980998999728997899999999999-9974999999916871---447999999998499858 No 122 >PRK09628 oorB 2-oxoglutarate-acceptor oxidoreductase subunit OorB; Reviewed Probab=71.30 E-value=6.6 Score=19.64 Aligned_cols=68 Identities=7% Similarity=0.102 Sum_probs=32.9 Q ss_pred HHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEEC-----CCCCCCHHHHHHHHHHHCCCCC Q ss_conf 999998856892875001116821567-99999985879-70774410-----0687637999999997089675 Q gi|254780234|r 103 YLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPWR-----HWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 103 ~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~R-----d~~~~sRe~~i~ya~~~gIpv~ 170 (404) -.++.|...||++||.++++..++..+ +..++.--++. ++|++|.- ...+.+-.++++|.+++-+|+. T Consensus 156 d~~~la~aaGatfVAR~~~~~~~~l~~~ik~Ai~hkGfS~VevlspCpt~~gr~Nk~~~~~~~~~~~k~~~~~~~ 230 (281) T PRK09628 156 DACKLATAAGASFVARESVIDPKKLEKLLVKGFSHKGFSFFDVFSNCHINLGRKNKMGEAVQMLEWIKSRTVSKR 230 (281) T ss_pred CHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCHH T ss_conf 999999876997899870489999999999996599967999956898875531686777999999987377778 No 123 >cd01534 4RHOD_Repeat_3 Member of the Rhodanese Homology Domain superfamily, repeat 3. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 3rd repeat which does not contain the putative catalytic Cys residue. Probab=70.73 E-value=7.9 Score=19.07 Aligned_cols=12 Identities=42% Similarity=0.717 Sum_probs=5.9 Q ss_pred CCEEEEEECHHH Q ss_conf 980799820089 Q gi|254780234|r 57 GAKEVYVKDLRR 68 (404) Q Consensus 57 Ga~~~~v~D~r~ 68 (404) |-...+++|+|+ T Consensus 13 ~~r~~~llDVR~ 24 (95) T cd01534 13 GDRTVYRFDVRT 24 (95) T ss_pred CCCCEEEEECCC T ss_conf 997379997998 No 124 >cd03376 TPP_PFOR_porB_like Thiamine pyrophosphate (TPP family), PFOR porB-like subfamily, TPP-binding module; composed of proteins similar to the beta subunit (porB) of the Helicobacter pylori four-subunit pyruvate ferredoxin oxidoreductase (PFOR), which are also found in archaea and some hyperthermophilic bacteria. PFOR catalyzes the oxidative decarboxylation of pyruvate to form acetyl-CoA, a crucial step in many metabolic pathways. Archaea, anaerobic bacteria and eukaryotes that lack mitochondria (and therefore pyruvate dehydrogenase) use PFOR to oxidatively decarboxylate pyruvate, with ferredoxin or flavodoxin as the electron acceptor. The 36-kDa porB subunit contains the binding sites for the cofactors, TPP and a divalent metal cation, which are required for activity. Probab=70.29 E-value=8.7 Score=18.78 Aligned_cols=134 Identities=20% Similarity=0.199 Sum_probs=70.3 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCC-HHHHHHHHHHHCCC-EEEEEECHHHHHHHHHHHHHHHCC---CCCCCCCCCCCCH Q ss_conf 999998718986999994578711-06899999997398-079982008999998799997368---6547874201301 Q gi|254780234|r 21 ILKWLQVEKGLEVIVFIADLGQGE-ELKIASDKARLLGA-KEVYVKDLRREFVRDFVFPMFRAN---ALYEGYYLLGTAI 95 (404) Q Consensus 21 ~i~~L~~e~g~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga-~~~~v~D~r~ef~~~~i~~~I~an---a~Yeg~Ypl~tsl 95 (404) ++++|+......|+++.-|-+--+ -+..+-. |..-|. ..++++|=+ ++-.-..| +...|..--+++- T Consensus 71 a~~~l~~~~~~~Vv~~~GDG~~~dIG~~~l~h-aa~Rn~~i~~iv~dN~-------~YgnTGGQ~S~tTp~G~~t~tsp~ 142 (235) T cd03376 71 ALKALGRGKDITVVAFAGDGGTADIGFQALSG-AAERGHDILYICYDNE-------AYMNTGIQRSGSTPYGAWTTTTPV 142 (235) T ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHH-HHHCCCCEEEEEECCH-------HHCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99986138996399996474475420788999-9982998399998882-------100368966989989863112688 Q ss_pred HHH-----HHHHHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEE-CCCCCCCHHHHHHHHH Q ss_conf 243-----3599999998856892875001116821567-99999985879-7077441-0068763799999999 Q gi|254780234|r 96 ARP-----LIAKYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPW-RHWSFKGRQDLIDFAE 163 (404) Q Consensus 96 aRp-----lia~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~-Rd~~~~sRe~~i~ya~ 163 (404) +|- --.+-+.++|...|+.+||.++.+.-+|.++ |..+...-+|. +.+++|. -.|.+. -.+.++.++ T Consensus 143 G~~~~g~~~~~kd~~~i~~a~g~~yVA~~~~~~~~~~~k~i~~A~~~~G~s~i~v~spCpt~w~~~-~~~~~~~~~ 217 (235) T cd03376 143 GKVSFGKKQPKKDLPLIMAAHNIPYVATASVAYPEDLYKKVKKALSIEGPAYIHILSPCPTGWRFD-PSKTIEIAR 217 (235) T ss_pred CCCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC-HHHHHHHHH T ss_conf 855778777886999999870995899974589999999999997289988999964796667959-799999999 No 125 >COG1830 FbaB DhnA-type fructose-1,6-bisphosphate aldolase and related enzymes [Carbohydrate transport and metabolism] Probab=69.93 E-value=8.8 Score=18.73 Aligned_cols=29 Identities=24% Similarity=0.396 Sum_probs=18.4 Q ss_pred EEEEEEEECC--CC------CHHHHHHHHHHHCCCEE Q ss_conf 6999994578--71------10689999999739807 Q gi|254780234|r 32 EVIVFIADLG--QG------EELKIASDKARLLGAKE 60 (404) Q Consensus 32 eVi~~~~d~G--q~------~d~~~~~~~A~~~Ga~~ 60 (404) ..+-+.+|-| ++ +|++.+-+++.+.|+.- T Consensus 23 ~~~i~a~DhGv~~g~p~~gl~d~e~~v~~~~~~g~da 59 (265) T COG1830 23 RLLILAMDHGVEHGNPIEGLEDPENIVAKVAEAGADA 59 (265) T ss_pred CEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCE T ss_conf 8899953555115887634269999999998548998 No 126 >PRK12569 hypothetical protein; Provisional Probab=69.73 E-value=8.9 Score=18.70 Aligned_cols=89 Identities=13% Similarity=0.187 Sum_probs=62.7 Q ss_pred HHHHHHHHHCCCCE---EECHHHC--CCCHHHHH---HHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999988568928---7500111--68215679---9999985879707744100687637999999997089675665 Q gi|254780234|r 102 KYLVDIANETGADA---IAHGSTG--KGNDQVRF---ELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 102 ~~lv~~a~~~ga~~---iaHG~Tg--kGNDQvRF---e~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ..+..+|+..|+.. =.||+-. .-+|..=. =.+++.++|++.++.|- -....+.|+++|+++-..- T Consensus 96 ~al~~~a~~~g~~l~hVKPHGALYn~~~~d~~la~~i~~ai~~~~~~l~l~~~~-------~s~~~~~A~~~Gl~~~~E~ 168 (245) T PRK12569 96 GALREFARAQGVRLQHVKPHGALYMHAARDEALARLLVEALQRLDPGLPLYCME-------ASVTYRIARELGQPVVREF 168 (245) T ss_pred HHHHHHHHHCCCCEEECCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEECC-------CHHHHHHHHHCCCCEEEEE T ss_conf 999999998399301114219999998569999999999999849884388528-------6399999998699746899 Q ss_pred CCCCCCCCCHHHHCCCCCCCCCCCCC Q ss_conf 67886415724311545763245845 Q gi|254780234|r 174 RGEAPFSIDTNLLHSSSEGRVLEDPS 199 (404) Q Consensus 174 ~~~~~yS~D~Nlwg~S~Egg~Ledp~ 199 (404) =.++.|.-|-.|.-|+..+.. ||. T Consensus 169 FADR~Y~~dG~Lv~R~~~gAv--d~~ 192 (245) T PRK12569 169 YADRDYDRSGSIVFTRRVGAL--DPQ 192 (245) T ss_pred EECCCCCCCCCEECCCCCCCC--CHH T ss_conf 765654899988418999999--999 No 127 >PRK09935 transcriptional regulator FimZ; Provisional Probab=69.72 E-value=1 Score=25.51 Aligned_cols=141 Identities=13% Similarity=0.124 Sum_probs=60.2 Q ss_pred HHHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 2599999999871-898699999457871106899999997398079982008999998799997368654787420130 Q gi|254780234|r 16 LDTSIILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTA 94 (404) Q Consensus 16 LDTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~ts 94 (404) .|..-.+.+++.. .+..|+.++..- | .....+|...||.-++..+...+-..+-+.....++. | +... T Consensus 62 ~~g~~~i~~i~~~~p~~~ilvls~~~----~-~~~~~~a~~~Ga~g~l~K~~~~~~l~~ai~~v~~G~~-----~-~~~~ 130 (210) T PRK09935 62 TDGFTLLKRIKQIQETVKVLFLSSKS----E-CFYAGRAIRAGANGFVSKRKDLNDIYNAVQMILSGYS-----F-FPSE 130 (210) T ss_pred CCCCCHHHHHHHHCCCCCEEEEECCC----C-HHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHCCCC-----C-CCHH T ss_conf 88640567898738997089971767----2-9999999966877688678999999999999985995-----3-6989 Q ss_pred HHHHHHHHHHHHH------HHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 1243359999999------885689287500111682156799999985879707744100687637999999997089 Q gi|254780234|r 95 IARPLIAKYLVDI------ANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAI 167 (404) Q Consensus 95 laRplia~~lv~~------a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gI 167 (404) ....+.....-.- -..+-..++..=|.|+.|.++--++.+..-.=+-.+-.=.|-.+.++|.+.+.||.+||+ T Consensus 131 ~~~~~~~~~~~~~~~~~~~Lt~RE~eVL~ll~~G~snkeIA~~L~iS~~TV~~h~~~I~~KL~v~nr~elv~~A~~~~L 209 (210) T PRK09935 131 TLNFIKSNKCSKGSSTDMPLSNREVTVLRYLANGLSNKEIAEQLLLSNKTISAHKSNIYGKLGLHSIVELIDYAKSHEL 209 (210) T ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 9999873436666556789998999999999869999999989498899999999999998199999999999998599 No 128 >cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies. Probab=69.46 E-value=9 Score=18.66 Aligned_cols=191 Identities=17% Similarity=0.161 Sum_probs=88.1 Q ss_pred CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHH--HHH--HHHHHHHHHHHCCCCEEECH Q ss_conf 10689999999739807998200899999879999736865478742013012--433--59999999885689287500 Q gi|254780234|r 44 EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIA--RPL--IAKYLVDIANETGADAIAHG 119 (404) Q Consensus 44 ~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tsla--Rpl--ia~~lv~~a~~~ga~~iaHG 119 (404) +|.+.+-+.+...|+. .++...- .++.|. +......-+-=+-=-+|++. ++- ..-.-|+-|-++||++|+-. T Consensus 21 ~d~~~~i~~~~~~g~d-ai~~~~G--~~~~~~-~~~~~~~~li~~ls~~t~~~~~~~~~~~l~~sVeeAvrlGAdaV~~~ 96 (235) T cd00958 21 EDPEETVKLAAEGGAD-AVALTKG--IARAYG-REYAGDIPLIVKLNGSTSLSPKDDNDKVLVASVEDAVRLGADAVGVT 96 (235) T ss_pred CCHHHHHHHHHHCCCC-EEEECHH--HHHHCC-CCCCCCCCEEEEECCCCCCCCCCCCHHHHHCCHHHHHHCCCCEEEEE T ss_conf 5989999999864999-9998888--997515-32157876699964676457888517766516999983555679999 Q ss_pred HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH---------HCC-- Q ss_conf 111682156799999985879707744100687637999999997089675665678864157243---------115-- Q gi|254780234|r 120 STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNL---------LHS-- 188 (404) Q Consensus 120 ~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nl---------wg~-- 188 (404) . .-|.++.+- .++. -.+..+.|.++|+|+=.- +|-...|+ -+. T Consensus 97 v-~~Gs~~E~~--~l~~------------------l~~v~~ea~~~G~Pll~~-----~yprG~~~~~~~d~~~ia~aaR 150 (235) T cd00958 97 V-YVGSEEERE--MLEE------------------LARVAAEAHKYGLPLIAW-----MYPRGPAVKNEKDPDLIAYAAR 150 (235) T ss_pred E-ECCCCHHHH--HHHH------------------HHHHHHHHHHHCCCEEEE-----EECCCCCCCCCCCHHHHHHHHH T ss_conf 8-689816899--9999------------------999999999839978999-----7416876556566899999999 Q ss_pred ---CCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCE--ECCHHHHHHHHHHH--HHHCCCCCE Q ss_conf ---45763245845768201001465812279996389999950247875162--31899999999888--642163741 Q gi|254780234|r 189 ---SSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQ--VMSPEVLLEQLNQY--GRCNGIGRI 261 (404) Q Consensus 189 ---S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~--~~~~~~li~~LN~i--gg~~GvGr~ 261 (404) ...+.+.-.++..-+ +.|.+.+. .+| +||=+-|- .-++.++++...+. +|..|+- T Consensus 151 ia~ELGADiVKv~y~g~~-e~f~~vv~--~~~-------------vPVviaGG~k~~~~~~~L~~v~~a~~aGa~Gv~-- 212 (235) T cd00958 151 IGAELGADIVKTKYTGDA-ESFKEVVE--GCP-------------VPVVIAGGPKKDSEEEFLKMVYDAMEAGAAGVA-- 212 (235) T ss_pred HHHHHCCCEEEECCCCCH-HHHHHHHH--HCC-------------CCEEECCCCCCCCHHHHHHHHHHHHHCCCCEEE-- T ss_conf 999978998982399998-99999997--089-------------989980899999999999999999976995676-- Q ss_pred ECCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 0007732021000111373789999999999 Q gi|254780234|r 262 DIVENRFVGIKSRGVYETPGGTILLHAHRAI 292 (404) Q Consensus 262 d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~L 292 (404) =+|-||..|-..-+++|.+.+ T Consensus 213 ----------~GRNvfQ~~~p~~~~~al~~i 233 (235) T cd00958 213 ----------VGRNIFQRPDPVAMLRAISAV 233 (235) T ss_pred ----------ECCHHHCCCCHHHHHHHHHHH T ss_conf ----------364122588899999999974 No 129 >PRK08305 spoVFB dipicolinate synthase subunit B; Reviewed Probab=69.28 E-value=9.1 Score=18.63 Aligned_cols=41 Identities=24% Similarity=0.182 Sum_probs=30.2 Q ss_pred CCCCCCEEEEEECCCHHHH----HHHHHHHHHCCCEEEEEEEECCC Q ss_conf 9877778999914772599----99999987189869999945787 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTS----IILKWLQVEKGLEVIVFIADLGQ 42 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTS----v~i~~L~~e~g~eVi~~~~d~Gq 42 (404) |+-+-|||+|+.+||.=|= -.+..|+ +.|++|+++.-+--| T Consensus 1 M~L~gK~I~lgvTGSiaay~kv~~~ir~L~-~~GA~V~~ImS~~a~ 45 (195) T PRK08305 1 MSLKGKRIGFGLTGSHCTYEEVMPQIEKLV-AEGAEVRPIVSYTVQ 45 (195) T ss_pred CCCCCCEEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEEEECHHHH T ss_conf 976898899996727998999999999999-879969999754477 No 130 >PRK05920 aromatic acid decarboxylase; Validated Probab=67.94 E-value=9.7 Score=18.45 Aligned_cols=40 Identities=28% Similarity=0.417 Sum_probs=30.0 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHH--HCCCEEEEEEEECC Q ss_conf 987777899991477259999999987--18986999994578 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQV--EKGLEVIVFIADLG 41 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~--e~g~eVi~~~~d~G 41 (404) |++ |||++++.||-=-.-+.+..|+. +.++||+.+.-+-+ T Consensus 1 m~~-mkrivvgITGASG~~ya~rll~~L~~~~~ev~lviS~~a 42 (205) T PRK05920 1 MSK-MKRIVLAITGASGAIYGVRLLECLLAADYEVHLVISKAA 42 (205) T ss_pred CCC-CCEEEEEEECHHHHHHHHHHHHHHHHCCCEEEEEECHHH T ss_conf 998-875999986542799999999999867998999986789 No 131 >PRK07226 fructose-bisphosphate aldolase; Provisional Probab=67.79 E-value=9.7 Score=18.43 Aligned_cols=208 Identities=15% Similarity=0.166 Sum_probs=99.7 Q ss_pred EEEEEEEECCC-------CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCC------CCCHHHH Q ss_conf 69999945787-------1106899999997398079982008999998799997368654787420------1301243 Q gi|254780234|r 32 EVIVFIADLGQ-------GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLL------GTAIARP 98 (404) Q Consensus 32 eVi~~~~d~Gq-------~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl------~tslaRp 98 (404) ..+.+.+|-|. -+|.+.+-+.+...|+. .++...- +++. +.+. |.+.-|+ +|+++-. T Consensus 20 r~~ilp~DHG~~~Gp~~gl~d~~~~i~~~~~~g~d-ai~~~~G--~~~~-~~~~------~~~~~~lil~l~~~t~~~~~ 89 (266) T PRK07226 20 RTVIVPMDHGVSHGPIDGLVDMRETVNKVAEGGAD-AVLMHKG--LARH-GHRG------YGRDVGLIVHLSASTSLSPD 89 (266) T ss_pred CEEEEECCCCCCCCCCCCCCCHHHHHHHHHHCCCC-EEEECHH--HHHH-CCCC------CCCCCCEEEEECCCCCCCCC T ss_conf 88999777775448875535999999999863999-9996767--7864-3522------15788749994377314788 Q ss_pred HHH---HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC-----C Q ss_conf 359---99999988568928750011168215679999998587970774410068763799999999708967-----5 Q gi|254780234|r 99 LIA---KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI-----D 170 (404) Q Consensus 99 lia---~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv-----~ 170 (404) .-. -.-|+-|-++||++|+-.. .-|.++.. ..+.--.+..+.|.++|+|+ | T Consensus 90 ~~~~~~~~sVeeAvrlGAdaV~~~v-~~Gs~~E~--------------------~~l~~l~~l~~ea~~~GlPll~~~yp 148 (266) T PRK07226 90 PNDKVLVGTVEDAIKLGADAVSVHV-NVGSETEA--------------------EMLEDLGRIARECEEWGMPLLAMMYP 148 (266) T ss_pred CCHHEEECCHHHHHHCCCCEEEEEE-ECCCCCHH--------------------HHHHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 6401012149999855877899985-47998379--------------------99999999999999859956999734 Q ss_pred CCCCCCCCCCCCH-HHHCC-----CCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEE-C-CH Q ss_conf 6656788641572-43115-----457632458457682010014658122799963899999502478751623-1-89 Q gi|254780234|r 171 KNKRGEAPFSIDT-NLLHS-----SSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQV-M-SP 242 (404) Q Consensus 171 ~~~~~~~~yS~D~-Nlwg~-----S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~-~-~~ 242 (404) .++..+ ...|. .+-+. ...+.++-.++..-| +.|.+.+. .| .+||=+-|-+ . +. T Consensus 149 rG~~~~--~~~d~~~ia~aaRia~ELGADiVKv~y~gd~-e~f~~vv~--~~-------------~vPVliaGG~k~~~~ 210 (266) T PRK07226 149 RGPKIK--NEYDPEVVAHAARVGAELGADIIKTNYTGDP-ESFREVVE--GC-------------PVPVVIAGGPKTGSD 210 (266) T ss_pred CCCCCC--CCCCHHHHHHHHHHHHHHCCCEEEECCCCCH-HHHHHHHH--HC-------------CCCEEEECCCCCCCH T ss_conf 688777--7875999999999999967998983599998-99999998--47-------------987899578888988 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 99999998886421637410007732021000111373789999999999865089 Q gi|254780234|r 243 EVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLD 298 (404) Q Consensus 243 ~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~ 298 (404) .++++...+.-..-+-|.. =+|-||..|-..-+..|.+.+-.--.+ T Consensus 211 ~~~L~~v~~ai~aGa~Gv~----------~GRNvfQ~~~p~~~~~al~~iVH~~~s 256 (266) T PRK07226 211 LEFLQMVEDAMEAGAAGVA----------IGRNIFQHEDPAKITRAISAVVHEGAD 256 (266) T ss_pred HHHHHHHHHHHHCCCCEEE----------ECCHHHCCCCHHHHHHHHHHHHCCCCC T ss_conf 9999999999975994665----------261023579899999999998509999 No 132 >pfam04800 ETC_C1_NDUFA4 ETC complex I subunit conserved region. Family of pankaryotic NADH-ubiquinone oxidoreductase subunits (EC:1.6.5.3) (EC:1.6.99.3) from complex I of the electron transport chain initially identified in Neurospora crassa as a 21 kDa protein. Probab=67.47 E-value=2.7 Score=22.40 Aligned_cols=22 Identities=27% Similarity=0.481 Sum_probs=18.8 Q ss_pred CCCCHHHHHHHHHHHCCCCCCC Q ss_conf 8763799999999708967566 Q gi|254780234|r 151 SFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 151 ~~~sRe~~i~ya~~~gIpv~~~ 172 (404) .|.++|+.|+||++|||+..+. T Consensus 54 ~F~skE~Ai~YA~k~gi~y~V~ 75 (101) T pfam04800 54 TFPTKEAAIAFAERQGWEYDVE 75 (101) T ss_pred EECCHHHHHHHHHHCCCEEEEE T ss_conf 8799999999999849828995 No 133 >cd06309 PBP1_YtfQ_like Periplasmic binding domain of ABC-type YtfQ-like transport systems. Periplasmic binding domain of ABC-type YtfQ-like transport systems. The YtfQ protein from Escherichia coli is up-regulated under glucose-limited conditions and shares homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their ligand specificity is not determined experimentally. Probab=65.68 E-value=11 Score=18.15 Aligned_cols=50 Identities=16% Similarity=0.232 Sum_probs=23.1 Q ss_pred ECHHHCCCCHHHHHHHHHHHH--CCCCEEEEEEC-CCCCC-CHHHHHHHHHHHC Q ss_conf 500111682156799999985--87970774410-06876-3799999999708 Q gi|254780234|r 117 AHGSTGKGNDQVRFELSAYSL--NSDIEIIAPWR-HWSFK-GRQDLIDFAEKHA 166 (404) Q Consensus 117 aHG~TgkGNDQvRFe~~~~~l--~P~l~viaP~R-d~~~~-sRe~~i~ya~~~g 166 (404) -.|..+..+.+-|.+....+| .|+++++.... +|+.. +++..-+++++++ T Consensus 130 i~g~~~~~~~~~R~~Gf~~~l~~~~~~~iv~~~~~~~~~~~~~~~~~~~L~~~p 183 (273) T cd06309 130 LQGTVGSSVAIDRKKGFAEVIKKYPNMKIVASQTGDFTRAKGKEVMEALLKAHG 183 (273) T ss_pred EECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 828989859999999999999878998676541234688999999999997278 No 134 >COG0056 AtpA F0F1-type ATP synthase, alpha subunit [Energy production and conversion] Probab=65.31 E-value=11 Score=18.10 Aligned_cols=80 Identities=16% Similarity=0.197 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHCCCEEEEEEEECCCCC-HHHHHHHHHHHCCCEEEE-EECHH--HHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 5999999998718986999994578711-068999999973980799-82008--9999987999973686547874201 Q gi|254780234|r 17 DTSIILKWLQVEKGLEVIVFIADLGQGE-ELKIASDKARLLGAKEVY-VKDLR--REFVRDFVFPMFRANALYEGYYLLG 92 (404) Q Consensus 17 DTSv~i~~L~~e~g~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga~~~~-v~D~r--~ef~~~~i~~~I~ana~Yeg~Ypl~ 92 (404) -|++++--+..+++.+|+|++|-+||+. .+-.+-++-+..||-++- |+.+- +--.-.|+. T Consensus 175 KTaIAidtIiNQk~~~v~CIYVAIGQK~stva~vv~tL~e~gAm~yTiVV~AsASd~a~lqYLa---------------- 238 (504) T COG0056 175 KTAIAIDTIINQKGSGVKCIYVAIGQKRSTVANVVRTLEEHGAMDYTIVVAASASDSAPLQYLA---------------- 238 (504) T ss_pred CCHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHH---------------- T ss_conf 3110278898355689479999815437789999999997597513399981477623555530---------------- Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 3012433599999998856892875 Q gi|254780234|r 93 TAIARPLIAKYLVDIANETGADAIA 117 (404) Q Consensus 93 tslaRplia~~lv~~a~~~ga~~ia 117 (404) |-.+-.+.|.-+..|-+++- T Consensus 239 -----Py~g~a~aE~f~~~G~dvLI 258 (504) T COG0056 239 -----PYAGCAMAEYFRDNGKDVLI 258 (504) T ss_pred -----HHHHHHHHHHHHHCCCEEEE T ss_conf -----22555889999865974899 No 135 >cd06289 PBP1_MalI_like Ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. This group includes the ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. They are members of the LacI-GalR family of repressor proteins which are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=65.16 E-value=11 Score=18.08 Aligned_cols=63 Identities=17% Similarity=0.170 Sum_probs=22.0 Q ss_pred HHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 9999988568928750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 103 YLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 103 ~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) .+-+.|++.|-+.+.+-+......|.++=..+....++.-|+.|.++ +..+.++.+.+.|||+ T Consensus 20 gi~~~a~~~gy~lll~~t~~~~~~e~~~i~~l~~~~vDGiIi~~~~~----~~~~~~~~l~~~~iPv 82 (268) T cd06289 20 GLEEVLEEAGYTVFLANSGEDVERQEQLLSTMLEHGVAGIILCPAAG----TSPDLLKRLAESGIPV 82 (268) T ss_pred HHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCC----CCHHHHHHHHHCCCCE T ss_conf 99999998699899995899989999999999965999899946888----9999999999759989 No 136 >TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component; InterPro: IPR014350 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . According to structural and sequence analyses, the PTS EIIB domain (2.7.1.69 from EC) can be divided in five groups , , : The PTS EIIB type 1 domain, which is found in the Glucose class of PTS, has an average length of about 80 amino acids. It forms a split alpha/beta sandwich composed of an antiparallel sheet (beta 1 to beta 4) and three alpha helices superimposed onto one side of the sheet. The phosphorylation site (Cys) is located at the end of the first beta strand on a protrusion formed by the edge of beta 1 and the reverse turn between beta 1 and beta 2 . The PTS EIIB type 2 domain, which is found in the Mannitol class of PTS, has an average length of about 100 amino acids. It consists of a four stranded parallel beta sheet flanked by two alpha helices (alpha 1 and 3) on one face and helix alpha 2 on the opposite face, with a characteristic Rossmann fold comprising two right-handed beta-alpha-beta motifs. The phosphorylation site (Cys) is located at the N-terminus of the domain, in the first beta strand. The PTS EIIB type 3 domain, which is found in the Lactose class of PTS, has an average length of about 100 amino acids. It is composed of a central four-stranded parallel open twisted beta sheet, which is flanked by three alpha helices on the concave side and two on the convex side of the beta sheet. The phosphorylation site (Cys) is located in the C-terminal end of the first beta strand . The PTS EIIB type 4 domain, which is found in the Mannose class of PTS, has an average length of about 160 amino acids. It has a central core of seven parallel beta strands surrounded by a total of six alpha-helices. Three helices cover the front face, one the back face with the remaining two capping the central beta sheet at the top and bottom. The phosphorylation site (His) is located at the suface exposed loop between strand 1 and helix 1 . The PTS EIIB type 5 domain, which is found in the Sorbitol class of PTS, has an average length of about 190 amino acids. The phosphorylation site (Cys) is located in the N-terminus of the domain. The EIIB type 3 protein is often found downstream of EIIC type 3. This entry hits fewer proteins than IPR013012 from INTERPRO. . Probab=64.79 E-value=11 Score=18.03 Aligned_cols=74 Identities=19% Similarity=0.199 Sum_probs=35.1 Q ss_pred CCCCCCC-HHHHHHHHHHHHHHHHCCCCEEEC--HHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHH Q ss_conf 7420130-124335999999988568928750--0111682156799999985879707744100687637999999997 Q gi|254780234|r 88 YYLLGTA-IARPLIAKYLVDIANETGADAIAH--GSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEK 164 (404) Q Consensus 88 ~Ypl~ts-laRplia~~lv~~a~~~ga~~iaH--G~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~ 164 (404) .|.++.+ .+=-|.+.++=+.|++.|..+.+. +-|..++---+|++. .|+|++. + --+++-.-..+ T Consensus 49 iLl~CaaGmstsLLv~~l~k~A~~~~~~~~i~A~~~~~~~e~~~~~d~V--lLaPQ~~---------~-~~~~lk~~t~~ 116 (142) T TIGR00853 49 ILLLCAAGMSTSLLVNKLNKAAKEYGVPVKIAAVSYGAAMEILDDADVV--LLAPQVA---------Y-MLPDLKKETDK 116 (142) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCEE--EEHHHHH---------H-HHHHHHHHCCC T ss_conf 6888769735478999999999845997588840576344335350431--1203267---------5-57999985135 Q ss_pred HCCCCCCCC Q ss_conf 089675665 Q gi|254780234|r 165 HAIPIDKNK 173 (404) Q Consensus 165 ~gIpv~~~~ 173 (404) .||||..+. T Consensus 117 ~Gip~~~i~ 125 (142) T TIGR00853 117 KGIPVEVIN 125 (142) T ss_pred CCCCEEEEC T ss_conf 897536638 No 137 >COG2403 Predicted GTPase [General function prediction only] Probab=64.34 E-value=11 Score=17.97 Aligned_cols=100 Identities=21% Similarity=0.237 Sum_probs=69.1 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEC--------------------C----CCCHHHHHHHHHHHC Q ss_conf 9877778999914772599999999871898699999457--------------------8----711068999999973 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADL--------------------G----QGEELKIASDKARLL 56 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~--------------------G----q~~d~~~~~~~A~~~ 56 (404) |++..+||+..--||-|--+.=.-++....|||++|++-. | -.+|.+.+++-.+.. T Consensus 2 ~m~a~kRviiLgaggrdfhv~n~a~r~~~~yevvaf~aaqiiG~~er~yppsleg~~~p~Gvpi~~~k~~~~lek~ire~ 81 (449) T COG2403 2 AMKARKRVIILGAGGRDFHVFNVALRDNPEYEVVAFTAAQIIGGTERIYPPSLEGVLYPLGVPILPEKDYDDLEKIIREK 81 (449) T ss_pred CCCCCEEEEEEECCCCCCCHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 87775269999506756622467860687524788899883277665389970113233783304664689999999870 Q ss_pred CCEEEEEECHHH---HHHHHHHHHHHHCCCCCCCCCCCCCC--HHHHHHH Q ss_conf 980799820089---99998799997368654787420130--1243359 Q gi|254780234|r 57 GAKEVYVKDLRR---EFVRDFVFPMFRANALYEGYYLLGTA--IARPLIA 101 (404) Q Consensus 57 Ga~~~~v~D~r~---ef~~~~i~~~I~ana~Yeg~Ypl~ts--laRplia 101 (404) ++ +..|+|+-+ +-+=+.+...+.++|.|.--=|+.|- +.+|+|+ T Consensus 82 ~V-D~~VlaySDvs~e~v~~IaS~vLs~GA~f~~~gP~et~~~~ekPvia 130 (449) T COG2403 82 DV-DIVVLAYSDVSYEHVFRIASRVLSAGADFKELGPKETMLKLEKPVIA 130 (449) T ss_pred CC-CEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCHHHHHHCCCEEE T ss_conf 76-61687701289899878999997478536872752776651485599 No 138 >TIGR00290 MJ0570_dom MJ0570-related uncharacterized domain; InterPro: IPR002761 This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N-terminal. The structure of Q8U2K6 from Pyrococcus furiosus has been resolved to 2.7A and is suggested to be a putative N-type pytophosphatase. In some members of the family e.g. Q12429 from SWISSPROT, this domain is associated with IPR006175 from INTERPRO, another domain of unknown function. Proteins with this uncharacterised domain include two apparent ortholog families in the archaea, one of which is universal among the first four completed archaeal genomes. The domain comprises the full length of the archaeal proteins and the first third of fungal proteins. . Probab=64.19 E-value=11 Score=17.96 Aligned_cols=145 Identities=23% Similarity=0.261 Sum_probs=82.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECC-CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH----C Q ss_conf 89999147725999999998718986999994578-711068999999973980799820089999987999973----6 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLG-QGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR----A 81 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~G-q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~----a 81 (404) |++..+|||-|+.+.+.|.. +.+.+|..+..-.- ..++ ..+. ....-+. .++++++..+ ..-.|.++ - T Consensus 2 ~~~~~~~gg~d~~~~~~~~~-~~~~~~~~l~~~~~~~~~~-~~~d-~~~~~~~-~~~~~~~~~~---~~g~p~~~~~~~~ 74 (265) T TIGR00290 2 KVAALYSGGKDSCLALYHAL-KEGHEVISLVNVFPKENEE-RELD-SYMFHGP-NLHLLDLLAE---SVGIPLIKRSILR 74 (265) T ss_pred CEEEEEECCCHHHHHHHHHH-HCCCCCHHHHHHCCCCCCH-HHHH-HHHHHCC-CHHHHHHHHH---HHCCHHHHHHHHH T ss_conf 24787507741478999987-4176100121101345502-4444-4444214-4257776676---5153045565541 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC-----CCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHH Q ss_conf 8654787420130124335999999988568-----92875001116821567999999858797077441006876379 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYLVDIANETG-----ADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQ 156 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~lv~~a~~~g-----a~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe 156 (404) |.. +..-+-++-...--.+.+....+..+ ++.+.-|+....-..-|++..++.++ ++.++|.- .-...+ T Consensus 75 ~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~pd~~~~~~g~~~~~~~~~~~~~~c~~~g--~~~~~plw--~~~~~~ 148 (265) T TIGR00290 75 NTS--NNVELYTSGTEEDEVEELKGLLGTLDDKGPDVEAVVVGAIYSEYQKTRIERVCRELG--LKPFAPLW--HRDPLE 148 (265) T ss_pred CCC--CCCEECCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC--CHHHHHHC--CCCHHH T ss_conf 234--441000011014568899988876431144212332323555556667888887723--11211210--558789 Q ss_pred HHHHHHHH Q ss_conf 99999997 Q gi|254780234|r 157 DLIDFAEK 164 (404) Q Consensus 157 ~~i~ya~~ 164 (404) .....+.. T Consensus 149 ~~~~~~~~ 156 (265) T TIGR00290 149 LLEEFVEE 156 (265) T ss_pred HHHHHHHH T ss_conf 99998864 No 139 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=63.75 E-value=12 Score=17.90 Aligned_cols=66 Identities=21% Similarity=0.223 Sum_probs=29.9 Q ss_pred HHHHHHHHHHCCCCEEECH-HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 9999999885689287500-11168215679999998587970774410068763799999999708967 Q gi|254780234|r 101 AKYLVDIANETGADAIAHG-STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG-~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) .+.+-+.|+++|++.+--+ +.+--+.|+..=..+-+-.++.-+++|+ | ...-...++-|++.|||| T Consensus 18 ~~Gae~aA~e~G~~v~~~~~~~~D~~~Q~~~ie~~Ia~gvDaIii~p~-d--~~a~~~~i~~A~~aGIpV 84 (298) T cd06302 18 EEGAKEAAKELGVDAIYVGPTTADAAGQVQIIEDLIAQGVDAIAVVPN-D--PDALEPVLKKAREAGIKV 84 (298) T ss_pred HHHHHHHHHHHCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECC-C--HHHHHHHHHHHHHCCCEE T ss_conf 999999999979999998899899999999999999739988999068-7--788899999998679869 No 140 >PRK08654 pyruvate carboxylase subunit A; Validated Probab=63.31 E-value=12 Score=17.85 Aligned_cols=31 Identities=19% Similarity=0.462 Sum_probs=17.9 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 789999147725999999998718986999994 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIA 38 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~ 38 (404) |||++|=-|=.---++ .-++ +.|++.+++.. T Consensus 3 ~kvLIANRGEIA~Rii-Ra~r-elGi~tVaVyS 33 (497) T PRK08654 3 KKILIANRGEIAIRVM-RACR-ELGIKTVAVYS 33 (497) T ss_pred CEEEEECCCHHHHHHH-HHHH-HCCCEEEEECC T ss_conf 6089987879999999-9999-85993999888 No 141 >PRK08339 short chain dehydrogenase; Provisional Probab=63.02 E-value=12 Score=17.81 Aligned_cols=57 Identities=25% Similarity=0.281 Sum_probs=33.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCC--CEEEEEECHHHH Q ss_conf 99991477259999999987189869999945787110689999999739--807998200899 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLG--AKEVYVKDLRRE 69 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~G--a~~~~v~D~r~e 69 (404) +|..=|+|+--.++..++ +.|+.|+.+. . ..+.++...++....+ ...++..|+.++ T Consensus 12 lITG~s~GIG~aiA~~la--~~Ga~V~i~~--r-~~~~l~~~~~~l~~~~~~~~~~~~~D~~~~ 70 (263) T PRK08339 12 FTTASSKGIGFGVARVLA--RAGADVIILS--R-NEENLKRAKEKIKSESDVEVHYIVADLTKR 70 (263) T ss_pred EEECCCCHHHHHHHHHHH--HCCCEEEEEE--C-CHHHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 991626099999999999--8699999997--9-889999999999850498579998489999 No 142 >PRK07165 F0F1 ATP synthase subunit alpha; Validated Probab=62.54 E-value=12 Score=17.75 Aligned_cols=54 Identities=19% Similarity=0.125 Sum_probs=41.2 Q ss_pred EEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEE Q ss_conf 9914772599999999871898699999457871-10689999999739807998 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKEVYV 63 (404) Q Consensus 10 laySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v 63 (404) ++=++-=-|++++--+..+++.+|+|+.+-+||. .++..+-+.-...||-++-+ T Consensus 149 iGDrqTGKTaIAldtIiNQk~~~V~cIYVAIGQK~s~Va~iv~~L~~~gAm~yTi 203 (507) T PRK07165 149 IGDRQTGKTHIALNTIINQKNRNVKCIYVAIGQKRENLSRIYETLKEHDALKNTI 203 (507) T ss_pred ECCCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCHHHEE T ss_conf 3578877889999999845468958999983101779999999998779756525 No 143 >COG2086 FixA Electron transfer flavoprotein, beta subunit [Energy production and conversion] Probab=62.39 E-value=12 Score=17.73 Aligned_cols=101 Identities=23% Similarity=0.202 Sum_probs=69.4 Q ss_pred HHHHHH-HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 999987-1898699999457871106899999997398079982008999998799997368654787420130124335 Q gi|254780234|r 22 LKWLQV-EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI 100 (404) Q Consensus 22 i~~L~~-e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli 100 (404) ...||+ .+|.+|+++++ |. ...++..++|+..|+.+.+.+.-+. | ++--|+. . T Consensus 46 AlrLke~~~~~eV~vlt~--Gp-~~a~~~lr~aLAmGaDraili~d~~-~---------------~~~d~~~-------t 99 (260) T COG2086 46 ALRLKEKGYGGEVTVLTM--GP-PQAEEALREALAMGADRAILITDRA-F---------------AGADPLA-------T 99 (260) T ss_pred HHHHHCCCCCCEEEEEEE--CC-HHHHHHHHHHHHCCCCEEEEEECCC-C---------------CCCCHHH-------H T ss_conf 998614688866999994--64-5339999999854887599970322-3---------------6755899-------9 Q ss_pred HHHHHHHHHHCCCCEEECHH--HCCCCHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 99999998856892875001--1168215679999998587970774410 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGS--TGKGNDQVRFELSAYSLNSDIEIIAPWR 148 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~--TgkGNDQvRFe~~~~~l~P~l~viaP~R 148 (404) |+.+....++.+.+.|--|+ +--.+-|+=+.++-..=-|.+...+-+. T Consensus 100 a~~Laa~~~~~~~~LVl~G~qa~D~~t~qvg~~lAe~Lg~P~~t~v~~i~ 149 (260) T COG2086 100 AKALAAAVKKIGPDLVLTGKQAIDGDTGQVGPLLAELLGWPQVTYVSKIE 149 (260) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCEEEEEEEEE T ss_conf 99999999874998899813443576466589999986985053378899 No 144 >cd06390 PBP1_iGluR_AMPA_GluR1 N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR1 subunit of the AMPA receptor. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR1 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic binding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs). Probab=61.87 E-value=12 Score=17.67 Aligned_cols=42 Identities=7% Similarity=0.006 Sum_probs=19.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCC Q ss_conf 4335999999988568928750011168215679999998587 Q gi|254780234|r 97 RPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNS 139 (404) Q Consensus 97 Rplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P 139 (404) ||-+.++++++.+..|=.-++==. -......|..--++..++ T Consensus 101 ~P~l~~A~~dlv~~~~W~~f~iiY-d~d~gL~rLq~ll~~~~~ 142 (364) T cd06390 101 RPELQDALISVIEHYKWQKFVYIY-DADRGLSVLQKVLDTAAE 142 (364) T ss_pred CCCHHHHHHHHHHHCCCCEEEEEE-ECCHHHHHHHHHHHHHHH T ss_conf 676799999999976996899999-382768999999986420 No 145 >COG0075 Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase [Amino acid transport and metabolism] Probab=61.33 E-value=13 Score=17.60 Aligned_cols=98 Identities=18% Similarity=0.203 Sum_probs=54.3 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99914772599999999871898699999457871106899999997398079982008999998799997368654787 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGY 88 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~ 88 (404) ||..+.|-=.--...+++ .+|.+|+.+.+.-|-.-+.+++++.-.+-+..+++.+-- +- T Consensus 83 VLv~~nG~FG~R~~~ia~-~~g~~v~~~~~~wg~~v~p~~v~~~L~~~~~~~~V~~vH--------------------~E 141 (383) T COG0075 83 VLVVVNGKFGERFAEIAE-RYGAEVVVLEVEWGEAVDPEEVEEALDKDPDIKAVAVVH--------------------NE 141 (383) T ss_pred EEEEECCHHHHHHHHHHH-HHCCCEEEEECCCCCCCCHHHHHHHHHCCCCCCEEEEEE--------------------CC T ss_conf 999827807779999999-829966998478888799999999985289950899981--------------------15 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHH Q ss_conf 4201301243359999999885689287500111682156799 Q gi|254780234|r 89 YLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFE 131 (404) Q Consensus 89 Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe 131 (404) -||.+--|+ +.+.+++++.|+-.|..+-|+-|.+-++++ T Consensus 142 --TSTGvlnpl--~~I~~~~k~~g~l~iVDaVsS~Gg~~~~vd 180 (383) T COG0075 142 --TSTGVLNPL--KEIAKAAKEHGALLIVDAVSSLGGEPLKVD 180 (383) T ss_pred --CCCCCCCCH--HHHHHHHHHCCCEEEEEECCCCCCCCCCHH T ss_conf --752013769--999999997298899982345788125444 No 146 >PRK10768 ribonucleoside hydrolase RihC; Provisional Probab=60.78 E-value=13 Score=17.54 Aligned_cols=48 Identities=13% Similarity=0.217 Sum_probs=37.1 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 7899991477259999999987189869999945787110689999999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKAR 54 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~ 54 (404) +||++=--.|.|-.+++.++......||.++|+--|- -.++...+.+. T Consensus 3 ~kvIiDtD~G~DDa~Al~lal~~~~~~v~gIT~v~GN-~~~~~~~~N~~ 50 (304) T PRK10768 3 LPIILDTDPGIDDAVAIAAALFAPELDLQLITTVAGN-VSVEKTTRNAL 50 (304) T ss_pred CCEEEECCCCHHHHHHHHHHHHCCCCEEEEEEECCCC-CCHHHHHHHHH T ss_conf 8889989997689999999987899849999974798-77999999999 No 147 >PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=60.43 E-value=13 Score=17.50 Aligned_cols=122 Identities=14% Similarity=0.163 Sum_probs=70.8 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEEECC---------CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 47725999999998718986999994578---------711068999999973980799820089999987999973686 Q gi|254780234|r 13 SGGLDTSIILKWLQVEKGLEVIVFIADLG---------QGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 13 SGGLDTSv~i~~L~~e~g~eVi~~~~d~G---------q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) -.+++=-=.+...| +-||+-|=+.+|-- ..++...+++-+...|.. +..+ |.. T Consensus 18 ~~~~sw~e~~~~ak-~~Gfd~iElsiDe~d~~~~rL~w~~~~~~~ir~~~~~~gi~-i~s~----------------cls 79 (283) T PRK13209 18 PAGECWLEKLRIAK-TAGFDFVEMSVDETDERLARLDWSREQRLALVNALVETGFR-VNSM----------------CLS 79 (283) T ss_pred CCCCCHHHHHHHHH-HCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC-CEEE----------------ECC T ss_conf 89999999999999-85998799842685310035899999999999999981998-6033----------------054 Q ss_pred CCCCCCCCCCCHH-----HHHHHHHHHHHHHHCCCCEEECHH----HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 5478742013012-----433599999998856892875001----1168215679999998587970774410068763 Q gi|254780234|r 84 LYEGYYLLGTAIA-----RPLIAKYLVDIANETGADAIAHGS----TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKG 154 (404) Q Consensus 84 ~Yeg~Ypl~tsla-----Rplia~~lv~~a~~~ga~~iaHG~----TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~s 154 (404) +--+||++++=. .--+.+++++.|.++|+..|.-.+ .+..+.+-| .+-..+ T Consensus 80 -~~~~~Pl~S~D~~~R~~~~e~~~kaI~lA~~LGi~~I~lag~dv~~~~~~~e~~-------------------~~f~e~ 139 (283) T PRK13209 80 -AHRRFPLGSEDDAVRAQGLEIMRKAIQLAQDLGIRVIQLAGYDVYYEEANNETR-------------------RRFIDG 139 (283) T ss_pred -CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHH-------------------HHHHHH T ss_conf -555799999799999999999999999999809998996887667887859999-------------------999999 Q ss_pred HHHHHHHHHHHCCCCCCC Q ss_conf 799999999708967566 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~ 172 (404) -.+..+||+++||.+... T Consensus 140 L~~~~~~A~~~gV~L~iE 157 (283) T PRK13209 140 LKESVEMASRYSVTLAFE 157 (283) T ss_pred HHHHHHHHHHHCCEEEEE T ss_conf 999999999859989994 No 148 >PRK11864 2-ketoisovalerate ferredoxin oxidoreductase subunit beta; Provisional Probab=60.16 E-value=13 Score=17.49 Aligned_cols=60 Identities=23% Similarity=0.305 Sum_probs=45.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEE-CCCCCC Q ss_conf 012433599999998856892875001116821567-99999985879-7077441-006876 Q gi|254780234|r 94 AIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPW-RHWSFK 153 (404) Q Consensus 94 slaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~-Rd~~~~ 153 (404) +-+.+--.+-+..+|...|+.+||.++.+.-+|.++ +..+...-+|. +.+++|. .-|.+. T Consensus 153 p~Gk~~~kkdl~~i~~a~g~~YVA~~~~a~~~~l~~~i~kA~~~~G~s~i~v~spCp~gw~~~ 215 (300) T PRK11864 153 PGGKRERKKPVADIMAAHGVPYVATASIAYPEDFIRKLKKAAEIPGFKFIHLLAPCPPGWRFD 215 (300) T ss_pred CCCCCCCCCCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCC T ss_conf 999878998999999975998799985569999999999997189987999978895658868 No 149 >COG1341 Predicted GTPase or GTP-binding protein [General function prediction only] Probab=59.75 E-value=13 Score=17.42 Aligned_cols=118 Identities=26% Similarity=0.281 Sum_probs=61.4 Q ss_pred CEEEEEECCCHHH--HHHHHHHHHH---CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 7899991477259--9999999871---8986999994578711068999999973980799820089999987999973 Q gi|254780234|r 6 KKVVLAYSGGLDT--SIILKWLQVE---KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 6 kkVvlaySGGLDT--Sv~i~~L~~e---~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) +.-+++-=|+.|| |....||..+ +|+.|.-+-+|+||++=.--. ...++-....+.++. T Consensus 72 ~~~~vmvvG~vDSGKSTLt~~LaN~~l~rG~~v~iiDaDvGQ~ei~pPg---~ISL~~~~s~~~~L~------------- 135 (398) T COG1341 72 KVGVVMVVGPVDSGKSTLTTYLANKLLARGRKVAIIDADVGQSEIGPPG---FISLAFPESPVISLS------------- 135 (398) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCC---EEEEECCCCCCCCHH------------- T ss_conf 8738999898676788999999988764474189996899976667974---677412567777777------------- Q ss_pred CCCCCCCCCCCCC-----CHHHHHHH-HHHHHHHHHCCCCEEECHHHCC--C--CHHHHHHHHHHHHCCCCE Q ss_conf 6865478742013-----01243359-9999998856892875001116--8--215679999998587970 Q gi|254780234|r 81 ANALYEGYYLLGT-----AIARPLIA-KYLVDIANETGADAIAHGSTGK--G--NDQVRFELSAYSLNSDIE 142 (404) Q Consensus 81 ana~Yeg~Ypl~t-----slaRplia-~~lv~~a~~~ga~~iaHG~Tgk--G--NDQvRFe~~~~~l~P~l~ 142 (404) ...+...|++++ ...|.+.. ..+++.|++. ++.+.=-++|- | -=|.-+.+ +.+..|++- T Consensus 136 -~l~~~~~~FvG~isP~~~~~~~i~~v~rL~~~a~~~-~~~ilIdT~GWi~G~~g~elk~~l-i~~ikP~~I 204 (398) T COG1341 136 -ELEPFTLYFVGSISPQGFPGRYIAGVARLVDLAKKE-ADFILIDTDGWIKGWGGLELKRAL-IDAIKPDLI 204 (398) T ss_pred -HCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCC-CCEEEECCCCCEECCHHHHHHHHH-HHHCCCCEE T ss_conf -758652279851477777689999999999986516-877999699843074278999998-865097789 No 150 >pfam03480 SBP_bac_7 Bacterial extracellular solute-binding protein, family 7. This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component. Probab=59.43 E-value=13 Score=17.44 Aligned_cols=18 Identities=17% Similarity=0.246 Sum_probs=8.1 Q ss_pred HHHHHHHCCCCEEECHHH Q ss_conf 999988568928750011 Q gi|254780234|r 104 LVDIANETGADAIAHGST 121 (404) Q Consensus 104 lv~~a~~~ga~~iaHG~T 121 (404) +-+...+.|...+..+.+ T Consensus 98 ~~~~~~~~gv~~L~~~~~ 115 (285) T pfam03480 98 LLKSLEAKGLKGLAFWNN 115 (285) T ss_pred HHHHHHHCCCEEEEEECC T ss_conf 999999769689986268 No 151 >TIGR02875 spore_0_A sporulation transcription factor Spo0A; InterPro: IPR012052 Members of this group are response regulators/transcription factors that contain CheY-like receiver (phosphoacceptor) domain and a unique DNA-binding domain. Spo0A controls the entry of Bacillus subtilis into the developmental process of sporulation . Activation of the Spo0A transcription factor by phosphorylation serves as a developmental checkpoint and to integrate several physiological signals that control entry into the sporulation pathway. The signals are generated by conditions of nutrient deprivation, high cell density, the Krebs cycle, DNA replication, DNA damage, and some aspect of the chromosome partitioning machinery . Activated Spo0A has multiple functions . It represses promoters (such as the abrB promoter) by binding to sites (0A boxes) downstream from the transcription start site. Spo0A also activates transcription from promoters used by two forms of RNA polymerase that differ by containing either the major sigma factor, sigma A (e.g. the spoIIE and spoIIG promoters) or the alternate sigma factor, sigma H (e.g. the spoIIA promoter). At promoters activated by Spo0A, the 0A boxes lie upstream of the transcription-initiation site.; GO: 0003677 DNA binding, 0005509 calcium ion binding, 0000160 two-component signal transduction system (phosphorelay), 0042173 regulation of sporulation, 0045449 regulation of transcription, 0050906 detection of stimulus during sensory perception. Probab=58.97 E-value=6.9 Score=19.48 Aligned_cols=134 Identities=21% Similarity=0.212 Sum_probs=60.8 Q ss_pred CCCCCHHHHHHHHHHHCCCEEEEEE-CHHHHHHHHHHHHHHHC----------CCCC-CCCCCCCCCHH-HHHHH-HHHH Q ss_conf 7871106899999997398079982-00899999879999736----------8654-78742013012-43359-9999 Q gi|254780234|r 40 LGQGEELKIASDKARLLGAKEVYVK-DLRREFVRDFVFPMFRA----------NALY-EGYYLLGTAIA-RPLIA-KYLV 105 (404) Q Consensus 40 ~Gq~~d~~~~~~~A~~~Ga~~~~v~-D~r~ef~~~~i~~~I~a----------na~Y-eg~Ypl~tsla-Rplia-~~lv 105 (404) +|| |.|.+||..+||+ +||+ .+-=|....-|..+.+. +..- .-+.+..|... .+.== ..+- T Consensus 87 fGQ----E~ITqrA~~LGAD-YYvlKPfDle~L~~RIRQl~~~t~~~esvri~~i~a~~~~~~~~~~~~Gk~~dLe~~iT 161 (270) T TIGR02875 87 FGQ----EKITQRAVALGAD-YYVLKPFDLEILAARIRQLAEGTVASESVRIKDIKADSPTVLSASKAEGKTKDLEAEIT 161 (270) T ss_pred CCH----HHHHHHHHHCCCC-CEEECHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCEECCCCCCCCCHHHHHH T ss_conf 375----5899999972898-15665041788999999862464243402312454566722000015798855357899 Q ss_pred HHHHHCCCCEEECHHHCCCCHHHHH------------HHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 9988568928750011168215679------------9999985879707744100687637999999997089675665 Q gi|254780234|r 106 DIANETGADAIAHGSTGKGNDQVRF------------ELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 106 ~~a~~~ga~~iaHG~TgkGNDQvRF------------e~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) .+-++.|.-+ | =||=-=.|= --..+.|.|.+- ....+-.--.|=|-.|-|.+-|+. T Consensus 162 ~iiHE~GVPA--H---iKGY~YLRdAI~mV~~~~ELLgavTK~LYP~IA-------~kYnTTaSRVERAIRHAIEvAWsR 229 (270) T TIGR02875 162 NIIHEIGVPA--H---IKGYMYLRDAILMVLKDMELLGAVTKELYPAIA-------KKYNTTASRVERAIRHAIEVAWSR 229 (270) T ss_pred HHHHHCCCCC--C---CCCCHHHHHHHHHHHHCHHHHCHHHHCCCHHHH-------HHCCCCCCHHHHHHHHHHHHCCCC T ss_conf 9998628988--6---342105889999875040220102110032578-------543869644667666463420258 Q ss_pred CCCCCCCCCHHHHCCCCCCC Q ss_conf 67886415724311545763 Q gi|254780234|r 174 RGEAPFSIDTNLLHSSSEGR 193 (404) Q Consensus 174 ~~~~~yS~D~Nlwg~S~Egg 193 (404) . ..=++++ |.|.+++.. T Consensus 230 G--~~E~i~k-lFGYTi~~~ 246 (270) T TIGR02875 230 G--NIELINK-LFGYTIEGE 246 (270) T ss_pred C--CCCCCHH-HHCCCCCCC T ss_conf 8--6210155-515664477 No 152 >TIGR00962 atpA ATP synthase F1, alpha subunit; InterPro: IPR005294 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents the alpha subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha-subunit contains a highly conserved adenine-specific non-catalytic nucleotide-binding domain, with a conserved amino acid sequence of Gly-X-X-X-X-Gly-Lys. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=58.47 E-value=14 Score=17.27 Aligned_cols=44 Identities=18% Similarity=0.214 Sum_probs=36.8 Q ss_pred HHHHHHHHHHHCC--------CEEEEEEEECCCCC-HHHHHHHHHHHCCCEEE Q ss_conf 9999999987189--------86999994578711-06899999997398079 Q gi|254780234|r 18 TSIILKWLQVEKG--------LEVIVFIADLGQGE-ELKIASDKARLLGAKEV 61 (404) Q Consensus 18 TSv~i~~L~~e~g--------~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga~~~ 61 (404) |++|+-=.-.+++ -+|+|++|=+||++ -+-.+-++-+..||=++ T Consensus 175 TavAIDtIiNQk~~~~~~~~~~dv~CvYVAIGQK~StvA~vv~~LE~~GAM~Y 227 (520) T TIGR00962 175 TAVAIDTIINQKDSGELGDEKKDVYCVYVAIGQKASTVAQVVRKLEEHGAMDY 227 (520) T ss_pred CHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCCC T ss_conf 05655777730577656786556489996207535689999999753487111 No 153 >PRK10100 DNA-binding transcriptional regulator CsgD; Provisional Probab=58.36 E-value=4.3 Score=20.96 Aligned_cols=125 Identities=12% Similarity=0.131 Sum_probs=55.4 Q ss_pred CEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 86999994578711068999999973980799820089999987999973686547874201301243359999999885 Q gi|254780234|r 31 LEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANE 110 (404) Q Consensus 31 ~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~ 110 (404) ..|+-+|. .+| +.-...|...||.-++..|...+- +..+|+ +.+.|..+++..+++.++....-..... T Consensus 82 ~kVliLTt----~dd-~e~v~~al~aGa~GyllK~~~~~~----Li~aIr--~v~~G~~~l~~~~~~~l~~~~~~~~~~~ 150 (216) T PRK10100 82 IKILLLNT----PED-YPYREIENWPHINGVFYAMEDQER----VVNGLQ--GVLRGECYFTQKLASYLITHSGNYRYNS 150 (216) T ss_pred CEEEEEEC----CCC-HHHHHHHHHCCCCEEEECCCCHHH----HHHHHH--HHHCCCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 77999968----874-899999976587667668999999----999999--9987997239999999997102256788 Q ss_pred --------CCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHC Q ss_conf --------68928750011168215679999998587970774410068763799999999708 Q gi|254780234|r 111 --------TGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHA 166 (404) Q Consensus 111 --------~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~g 166 (404) +-..++..=+.|+.|.++--++.+---.=.-.+-.=.+-.+.++|.+.+.||.+|. T Consensus 151 ~~~~~LT~RE~eIL~lla~G~sNkEIA~~L~iSe~TVk~H~~~I~~KL~v~~R~qav~~A~~nl 214 (216) T PRK10100 151 TESALLTHREKEILNKLRIGASNNEIARSLFISENTVKTHLYNLFKKIAVKNRTQAVSWANDNL 214 (216) T ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC T ss_conf 7778999899999999987999999998949889899999999999809999999999999866 No 154 >cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Probab=56.18 E-value=15 Score=17.01 Aligned_cols=64 Identities=14% Similarity=0.144 Sum_probs=25.2 Q ss_pred HHHHHHHHC-CCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 999998856-8928750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 103 YLVDIANET-GADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 103 ~lv~~a~~~-ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) .+-+.|+++ |.+.+.-.+.+--..|.+.=..+.+..+|.-|++|.- ..+-...++-|.+.|||| T Consensus 20 g~e~~a~~~~g~~v~~~~~~~d~~~Q~~~i~~li~~~vDgIii~p~d---~~~~~~~~~~a~~agIPv 84 (272) T cd06301 20 AMKEHAKVLGGVELQFEDAKNDVATQLSQVENFIAQGVDAIIVVPVD---TAATAPIVKAANAAGIPL 84 (272) T ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCC---CHHHHHHHHHHHHCCCCE T ss_conf 99999997499789997799999999999999998599999995588---211189999999869988 No 155 >PRK03359 putative electron transfer flavoprotein FixA; Reviewed Probab=55.38 E-value=16 Score=16.92 Aligned_cols=99 Identities=19% Similarity=0.091 Sum_probs=61.3 Q ss_pred HHHHHHHC-CCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 99998718-98699999457871-10689999999739807998200899999879999736865478742013012433 Q gi|254780234|r 22 LKWLQVEK-GLEVIVFIADLGQG-EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPL 99 (404) Q Consensus 22 i~~L~~e~-g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpl 99 (404) ...||++. |.+|+.+++ |.+ .+-....++|+..||.+.+.+.-+ .| .|.-++ . T Consensus 45 AlrlkE~~gg~~Vtvvsv--G~~~~~~~~~lr~aLAmGaD~ai~i~d~-~~---------------~~~D~~-------~ 99 (256) T PRK03359 45 ACQLKQQAAEAQVTALSV--GGKALTNAKGRKDVLSRGPDELIVVIDD-QF---------------EQALPQ-------Q 99 (256) T ss_pred HHHHHHHCCCCEEEEEEE--CCCCCCCHHHHHHHHHCCCCEEEEECCC-CC---------------CCCCHH-------H T ss_conf 999888519957999993--7754455999999997589728997576-54---------------443779-------9 Q ss_pred HHHHHHHHHHHCCCCEEECH--HHCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf 59999999885689287500--11168215679999998587970774 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHG--STGKGNDQVRFELSAYSLNSDIEIIA 145 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG--~TgkGNDQvRFe~~~~~l~P~l~via 145 (404) +|+.+....++.|.|.|--| ++--++-|+-..++-..=-|.+...+ T Consensus 100 tA~~La~~i~~~~~DLVl~G~~s~D~~~gqvg~~lAe~Lg~P~vt~v~ 147 (256) T PRK03359 100 TASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAINGVS 147 (256) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHCCCHHEEEE T ss_conf 999999999872999999925102599985228999982985220077 No 156 >cd06322 PBP1_ABC_sugar_binding_like_12 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=54.92 E-value=16 Score=16.87 Aligned_cols=67 Identities=18% Similarity=0.120 Sum_probs=33.6 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 9999999885689287500111682156799999985879707744100687637999999997089675 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) .+.+-+.|++.|.+.+..-+....+.|.+.=..+..-.+|.-|++|.-.-. -...++-+.+.||||- T Consensus 18 ~~gie~~a~~~G~~~~i~~s~~d~~~q~~~i~~li~~~vDgiIi~p~~~~~---~~~~~~~~~~~gIPvV 84 (267) T cd06322 18 ANAMKEEAKKQKVNLIVSIANQDLNKQLSDVEDFITKKVDAIVLSPVDSKG---IRAAIAKAKKAGIPVI 84 (267) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCH---HHHHHHHHHHCCCCEE T ss_conf 999999999749999998399999999999999997499999990676101---0699999997599789 No 157 >COG2264 PrmA Ribosomal protein L11 methylase [Translation, ribosomal structure and biogenesis] Probab=54.81 E-value=16 Score=16.86 Aligned_cols=68 Identities=28% Similarity=0.246 Sum_probs=36.0 Q ss_pred EEECCCH--HHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCC Q ss_conf 9914772--599999999871898699999457871106899999997398079982008999998799997368 Q gi|254780234|r 10 LAYSGGL--DTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 10 laySGGL--DTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~an 82 (404) +||--|. -|+.|+.||. ++.-.. --..|+|-+.-+-.| -|.++||++.+-+|. ++.+-+-.+.-++.| T Consensus 139 lAFGTG~HpTT~lcL~~Le-~~~~~g-~~vlDvGcGSGILaI--Aa~kLGA~~v~g~Di-Dp~AV~aa~eNa~~N 208 (300) T COG2264 139 LAFGTGTHPTTSLCLEALE-KLLKKG-KTVLDVGCGSGILAI--AAAKLGAKKVVGVDI-DPQAVEAARENARLN 208 (300) T ss_pred CCCCCCCCHHHHHHHHHHH-HHHCCC-CEEEEECCCHHHHHH--HHHHCCCCEEEEECC-CHHHHHHHHHHHHHC T ss_conf 3367887801799999999-860589-879982678159999--999819866899718-889999999999976 No 158 >cd01976 Nitrogenase_MoFe_alpha Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=54.26 E-value=16 Score=16.80 Aligned_cols=67 Identities=15% Similarity=0.240 Sum_probs=32.9 Q ss_pred HHHH-HHHHCCCCEEECHHH-CCCCHHH-HHHHHHHHH--------CC------CCEEEEEECCCCCCCHHHHHH-HHHH Q ss_conf 9999-988568928750011-1682156-799999985--------87------970774410068763799999-9997 Q gi|254780234|r 103 YLVD-IANETGADAIAHGST-GKGNDQV-RFELSAYSL--------NS------DIEIIAPWRHWSFKGRQDLID-FAEK 164 (404) Q Consensus 103 ~lv~-~a~~~ga~~iaHG~T-gkGNDQv-RFe~~~~~l--------~P------~l~viaP~Rd~~~~sRe~~i~-ya~~ 164 (404) .+.+ ..++.|+.+|.=-|- .+|..|. =++.+..++ .| ++.+|. ++++.+.-+++. ++++ T Consensus 120 aV~~~~~~~~gipVIpv~~~GF~g~s~~~G~~~a~~al~~~vv~~~~~~~~~~~~iNiig---e~ni~gd~~~i~~lL~~ 196 (421) T cd01976 120 AVARKASKELGIPVVPVRCEGFRGVSQSLGHHIANDAIRDHILGKRNEFEPTPYDVNIIG---DYNIGGDAWASRILLEE 196 (421) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCEEEEC---CCCCCCCHHHHHHHHHH T ss_conf 999998775299889986898577523068899999999984166887788984179975---56676329999999998 Q ss_pred HCCCCCCC Q ss_conf 08967566 Q gi|254780234|r 165 HAIPIDKN 172 (404) Q Consensus 165 ~gIpv~~~ 172 (404) -||.|-.. T Consensus 197 mGi~V~~~ 204 (421) T cd01976 197 MGLRVVAQ 204 (421) T ss_pred CCCCEEEE T ss_conf 39837999 No 159 >PRK05586 biotin carboxylase; Validated Probab=54.24 E-value=16 Score=16.80 Aligned_cols=32 Identities=28% Similarity=0.539 Sum_probs=18.6 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 7899991477259999999987189869999945 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIAD 39 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d 39 (404) |||.+|=-|=.---++ .-.+ +-|.+.++++.+ T Consensus 3 ~kvLIANRGEIA~Ri~-rt~~-~lgi~tVavys~ 34 (447) T PRK05586 3 KKILIANRGEIAVRII-RACR-EMGIETVAVYSE 34 (447) T ss_pred CEEEEECCCHHHHHHH-HHHH-HCCCCEEEECCC T ss_conf 5499989879999999-9999-839959999170 No 160 >TIGR00787 dctP TRAP transporter solute receptor, DctP family; InterPro: IPR004682 TRAP-T family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-positive bacteria and archaea. Only one member of the family has been both sequenced and functionally characterised. This system is the DctPQM system of Rhodobacter capsulatus. DctP is a periplasmic dicarboxylate (malate, fumarate, succinate) binding receptor that is biochemically well-characterised.; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=53.61 E-value=15 Score=17.01 Aligned_cols=51 Identities=10% Similarity=0.154 Sum_probs=26.4 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH-CCCCEEEE Q ss_conf 86547874201301243359999999885689287500111682156799999985-87970774 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL-NSDIEIIA 145 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l-~P~l~via 145 (404) +.+--.-||.++.. |.=++..++..|.-..|.++-+ +++ +..| .|++.|+. T Consensus 28 G~~~i~~~P~s~Lg--Pa~D~~~~~~l~~G~~d~~~~s-~~~----------f~~la~p~~~v~d 79 (267) T TIGR00787 28 GEIKISVFPSSQLG--PASDREMLEALRGGALDMTAPS-SSK----------FTPLASPELAVLD 79 (267) T ss_pred CEEEEEECCCCCCC--CCCHHHHHHHHHCCCEEEEECC-CCC----------CCCCCCCHHHHHC T ss_conf 85789975763478--8776999998408954897257-675----------4233300687716 No 161 >cd06315 PBP1_ABC_sugar_binding_like_6 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=53.20 E-value=17 Score=16.68 Aligned_cols=66 Identities=17% Similarity=0.191 Sum_probs=48.6 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 99999998856892875001116821567-99999985879707744100687637999999997089675 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) ++.+-+.|+++|.++.---+-|.-..|.. ++.++ ++.||.-|+.|+ +...-...++-+++.|||+- T Consensus 19 ~~G~~eaA~~lGw~v~v~D~~g~~~~~~~~i~~ai-~~k~D~Iii~~~---D~~~~~~~l~~A~~agIPvv 85 (280) T cd06315 19 GEGVREAAKAIGWNLRILDGRGSEAGQAAALNQAI-ALKPDGIVLGGV---DAAELQAELELAQKAGIPVV 85 (280) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHH-HCCCCEEEECCC---CHHHHHHHHHHHHHCCCCEE T ss_conf 88999999975987999889999999999999999-639999999982---97887899999998799789 No 162 >TIGR03324 alt_F1F0_F1_al alternate F1F0 ATPase, F1 subunit alpha. A small number of taxonomically diverse prokaryotic species, including Methanosarcina barkeri, have what appears to be a second ATP synthase, in addition to the normal F1F0 ATPase in bacteria and A1A0 ATPase in archaea. These enzymes use ion gradients to synthesize ATP, and in principle may run in either direction. This model represents the F1 alpha subunit of this apparent second ATP synthase. Probab=52.39 E-value=18 Score=16.60 Aligned_cols=103 Identities=17% Similarity=0.172 Sum_probs=63.0 Q ss_pred EEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-CCHHHHHHHHHHHCCCEEEEEE---CHHHHHHHHHHHHHHHCCCCC Q ss_conf 991477259999999987189869999945787-1106899999997398079982---008999998799997368654 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-GEELKIASDKARLLGAKEVYVK---DLRREFVRDFVFPMFRANALY 85 (404) Q Consensus 10 laySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-~~d~~~~~~~A~~~Ga~~~~v~---D~r~ef~~~~i~~~I~ana~Y 85 (404) ++=||-=-|++++--+..+.+.+|+|+.+-+|| ..+...+-+.-...|+-++-++ ..-+--...|+. T Consensus 168 igdrgtGKTtla~dtIinqk~~~vicVyvaIGqr~seV~~~v~~l~~~gal~~TvVV~Ata~dpa~lr~~A--------- 238 (497) T TIGR03324 168 LGDRQTGKTAIAIDTILNQKGRNVLCIYCAIGQRASAVAKVVANLREHGAMDYTIVVVTEGNDPPGLQYIA--------- 238 (497) T ss_pred ECCCCCCHHHHHHHHHHHCCCCCEEEEEEEEEEEHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHH--------- T ss_conf 45888880088999997045667189999732507879999999874875234179995688998999887--------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHC Q ss_conf 78742013012433599999998856892875001116821567999999858 Q gi|254780234|r 86 EGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLN 138 (404) Q Consensus 86 eg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~ 138 (404) |..+-.+.|+-+..|-+++.-- -|--||-.+.+-+. T Consensus 239 ------------pyaa~aiAEyFrd~G~dVLlv~-----DdLTr~A~A~REis 274 (497) T TIGR03324 239 ------------PYAATSIGEHFMEQGRDVLIVY-----DDLTQHARAYRELS 274 (497) T ss_pred ------------HHHHHHHHHHHHHCCCEEEEEE-----ECHHHHHHHHHHHH T ss_conf ------------8999999999996897489999-----78069999999999 No 163 >COG1540 Uncharacterized proteins, homologs of lactam utilization protein B [General function prediction only] Probab=52.37 E-value=18 Score=16.59 Aligned_cols=144 Identities=15% Similarity=0.269 Sum_probs=90.6 Q ss_pred HHHHHHHCCCC---EEECHHHCC--CCHHH--H-HHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99998856892---875001116--82156--7-9999998587970774410068763799999999708967566567 Q gi|254780234|r 104 LVDIANETGAD---AIAHGSTGK--GNDQV--R-FELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRG 175 (404) Q Consensus 104 lv~~a~~~ga~---~iaHG~Tgk--GNDQv--R-Fe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~ 175 (404) |-.+++..|.. +-.||+-+- -.|.. + +=.+++.++|++.+.. + +-.+.+.-++++|+++...--. T Consensus 95 L~a~~~a~G~~~~hVKpHGALYN~~a~D~~la~av~~av~~~dp~L~l~~------L-ags~~~~~a~~~GL~~~~EvFA 167 (252) T COG1540 95 LQAFARAQGGVVQHVKPHGALYNQAAKDRALADAVAEAVAAFDPSLILMG------L-AGSELLRAAKRAGLPVAEEVFA 167 (252) T ss_pred HHHHHHHCCCEEEEECCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEE------C-CCHHHHHHHHHCCCHHHHHHHC T ss_conf 99999964975777553178898861189999999999997498956996------1-8579999999819545888841 Q ss_pred CCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCC-HHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHH Q ss_conf 8864157243115457632458457682010014658-122799963899999502478751623189999999988864 Q gi|254780234|r 176 EAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVS-PEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGR 254 (404) Q Consensus 176 ~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~-p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg 254 (404) ++.|--|--|--|+-.|..++||...+ +-+..+.+. .-.+ -.-+.|.|+.+. +-|-|..-..+++..++-+.-. T Consensus 168 DR~Y~~dG~Lv~R~~pgA~i~d~~~a~-~qvl~m~~~g~v~a-~dG~~v~v~ads---iCvHGD~p~Al~~~~riR~~l~ 242 (252) T COG1540 168 DRAYQPDGTLVPRSLPGAVIHDEEEAL-AQVLQMVREGKVTA-IDGEWVAVEADS---ICVHGDNPHALAFARRIRAALE 242 (252) T ss_pred CCCCCCCCCEECCCCCCCCCCCHHHHH-HHHHHHHHCCCEEE-ECCCEEEEECCE---EEECCCCHHHHHHHHHHHHHHH T ss_conf 335288976711789985125999999-99999986496483-279388650446---8982897789999999999998 Q ss_pred HCCCC Q ss_conf 21637 Q gi|254780234|r 255 CNGIG 259 (404) Q Consensus 255 ~~GvG 259 (404) +.||- T Consensus 243 ~~gi~ 247 (252) T COG1540 243 AEGIK 247 (252) T ss_pred HCCCE T ss_conf 76975 No 164 >PRK07792 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=51.82 E-value=18 Score=16.54 Aligned_cols=58 Identities=26% Similarity=0.314 Sum_probs=34.8 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE-EEEEECHHHH Q ss_conf 9999147725999999998718986999994578711068999999973980-7998200899 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK-EVYVKDLRRE 69 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~-~~~v~D~r~e 69 (404) +|..=|+|+--.++..+. +.|+.|+. .|+...++.++..+.....|.. ..+..|+.++ T Consensus 13 lVTGas~GIGraiA~~lA--~~GA~Vvi--~d~~~~~~~~~~~~ei~~~G~~~~~~~~Dvsd~ 71 (303) T PRK07792 13 VVTGAAAGLGRAEALGLA--RLGATVVV--NDIASALDASDVLDEIRAAGAKAVAVAGDISQR 71 (303) T ss_pred EEECCCCHHHHHHHHHHH--HCCCEEEE--EECCCCHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 992886689999999999--86999999--718972479999999984499389996676899 No 165 >pfam08901 DUF1847 Protein of unknown function (DUF1847). This family of proteins are functionally uncharacterized. THey contain 4 N-terminal cysteines that may form a zinc binding domain. Probab=51.81 E-value=18 Score=16.53 Aligned_cols=35 Identities=37% Similarity=0.544 Sum_probs=14.9 Q ss_pred CEEEEEECCCHH--HHHHHHHHHHHCCCEEEEEEEECC Q ss_conf 789999147725--999999998718986999994578 Q gi|254780234|r 6 KKVVLAYSGGLD--TSIILKWLQVEKGLEVIVFIADLG 41 (404) Q Consensus 6 kkVvlaySGGLD--TSv~i~~L~~e~g~eVi~~~~d~G 41 (404) ||+.+||--||- +.+....|+ .+|+||+.|++-+| T Consensus 56 kkiGiAfCiGL~~Ea~~~~~il~-~~gFev~sV~CKvg 92 (157) T pfam08901 56 KKIGIAFCIGLLNEARILAKILK-AKGFEVYSVCCKVG 92 (157) T ss_pred CEEEEHHHHHHHHHHHHHHHHHH-HCCCEEEEEEECCC T ss_conf 75530223738999999999999-78988988885068 No 166 >COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=51.09 E-value=12 Score=17.69 Aligned_cols=78 Identities=23% Similarity=0.222 Sum_probs=37.1 Q ss_pred HHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 98856892875001116821567999999858797077441006876379999999970896756656788641572431 Q gi|254780234|r 107 IANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLL 186 (404) Q Consensus 107 ~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlw 186 (404) +-..+|-..-+|-+.+--|-++-.|- ....+|+.-+++-.||-+.. |..+-+-+=+ .++ +-+ +..-|+.|.|.| T Consensus 212 ~l~~lG~~~~~~~s~~~~~~~~~lEq-La~~nPd~mil~t~~d~~~~-~k~~~~~lWK-~Lk--AVK-n~qV~~vDrn~W 285 (310) T COG4594 212 LLTQLGYQVPAASSDGGPYMSVGLEQ-LAAINPDVMILATYRDESIV-RKWEKNALWK-KLK--AVK-NGQVYDVDRNTW 285 (310) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCHHH-HHCCCCCEEEEEECCCHHHH-HHHHCCHHHH-HHH--HHH-CCCEEEECHHHH T ss_conf 99996675541215799764015998-84179877999735861666-5530145988-877--863-595787131378 Q ss_pred CCCC Q ss_conf 1545 Q gi|254780234|r 187 HSSS 190 (404) Q Consensus 187 g~S~ 190 (404) -|+- T Consensus 286 ar~R 289 (310) T COG4594 286 ARSR 289 (310) T ss_pred HHHC T ss_conf 8860 No 167 >pfam07338 DUF1471 Protein of unknown function (DUF1471). This family consists of several hypothetical Enterobacterial proteins of around 90 residues in length. Some members of this family are annotated as ydgH precursors and contain two copies of this region, one at the N-terminus and the other at the C-terminus. The function of this family is unknown. Probab=51.02 E-value=18 Score=16.45 Aligned_cols=39 Identities=21% Similarity=0.193 Sum_probs=29.8 Q ss_pred CCEEEEEEEECCCCCH-HHHHHHHHHHCCCEEEEEECHHH Q ss_conf 9869999945787110-68999999973980799820089 Q gi|254780234|r 30 GLEVIVFIADLGQGEE-LKIASDKARLLGAKEVYVKDLRR 68 (404) Q Consensus 30 g~eVi~~~~d~Gq~~d-~~~~~~~A~~~Ga~~~~v~D~r~ 68 (404) -..+|.++...|.+.| .+.+.+||.+.||+-++|+-..+ T Consensus 13 ~~G~ISvsg~~~s~~d~~~~la~kAd~~GA~~y~Ii~~~~ 52 (66) T pfam07338 13 PIGTISVSGAFGSPSDAEAALAKKADAKGAKYYRIISARE 52 (66) T ss_pred ECEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 6308999056599899999999999986998899998235 No 168 >cd03017 PRX_BCP Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides. The protein was named BCP based on its electrophoretic mobility before its function was known. BCP shows substrate selectivity toward fatty acid hydroperoxides rather than hydrogen peroxide or alkyl hydroperoxides. BCP contains the peroxidatic cysteine but appears not to possess a resolving cysteine (some sequences, not all, contain a second cysteine but its role is still unknown). Unlike other PRXs, BCP exists as a monomer. The plant homolog of BCP is PRX Q, which is expressed only in leaves and is cellularly localized in the chloroplasts and the guard cells of stomata. Also included in this subfamily is the fungal nuclear protein, Dot5p (for disrupter of telomere silencing protein 5), w Probab=50.52 E-value=19 Score=16.40 Aligned_cols=16 Identities=38% Similarity=0.204 Sum_probs=6.4 Q ss_pred CEEEEEECCCHHHHHH Q ss_conf 7899991477259999 Q gi|254780234|r 6 KKVVLAYSGGLDTSII 21 (404) Q Consensus 6 kkVvlaySGGLDTSv~ 21 (404) |+|||.|--+-.|.+| T Consensus 24 k~vvl~F~p~~~~p~C 39 (140) T cd03017 24 KPVVLYFYPKDDTPGC 39 (140) T ss_pred CEEEEEEECCCCCCCC T ss_conf 8499999668789978 No 169 >TIGR01472 gmd GDP-mannose 4,6-dehydratase; InterPro: IPR006368 This family represent GDP-mannose 4,6-dehydratase, also known as GDP-D-mannose dehydratase. This enzyme converts GDP-mannose to GDP-4-dehydro-6-deoxy-D-mannose, the first of three steps for the conversion of GDP-mannose to GDP-fucose in animals, plants, and bacteria. In bacteria, GDP-L-fucose acts as a precursor of surface antigens such as the extracellular polysaccharide colanic acid of Escherichia coli. Excluded from this family are members of the clade that are poorly related because of highly dervied (phylogenetically long-branch) sequences, e.g. Aneurinibacillus thermoaerophilus Gmd, described as a bifunctional GDP-mannose 4,6-dehydratase/GDP-6-deoxy-D-lyxo-4-hexulose reductase .; GO: 0008446 GDP-mannose 46-dehydratase activity, 0019673 GDP-mannose metabolic process, 0005622 intracellular. Probab=50.45 E-value=15 Score=17.05 Aligned_cols=70 Identities=20% Similarity=0.162 Sum_probs=36.8 Q ss_pred CHHHHHHHHHHHHHCCCEEEEEE-----EECCCCCHHHHHHHHHHHCCC--EEEEEECHHHHHHHHHHHHHHHCCCCCC Q ss_conf 72599999999871898699999-----457871106899999997398--0799820089999987999973686547 Q gi|254780234|r 15 GLDTSIILKWLQVEKGLEVIVFI-----ADLGQGEELKIASDKARLLGA--KEVYVKDLRREFVRDFVFPMFRANALYE 86 (404) Q Consensus 15 GLDTSv~i~~L~~e~g~eVi~~~-----~d~Gq~~d~~~~~~~A~~~Ga--~~~~v~D~r~ef~~~~i~~~I~ana~Ye 86 (404) |=|-|++.-+|. ++|||||++. -|++-=+++=.=+. ...-+- -+.+.=|+.|--.=--+...||=..+|+ T Consensus 10 GQDGSYLAE~LL-~~GYeVHG~~RRSSSfNT~Ri~hiY~~~h-~~~~r~A~~fLHYGDlTDs~~L~~~i~~~kP~EiYN 86 (365) T TIGR01472 10 GQDGSYLAEFLL-EKGYEVHGLIRRSSSFNTQRIEHIYEDPH-KEEKRRALMFLHYGDLTDSSNLVKLIDEIKPTEIYN 86 (365) T ss_pred CCCHHHHHHHHH-HCCCEEEEEEECCCCCCHHHHHHHHCCCC-CCCCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEC T ss_conf 576789999987-26968764586255425224567640535-410166135420442106899999974048863420 No 170 >COG4126 Hydantoin racemase [Amino acid transport and metabolism] Probab=50.41 E-value=19 Score=16.39 Aligned_cols=82 Identities=17% Similarity=0.156 Sum_probs=49.1 Q ss_pred HHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH-------------HHHHHHHHHHHCCCCEE Q ss_conf 99999739807998200899999879999736865478742013012433-------------59999999885689287 Q gi|254780234|r 50 SDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPL-------------IAKYLVDIANETGADAI 116 (404) Q Consensus 50 ~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpl-------------ia~~lv~~a~~~ga~~i 116 (404) -.-|...|- ++=|+..-++. +..+...+..+.+-. .++-..|..+|+ .....-+..++.|+++| T Consensus 102 v~~A~~vgr-rfsViTtt~rs-~~il~~lv~~~g~s~-~~~~vrstdl~vL~l~~~~~~~~~~l~~~~~~a~~edgAeaI 178 (230) T COG4126 102 VLAALFVGR-RFSVITTTERS-RPILEELVRSYGLSR-HCRSVRSTDLPVLALEGPPEEAEALLVIEAAEALKEDGAEAI 178 (230) T ss_pred HHHHHHHCC-EEEEEECCCCC-HHHHHHHHHHCCCCC-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 999987456-67888548531-778999998637323-455323379870103697678999999999998653288889 Q ss_pred ECHHHCCCCHHHHHHHHH Q ss_conf 500111682156799999 Q gi|254780234|r 117 AHGSTGKGNDQVRFELSA 134 (404) Q Consensus 117 aHG~TgkGNDQvRFe~~~ 134 (404) .-||+|..+=+-+.+..+ T Consensus 179 iLGCAGms~la~~Lq~~~ 196 (230) T COG4126 179 ILGCAGMSDLADQLQKAF 196 (230) T ss_pred EECCCCHHHHHHHHHHHH T ss_conf 985821777999999971 No 171 >cd05802 GlmM GlmM is a bacterial phosphoglucosamine mutase (PNGM) that belongs to the alpha-D-phosphohexomutase superfamily. It is required for the interconversion of glucosamine-6-phosphate and glucosamine-1-phosphate in the biosynthetic pathway of UDP-N-acetylglucosamine, an essential precursor to components of the cell envelope. In order to be active, GlmM must be phosphorylated, which can occur via autophosphorylation or by the Ser/Thr kinase StkP. GlmM functions in a classical ping-pong bi-bi mechanism with glucosamine-1,6-diphosphate as an intermediate. Other members of the alpha-D-phosphohexomutase superfamily include phosphoglucosamine mutase (PNGM), phosphoacetylglucosamine mutase (PAGM), the bacterial phosphomannomutase ManB, and the bifunctional phosphomannomutase/phosphoglucomutase (PMM/PGM). Each of these enzymes has four domains with a centrally located active site formed by four loops, one from each domain. All four domains are included in this alignment model. Probab=50.40 E-value=19 Score=16.38 Aligned_cols=17 Identities=24% Similarity=0.356 Sum_probs=9.3 Q ss_pred HHHHHHHCCCCEEECHH Q ss_conf 99998856892875001 Q gi|254780234|r 104 LVDIANETGADAIAHGS 120 (404) Q Consensus 104 lv~~a~~~ga~~iaHG~ 120 (404) +-++.+++||+++.-.| T Consensus 185 ~~~il~~lG~~v~~~~~ 201 (434) T cd05802 185 APEVFRELGAEVIVINN 201 (434) T ss_pred HHHHHHHCCCCEEEECC T ss_conf 99999872984477425 No 172 >PRK12367 short chain dehydrogenase; Provisional Probab=50.08 E-value=19 Score=16.35 Aligned_cols=128 Identities=20% Similarity=0.233 Sum_probs=67.6 Q ss_pred CE-EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 78-99991477259999999987189869999945787110689999999739807998200899999879999736865 Q gi|254780234|r 6 KK-VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 6 kk-VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) |+ .+..-||+|--+.+ +.|+ .+|+.||++|-+-....+. ..-|..+-+...+.+|. T Consensus 18 KtIgITGAsGaLG~AL~-k~f~-~~GakVIalTh~~~~~~~~-------~~~~p~~wi~W~cG~E~-------------- 74 (250) T PRK12367 18 KRIGITGASGALGKALT-KLFR-AKGAKVIGLTHSKINNSEI-------SDESPNEWVSWECGKES-------------- 74 (250) T ss_pred CEEEEECCCCHHHHHHH-HHHH-HCCCEEEEEECCCCCCCCC-------CCCCCCEEEEEECCCHH-------------- T ss_conf 87999678738999999-9999-8899899983688887545-------56789528984349989-------------- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHH Q ss_conf 47874201301243359999999885689287500111682156799999985879707744100687637999999997 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEK 164 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~ 164 (404) .+.+..++.+.=.+.||-.+.|. ..++..-.++ ++..++-||-.++ ..++++. T Consensus 75 ------------------~L~~~LkkiDILILNHGIn~~~~--~~~~~i~~s~--EINalS~~Rllel-----F~~~~~~ 127 (250) T PRK12367 75 ------------------SLDKTLAKIDVLILNHGINPGGR--QNSNDINKAL--EINALSSWRLLQL-----FEDIALN 127 (250) T ss_pred ------------------HHHHHHHHCCEEEECCCCCCCCC--CCHHHHHHHH--HHHHHHHHHHHHH-----HHHHHHC T ss_conf ------------------99999875889998387774556--5978999999--8777879999999-----9999736 Q ss_pred HCCCCCCCCCCCCCCCCCHHHHCCCCCCCCC Q ss_conf 0896756656788641572431154576324 Q gi|254780234|r 165 HAIPIDKNKRGEAPFSIDTNLLHSSSEGRVL 195 (404) Q Consensus 165 ~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~L 195 (404) .. -....-+|=-+-|+.++ T Consensus 128 ~~------------~~~~kEiWvNTSEAEi~ 146 (250) T PRK12367 128 NN------------SQIPKEIWVNTSEAEIQ 146 (250) T ss_pred CC------------CCCCCEEEEECCHHHCC T ss_conf 65------------55783588615166415 No 173 >PRK11865 pyruvate ferredoxin oxidoreductase subunit beta; Provisional Probab=49.75 E-value=19 Score=16.31 Aligned_cols=52 Identities=23% Similarity=0.344 Sum_probs=39.9 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEE-CCCCCC Q ss_conf 9999998856892875001116821567-99999985879-7077441-006876 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPW-RHWSFK 153 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~-Rd~~~~ 153 (404) |-+..+|...|+.+||.++.+--+|.++ +..+...-+|. +.+++|. .-|.+. T Consensus 163 kdl~~ia~a~g~~YVA~~s~~~~~~l~~~i~kA~~~~G~s~i~v~sPCp~gw~~~ 217 (298) T PRK11865 163 KDMPLIMAAHGIPYVATASIGYPEDFMKKVKKAAEVDGPAYIQVLQPCPTGWGFP 217 (298) T ss_pred CCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCC T ss_conf 6899999982998899982579999999999998089987999978895668878 No 174 >PRK11411 fecB iron-dicitrate transporter substrate-binding subunit; Provisional Probab=49.10 E-value=19 Score=16.33 Aligned_cols=16 Identities=19% Similarity=0.032 Sum_probs=6.9 Q ss_pred CCEEEEEECCCHHHHH Q ss_conf 7789999147725999 Q gi|254780234|r 5 VKKVVLAYSGGLDTSI 20 (404) Q Consensus 5 ~kkVvlaySGGLDTSv 20 (404) -||||....+.+|+-+ T Consensus 39 P~RIV~L~~~~~e~l~ 54 (297) T PRK11411 39 PQRVVVLELSFADALA 54 (297) T ss_pred CCEEEEECCCCHHHHH T ss_conf 9779997586279999 No 175 >PRK06249 2-dehydropantoate 2-reductase; Provisional Probab=49.00 E-value=20 Score=16.24 Aligned_cols=35 Identities=14% Similarity=0.314 Sum_probs=28.3 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 9877778999914772599999999871898699999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFI 37 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~ 37 (404) |..++.||++.=.|++-+-+. .+|. +.|.+|..++ T Consensus 1 M~~~~~kI~IiGaGAiG~~~a-~~L~-~aG~~V~li~ 35 (313) T PRK06249 1 MDSETPRIAIIGTGAIGGFYG-AMLA-RAGFDVHFLL 35 (313) T ss_pred CCCCCCEEEEECCCHHHHHHH-HHHH-HCCCCEEEEE T ss_conf 999988899999149999999-9999-6699569996 No 176 >pfam02677 DUF208 Uncharacterized BCR, COG1636. Probab=48.89 E-value=20 Score=16.22 Aligned_cols=143 Identities=18% Similarity=0.160 Sum_probs=82.9 Q ss_pred CCHHHHHHHHHHHHHCCCEEEEEEEECC-CCC-H----HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 7725999999998718986999994578-711-0----689999999739807998200899999879999736865478 Q gi|254780234|r 14 GGLDTSIILKWLQVEKGLEVIVFIADLG-QGE-E----LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 14 GGLDTSv~i~~L~~e~g~eVi~~~~d~G-q~~-d----~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) =|-+|++.+..|+ ++|++|+.+.-|=- +|. | +++.++-|.+.|. +++.-|.. .+++. ..++.- |+ T Consensus 7 CaPCs~~~l~~L~-~~g~~vt~~FyNPNIhP~~Ey~~R~~~~~~~~~~~~i-~~i~~~Yd---~~~~~-~~v~g~---e~ 77 (176) T pfam02677 7 CAPCSSYPLEYLR-EYGFEVTGFFYNPNIHPYSEYLRRLEEVKRFAEKLGI-PLIEGDYD---PEEWL-ERVKGL---ED 77 (176) T ss_pred CCCCCHHHHHHHH-HCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCC-CEEECCCC---HHHHH-HHHHCC---CC T ss_conf 8786079999999-8498369987799999789999999999999998599-37956888---59999-998445---55 Q ss_pred CCCCCCCHHHHHHH---HHHHHHHHHCCCCEEECHH---HCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCC-CHHHHHH Q ss_conf 74201301243359---9999998856892875001---116821567999999858797077441006876-3799999 Q gi|254780234|r 88 YYLLGTAIARPLIA---KYLVDIANETGADAIAHGS---TGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFK-GRQDLID 160 (404) Q Consensus 88 ~Ypl~tslaRplia---~~lv~~a~~~ga~~iaHG~---TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~-sRe~~i~ 160 (404) -|=+..--+.+.. +....+|++.|.|+.+..= --|--+|++-- ...++-...|-==.+||.-. +=.+.++ T Consensus 78 -epE~g~RC~~Cy~~RL~~tA~~A~e~gfd~ftTTL~~Sp~k~~~~I~~i--G~~~~~~~gi~f~~~DfRk~~G~~~~~e 154 (176) T pfam02677 78 -EPEGGKRCTYCYDMRLEETAQYAKEHGFDAFTTTLLISPYKNHEQINEI--GERAAKEYGVEFLYSDFRKNGGYQRSIE 154 (176) T ss_pred -CCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHH--HHHHHHHHCCCEEECCCCCCCCHHHHHH T ss_conf -8888871599999999999999998599999764617776479999999--9999999799785000478865799999 Q ss_pred HHHHHCCC Q ss_conf 99970896 Q gi|254780234|r 161 FAEKHAIP 168 (404) Q Consensus 161 ya~~~gIp 168 (404) .+++.|+= T Consensus 155 lske~glY 162 (176) T pfam02677 155 LSKEYGLY 162 (176) T ss_pred HHHHHCCC T ss_conf 99984974 No 177 >KOG2316 consensus Probab=48.39 E-value=20 Score=16.17 Aligned_cols=210 Identities=17% Similarity=0.174 Sum_probs=96.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCH-HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH----HC Q ss_conf 899991477259999999987189869999945787110-6899999997398079982008999998799997----36 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEE-LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF----RA 81 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d-~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I----~a 81 (404) ||+-..|||-||.+.+.-.. ..|-+++++. |+--++| .+++---+... +-|.++|+ +++-.-.|.. ++ T Consensus 2 rvvaLiSGGKDScynmm~cv-~~gHeiVaLa-nl~p~~d~~delDSyMyQt--VGh~~i~l---yaecm~lPlyrr~i~g 74 (277) T KOG2316 2 RVVALISGGKDSCYNMMCCV-RLGHEIVALA-NLHPKEDESDELDSYMYQT--VGHDVIDL---YAECMGLPLYRRRIRG 74 (277) T ss_pred CEEEEEECCHHHHHHHHHHH-HCCCEEEEEE-CCCCCCCCCHHHHHHHHHH--HHHHHHHH---HHHHHCCCEEEEECCC T ss_conf 18999717807899999999-8487435643-2468755520577889886--02778999---9998468346642148 Q ss_pred CCCCCC-CCCCCCCHHHHHHHHHHHHHHHHC-CCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHH Q ss_conf 865478-742013012433599999998856-892875001116821567999999858797077441006876379999 Q gi|254780234|r 82 NALYEG-YYLLGTAIARPLIAKYLVDIANET-GADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLI 159 (404) Q Consensus 82 na~Yeg-~Ypl~tslaRplia~~lv~~a~~~-ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i 159 (404) ...-++ .|--.-.=.--=.-..+-.+-++. ...+|+-|+--.---.+|-|..+..|+ +..+++. |+.. .++++ T Consensus 75 ~s~nq~l~Y~~t~~DEvEDLy~ll~~VK~~~p~~eaVS~GAIlS~YQr~RVEnVC~RL~--L~~Ls~L--W~rd-Q~~LL 149 (277) T KOG2316 75 RSINQKLQYTKTEGDEVEDLYELLKTVKEKIPDVEAVSVGAILSDYQRTRVENVCSRLG--LVSLSYL--WQRD-QEELL 149 (277) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHHHC--CEEEHHH--HHCC-HHHHH T ss_conf 64541354456888318999999999986288700443346676888999998986608--6332677--7406-99999 Q ss_pred HHHHHHCCCCCCCCCCCCCCC-------------------CCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCC Q ss_conf 999970896756656788641-------------------5724311545763245845768201001465812279996 Q gi|254780234|r 160 DFAEKHAIPIDKNKRGEAPFS-------------------IDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTP 220 (404) Q Consensus 160 ~ya~~~gIpv~~~~~~~~~yS-------------------~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~p 220 (404) .-.-..|+..-.-|. ..+-= .+.-=.|.--|||+-|+--..+| .|.. +-.-|+- T Consensus 150 ~eMi~~g~~AiiiKV-AAigL~~khLgksL~em~p~L~~l~~ky~vh~CGEGGEyET~vlDcP--lF~k----rivld~~ 222 (277) T KOG2316 150 QEMILSGLDAIIIKV-AAIGLGRKHLGKSLDEMQPYLLKLNDKYGVHVCGEGGEYETFVLDCP--LFKK----RIVLDEY 222 (277) T ss_pred HHHHHCCCCEEEEEE-EECCCCHHHHCCCHHHHHHHHHHHHHHHCCEECCCCCCEEEEEECCC--CHHH----EEEEEEE T ss_conf 999974877499997-40466766647678888799997426408214578862567885160--2231----5665515 Q ss_pred EEEEEEEECCEEEEE Q ss_conf 389999950247875 Q gi|254780234|r 221 TTIRIDFQRGDPIAI 235 (404) Q Consensus 221 e~v~I~Fe~G~PVai 235 (404) +.+.-.++.+-||.+ T Consensus 223 evv~hs~~~~~~~~v 237 (277) T KOG2316 223 EVVIHSADEVCPVGV 237 (277) T ss_pred EEEECCCCCCCCEEE T ss_conf 776357667661147 No 178 >PRK07178 acetyl-CoA carboxylase; Validated Probab=46.96 E-value=21 Score=16.02 Aligned_cols=73 Identities=21% Similarity=0.228 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHH------------------------HHHCCCCEE-EEEECCCCC Q ss_conf 3359999999885689287500111682156799999------------------------985879707-744100687 Q gi|254780234|r 98 PLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSA------------------------YSLNSDIEI-IAPWRHWSF 152 (404) Q Consensus 98 plia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~------------------------~~l~P~l~v-iaP~Rd~~~ 152 (404) ++=...++++|++.|||+|-.|- |==---..|=..+ +.++-+..| +.|+-+-.. T Consensus 59 Yln~~~Ii~~A~~~g~dAiHPGY-GFLSEn~~FA~~~~~~gi~FIGPs~~~i~~~GdK~~ar~~a~~~gvPv~pgs~~~v 137 (471) T PRK07178 59 YLNPRKLVNLAVETGCDALHPGY-GFLSENAELAEICAERGIKFIGPSADVIRRMGDKTEARRSMIKAGVPVTPGSEGNL 137 (471) T ss_pred HCCHHHHHHHHHHHCCCEEECCC-CHHHCCHHHHHHHHHCCCEEECCCHHHHHHHHCHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 44999999999996999997783-33115989999999789989995999999874839899999986998268968865 Q ss_pred CCHHHHHHHHHHHCCCCCC Q ss_conf 6379999999970896756 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~~ 171 (404) .+-++.++++++.|.||=. T Consensus 138 ~~~eea~~~A~~iGyPV~l 156 (471) T PRK07178 138 ADIDEALAEAERIGYPVML 156 (471) T ss_pred CCHHHHHHHHHHCCCCEEE T ss_conf 6699999999866981586 No 179 >KOG2862 consensus Probab=46.52 E-value=21 Score=15.98 Aligned_cols=48 Identities=29% Similarity=0.221 Sum_probs=28.2 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 8999914772599999999871898699999457871106899999997 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARL 55 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~ 55 (404) +.+|-+|-|+=+--.+.-.+ .+|++|.-+++|.||.-.++.|+++-.. T Consensus 93 d~vLv~~~G~wg~ra~D~~~-r~ga~V~~v~~~~G~~~~le~i~~~lsq 140 (385) T KOG2862 93 DNVLVVSTGTWGQRAADCAR-RYGAEVDVVEADIGQAVPLEEITEKLSQ 140 (385) T ss_pred CEEEEEEECHHHHHHHHHHH-HHCCEEEEEECCCCCCCCHHHHHHHHHH T ss_conf 74999972337778899998-6086555871585567549999999871 No 180 >PRK12608 transcription termination factor Rho; Provisional Probab=46.21 E-value=22 Score=15.94 Aligned_cols=55 Identities=29% Similarity=0.486 Sum_probs=25.0 Q ss_pred CEECCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 6231899999999888642163741000773202100011137378999999999986508999999 Q gi|254780234|r 237 GQVMSPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLDSGSAH 303 (404) Q Consensus 237 G~~~~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~ 303 (404) |..|+-+ +++.. -|.|-..++=||-+ --|.+| ||--|.-..-|. |.+-|+.+++. T Consensus 285 gs~mD~~-i~eef------kgtgn~el~Ldr~l--a~~r~f--PAIdi~~SgTR~-EelLl~~~e~~ 339 (379) T PRK12608 285 GSRMDEV-IFEEF------KGTGNMEIVLDREL--ADKRVF--PAIDIAKSGTRR-EELLLDSKELE 339 (379) T ss_pred CCCCCHH-HHHHH------HCCCCEEEEECHHH--HHCCCC--CCCCCCCCCCCH-HHHCCCHHHHH T ss_conf 6643458-89996------32686089972557--767887--511554577622-66548999999 No 181 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=45.95 E-value=22 Score=15.92 Aligned_cols=144 Identities=17% Similarity=0.207 Sum_probs=61.0 Q ss_pred EECCCHHHHHHHHHHH--HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9147725999999998--71898699999457871106899999997398079982008999998799997368654787 Q gi|254780234|r 11 AYSGGLDTSIILKWLQ--VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGY 88 (404) Q Consensus 11 aySGGLDTSv~i~~L~--~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~ 88 (404) ..++.-|..-=+.-+. ..+|+++|.++. -..+.+..+-++|...|+ +++++|..-. +-. + ..|... T Consensus 36 ~~da~~D~~~Q~~~Ie~~I~qgvD~Iii~p--~d~~~~~~~i~~A~~agI-PVI~~d~~~~---~~~---~---~~~~~~ 103 (303) T cd01539 36 FYDAKNNQSTQNEQIDTALAKGVDLLAVNL--VDPTAAQTVINKAKQKNI-PVIFFNREPE---EED---I---KSYDKA 103 (303) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHHHHHCCC-EEEEECCCCC---HHH---C---CCCCCE T ss_conf 707878999999999999975999999946--873348999999996698-4999878766---122---0---345632 Q ss_pred CCCCCCH--HHHHHHHHHHHHHHHCCC-------C----EEECHHHCCCCHHHHHHHHHHHH---CCCCEEEEEE-CCCC Q ss_conf 4201301--243359999999885689-------2----87500111682156799999985---8797077441-0068 Q gi|254780234|r 89 YLLGTAI--ARPLIAKYLVDIANETGA-------D----AIAHGSTGKGNDQVRFELSAYSL---NSDIEIIAPW-RHWS 151 (404) Q Consensus 89 Ypl~tsl--aRplia~~lv~~a~~~ga-------~----~iaHG~TgkGNDQvRFe~~~~~l---~P~l~viaP~-Rd~~ 151 (404) +..++.- +=-+.++.+++...+... . ++-.|..|..+-+.|.+....+| +|++++++.. -+|+ T Consensus 104 ~~Vg~Dn~~~G~~~ge~l~~~l~~~~~~~~~~g~gki~~vil~G~~g~~~~~~R~~Gf~~~l~~~~~~~~iv~~~~~~w~ 183 (303) T cd01539 104 YYVGTDAEQSGILQGKLIADYWNANKDALDKNGDGIIQYVMLKGEPGHPDAIARTKYSIETLNDAGIKTEELASDTANWD 183 (303) T ss_pred EEEECCHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCC T ss_conf 79953779999999999999874066403316888457999968999978999998699999854980157778716878 Q ss_pred CC-CHHHHHHHHHHHC Q ss_conf 76-3799999999708 Q gi|254780234|r 152 FK-GRQDLIDFAEKHA 166 (404) Q Consensus 152 ~~-sRe~~i~ya~~~g 166 (404) .. +++.+-+++.+|+ T Consensus 184 ~~~a~~~~e~~L~~~~ 199 (303) T cd01539 184 RAQAKDKMDALLLKYG 199 (303) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 8999999999998589 No 182 >COG0174 GlnA Glutamine synthetase [Amino acid transport and metabolism] Probab=45.94 E-value=3.3 Score=21.84 Aligned_cols=149 Identities=22% Similarity=0.245 Sum_probs=79.5 Q ss_pred HHHHHHHHHHHHHCCCEEEEEEEEC-CCCC--H----HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH------CC Q ss_conf 2599999999871898699999457-8711--0----68999999973980799820089999987999973------68 Q gi|254780234|r 16 LDTSIILKWLQVEKGLEVIVFIADL-GQGE--E----LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR------AN 82 (404) Q Consensus 16 LDTSv~i~~L~~e~g~eVi~~~~d~-Gq~~--d----~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~------an 82 (404) ++|-+.++|.. ....-|+|-..+- |++- | +..+.++....|..+ .++-..-|| |+|.... -- T Consensus 75 ~~T~~~~Pw~~-~~ta~v~cdv~~~~g~p~~~dPR~vlkr~~~~l~~~G~~~-~~~g~E~EF---fLfd~~~~~~~~~~~ 149 (443) T COG0174 75 LSTLVVDPWRE-GPTARVLCDVYDPDGTPYPRDPRSVLKRALARLKDEGLAP-AVVGPELEF---FLFDRDGRDPDGGRP 149 (443) T ss_pred CCCEEECCCCC-CCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCC-CCEECCEEE---EEEECCCCCCCCCCC T ss_conf 67527677778-8727999998879999589883999999999998668854-302024169---996036677666766 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEE-CHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHH Q ss_conf 65478742013012433599999998856892875-00111682156799999985879707744100687637999999 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIA-HGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDF 161 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~ia-HG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~y 161 (404) ..+.+.|++.---.-.=+.+.++..+.+.|..+=+ |.=-|.| | ||..++ .++-++ .| |.-+.-|.-.-.- T Consensus 150 ~~~g~yf~~~~~~~~~d~~~~i~~~l~~~Gi~ie~~hhEva~g--Q--~EI~~~-~~~~l~-~A---D~~~~~K~vvk~v 220 (443) T COG0174 150 ADKGGYFDVAPLDEAEDFRRDIVEALEAAGIEIEAIHHEVAPG--Q--FEINLR-FDDALK-AA---DQIVIFKYVVKEV 220 (443) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCC--C--EEEECC-CCCHHH-HH---HHHHHHHHHHHHH T ss_conf 8877646765422278999999999998799826301450588--5--578337-878899-98---9999999999999 Q ss_pred HHHHCCCCCCCCCCCCCCCC Q ss_conf 99708967566567886415 Q gi|254780234|r 162 AEKHAIPIDKNKRGEAPFSI 181 (404) Q Consensus 162 a~~~gIpv~~~~~~~~~yS~ 181 (404) |++||+-+++-+ ||+-- T Consensus 221 A~~hG~~aTFMp---KP~~g 237 (443) T COG0174 221 AEKHGLTATFMP---KPFFG 237 (443) T ss_pred HHHHCCEEEEEC---CCCCC T ss_conf 999598599947---87789 No 183 >PRK12778 putative bifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta; Provisional Probab=45.84 E-value=22 Score=15.90 Aligned_cols=101 Identities=22% Similarity=0.296 Sum_probs=48.4 Q ss_pred EEEECCCHHHHHHH---HHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEC----HHHHHHHHHHHHHHHC Q ss_conf 99914772599999---9998718986999994578711068999999973980799820----0899999879999736 Q gi|254780234|r 9 VLAYSGGLDTSIIL---KWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKD----LRREFVRDFVFPMFRA 81 (404) Q Consensus 9 vlaySGGLDTSv~i---~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D----~r~ef~~~~i~~~I~a 81 (404) |+..+||+-+--+. ++|+ +.|.+|+++.. +...|+--.++...+++..-++.-| ...-|+.+-+...|.. T Consensus 101 vv~VgGGvG~Ap~~piakalk-~~Gn~v~~I~G--ar~k~~vilede~~~~~d~~~itTDDGS~G~kG~VT~~L~~ll~~ 177 (760) T PRK12778 101 VVCAGGGVGVAPMLPIVQALK-AAGNRVITVLA--GRSKELIILEKEMRESSDEVIIMTDDGSYGRKGLVTEGVEEVIKR 177 (760) T ss_pred EEEEECCEEHHHHHHHHHHHH-HCCCEEEEEEE--CCCHHHEEHHHHHHHHCCEEEEECCCCCCCCCCEECHHHHHHHHC T ss_conf 999958696898799999999-86995999984--266334035999997438699985899997274566779999748 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 865478742013012433599999998856892875 Q gi|254780234|r 82 NALYEGYYLLGTAIARPLIAKYLVDIANETGADAIA 117 (404) Q Consensus 82 na~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~ia 117 (404) +.|=+.-+++-+.+-|.+.+.+++.|...++ T Consensus 178 -----~~~D~v~aiGP~~MMk~v~~~t~~~~i~t~v 208 (760) T PRK12778 178 -----EKVNKCFAIGPAIMMKFVCLLTKKYEIPTIV 208 (760) T ss_pred -----CCCCEEEEECCHHHHHHHHHHHCCCCCCEEE T ss_conf -----8987899979789999999872605897489 No 184 >PRK09281 F0F1 ATP synthase subunit alpha; Validated Probab=45.55 E-value=22 Score=15.87 Aligned_cols=98 Identities=17% Similarity=0.160 Sum_probs=56.5 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEE-ECHH--HHHHHHHHHHHHHCCCCCCCC Q ss_conf 4772599999999871898699999457871-10689999999739807998-2008--999998799997368654787 Q gi|254780234|r 13 SGGLDTSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKEVYV-KDLR--REFVRDFVFPMFRANALYEGY 88 (404) Q Consensus 13 SGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v-~D~r--~ef~~~~i~~~I~ana~Yeg~ 88 (404) +|-=-|++++--+..+.+.+|+|+.+-+||. .+...+-+.-...||-++-+ +-.. +--...|+. T Consensus 171 ~gtGKT~iaidtIinqk~~~vicVyvaIGer~seV~~~v~~L~~~gal~~TvvV~ata~dpa~lr~~A------------ 238 (502) T PRK09281 171 RQTGKTAIAIDTIINQKGKDVICIYVAIGQKASTVANVVRKLEEHGAMEYTIVVAATASDPAPLQYLA------------ 238 (502) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHH------------ T ss_conf 87338899999998532588369999851221789999998754486411379971799999999999------------ Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHH Q ss_conf 420130124335999999988568928750011168215679999998 Q gi|254780234|r 89 YLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYS 136 (404) Q Consensus 89 Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~ 136 (404) |..+-.+.|+-+..|-+++.-- -|.-||-.+.+- T Consensus 239 ---------pyaa~aIAEyFrd~G~dVLlv~-----DdLtr~A~A~RE 272 (502) T PRK09281 239 ---------PYAGCAMGEYFMDNGKDALIVY-----DDLSKQAVAYRQ 272 (502) T ss_pred ---------HHHHHHHHHHHHHCCCCEEEEE-----ECHHHHHHHHHH T ss_conf ---------9999999999996799489998-----556899999988 No 185 >PRK09195 gatY tagatose-bisphosphate aldolase; Reviewed Probab=45.41 E-value=21 Score=16.08 Aligned_cols=121 Identities=16% Similarity=0.263 Sum_probs=66.9 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999871898699999457871106899999997398079982008999998799997368654787420130124335 Q gi|254780234|r 21 ILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI 100 (404) Q Consensus 21 ~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli 100 (404) ++.--+ +.+|-|-+|.+. .-+....+-+-|...++ +.++.=....+ +| -| ...+ T Consensus 9 lL~~A~-~~~yAV~AfNv~--~~e~~~Avi~AAee~~s-PvIlq~~~~~~--~~-----------~g---------~~~~ 62 (284) T PRK09195 9 MLNNAQ-RGGYAVPAFNIH--NLETMQVVVETAAELHS-PVIIAGTPGTF--SY-----------AG---------TEYL 62 (284) T ss_pred HHHHHH-HCCEEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEEECCHHHH--HH-----------CC---------HHHH T ss_conf 999999-879079987778--89999999999999599-98999885166--44-----------79---------8999 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE------EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999998856892875001116821567999999858797077------44100687637999999997089675665 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII------APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi------aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) +..+...|++.+..+..|- --|.| ||...+++.-...-+ -|+ +.++.--.+..+||..+|++|.... T Consensus 63 ~~~~~~~a~~~~VPV~lHL--DH~~~---~e~i~~ai~~GftSVM~DgS~l~~-eeNi~~Tk~vv~~Ah~~gv~VEaEl 135 (284) T PRK09195 63 LAIVSAAAKQYHHPLALHL--DHHEK---FDDIAQKVRSGVRSVMIDGSHLPF-AQNISLVKEVVDFCHRFDVSVEAEL 135 (284) T ss_pred HHHHHHHHHHCCCCEEEEC--CCCCC---HHHHHHHHHCCCCEEEEECCCCCH-HHHHHHHHHHHHHHHHHCCEEEEEE T ss_conf 9999999987799889966--98799---999999997499868863898999-9999999999999987288189974 No 186 >KOG1865 consensus Probab=45.41 E-value=22 Score=15.86 Aligned_cols=65 Identities=28% Similarity=0.367 Sum_probs=34.3 Q ss_pred HHHHHHH-HHHCCCCEEEEEECCC--------CC-CCHHHH-HHHH-------HHHCCCCCCCCCCCCCCCCCHHHHCCC Q ss_conf 6799999-9858797077441006--------87-637999-9999-------970896756656788641572431154 Q gi|254780234|r 128 VRFELSA-YSLNSDIEIIAPWRHW--------SF-KGRQDL-IDFA-------EKHAIPIDKNKRGEAPFSIDTNLLHSS 189 (404) Q Consensus 128 vRFe~~~-~~l~P~l~viaP~Rd~--------~~-~sRe~~-i~ya-------~~~gIpv~~~~~~~~~yS~D~Nlwg~S 189 (404) ..|+..+ +++.+....|+|.+.. +| .+|+|- -+|+ ++-.+ +-.+. ..|-|-|.+|-|.. T Consensus 155 c~~q~hi~~A~~~~g~pisP~~i~s~L~~I~~~f~~grQEDAHEFLr~~vd~mqk~cL--~g~~~-~~~~sq~ttlv~~i 231 (545) T KOG1865 155 CTFQAHITRALHNPGHPISPSQILSNLRNISAHFGRGRQEDAHEFLRFTVDAMQKACL--PGHKQ-VDPRSQDTTLVHQI 231 (545) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHC--CCCCC-CCCCCCCCEEHHHH T ss_conf 3899999998527998768689998765421102778434199999999999998613--78766-88531011435664 Q ss_pred CCCCCCC Q ss_conf 5763245 Q gi|254780234|r 190 SEGRVLE 196 (404) Q Consensus 190 ~Egg~Le 196 (404) +. |-|. T Consensus 232 FG-G~Lr 237 (545) T KOG1865 232 FG-GYLR 237 (545) T ss_pred HC-CEEH T ss_conf 05-0101 No 187 >cd06324 PBP1_ABC_sugar_binding_like_13 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=45.24 E-value=22 Score=15.84 Aligned_cols=69 Identities=14% Similarity=0.155 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHCC----CC--EEECHHHCCCCHHHHHHHHHHHH--CCCCEEEEE-ECCCCCC-CHHHHHHHHHHHC Q ss_conf 335999999988568----92--87500111682156799999985--879707744-1006876-3799999999708 Q gi|254780234|r 98 PLIAKYLVDIANETG----AD--AIAHGSTGKGNDQVRFELSAYSL--NSDIEIIAP-WRHWSFK-GRQDLIDFAEKHA 166 (404) Q Consensus 98 plia~~lv~~a~~~g----a~--~iaHG~TgkGNDQvRFe~~~~~l--~P~l~viaP-~Rd~~~~-sRe~~i~ya~~~g 166 (404) ..+++.+++.+...+ .- ++..|.-|..+.+.|.+....+| .|++++... .-+|+.. +++..-+.+++|. T Consensus 123 ~~~a~~li~~~~~~~~~~g~~~i~~i~G~~~~~~~~~R~~Gf~~~l~~~p~~~vv~~~~~~~~~~~a~~~~~~lL~~~P 201 (305) T cd06324 123 YLMAEALISQARSVQAPGGRIDLLAISGDPTTPAAILREAGLRRALAEHPDVRLRQVVYAGWSEDEAYEQAENLLKRYP 201 (305) T ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCC T ss_conf 9999999974055315788468999978889868999998999999868995898886189878999999999997399 No 188 >PRK11579 putative oxidoreductase; Provisional Probab=45.19 E-value=22 Score=15.84 Aligned_cols=119 Identities=15% Similarity=0.064 Sum_probs=65.0 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE--------------EEEEECH Q ss_conf 98777789999147725999999998718986999994578711068999999973980--------------7998200 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK--------------EVYVKDL 66 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~--------------~~~v~D~ 66 (404) |++++ ||.+.=-|.--...-++-+....+.+|++++ |. |.+. .+|...|+. +.+++-. T Consensus 1 M~~~i-rvgiiG~G~~~~~~h~~~~~~~~~~~l~av~-d~----~~~~--~~a~~~~~~~~~~~~~ll~~~~id~V~i~t 72 (346) T PRK11579 1 MSDKI-RVGLIGYGYASKTFHAPLIAGTPGLELAAVS-SS----DETK--VKADWPTVTVVSEPKHLFNDPNIDLIVIPT 72 (346) T ss_pred CCCCC-EEEEECCCHHHHHHHHHHHHCCCCEEEEEEE-CC----CHHH--HHCCCCCCCEECCHHHHHCCCCCCEEEECC T ss_conf 99887-5999936299999999999629991999997-98----9999--950258995389999994599999999979 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH Q ss_conf 89999987999973686547874201301243359999999885689287500111682156799999985 Q gi|254780234|r 67 RREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 67 r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l 137 (404) -..+=.+.+..+++++----=.=|+.++++ -++.+++.|++.|..... +-|.||+-.++.+ T Consensus 73 p~~~H~~~~~~al~aGkhv~~EKP~a~~~~---~a~~l~~~a~~~g~~l~v-------~~~~R~~~~~~~~ 133 (346) T PRK11579 73 PNDTHFPLAKAALEAGKHVVVDKPFTVTLS---QARELDALAKSLGRVLSV-------FHNRRWDSDFLTL 133 (346) T ss_pred CCHHHHHHHHHHHHCCCCEEECCCCCCCHH---HHHHHHHHHHHCCCEEEE-------EEEECCCHHHHHH T ss_conf 967899999999987994895387678799---999999999872967999-------6553189899998 No 189 >KOG0780 consensus Probab=45.04 E-value=22 Score=15.82 Aligned_cols=145 Identities=21% Similarity=0.179 Sum_probs=90.2 Q ss_pred CCCHHHHHHHH---HHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 47725999999---998718986999994578711068999999973980799820089999987999973686547874 Q gi|254780234|r 13 SGGLDTSIILK---WLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYY 89 (404) Q Consensus 13 SGGLDTSv~i~---~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Y 89 (404) -|+=-|++|.+ |++ .+||.+.-+|+|+=-..-+|.+++.|.+.+. ++|. .|-. T Consensus 110 qG~GKTTtc~KlA~y~k-kkG~K~~LvcaDTFRagAfDQLkqnA~k~~i-P~yg--------------------syte-- 165 (483) T KOG0780 110 QGSGKTTTCTKLAYYYK-KKGYKVALVCADTFRAGAFDQLKQNATKARV-PFYG--------------------SYTE-- 165 (483) T ss_pred CCCCCCEEHHHHHHHHH-HCCCCEEEEEECCCCCCHHHHHHHHHHHHCC-EEEE--------------------CCCC-- T ss_conf 57886300899999998-4687245776022453068999987674077-0684--------------------0366-- Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCC-HHHHHHH---HHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 201301243359999999885689287500111682-1567999---999858797077441006876379999999970 Q gi|254780234|r 90 LLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGN-DQVRFEL---SAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKH 165 (404) Q Consensus 90 pl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGN-DQvRFe~---~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~ 165 (404) ..-..||..-|+..+++++|.|----.|+-. ++-=||- ..+++.|+.-|+. -|-+. -+...+-+++. T Consensus 166 -----~dpv~ia~egv~~fKke~fdvIIvDTSGRh~qe~sLfeEM~~v~~ai~Pd~vi~V--mDasi--GQaae~Qa~aF 236 (483) T KOG0780 166 -----ADPVKIASEGVDRFKKENFDVIIVDTSGRHKQEASLFEEMKQVSKAIKPDEIIFV--MDASI--GQAAEAQARAF 236 (483) T ss_pred -----CCHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEE--EECCC--CHHHHHHHHHH T ss_conf -----5558999999998886397289982787301248999999999851598738999--85620--07679999988 Q ss_pred CCCCCCCCCCCCCCCCCHHHHCCCCCCCCCC Q ss_conf 8967566567886415724311545763245 Q gi|254780234|r 166 AIPIDKNKRGEAPFSIDTNLLHSSSEGRVLE 196 (404) Q Consensus 166 gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Le 196 (404) .-.|..+. -|=.-|=|-+-.||-|- T Consensus 237 k~~vdvg~------vIlTKlDGhakGGgAlS 261 (483) T KOG0780 237 KETVDVGA------VILTKLDGHAKGGGALS 261 (483) T ss_pred HHHHCCCE------EEEEECCCCCCCCCEEE T ss_conf 77615403------79972256777773453 No 190 >pfam02441 Flavoprotein Flavoprotein. This family contains diverse flavoprotein enzymes. This family includes epidermin biosynthesis protein, EpiD, which has been shown to be a flavoprotein that binds FMN. This enzyme catalyses the removal of two reducing equivalents from the cysteine residue of the C-terminal meso-lanthionine of epidermin to form a --C==C-- double bond. This family also includes the B chain of dipicolinate synthase a small polar molecule that accumulates to high concentrations in bacterial endospores, and is thought to play a role in spore heat resistance, or the maintenance of heat resistance. dipicolinate synthase catalyses the formation of dipicolinic acid from dihydroxydipicolinic acid. This family also includes phenylacrylic acid decarboxylase (EC:4.1.1.-). Probab=44.75 E-value=23 Score=15.79 Aligned_cols=37 Identities=27% Similarity=0.442 Sum_probs=27.8 Q ss_pred CEEEEEECCCHHHHHHHHHHH--HHCCCEEEEEEEECCC Q ss_conf 789999147725999999998--7189869999945787 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ--VEKGLEVIVFIADLGQ 42 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~--~e~g~eVi~~~~d~Gq 42 (404) |||+++.+|+-..-.+..+++ .+.|+||..+.-+-++ T Consensus 1 kri~l~itGs~~a~~~~~ll~~L~~~~~~v~vv~S~~A~ 39 (118) T pfam02441 1 KKILLGITGSSAAIKALRLLRELKKEGAEVRVILSKAAK 39 (118) T ss_pred CEEEEEEECHHHHHHHHHHHHHHHHCCCEEEEEECHHHH T ss_conf 979999917899999999999998789979999762698 No 191 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=44.43 E-value=23 Score=15.76 Aligned_cols=81 Identities=19% Similarity=0.143 Sum_probs=34.7 Q ss_pred EEEEEEECCC-C---CHHHHHHHHHHHCCCEEEEEECHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 9999945787-1---10689999999739807998200899999--8799997368654787420130124335999999 Q gi|254780234|r 33 VIVFIADLGQ-G---EELKIASDKARLLGAKEVYVKDLRREFVR--DFVFPMFRANALYEGYYLLGTAIARPLIAKYLVD 106 (404) Q Consensus 33 Vi~~~~d~Gq-~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~--~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~ 106 (404) .|++++..-+ + .+.+.++++|..+|+ +..+.|...+-.. .-|-.+|.-+. =-+..+..-+--....++ T Consensus 27 ~IGv~~~~~~~~~~~~~~~~~~~~Ake~G~-~v~v~dA~~D~~~Qi~qIe~~I~qgv-----daIiv~p~D~~al~~~v~ 100 (330) T PRK10355 27 KIGMAIDDLRLERWQKDRDIFVKKAESLGA-KVFVQSANGNEETQMSQIENMINRGV-----DVLVIIPYNGQVLSNVIK 100 (330) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHCCC-EEEEECCCCCHHHHHHHHHHHHHCCC-----CEEEEECCCHHHHHHHHH T ss_conf 799990688886899999999999997599-89997189998999999999998599-----999996998788899999 Q ss_pred HHHHCCCCEEECH Q ss_conf 9885689287500 Q gi|254780234|r 107 IANETGADAIAHG 119 (404) Q Consensus 107 ~a~~~ga~~iaHG 119 (404) .|++-|..+|+.- T Consensus 101 ~A~~aGIPVI~~D 113 (330) T PRK10355 101 EAKQEGIKVLAYD 113 (330) T ss_pred HHHHCCCEEEEEC T ss_conf 9998799499957 No 192 >PRK13237 tyrosine phenol-lyase; Provisional Probab=44.42 E-value=23 Score=15.76 Aligned_cols=148 Identities=17% Similarity=0.215 Sum_probs=83.7 Q ss_pred EEECCCHHHHHHHHHHHHHCCCE-----EEEEEEECC--CCC---HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 99147725999999998718986-----999994578--711---06899999997398079982008999998799997 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQVEKGLE-----VIVFIADLG--QGE---ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 10 laySGGLDTSv~i~~L~~e~g~e-----Vi~~~~d~G--q~~---d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ..|.|-+|--.+-..|+ +.|.+ ++++|.|.+ |+- ++..+++-|.+.|+ + +++|. ..|+|+ + ..| T Consensus 155 ~pFKGN~Dl~kLe~~i~-~~g~~~I~~v~~tiTnN~~GGQPVSm~Nir~v~~la~~~gi-p-l~lDa-aRfaEN-A-yFI 228 (459) T PRK13237 155 HPFKGNVDLDKLQKLID-EVGAENIAYICLAVTVNLAGGQPVSMANMRAVRELCDKHGI-K-VFFDA-TRCVEN-A-YFI 228 (459) T ss_pred CCCCCCCCHHHHHHHHH-HHCCCCEEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCC-C-EEEEH-HHHHHH-H-HHH T ss_conf 89688779999999998-74855600899998526878806468999999999998099-6-99876-678876-8-999 Q ss_pred HCC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCC-----CHHHHHHHHHHHHCCCCE---EE--EEEC Q ss_conf 368-654787420130124335999999988568928750011168-----215679999998587970---77--4410 Q gi|254780234|r 80 RAN-ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKG-----NDQVRFELSAYSLNSDIE---II--APWR 148 (404) Q Consensus 80 ~an-a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkG-----NDQvRFe~~~~~l~P~l~---vi--aP~R 148 (404) +-. .-|.++-+ .=|++.+..+|--.-+++=-.|+...| ||+-=|+.. +.+.+-.+ .| ---| T Consensus 229 k~RE~gy~~~si-------~eI~rEmfs~aDg~tmSaKKD~~vniGG~ia~~d~~l~~~~-~~~~i~~EGf~TYGGlagR 300 (459) T PRK13237 229 KEREEGYQDKSI-------KEILKEMFSYADGATMSGKKDCLVNIGGFLAMNDDELFQEA-KELVVVYEGMPSYGGMAGR 300 (459) T ss_pred HHCCCCCCCCCH-------HHHHHHHHHHHHEEEEEECCCCCCCCCCEEEECCHHHHHHH-HHHEEECCCCCCCCCCCHH T ss_conf 834420148999-------99999997440279998320253465567985779999999-8551120587554562264 Q ss_pred CCC--------------CCCHHHHHHH----HHHHCCCCCC Q ss_conf 068--------------7637999999----9970896756 Q gi|254780234|r 149 HWS--------------FKGRQDLIDF----AEKHAIPIDK 171 (404) Q Consensus 149 d~~--------------~~sRe~~i~y----a~~~gIpv~~ 171 (404) |+. +..|-..++| +.+.||||.. T Consensus 301 DmeAlA~GL~E~~de~yl~~Ri~qv~yl~~~L~~~GVPvv~ 341 (459) T PRK13237 301 DMEAMAIGIEESVQDDYIRHRIGQVRYLGEKLLAAGVPIVE 341 (459) T ss_pred HHHHHHCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEC T ss_conf 69999634375436799998899999999999987997036 No 193 >COG0541 Ffh Signal recognition particle GTPase [Intracellular trafficking and secretion] Probab=44.33 E-value=23 Score=15.75 Aligned_cols=124 Identities=21% Similarity=0.262 Sum_probs=76.0 Q ss_pred CHHHHH--HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 725999--999998718986999994578711068999999973980799820089999987999973686547874201 Q gi|254780234|r 15 GLDTSI--ILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLG 92 (404) Q Consensus 15 GLDTSv--~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~ 92 (404) |--||+ +..||+. +|+.|--+++|+--+--++.++.-|...|+ ++|-.+.. T Consensus 112 GKTTt~~KLA~~lkk-~~~kvllVaaD~~RpAA~eQL~~La~q~~v-~~f~~~~~------------------------- 164 (451) T COG0541 112 GKTTTAGKLAKYLKK-KGKKVLLVAADTYRPAAIEQLKQLAEQVGV-PFFGSGTE------------------------- 164 (451) T ss_pred CHHHHHHHHHHHHHH-CCCCEEEEECCCCCHHHHHHHHHHHHHCCC-CEECCCCC------------------------- T ss_conf 748689999999997-499458985056786899999999986098-53167788------------------------- Q ss_pred CCHHHHH-HHHHHHHHHHHCCCCEEECHHHC-CCCHHHHHHH---HHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 3012433-59999999885689287500111-6821567999---99985879707744100687637999999997089 Q gi|254780234|r 93 TAIARPL-IAKYLVDIANETGADAIAHGSTG-KGNDQVRFEL---SAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAI 167 (404) Q Consensus 93 tslaRpl-ia~~lv~~a~~~ga~~iaHG~Tg-kGNDQvRFe~---~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gI 167 (404) .-|+ ||+..++.|++.+.|+|-----| -+.|.-=|+- .-..+.|+=.++. .|-..-++...-|+..+= T Consensus 165 ---~~Pv~Iak~al~~ak~~~~DvvIvDTAGRl~ide~Lm~El~~Ik~~~~P~E~llV----vDam~GQdA~~~A~aF~e 237 (451) T COG0541 165 ---KDPVEIAKAALEKAKEEGYDVVIVDTAGRLHIDEELMDELKEIKEVINPDETLLV----VDAMIGQDAVNTAKAFNE 237 (451) T ss_pred ---CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE----EECCCCHHHHHHHHHHHH T ss_conf ---9979999999999997499889996887330309999999999855398748998----764445678999999866 Q ss_pred CCCCC Q ss_conf 67566 Q gi|254780234|r 168 PIDKN 172 (404) Q Consensus 168 pv~~~ 172 (404) .++.+ T Consensus 238 ~l~it 242 (451) T COG0541 238 ALGIT 242 (451) T ss_pred HCCCC T ss_conf 26986 No 194 >pfam11138 DUF2911 Protein of unknown function (DUF2911). This bacterial family of proteins has no known function. Probab=43.97 E-value=16 Score=16.93 Aligned_cols=42 Identities=26% Similarity=0.392 Sum_probs=32.4 Q ss_pred CCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECC Q ss_conf 157243115457632458457682010014658122799963899999502 Q gi|254780234|r 180 SIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRG 230 (404) Q Consensus 180 S~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G 230 (404) .-+.+.||.. ...+-+|+...+++|...|..-|.++|.|+.- T Consensus 84 n~~~~~wG~~---------~yd~~~Dvlrv~v~~~~~~~~~E~fti~f~~~ 125 (145) T pfam11138 84 NKDTNQWGAY---------TYDESKDVLRVTVPPTKLPEPVETFTIDFDET 125 (145) T ss_pred ECCCCCCCCC---------CCCHHHEEEEEEEEEEECCCCCEEEEEEEEEC T ss_conf 6665647866---------78803438999966472887406899999958 No 195 >COG1832 Predicted CoA-binding protein [General function prediction only] Probab=43.94 E-value=23 Score=15.71 Aligned_cols=102 Identities=20% Similarity=0.160 Sum_probs=65.4 Q ss_pred EEEEEECCCH--HHHHHHHHHHHHCCCEEEEEEE--E----CCCC--CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHH Q ss_conf 8999914772--5999999998718986999994--5----7871--106899999997398079982008999998799 Q gi|254780234|r 7 KVVLAYSGGL--DTSIILKWLQVEKGLEVIVFIA--D----LGQG--EELKIASDKARLLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 7 kVvlaySGGL--DTSv~i~~L~~e~g~eVi~~~~--d----~Gq~--~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) =.|++.|.-= +|-...++|+ ++||+|+-+.- + +|++ ..+.+|. ...+.+.+=.+-|++-+.+- T Consensus 19 IAvVG~S~~P~r~sy~V~kyL~-~~GY~ViPVNP~~~~~eiLG~k~y~sL~dIp------e~IDiVdvFR~~e~~~~i~~ 91 (140) T COG1832 19 IAVVGASDKPDRPSYRVAKYLQ-QKGYRVIPVNPKLAGEEILGEKVYPSLADIP------EPIDIVDVFRRSEAAPEVAR 91 (140) T ss_pred EEEEECCCCCCCCHHHHHHHHH-HCCCEEEEECCCCCHHHHCCCHHHHCHHHCC------CCCCEEEEECCHHHHHHHHH T ss_conf 9999457999861889999999-7899899558552057762831120587588------77757998528656388999 Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHC Q ss_conf 9973686547874201301243359999999885689287500111 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTG 122 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~Tg 122 (404) .+|..++ ...|+.-.+ ......+.+++.|.++|..-|.. T Consensus 92 eal~~~~---kv~W~QlGi----~n~ea~~~~~~aG~~vV~nrCi~ 130 (140) T COG1832 92 EALEKGA---KVVWLQLGI----RNEEAAEKARDAGLDVVMDRCIM 130 (140) T ss_pred HHHHHCC---CEEEEECCC----CCHHHHHHHHHHCCHHHHHHHHH T ss_conf 9996178---859974585----79899999998373789876278 No 196 >PRK13303 L-aspartate dehydrogenase; Provisional Probab=43.72 E-value=23 Score=15.68 Aligned_cols=118 Identities=17% Similarity=0.226 Sum_probs=79.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHH-----------HHHHHHHHCCCEEEEEECHHHHHHHH Q ss_conf 7789999147725999999998718986999994578711068-----------99999997398079982008999998 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELK-----------IASDKARLLGAKEVYVKDLRREFVRD 73 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~-----------~~~~~A~~~Ga~~~~v~D~r~ef~~~ 73 (404) |.||.+.=-|-+-+.++ .+|+....+++-.+.++....+..+ .+.+ +.-.. +.+|.-+-.+-+.+ T Consensus 1 MmrVgiiG~GaIG~~va-~~l~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~--L~~~~-DlVVE~A~~~av~~ 76 (265) T PRK13303 1 MMKVAMIGFGAIAAAVY-ELLEHDPRLRVDWVIVPEHSVDAVRRALGRAVQVVSSVDA--LAQRP-DLVVECAGHAALKE 76 (265) T ss_pred CCEEEEECCCHHHHHHH-HHHHCCCCCEEEEEEECCCCHHHHHHHHCCCCCCCCCHHH--HHHCC-CEEEECCCHHHHHH T ss_conf 92999985468999999-9984499727999994685267787530458864479889--82379-99998988899999 Q ss_pred HHHHHHHCCCCCCCCCCCCC-CHHHHHHHHHHHHHHHHCCCC-EEECHHHCCCCHHHHH Q ss_conf 79999736865478742013-012433599999998856892-8750011168215679 Q gi|254780234|r 74 FVFPMFRANALYEGYYLLGT-AIARPLIAKYLVDIANETGAD-AIAHGSTGKGNDQVRF 130 (404) Q Consensus 74 ~i~~~I~ana~Yeg~Ypl~t-slaRplia~~lv~~a~~~ga~-~iaHG~TgkGNDQvRF 130 (404) |.-+.+.++. +-..+|+ +|+-+-.-+.+.+.|++.|+. +|..|+. -|-|-++- T Consensus 77 ~~~~~L~~g~---dl~v~SvgALaD~~~~~~l~~~A~~~~~~i~ipsGAi-gGlD~l~a 131 (265) T PRK13303 77 HVVPILKAGI---DCAVASVGALADEALRERLEQAAEAGGARLHLLSGAI-GGIDALAA 131 (265) T ss_pred HHHHHHHCCC---CEEEECCHHHCCHHHHHHHHHHHHHCCCEEEECCCHH-HHHHHHHH T ss_conf 9999997299---8899881585798899999999986897599726234-40679999 No 197 >cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain. Probab=43.55 E-value=24 Score=15.66 Aligned_cols=21 Identities=29% Similarity=0.509 Sum_probs=12.8 Q ss_pred HHCCCEEEEEECHHHH--HHHHHH Q ss_conf 9739807998200899--999879 Q gi|254780234|r 54 RLLGAKEVYVKDLRRE--FVRDFV 75 (404) Q Consensus 54 ~~~Ga~~~~v~D~r~e--f~~~~i 75 (404) +..|- +.+++|+|++ |.+..| T Consensus 10 l~~~e-~~~llDVR~~~E~~~ghI 32 (100) T cd01523 10 LLAGQ-PLFILDVRNESDYERWKI 32 (100) T ss_pred HHCCC-CCEEEECCCHHHHHCCCC T ss_conf 97799-968998997788842845 No 198 >PRK09437 bcp thioredoxin-dependent thiol peroxidase; Reviewed Probab=43.43 E-value=24 Score=15.65 Aligned_cols=64 Identities=17% Similarity=0.026 Sum_probs=32.1 Q ss_pred CEEEEEECCCHHHHHHHHHHH---------HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 789999147725999999998---------718986999994578711068999999973980799820089999987 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ---------VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~---------~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) |+|||.|--+-.|+.|..-+. .+.|++|+++..|. .+..++=+.+.+..--.+-|...+.++.| T Consensus 31 k~vvl~Fyp~~~tp~C~~e~~~f~d~~~~f~~~g~~VigIS~D~-----~~~~~~f~~~~~l~f~lLsD~~~~v~~~y 103 (156) T PRK09437 31 KKVLVYFYPKAMTPGCTVQACGLRDNMDELKKAGVVVLGISPDK-----PEKLSKFAEKELLNFTLLSDEDHQVAEQF 103 (156) T ss_pred CEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCC-----HHHHHHHHHHHCCCCCEEECCCHHHHHHC T ss_conf 97999997687899874688999999987532581899876888-----99999999970999748988980699984 No 199 >PRK09390 fixJ response regulator FixJ; Provisional Probab=43.12 E-value=24 Score=15.62 Aligned_cols=17 Identities=12% Similarity=0.149 Sum_probs=9.4 Q ss_pred CCCCHHHHHHHHHHHCC Q ss_conf 87637999999997089 Q gi|254780234|r 151 SFKGRQDLIDFAEKHAI 167 (404) Q Consensus 151 ~~~sRe~~i~ya~~~gI 167 (404) +..+|.+.+.||.++|+ T Consensus 185 ~v~~~~elv~~A~~~Gl 201 (202) T PRK09390 185 QAGSLSELVRMALRAGA 201 (202) T ss_pred CCCCHHHHHHHHHHCCC T ss_conf 88988999999998599 No 200 >PRK00724 formate dehydrogenase accessory protein; Reviewed Probab=43.00 E-value=24 Score=15.61 Aligned_cols=21 Identities=19% Similarity=0.205 Sum_probs=14.6 Q ss_pred EEEEECCCHHHHHHHHHHHHH Q ss_conf 999914772599999999871 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVE 28 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e 28 (404) +++.-|-|-|--.++-||..| T Consensus 41 ~tlm~tP~~~~eLa~GfL~sE 61 (262) T PRK00724 41 AVMMATPGDLEDFAIGFLLSE 61 (262) T ss_pred EEEEECCCCHHHHHHHHHHHC T ss_conf 999937988699986677425 No 201 >cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain. Probab=42.96 E-value=24 Score=15.60 Aligned_cols=15 Identities=27% Similarity=0.370 Sum_probs=9.9 Q ss_pred EEEEEECHHHHHHHHHH Q ss_conf 07998200899999879 Q gi|254780234|r 59 KEVYVKDLRREFVRDFV 75 (404) Q Consensus 59 ~~~~v~D~r~ef~~~~i 75 (404) ..+.++|+|+.. +|- T Consensus 12 ~~~~liDVR~~~--E~~ 26 (90) T cd01524 12 DGVTLIDVRTPQ--EFE 26 (90) T ss_pred CCEEEEECCCHH--HHH T ss_conf 998999798789--995 No 202 >PRK06801 hypothetical protein; Provisional Probab=42.93 E-value=24 Score=15.60 Aligned_cols=120 Identities=8% Similarity=0.121 Sum_probs=68.4 Q ss_pred HHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99998718986999994578711068999999973980799820089999987999973686547874201301243359 Q gi|254780234|r 22 LKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 22 i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) +.--+ +.+|-|-+|.+. .-+.+..+-+-|....+ +.++. +-+-.. +| - .-+.++ T Consensus 10 L~~A~-~~~yAV~AfNv~--n~e~~~avi~AAee~~s-PvIlq-~s~~~~-~~-----------~---------~~~~~~ 63 (286) T PRK06801 10 LAHAR-KHGYALGAFNVL--DSHFLRALFAAAKQERS-PFIIN-IAEVHF-KY-----------I---------SLESLV 63 (286) T ss_pred HHHHH-HCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEEE-CCHHHH-HH-----------C---------CHHHHH T ss_conf 99999-889479998889--99999999999999787-98998-067577-56-----------6---------999999 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEE------EECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999988568928750011168215679999998587970774------4100687637999999997089675665 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIA------PWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~via------P~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ..+...|++-...+..|= --|.| ||...+++.....-+- |+- -++.--.+..+||.++|++|.... T Consensus 64 ~~~~~~a~~~~VPV~lHL--DHg~~---~e~i~~ai~~Gf~SVM~DgS~l~~e-eNi~~Tk~vve~ah~~gv~VEaEl 135 (286) T PRK06801 64 EAVKFEAAHHDIPVVLNL--DHGLH---FEAVVRALRLGFSSVMFDGSTLEYE-ENVRQTREVVKMCHAVGVSVEAEL 135 (286) T ss_pred HHHHHHHHHCCCCEEEEC--CCCCC---HHHHHHHHHCCCCEEEEECCCCCHH-HHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 999999987799899989--99999---9999999982998799749989999-999999999999988498599996 No 203 >cd07202 cPLA2_Grp-IVC Group IVC cytoplasmic phospholipase A2; catalytic domain; Ca-independent. Group IVC cPLA2, a small 61 kDa protein, is a single domain alpha/beta hydrolase. It lacks a C2 domain; therefore, it has no Ca-dependence. Group IVC cPLA2 is also referred to as cPLA2-gamma. The cPLA2-gamma enzyme is predominantly found in cardiac and skeletal muscles, and to a lesser extent in the brain. Human cPLA2-gamma is approximately 30% identical to cPLA2-alpha. The catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bond of phospholipids to release arachidonic acid. At the active site, cPLA2 contains a serine nucleophile through which the catalytic mechanism is initiated. The active site is partially covered by a solvent-accessible flexible lid. cPLA2 displays interfacial activation as it exists in both "closed lid" and "open lid" forms. Movement of the cPLA2 lid possibly exposes a greater hydrophobic surface and the active site. cPLA2 be Probab=42.80 E-value=24 Score=15.59 Aligned_cols=48 Identities=15% Similarity=0.131 Sum_probs=25.1 Q ss_pred EECCEECCHHHHHHHHHHHHHHCCC--CCEECCC--CCCCCCCCCCCCCCHH Q ss_conf 7516231899999999888642163--7410007--7320210001113737 Q gi|254780234|r 234 AINGQVMSPEVLLEQLNQYGRCNGI--GRIDIVE--NRFVGIKSRGVYETPG 281 (404) Q Consensus 234 ainG~~~~~~~li~~LN~igg~~Gv--Gr~d~vE--nr~vG~KsR~vYEaPg 281 (404) ++|--.-+|.+-++..-+-..+||| =+++.-. .-.-+.|.==|||-|- T Consensus 312 sfD~S~~~pf~~l~~t~~yc~~~~ipFP~i~~~~~~~d~~~pkecyvf~~~~ 363 (430) T cd07202 312 SFDFSEGDPFETIKDTAEYCRKHNIPFPQVDEAKLDQDAEAPKDFYVFKGEN 363 (430) T ss_pred EECCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCEEEEECCCC T ss_conf 8326778835268999999997499999899874401235886058765999 No 204 >PRK10365 transcriptional regulatory protein ZraR; Provisional Probab=42.57 E-value=24 Score=15.56 Aligned_cols=107 Identities=17% Similarity=0.253 Sum_probs=56.5 Q ss_pred HHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCC---------C Q ss_conf 99999871-8986999994578711068999999973980799820089999987999973686547874---------2 Q gi|254780234|r 21 ILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYY---------L 90 (404) Q Consensus 21 ~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Y---------p 90 (404) ++..++.. ....||-+|+ + .+.+. .-+|++.||.+++...+.-+-....+..++.-.-..+... . T Consensus 67 lL~~l~~~~p~~pvIviT~---~-~~~~~-av~A~k~GA~Dyl~KP~~~~~L~~~i~~al~~~~~~~~~~~~~~~~~~~l 141 (441) T PRK10365 67 TLKEIKALNPAIPVLIMTA---Y-SSVET-AVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSIDAETPAVSASQFGM 141 (441) T ss_pred HHHHHHHHCCCCCEEEEEC---C-CCHHH-HHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCC T ss_conf 9999984298982899969---9-99899-99999828512340788899999999999999987652000012225786 Q ss_pred CCCCHHHHHHH--HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHH Q ss_conf 01301243359--9999998856892875001116821567999999 Q gi|254780234|r 91 LGTAIARPLIA--KYLVDIANETGADAIAHGSTGKGNDQVRFELSAY 135 (404) Q Consensus 91 l~tslaRplia--~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~ 135 (404) ++.| |.+- ...++.+-.-.+.++-.|-+|.|.+.+---.+.. T Consensus 142 iG~S---~am~~v~~~i~~~A~s~~pVLI~GE~GTGK~~~Ar~IH~~ 185 (441) T PRK10365 142 VGKS---PAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHAS 185 (441) T ss_pred EECC---HHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHC T ss_conf 6689---9999999999998488994899899981099999999965 No 205 >COG3033 TnaA Tryptophanase [Amino acid transport and metabolism] Probab=42.53 E-value=24 Score=15.56 Aligned_cols=113 Identities=19% Similarity=0.243 Sum_probs=54.8 Q ss_pred EECCCHHHHHHHHHHHHHCCCE-----EEEEEEEC--CCCCH---HHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 9147725999999998718986-----99999457--87110---68999999973980799820089999987999973 Q gi|254780234|r 11 AYSGGLDTSIILKWLQVEKGLE-----VIVFIADL--GQGEE---LKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 11 aySGGLDTSv~i~~L~~e~g~e-----Vi~~~~d~--Gq~~d---~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) -|-|-.|---+-..+ .+-|.| |.++|.|. ||+.. ...+.+-|.+.|+. +++|. ..|+|+. +.|| T Consensus 164 pFKGd~D~~kLe~li-devG~~nvp~I~~tiT~NsagGQpVSm~n~r~v~~ia~ky~ip--vv~Da-~RfaENa--YFIk 237 (471) T COG3033 164 PFKGNFDLEKLERLI-DEVGADNVPYIVLTITNNSAGGQPVSMANMKAVYEIAKKYDIP--VVMDA-ARFAENA--YFIK 237 (471) T ss_pred CCCCCCCHHHHHHHH-HHHCCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCC--EEEEH-HHHHHHH--HHHH T ss_conf 877865799999999-8737145868999996266678712077699999999871995--89644-5665420--2234 Q ss_pred CC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHH-----CCCCHHHHHHHHHHH Q ss_conf 68-654787420130124335999999988568928750011-----168215679999998 Q gi|254780234|r 81 AN-ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGST-----GKGNDQVRFELSAYS 136 (404) Q Consensus 81 an-a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~T-----gkGNDQvRFe~~~~~ 136 (404) -+ .-|.+. | + --|++.+..+|-..-+++--.|-- --=||.-+||++... T Consensus 238 ~rE~gYrd~----s-I--~~IarEm~sYaD~~~mS~KKD~lvnmGGfl~~~D~~~fDvy~~~ 292 (471) T COG3033 238 QREPGYRDW----S-I--EEIAREMYSYADGCTMSAKKDGLVNMGGFLCFKDDSFFDVYEEC 292 (471) T ss_pred HCCCCCCCC----C-H--HHHHHHHHHHHHHHEEECCCCCEECCCCEEEECCCCHHHHHHHH T ss_conf 247655546----0-9--99999998651243020254524224657886475077899998 No 206 >PRK08591 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=42.26 E-value=25 Score=15.53 Aligned_cols=22 Identities=41% Similarity=0.425 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHCCCCEEECH Q ss_conf 3359999999885689287500 Q gi|254780234|r 98 PLIAKYLVDIANETGADAIAHG 119 (404) Q Consensus 98 plia~~lv~~a~~~ga~~iaHG 119 (404) ++=...++++|++.|||+|-.| T Consensus 60 Yln~~~Ii~~A~~~g~dAihPG 81 (449) T PRK08591 60 YLNIPAIISAAEITGADAIHPG 81 (449) T ss_pred CCCHHHHHHHHHHHCCCEEECC T ss_conf 0489999999998299989727 No 207 >PRK06029 3-octaprenyl-4-hydroxybenzoate carboxy-lyase; Provisional Probab=42.09 E-value=25 Score=15.51 Aligned_cols=148 Identities=18% Similarity=0.186 Sum_probs=70.0 Q ss_pred CCEEEEEECCCHHHHHHHHHHHH---HCCCEEEEEEEECCCC-------CHHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 77899991477259999999987---1898699999457871-------1068999999973980799820089999987 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQV---EKGLEVIVFIADLGQG-------EELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~---e~g~eVi~~~~d~Gq~-------~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) |||||++.||.--.-+.+..|+. ..++||+.+.-+-++. .++++++..+. . +-|-. | T Consensus 1 MkrIvvgITGASGa~ya~rll~~L~~~~~~ev~lv~S~~A~~v~~~E~~~~~~~~~~~~~-----~--~~~~~-----d- 67 (187) T PRK06029 1 MKRLIVGISGASGAIYGVRLLQVLRDVEEVETHLVISQAARQTLAHELDFSLREVQALAD-----V--VHDVR-----D- 67 (187) T ss_pred CCEEEEEEECHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHCCCHHHHHHHHH-----E--ECCHH-----C- T ss_conf 973999985288899999999998642895699998868999999993998888655554-----0--47730-----1- Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 9999736865478742013012433599999998856892875001116821567-999999858797077441006876 Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSDIEIIAPWRHWSFK 153 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~l~viaP~Rd~~~~ 153 (404) ++-.|..+ .++.-.=+==||-.+.+-.+|.-.. .|=-.| .+++.+-- -+.+---||-.|. T Consensus 68 ~~A~iASG-----S~~~d~MvI~PCSm~TLa~IA~G~s-----------dnLi~RaAdV~LKEr---R~LvlvpREtPls 128 (187) T PRK06029 68 IGASIASG-----SFGTDGMVIAPCSMKTLAKIAHGYS-----------DNLLTRAADVMLKER---RRLVLCVRETPLH 128 (187) T ss_pred CCCCCCCC-----CCHHCCEEEEECCHHHHHHHHHCCC-----------CCHHHHHHHHHHHHC---CEEEEEECCCCCC T ss_conf 07875545-----6300757994154889999970045-----------888999999998717---8189997688976 Q ss_pred --CHHHHHHHHHHHCCCCCCCCCC-CCCCCCCHH Q ss_conf --3799999999708967566567-886415724 Q gi|254780234|r 154 --GRQDLIDFAEKHAIPIDKNKRG-EAPFSIDTN 184 (404) Q Consensus 154 --sRe~~i~ya~~~gIpv~~~~~~-~~~yS~D~N 184 (404) .-+.+....+.-.|=+|..+.- .+|-|+|+- T Consensus 129 ~ihLeNMl~ls~~GaiI~P~~P~FY~~p~si~dl 162 (187) T PRK06029 129 LGHLRNMTKLAEMGAIIMPPVPAFYHRPQTLEDM 162 (187) T ss_pred HHHHHHHHHHHHCCCEEECCCCCCCCCCCCHHHH T ss_conf 8999999999988998947982002799989999 No 208 >PRK04148 hypothetical protein; Provisional Probab=42.02 E-value=25 Score=15.50 Aligned_cols=82 Identities=24% Similarity=0.380 Sum_probs=48.4 Q ss_pred CHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 72599999999871898699999457871106899999997398079982008999998799997368654787420130 Q gi|254780234|r 15 GLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTA 94 (404) Q Consensus 15 GLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~ts 94 (404) |--+.++ ..|+ +.|++|+|.-+|-. +-++|..+|. .++.-|+-+- -+ .+|+|--. .-| T Consensus 26 Gf~~~vA-~~L~-e~g~dv~~~Din~~-------aV~~a~~~Gl-~~v~DDif~P--------~l---~iY~~a~l-IYS 83 (135) T PRK04148 26 GFYFKVA-KKLK-ESGFDVIVIDINKK-------AVEKAKKLGL-NAFVDDIFNP--------NL---EIYKNAKL-IYS 83 (135) T ss_pred CCCHHHH-HHHH-HCCCCEEEEECCHH-------HHHHHHHCCC-CEEECCCCCC--------CH---HHHCCCCE-EEE T ss_conf 6667899-9998-74998899957655-------4322123287-6375158899--------88---88517887-998 Q ss_pred HHH--HHHHHHHHHHHHHCCCCEEECH Q ss_conf 124--3359999999885689287500 Q gi|254780234|r 95 IAR--PLIAKYLVDIANETGADAIAHG 119 (404) Q Consensus 95 laR--plia~~lv~~a~~~ga~~iaHG 119 (404) + | |=+...++++|++.||+.+-.- T Consensus 84 I-RPp~Elq~~il~lakkv~~dliI~P 109 (135) T PRK04148 84 I-RPPRDLQPFILELAKKLNCPLYIKP 109 (135) T ss_pred E-CCCHHHHHHHHHHHHHCCCCEEEEC T ss_conf 1-8987888999999997599899970 No 209 >cd01977 Nitrogenase_VFe_alpha Nitrogenase_VFe_alpha -like: Nitrogenase VFe protein, alpha subunit like. This group contains proteins similar to the alpha subunits of, the VFe protein of the vanadium-dependent (V-) nitrogenase and the FeFe protein of the iron only (Fe-) nitrogenase Nitrogenase catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. In addition to V- and Fe- nitrogenases there is a molybdenum (Mo)-dependent nitrogenase which is the most widespread and best characterized of these systems. These systems consist of component 1 (VFe protein, FeFe protein or, MoFe protein respectively) and, component 2 (Fe protein). MoFe is an alpha2beta2 tetramer, V-and Fe- nitrogenases are alpha2beta2delta2 hexamers. The alpha and beta subunits of VFe and FeFe are similar to the alpha and beta subunits of MoFe. For MoFe each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha sub Probab=41.94 E-value=25 Score=15.50 Aligned_cols=22 Identities=14% Similarity=0.504 Sum_probs=11.1 Q ss_pred CCCCCCHHHHH-HHHHHHCCCCC Q ss_conf 06876379999-99997089675 Q gi|254780234|r 149 HWSFKGRQDLI-DFAEKHAIPID 170 (404) Q Consensus 149 d~~~~sRe~~i-~ya~~~gIpv~ 170 (404) ++++.+--+++ .++++-||+|- T Consensus 170 ~~~~~gd~~eikrll~~~Gi~v~ 192 (415) T cd01977 170 DYNIQGDTEVLQKYFERMGIQVL 192 (415) T ss_pred CCCCCCCHHHHHHHHHHCCCCEE T ss_conf 98885528999999997599579 No 210 >PRK12857 putative aldolase; Reviewed Probab=41.92 E-value=25 Score=15.49 Aligned_cols=115 Identities=13% Similarity=0.203 Sum_probs=58.0 Q ss_pred HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 18986999994578711068999999973980799820089999987999973686547874201301243359999999 Q gi|254780234|r 28 EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDI 107 (404) Q Consensus 28 e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~ 107 (404) +.+|-|-+|.+. .-+.+..+-+-|....+ +.++ .+-+-.. +|. ....++..+... T Consensus 15 ~~~yAV~AfNv~--n~e~~~avi~AAee~~s-PvIl-q~s~~~~-~~~--------------------g~~~~~~~~~~~ 69 (284) T PRK12857 15 KGGYAVGAFNCN--NMEIVQAIVAAAEAERS-PVII-QASQGAI-KYA--------------------GIEYISAMVRTA 69 (284) T ss_pred HCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE-ECCCCHH-HHC--------------------CHHHHHHHHHHH T ss_conf 889179987889--89999999999999789-9899-9171477-657--------------------999999999999 Q ss_pred HHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE------EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 8856892875001116821567999999858797077------44100687637999999997089675665 Q gi|254780234|r 108 ANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII------APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 108 a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi------aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) |++-...+..|= --|.| ||...+++.-...-+ -|+ +-++.--.+..+||.++|++|.... T Consensus 70 a~~~~VpV~lHL--DH~~~---~e~i~~ai~~Gf~SVM~DgS~l~~-eeNi~~Tk~vv~~ah~~gv~VEaEl 135 (284) T PRK12857 70 AEKASVPVALHL--DHGTD---FEQVMKCIRNGFTSVMIDGSKLPL-EENIALTKKVVEIAHAVGVSVEAEL 135 (284) T ss_pred HHHCCCCEEEEC--CCCCC---HHHHHHHHHCCCCEEEEECCCCCH-HHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 997699899967--98899---999999998099879972898999-9999999999999987089158853 No 211 >PRK06111 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=41.70 E-value=25 Score=15.47 Aligned_cols=71 Identities=20% Similarity=0.276 Sum_probs=38.0 Q ss_pred HHHHHHHHHHHHHCCCCEEECHH-------------H-----CCC---------CHHHHHHHHHHHHCCCCEE-EEEECC Q ss_conf 33599999998856892875001-------------1-----168---------2156799999985879707-744100 Q gi|254780234|r 98 PLIAKYLVDIANETGADAIAHGS-------------T-----GKG---------NDQVRFELSAYSLNSDIEI-IAPWRH 149 (404) Q Consensus 98 plia~~lv~~a~~~ga~~iaHG~-------------T-----gkG---------NDQvRFe~~~~~l~P~l~v-iaP~Rd 149 (404) ++=...++++|++.|||+|-.|= . .-| -|..++-.. +-+..| +.|+-+ T Consensus 60 Yln~~~Ii~~A~~~g~dAIHPGYGFLSEna~FA~~~~~~gi~fIGP~~~~i~~~GdK~~ar~~----a~~~gvPvvPG~~ 135 (449) T PRK06111 60 YLNLEKIIEIAKKTGAEAIHPGYGLLSENASFAERCKEEGIVFIGPSAEIIAKMGSKIEARRA----MQAAGVPVVPGIT 135 (449) T ss_pred HCCHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHHHHHCCCEEECCCHHHHHHHHCHHHHHHH----HHHCCCCCCCCCC T ss_conf 339999999999839999977987311698999999988998999299999986486999999----9983998557867 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 68763799999999708967566 Q gi|254780234|r 150 WSFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 150 ~~~~sRe~~i~ya~~~gIpv~~~ 172 (404) -..++-++..+.+++.|.||=.. T Consensus 136 ~~~~~~~ea~~~a~~iGyPvliK 158 (449) T PRK06111 136 TALEDAEEAIAIARQIGYPVMLK 158 (449) T ss_pred CCCCCHHHHHHHHHHCCCCEEEE T ss_conf 77899999999998659802662 No 212 >PRK13238 tnaA tryptophanase; Provisional Probab=41.50 E-value=25 Score=15.45 Aligned_cols=87 Identities=17% Similarity=0.280 Sum_probs=57.0 Q ss_pred EECCCHHHHHHHHHHHHHCCCE-----EEEEEEECC--CCC---HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 9147725999999998718986-----999994578--711---068999999973980799820089999987999973 Q gi|254780234|r 11 AYSGGLDTSIILKWLQVEKGLE-----VIVFIADLG--QGE---ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 11 aySGGLDTSv~i~~L~~e~g~e-----Vi~~~~d~G--q~~---d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) -|.|-+|--.+-..|+ ++|.+ ++++|.|.+ |+- ++..+.+-|.+.|+ + +++|. ..|+|+ + ..|+ T Consensus 156 pFKGN~Dl~kLe~~i~-~~g~~~I~~v~~tiTnN~~GGQPVSm~Nir~v~~la~~~~i-p-~~lDa-aRfaEN-A-yFIk 229 (461) T PRK13238 156 PFKGNFDLEKLEALIE-EVGADNIPFIVMTITNNSAGGQPVSMANLRAVYEIAKKYGI-P-VVLDA-ARFAEN-A-YFIK 229 (461) T ss_pred CCCCCCCHHHHHHHHH-HHCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCC-C-EEEEH-HHHHHH-H-HHHH T ss_conf 9788779999999998-73844654899998626878816468999999999998299-5-99865-667664-5-7887 Q ss_pred CC-CCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 68-6547874201301243359999999885 Q gi|254780234|r 81 AN-ALYEGYYLLGTAIARPLIAKYLVDIANE 110 (404) Q Consensus 81 an-a~Yeg~Ypl~tslaRplia~~lv~~a~~ 110 (404) -. .-|.++. - .=|++.+..+|-- T Consensus 230 ~RE~gY~d~s----i---~eI~rEmfs~aDg 253 (461) T PRK13238 230 QREPGYKDKS----I---KEITREMFSYADG 253 (461) T ss_pred HCCCCCCCCC----H---HHHHHHHHHHHHE T ss_conf 3253125799----9---9999999845237 No 213 >TIGR01328 met_gam_lyase methionine gamma-lyase; InterPro: IPR006237 This family of sequences is a methionine gamma-lyase subset of a family of PLP-dependent trans-sulphuration enzymes. The member from the parasite Trichomonas vaginalis is described as catalyzing alpha gamma- and alpha-beta eliminations and gamma-replacement reactions on methionine, cysteine, and some derivatives. Likewise, the enzyme from Pseudomonas degrades cysteine as well as methionine.. Probab=41.49 E-value=25 Score=15.45 Aligned_cols=87 Identities=23% Similarity=0.507 Sum_probs=54.8 Q ss_pred CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHH-- Q ss_conf 106899999997398079982008999998799997368654787420130124335999999988568928750011-- Q gi|254780234|r 44 EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGST-- 121 (404) Q Consensus 44 ~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~T-- 121 (404) -|++.+-+-|..-|-.++++ |= -|+.|+|+ ||+ .+|||.|-|-+| T Consensus 161 iD~e~v~r~A~~~gG~kViv-DN------TfatP~lt----------------~P~----------alG~D~VVHSATKY 207 (392) T TIGR01328 161 IDLELVVRVAHEKGGVKVIV-DN------TFATPYLT----------------RPV----------ALGVDVVVHSATKY 207 (392) T ss_pred HHHHHHHHHHHHCCCEEEEE-EC------CCCCHHHC----------------CHH----------HCCCCEEEECCCCC T ss_conf 75999999997529828998-56------62770013----------------803----------61877689424121 Q ss_pred -------------CCCCH--HHHHHHHHHHHCCCCEEEEEECCCC----C-----------CCHHHHHHHHHHHC Q ss_conf -------------16821--5679999998587970774410068----7-----------63799999999708 Q gi|254780234|r 122 -------------GKGND--QVRFELSAYSLNSDIEIIAPWRHWS----F-----------KGRQDLIDFAEKHA 166 (404) Q Consensus 122 -------------gkGND--QvRFe~~~~~l~P~l~viaP~Rd~~----~-----------~sRe~~i~ya~~~g 166 (404) |+-.+ |+|.= .++-+ -..||+|+--|= + .+=+...+|+++|= T Consensus 208 igGHgDvVaG~~~G~~e~~q~iR~~-g~KD~--tG~ViSPfdAwL~lRGLkTL~~RM~rhsenA~kvAe~Lk~HP 279 (392) T TIGR01328 208 IGGHGDVVAGVLIGDAELLQRIRLV-GVKDM--TGAVISPFDAWLVLRGLKTLAVRMKRHSENAEKVAEYLKEHP 279 (392) T ss_pred CCCCHHHHHHHHCCCHHHHHHHCCC-CCCCC--CCCEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 3761221122110775667440111-10005--665005578999974366777888773053899999852388 No 214 >cd06283 PBP1_RegR_EndR_KdgR_like Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR. Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR, all of which are members of the LacI-GalR family of bacterial transcription regulators. RegR regulates bacterial competence and the expression of virulence factors, including hyaluronidase. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=41.34 E-value=25 Score=15.43 Aligned_cols=109 Identities=17% Similarity=0.097 Sum_probs=50.6 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEE--CHHHHHHHHHH---HHHHHCCCCCCC-CCCCCCCHHHHHH Q ss_conf 71898699999457871106899999997398079982--00899999879---999736865478-7420130124335 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVK--DLRREFVRDFV---FPMFRANALYEG-YYLLGTAIARPLI 100 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~--D~r~ef~~~~i---~~~I~ana~Yeg-~Ypl~tslaRpli 100 (404) .++||.++.++.+-....+.+.+ +.....++.-+++. +..++..+... .|.+..+....+ .++. ....-.-. T Consensus 26 ~~~gy~l~i~~s~~~~~~e~~~i-~~l~~~~vDgiIl~~~~~~~~~~~~~~~~~iPvV~i~~~~~~~~~~~-V~~D~~~~ 103 (267) T cd06283 26 RAHGYQVLVCNSDNDPEKEKEYL-ESLLAYQVDGLIVNPTGNNKELYQRLAKNGKPVVLVDRKIPELGVDT-VTLDNYEA 103 (267) T ss_pred HHCCCEEEEEECCCCHHHHHHHH-HHHHHCCCCEEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCE-EEECHHHH T ss_conf 98699899997899989999999-99996699999985877998999999976998999688578999988-99777999 Q ss_pred HHHHHHHHHHCCCCEEECHH---HCCCCHHHHHHHHHHHH Q ss_conf 99999998856892875001---11682156799999985 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGS---TGKGNDQVRFELSAYSL 137 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~---TgkGNDQvRFe~~~~~l 137 (404) ++.+++...+.|..-++.=+ ++.-..+-|++....++ T Consensus 104 ~~~~~~~L~~~G~~~i~~i~~~~~~~~~~~~R~~g~~~a~ 143 (267) T cd06283 104 AKEAVDHLIEKGYERILFVTEPLDEISPRMERYEGFKEAL 143 (267) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 9999999997599729999568778837999999999999 No 215 >cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem Probab=41.31 E-value=25 Score=15.43 Aligned_cols=42 Identities=19% Similarity=0.123 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEC--HHHCCCCHHHHHHHHHHHH Q ss_conf 24335999999988568928750--0111682156799999985 Q gi|254780234|r 96 ARPLIAKYLVDIANETGADAIAH--GSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 96 aRplia~~lv~~a~~~ga~~iaH--G~TgkGNDQvRFe~~~~~l 137 (404) .-.-.++.+.+...+.|..-|+- |..+-.+.+.|.+....++ T Consensus 101 d~~~~g~~~~~~l~~~g~~~i~~i~~~~~~~~~~~R~~g~~~~~ 144 (264) T cd01537 101 DNEQAGYLAGEHLAEKGHRRIALLAGPLGSSTARERVAGFKDAL 144 (264) T ss_pred CHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 77999999999998629862453048877706999999999999 No 216 >cd00293 USP_Like Usp: Universal stress protein family. The universal stress protein Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae Usp reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, athough Usp lacks ATP-binding activity. Probab=41.28 E-value=25 Score=15.43 Aligned_cols=104 Identities=19% Similarity=0.277 Sum_probs=57.3 Q ss_pred EEEEEECCCHHHHHHHHHHH---HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCC Q ss_conf 89999147725999999998---718986999994578711068999999973980799820089999987999973686 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQ---VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANA 83 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~---~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana 83 (404) ||++++.+.-.+..++.|.. ...+.+++.+++.--............ .+-.++..+.+....-..+. T Consensus 1 ~Ilv~vd~s~~s~~al~~a~~la~~~~~~l~~v~v~~~~~~~~~~~~~~~----------~~~~~~~l~~~~~~~~~~~~ 70 (130) T cd00293 1 RILVAVDGSEESERALRWAARLARRLGAELVLLHVVDPPPSSAAELAELL----------EEEARALLEALREALAEAGV 70 (130) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHH----------HHHHHHHHHHHHHHHHCCCC T ss_conf 99999898989999999999999872997999999728865530144678----------99999999999998542798 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCH Q ss_conf 5478742013012433599999998856892875001116821 Q gi|254780234|r 84 LYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGND 126 (404) Q Consensus 84 ~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGND 126 (404) -++- ....+.+ ++.+++.+++.++|.|.=|+.+++.- T Consensus 71 ~~~~----~~~~~~~--~~~i~~~a~~~~~dliV~G~~~~~~~ 107 (130) T cd00293 71 KVET----VVLEGDP--AEAILEAAEELGADLIVMGSRGRSGL 107 (130) T ss_pred CEEE----EEEECCH--HHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 3899----9994661--89998877761188999947999754 No 217 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=41.20 E-value=26 Score=15.42 Aligned_cols=65 Identities=14% Similarity=0.151 Sum_probs=22.7 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 99999988568928750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) +.+-+.|++.|.+.+...+.+--+.|.+.=..+..-.++.-+++|.- -..-.+.+..+.+.|||+ T Consensus 46 ~gie~~a~~~Gy~l~v~~s~~d~~~q~~~i~~li~~~vdgiii~p~d---~~~~~~~~~~a~~~gIPv 110 (295) T PRK10653 46 DGAQKEADKLGYNLVVLDSQNNPAKELANVQDLTVRGTKILLINPTD---SDAVGNAVKMANQANIPV 110 (295) T ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCC---HHHHHHHHHHHHHCCCEE T ss_conf 99999999769989998399999999999999997699889984777---065699999999769819 No 218 >pfam00609 DAGK_acc Diacylglycerol kinase accessory domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown. Probab=41.19 E-value=20 Score=16.18 Aligned_cols=20 Identities=25% Similarity=0.527 Sum_probs=17.8 Q ss_pred CEEEEEEEECCEEEEECCEE Q ss_conf 63899999502478751623 Q gi|254780234|r 220 PTTIRIDFQRGDPIAINGQV 239 (404) Q Consensus 220 pe~v~I~Fe~G~PVainG~~ 239 (404) ...|.|+|++..|+.||||+ T Consensus 138 ~~~v~i~~~~~~p~QvDGEP 157 (157) T pfam00609 138 GGPIRIETKKKIPMQVDGEP 157 (157) T ss_pred CCEEEEEECCCCEEEECCCC T ss_conf 77799998788247738999 No 219 >cd01320 ADA Adenosine deaminase (ADA) is a monomeric zinc dependent enzyme which catalyzes the irreversible hydrolytic deamination of both adenosine, as well as desoxyadenosine, to ammonia and inosine or desoxyinosine, respectively. ADA plays an important role in the purine pathway. Low, as well as high levels of ADA activity have been linked to several diseases. Probab=41.11 E-value=26 Score=15.41 Aligned_cols=21 Identities=24% Similarity=0.480 Sum_probs=16.2 Q ss_pred EECCCHHHHHHHHHHHHHCCCE Q ss_conf 9147725999999998718986 Q gi|254780234|r 11 AYSGGLDTSIILKWLQVEKGLE 32 (404) Q Consensus 11 aySGGLDTSv~i~~L~~e~g~e 32 (404) =.+|++..++++.|.+ ++|.+ T Consensus 10 HL~Gsi~~~tl~~la~-~~~~~ 30 (325) T cd01320 10 HLDGSLRPETILELAK-KNGIT 30 (325) T ss_pred CCCCCCCHHHHHHHHH-HCCCC T ss_conf 7778889999999999-71999 No 220 >TIGR01074 rep ATP-dependent DNA helicase Rep; InterPro: IPR005752 RepA hexameric DNA helicase contain ATP-binding domains similar to those seen in monomeric helicases but which are arranged in a ring. There is compelling evidence to suggest that a single ssDNA molecule passes through the centre of the hexameric ring. Activity of the enzyme is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilizing the duplex and separating the two strands in an active process ; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=41.07 E-value=26 Score=15.41 Aligned_cols=20 Identities=20% Similarity=0.316 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHCCCCCEEC Q ss_conf 99999988864216374100 Q gi|254780234|r 244 VLLEQLNQYGRCNGIGRIDI 263 (404) Q Consensus 244 ~li~~LN~igg~~GvGr~d~ 263 (404) --+++|++.|..+.+-.... T Consensus 419 aTL~KL~~~A~~~~~Slf~A 438 (677) T TIGR01074 419 ATLEKLGEYAAEKNKSLFEA 438 (677) T ss_pred HHHHHHHHHHHHCCCCHHHH T ss_conf 56888999997407978899 No 221 >smart00829 PKS_ER Enoylreductase. Enoylreductase in Polyketide synthases. Probab=41.01 E-value=26 Score=15.40 Aligned_cols=54 Identities=31% Similarity=0.553 Sum_probs=25.4 Q ss_pred HHHHHH-HCCCCEEEC-HHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCC Q ss_conf 999988-568928750-01116821567999999858797077441006876379999999970896 Q gi|254780234|r 104 LVDIAN-ETGADAIAH-GSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIP 168 (404) Q Consensus 104 lv~~a~-~~ga~~iaH-G~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIp 168 (404) +.+.++ +.|-.++-| ||.|-|-=-+.+ .++++ .+||+-. +.++..+++++.|++ T Consensus 96 l~~~~~i~~g~~VlI~gg~G~vG~~aiql---ak~~G--a~Vi~t~------~s~~k~~~~~~lG~~ 151 (288) T smart00829 96 LVDLARLRPGESVLIHAAAGGVGQAAIQL---AQHLG--AEVFATA------GSPEKRDFLRELGIP 151 (288) T ss_pred HHHHCCCCCCCEEEEECCCCHHHHHHHHH---HHHCC--CCEEEEE------CCHHHHHHHHHCCCC T ss_conf 99750889999999978986777999999---99739--8300340------888999999976999 No 222 >pfam01156 IU_nuc_hydro Inosine-uridine preferring nucleoside hydrolase. Probab=40.77 E-value=26 Score=15.37 Aligned_cols=48 Identities=21% Similarity=0.293 Sum_probs=35.5 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 7899991477259999999987189869999945787110689999999 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKAR 54 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~ 54 (404) +||++=--.|.|-.+++.++......||+++|+--|.. ..+...+.+. T Consensus 1 ~kvIiDtD~G~DDa~Al~~al~~~~~~l~git~~~Gn~-~~~~~~~n~~ 48 (303) T pfam01156 1 RKVILDTDPGIDDALALLLALASPELDLLGITTVAGNV-PLEKTTRNAL 48 (303) T ss_pred CCEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCCC-CHHHHHHHHH T ss_conf 94999899974899999999669998899999925986-8999999999 No 223 >COG1564 THI80 Thiamine pyrophosphokinase [Coenzyme metabolism] Probab=40.08 E-value=11 Score=17.95 Aligned_cols=198 Identities=17% Similarity=0.129 Sum_probs=92.9 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEE-ECHHHHHHHHHHHHHHHCCCCC Q ss_conf 899991477259999999987189869999945787110689999999739807998-2008999998799997368654 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYV-KDLRREFVRDFVFPMFRANALY 85 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v-~D~r~ef~~~~i~~~I~ana~Y 85 (404) ..++---||.+.+.=.-+-..+ ... -++||=| ...+..+|..+-++ =|+- - +.+-.+..++-...+ T Consensus 2 ~~~~il~g~~~~~~~~~~~~~~-~~~--~v~aDgG--------a~~l~~~gl~P~~~vGDfD-S-v~~e~~~~~~~~~~~ 68 (212) T COG1564 2 SAVLILNGGILAPTDRLDYLWK-FDK--IVAADGG--------ANHLLELGLVPDLAVGDFD-S-VSEELLAYYKEKTVT 68 (212) T ss_pred CEEEEECCCCCCCHHHHHHCCC-CCE--EEEECCH--------HHHHHHCCCCCCEEEECCC-C-CCHHHHHHHHHCCCC T ss_conf 4699971765578256432045-666--9998708--------8899984998657872464-5-679999998645762 Q ss_pred CCCCC---CCCCHHHHHHHHHHHHHHHHCCC-CEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHH Q ss_conf 78742---01301243359999999885689-287500111682156799999985879707744100687637999999 Q gi|254780234|r 86 EGYYL---LGTAIARPLIAKYLVDIANETGA-DAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDF 161 (404) Q Consensus 86 eg~Yp---l~tslaRplia~~lv~~a~~~ga-~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~y 161 (404) -.+| -.|- .+..+..|.++|+ +.+.-|++|. |+|-.+..+.==.+-.+--....+.++++++.+ T Consensus 69 -~~f~~eKd~TD------~elAl~~a~e~g~d~i~i~Ga~GG-----R~DH~l~nl~ll~~~~~~~~~i~l~~~~n~~~~ 136 (212) T COG1564 69 -IKFPAEKDSTD------LELALDEALERGADEIVILGALGG-----RLDHALANLFLLLRPAKSGFKITLISGQNLITP 136 (212) T ss_pred -EECCHHHCCCH------HHHHHHHHHHCCCCEEEEEECCCC-----HHHHHHHHHHHHHHHHHCCCEEEEECCCCEEEE T ss_conf -34683435466------999999999809977999815788-----189999999998754420646999637857998 Q ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHH--HCCCCCEEEEEEEECCEEEEE Q ss_conf 99708967566567886415724311545763245845768201001465812--279996389999950247875 Q gi|254780234|r 162 AEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPE--EAPDTPTTIRIDFQRGDPIAI 235 (404) Q Consensus 162 a~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~--~ap~~pe~v~I~Fe~G~PVai 235 (404) +..-..-++..+ ..+||+=. +-..++-.++.-..+..+..+....+.- +.-. -..++++|++|.-+-+ T Consensus 137 l~~g~~~i~~~~--~~~~s~f~---~~~~~~Lti~g~KypL~~~~~~~g~~~siSN~~~-~~~~~vs~~~G~~l~~ 206 (212) T COG1564 137 LPPGQHTIEKDP--GYLYSIFG---GEDVAGLTIKGAKYPLKNADLPFGSSRSISNEFI-GDPVTVSLKSGIVLVI 206 (212) T ss_pred ECCCCEEECCCC--CCCEEEEE---CCCCCCEEECCCEEECCCCCCCCCCEEEEEEEEE-CCCEEEEECCCEEEEE T ss_conf 479856732689--95189984---1531216861646225675321455068731041-7708999579789999 No 224 >PRK10310 galactitol-specific PTS system component IIB; Provisional Probab=40.05 E-value=27 Score=15.30 Aligned_cols=36 Identities=19% Similarity=0.336 Sum_probs=26.3 Q ss_pred CCEEEEEECCCHHHHHHHH-----HHHHHCCCEEEEEEEECC Q ss_conf 7789999147725999999-----998718986999994578 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILK-----WLQVEKGLEVIVFIADLG 41 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~-----~L~~e~g~eVi~~~~d~G 41 (404) +|||++|.-.|.-||..+. |++ +.|.+|-..-+++. T Consensus 2 kKkIlVACGsGIATSTvva~kv~~~~~-e~gi~v~i~Q~~i~ 42 (94) T PRK10310 2 KRKIIVACGGAVATSTMAAEEIKELCQ-SHNIPVELIQCRVN 42 (94) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHH-HCCCEEEEEEEEHH T ss_conf 955999858837599999999999999-85980689997478 No 225 >TIGR02064 dsrA sulfite reductase, dissimilatory-type alpha subunit; InterPro: IPR011806 Dissimilatory sulphite reductase catalyzes the six-electron reduction of sulphite to sulphide, as the terminal reaction in dissimilatory sulphate reduction. It remains unclear however, whether trithionate and thiosulphate serve as intermediate compounds to sulphide, or as end products of sulphite reduction . Sulphite reductase is a multisubunit enzyme composed of dimers of either alpha/beta or alpha/beta/gamma subunits, each containing a siroheme and iron sulphur cluster prosthetic centre . Found in sulphate-reducing bacteria, these genes are commonly located in a unidirectional gene cluster .; GO: 0018551 hydrogensulfite reductase activity, 0020037 heme binding, 0051539 4 iron 4 sulfur cluster binding. Probab=39.66 E-value=22 Score=15.83 Aligned_cols=26 Identities=23% Similarity=0.590 Sum_probs=15.6 Q ss_pred CCCCEEEEEECCCCCCCHHHHH-HHHHH Q ss_conf 8797077441006876379999-99997 Q gi|254780234|r 138 NSDIEIIAPWRHWSFKGRQDLI-DFAEK 164 (404) Q Consensus 138 ~P~l~viaP~Rd~~~~sRe~~i-~ya~~ 164 (404) ..|+.||=-||| +++--++++ +|.+. T Consensus 227 RsDfa~iGTWkd-d~kVDqe~vk~yi~~ 253 (430) T TIGR02064 227 RSDFAVIGTWKD-DIKVDQEAVKKYIAA 253 (430) T ss_pred HCCCEEECCCCC-CHHHHHHHHHHHHHC T ss_conf 135223224577-347649999999832 No 226 >PRK09701 D-allose transporter subunit; Provisional Probab=39.16 E-value=27 Score=15.20 Aligned_cols=66 Identities=18% Similarity=-0.008 Sum_probs=34.1 Q ss_pred HHHHHHHHHCCCCEEECHHHC--CCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 999999885689287500111--682156799999985879707744100687637999999997089675 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTG--KGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~Tg--kGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+-+.|+++|+++.--++.. --+.|+..=..+-+-.++.-+++|.- ...-..-++=|.+.||||- T Consensus 44 ~Ga~~aAk~~G~~v~v~~~~~~~D~~~Qi~~Ie~~I~~gvdaIii~p~d---~~a~~~~i~~A~~aGIpVV 111 (311) T PRK09701 44 KGIEDEAKTLGVSVDIFASPSEGDFQSQLQLFEDLSNKNYKGIAFAPLS---SVNLVMPVARAWKKGIYLV 111 (311) T ss_pred HHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCC---HHHHHHHHHHHHHCCCCEE T ss_conf 9999999986997999927987899999999999997599999991898---7788999999997799189 No 227 >COG1064 AdhP Zn-dependent alcohol dehydrogenases [General function prediction only] Probab=39.08 E-value=27 Score=15.19 Aligned_cols=139 Identities=19% Similarity=0.239 Sum_probs=73.7 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHH-HHHHHHHHHHHHHCCCC Q ss_conf 78999914772599999999871898699999457871106899999997398079982008-99999879999736865 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLR-REFVRDFVFPMFRANAL 84 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r-~ef~~~~i~~~I~ana~ 84 (404) ++|++.=-|||- ..++.+.+ -.|++|++++..- +-.+.|+++||. +++ |.+ .++.+. +... +++. T Consensus 168 ~~V~I~G~GGlG-h~avQ~Ak-a~ga~Via~~~~~-------~K~e~a~~lGAd-~~i-~~~~~~~~~~-~~~~--~d~i 233 (339) T COG1064 168 KWVAVVGAGGLG-HMAVQYAK-AMGAEVIAITRSE-------EKLELAKKLGAD-HVI-NSSDSDALEA-VKEI--ADAI 233 (339) T ss_pred CEEEEECCCHHH-HHHHHHHH-HCCCEEEEEECCH-------HHHHHHHHHCCC-EEE-ECCCCHHHHH-HHHH--CCEE T ss_conf 899998774899-99999999-8699699995787-------799999984882-899-7678116677-6734--7399 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHC-CCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHH Q ss_conf 47874201301243359999999885689287500111-68215679999998587970774410068763799999999 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTG-KGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAE 163 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~Tg-kGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~ 163 (404) +.|.- +......++..+..| ..+.=|--+ ...-++.. +....-+++|.--.-- ...--+|+++|+. T Consensus 234 ------i~tv~--~~~~~~~l~~l~~~G-~~v~vG~~~~~~~~~~~~---~~li~~~~~i~GS~~g-~~~d~~e~l~f~~ 300 (339) T COG1064 234 ------IDTVG--PATLEPSLKALRRGG-TLVLVGLPGGGPIPLLPA---FLLILKEISIVGSLVG-TRADLEEALDFAA 300 (339) T ss_pred ------EECCC--HHHHHHHHHHHHCCC-EEEEECCCCCCCCCCCCH---HHHHHCCCEEEEEECC-CHHHHHHHHHHHH T ss_conf ------99877--455999999864297-899978888766666787---7863267089997248-9999999999998 Q ss_pred HHCCCCCC Q ss_conf 70896756 Q gi|254780234|r 164 KHAIPIDK 171 (404) Q Consensus 164 ~~gIpv~~ 171 (404) +|+|.... T Consensus 301 ~g~Ikp~i 308 (339) T COG1064 301 EGKIKPEI 308 (339) T ss_pred HCCCEEEE T ss_conf 18943248 No 228 >cd06308 PBP1_sensor_kinase_like Periplasmic binding domain of two-component sensor kinase signaling systems. Periplasmic binding domain of two-component sensor kinase signaling systems, some of which are fused with a C-terminal histidine kinase A domain (HisK) and/or a signal receiver domain (REC). Members of this group share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily and are predicted to be involved in sensing of environmental stimuli; their substrate specificities, however, are not known in detail. Probab=38.70 E-value=28 Score=15.15 Aligned_cols=141 Identities=14% Similarity=0.244 Sum_probs=64.5 Q ss_pred EEECCCHHHHHHHHHHH--HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99147725999999998--7189869999945787110689999999739807998200899999879999736865478 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQ--VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 10 laySGGLDTSv~i~~L~--~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) +..+...|...-+..+. ..++++.+.+... ..+.+..+-++|...|. +.+.+|-.-. -.++. ..|. -+ T Consensus 34 ~v~da~~d~~~Q~~~ie~li~~~vD~iiv~p~--d~~~~~~~l~~a~~agI-PVV~~d~~~~-~~~~~-~~V~-----~D 103 (270) T cd06308 34 IIADAADDNSKQVADIENFIRQGVDLLIISPN--EAAPLTPVVEEAYRAGI-PVILLDRKIL-SDKYT-AYIG-----AD 103 (270) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCEEEEECC--CCCHHHHHHHHHHHCCC-CEEEECCCCC-CCCEE-EEEE-----CC T ss_conf 99679899999999999999849999999648--85200899999998599-1999458888-88701-8995-----17 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH--CCCCEEEEEE-CCCCCC-CHHHHHHHHH Q ss_conf 74201301243359999999885689287500111682156799999985--8797077441-006876-3799999999 Q gi|254780234|r 88 YYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL--NSDIEIIAPW-RHWSFK-GRQDLIDFAE 163 (404) Q Consensus 88 ~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l--~P~l~viaP~-Rd~~~~-sRe~~i~ya~ 163 (404) .|..+ ...++.+++.....+--++-.|..+..+.+-|.+....+| .|++++.... -+|+.. +.+..-+.+. T Consensus 104 n~~~G-----~~~~~~l~~~~~~~~~i~~i~G~~~~~~~~~R~~Gf~~~l~~~~~~~vv~~~~~~~~~~~a~~~~~~~L~ 178 (270) T cd06308 104 NYEIG-----RQAGEYIANLLPGKGNILEIWGLEGSSPAIERHDGFKEALSKYPKIKIVAQQDGDWLKEKAEEKMEELLQ 178 (270) T ss_pred HHHHH-----HHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 68889-----9999999985056854999627999854999999999999866895246686176679999999999985 Q ss_pred HH Q ss_conf 70 Q gi|254780234|r 164 KH 165 (404) Q Consensus 164 ~~ 165 (404) .| T Consensus 179 ~~ 180 (270) T cd06308 179 AN 180 (270) T ss_pred HC T ss_conf 08 No 229 >cd00640 Trp-synth-beta_II Tryptophan synthase beta superfamily (fold type II); this family of pyridoxal phosphate (PLP)-dependent enzymes catalyzes beta-replacement and beta-elimination reactions. This CD corresponds to aminocyclopropane-1-carboxylate deaminase (ACCD), tryptophan synthase beta chain (Trp-synth_B), cystathionine beta-synthase (CBS), O-acetylserine sulfhydrylase (CS), serine dehydratase (Ser-dehyd), threonine dehydratase (Thr-dehyd), diaminopropionate ammonia lyase (DAL), and threonine synthase (Thr-synth). ACCD catalyzes the conversion of 1-aminocyclopropane-1-carboxylate to alpha-ketobutyrate and ammonia. Tryptophan synthase folds into a tetramer, where the beta chain is the catalytic PLP-binding subunit and catalyzes the formation of L-tryptophan from indole and L-serine. CBS is a tetrameric hemeprotein that catalyzes condensation of serine and homocysteine to cystathionine. CS is a homodimer that catalyzes the formation of L-cysteine from O-acetyl-L-serine. Ser-dehy Probab=38.22 E-value=28 Score=15.10 Aligned_cols=52 Identities=25% Similarity=0.378 Sum_probs=22.2 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEC Q ss_conf 9999147725999999998718986999994578711068999999973980799820 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKD 65 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D 65 (404) .|++.|+| ++...+.|.-...|+.++.|.-.--. ..-.+..+.+||. .+..+ T Consensus 52 ~vv~aSsG-N~g~a~A~~a~~~G~~~~i~~p~~~~----~~k~~~~~~~GA~-vi~~~ 103 (244) T cd00640 52 VIIESTGG-NTGIALAAAAARLGLKCTIVMPEGAS----PEKVAQMRALGAE-VVLVP 103 (244) T ss_pred EEEEECCC-HHHHHHHHHHHHCCCCCEEEECCHHH----HHHHHHHHHCCCE-EEEEC T ss_conf 89992785-78999999887469663377675156----7899988707977-99989 No 230 >TIGR02079 THD1 threonine dehydratase; InterPro: IPR011820 This entry represents threonine dehydratase, the first step in the pathway converting threonine into isoleucine. At least two other clades of biosynthetic threonine dehydratases are described by IPR005787 from INTERPRO and IPR005789 from INTERPRO. The sequences described by this entry are exclusively found in species containing the rest of the isoleucine pathway and which are generally lacking in members of the other two clades of threonine dehydratases. The sequences associated with this entry are often cluster with other elements of the isoleucine pathway.; GO: 0004794 L-threonine ammonia-lyase activity, 0030170 pyridoxal phosphate binding, 0009097 isoleucine biosynthetic process. Probab=37.95 E-value=29 Score=15.08 Aligned_cols=108 Identities=19% Similarity=0.305 Sum_probs=56.1 Q ss_pred EEEEECCCHHHHHHHHHHHHH-CCCEEEEEEEECCCCCH---HHHHHHHHHHC-CCEEEEEECHH------HHHHHHHH- Q ss_conf 999914772599999999871-89869999945787110---68999999973-98079982008------99999879- Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVE-KGLEVIVFIADLGQGEE---LKIASDKARLL-GAKEVYVKDLR------REFVRDFV- 75 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d---~~~~~~~A~~~-Ga~~~~v~D~r------~ef~~~~i- 75 (404) ||+==.|=|==+-+-.+-+++ .|-.|||+.- |..-| .++|++||+-. |.++++++++- .||+++.+ T Consensus 277 iVaEPAGALSiAAL~~~~r~ei~gKTVVC~vS--GGNNDi~R~~Ei~eRsl~y~GLKhYFiv~FPQRpGALreFln~vLG 354 (415) T TIGR02079 277 IVAEPAGALSIAALEELSREEIKGKTVVCVVS--GGNNDIERTEEIRERSLLYEGLKHYFIVRFPQRPGALREFLNDVLG 354 (415) T ss_pred EEEECCHHHHHHHHHHHCHHCCCCCEEEEEEE--CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHCC T ss_conf 69815136679998751220046972799952--7757533136789888873177036875278988636887654067 Q ss_pred -------HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf -------999736865478742013012433599999998856892875 Q gi|254780234|r 76 -------FPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIA 117 (404) Q Consensus 76 -------~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~ia 117 (404) |.-.|=++-=+|.=..|--++--==-..|++.|+..+..+.. T Consensus 355 P~DDIT~FEY~KK~nRe~GpaliGiel~~~~D~~GLL~Rm~A~~i~Y~~ 403 (415) T TIGR02079 355 PNDDITKFEYTKKSNRETGPALIGIELSDKEDFEGLLERMAAADIHYEA 403 (415) T ss_pred CCCCCCCCEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCEEE T ss_conf 8875233100020147878779987503610067799999855994164 No 231 >TIGR00506 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; InterPro: IPR000422 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin . It is sometimes found as a bifunctional enzyme with GTP cyclohydrolase II that catalyses the first committed step in the biosynthesis of riboflavin (IPR000926 from INTERPRO). No sequences with significant homology to DHBP synthase are found in the metazoa.; GO: 0008686 34-dihydroxy-2-butanone-4-phosphate synthase activity, 0009231 riboflavin biosynthetic process. Probab=37.92 E-value=14 Score=17.27 Aligned_cols=102 Identities=25% Similarity=0.330 Sum_probs=50.6 Q ss_pred EEEEEEEECCCCCH-------HHHH-HHH-HHHCCCEEEEEECHHHHHHHHH-----HHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 69999945787110-------6899-999-9973980799820089999987-----99997368654787420130124 Q gi|254780234|r 32 EVIVFIADLGQGEE-------LKIA-SDK-ARLLGAKEVYVKDLRREFVRDF-----VFPMFRANALYEGYYLLGTAIAR 97 (404) Q Consensus 32 eVi~~~~d~Gq~~d-------~~~~-~~~-A~~~Ga~~~~v~D~r~ef~~~~-----i~~~I~ana~Yeg~Ypl~tslaR 97 (404) +-.+||+++-+.++ -|.+ .=+ ++.-.+ +-+.+|-+++|-+++ |||++.++.. . |.| T Consensus 98 ~~~~fti~~~~r~et~TGIsa~DR~~Ti~~~l~~~~-~~vmmgr~~dFG~df~rPGHvfpLRAa~Gg---V------L~R 167 (230) T TIGR00506 98 EKSTFTIDVAHRKETTTGISANDRALTIRAALADVV-KPVMMGRKEDFGSDFRRPGHVFPLRAADGG---V------LKR 167 (230) T ss_pred CEEEEEEEEECCCCCEECCCHHHHHHHHHHHHCCCC-CEEECCCCCCCHHHCCCCCCCCHHHHCCCC---E------ECC T ss_conf 423789997477885005546478899988746766-422227624551143688865415432787---5------125 Q ss_pred HHHHHHHHHHH---HHCCCCEEECHHHCCCCHHHHHHHHHHHHCC--CCEEEEEECCCCCCCHHH--HHHHHHHHCCCC Q ss_conf 33599999998---8568928750011168215679999998587--970774410068763799--999999708967 Q gi|254780234|r 98 PLIAKYLVDIA---NETGADAIAHGSTGKGNDQVRFELSAYSLNS--DIEIIAPWRHWSFKGRQD--LIDFAEKHAIPI 169 (404) Q Consensus 98 plia~~lv~~a---~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P--~l~viaP~Rd~~~~sRe~--~i~ya~~~gIpv 169 (404) -=--+.-|+.| --.++.+|+- .++- +.+ +-+++| .++||+||||+. T Consensus 168 ~GHTEasVdL~~lAGl~pa~VICE-----------------~~nd~edG~---------mar~~ekf~~~yA~Kh~l~~ 220 (230) T TIGR00506 168 AGHTEASVDLAELAGLKPAGVICE-----------------IMNDEEDGT---------MARKPEKFLVEYAKKHGLKL 220 (230) T ss_pred CCCCHHHHHHHHHCCCCCCEEEEE-----------------ECCCCCCCC---------CCCCCCHHHHHHHHHHCCCC T ss_conf 773348889999708997289986-----------------237720331---------03666488999998819850 No 232 >PRK07998 gatY putative fructose-1,6-bisphosphate aldolase; Reviewed Probab=37.62 E-value=29 Score=15.04 Aligned_cols=121 Identities=16% Similarity=0.235 Sum_probs=60.1 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999871898699999457871106899999997398079982008999998799997368654787420130124335 Q gi|254780234|r 21 ILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI 100 (404) Q Consensus 21 ~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli 100 (404) ++..-+ +.+|-|-+|.+. .-+.+..+-+-|.+... +.++ ..-.-.++ |.| -..+ T Consensus 9 lL~~A~-~~~yAVgaFNv~--n~e~~~Avi~AAee~~s-PvIl-q~s~~~~~------------~~g---------~~~~ 62 (283) T PRK07998 9 LLDRIQ-EKHVLAGAFNTT--NLETTISILNAIERSGL-PNFI-QIAPTNAQ------------LSG---------YDYI 62 (283) T ss_pred HHHHHH-HCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE-ECCHHHHH------------HCC---------HHHH T ss_conf 999999-889379987889--99999999999999786-9899-97750675------------559---------9999 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE---E---EECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 99999998856892875001116821567999999858797077---4---4100687637999999997089675665 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII---A---PWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi---a---P~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) +..+...|++....++.|= --|.| ||...+++.....-+ + |+-+ +..--.+..+||..+|++|.+.. T Consensus 63 ~~~~~~~a~~~~VPV~lHL--DH~~~---~e~i~~ai~~GftSVM~DgS~l~~ee-Ni~~Tk~vv~~Ah~~gv~VEaEl 135 (283) T PRK07998 63 YEIVKRHADKMDVPVSLHL--DHGKT---FEDVKQAVRAGFTSVMIDGAALPFEE-NIAFTQEAVDFCKSYGVPVEAEL 135 (283) T ss_pred HHHHHHHHHHCCCCEEEEC--CCCCC---HHHHHHHHHCCCCEEEEECCCCCHHH-HHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9999999998699899975--88899---99999999739988986099899999-99999999999977699799985 No 233 >pfam10281 Ish1 Putative stress-responsive nuclear envelope protein. This family of proteins found in fungi is a putative stress-responsive nuclear envelope protein Ish1. Probab=37.55 E-value=26 Score=15.42 Aligned_cols=20 Identities=25% Similarity=0.620 Sum_probs=16.4 Q ss_pred CHHHHHHHHHHHCCCCCCCC Q ss_conf 37999999997089675665 Q gi|254780234|r 154 GRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 154 sRe~~i~ya~~~gIpv~~~~ 173 (404) |-+++-+|+..||||++... T Consensus 5 s~~~Lk~wL~~~gi~~~~~~ 24 (38) T pfam10281 5 SDSDLKEWLDSHGIPVPQKA 24 (38) T ss_pred CHHHHHHHHHHCCCCCCCCH T ss_conf 48899999998499789713 No 234 >cd03014 PRX_Atyp2cys Peroxiredoxin (PRX) family, Atypical 2-cys PRX subfamily; composed of PRXs containing peroxidatic and resolving cysteines, similar to the homodimeric thiol specific antioxidant (TSA) protein also known as TRX-dependent thiol peroxidase (Tpx). Tpx is a bacterial periplasmic peroxidase which differs from other PRXs in that it shows substrate specificity toward alkyl hydroperoxides over hydrogen peroxide. As with all other PRXs, the peroxidatic cysteine (N-terminal) of Tpx is oxidized into a sulfenic acid intermediate upon reaction with peroxides. Tpx is able to resolve this intermediate by forming an intramolecular disulfide bond with a conserved C-terminal cysteine (the resolving cysteine), which can then be reduced by thioredoxin. This differs from the typical 2-cys PRX which resolves the oxidized cysteine by forming an intermolecular disulfide bond with the resolving cysteine from the other subunit of the homodimer. Atypical 2-cys PRX homodimers have a loop-based Probab=37.53 E-value=26 Score=15.42 Aligned_cols=34 Identities=21% Similarity=0.147 Sum_probs=17.4 Q ss_pred CEEEEEECCCHHHHHHHHHH-------HHHCCCEEEEEEEE Q ss_conf 78999914772599999999-------87189869999945 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWL-------QVEKGLEVIVFIAD 39 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L-------~~e~g~eVi~~~~d 39 (404) |++||.|-=..+|++|-.=+ .+-.+++|+++.+| T Consensus 27 k~~Vl~fyP~~~T~vC~~e~~~f~~~~~~f~~~~v~gIS~D 67 (143) T cd03014 27 KVKVISVFPSIDTPVCATQTKRFNKEAAKLDNTVVLTISAD 67 (143) T ss_pred CEEEEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEEECC T ss_conf 86999998078998453159999999873589689999789 No 235 >cd00947 TBP_aldolase_IIB Tagatose-1,6-bisphosphate (TBP) aldolase and related Type B Class II aldolases. TBP aldolase is a tetrameric class II aldolase that catalyzes the reversible condensation of dihydroxyacetone phosphate with glyceraldehyde 3-phsophate to produce tagatose 1,6-bisphosphate. There is an absolute requirement for a divalent metal ion, usually zinc, and in addition the enzymes are activated by monovalent cations such as Na+. The type A and type B Class II FBPA's differ in the presence and absence of distinct indels in the sequence that result in differing loop lengths in the structures. Probab=37.43 E-value=29 Score=15.02 Aligned_cols=115 Identities=17% Similarity=0.277 Sum_probs=56.5 Q ss_pred HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 18986999994578711068999999973980799820089999987999973686547874201301243359999999 Q gi|254780234|r 28 EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDI 107 (404) Q Consensus 28 e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~ 107 (404) +.+|-|-+|.+ -.-+.+..+-+-|....+ +.++.=....+ +|. | -..++..+... T Consensus 10 ~~~yAV~AfNv--~~~e~~~avi~AAee~~s-PvIlq~s~~~~--~~~-----------g---------~~~~~~~~~~~ 64 (276) T cd00947 10 EGGYAVGAFNI--NNLETLKAILEAAEETRS-PVILQISEGAI--KYA-----------G---------LELLVAMVKAA 64 (276) T ss_pred HCCEEEEEEEE--CCHHHHHHHHHHHHHHCC-CEEEECCHHHH--HHC-----------C---------HHHHHHHHHHH T ss_conf 88917998787--989999999999999689-99999786177--658-----------9---------99999999999 Q ss_pred HHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE------EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 8856892875001116821567999999858797077------44100687637999999997089675665 Q gi|254780234|r 108 ANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII------APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 108 a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi------aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) |++....+..|-= -|.| ||...+++.-...-+ -|+-| ++.--.+..+||.++|++|.... T Consensus 65 a~~~~VPV~lHLD--H~~~---~e~i~~ai~~Gf~SVM~DgS~lp~ee-Ni~~T~~vv~~ah~~gv~VEaEl 130 (276) T cd00947 65 AERASVPVALHLD--HGSS---FELIKRAIRAGFSSVMIDGSHLPFEE-NVAKTKEVVELAHAYGVSVEAEL 130 (276) T ss_pred HHHCCCCEEEECC--CCCC---HHHHHHHHHCCCCEEEEECCCCCHHH-HHHHHHHHHHHHHHHCCEEEEEE T ss_conf 9977999999668--7678---99999999839987985289899999-99999999999998198489986 No 236 >PRK08063 enoyl-(acyl carrier protein) reductase; Provisional Probab=37.43 E-value=29 Score=15.02 Aligned_cols=65 Identities=15% Similarity=0.212 Sum_probs=41.0 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE-EEEEECHHHH Q ss_conf 98777789999147725999999998718986999994578711068999999973980-7998200899 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK-EVYVKDLRRE 69 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~-~~~v~D~r~e 69 (404) |.+.+.=+|..=|+|+-..++..+.+ .|++|+... ....++.+...+.....|.. ..+..|+.++ T Consensus 1 ~L~gKvalVTGas~GIG~aia~~la~--~Ga~Vvi~~--~~~~~~~~~~~~~~~~~g~~~~~~~~Dv~d~ 66 (250) T PRK08063 1 VFSGKVALVTGSSRGIGKAIALRLAK--EGYDIAINY--ARSRKAAEETANEIEQLGRKALVVKANVGDV 66 (250) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHH--CCCEEEEEC--CCCHHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 98949899958766999999999998--899899975--9998999999999995499589998479999 No 237 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=37.24 E-value=29 Score=15.00 Aligned_cols=92 Identities=20% Similarity=0.233 Sum_probs=48.0 Q ss_pred HCCCCCC-CCCCC----CCCHHHHHHHHHHHHHHHHCCCCEEECH-------------HHC----------CCCHHHHHH Q ss_conf 3686547-87420----1301243359999999885689287500-------------111----------682156799 Q gi|254780234|r 80 RANALYE-GYYLL----GTAIARPLIAKYLVDIANETGADAIAHG-------------STG----------KGNDQVRFE 131 (404) Q Consensus 80 ~ana~Ye-g~Ypl----~tslaRplia~~lv~~a~~~ga~~iaHG-------------~Tg----------kGNDQvRFe 131 (404) ||...|. |+=|. .+++.=+|=-..++++|++.++|+|=.| |.- +-=||+==- T Consensus 41 KADEaY~vG~g~~lard~~Pv~AYL~I~eiI~vAk~~~vDaiHPGYGfLSE~~~Fa~~v~~aGi~FIGP~a~~~~~~GdK 120 (1169) T TIGR01235 41 KADEAYLVGEGPQLARDLKPVEAYLSIDEIIRVAKKNKVDAIHPGYGFLSENSEFADAVVKAGIVFIGPKAEVLDQLGDK 120 (1169) T ss_pred CCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHCCCEEECCCHHHHHHHCCH T ss_conf 23410011476310102687213047156989960078977708856224887899999868956737974777540576 Q ss_pred HHHHHHCCCCEE-EEEECCCCCCCHHHHHHHHHHHCCCCCC Q ss_conf 999985879707-7441006876379999999970896756 Q gi|254780234|r 132 LSAYSLNSDIEI-IAPWRHWSFKGRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 132 ~~~~~l~P~l~v-iaP~Rd~~~~sRe~~i~ya~~~gIpv~~ 171 (404) ++.|-+|-+..| +-|=-|--...-||-.+|++++|.||=. T Consensus 121 V~AR~~A~~aGvPvvPgt~Gp~~t~eev~~f~~~~GYPvi~ 161 (1169) T TIGR01235 121 VAARNLAIKAGVPVVPGTDGPVETLEEVLDFAKAIGYPVII 161 (1169) T ss_pred HHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 89998888778876368868752599999999756995898 No 238 >PRK08610 fructose-bisphosphate aldolase; Reviewed Probab=37.01 E-value=29 Score=14.97 Aligned_cols=33 Identities=18% Similarity=0.110 Sum_probs=15.8 Q ss_pred HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEE Q ss_conf 189869999945787110689999999739807998 Q gi|254780234|r 28 EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYV 63 (404) Q Consensus 28 e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v 63 (404) +.+|-|-+|.+. .-+.+..+-+-|....+ +.++ T Consensus 15 ~~~yAV~afNv~--n~e~~~avi~AAee~~s-PvIl 47 (286) T PRK08610 15 ENGYAVGQYNIN--NLEFTQAILEASQEENA-PVIL 47 (286) T ss_pred HCCEEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE T ss_conf 889179987669--89999999999999687-9999 No 239 >COG0108 RibB 3,4-dihydroxy-2-butanone 4-phosphate synthase [Coenzyme metabolism] Probab=36.72 E-value=30 Score=14.94 Aligned_cols=130 Identities=18% Similarity=0.234 Sum_probs=62.9 Q ss_pred EEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEE-------------EEEECHHHHHHHHHHH Q ss_conf 991477259999999987189869999945787110689999999739807-------------9982008999998799 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKE-------------VYVKDLRREFVRDFVF 76 (404) Q Consensus 10 laySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~-------------~~v~D~r~ef~~~~i~ 76 (404) |.|....=|.-.+.||..+ +--.||++.+ .++|.++|... .+.+|.++- T Consensus 33 li~aAe~vT~e~i~fm~~~-a~GliC~~lt----------~e~~~~L~Lp~Mv~~n~~~~~taFtVsVd~~~t------- 94 (203) T COG0108 33 LIFAAEAVTPEQIAFMRRH-ASGLICVALT----------EERAKRLGLPPMVDNNTDAHGTAFTVSVDARET------- 94 (203) T ss_pred EEEEHHHCCHHHHHHHHHH-CCEEEEEECC----------HHHHHHCCCCCCCCCCCCCCCCCEEEEEECCCC------- T ss_conf 8988112899999999985-8915999589----------999985799500246888888764999804237------- Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH------------HCCCCHHHHHHHHHHHHCCCCEEE Q ss_conf 99736865478742013012433599999998856892875001------------116821567999999858797077 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS------------TGKGNDQVRFELSAYSLNSDIEII 144 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~------------TgkGNDQvRFe~~~~~l~P~l~vi 144 (404) .-=.++..|.+..+.+++ ......+....|- ...|-...--+++=.+=-+...|| T Consensus 95 ------------~TGISa~DRa~TIr~l~~-~~~~~~df~~PGHVfpL~A~~ggVl~R~GHTEasVdLarlAGl~Pa~Vi 161 (203) T COG0108 95 ------------TTGISAADRALTIRALAD-PGAKPSDFRRPGHVFPLRAKDGGVLERRGHTEAAVDLARLAGLKPAGVI 161 (203) T ss_pred ------------CCCCCHHHHHHHHHHHHC-CCCCHHHCCCCCCEEEEEECCCCEECCCCHHHHHHHHHHHCCCCCCEEE T ss_conf ------------688588899999999865-7888888489997003650668763148807999999998399973799 Q ss_pred EEECCC--CCCCHHHHHHHHHHHCCCCC Q ss_conf 441006--87637999999997089675 Q gi|254780234|r 145 APWRHW--SFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 145 aP~Rd~--~~~sRe~~i~ya~~~gIpv~ 170 (404) +.+-+- .+-++++.++|+++||+|+- T Consensus 162 cEi~~~dG~mar~~~~~~fa~~h~l~~i 189 (203) T COG0108 162 CEIMNDDGTMARLPELEEFAKEHGLPVI 189 (203) T ss_pred EEEECCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 9986798661573899999998399688 No 240 >cd01536 PBP1_ABC_sugar_binding_like Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. The members of this family function as the primary receptors for chemotaxis and transport of many sugar based solutes in bacteria and archaea. The sugar binding domain is also homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Moreover, this periplasmic binding domain, also known as Venus flytrap domain, undergoes transition from an open to a closed conformational state upon the binding of ligands such as lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. This family also includes the periplasmic binding domain of autoinducer-2 (AI-2 Probab=36.61 E-value=30 Score=14.93 Aligned_cols=17 Identities=29% Similarity=0.216 Sum_probs=7.3 Q ss_pred HHHHHHHHHCCCCEEEC Q ss_conf 99999988568928750 Q gi|254780234|r 102 KYLVDIANETGADAIAH 118 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaH 118 (404) +.+-+.+++.|.+.+-. T Consensus 19 ~gi~~~a~~~g~~~~~~ 35 (267) T cd01536 19 KGAEAAAKELGVELIVL 35 (267) T ss_pred HHHHHHHHHCCCEEEEE T ss_conf 99999999849999999 No 241 >TIGR00789 flhB_rel FlhB domain protein; InterPro: IPR004683 This group describes a short protein (80-93 residues) homologous to the C-terminus of the flagellar biosynthetic protein FlhB. It is found so far only in species that also have FlhB. In a phylogenetic tree based on alignment of both this family and the homologous region of FlhB and its homologs, the members of this family form a monophyletic set.. Probab=36.45 E-value=20 Score=16.14 Aligned_cols=42 Identities=31% Similarity=0.582 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 79999999970896756656788641572431154576324584576820100146 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMI 210 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t 210 (404) =+.-|+-|++||||+.... - .=+-|.+. +...+.||++|... T Consensus 29 A~~II~~AK~~gipi~ed~-----~-L~~~L~~l--------~l~~~IPee~Y~~V 70 (84) T TIGR00789 29 AEKIIELAKKHGIPIKEDE-----D-LVDVLLKL--------DLDDEIPEELYEVV 70 (84) T ss_pred HHHHHHHHHHCCCCCCCCH-----H-HHHHHHHC--------CCCCCCCHHHHHHH T ss_conf 8899999976389976582-----8-99987440--------54576885788999 No 242 >PRK10217 dTDP-glucose 4,6-dehydratase; Provisional Probab=36.30 E-value=30 Score=14.90 Aligned_cols=73 Identities=15% Similarity=0.184 Sum_probs=39.4 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 778999914772599999999871898699999457871106899999997398079982008999998799997 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) ||||++-=..|.--|.++..|.++ +.+++.+.-++--..++.........-.. .++..|+++.-.-+-++.-+ T Consensus 1 MKkILVTGg~GFIGs~Lv~~Ll~~-~~~~v~~~d~~~~~~~~~~~~~~~~~~~~-~~~~~Di~d~~~l~~~~~~~ 73 (355) T PRK10217 1 MRKILITGGAGFIGSALVRYIINE-TSDAVVVVDKLTYAGNLMSLAPVAQSERF-AFEKVDICDRAELARVFTEY 73 (355) T ss_pred CCEEEECCCCCHHHHHHHHHHHHC-CCCEEEEEECCCCCCCHHHHHHHHCCCCC-EEEECCCCCHHHHHHHHHHC T ss_conf 996999378757999999999976-99889998289876525444454127871-69980058899999999861 No 243 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=36.26 E-value=30 Score=14.89 Aligned_cols=38 Identities=26% Similarity=0.158 Sum_probs=16.8 Q ss_pred HHHHHHHHHHHCCCCEEE--CHHHCCCCHHHHHHHHHHHH Q ss_conf 599999998856892875--00111682156799999985 Q gi|254780234|r 100 IAKYLVDIANETGADAIA--HGSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 100 ia~~lv~~a~~~ga~~ia--HG~TgkGNDQvRFe~~~~~l 137 (404) .++.+++...+.|..-|+ .|.....+.+.|++....++ T Consensus 103 a~~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~~~ 142 (264) T cd06267 103 GAYLAVEHLIELGHRRIAFIGGPPDLSTARERLEGYREAL 142 (264) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 9999999998718885689638987706999999999999 No 244 >pfam01070 FMN_dh FMN-dependent dehydrogenase. Probab=36.13 E-value=27 Score=15.20 Aligned_cols=174 Identities=16% Similarity=0.170 Sum_probs=77.5 Q ss_pred HHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHH----HCC Q ss_conf 9999999739807998200899999879999736865478742013012433599999998856892875001----116 Q gi|254780234|r 48 IASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGS----TGK 123 (404) Q Consensus 48 ~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~----Tgk 123 (404) .+.+-|.+.|. .+.+=.....-.|+.. ++.-....++ -|+.. .|- +.+.+++.|++.|++++.-.- .|+ T Consensus 78 ~lAraA~~~gi-~~~lss~~~~~~e~i~-~~~~~~~~fQ-ly~~~---d~~-~~~~~i~ra~~ag~~al~ltvD~~~~g~ 150 (301) T pfam01070 78 ALARAAAAAGI-PFVLSTVASTSLEEVA-AAAGGPLWFQ-LYVPK---DRE-LTEDLLERAEAAGYKALVLTVDTPVLGN 150 (301) T ss_pred HHHHHHHHHCC-CEECCCCCCCCHHHHH-HHCCCCEEEE-EEECC---CHH-HHHHHHHHHHHCCCCEEEEECCCCCCCC T ss_conf 99999998358-7004687655527889-8579976899-87458---889-9999999999749997999726876577 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCC Q ss_conf 82156799999985879707744100687637999999997089675665678864157243115457632458457682 Q gi|254780234|r 124 GNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAP 203 (404) Q Consensus 124 GNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~p 203 (404) .....|.--.++.+.-..+ -|+--.... ..+-..-|.++|+.. +|=..|.|-.|+ +.++| T Consensus 151 r~~d~r~~~~i~~l~~~~~--~PvivKGI~-s~eDA~~a~~~Gv~~---------------I~VSnHGGRqlD--~~~~t 210 (301) T pfam01070 151 RERDLRNGDDLAWLRDQWK--GPLVLKGIL-SPEDAKRAVEAGVDG---------------IVVSNHGGRQLD--GAPAT 210 (301) T ss_pred CCCCCCCHHHHHHHHHHCC--CCEEEECCC-CHHHHHHHHHCCCCE---------------EEECCCCCCCCC--CCCCH T ss_conf 8532043999999998669--988998289-999999999859999---------------996499854468--88679 Q ss_pred HHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHHHCCCCCE Q ss_conf 0100146581227999638999995024787516231899999999888642163741 Q gi|254780234|r 204 EYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRCNGIGRI 261 (404) Q Consensus 204 e~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~~GvGr~ 261 (404) -++..-.. ++ .....||-+||---++.+++..|--=+-.-|+||- T Consensus 211 ~~~L~eI~---~~----------v~~~~~i~~DGGIR~G~DV~KAlALGA~~V~iGRp 255 (301) T pfam01070 211 IDALPEIV---AA----------VGGRIPVLVDGGIRRGTDVLKALALGADAVLLGRP 255 (301) T ss_pred HHHHHHHH---HH----------HCCCEEEEECCCCCCCHHHHHHHHCCCCEEEECHH T ss_conf 99999999---98----------56774899638747626899999808986655689 No 245 >PRK06849 hypothetical protein; Provisional Probab=36.04 E-value=30 Score=14.87 Aligned_cols=130 Identities=14% Similarity=0.087 Sum_probs=74.2 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECH-HHHHHHHHHHHHHHCCCC Q ss_conf 7899991477259999999987189869999945787110689999999739807998200-899999879999736865 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDL-RREFVRDFVFPMFRANAL 84 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~-r~ef~~~~i~~~I~ana~ 84 (404) +|-||-=.|..--+..+..+....|.+|+.. |.-. --+ - .-.-.+.+++.+.. +.+. +.|+ T Consensus 4 p~tvLiTg~r~~~aL~laR~l~~~Gh~V~~a--D~~~-~~l----~-r~Sr~v~~~~~vP~P~~d~-~~y~--------- 65 (387) T PRK06849 4 PKTVLITGARAPAALQLARSFHNAGHTVILA--DSLK-YPL----S-RFSRAVDGFYTIPSPKWDP-NAYI--------- 65 (387) T ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEEEEE--CCCC-CCC----C-CHHHCEEEEEECCCCCCCH-HHHH--------- T ss_conf 8779995886078999999998789979998--4898-775----4-2000011279869997898-9999--------- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHH-HHHHCCCCEEEEEECC--CCCCCHHHHHHH Q ss_conf 4787420130124335999999988568928750011168215679999-9985879707744100--687637999999 Q gi|254780234|r 85 YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELS-AYSLNSDIEIIAPWRH--WSFKGRQDLIDF 161 (404) Q Consensus 85 Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~-~~~l~P~l~viaP~Rd--~~~~sRe~~i~y 161 (404) ..|.+++++++.|.+--.| +++.+... -..|.+..+++.|=-+ ..+...-..+++ T Consensus 66 -----------------~~Ll~Iv~~e~idl~IP~~-----eev~~~a~~~~~l~~~~~~~~~d~~~l~~LhdK~~F~~~ 123 (387) T PRK06849 66 -----------------QALLSIVKRHNIDLLIPTC-----EEVFYLSLAKEELSAYCHVFHFDFDLLLMLHNKFEFIQQ 123 (387) T ss_pred -----------------HHHHHHHHHHCCCEEEECC-----CHHHHHHHHHHHCCCCCCEECCCHHHHHHHHHHHHHHHH T ss_conf -----------------9999999983899999777-----689999865764476763765899999986448999999 Q ss_pred HHHHCCCCCCCCCC Q ss_conf 99708967566567 Q gi|254780234|r 162 AEKHAIPIDKNKRG 175 (404) Q Consensus 162 a~~~gIpv~~~~~~ 175 (404) |++.|+|+|.+... T Consensus 124 a~~lGl~vP~T~~i 137 (387) T PRK06849 124 ARSLGLSAPKTYLI 137 (387) T ss_pred HHHCCCCCCCEEEE T ss_conf 99749999988981 No 246 >PRK09483 response regulator; Provisional Probab=35.95 E-value=31 Score=14.86 Aligned_cols=113 Identities=14% Similarity=0.155 Sum_probs=49.6 Q ss_pred HHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH-------HHHCCCCEEECHHH Q ss_conf 99999973980799820089999987999973686547874201301243359999999-------88568928750011 Q gi|254780234|r 49 ASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDI-------ANETGADAIAHGST 121 (404) Q Consensus 49 ~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~-------a~~~ga~~iaHG~T 121 (404) ...+|...||.-+...+...+-.. .+|++ ...|..+++..+++.+.....-.. -..+-..++.+=|. T Consensus 89 ~~~~al~~Ga~gyl~K~~~~~~l~----~ai~~--v~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~LT~RE~eVl~ll~~ 162 (216) T PRK09483 89 LPAKVMQAGAAGYLSKGAAPQEVV----SAIRS--VHSGQRYIASDIAQQMALSQFEPATENPFASLSERELQIMLMITK 162 (216) T ss_pred HHHHHHHCCCCEEEECCCCHHHHH----HHHHH--HHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 899999748878994799999999----99999--985997069899999987323765456545689999999999987 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 1682156799999985879707744100687637999999997089 Q gi|254780234|r 122 GKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAI 167 (404) Q Consensus 122 gkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gI 167 (404) |+.|.++--++.+..-.=+-.+-.=.|-.+.++|.+.+.||.++|+ T Consensus 163 G~snkeIA~~L~iS~~TV~~h~~~i~~KL~v~~r~el~~~A~~~gl 208 (216) T PRK09483 163 GQKVNEISEQLNLSPKTVNSYRYRMFSKLNISGDVELTHLAIRHGL 208 (216) T ss_pred CCCHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 9999999999698999999999999998099999999999999599 No 247 >PRK13304 L-aspartate dehydrogenase; Reviewed Probab=35.76 E-value=31 Score=14.84 Aligned_cols=115 Identities=16% Similarity=0.176 Sum_probs=72.8 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECC----------CC---CHHHHHHHHHHHCCCEEEEEECHHHHHH Q ss_conf 7789999147725999999998718986999994578----------71---1068999999973980799820089999 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLG----------QG---EELKIASDKARLLGAKEVYVKDLRREFV 71 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~G----------q~---~d~~~~~~~A~~~Ga~~~~v~D~r~ef~ 71 (404) |.||.+.=-|-+-+.++-.+.+..-+.+++++...-- .. .++++..+ .. +.+|.=+-.+-+ T Consensus 1 M~rVgiIG~GaIG~~Va~~l~~g~~~~~L~~V~~r~~~~a~~~a~~~~~~~~~~ld~l~~-----~~-DlVVE~A~~~av 74 (265) T PRK13304 1 MLKIGIVGCGAIANLITKAIDSGRINAELLAFYDRNLEKAENLAEKTGAPACLSIDELVK-----DV-DLVVECASQKAV 74 (265) T ss_pred CCEEEEECCCHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHCCCCCCCCCHHHHHH-----CC-CEEEECCCHHHH T ss_conf 949999865789999999986799864999997898789887764159971279899833-----89-999989898999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCC-CHHHHHHHHHHHHHHHHCCCCE-EECHHHCCCCHHHH Q ss_conf 9879999736865478742013-0124335999999988568928-75001116821567 Q gi|254780234|r 72 RDFVFPMFRANALYEGYYLLGT-AIARPLIAKYLVDIANETGADA-IAHGSTGKGNDQVR 129 (404) Q Consensus 72 ~~~i~~~I~ana~Yeg~Ypl~t-slaRplia~~lv~~a~~~ga~~-iaHG~TgkGNDQvR 129 (404) .+|.-+.+.++. +-..+|+ +|+-+-+-..+.+.|++.|+.. |..|+-| |=|-+| T Consensus 75 ~~~~~~~L~~G~---dlvv~SvGALaD~~l~~~L~~~A~~~g~~i~ipsGAig-GlD~l~ 130 (265) T PRK13304 75 EDTVPKSLNNGK---DVIIMSVGALADKELFLKLYKLAKENGCKIYLPSGAIA-GIDGIK 130 (265) T ss_pred HHHHHHHHHCCC---CEEEECHHHHCCHHHHHHHHHHHHHCCCEEEECCCHHH-HHHHHH T ss_conf 999999997599---89998136855989999999999737977998261344-278899 No 248 >PRK13124 consensus Probab=35.74 E-value=31 Score=14.84 Aligned_cols=127 Identities=20% Similarity=0.313 Sum_probs=74.0 Q ss_pred CCCCEEEEEE-CCC---HHHHHHHHHHHHHCCCEEEEEEEECCC----CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 7777899991-477---259999999987189869999945787----11068999999973980799820089999987 Q gi|254780234|r 3 RDVKKVVLAY-SGG---LDTSIILKWLQVEKGLEVIVFIADLGQ----GEELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 3 ~~~kkVvlay-SGG---LDTSv~i~~L~~e~g~eVi~~~~d~Gq----~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) ++.+|..++| ..| +|+|.-+.....+.|++++=+=+-.-. +.-+..+-.+|++.|.+-..+.+.-+++-+++ T Consensus 5 ~~~r~ali~yitaG~P~~e~s~~~~~~l~~~GaDiiElGiPfSDP~ADGpvIq~A~~~AL~~G~~~~~~~~~~~~~r~~~ 84 (257) T PRK13124 5 QENKKLFIPFITAGDPDPETTIDLVLALEEAGADILELGIPYSDPLADGPVIQRASKRALNGGMNIVKAMELVGKMRKKV 84 (257) T ss_pred HCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC T ss_conf 77995689886370899899999999999769999997898888776579999999999976996899999999852447 Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHH---HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 999973686547874201301243359---99999988568928750011168215679999998587970774410068 Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIA---KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWS 151 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia---~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~ 151 (404) -.|.+. . .|.+ |+.. +..++.+++.|++.+- -|||. +. T Consensus 85 ~~pivl-----M-~Y~N------~i~~~G~e~F~~~~~~~Gv~GvI--------------------ipDLP----~e--- 125 (257) T PRK13124 85 TIPIVY-----F-TYYN------PVLQYGLEKFFALARENGIDGLL--------------------IPDLP----LE--- 125 (257) T ss_pred CCCEEE-----E-EHHH------HHHHHHHHHHHHHHHHCCCCEEE--------------------CCCCC----HH--- T ss_conf 888899-----7-5007------89875799999999975998477--------------------78999----79--- Q ss_pred CCCHHHHHHHHHHHCCCCC Q ss_conf 7637999999997089675 Q gi|254780234|r 152 FKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 152 ~~sRe~~i~ya~~~gIpv~ 170 (404) --++..+.|++|||..- T Consensus 126 --E~~~~~~~~~~~gl~~I 142 (257) T PRK13124 126 --ESGELQEICDKYGIYLI 142 (257) T ss_pred --HHHHHHHHHHHCCCCEE T ss_conf --99999999986687357 No 249 >cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i Probab=35.65 E-value=31 Score=14.83 Aligned_cols=11 Identities=27% Similarity=0.501 Sum_probs=4.8 Q ss_pred HCCCEEEEEEE Q ss_conf 18986999994 Q gi|254780234|r 28 EKGLEVIVFIA 38 (404) Q Consensus 28 e~g~eVi~~~~ 38 (404) ++||.++-.+. T Consensus 27 ~~gy~~li~~s 37 (273) T cd01541 27 EKGYSLLLAST 37 (273) T ss_pred HCCCEEEEEEC T ss_conf 86998999978 No 250 >TIGR02025 BchH magnesium chelatase, H subunit; InterPro: IPR011771 This entry represents the H subunit of the magnesium chelatase complex responsible for magnesium insertion into the protoporphyrin IX ring in the biosynthesis of both chlorophyll and bacteriochlorophyll. In chlorophyll-utilizing species, this gene is known as ChlH, while in bacteriochlorophyll-utilizing spoecies it is called BchH. Subunit H is the largest (~140kDa) of the three subunits (the others being BchD/ChlD and BchI/ChlI), and is known to bind protoporphyrin IX. Subunit H is homologous to the CobN subunit of cobaltochelatase and by anology with that enzyme, subunit H is believed to also bind the magnesium ion which is inserted into the ring. In conjunction with the hydrolysis of ATP by subunits I and D, a conformation change is believed to happen in subunit H causing the magnesium ion insertion into the distorted protoporphyrin ring .; GO: 0016851 magnesium chelatase activity, 0015995 chlorophyll biosynthetic process. Probab=35.41 E-value=17 Score=16.67 Aligned_cols=21 Identities=24% Similarity=0.349 Sum_probs=10.1 Q ss_pred CCEECCHHHHHHHHHHHHHHC Q ss_conf 162318999999998886421 Q gi|254780234|r 236 NGQVMSPEVLLEQLNQYGRCN 256 (404) Q Consensus 236 nG~~~~~~~li~~LN~igg~~ 256 (404) =|++.+.-++|.+|-++..-. T Consensus 809 ~G~a~~~~~~~~~L~~~~~~~ 829 (1384) T TIGR02025 809 LGEAPSAESLVDTLMEIASVD 829 (1384) T ss_pred CCCCCCHHHHHHHHHHHHHCC T ss_conf 478976778999999998426 No 251 >cd00617 Tnase_like Tryptophanase family (Tnase). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to tryptophanase (Tnase) and tyrosine phenol-lyase (TPL). Tnase and TPL are active as tetramers and catalyze beta-elimination reactions. Tnase catalyzes degradation of L-tryptophan to yield indole, pyruvate and ammonia and TPL catalyzes degradation of L-tyrosine to yield phenol, pyruvate and ammonia. Probab=35.17 E-value=31 Score=14.78 Aligned_cols=147 Identities=19% Similarity=0.257 Sum_probs=81.0 Q ss_pred EECCCHHHHHHHHHHHHHCCCE-----EEEEEEEC--CCCC---HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHH Q ss_conf 9147725999999998718986-----99999457--8711---068999999973980799820089999987999973 Q gi|254780234|r 11 AYSGGLDTSIILKWLQVEKGLE-----VIVFIADL--GQGE---ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFR 80 (404) Q Consensus 11 aySGGLDTSv~i~~L~~e~g~e-----Vi~~~~d~--Gq~~---d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ 80 (404) -|-|.+|---+-..|+ ++|.+ ++++|.|. |||- ++..+++-|.+.|+ + .++|+ ..|+|+ + ..|+ T Consensus 131 pFKGN~Dl~kLe~~i~-~~g~~~I~~v~~tiTnN~~GGQPVSm~Nir~v~~la~~~gi-p-lvlDa-aRfaEN-A-yFIk 204 (431) T cd00617 131 PFKGNIDVAKLEKLID-EVGAENIPYIVLTITNNTAGGQPVSMANLREVRELAHKYGI-P-VVLDA-ARFAEN-A-YFIK 204 (431) T ss_pred CCCCCCCHHHHHHHHH-HHCCCCEEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCC-C-EEEEH-HHHHHH-H-HHHH T ss_conf 8488889999999998-73866610899998636878856678999999999998099-7-99876-678876-8-9998 Q ss_pred CC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCC-----CCHHHHHHHHHHHHCCCCEE---E--EEECC Q ss_conf 68-65478742013012433599999998856892875001116-----82156799999985879707---7--44100 Q gi|254780234|r 81 AN-ALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGK-----GNDQVRFELSAYSLNSDIEI---I--APWRH 149 (404) Q Consensus 81 an-a~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~Tgk-----GNDQvRFe~~~~~l~P~l~v---i--aP~Rd 149 (404) -. .-|.++-+ .=|++.+..+|--.-+++--.|+.-. -||+-=|+.. +.+.+-.+= | ---|| T Consensus 205 ~rE~gy~d~si-------~eI~rEmfs~aDg~tmSaKKD~~vniGG~ia~~D~~l~~~~-~~~~i~~EGf~TYGGlaGRD 276 (431) T cd00617 205 EREEGYRDKSI-------AEIAREMFSYADGCTMSAKKDGLVNIGGFLALRDDELYEEA-RQRVVLYEGFVTYGGMAGRD 276 (431) T ss_pred HCCCCCCCCCH-------HHHHHHHHHHHHEEEEEECCCCCCCCCCEEEECCHHHHHHH-HHHCCCCCCCCCCCCCCHHH T ss_conf 43530247999-------99999998540278997031453245567885779999999-75432005874435632534 Q ss_pred CC--------------CCCHHHHHHH----HHHHCCCCCC Q ss_conf 68--------------7637999999----9970896756 Q gi|254780234|r 150 WS--------------FKGRQDLIDF----AEKHAIPIDK 171 (404) Q Consensus 150 ~~--------------~~sRe~~i~y----a~~~gIpv~~ 171 (404) +. +..|-..++| +.+.||||-. T Consensus 277 ~eAlA~GL~E~~d~~yl~~Ri~qv~yL~~~L~~~GvPvv~ 316 (431) T cd00617 277 MEALAQGLREAVEEDYLRHRVEQVRYLGDRLDEAGVPIVE 316 (431) T ss_pred HHHHHCCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCEEC T ss_conf 9998654075436799997799999999999988996666 No 252 >PRK10443 rihA ribonucleoside hydrolase 1; Provisional Probab=35.16 E-value=31 Score=14.77 Aligned_cols=39 Identities=23% Similarity=0.319 Sum_probs=33.9 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 987777899991477259999999987189869999945787 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ 42 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq 42 (404) |+. ||++=--+|.|-.+++.++......||+++|+--|. T Consensus 1 M~~---kvIiDtD~G~DDa~Al~lal~~~~~ev~gIT~v~GN 39 (311) T PRK10443 1 MAL---PIILDCDPGHDDAIALVLALASPELDVKAVTTSAGN 39 (311) T ss_pred CCC---CEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCC T ss_conf 998---889989997589999999986899879999980799 No 253 >cd02971 PRX_family Peroxiredoxin (PRX) family; composed of the different classes of PRXs including many proteins originally known as bacterioferritin comigratory proteins (BCP), based on their electrophoretic mobility before their function was identified. PRXs are thiol-specific antioxidant (TSA) proteins also known as TRX peroxidases and alkyl hydroperoxide reductase C22 (AhpC) proteins. They confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either TRX, glutathione, trypanothione and AhpF. They are distinct from other peroxidases in that they have no cofactors such as metals or prosthetic groups. The first step of catalysis, common to all PRXs, is the nucleophilic attack by the catalytic cysteine (also known as the peroxidatic cysteine) on the peroxide leading to cleavage of the oxygen-oxygen bond and the formation of a Probab=35.09 E-value=31 Score=14.77 Aligned_cols=41 Identities=22% Similarity=0.179 Sum_probs=15.9 Q ss_pred CCCEEEEEEEECCCCCHHHHHHHHHHHCC-CEEEEEECHHHHHHHHH Q ss_conf 89869999945787110689999999739-80799820089999987 Q gi|254780234|r 29 KGLEVIVFIADLGQGEELKIASDKARLLG-AKEVYVKDLRREFVRDF 74 (404) Q Consensus 29 ~g~eVi~~~~d~Gq~~d~~~~~~~A~~~G-a~~~~v~D~r~ef~~~~ 74 (404) .|.+|+++..| +.+..++-+.+.| ..--++.|...+|++.| T Consensus 55 ~g~~vv~Is~d-----~~~~~~~~~~~~~~~~f~~l~D~~~~~~~~y 96 (140) T cd02971 55 GGAEVLGVSVD-----SPFSHKAWAEKEGGLNFPLLSDPDGEFAKAY 96 (140) T ss_pred CCCEEEEECCC-----CHHHHHHHHHHHCCCCCEEEECCCHHHHHHC T ss_conf 77168776289-----9999999999845888658886868999983 No 254 >pfam01012 ETF Electron transfer flavoprotein domain. This family includes the homologous domain shared between the alpha and beta subunits of the electron transfer flavoprotein. Probab=34.75 E-value=32 Score=14.73 Aligned_cols=79 Identities=23% Similarity=0.277 Sum_probs=50.2 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999871898699999457871106899999997398079982008999998799997368654787420130124335 Q gi|254780234|r 21 ILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI 100 (404) Q Consensus 21 ~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli 100 (404) ....|+++.|.+|+++++ |..+ +...+.+...|+.+.++++-. .|. +|. + -.. T Consensus 23 ~A~~l~~~~g~~v~~v~~--G~~~--~~~~~~~~~~Gad~v~~~~~~-~~~-~~~--------------~-------~~~ 75 (161) T pfam01012 23 AARQLAEALGGEVTAVVV--GPVP--AAAEALLAAMGADKVLVVEDP-ALA-GYD--------------A-------EAY 75 (161) T ss_pred HHHHHHHHCCCEEEEEEE--CCCH--HHHHHHHHHCCCCEEEEECCH-HHC-CCC--------------H-------HHH T ss_conf 999999863985999995--3947--999999875599999997871-212-468--------------6-------889 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCH Q ss_conf 99999998856892875001116821 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGND 126 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGND 126 (404) ++.+.++.++.+.+.|--|+|..|.| T Consensus 76 a~~l~~~i~~~~~~~Vl~~~s~~g~~ 101 (161) T pfam01012 76 AAALAALIKKEGPDLVLAGATSIGKD 101 (161) T ss_pred HHHHHHHHHCCCCCEEEECCHHHHHH T ss_conf 99999975413997999704078761 No 255 >TIGR02635 RhaI_grampos L-rhamnose isomerase; InterPro: IPR013457 The proteins in this entry are closely related to the L-rhamnose isomerases (IPR013451 from INTERPRO) found in Pseudomonas stutzeri and in a number of the Rhizobiales. They are encoded in similar genomic contexts, close to genes apparently involved in rhamnose catabolism such as rhamnulose-1-phosphate aldolase (IPR013454 from INTERPRO), sugar kinases, and sugar transporters.. Probab=34.72 E-value=32 Score=14.76 Aligned_cols=145 Identities=20% Similarity=0.266 Sum_probs=80.6 Q ss_pred HHHHCCCCCCCCCCCCCCHHHHH-------HHH--HHHHHHHHCCCCEE----ECHHHCCCCHHHHHH-----H----HH Q ss_conf 99736865478742013012433-------599--99999885689287----500111682156799-----9----99 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPL-------IAK--YLVDIANETGADAI----AHGSTGKGNDQVRFE-----L----SA 134 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRpl-------ia~--~lv~~a~~~ga~~i----aHG~TgkGNDQvRFe-----~----~~ 134 (404) =+|..|-.-..-|=+|+ |.-|= |.. ..|++|++.|-.-| |.|-..=|.|-+|-- = .+ T Consensus 86 GaiNpNlFQDDDYKfGS-lThp~~~iR~KAi~h~LeCvdIa~~tGSkdisLWLADGTnYPGQdD~R~Rk~RL~EsL~~vY 164 (382) T TIGR02635 86 GAINPNLFQDDDYKFGS-LTHPDKRIRRKAIDHLLECVDIAKKTGSKDISLWLADGTNYPGQDDIRKRKDRLEESLAEVY 164 (382) T ss_pred CCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 45378767685435567-65734799999999999899999860885256221057873588775447889988899998 Q ss_pred HHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCC-CCCCC-------CCCCCCCCHHH Q ss_conf 98587970774410068763799999999708967566567886415724311545-76324-------58457682010 Q gi|254780234|r 135 YSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSS-EGRVL-------EDPSQPAPEYV 206 (404) Q Consensus 135 ~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~-Egg~L-------edp~~~~pe~~ 206 (404) ..|.+++..+= |..| +.+ .=|.+|--=||++| -+.-| -|.-+.||- T Consensus 165 ~~l~~dmRlLi---EYKF------------------FEP---AfY~TD~pDWGtAY~~~~kLG~rA~VlVD~GHHa~G-- 218 (382) T TIGR02635 165 EHLGDDMRLLI---EYKF------------------FEP---AFYHTDIPDWGTAYALSKKLGDRALVLVDTGHHAQG-- 218 (382) T ss_pred HHCCCCCEEEE---EEEC------------------CCC---CCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC-- T ss_conf 51698877853---2204------------------575---344578333579999998548923898625779877-- Q ss_pred CCCCCCHHHCCCCCEEEEEEEECCEEEE--ECCE----------ECCHHHHHHHHHHHHHHCCC Q ss_conf 0146581227999638999995024787--5162----------31899999999888642163 Q gi|254780234|r 207 YKMIVSPEEAPDTPTTIRIDFQRGDPIA--INGQ----------VMSPEVLLEQLNQYGRCNGI 258 (404) Q Consensus 207 ~~~t~~p~~ap~~pe~v~I~Fe~G~PVa--inG~----------~~~~~~li~~LN~igg~~Gv 258 (404) -|=.-+|-+=...|..=+ .|.+ +.+|.||+.-++||-...+- T Consensus 219 ----------TNIE~IVA~LL~e~KLGGFhfNsrkYADDDL~~Ga~nPyeLFlIf~e~~~a~~d 272 (382) T TIGR02635 219 ----------TNIEFIVATLLDEKKLGGFHFNSRKYADDDLIVGAINPYELFLIFKEIVRAGRD 272 (382) T ss_pred ----------CCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf ----------748999998872676566334554444357887501757999999999706899 No 256 >COG3459 Cellobiose phosphorylase [Carbohydrate transport and metabolism] Probab=34.71 E-value=30 Score=14.91 Aligned_cols=19 Identities=26% Similarity=0.319 Sum_probs=13.8 Q ss_pred CCEEECHHHCCCCHHHHHH Q ss_conf 9287500111682156799 Q gi|254780234|r 113 ADAIAHGSTGKGNDQVRFE 131 (404) Q Consensus 113 a~~iaHG~TgkGNDQvRFe 131 (404) ...+.|+-+|.=-||.+=+ T Consensus 227 ~~~v~h~~~g~~~~ql~~~ 245 (1056) T COG3459 227 SRLVLHADNGFIGDQLEKE 245 (1056) T ss_pred HHEEECCCCEEEHHHHHCC T ss_conf 6425326643506555323 No 257 >cd02969 PRX_like1 Peroxiredoxin (PRX)-like 1 family; hypothetical proteins that show sequence similarity to PRXs. Members of this group contain a conserved cysteine that aligns to the first cysteine in the CXXC motif of TRX. This does not correspond to the peroxidatic cysteine found in PRXs, which aligns to the second cysteine in the CXXC motif of TRX. In addition, these proteins do not contain the other two conserved residues of the catalytic triad of PRX. PRXs confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. Probab=34.12 E-value=33 Score=14.66 Aligned_cols=68 Identities=16% Similarity=0.170 Sum_probs=44.2 Q ss_pred CEEEEEECCCHHHHHHHHHHH------H---HCCCEEEEEEEECC---CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHH Q ss_conf 789999147725999999998------7---18986999994578---71106899999997398079982008999998 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ------V---EKGLEVIVFIADLG---QGEELKIASDKARLLGAKEVYVKDLRREFVRD 73 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~------~---e~g~eVi~~~~d~G---q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~ 73 (404) |-|||.|. +-...+|-..+. . ++|.+++++..|-. +.+..+.+++.+...|..--++.|-..+.++. T Consensus 26 k~vvl~F~-~~~CP~c~~~~p~l~~l~~~~~~~gv~~lgVn~~~~~~~~~ds~~~~~~~~~~~~~~fPvL~D~~g~va~~ 104 (171) T cd02969 26 KALVVMFI-CNHCPYVKAIEDRLNRLAKEYGAKGVAVVAINSNDIEAYPEDSPENMKAKAKEHGYPFPYLLDETQEVAKA 104 (171) T ss_pred CEEEEEEE-CCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCHHHHH T ss_conf 98999995-89991788888765567767634782899996687443755699999999987023717999589749997 Q ss_pred H Q ss_conf 7 Q gi|254780234|r 74 F 74 (404) Q Consensus 74 ~ 74 (404) | T Consensus 105 y 105 (171) T cd02969 105 Y 105 (171) T ss_pred C T ss_conf 4 No 258 >PRK07094 biotin synthase; Provisional Probab=33.98 E-value=33 Score=14.65 Aligned_cols=23 Identities=26% Similarity=0.202 Sum_probs=15.2 Q ss_pred CEEEEEEEECCCCCCCHHHCCCC Q ss_conf 15999877278310883222301 Q gi|254780234|r 351 GNVMVVGRESNKSLYSDKLVTFE 373 (404) Q Consensus 351 G~~~v~gr~S~~sLy~~~~as~~ 373 (404) .|+++...+..|.||+.+..+-+ T Consensus 279 p~~tp~~~r~~y~ly~~k~~~~~ 301 (323) T PRK07094 279 PNLTPTEYRKLYQLYPGKICTGE 301 (323) T ss_pred CCCCCHHHCCCCCCCCCCCCCCC T ss_conf 88994676257526799876887 No 259 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=33.81 E-value=33 Score=14.63 Aligned_cols=17 Identities=18% Similarity=0.223 Sum_probs=6.6 Q ss_pred HHHHHHHHHCCCCEEEC Q ss_conf 99999988568928750 Q gi|254780234|r 102 KYLVDIANETGADAIAH 118 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaH 118 (404) +.+-+.|++.|-+.+.+ T Consensus 19 ~~i~~~~~~~gy~~~i~ 35 (265) T cd06285 19 EGIEEAAAERGYSTFVA 35 (265) T ss_pred HHHHHHHHHCCCEEEEE T ss_conf 99999999869989999 No 260 >cd06388 PBP1_iGluR_AMPA_GluR4 N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR4 subunit of the AMPA receptor. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR4 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic binding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs). Probab=33.78 E-value=33 Score=14.62 Aligned_cols=51 Identities=6% Similarity=-0.022 Sum_probs=27.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 4335999999988568928750011168215679999998587970774410 Q gi|254780234|r 97 RPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWR 148 (404) Q Consensus 97 Rplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~R 148 (404) ||-+.++++++.+..+=+-++= .--......|..--++...+.--.+++.+ T Consensus 108 ~P~l~~A~~dlV~~~~W~~f~i-iYdsd~gL~~Lq~ll~~~~~~~~~v~~~~ 158 (371) T cd06388 108 RPSLRGALLSLLDHYEWNRFVF-LYDTDRGYSILQAIMEKAGQNGWQVSAIC 158 (371) T ss_pred CCCHHHHHHHHHHHCCCCEEEE-EEECCHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 8767999999999679728999-99682778999999985332687368999 No 261 >PRK12337 2-phosphoglycerate kinase; Provisional Probab=33.67 E-value=33 Score=14.61 Aligned_cols=15 Identities=20% Similarity=0.031 Sum_probs=7.1 Q ss_pred HHHCCCCCCCCCCCC Q ss_conf 431154576324584 Q gi|254780234|r 184 NLLHSSSEGRVLEDP 198 (404) Q Consensus 184 Nlwg~S~Egg~Ledp 198 (404) -||-.||.++...+| T Consensus 312 ~Lh~SSy~Awk~L~~ 326 (492) T PRK12337 312 TLHASTFNAWEALVP 326 (492) T ss_pred HHHHHHHHHHHHCCC T ss_conf 577755688886087 No 262 >pfam00289 CPSase_L_chain Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Probab=33.56 E-value=33 Score=14.60 Aligned_cols=49 Identities=31% Similarity=0.396 Sum_probs=31.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEE Q ss_conf 1243359999999885689287500111682156799999985879707744 Q gi|254780234|r 95 IARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAP 146 (404) Q Consensus 95 laRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP 146 (404) ...++-...++++|++.||+++.-|- |-=+--.+|...+... .+..+-| T Consensus 56 ~~~Yl~~~~Ii~ia~~~~~~aihpGy-GflsEn~~fa~~~~~~--Gi~fiGP 104 (109) T pfam00289 56 SESYLNIERILDIAEKEGADAIHPGY-GFLSENAEFAEACEKA--GITFIGP 104 (109) T ss_pred CCCCCCHHHHHHHHHHHCCCEEECCC-CCCCCCHHHHHHHHHC--CCEEECC T ss_conf 21113799999999981889687797-6233599999999988--8989995 No 263 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=33.51 E-value=33 Score=14.59 Aligned_cols=141 Identities=13% Similarity=0.070 Sum_probs=66.2 Q ss_pred ECCCHHHHHHHHHHH--HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 147725999999998--718986999994578711068999999973980799820089999987999973686547874 Q gi|254780234|r 12 YSGGLDTSIILKWLQ--VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYY 89 (404) Q Consensus 12 ySGGLDTSv~i~~L~--~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Y 89 (404) ..+..|.+--+.-+. ..+|++.|.++. ...+.+..+-++|...|+ +++++|-..+ +. .....|-+ T Consensus 36 ~da~~d~~~Q~~~i~~~I~~gvDaIii~p--~d~~a~~~~v~~A~~aGI-pVv~~d~~~~---~~-----~~~~~~~~-- 102 (294) T cd06316 36 TDAQFDPAKQVADIETTISQKPDIIISIP--VDPVSTAAAYKKVAEAGI-KLVFMDNVPS---GL-----EHGKDYAG-- 102 (294) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHHHHHCCC-CEEEECCCCC---CC-----CCCCCEEE-- T ss_conf 69988999999999999985999999938--886787999999998199-6798326777---77-----77774799-- Q ss_pred CCCCC--HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHH---HHHHHHHHCCCCEEEEEECCCCC-CCHHHHHHHHH Q ss_conf 20130--12433599999998856892875001116821567---99999985879707744100687-63799999999 Q gi|254780234|r 90 LLGTA--IARPLIAKYLVDIANETGADAIAHGSTGKGNDQVR---FELSAYSLNSDIEIIAPWRHWSF-KGRQDLIDFAE 163 (404) Q Consensus 90 pl~ts--laRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvR---Fe~~~~~l~P~l~viaP~Rd~~~-~sRe~~i~ya~ 163 (404) ..++- -+--+.++.+++.+...|--++-.|..+-.+.+-| |.-++..-.|+++|...--.|+. .+.+..-+++. T Consensus 103 ~v~~Dn~~~G~~aa~~l~~~~~~~g~v~ii~~~~~~~~~~~R~~Gf~~~l~~~~p~i~iv~~~~~~~~~~a~~~~~~~L~ 182 (294) T cd06316 103 IVTDDNYGNGQIAADALAKALPGKGKVGLIYHGADYFVTNQRDQGFKETIKKNYPDITIVAEKGIDGPSKAEDIANAMLT 182 (294) T ss_pred EEECCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 99488899999999999962666880899975899768999999999999720897299984388888999999999997 Q ss_pred HH Q ss_conf 70 Q gi|254780234|r 164 KH 165 (404) Q Consensus 164 ~~ 165 (404) +| T Consensus 183 ~~ 184 (294) T cd06316 183 QN 184 (294) T ss_pred HC T ss_conf 39 No 264 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=33.27 E-value=34 Score=14.57 Aligned_cols=67 Identities=12% Similarity=0.096 Sum_probs=33.8 Q ss_pred HHHHHHHHHHHHCC-----CCEEECHHHCCCCHHHHHHHHHHHH----CCCCEEEE-EECCCCCC-CHHHHHHHHHHH Q ss_conf 35999999988568-----9287500111682156799999985----87970774-41006876-379999999970 Q gi|254780234|r 99 LIAKYLVDIANETG-----ADAIAHGSTGKGNDQVRFELSAYSL----NSDIEIIA-PWRHWSFK-GRQDLIDFAEKH 165 (404) Q Consensus 99 lia~~lv~~a~~~g-----a~~iaHG~TgkGNDQvRFe~~~~~l----~P~l~via-P~Rd~~~~-sRe~~i~ya~~~ 165 (404) ..++.+++.+.+.| .-++..+..+-.+-+-|.+.....| .|+++|+. ++.+|+.. +.+.+-+.+.+| T Consensus 113 ~~g~~la~~l~~kg~~~~~~~vi~~~~~~~~~~~~R~~Gf~~~l~~~g~p~~~vv~~~~~~~~~~~a~~~~~~~L~a~ 190 (289) T cd01540 113 QVGEAIADEMKKRGWDPKEVGALRITYDELDTAKPRTDGALEALKAPGFPEANIFQAPQKTTDTEGAFDAAASTLTKN 190 (289) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 999999999987478866359999168888218899876999996359996079984568866999999999999758 No 265 >PRK10116 universal stress protein UspC; Provisional Probab=33.23 E-value=34 Score=14.56 Aligned_cols=109 Identities=13% Similarity=0.170 Sum_probs=60.1 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHH---HHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHH Q ss_conf 987777899991477259999999---98718986999994578711068999999973980799820089999987999 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKW---LQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFP 77 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~---L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~ 77 (404) |+. |+|.+|.-+.-+|-.++.. |.+..+.++.-++++.- ++-.. .. ..+ ...|.++.+-++ ... T Consensus 1 M~Y--khILVAvDlS~~S~~~i~kA~~lA~~~~AklslihV~~~-~~~y~-------~~-~~~-~~~d~~~~~~e~-a~~ 67 (142) T PRK10116 1 MSY--SNILVAVAVTPESQQLLAKAVSIARPVNGKISLITLASD-PEMYN-------QF-AAP-MLEDLRSVMQEE-TQS 67 (142) T ss_pred CCC--CEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEECC-HHHHH-------HC-CCH-HHHHHHHHHHHH-HHH T ss_conf 998--689999418850699999999999981998999999357-66652-------10-221-378899999999-999 Q ss_pred HHHCCCCCCCCCCCCCC-HHHHHHHHHHHHHHHHCCCCEEECHHHCC Q ss_conf 97368654787420130-12433599999998856892875001116 Q gi|254780234|r 78 MFRANALYEGYYLLGTA-IARPLIAKYLVDIANETGADAIAHGSTGK 123 (404) Q Consensus 78 ~I~ana~Yeg~Ypl~ts-laRplia~~lv~~a~~~ga~~iaHG~Tgk 123 (404) .+..=+.+.+ ||.... +..--.++.+.+.|++.++|.|-=|+-|+ T Consensus 68 ~L~~l~~~~~-~pi~~~~v~~G~~~~~I~~~a~e~~~DLIV~G~Hg~ 113 (142) T PRK10116 68 FLDELIQKAD-YPIEKTFIAYGELSEHILEVCRKHHFDLVICGNHNH 113 (142) T ss_pred HHHHHHHHCC-CCCEEEEEEECCHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 9999998749-981589999658899999999996999999905985 No 266 >PRK06824 translation initiation factor Sui1; Validated Probab=33.14 E-value=34 Score=14.55 Aligned_cols=80 Identities=15% Similarity=0.259 Sum_probs=50.7 Q ss_pred CCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEE----CCEEEEE-CCEECCHHHHHHHHHH Q ss_conf 8641572431154576324584576820100146581227999638999995----0247875-1623189999999988 Q gi|254780234|r 177 APFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQ----RGDPIAI-NGQVMSPEVLLEQLNQ 251 (404) Q Consensus 177 ~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe----~G~PVai-nG~~~~~~~li~~LN~ 251 (404) --||+|.+=.+-. |+. |.+.-. ......|.....|.|.++ .|.+|.+ .|-.++..+|-..+-+ T Consensus 9 LVYSTd~griCp~--C~~--------p~~~C~--C~~~~~~~~d~~vri~~e~KGR~GK~VTvI~Gl~~~~~~lk~Lak~ 76 (118) T PRK06824 9 LVYSTEAGRHCPD--CGQ--------PVAACI--CKQQAAPAGDGIVRVRRETKGRGGKTVTVITGVPLAEDALKELAKE 76 (118) T ss_pred EEEECCCCCCCCC--CCC--------CHHHCC--CCCCCCCCCCCEEEEEEEECCCCCCEEEEEECCCCCHHHHHHHHHH T ss_conf 8798899976876--688--------133252--4444668668449999875377997189996887999999999999 Q ss_pred HHHHCCCCCEECCCCCCCC Q ss_conf 8642163741000773202 Q gi|254780234|r 252 YGRCNGIGRIDIVENRFVG 270 (404) Q Consensus 252 igg~~GvGr~d~vEnr~vG 270 (404) +=.+.|+|=. +.|..|- T Consensus 77 LK~~cG~GGt--vK~g~Ie 93 (118) T PRK06824 77 LKRRCGTGGT--LKDGVIE 93 (118) T ss_pred HHHHHCCCCE--EECCEEE T ss_conf 9997467961--6189899 No 267 >cd01575 PBP1_GntR Ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. This group represents the ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of GntR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding, Probab=33.11 E-value=34 Score=14.55 Aligned_cols=111 Identities=11% Similarity=0.024 Sum_probs=51.1 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHH--HHHH---HHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 71898699999457871106899999997398079982008--9999---987999973686547874201301243359 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLR--REFV---RDFVFPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r--~ef~---~~~i~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) .++||.++-+..+--...+. ..-+.....++.-+++.... ++.. .+.=.|.+..+.........+....-.--+ T Consensus 26 ~~~gy~~~i~~s~~d~~~e~-~~i~~l~~~~vDGiIi~~~~~~~~~~~~l~~~~iPvV~~~~~~~~~~~~~V~~Dn~~~~ 104 (268) T cd01575 26 EAAGYQLLLGNTGYSPEREE-ELLRTLLSRRPAGLILTGLEHTERTRQLLRAAGIPVVEIMDLPPDPIDMAVGFSHAEAG 104 (268) T ss_pred HHCCCEEEEEECCCCHHHHH-HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCEEECCHHHHH T ss_conf 98699999997899989999-99999995699999994377998999999977996998488688888858973889999 Q ss_pred HHHHHHHHHCCCCEEEC--HHHCCCCHHHHHHHHHHHHC Q ss_conf 99999988568928750--01116821567999999858 Q gi|254780234|r 102 KYLVDIANETGADAIAH--GSTGKGNDQVRFELSAYSLN 138 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaH--G~TgkGNDQvRFe~~~~~l~ 138 (404) +.+++...+.|+.-|+- |.+..-..+.|++...+++. T Consensus 105 ~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~G~~~al~ 143 (268) T cd01575 105 RAMARHLLARGYRRIGFLGARMDDTRAQQRLEGFRAALR 143 (268) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 999999997499769998489887369999999999999 No 268 >TIGR02198 rfaE_dom_I rfaE bifunctional protein, domain I; InterPro: IPR011913 RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in Escherichia coli, and separate proteins in some other genome. The longer, N-terminal domain I (this family) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (IPR011914 from INTERPRO) adds ADP to yield ADP-D-glycero-D-manno-heptose .; GO: 0016773 phosphotransferase activity alcohol group as acceptor, 0016779 nucleotidyltransferase activity, 0005975 carbohydrate metabolic process. Probab=32.98 E-value=27 Score=15.24 Aligned_cols=37 Identities=27% Similarity=0.412 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCC Q ss_conf 7999999997089675665678864157243115457632458457 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQ 200 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~ 200 (404) -++.|+.|++||+||-..+|. +-||. |-|-.|-+|.. T Consensus 164 ~~~~I~~Ar~~~~pVlVDPKg-~df~~--------Y~GAtl~TPN~ 200 (321) T TIGR02198 164 VQEVIAAAREAGKPVLVDPKG-KDFSR--------YRGATLITPNR 200 (321) T ss_pred HHHHHHHHHHCCCCEEEECCC-CCHHH--------HCCCCCCCCCH T ss_conf 999999999668919980787-62345--------14664236687 No 269 >cd03466 Nitrogenase_NifN_2 Nitrogenase_nifN_2: A subgroup of the NifN subunit of the NifEN complex: NifN forms an alpha2beta2 tetramer with NifE. NifN and nifE are structurally homologous to nitrogenase MoFe protein beta and alpha subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The nifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this nifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). This group also contains the Clostidium fused NifN-NifB protein. Probab=32.88 E-value=34 Score=14.52 Aligned_cols=11 Identities=9% Similarity=0.108 Sum_probs=7.5 Q ss_pred HHHHHHCCCCC Q ss_conf 99997089675 Q gi|254780234|r 160 DFAEKHAIPID 170 (404) Q Consensus 160 ~ya~~~gIpv~ 170 (404) ..+++.||++- T Consensus 175 ~ll~~~Gl~~~ 185 (429) T cd03466 175 EILREFGIEYI 185 (429) T ss_pred HHHHHCCCCEE T ss_conf 99998299579 No 270 >PRK13394 3-hydroxybutyrate dehydrogenase; Provisional Probab=32.86 E-value=34 Score=14.52 Aligned_cols=32 Identities=22% Similarity=0.109 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCC Q ss_conf 7999999997089675665678864157243115457 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSE 191 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~E 191 (404) |.-..+++ .+||-|-.- .|=.++..+|...++ T Consensus 169 k~lA~E~a-~~gIrVN~V----~PG~i~T~~~~~~~~ 200 (262) T PRK13394 169 RVLAKEGA-KHNVRSHVV----CPGFVRTPLVDKQIP 200 (262) T ss_pred HHHHHHHH-HCCEEEEEE----EECCCCCCHHHHHCH T ss_conf 99999852-319699999----758788702331365 No 271 >PRK09358 adenosine deaminase; Provisional Probab=32.82 E-value=34 Score=14.52 Aligned_cols=20 Identities=25% Similarity=0.551 Sum_probs=15.2 Q ss_pred ECCCHHHHHHHHHHHHHCCCE Q ss_conf 147725999999998718986 Q gi|254780234|r 12 YSGGLDTSIILKWLQVEKGLE 32 (404) Q Consensus 12 ySGGLDTSv~i~~L~~e~g~e 32 (404) .+|++..+++..|.+ ++|.. T Consensus 18 L~Gsi~~etl~~la~-~~~~~ 37 (333) T PRK09358 18 LDGSLRPETILELAR-RNGID 37 (333) T ss_pred CCCCCCHHHHHHHHH-HCCCC T ss_conf 768889999999999-82999 No 272 >PRK10565 hypothetical protein; Provisional Probab=32.71 E-value=34 Score=14.51 Aligned_cols=116 Identities=22% Similarity=0.317 Sum_probs=66.7 Q ss_pred CCEE-EEEECC--CHHHHHHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHCCCEEEEEECHHHHHHH--HHHHH Q ss_conf 7789-999147--7259999999987189869999945787--110689999999739807998200899999--87999 Q gi|254780234|r 5 VKKV-VLAYSG--GLDTSIILKWLQVEKGLEVIVFIADLGQ--GEELKIASDKARLLGAKEVYVKDLRREFVR--DFVFP 77 (404) Q Consensus 5 ~kkV-vlaySG--GLDTSv~i~~L~~e~g~eVi~~~~d~Gq--~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~--~~i~~ 77 (404) .++| ||+=+| |=|-=|+..+|. ++|++|..+...-.. .++....++.....|-. ....|.. +.+ |.+-+ T Consensus 60 ~~~v~VlcG~GNNGGDG~v~AR~L~-~~G~~V~v~~~~~~~~~~~~a~~~~~~~~~~~~~-i~~~~~~--~~~~~dliVD 135 (515) T PRK10565 60 ARHWLVLCGHGNNGGDGYVVARLAK-AAGIDVTLLAQESDKPLPEEAALAREAWLNAGGE-IHAADIV--WPESIDLIID 135 (515) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHH-HCCCEEEEEEECCCCCCCHHHHHHHHHHHHCCCC-EECCCCC--CCCCCCEEEE T ss_conf 9979999889856999999999999-7798699999679888959999999999975991-6135545--6554778999 Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHH--HHHHHHHHHCCCCEEE-------CHHHCCCCH-HHHHHHHHH Q ss_conf 973686547874201301243359--9999998856892875-------001116821-567999999 Q gi|254780234|r 78 MFRANALYEGYYLLGTAIARPLIA--KYLVDIANETGADAIA-------HGSTGKGND-QVRFELSAY 135 (404) Q Consensus 78 ~I~ana~Yeg~Ypl~tslaRplia--~~lv~~a~~~ga~~ia-------HG~TgkGND-QvRFe~~~~ 135 (404) =.++|.+.||+-. +.+++..+..++..|| ++-||.-.. -+|-+.++. T Consensus 136 -----------alfGtGl~~~l~~~~~~lI~~iN~~~~~vvAVDIPSGl~~dtG~~~g~av~Ad~TvT 192 (515) T PRK10565 136 -----------ALLGTGLRQAPRESYAQLIDHANRHPAPVVAVDIPSGLLAETGATPGAVINADHTVT 192 (515) T ss_pred -----------EEEECCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCEEECEEEEE T ss_conf -----------988637898888799999999985699889997678877777887885354207787 No 273 >PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional Probab=32.68 E-value=34 Score=14.50 Aligned_cols=120 Identities=17% Similarity=0.096 Sum_probs=57.7 Q ss_pred HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHCCCCEEE---CHH Q ss_conf 0689999999739807998200899999879999736865478742-013012433599999998856892875---001 Q gi|254780234|r 45 ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYL-LGTAIARPLIAKYLVDIANETGADAIA---HGS 120 (404) Q Consensus 45 d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Yp-l~tslaRplia~~lv~~a~~~ga~~ia---HG~ 120 (404) ++..+-+.+..+|+. .+++-.-|-|...-++..++. +.-.|-+. +.|+- -+|.+..++..++.|.+.|. +|. T Consensus 51 e~~~~id~l~~~Gv~-~v~~tGGEPllr~D~~ei~~~-a~~~G~~~~l~TNG--~lit~~~a~~L~~~gl~~v~vSlDg~ 126 (375) T PRK05301 51 EWIRVLREARALGVL-QLHFSGGEPLLRKDLEELVAH-ARRLGLYTNLITSG--VGLTEARLAALKAAGLDHIQLSFQDS 126 (375) T ss_pred HHHHHHHHHHHCCCC-EEEECCCCCCCCCCHHHHHHH-HHHCCCEEEEEECC--CCCCHHHHHHHHHHCCCEEEEECCCC T ss_conf 999999999986998-899618652456689999999-99769758996067--45579999999850998899956779 Q ss_pred HCCCCHHHH-----HHHHHHHHC------CCCEEEEEECCCCCCCHHHHHHHHHHHCCC Q ss_conf 116821567-----999999858------797077441006876379999999970896 Q gi|254780234|r 121 TGKGNDQVR-----FELSAYSLN------SDIEIIAPWRHWSFKGRQDLIDFAEKHAIP 168 (404) Q Consensus 121 TgkGNDQvR-----Fe~~~~~l~------P~l~viaP~Rd~~~~sRe~~i~ya~~~gIp 168 (404) +-.-+|.+| |+.+++++. -.+.|-.-+--.++.--++.++++.+.|++ T Consensus 127 ~~e~hD~~rG~~G~f~~~~~~i~~l~~~Gi~v~i~~ti~r~N~~~l~~i~~la~~lGv~ 185 (375) T PRK05301 127 DPELADRIAGTRGAFAQKLEVARLVKAHGYPLTLNAVIHRHNIDQIPRIIELAVELGAD 185 (375) T ss_pred CHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCC T ss_conf 87787776378862999999999999749816999872305688899999999972998 No 274 >KOG2282 consensus Probab=32.51 E-value=23 Score=15.74 Aligned_cols=100 Identities=25% Similarity=0.272 Sum_probs=57.6 Q ss_pred EEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 99999457871106899999997398079982008999998799997368654787420130124335999999988568 Q gi|254780234|r 33 VIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETG 112 (404) Q Consensus 33 Vi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~g 112 (404) .-|-.+|-|..-| +...++..|.......+.+....+.-+-|.+ -|-+.|.+.-...|+.|.+. T Consensus 127 ldcpicdqggecd---lqdq~m~fgsdr~rf~~~kravedknigplv------------ktimtrciqctrcvrfasei- 190 (708) T KOG2282 127 LDCPICDQGGECD---LQDQAMAFGSDRSRFTEGKRAVEDKNIGPLV------------KTIMTRCIQCTRCVRFASEI- 190 (708) T ss_pred CCCCCCCCCCCCC---CHHHHHHHCCCHHHHHHHHHHHHCCCCCHHH------------HHHHHHHHHHHHHHHHHHHH- T ss_conf 8987357888676---0676755247503454433443237653489------------99999877577888888864- Q ss_pred CCEEECHHHCCCCHHH-H--HHHHHH-HHCCCCEEEEEEC Q ss_conf 9287500111682156-7--999999-8587970774410 Q gi|254780234|r 113 ADAIAHGSTGKGNDQV-R--FELSAY-SLNSDIEIIAPWR 148 (404) Q Consensus 113 a~~iaHG~TgkGNDQv-R--Fe~~~~-~l~P~l~viaP~R 148 (404) |.+=--|-||+|||.- - -|.-|. -|--++-=|.|+- T Consensus 191 agv~dlgttgrg~d~qigtyvek~f~selsgniidicpvg 230 (708) T KOG2282 191 AGVDDLGTTGRGNDMQIGTYVEKLFMSELSGNIIDICPVG 230 (708) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 3875446666777631889999999876128746642345 No 275 >cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity. Probab=32.26 E-value=26 Score=15.41 Aligned_cols=37 Identities=19% Similarity=0.188 Sum_probs=18.8 Q ss_pred HHHHCCCCEEEEEECCCCC-C-CHHHHHHHHHHHCCCCCC Q ss_conf 9985879707744100687-6-379999999970896756 Q gi|254780234|r 134 AYSLNSDIEIIAPWRHWSF-K-GRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 134 ~~~l~P~l~viaP~Rd~~~-~-sRe~~i~ya~~~gIpv~~ 171 (404) ++.-+|+++|++=.-|=++ . +-++ +.-|..++.++.. T Consensus 63 ~k~a~p~~~Vva~~GDG~~~~iG~~~-l~~A~~rn~~i~~ 101 (193) T cd03375 63 VKLANPDLTVIVVSGDGDLAAIGGNH-FIHAARRNIDITV 101 (193) T ss_pred HHHHCCCCEEEEEEECCHHHHCCHHH-HHHHHHHCCCEEE T ss_conf 98759997399997065576456188-9999980997499 No 276 >pfam08546 ApbA_C Ketopantoate reductase PanE/ApbA C terminal. This is a family of 2-dehydropantoate 2-reductases also known as ketopantoate reductases, EC:1.1.1.169. The reaction catalysed by this enzyme is: (R)-pantoate + NADP(+) <= 2-dehydropantoate + NADPH. AbpA catalyses the NADPH reduction of ketopantoic acid to pantoic acid in the alternative pyrimidine biosynthetic (APB) pathway. ApbA and PanE are allelic. ApbA, the ketopantoate reductase enzyme is required for the synthesis of thiamine via the APB biosynthetic pathway. Probab=32.24 E-value=21 Score=15.98 Aligned_cols=99 Identities=22% Similarity=0.222 Sum_probs=42.8 Q ss_pred CHHHHHHHHHHHHHHHC--CCCCCCCC--CCCCCHHHHHHH---HHHHHHHHHCCCCEEECHHHCCCCHHHHHHH-HHHH Q ss_conf 00899999879999736--86547874--201301243359---9999998856892875001116821567999-9998 Q gi|254780234|r 65 DLRREFVRDFVFPMFRA--NALYEGYY--LLGTAIARPLIA---KYLVDIANETGADAIAHGSTGKGNDQVRFEL-SAYS 136 (404) Q Consensus 65 D~r~ef~~~~i~~~I~a--na~Yeg~Y--pl~tslaRplia---~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~-~~~~ 136 (404) |.+.+.++.+++-+... .|++.-.+ .+...-.|-++. +.++.+|+..|... .. |++...+ .+.. T Consensus 1 di~~~~W~Kl~~n~~~n~l~al~~~~~g~l~~~~~~~~l~~~l~~E~~~Va~a~Gi~l-------~~-~~~~~~~~~~~~ 72 (125) T pfam08546 1 DIRLARWEKLLFNAAINPLTALLGCTNGELLDSPEARELIRALMDEAVAVARAEGVAL-------PE-DDLVEAVLRVAR 72 (125) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCCC-------CH-HHHHHHHHHHHH T ss_conf 9779999999999978099999899489998495899999999999999999818999-------98-999999999985 Q ss_pred HCCCCEEEEEECCCCCCCHHHHH--------HHHHHHCCCCCCCC Q ss_conf 58797077441006876379999--------99997089675665 Q gi|254780234|r 137 LNSDIEIIAPWRHWSFKGRQDLI--------DFAEKHAIPIDKNK 173 (404) Q Consensus 137 l~P~l~viaP~Rd~~~~sRe~~i--------~ya~~~gIpv~~~~ 173 (404) ..|+.+ -+=++|+.- +|.-|+ ..+++||||+|... T Consensus 73 ~~~~~~-~SM~~D~~~-gr~tEid~i~G~vv~~a~~~gi~~P~~~ 115 (125) T pfam08546 73 KTPANK-SSMLQDIEA-GRPTEIDAINGYVVRLARKLGVPTPTNE 115 (125) T ss_pred HCCCCC-CHHHHHHHC-CCCCCHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 179999-689999880-9975099999999999999788997899 No 277 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=31.78 E-value=23 Score=15.68 Aligned_cols=16 Identities=13% Similarity=0.538 Sum_probs=10.2 Q ss_pred HHHHHHHCCCCCCCCC Q ss_conf 9999970896756656 Q gi|254780234|r 159 IDFAEKHAIPIDKNKR 174 (404) Q Consensus 159 i~ya~~~gIpv~~~~~ 174 (404) ++||++||+=|..|.. T Consensus 120 vEYAKEHGLiVEfSAE 135 (371) T TIGR02090 120 VEYAKEHGLIVEFSAE 135 (371) T ss_pred HHHHHHCCCEEEECCC T ss_conf 9877525735531778 No 278 >PRK06940 short chain dehydrogenase; Provisional Probab=31.70 E-value=36 Score=14.39 Aligned_cols=58 Identities=22% Similarity=0.312 Sum_probs=34.3 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE-EEEEECHHHH Q ss_conf 789999147725999999998718986999994578711068999999973980-7998200899 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK-EVYVKDLRRE 69 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~-~~~v~D~r~e 69 (404) |-+|+.=|||+-..++.. |. + |+.|+.. |.- .+.++...+.-...|.. ..+..|+.++ T Consensus 6 kV~v~tGa~GIG~aiA~~-la-~-Ga~vvi~--~~~-~~~l~~~~~~l~~~g~~~~~~~~Dvs~~ 64 (277) T PRK06940 6 EVVVVIGAGGMGQAIARR-VG-S-GKTVLLA--DYN-EENLQAVARTLREAGFDVITQQVDVSSR 64 (277) T ss_pred CEEEECCCCHHHHHHHHH-HH-H-CCEEEEE--ECC-HHHHHHHHHHHHHCCCEEEEEECCCCCH T ss_conf 299997816999999999-98-1-9989999--898-8999999999872288299998257998 No 279 >PRK13566 anthranilate synthase; Provisional Probab=31.67 E-value=36 Score=14.39 Aligned_cols=60 Identities=25% Similarity=0.300 Sum_probs=43.6 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEE--EECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 99999988568928750011168215679999998587970774--410068763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIA--PWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~via--P~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) .-|+.+-++.||++.-- -||.. .-.+..++||+-|++ |-+--++ +-...+..+.+.+||+ T Consensus 537 htLa~YlrqlGAeV~vv-----R~d~~--~~~l~~~~pD~vvlSPGPG~P~d~-g~~~~i~~~~~~~iPi 598 (724) T PRK13566 537 HTLANYFRQTGAEVTTV-----RYPVA--EEILDRVNPDLVVLSPGPGRPEDF-DCKATIDAALARNLPI 598 (724) T ss_pred HHHHHHHHHHCCEEEEE-----ECCCC--HHHHHHHCCCEEEECCCCCCCHHC-CCHHHHHHHHHCCCCE T ss_conf 99999999819879998-----18987--677763188989987999790005-5859999998579987 No 280 >pfam02634 FdhD-NarQ FdhD/NarQ family. Nitrate assimilation protein, NarQ, and FdhD are required for formate dehydrogenase activity. Probab=31.49 E-value=36 Score=14.37 Aligned_cols=21 Identities=19% Similarity=0.121 Sum_probs=14.3 Q ss_pred EEEEECCCHHHHHHHHHHHHH Q ss_conf 999914772599999999871 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVE 28 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e 28 (404) .++.-+-+-+--.++-||..| T Consensus 16 ~~~m~tP~~l~eLa~GfL~sE 36 (235) T pfam02634 16 ATLMATPADLEDLALGFLLSE 36 (235) T ss_pred EEEEECCCCHHHHHHHHHHHC T ss_conf 999847988699999998545 No 281 >COG1794 RacX Aspartate racemase [Cell envelope biogenesis, outer membrane] Probab=31.48 E-value=36 Score=14.37 Aligned_cols=17 Identities=29% Similarity=0.532 Sum_probs=8.1 Q ss_pred HHHHHHCCCCEEECHHH Q ss_conf 99988568928750011 Q gi|254780234|r 105 VDIANETGADAIAHGST 121 (404) Q Consensus 105 v~~a~~~ga~~iaHG~T 121 (404) ++...+.||+.|.-||| T Consensus 181 i~~l~~~Gae~vIlGCT 197 (230) T COG1794 181 IERLAERGAEGVILGCT 197 (230) T ss_pred HHHHHHCCCCEEEEECC T ss_conf 99999769988999433 No 282 >PRK11908 NAD-dependent epimerase/dehydratase family protein; Provisional Probab=31.45 E-value=36 Score=14.36 Aligned_cols=151 Identities=16% Similarity=0.117 Sum_probs=69.8 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHH--HHHHHHHHH------ Q ss_conf 778999914772599999999871898699999457871106899999997398079982008--999998799------ Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLR--REFVRDFVF------ 76 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r--~ef~~~~i~------ 76 (404) ||||++-=..|.=-|-++..|.++.+++|+++-..... +.. .......+++-.|++ .++++..+. T Consensus 1 MKkILVTGgaGFIGs~Lv~~Ll~~~~~~V~~~d~~~~~---~~~----~~~~~~~~~~~gDi~~~~~~~~~~~~~~D~V~ 73 (347) T PRK11908 1 MKKVLILGVNGFIGHHLTKRILETTDWEVYGMDMQTDR---LGD----LVNHPRMHFFEGDITINKEWIEYHVKKCDVVL 73 (347) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHCCCEEEEEECCCCC---HHH----HCCCCCEEEEECCCCCCHHHHHHHHCCCCEEE T ss_conf 98899975743899999999998289789999799763---677----55799859997754469999997660598897 Q ss_pred H--HHHCCCCCCCCCCCCCCHHHHHHH-HHHHHHHHHCCCCEEECHHHCC--CCH-HHHHHHHHHHHCCCCE--EEEEEC Q ss_conf 9--973686547874201301243359-9999998856892875001116--821-5679999998587970--774410 Q gi|254780234|r 77 P--MFRANALYEGYYLLGTAIARPLIA-KYLVDIANETGADAIAHGSTGK--GND-QVRFELSAYSLNSDIE--IIAPWR 148 (404) Q Consensus 77 ~--~I~ana~Yeg~Ypl~tslaRplia-~~lv~~a~~~ga~~iaHG~Tgk--GND-QvRFe~~~~~l~P~l~--viaP~R 148 (404) . ++.....+.. .|+.+- ..-+.+ ..+++.|++.+ .-+.+-||.. |+. ...| .|+-. ...|+- T Consensus 74 HlAa~~~~~~~~~-~p~~~~-~~nv~~t~~ll~~~~~~~-~r~if~SS~~VYG~~~~~~~-------~~~~~~~~~~p~~ 143 (347) T PRK11908 74 PLVAIATPATYVK-QPLRVF-ELDFEANLPIVRSAVKYG-KHLVFPSTSEVYGMCSDEEF-------DPEASALTYGPIN 143 (347) T ss_pred EEHHCCCHHHHHH-CHHHHH-HHHHHHHHHHHHHHHHHC-CEEEECCCCEEECCCCCCCC-------CCCCCCCCCCCCC T ss_conf 5200036488886-889999-999999999999999739-83896266126547899998-------9777876578877 Q ss_pred CCC----C---CCHHHHHHHHHHHCCCCCCC Q ss_conf 068----7---63799999999708967566 Q gi|254780234|r 149 HWS----F---KGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 149 d~~----~---~sRe~~i~ya~~~gIpv~~~ 172 (404) .+. . .+-.-...|++++|+|+..- T Consensus 144 ~p~~~Y~~sK~~~E~l~~~y~~~~~l~~~il 174 (347) T PRK11908 144 KPRWIYACSKQLMDRVIWAYGMEEGLNFTLF 174 (347) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 8654778999999999999999858987999 No 283 >pfam03853 YjeF_N YjeF-related protein N-terminus. Probab=31.25 E-value=36 Score=14.34 Aligned_cols=53 Identities=30% Similarity=0.453 Sum_probs=33.0 Q ss_pred CEEE-EEECC--CHHHHHHHHHHHHHCCCEEEEEEEEC-CC-CCHHHHHHHHHHHCCCE Q ss_conf 7899-99147--72599999999871898699999457-87-11068999999973980 Q gi|254780234|r 6 KKVV-LAYSG--GLDTSIILKWLQVEKGLEVIVFIADL-GQ-GEELKIASDKARLLGAK 59 (404) Q Consensus 6 kkVv-laySG--GLDTSv~i~~L~~e~g~eVi~~~~d~-Gq-~~d~~~~~~~A~~~Ga~ 59 (404) ++|+ |+=+| |=|-=++..+|+ +.|++|..+...- -+ .+|.....+.+.++|.. T Consensus 29 ~~v~il~G~GNNGGDGlv~Ar~L~-~~g~~V~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (170) T pfam03853 29 KRVLVLCGPGNNGGDGLAAARHLA-QRGYKVTVLLLNPDEKLSEDARRALEIAKKLGGK 86 (170) T ss_pred CEEEEEECCCCCHHHHHHHHHHHH-HCCCCEEEEEECCCCCCCHHHHHHHHHHHHCCCC T ss_conf 869999879987288999999999-8799079999678555999999999999986996 No 284 >COG4943 Predicted signal transduction protein containing sensor and EAL domains [Signal transduction mechanisms] Probab=31.15 E-value=36 Score=14.33 Aligned_cols=29 Identities=28% Similarity=0.237 Sum_probs=18.3 Q ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHC Q ss_conf 98777789999147725999999998718 Q gi|254780234|r 1 MSRDVKKVVLAYSGGLDTSIILKWLQVEK 29 (404) Q Consensus 1 M~~~~kkVvlaySGGLDTSv~i~~L~~e~ 29 (404) |+..-++-+++..|=+=-+++++-|..-. T Consensus 1 ~~~~~r~~~~~~~g~~i~~vl~pil~s~~ 29 (524) T COG4943 1 MLHRTRHLVLLIPGVLILSVLLPILLSLW 29 (524) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 94055678988434889999999999998 No 285 >PRK10867 signal recognition particle protein; Provisional Probab=31.00 E-value=36 Score=14.31 Aligned_cols=94 Identities=17% Similarity=0.138 Sum_probs=55.9 Q ss_pred HHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 99987189869999945787110689999999739807998200899999879999736865478742013012433599 Q gi|254780234|r 23 KWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAK 102 (404) Q Consensus 23 ~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~ 102 (404) .||+..++..|.-+++|+=-+--+|.++.-|..+|+ ++|-.+... -| .-|++ T Consensus 122 ~~lk~k~~k~vllvaaDt~RpaA~eQL~~la~~~~v-~~~~~~~~~--------------------dp-------~~ia~ 173 (453) T PRK10867 122 KFLREKHKKKVLVVSADVYRPAAIKQLETLAEQVGV-DFFPSDVGQ--------------------KP-------VDIVN 173 (453) T ss_pred HHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCC-CEECCCCCC--------------------CH-------HHHHH T ss_conf 999973898379855887705899999999985198-043678899--------------------88-------99999 Q ss_pred HHHHHHHHCCCCEEEC-HHHCCCCHHHHHH---HHHHHHCCCCEEE Q ss_conf 9999988568928750-0111682156799---9999858797077 Q gi|254780234|r 103 YLVDIANETGADAIAH-GSTGKGNDQVRFE---LSAYSLNSDIEII 144 (404) Q Consensus 103 ~lv~~a~~~ga~~iaH-G~TgkGNDQvRFe---~~~~~l~P~l~vi 144 (404) ..++.|+..+.|+|-- .+-.-.+|+--|+ ...++..|+-.++ T Consensus 174 ~a~~~ak~~~~DvvivDTAGRl~~d~~Lm~El~~i~~~~~P~e~ll 219 (453) T PRK10867 174 AALKEAKLKFYDVLLVDTAGRLHVDEAMMDEIKQVHASINPVETLF 219 (453) T ss_pred HHHHHHHHCCCCEEEEECCCCHHCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 9999999779999999787601210888999999987637871379 No 286 >PRK12738 kbaY tagatose-bisphosphate aldolase; Reviewed Probab=30.98 E-value=36 Score=14.31 Aligned_cols=122 Identities=12% Similarity=0.127 Sum_probs=59.9 Q ss_pred HHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH Q ss_conf 99999987189869999945787110689999999739807998200899999879999736865478742013012433 Q gi|254780234|r 20 IILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPL 99 (404) Q Consensus 20 v~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpl 99 (404) -++..-+ +.+|-|-+|.+. .-+.+..+-+-|....+ +.++.-....+ + |- +-.. T Consensus 8 ~lL~~A~-~~~yAV~AfNv~--n~e~~~Avi~AAee~~s-PvIlq~s~~~~--~-----------~~---------~~~~ 61 (286) T PRK12738 8 YLLQDAQ-ANGYAVPAFNIH--NAETIQAILEVCSEMRS-PVILAGTPGTF--K-----------HI---------ALEE 61 (286) T ss_pred HHHHHHH-HCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEEECCHHHH--H-----------HC---------CHHH T ss_conf 9999999-879579998889--99999999999999789-98999375377--6-----------66---------9999 Q ss_pred HHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE------EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 599999998856892875001116821567999999858797077------44100687637999999997089675665 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII------APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi------aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ++..+...|++....+..|= --|.| ||...+++.-...-+ -|+- -++.--.+..+||..+|++|.... T Consensus 62 ~~~~~~~~a~~~~VPV~lHL--DH~~~---~e~i~~ai~~GftSVM~DgS~lp~e-eNi~~Tk~vv~~Ah~~gv~VEaEl 135 (286) T PRK12738 62 IYALCSAYSTTYNMPLALHL--DHHES---LDDIRRKVHAGVRSAMIDGSHFPFA-ENVKLVKSVVDFCHSQDCSVEAEL 135 (286) T ss_pred HHHHHHHHHHHCCCCEEEEC--CCCCC---HHHHHHHHHCCCCEEEEECCCCCHH-HHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999998799999989--99999---9999999977998798738999999-999999999999847399788864 No 287 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=30.88 E-value=37 Score=14.30 Aligned_cols=46 Identities=17% Similarity=0.082 Sum_probs=19.0 Q ss_pred HHHHHHHHHCCCCEEECHHHC--CCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 999999885689287500111--6821567999999858797077441 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTG--KGNDQVRFELSAYSLNSDIEIIAPW 147 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~Tg--kGNDQvRFe~~~~~l~P~l~viaP~ 147 (404) .++.+.|+++|.+..--++.| -=+.|.+.=-.+-.-.+|--+++|. T Consensus 65 ~G~~~~Ak~lGv~l~v~~a~~~~d~~~Q~~qie~~i~~~vDAIil~~v 112 (340) T PRK10936 65 YGLVQEAKRLGVDLKVLEAGGYYNLATQQQQLEQCVAWGADAILLGAV 112 (340) T ss_pred HHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 999999998098699985899889999999999999759999998678 No 288 >cd06321 PBP1_ABC_sugar_binding_like_11 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=30.72 E-value=37 Score=14.28 Aligned_cols=65 Identities=15% Similarity=0.042 Sum_probs=24.4 Q ss_pred HHHHHHHHHCCCCE--EECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 99999988568928--750011168215679999998587970774410068763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADA--IAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~--iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) +.+-+.|++.|..+ +...+.+--+.|.+.=..+....+|.-+++|.. ...-+..+.-+.+.|||| T Consensus 19 ~gie~~a~~~g~~~~v~~~~~~~d~~~q~~~i~~li~~~vDgIii~~~~---~~~~~~~i~~a~~~gIpv 85 (271) T cd06321 19 KGAEAAAKKLNPGVKVTVVSADYDLNKQVSQIDNFIAAKVDLILLNAVD---SKGIAPAVKRAQAAGIVV 85 (271) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCC---CCHHHHHHHHHHHCCCEE T ss_conf 9999999971997699980798899999999999997499889993478---403599999999749918 No 289 >cd07200 cPLA2_Grp-IVA Group IVA cytosolic phospholipase A2; catalytic domain; Ca-dependent. Group IVA cPLA2, an 85 kDa protein, consists of two domains: the regulatory C2 domain and the alpha/beta hydrolase PLA2 domain. Group IVA cPLA2 is also referred to as cPLA2-alpha. The catalytic domain of cytosolic phospholipase A2 (cPLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bond of phospholipids to release arachidonic acid. At the active site, cPLA2 contains a serine nucleophile through which the catalytic mechanism is initiated. The active site is partially covered by a solvent-accessible flexible lid. cPLA2 displays interfacial activation as it exists in both "closed lid" and "open lid" forms. Movement of the cPLA2 lid possibly exposes a greater hydrophobic surface and the active site. cPLA2 belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile Probab=30.66 E-value=37 Score=14.27 Aligned_cols=12 Identities=42% Similarity=0.589 Sum_probs=7.5 Q ss_pred CCCEEEEEECCC Q ss_conf 777899991477 Q gi|254780234|r 4 DVKKVVLAYSGG 15 (404) Q Consensus 4 ~~kkVvlaySGG 15 (404) ++-.|.++.||| T Consensus 42 eVPvIAv~gSGG 53 (505) T cd07200 42 EVPVIALLGSGG 53 (505) T ss_pred CCCEEEEECCCC T ss_conf 699899971684 No 290 >COG3867 Arabinogalactan endo-1,4-beta-galactosidase [Carbohydrate transport and metabolism] Probab=30.54 E-value=27 Score=15.29 Aligned_cols=199 Identities=17% Similarity=0.250 Sum_probs=94.2 Q ss_pred HHHHHHHHHHCCCEEEEEECHH-HHHHHHHHH---HHHHCCCCCCCCCCCCCCHHHHH--HHHHHHHHHHHCCCCEE--E Q ss_conf 6899999997398079982008-999998799---99736865478742013012433--59999999885689287--5 Q gi|254780234|r 46 LKIASDKARLLGAKEVYVKDLR-REFVRDFVF---PMFRANALYEGYYLLGTAIARPL--IAKYLVDIANETGADAI--A 117 (404) Q Consensus 46 ~~~~~~~A~~~Ga~~~~v~D~r-~ef~~~~i~---~~I~ana~Yeg~Ypl~tslaRpl--ia~~lv~~a~~~ga~~i--a 117 (404) .-++.+||..+|-+ + .+|+- .+|+.|=.. |.-+.+--| -.+-+.+ --+..+..++++|++.= - T Consensus 106 ~ieiakRAk~~GmK-V-l~dFHYSDfwaDPakQ~kPkaW~~l~f-------e~lk~avy~yTk~~l~~m~~eGi~pdmVQ 176 (403) T COG3867 106 AIEIAKRAKNLGMK-V-LLDFHYSDFWADPAKQKKPKAWENLNF-------EQLKKAVYSYTKYVLTTMKKEGILPDMVQ 176 (403) T ss_pred HHHHHHHHHHCCCE-E-EEECCCHHHCCCHHHCCCCHHHHHCCH-------HHHHHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 99999998746867-9-864021110168010178477663489-------99999999999999999997489945167 Q ss_pred CH--H-------HCCCCHHHH----HH---HHHHHHCCCCEEEEEECCCCCCCHHHH-HHHHHHHCCCCCCCCCCCCCCC Q ss_conf 00--1-------116821567----99---999985879707744100687637999-9999970896756656788641 Q gi|254780234|r 118 HG--S-------TGKGNDQVR----FE---LSAYSLNSDIEIIAPWRHWSFKGRQDL-IDFAEKHAIPIDKNKRGEAPFS 180 (404) Q Consensus 118 HG--~-------TgkGNDQvR----Fe---~~~~~l~P~l~viaP~Rd~~~~sRe~~-i~ya~~~gIpv~~~~~~~~~yS 180 (404) -| . .|-|-+--| |. ++++-..|+++|.--.-+..-.||=.- .+-+-+++++.++--. +-|- T Consensus 177 VGNEtn~gflwp~Ge~~~f~k~a~L~n~g~~avrev~p~ikv~lHla~g~~n~~y~~~fd~ltk~nvdfDVig~--SyYp 254 (403) T COG3867 177 VGNETNGGFLWPDGEGRNFDKMAALLNAGIRAVREVSPTIKVALHLAEGENNSLYRWIFDELTKRNVDFDVIGS--SYYP 254 (403) T ss_pred ECCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEEE--ECCC T ss_conf 54546771324678876768999999977656532489716999934788875235888788773898407763--1254 Q ss_pred CCHHHHCCCCCCC--CCC---CCCC---CCCHHHCCCCCCHHHCCCCCEEE-EEEEECCEEEEECCEECCHHHHHHHHHH Q ss_conf 5724311545763--245---8457---68201001465812279996389-9999502478751623189999999988 Q gi|254780234|r 181 IDTNLLHSSSEGR--VLE---DPSQ---PAPEYVYKMIVSPEEAPDTPTTI-RIDFQRGDPIAINGQVMSPEVLLEQLNQ 251 (404) Q Consensus 181 ~D~Nlwg~S~Egg--~Le---dp~~---~~pe~~~~~t~~p~~ap~~pe~v-~I~Fe~G~PVainG~~~~~~~li~~LN~ 251 (404) -||-+++.- .|. +..+ -.-|-.|.+|.. ++-..+... .-+---|.|+.+-|+..-.-++|+..+. T Consensus 255 ----yWhgtl~nL~~nl~dia~rY~K~VmV~Etay~yTlE--dgDg~~Nt~~~~~~t~~ypitVQGQat~vrDvie~V~n 328 (403) T COG3867 255 ----YWHGTLNNLTTNLNDIASRYHKDVMVVETAYTYTLE--DGDGHENTFPSSEQTGGYPITVQGQATFVRDVIEAVKN 328 (403) T ss_pred ----CCCCCHHHHHHHHHHHHHHHCCEEEEEEECCEEEEC--CCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHH T ss_conf ----345748888868999988756747999811314612--57787775786455479735872334689999999972 Q ss_pred HHHHCCCCCE Q ss_conf 8642163741 Q gi|254780234|r 252 YGRCNGIGRI 261 (404) Q Consensus 252 igg~~GvGr~ 261 (404) .++.||.|.+ T Consensus 329 vp~~~GlGvF 338 (403) T COG3867 329 VPKSNGLGVF 338 (403) T ss_pred CCCCCCEEEE T ss_conf 7888835789 No 291 >cd06292 PBP1_LacI_like_10 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=30.30 E-value=37 Score=14.23 Aligned_cols=110 Identities=11% Similarity=-0.011 Sum_probs=52.6 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECH---HHHHHHHHHHHHHHCC-------CCCCC--CCCCCCC Q ss_conf 7189869999945787110689999999739807998200---8999998799997368-------65478--7420130 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDL---RREFVRDFVFPMFRAN-------ALYEG--YYLLGTA 94 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~---r~ef~~~~i~~~I~an-------a~Yeg--~Ypl~ts 94 (404) .+.||.++-++.+-....+ ...-+.....+..-.++.-. ..+...+.+..+.+.+ ..... .++ +.. T Consensus 26 ~~~Gy~~li~~s~~~~~~e-~~~i~~l~~~~vdGiIi~~~~~~~~~~~~~~~~~~~~~~iPvV~~d~~~~~~~~~~-~V~ 103 (273) T cd06292 26 AQYGYTVLLCNTYRGGVSE-ADYVEDLLARGVRGVVFISSLHADTHADHSHYERLAERGLPVVLVNGRAPPPLKVP-HVS 103 (273) T ss_pred HHCCCEEEEEECCCCHHHH-HHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCC-EEE T ss_conf 9869989999789997999-99999999649984999247666331578999999966998899933579988874-899 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEC--HHHCCCCHHHHHHHHHHHHC Q ss_conf 124335999999988568928750--01116821567999999858 Q gi|254780234|r 95 IARPLIAKYLVDIANETGADAIAH--GSTGKGNDQVRFELSAYSLN 138 (404) Q Consensus 95 laRplia~~lv~~a~~~ga~~iaH--G~TgkGNDQvRFe~~~~~l~ 138 (404) ..-.--++.+++...+.|+.-++. |....-+-+-|.+....++. T Consensus 104 ~Dn~~a~~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~G~~~a~~ 149 (273) T cd06292 104 TDDALAMRLAVRHLVALGHRRIGFASGPGRTVPRRRKIAGFRAALE 149 (273) T ss_pred ECHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 6889999999999998599808995178777159999999999999 No 292 >PRK07105 pyridoxamine kinase; Validated Probab=30.25 E-value=37 Score=14.23 Aligned_cols=126 Identities=21% Similarity=0.291 Sum_probs=70.1 Q ss_pred CCCCEEEEEE----CCCHHHHHHHHHHHHHCCCEEEEE-EEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHH---HHHH Q ss_conf 7777899991----477259999999987189869999-9457871106899999997398079982008999---9987 Q gi|254780234|r 3 RDVKKVVLAY----SGGLDTSIILKWLQVEKGLEVIVF-IADLGQGEELKIASDKARLLGAKEVYVKDLRREF---VRDF 74 (404) Q Consensus 3 ~~~kkVvlay----SGGLDTSv~i~~L~~e~g~eVi~~-~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef---~~~~ 74 (404) ..||||.++- .||.--+..++-|. -.|.++.++ |+-+-|.-- |.......|+.+.+ .+++ T Consensus 2 ~~mK~il~i~d~~~~G~agl~A~i~~ls-a~G~~~~~vpTallS~~T~-----------g~~~~~~~~l~~~l~~~~~~w 69 (284) T PRK07105 2 NPMKRVAAAHDLSGFGRVALTASIPIMS-SMGLQVCPLPTALLSSHTG-----------GFQNPSILDLTDGMQAFLTHW 69 (284) T ss_pred CCCCEEEEECCCCCCCHHHHHHHHHHHH-HCCCCCEEEEEEEECCCCC-----------CCCCCEECCHHHHHHHHHHHH T ss_conf 9868699985878746779999999999-8699540656899456899-----------846450017289999999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH---HHHCC----CCEEECHHHCCC---CHHHHHHHHHHHHCCCCEEE Q ss_conf 999973686547874201301243359999999---88568----928750011168---21567999999858797077 Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIAKYLVDI---ANETG----ADAIAHGSTGKG---NDQVRFELSAYSLNSDIEII 144 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~---a~~~g----a~~iaHG~TgkG---NDQvRFe~~~~~l~P~l~vi 144 (404) -.--++..|.|-| |+-+.. +++.+.+. .+..+ +|-|. |.+|+. -|+--.+ +++-|.|.-.|| T Consensus 70 ~~~~i~~dai~tG-~Lgs~~-----~i~~v~~~l~~~~~~~~~iVvDPVM-gd~G~ly~~~~~~~v~-~~r~Llp~AdiI 141 (284) T PRK07105 70 KSLNLKFDAIYSG-YLGSPR-----QVQIVSEFIDDFKKKDLLVVVDPVM-GDNGKLYQGFDQKMVE-EMRKLIQKADVI 141 (284) T ss_pred HHCCCCCCEEEEC-CCCCHH-----HHHHHHHHHHHHCCCCCCEEECCCC-CCCCCCCCCCCHHHHH-HHHHHHCCCCCC T ss_conf 8669987989989-879999-----9999999999855789848969655-7688766778799999-999984407462 Q ss_pred EEEC Q ss_conf 4410 Q gi|254780234|r 145 APWR 148 (404) Q Consensus 145 aP~R 148 (404) +|=. T Consensus 142 TPN~ 145 (284) T PRK07105 142 TPNL 145 (284) T ss_pred CCCH T ss_conf 7999 No 293 >cd06273 PBP1_GntR_like_1 This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational Probab=29.94 E-value=38 Score=14.19 Aligned_cols=113 Identities=14% Similarity=0.083 Sum_probs=52.2 Q ss_pred HHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHH--HHHHH---HHHHHHHHCCCCCCCC-CCCCCCHH Q ss_conf 999871898699999457871106899999997398079982008--99999---8799997368654787-42013012 Q gi|254780234|r 23 KWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLR--REFVR---DFVFPMFRANALYEGY-YLLGTAIA 96 (404) Q Consensus 23 ~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r--~ef~~---~~i~~~I~ana~Yeg~-Ypl~tsla 96 (404) .-+ .+.||.++-.+.+--...+.+ .-+.....+..-.++.... ++..+ +.-.|.+..+....+. ++ +.... T Consensus 23 ~~~-~~~Gy~~~i~~s~~d~~~e~~-~i~~l~~~~vdGiIi~~~~~~~~~~~~l~~~~iP~V~~~~~~~~~~~~-~V~~D 99 (268) T cd06273 23 ETL-AAHGYTLLVASSGYDLDREYA-QARKLLERGVDGLALIGLDHSPALLDLLARRGVPYVATWNYSPDSPYP-CVGFD 99 (268) T ss_pred HHH-HHCCCEEEEEECCCCHHHHHH-HHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCC-EEEEC T ss_conf 999-985999999958999899999-999999659999999189999899999997599899984657889997-89967 Q ss_pred HHHHHHHHHHHHHHCCCCEEECH-HHCCCC--HHHHHHHHHHHHC Q ss_conf 43359999999885689287500-111682--1567999999858 Q gi|254780234|r 97 RPLIAKYLVDIANETGADAIAHG-STGKGN--DQVRFELSAYSLN 138 (404) Q Consensus 97 Rplia~~lv~~a~~~ga~~iaHG-~TgkGN--DQvRFe~~~~~l~ 138 (404) -.--++.+++...+.|+.-++.= ..-..+ -+-|++....++. T Consensus 100 n~~~~~~~~~~L~~~G~~~i~~i~~~~~~~~~~~~R~~gf~~al~ 144 (268) T cd06273 100 NREAGRLAARHLIALGHRRIAMIFGPTQGNDRARARRAGVRAALA 144 (268) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 799999999999872987479994687777789999999999999 No 294 >PRK12737 gatY tagatose-bisphosphate aldolase; Reviewed Probab=29.89 E-value=38 Score=14.19 Aligned_cols=120 Identities=15% Similarity=0.212 Sum_probs=62.7 Q ss_pred HHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99998718986999994578711068999999973980799820089999987999973686547874201301243359 Q gi|254780234|r 22 LKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 22 i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) +..-+ +.+|-|-+|.+. .-+.+..+-+-|....+ +.++.=....+ +|. | -..++ T Consensus 10 L~~A~-~~~yAV~AfNv~--n~e~~~Avi~AAee~~s-PvIlq~s~~~~--~~~-----------g---------~~~~~ 63 (284) T PRK12737 10 LKKAQ-AEGYAVPAFNIH--NLETLQVVVETAAELRS-PVILAGTPGTI--SYA-----------G---------ADYII 63 (284) T ss_pred HHHHH-HCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEEECCHHHH--HHC-----------C---------HHHHH T ss_conf 99999-889379998889--99999999999999789-98999675388--777-----------9---------99999 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE---E---EECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 9999998856892875001116821567999999858797077---4---4100687637999999997089675665 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII---A---PWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi---a---P~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) ..+...|++.+..+..| .--|.| ||...+++.....-+ + |+.+ ++.--.+..+||.++|++|.... T Consensus 64 ~~~~~~a~~~~VPV~lH--LDH~~~---~e~~~~ai~~GftSVM~DgS~lp~ee-Ni~~T~~vv~~Ah~~gv~VEaEl 135 (284) T PRK12737 64 AIAEVAARKYNIPLALH--LDHHED---LDDIKKKVRAGIRSVMIDGSHLSFEE-NIAIVKEVVEFCHRYDASVEAEL 135 (284) T ss_pred HHHHHHHHHCCCCEEEE--CCCCCC---HHHHHHHHHCCCCEEEEECCCCCHHH-HHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999999879999998--899999---99999999819987987099999999-99999999998645288699963 No 295 >PRK12361 hypothetical protein; Provisional Probab=29.82 E-value=38 Score=14.18 Aligned_cols=88 Identities=23% Similarity=0.205 Sum_probs=48.1 Q ss_pred CEEEEEE---CCCHHHH----HHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH Q ss_conf 7899991---4772599----99999987189869999945787110689999999739807998200899999879999 Q gi|254780234|r 6 KKVVLAY---SGGLDTS----IILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPM 78 (404) Q Consensus 6 kkVvlay---SGGLDTS----v~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~ 78 (404) |+-.|-| |||=|-. -+..+|......+|..-+- ..+-+++.++|.+-|+. .+|.=.-|=-+.+.+ .+ T Consensus 242 ~~a~li~NPvsG~g~~~~~~~~i~~~L~~~~~l~v~~t~~----~~~a~~LArqAv~~GaD-lVVAaGGDGTVneVA-~a 315 (546) T PRK12361 242 KRAWLIANPVSGGGKWQQYGEQIIQELKAYFDLTVKLTTP----DISANTLAKQARKAGAD-IIIACGGDGTVTEVA-SE 315 (546) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHHHEEEEEEECCC----CCCHHHHHHHHHHCCCC-EEEEECCCHHHHHHH-HH T ss_conf 7559998367789876889999999735317689996688----88999999999974999-999988852999999-99 Q ss_pred HHCCCCCCCCCCCCCC--HHHHH Q ss_conf 7368654787420130--12433 Q gi|254780234|r 79 FRANALYEGYYLLGTA--IARPL 99 (404) Q Consensus 79 I~ana~Yeg~Ypl~ts--laRpl 99 (404) +...-.--|.-|+||+ +||-| T Consensus 316 LvgTdv~LGIIPlGTaN~lAR~L 338 (546) T PRK12361 316 LVNTDITLGIIPLGTANALSHAL 338 (546) T ss_pred HCCCCCCEEEECCCCHHHHHHHC T ss_conf 71899824996177557889861 No 296 >COG3155 ElbB Uncharacterized protein involved in an early stage of isoprenoid biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=29.81 E-value=38 Score=14.18 Aligned_cols=45 Identities=22% Similarity=0.247 Sum_probs=31.1 Q ss_pred CCEEEEEECCC--H-----HHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHH Q ss_conf 77899991477--2-----599999999871898699999457871106899 Q gi|254780234|r 5 VKKVVLAYSGG--L-----DTSIILKWLQVEKGLEVIVFIADLGQGEELKIA 49 (404) Q Consensus 5 ~kkVvlaySGG--L-----DTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~ 49 (404) ||||.+-.||- + --||+..+-.++.|.+++||.-|.-|-.-+..+ T Consensus 1 mKkv~ViLSGCGV~DGaEIHEsVltllai~r~GA~~~cFAP~~~Q~hViNHl 52 (217) T COG3155 1 MKKVGVILSGCGVYDGAEIHESVLTLLAISRSGAQAVCFAPDKQQVHVINHL 52 (217) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHC T ss_conf 9626999756755540788999999999975386258746992256666531 No 297 >cd03018 PRX_AhpE_like Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE. AhpE is described as a 1-cys PRX because of the absence of a resolving cysteine. The structure and sequence of AhpE, however, show greater similarity to 2-cys PRXs than 1-cys PRXs. PRXs are thiol-specific antioxidant (TSA) proteins that confer a protective role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. The first step of catalysis is the nucleophilic attack by the peroxidatic cysteine on the peroxide leading to the formation of a cysteine sulfenic acid intermediate. The absence of a resolving cysteine suggests that functional AhpE is regenerated by an external reductant. The solution behavior and crystal structure of AhpE show that it forms dimers and octamers. Probab=29.77 E-value=38 Score=14.17 Aligned_cols=27 Identities=30% Similarity=0.330 Sum_probs=12.3 Q ss_pred HHCCCCEEEEEECCCCCCCHHHHHHHHH Q ss_conf 8587970774410068763799999999 Q gi|254780234|r 136 SLNSDIEIIAPWRHWSFKGRQDLIDFAE 163 (404) Q Consensus 136 ~l~P~l~viaP~Rd~~~~sRe~~i~ya~ 163 (404) .++|+.+|.--|+..+-..| ++-+|++ T Consensus 119 iID~~G~V~~~~v~~~~~~r-~~~~~~~ 145 (149) T cd03018 119 VIDRDGIIRYAWVSDDGEPR-DLPDYDE 145 (149) T ss_pred EECCCCEEEEEEEECCCCCC-CHHHHHH T ss_conf 99799989999990799678-8579999 No 298 >cd05566 PTS_IIB_galactitol PTS_IIB_galactitol: subunit IIB of enzyme II (EII) of the galactitol-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In this system, EII is a galactitol-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain that are expressed on three distinct polypeptide chains, in contrast to other PTS sugar transporters. The three genes encoding these subunits (gatA, gatB, and gatC) comprise the gatCBA operon. Galactitol PTS permease takes up exogenous galactitol, releasing the phosphate ester into the cytoplasm in preparation for oxidation and further metabolism via a modified glycolytic pathway called the tagatose-6-phosphate glycolytic pathway. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include galactitol, chitobiose/lichenan, ascorbate, lactose, mannitol, fructose, and Probab=29.40 E-value=39 Score=14.13 Aligned_cols=35 Identities=31% Similarity=0.529 Sum_probs=22.1 Q ss_pred CEEEEEECCCHHHHHHHH-----HHHHHCCCEEEEEEEECC Q ss_conf 789999147725999999-----998718986999994578 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILK-----WLQVEKGLEVIVFIADLG 41 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~-----~L~~e~g~eVi~~~~d~G 41 (404) |||++|.-.|.-||..+. +|+ +.|.++....++++ T Consensus 1 KkIlVaCGsGiaTSt~v~~ki~~~l~-e~gi~~~i~~~~i~ 40 (89) T cd05566 1 KKILVACGTGVATSTVVASKVKELLK-ENGIDVKVEQCKIA 40 (89) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHH-HCCCCEEEEEECHH T ss_conf 94999989970399999999999999-86991599997599 No 299 >TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine. Probab=29.34 E-value=39 Score=14.12 Aligned_cols=216 Identities=14% Similarity=0.102 Sum_probs=97.2 Q ss_pred EEEECCCCC--HHHH---HHHHHHHCCCE--EEEEECHHHHHHHHHHHHH-HHCCCCCCC-CCCCCCCHHHH-HHHHHHH Q ss_conf 994578711--0689---99999973980--7998200899999879999-736865478-74201301243-3599999 Q gi|254780234|r 36 FIADLGQGE--ELKI---ASDKARLLGAK--EVYVKDLRREFVRDFVFPM-FRANALYEG-YYLLGTAIARP-LIAKYLV 105 (404) Q Consensus 36 ~~~d~Gq~~--d~~~---~~~~A~~~Ga~--~~~v~D~r~ef~~~~i~~~-I~ana~Yeg-~Ypl~tslaRp-lia~~lv 105 (404) +.|.+|+.- |++. +-+.|..+||. ++...+..+.+..+.-.+. ...+...+. .|=+-..+.-+ =--+.+. T Consensus 3 IIAEIg~NH~Gdl~~Ak~LI~~A~~sGadaVKFQ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~kk~els~~~~~~L~ 82 (329) T TIGR03569 3 IIAEAGVNHNGSLELAKKLVDAAAEAGADAVKFQTFKAEDLVSKNAPKAEYQKINTGAEESQLEMLKKLELSEEDHRELK 82 (329) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHCCCCCCHHHCCCCCCCCCCCHHHHHHHHCCCHHHHHHHH T ss_conf 99996787678099999999999994969999307871132781103033012689888619999987148999999999 Q ss_pred HHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH Q ss_conf 99885689287500111682156799999985879707744100687637999999997089675665678864157243 Q gi|254780234|r 106 DIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNL 185 (404) Q Consensus 106 ~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nl 185 (404) +.+++.|.+..+.- -|.-.++.. ..++++.-=|+-. +++ --.+++++.+.+.|+=.+... T Consensus 83 ~~~~~~gi~~~~s~-----fd~~s~~~l-~~~~~~~~KIaS~---d~~-n~~Li~~i~~~~kpiiiStG~---------- 142 (329) T TIGR03569 83 EYCESKGIEFLSTP-----FDLESADFL-EDLGVPRFKIPSG---EIT-NAPLLKKIARFGKPVILSTGM---------- 142 (329) T ss_pred HHHHHHCCEEEECC-----CCHHHHHHH-HHCCCCEEEECCC---CCC-CHHHHHHHHHHCCCEEEECCC---------- T ss_conf 99997299299896-----989999999-8649997997923---105-489999999748978996786---------- Q ss_pred HCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHHHCCCCCEECCC Q ss_conf 11545763245845768201001465812279996389999950247875162318999999998886421637410007 Q gi|254780234|r 186 LHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRCNGIGRIDIVE 265 (404) Q Consensus 186 wg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~~GvGr~d~vE 265 (404) ..+++-.. +..+. ....+.....+-+.=-...|..++.-. +..|..|.+.-+ .=||-.||.. T Consensus 143 -------s~~~EI~~-----av~~~--~~~~~~~~~i~llhC~s~YP~~~~~~n---L~~i~~lk~~f~-~~iG~SDHt~ 204 (329) T TIGR03569 143 -------ATLEEIEA-----AVGVL--RDAGTPDSNITLLHCTTEYPAPFEDVN---LNAMDTLKEAFD-LPVGYSDHTL 204 (329) T ss_pred -------CHHHHHHH-----HHHHH--HHCCCCCCCEEEEEECCCCCCCHHHCC---HHHHHHHHHHCC-CCEEECCCCC T ss_conf -------23999999-----99999--974886557688643788999937647---899999999859-9868479789 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHH-HHHCCHH Q ss_conf 7320210001113737899999999998-6508999 Q gi|254780234|r 266 NRFVGIKSRGVYETPGGTILLHAHRAIE-SISLDSG 300 (404) Q Consensus 266 nr~vG~KsR~vYEaPga~iL~~Ahr~LE-~~~l~~~ 300 (404) + .++|-+++-+-|. -+| .+|+|+. T Consensus 205 g----------~~~~~~Ava~GA~-vIEKHfTldk~ 229 (329) T TIGR03569 205 G----------IEAPIAAVALGAT-VIEKHFTLDKN 229 (329) T ss_pred C----------CHHHHHHHHCCCE-EEEEEEECCCC T ss_conf 8----------3679999986996-99987776899 No 300 >TIGR01463 mtaA_cmuA methyltransferase, MtaA/CmuA family; InterPro: IPR006360 This entry represents a methyltransferase domain found in proteins which are involved in C1 metabolism in methanogenic archaea and methylotrophic bacteria. It is closely related to, yet is distinct from, uroporphyrinogen decarboxylase (4.1.1.37 from EC). In methanogens, this domain is found in single-domain proteins catalysing the transfer of the methyl group of methylcobalamin to coenzymeM . In Methanosarcina barkeri these proteins have been shown to be involved in methanogenesis from C1 compounds such as methylamine and dimethylsulphide ,CH3-cob(III)alamin + coenzymeM --> cob(I)alamin + CH3-coenzymeM In methylotrophic bacteria, this domain is found at the N-terminus of the bifunctional corrinoid-binding CmuA protein, which is involved in chloromethane utilisation. The pathway of chloromethane utilisation allows the microorganisms that possess it to grow with chloromethane as the sole carbon and energy source. It is initiated by a corrinoid-dependent methyltransferase system involving methyltransferase I (CmuA) and methyltransferase II (CmuB), which transfer the methyl group of chloromethane onto tetrahydrofolate . The methyl group of chloromethane is first transferred by the protein CmuA to its corrinoid moiety, from where it is transferred to tetrahydrofolate by CmuB, thereby yielding methyltetrahydrofolate , .CH3Cl + cob(I)alamin ---> CH3-cob(III)alamin + Cl- ; GO: 0008168 methyltransferase activity, 0006730 one-carbon compound metabolic process. Probab=28.98 E-value=39 Score=14.08 Aligned_cols=20 Identities=25% Similarity=0.145 Sum_probs=17.7 Q ss_pred HHHHHHHHHCCCCEEECHHH Q ss_conf 99999988568928750011 Q gi|254780234|r 102 KYLVDIANETGADAIAHGST 121 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~T 121 (404) |+++..++|.++-.|.|-|+ T Consensus 228 kr~~~~~KEi~~~~~lHiCg 247 (350) T TIGR01463 228 KRLVAEIKEIDGITVLHICG 247 (350) T ss_pred HHHHHHHHCCCCCEEEEECC T ss_conf 88987621126866884037 No 301 >PRK07945 hypothetical protein; Provisional Probab=28.85 E-value=39 Score=14.06 Aligned_cols=102 Identities=16% Similarity=0.260 Sum_probs=52.4 Q ss_pred EECCCHHHHHHHHHHHHHCCCEEEEEEEE-----CCCC---C----HHHHHHHHHHHCCCEEEE-----------EECHH Q ss_conf 91477259999999987189869999945-----7871---1----068999999973980799-----------82008 Q gi|254780234|r 11 AYSGGLDTSIILKWLQVEKGLEVIVFIAD-----LGQG---E----ELKIASDKARLLGAKEVY-----------VKDLR 67 (404) Q Consensus 11 aySGGLDTSv~i~~L~~e~g~eVi~~~~d-----~Gq~---~----d~~~~~~~A~~~Ga~~~~-----------v~D~r 67 (404) -||.|-.|=--..--..+.||+-+++|=- +..+ + .++.|.+-..++.-...+ -+|+. T Consensus 105 ~wSDG~~sieeMa~aA~~lGyeYlaITDHS~~l~vA~GLs~erL~~Q~~eI~~ln~~~~~~rIL~GiEvDIL~DGsLD~~ 184 (335) T PRK07945 105 DWSDGGSPIEEMMATAAALGHEYCALTDHSPRLTVANGLSPERLRKQLDVIAELNERLAPFRILTGIEVDILEDGSLDQE 184 (335) T ss_pred CCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC T ss_conf 84579786999999999839827998048853456689799999999999999985448927997535321489887899 Q ss_pred HHHHHH--HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCC Q ss_conf 999998--79999736865478742013012433599999998856892875001116 Q gi|254780234|r 68 REFVRD--FVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGK 123 (404) Q Consensus 68 ~ef~~~--~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~Tgk 123 (404) +++-++ +|.-+|-.+ -.+.|--+-+.++..++.-..+.++| |||+ T Consensus 185 DevL~~LD~VVASVHs~----------f~~~~~~mT~Ril~Ai~nP~V~IlgH-PTGR 231 (335) T PRK07945 185 PELLDRLDVVVASVHSK----------LRMDRAAMTRRMLAAVANPHTDVLGH-CTGR 231 (335) T ss_pred HHHHHHCCEEEEEECCC----------CCCCHHHHHHHHHHHHHCCCCCEEEC-CCCC T ss_conf 89997579899970134----------57999999999999972998888537-7645 No 302 >PRK06361 hypothetical protein; Provisional Probab=28.74 E-value=40 Score=14.05 Aligned_cols=118 Identities=21% Similarity=0.267 Sum_probs=63.1 Q ss_pred HHHHHHHCCCCEE-ECHHHCCCCHHHHHHHHHHHHCCCCEEEE-EECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 9999885689287-50011168215679999998587970774-410068763799999999708967566567886415 Q gi|254780234|r 104 LVDIANETGADAI-AHGSTGKGNDQVRFELSAYSLNSDIEIIA-PWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSI 181 (404) Q Consensus 104 lv~~a~~~ga~~i-aHG~TgkGNDQvRFe~~~~~l~P~l~via-P~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~ 181 (404) ..+..++++...+ .||.....-.+....+ ...-+|...||+ | .+-. ++.+..|+++|.-+..+. +|.+. T Consensus 82 ~~~~l~~ld~~iasvH~~~~~~~~~~~~~~-~Aienp~VdII~HP----g~i~-~e~a~~Aae~gtaLEINa---~p~rl 152 (216) T PRK06361 82 LAKKARDLGAEIVVVHGETIVEPVAPGTNL-AAISCEDVDILAHP----GLIT-EEDAALAAENGVFLEITA---RKGHN 152 (216) T ss_pred HHHHHHHCCCEEEEEECCCCHHHHHHHHHH-HHHHCCCCCEEECC----CCCC-HHHHHHHHHHCCEEEEEC---CCCCC T ss_conf 578998679769998166201156768999-99858995587389----8425-899999998195899968---97764 Q ss_pred CHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHH--HHHHHHHHHHHCCCC Q ss_conf 724311545763245845768201001465812279996389999950247875162318999--999998886421637 Q gi|254780234|r 182 DTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEV--LLEQLNQYGRCNGIG 259 (404) Q Consensus 182 D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~--li~~LN~igg~~GvG 259 (404) |-| + ..+...-+.|+|++||-.+.++-+ -+.+-=.++.+-|+. T Consensus 153 d~~-------------------~----------------~~~r~A~~~Gv~i~I~sDAH~~~~l~~~~~gv~~Arra~L~ 197 (216) T PRK06361 153 LTN-------------------G----------------HVARLAREAGAPLVINTDTHAPSDLITYETARKVALGAGLT 197 (216) T ss_pred CCC-------------------H----------------HHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 777-------------------9----------------99999998699699978999800036889999999984999 Q ss_pred CEECCC Q ss_conf 410007 Q gi|254780234|r 260 RIDIVE 265 (404) Q Consensus 260 r~d~vE 265 (404) .-|.+. T Consensus 198 ~~dv~n 203 (216) T PRK06361 198 EAELKE 203 (216) T ss_pred HHHHHH T ss_conf 999999 No 303 >PRK07313 phosphopantothenoylcysteine decarboxylase; Validated Probab=28.59 E-value=40 Score=14.03 Aligned_cols=133 Identities=22% Similarity=0.277 Sum_probs=61.7 Q ss_pred CEEEEEECCCHHHHHH---HHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH--HCCCEEEEEECHHH-HHHH--HHHHH Q ss_conf 7899991477259999---999987189869999945787110689999999--73980799820089-9999--87999 Q gi|254780234|r 6 KKVVLAYSGGLDTSII---LKWLQVEKGLEVIVFIADLGQGEELKIASDKAR--LLGAKEVYVKDLRR-EFVR--DFVFP 77 (404) Q Consensus 6 kkVvlaySGGLDTSv~---i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~--~~Ga~~~~v~D~r~-ef~~--~~i~~ 77 (404) |||+|+.+||.-.--+ +..|+ +.|++|.++.-.-.+. -+..... ..|- +.+. |.-+ +.-. +++.- T Consensus 3 K~IllgvtGsIAayK~~~lir~L~-k~g~~V~vi~T~~A~~----fv~~~~~~~ls~~-~v~~-d~~~~~~~~~~~HI~L 75 (180) T PRK07313 3 KKILLAVSGSIAAYKAADLTSQLT-KIGYDVTVIMTKAATQ----FITPLTLQVLSKN-PVHL-DVMDEEDPKSMNHIEL 75 (180) T ss_pred CEEEEEEECHHHHHHHHHHHHHHH-HCCCEEEEEECCCHHH----HCCHHHHHHHCCC-EEEC-CCCCCCCCCCCCCEEE T ss_conf 769999954999999999999999-8899599997746997----6698789887299-2883-7886546787661212 Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCC-EEEEEECC---CCCC Q ss_conf 9736865478742013012433599999998856892875001116821567999999858797-07744100---6876 Q gi|254780234|r 78 MFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDI-EIIAPWRH---WSFK 153 (404) Q Consensus 78 ~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l-~viaP~Rd---~~~~ 153 (404) +=++. -.+ .+ |+.|-.+.+.|. |-.+|..- ..+.++.++- -+++|--. |+-. T Consensus 76 ~~waD------l~l-Ia---PaTANtiaK~A~------------GiaDnllt--~~~lA~~~~~Pv~iaPAMn~~Mw~~p 131 (180) T PRK07313 76 AKNTD------LFL-VA---PATANTIAKLAH------------GIADNLVT--SVALALPATTPKLIAPAMNTKMYQNP 131 (180) T ss_pred CCCCC------EEE-EE---CCCHHHHHHHHH------------CCCCCHHH--HHHHHHCCCCCEEEECCCCHHHHHCH T ss_conf 32038------889-81---086869999984------------12472999--99997268998899517788888798 Q ss_pred CHHHHHHHHHHHCCCC Q ss_conf 3799999999708967 Q gi|254780234|r 154 GRQDLIDFAEKHAIPI 169 (404) Q Consensus 154 sRe~~i~ya~~~gIpv 169 (404) .-++-++.+++.|+-+ T Consensus 132 ~tq~nl~~L~~~G~~i 147 (180) T PRK07313 132 ATQRNIKRLKEDGVTI 147 (180) T ss_pred HHHHHHHHHHHCCCEE T ss_conf 8999999999789899 No 304 >PRK02621 consensus Probab=28.51 E-value=40 Score=14.02 Aligned_cols=23 Identities=17% Similarity=0.137 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHCCCEEEEEECHH Q ss_conf 06899999997398079982008 Q gi|254780234|r 45 ELKIASDKARLLGAKEVYVKDLR 67 (404) Q Consensus 45 d~~~~~~~A~~~Ga~~~~v~D~r 67 (404) |.-+..+.-...||.+.+++|+- T Consensus 31 dP~~~ak~~~~~gad~lhivDld 53 (254) T PRK02621 31 DPVELACRYSQAGADELVFLDIT 53 (254) T ss_pred CHHHHHHHHHHCCCCEEEEEECC T ss_conf 99999999998599999998266 No 305 >pfam11116 DUF2624 Protein of unknown function (DUF2624). This family is conserved in the Bacillaceae family. Several members are named as YqfT. The function is not known. Probab=28.13 E-value=41 Score=13.98 Aligned_cols=21 Identities=24% Similarity=0.605 Sum_probs=15.4 Q ss_pred CCCCHHHHHHHHHHHCCCCCCC Q ss_conf 8763799999999708967566 Q gi|254780234|r 151 SFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 151 ~~~sRe~~i~ya~~~gIpv~~~ 172 (404) +++ -+|++.|++++|||+... T Consensus 14 ~iT-~~ELlkYakqy~i~iT~~ 34 (84) T pfam11116 14 QIT-ADELLKYAKQYGIPITRA 34 (84) T ss_pred CCC-HHHHHHHHHHHCCCCCHH T ss_conf 388-999999999918887889 No 306 >cd03332 LMO_FMN L-Lactate 2-monooxygenase (LMO) FMN-binding domain. LMO is a FMN-containing enzyme that catalyzes the conversion of L-lactate and oxygen to acetate, carbon dioxide, and water. LMO is a member of the family of alpha-hydroxy acid oxidases. It is thought to be a homooctamer with two- and four- fold axes in the center of the octamer. Probab=27.91 E-value=41 Score=13.95 Aligned_cols=33 Identities=15% Similarity=0.128 Sum_probs=21.5 Q ss_pred CCEEEEECCEECCHHHHHHHHHHHHHHCCCCCE Q ss_conf 024787516231899999999888642163741 Q gi|254780234|r 229 RGDPIAINGQVMSPEVLLEQLNQYGRCNGIGRI 261 (404) Q Consensus 229 ~G~PVainG~~~~~~~li~~LN~igg~~GvGr~ 261 (404) .-.||-+||---++.+++..|--=+-.-+|||- T Consensus 307 ~~~~V~~DgGIRrG~DV~KAlALGA~~V~iGRp 339 (383) T cd03332 307 DRLTVLFDSGVRTGADIMKALALGAKAVLIGRP 339 (383) T ss_pred CCCEEEEECCCCCHHHHHHHHHCCCCEEEECHH T ss_conf 998499979978679999999769998987789 No 307 >COG1363 FrvX Cellulase M and related proteins [Carbohydrate transport and metabolism] Probab=27.78 E-value=41 Score=13.94 Aligned_cols=38 Identities=26% Similarity=0.313 Sum_probs=29.1 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 3799999999708967566567886415724311545763 Q gi|254780234|r 154 GRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGR 193 (404) Q Consensus 154 sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg 193 (404) -|+..++-|++|+||+..... +-|.+|..-.+.+-+|- T Consensus 271 l~~~L~~~A~~~~Ip~Q~~v~--~~ggTDA~a~~~~g~gv 308 (355) T COG1363 271 LRKFLLELAEKNNIPYQVDVS--PGGGTDAGAAHLTGGGV 308 (355) T ss_pred HHHHHHHHHHHCCCCEEEEEC--CCCCCCHHHHHHCCCCC T ss_conf 999999999985997699945--78875389998727997 No 308 >TIGR03106 trio_M42_hydro hydrolase, peptidase M42 family. This model describes a subfamily of MEROPS peptidase family M42, a glutamyl aminopeptidase family that also includes the cellulase CelM from Clostridium thermocellum and deblocking aminopeptidases that can remove acylated amino acids. Members of this family occur in a three gene cassette with an amidotransferase (TIGR03104)in the asparagine synthase (glutamine-hydrolyzing) family, and a probable acetyltransferase (TIGR03103) in the GNAT family. Probab=27.61 E-value=41 Score=13.92 Aligned_cols=19 Identities=21% Similarity=0.130 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHCCCCCE Q ss_conf 9999999888642163741 Q gi|254780234|r 243 EVLLEQLNQYGRCNGIGRI 261 (404) Q Consensus 243 ~~li~~LN~igg~~GvGr~ 261 (404) ..+.++|-++|.+|||-.- T Consensus 266 ~~l~~~l~~~A~~~~Ip~Q 284 (343) T TIGR03106 266 YHLTRKLIRLCQDHGIPHR 284 (343) T ss_pred HHHHHHHHHHHHHCCCCCE T ss_conf 8999999999998699847 No 309 >PTZ00110 helicase; Provisional Probab=27.61 E-value=28 Score=15.17 Aligned_cols=149 Identities=18% Similarity=0.208 Sum_probs=77.1 Q ss_pred EEECCCHHHHHHHHHHHHHCCC---EEEEEEEECCC---------CCHHHHHHHHHHHC---CC-EEEEEECHHHHHHHH Q ss_conf 9914772599999999871898---69999945787---------11068999999973---98-079982008999998 Q gi|254780234|r 10 LAYSGGLDTSIILKWLQVEKGL---EVIVFIADLGQ---------GEELKIASDKARLL---GA-KEVYVKDLRREFVRD 73 (404) Q Consensus 10 laySGGLDTSv~i~~L~~e~g~---eVi~~~~d~Gq---------~~d~~~~~~~A~~~---Ga-~~~~v~D~r~ef~~~ 73 (404) -++|+|-|||-+=.-|+ .-+. +..-|--|.=+ .+|++..+++-... |- .+--|..+.+-=.-+ T Consensus 113 ~~~~~~~~~~~~g~~l~-~i~w~~~~l~~f~KnfY~eh~~v~~~s~~ev~~~r~~~~i~v~~G~~vP~Pi~~F~~~~fp~ 191 (602) T PTZ00110 113 NSSSNGYGSSTLGKRLM-PIDWNQVELVPFEKNFYKEHPEVSAMSTKEVDDIRKEREITIIAGRDVPKPVVSFEYTSFPD 191 (602) T ss_pred CCCCCCCCHHHHHCCCC-CCCCCHHHCCCCCCCCCCCCHHHHHCCHHHHHHHHHHCCCEEECCCCCCCCCCCHHHCCCCH T ss_conf 54467766102414688-88875300664135277769767618999999999977988927898994726650159999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHH----------HHCCCCEE Q ss_conf 79999736865478742013012433599999998856892875001116821567999999----------85879707 Q gi|254780234|r 74 FVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAY----------SLNSDIEI 143 (404) Q Consensus 74 ~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~----------~l~P~l~v 143 (404) ||..+|+.+. |---| |+++.. +.+|. .|-|.|+-.-||.|..-.=.=-.+. .-+|-.=| T Consensus 192 ~il~~i~~~G-----F~~PT----PIQ~qa-IPiaL-sGrDvIgiAqTGSGKTLAFlLP~l~hi~~q~~~~~~~gP~aLI 260 (602) T PTZ00110 192 YILKSIEAAG-----FTEPT----PIQVQG-WPIAL-SGRDMIGIAETGSGKTLAFLLPAIVHINAQPLLRPGDGPIVLV 260 (602) T ss_pred HHHHHHHHCC-----CCCCC----HHHHHH-HHHHH-CCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCEEEE T ss_conf 9999999769-----99999----899999-87985-6986799878978899999999999985163436789976999 Q ss_pred EEEECCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 744100687637999999997089675 Q gi|254780234|r 144 IAPWRHWSFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 144 iaP~Rd~~~~sRe~~i~ya~~~gIpv~ 170 (404) +||-||.-.+-.++...|.+..+|.+. T Consensus 261 LaPTRELA~QI~~e~~~~~~~~~ir~~ 287 (602) T PTZ00110 261 LAPTRELAEQIREQALQFGRSSKLKNS 287 (602) T ss_pred ECCHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 738399999999999997154785499 No 310 >COG0604 Qor NADPH:quinone reductase and related Zn-dependent oxidoreductases [Energy production and conversion / General function prediction only] Probab=27.54 E-value=42 Score=13.91 Aligned_cols=13 Identities=23% Similarity=0.404 Sum_probs=6.2 Q ss_pred CCEEEEEEEECCC Q ss_conf 9869999945787 Q gi|254780234|r 30 GLEVIVFIADLGQ 42 (404) Q Consensus 30 g~eVi~~~~d~Gq 42 (404) |.|+.....-+|. T Consensus 62 G~d~aG~V~avG~ 74 (326) T COG0604 62 GSEAAGVVVAVGS 74 (326) T ss_pred CEEEEEEEEECCC T ss_conf 0157899996378 No 311 >pfam01116 F_bP_aldolase Fructose-bisphosphate aldolase class-II. Probab=27.43 E-value=42 Score=13.89 Aligned_cols=121 Identities=14% Similarity=0.232 Sum_probs=65.7 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH Q ss_conf 99999871898699999457871106899999997398079982008999998799997368654787420130124335 Q gi|254780234|r 21 ILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLI 100 (404) Q Consensus 21 ~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRpli 100 (404) ++.--+ +.+|-|-+|.+. .-+....+-+-|....+ +.++ .+-+--. +|. .-..+ T Consensus 7 ll~~A~-~~~yAV~afNv~--~~e~~~avi~AAee~~s-PvIl-q~s~~~~-~~~--------------------g~~~~ 60 (283) T pfam01116 7 LLDKAK-EGGYAVPAFNVN--NLETINAVLEAAEEANS-PVII-QVSPGAA-KYA--------------------GAEAL 60 (283) T ss_pred HHHHHH-HCCEEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE-ECCCCHH-HHC--------------------CHHHH T ss_conf 999999-889279987679--89999999999999699-9999-9684467-558--------------------99999 Q ss_pred HHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCE---EE---EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 999999988568928750011168215679999998587970---77---44100687637999999997089675665 Q gi|254780234|r 101 AKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIE---II---APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 101 a~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~---vi---aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) +..+...|++-+..+..|= --|.| ||...+++.--.. += -|+- .++.--.+..+||.++|++|.... T Consensus 61 ~~~~~~~a~~~~VPV~lHL--DH~~~---~e~~~~ai~~GftSVM~DgS~l~~e-eNi~~T~~vv~~Ah~~gv~VEaEl 133 (283) T pfam01116 61 AAMVRAAAEKYSVPVALHL--DHGAS---FEGILEAIEAGFSSVMIDGSHLPFE-ENIAITKEVVEYAHARGVSVEAEL 133 (283) T ss_pred HHHHHHHHHHCCCCEEEEC--CCCCC---HHHHHHHHHHCCCEEEEECCCCCHH-HHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9999999997799899966--77799---9999999981998698638979999-999999999999987398489975 No 312 >KOG0625 consensus Probab=27.37 E-value=42 Score=13.89 Aligned_cols=22 Identities=18% Similarity=0.317 Sum_probs=12.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHH Q ss_conf 7789999147725999999998 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWLQ 26 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L~ 26 (404) ..|+++.--|=|-|..+-+.++ T Consensus 83 v~rlivGqnG~LSTPAvS~iIR 104 (558) T KOG0625 83 VGRLIVGQNGILSTPAVSCIIR 104 (558) T ss_pred CCEEEECCCCCCCCHHHHHHHH T ss_conf 5158863587036567888987 No 313 >KOG1198 consensus Probab=27.20 E-value=42 Score=13.87 Aligned_cols=25 Identities=16% Similarity=0.226 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 8999999999997233673999999 Q gi|254780234|r 324 PEREMLQALIDKSQEYVEGSITLKL 348 (404) Q Consensus 324 p~~~~l~a~i~~~q~~VtG~V~l~L 348 (404) |+.+.-++|-+....+.+|.|.++. T Consensus 321 p~~~~~ea~~~~~~~~~~GK~vl~~ 345 (347) T KOG1198 321 PFSQAKEAFEKLEKSHATGKVVLEK 345 (347) T ss_pred EHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 5468899999887557864189985 No 314 >PRK13982 bifunctional SbtC-like/phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Provisional Probab=26.98 E-value=42 Score=13.84 Aligned_cols=37 Identities=27% Similarity=0.487 Sum_probs=25.8 Q ss_pred CCCCCEEEEEECCCHH---HHHHHHHHHHHCCCEEEEEEEE Q ss_conf 8777789999147725---9999999987189869999945 Q gi|254780234|r 2 SRDVKKVVLAYSGGLD---TSIILKWLQVEKGLEVIVFIAD 39 (404) Q Consensus 2 ~~~~kkVvlaySGGLD---TSv~i~~L~~e~g~eVi~~~~d 39 (404) +-.=|||+|+.|||.- +--++..|+ +.|++|.++--. T Consensus 68 ~l~GKkIlLgVtGsIAAYKa~~LvR~L~-k~Ga~V~vvmT~ 107 (476) T PRK13982 68 SLGSKRITLIIGGGIAAYKALDLIRRLK-ERGAEVRCVLTK 107 (476) T ss_pred CCCCCEEEEEECCHHHHHHHHHHHHHHH-HCCCEEEEEECH T ss_conf 8889979999674899999999999999-789989999787 No 315 >TIGR00406 prmA ribosomal protein L11 methyltransferase; InterPro: IPR004498 Ribosomal protein L11 methyltransferase (2.1.1 from EC) is required for the methylation of ribosomal protein L11. It forms a bifunctional operon in Escherichia coli with panF (pantothenate transport).; GO: 0008276 protein methyltransferase activity, 0006479 protein amino acid methylation. Probab=26.96 E-value=42 Score=13.84 Aligned_cols=59 Identities=29% Similarity=0.360 Sum_probs=37.1 Q ss_pred EEEE-CCCHH-HHHHHHHHHHHCCC-EEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHH Q ss_conf 9991-47725-99999999871898-6999994578711068999999973980799820089999 Q gi|254780234|r 9 VLAY-SGGLD-TSIILKWLQVEKGL-EVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFV 71 (404) Q Consensus 9 vlay-SGGLD-TSv~i~~L~~e~g~-eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~ 71 (404) .||| +|-.. |+.||-||- .-.. +-+-..+|+|-+.=+=.|. |.++||.+.+-+|. |-.+ T Consensus 170 GlAFGTGtH~TT~LCLe~L~-~~d~k~kh~~viD~GCGSGIL~IA--a~~LGa~~~~g~D~-DP~A 231 (330) T TIGR00406 170 GLAFGTGTHPTTSLCLELLE-DLDLKDKHKKVIDVGCGSGILSIA--ALKLGAAKVVGIDI-DPLA 231 (330) T ss_pred CCCCCCCCCHHHHHHHHHHH-CCCCCCCCCEEEEECCCHHHHHHH--HHHHHHHHEEEECC-CHHH T ss_conf 64568897457899999874-014777665478712671789999--99751231122137-7289 No 316 >PTZ00098 phosphoethanolamine N-methyltransferase; Provisional Probab=26.86 E-value=43 Score=13.83 Aligned_cols=149 Identities=9% Similarity=0.006 Sum_probs=74.1 Q ss_pred HHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 99987189869999945787110689999999739807998200899999879999736865478742013012433599 Q gi|254780234|r 23 KWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAK 102 (404) Q Consensus 23 ~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~ 102 (404) .+|.+++|++|+++.+- +.-++.+.+++..-+...+.+.|..+.-..|--|..|.+.-. .+...+-.++-..+ T Consensus 68 ~~LA~~yg~~V~GiDls---~~~~~~A~er~~~~~~v~f~~~d~~~~~f~d~sFDvV~S~da----ilHip~~DK~~l~~ 140 (263) T PTZ00098 68 KYINEKYGAHTHGIDIC---EKIVNIAKERNQDKAKIEFEAKDILKKDFPENNFDLIYSRDA----ILHLSLADKKKLFE 140 (263) T ss_pred HHHHHHCCCEEEEEECC---HHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCEEEEEEEHH----HHCCCHHHHHHHHH T ss_conf 99999749879998588---999999998551258548996785367788674558987502----23088243999999 Q ss_pred HHHHHHHHCCCCEEECHHHCCCCH-HHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 999998856892875001116821-5679999998587970774410068763799999999708967566567886415 Q gi|254780234|r 103 YLVDIANETGADAIAHGSTGKGND-QVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSI 181 (404) Q Consensus 103 ~lv~~a~~~ga~~iaHG~TgkGND-QvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~ 181 (404) .+.++-+--|.=+++.=|-+..++ --.|.-++++- ..++.|-+.--..+++.|+.. .--. T Consensus 141 e~~RvLKPGG~llitDw~~~~~~~~s~~~~~yi~~r-----------~~~l~s~~~Y~~~l~~aGF~~--------v~~~ 201 (263) T PTZ00098 141 KCYKWLKPNGILLITDYCADEIENWDDEFKAYIKKR-----------KYTLIPIEEYADIIKACKFKN--------VEAK 201 (263) T ss_pred HHHHHCCCCCEEEEEEECCCCCCCCCHHHHHHHHHC-----------CCCCCCHHHHHHHHHHCCCCE--------EEEE T ss_conf 999984688789997501357899739899999846-----------876679999999999779966--------7875 Q ss_pred CHHHHCCCCCCCCCCC Q ss_conf 7243115457632458 Q gi|254780234|r 182 DTNLLHSSSEGRVLED 197 (404) Q Consensus 182 D~Nlwg~S~Egg~Led 197 (404) |.|-|.+.+-..+|+. T Consensus 202 Drt~~~~~~~~~el~~ 217 (263) T PTZ00098 202 DISDYWLELLELEHKK 217 (263) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 1428999999999999 No 317 >PTZ00308 ethanolamine-phosphate cytidylytransferase; Provisional Probab=26.77 E-value=43 Score=13.81 Aligned_cols=175 Identities=18% Similarity=0.251 Sum_probs=78.4 Q ss_pred HHHHHHCCCCEEECH----HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 999885689287500----1116821567999999858797077441006876379999999970896756656788641 Q gi|254780234|r 105 VDIANETGADAIAHG----STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFS 180 (404) Q Consensus 105 v~~a~~~ga~~iaHG----~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS 180 (404) .+.+.+.+||+|+|| ++.-|+|-.++ ++..+ -++++ -|-... |-.+.+.-.-.. ++ ..-.-+ T Consensus 88 ~~~l~~~~id~vvHgdd~~~~~~g~d~Y~~---~k~~g-rf~~~--kRT~gi-STtdlv~R~Ll~------tk-~h~~~~ 153 (353) T PTZ00308 88 LEDLERLECDFVVHGDDISVDLNGRNSYQE---IIDAG-KFKVV--KRTEGI-STTDLVGRMLLC------TK-SHLLKS 153 (353) T ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCHHHH---HHHCC-CEEEE--ECCCCC-CHHHHHHHHHHH------CC-CCCCCC T ss_conf 999998098789607865228997682198---99759-84876--358996-788999999863------65-210245 Q ss_pred CCHHHHCCCCCCCCCCCCCCCCC------HHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEE----CCHHHHHHHHH Q ss_conf 57243115457632458457682------010014658122799963899999502478751623----18999999998 Q gi|254780234|r 181 IDTNLLHSSSEGRVLEDPSQPAP------EYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQV----MSPEVLLEQLN 250 (404) Q Consensus 181 ~D~Nlwg~S~Egg~Ledp~~~~p------e~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~----~~~~~li~~LN 250 (404) .+.++...+. .|+...+ ..+.+... ...|.. ...-|-+||-- ..-++.++++- T Consensus 154 ~~~~~~~~~~------~~~t~~~~~~~ts~~i~qfs~--~~~p~~---------~~kIVYvDG~FDLFH~GHi~~L~~ak 216 (353) T PTZ00308 154 VDEVQLESSL------FPYTPTSHCLTTSRKIVQFSN--NRSPKP---------GDRIVYVDGSFDLFHIGHIRVLQKAR 216 (353) T ss_pred CCCCCCCCCC------CCCCCCCCCCCCCHHHHHHCC--CCCCCC---------CCEEEEEECCHHCCCHHHHHHHHHHH T ss_conf 5422344456------777775412243047886147--889899---------99799980860226778999999998 Q ss_pred HHHHHCCCCCEECCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 88642163741000773202100011-13737899999999998650899999999999999999875351469899999 Q gi|254780234|r 251 QYGRCNGIGRIDIVENRFVGIKSRGV-YETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFWFSPEREML 329 (404) Q Consensus 251 ~igg~~GvGr~d~vEnr~vG~KsR~v-YEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~wf~p~~~~l 329 (404) +.|. --+||+-+-++ -+--|. ---+|+-. -+.--.++-+|.+.|.-|-=|.+..+.| T Consensus 217 ~~g~-----------yLIVGv~~D~~v~~~KG~----------nyPImnl~-ER~l~vl~CkyVDeVi~~aP~~~t~~~i 274 (353) T PTZ00308 217 ELGD-----------YLIVGVHEDQVVNEQKGS----------NYPIMNLN-ERVLGVLSCRYVDEVVIGAPFDVTKEVI 274 (353) T ss_pred HCCC-----------EEEEEECCCHHHHHHCCC----------CCCCCCHH-HHHHHHHHHCCCCEEEECCCCCCCHHHH T ss_conf 5199-----------899995781898974489----------98866799-9999876202256266589986789999 Q ss_pred HHH Q ss_conf 999 Q gi|254780234|r 330 QAL 332 (404) Q Consensus 330 ~a~ 332 (404) +.| T Consensus 275 ~~~ 277 (353) T PTZ00308 275 DSL 277 (353) T ss_pred HHC T ss_conf 867 No 318 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=26.75 E-value=43 Score=13.81 Aligned_cols=33 Identities=21% Similarity=0.264 Sum_probs=22.2 Q ss_pred ECCEEEEECCEECCHHHHHHHHHHHHHHCCCCC Q ss_conf 502478751623189999999988864216374 Q gi|254780234|r 228 QRGDPIAINGQVMSPEVLLEQLNQYGRCNGIGR 260 (404) Q Consensus 228 e~G~PVainG~~~~~~~li~~LN~igg~~GvGr 260 (404) ....||-+||---++.+++..|--=+-.-+||| T Consensus 269 ~~~~~v~~DgGiR~G~DV~KAlALGA~aV~iGR 301 (344) T cd02922 269 FDKIEVYVDGGVRRGTDVLKALCLGAKAVGLGR 301 (344) T ss_pred CCCEEEEECCCCCCHHHHHHHHHCCCCEEEECH T ss_conf 887089971885757899999976999897678 No 319 >PRK00522 tpx thiol peroxidase; Provisional Probab=26.69 E-value=43 Score=13.80 Aligned_cols=35 Identities=26% Similarity=0.233 Sum_probs=21.4 Q ss_pred CEEEEEECCCHHHHHHHHHH-------HHHCCCEEEEEEEEC Q ss_conf 78999914772599999999-------871898699999457 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWL-------QVEKGLEVIVFIADL 40 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L-------~~e~g~eVi~~~~d~ 40 (404) |++||.|-=..||.+|-.=+ .+-.+++|+++++|. T Consensus 46 k~vVL~f~P~~~TpvC~~e~~~f~~~~~~~~~~~v~gIS~D~ 87 (168) T PRK00522 46 KRKVLNIFPSIDTGVCAASVRKFNQLAAELDNTVVLCVSADL 87 (168) T ss_pred CEEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 859999973688865277898899999863597499996899 No 320 >pfam01177 Asp_Glu_race Asp/Glu/Hydantoin racemase. This family contains aspartate racemase, glutamate racemase, hydantoin racemase and arylmalonate decarboxylase. Probab=26.53 E-value=43 Score=13.78 Aligned_cols=15 Identities=20% Similarity=0.177 Sum_probs=6.2 Q ss_pred HHHHHCCCEEEEEECH Q ss_conf 9999739807998200 Q gi|254780234|r 51 DKARLLGAKEVYVKDL 66 (404) Q Consensus 51 ~~A~~~Ga~~~~v~D~ 66 (404) ++....|+ +++++-+ T Consensus 58 ~~l~~~g~-d~iviaC 72 (212) T pfam01177 58 RRLAKAGA-DAIVIAC 72 (212) T ss_pred HHHHHCCC-CEEEEEC T ss_conf 99986799-9999917 No 321 >PRK06701 short chain dehydrogenase; Provisional Probab=26.52 E-value=43 Score=13.78 Aligned_cols=56 Identities=13% Similarity=0.298 Sum_probs=20.6 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE-EEEEECHHH Q ss_conf 999147725999999998718986999994578711068999999973980-799820089 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK-EVYVKDLRR 68 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~-~~~v~D~r~ 68 (404) |..=|.|+--.++..+. +.|++|+... ....++.+...+...+.|.. ..+..|+.+ T Consensus 50 VTGgs~GIG~aiA~~la--~~GA~V~i~~--~~~~~~a~~~~~~~~~~G~~~~~~~~Dv~d 106 (289) T PRK06701 50 ITGGDSGIGRAVAVAFA--KEGADIAIVY--LDEHEDANETKQRVEKEGVKCLLIPGDVSD 106 (289) T ss_pred EECCCCHHHHHHHHHHH--HCCCEEEEEE--CCCHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 96825799999999999--8799899982--894678999999999639908999847899 No 322 >TIGR00434 cysH phosophoadenylyl-sulfate reductase; InterPro: IPR004511 Phosphoadenosine phosphosulphate reductase (1.8.4.8 from EC), involved in the assimilation of inorganic sulphate, is designated cysH in bacteria and MET16 in Saccharomyces cerevisiae. Synonyms include phosphoadenosine phosphosulphate reductase and PAPS reductase. In a reaction requiring reduced thioredoxin and NADPH, it converts 3'-phosphoadenylylsulphate (PAPS) to sulphite and adenosine 3',5' diphosphate (PAP). A related family of plant enzymes, scoring below the trusted cut-off, differs in having a thioredoxin-like C-terminal domain, not requiring thioredoxin, and in having a preference for 5'-adenylylsulphate (APS) over PAPS.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019379 sulfate assimilation phosphoadenylyl sulfate reduction by phosphoadenylyl-sulfate reductase (thioredoxin). Probab=26.47 E-value=43 Score=13.78 Aligned_cols=168 Identities=20% Similarity=0.288 Sum_probs=104.2 Q ss_pred EEEEEECCCHHHHHHHHHH-HHHCCCEEEEEE-EECCC--CC---HHHHHHHHHH---HCCCEEEEEEC--H-HHHHHHH Q ss_conf 8999914772599999999-871898699999-45787--11---0689999999---73980799820--0-8999998 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWL-QVEKGLEVIVFI-ADLGQ--GE---ELKIASDKAR---LLGAKEVYVKD--L-RREFVRD 73 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L-~~e~g~eVi~~~-~d~Gq--~~---d~~~~~~~A~---~~Ga~~~~v~D--~-r~ef~~~ 73 (404) ..|+|-|=|+.--|.|.++ +....++||-+- +|+|= +| =+|.+++|-. -+-. ++|--| + .++|++. T Consensus 15 ~LV~~~SFGi~g~V~ldL~~ki~~~~~~i~lDfiDTG~hF~eTY~~~~~~~~ry~qPkglni-~vy~p~GClS~a~~~~~ 93 (226) T TIGR00434 15 ELVYSTSFGIEGAVLLDLVSKIKKDIPVIFLDFIDTGYHFPETYELIDEVKERYPQPKGLNI-KVYKPDGCLSLAEQAAK 93 (226) T ss_pred CCEEHHHHHHHHHHHHHHHHHHCCCCCEEEECCHHHCCCCHHHHHHHHHHHHHHCCCCCCEE-EEECCCCCCCHHHHHHH T ss_conf 10202267899999999988517885146530001126727789999999876168877506-77678888797899977 Q ss_pred HHHHHHHCCCCCCCCCCCCC---CHHHHHHHHHH---HHHHHHCCCCEEECHHHCCCCHHHH--HHHHHHHHCCCCEE-- Q ss_conf 79999736865478742013---01243359999---9998856892875001116821567--99999985879707-- Q gi|254780234|r 74 FVFPMFRANALYEGYYLLGT---AIARPLIAKYL---VDIANETGADAIAHGSTGKGNDQVR--FELSAYSLNSDIEI-- 143 (404) Q Consensus 74 ~i~~~I~ana~Yeg~Ypl~t---slaRplia~~l---v~~a~~~ga~~iaHG~TgkGNDQvR--Fe~~~~~l~P~l~v-- 143 (404) |.+.=|--- ++.|+..-+++ =++-+++.|.|+--| .=.||-+ =++.+-.+.-..++ T Consensus 94 -----------YGd~LWe~~dVE~~n~y~~lrKVEP~~rALkel~~~AW~~G---LRRdQ~~~R~nl~~~~~d~kf~~~K 159 (226) T TIGR00434 94 -----------YGDKLWEQEDVEDPNKYDYLRKVEPLKRALKELEASAWLTG---LRRDQGPSRANLKIVNIDEKFGILK 159 (226) T ss_pred -----------HCCHHHCCCCCCCHHHHCHHHCCHHHHHHCCCCCEEEEEEE---CCCCCCCCCCCCCHHHHHHHCCEEE T ss_conf -----------16300114577870121100010278875023705632320---3314354321375244362458035 Q ss_pred EEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCC Q ss_conf 74410068763799999999708967566567886415724311545763245845768 Q gi|254780234|r 144 IAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPA 202 (404) Q Consensus 144 iaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~ 202 (404) |.|.-+|++. +--.|..+|++|.---- |+. -+|+.+-.--.|+.++ T Consensus 160 v~PlI~Wt~~---~v~~Y~~~h~LpYn~L~--------D~G--Y~SiGd~h~T~P~~~~ 205 (226) T TIGR00434 160 VLPLIDWTWK---DVYQYLDAHNLPYNPLH--------DQG--YPSIGDAHSTRPVKEG 205 (226) T ss_pred EECCCCCCHH---HHHHHHHHCCCCCCCCC--------CCC--CCCCCCCCCCCCCCCC T ss_conf 4042148768---99999875579887211--------278--7532675678888765 No 323 >TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR; InterPro: IPR014178 Members of this entry are transcriptional regulators of fatty acid metabolism (FadR), including both biosynthesis and beta-oxidation reactions, which are found exclusively in a subset of Gammaproteobacteria with strictly one copy per genome. Proteins have an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR when expressed in Escherichia coli shows differences in regulatory response and fatty acid binding profiles. All members of this entry are expected to have the same function.; GO: 0000062 acyl-CoA binding, 0003677 DNA binding, 0003700 transcription factor activity, 0019217 regulation of fatty acid metabolic process. Probab=26.45 E-value=29 Score=15.00 Aligned_cols=54 Identities=20% Similarity=0.368 Sum_probs=28.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCH--HHHH---HH------HHHHHHHHHCCCCEEECHHHCCCCH Q ss_conf 9999987999973686547874201301--2433---59------9999998856892875001116821 Q gi|254780234|r 68 REFVRDFVFPMFRANALYEGYYLLGTAI--ARPL---IA------KYLVDIANETGADAIAHGSTGKGND 126 (404) Q Consensus 68 ~ef~~~~i~~~I~ana~Yeg~Ypl~tsl--aRpl---ia------~~lv~~a~~~ga~~iaHG~TgkGND 126 (404) .-|||+||-..|+ +|.||=||-| +|=| |+ +.+++.-=..|==.|=||===|=|| T Consensus 8 A~fAEeYIveSIW-----n~~FPPGSiLPAERELsElIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNn 72 (275) T TIGR02812 8 AGFAEEYIVESIW-----NNRFPPGSILPAERELSELIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNN 72 (275) T ss_pred HHHHHHHHHHHCC-----CCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 4235443322101-----6434658987246657542376630378999887641134365889986276 No 324 >TIGR01830 3oxo_ACP_reduc 3-oxoacyl-(acyl-carrier-protein) reductase; InterPro: IPR011284 This entry represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis found in many plant and bacterial species. This enzyme is involved in type II fatty acid biosynthesis, where the individual metabolic transformations are carried out by different enzymes rather than by a single enzyme as occurs in type I fatty acid biosynthesis . Structural studies show that the enzyme is a tetramer which forms a typical Rossman fold , . Unlike other members of the short-chain dehydrogenase/reductase superfamily, the enzyme undergoes a marked conformational change upon binding of the NADP(H)cofactor. This conformational change aligns the side chains of the catalytic triad at the active site in an active conformation and increases the affinity of the enzyme for its substrate.; GO: 0004316 3-oxoacyl-[acyl-carrier-protein] reductase activity, 0051287 NAD binding, 0006633 fatty acid biosynthetic process. Probab=26.42 E-value=43 Score=13.77 Aligned_cols=58 Identities=21% Similarity=0.413 Sum_probs=42.7 Q ss_pred EEEEEECCCHH-HHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH-HCCCEEEEEECH Q ss_conf 89999147725-9999999987189869999945787110689999999-739807998200 Q gi|254780234|r 7 KVVLAYSGGLD-TSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKAR-LLGAKEVYVKDL 66 (404) Q Consensus 7 kVvlaySGGLD-TSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~-~~Ga~~~~v~D~ 66 (404) +|++.|.-.=. ..-+..-|+ ++|.++.++-+|+-+.|+.+..-+++. .+| .++.|=++ T Consensus 24 ~V~i~y~~~e~~~~~~~~e~~-~~G~~a~~~~~dvs~~~~~~~~~~~~~~~~G-iDiLVNNA 83 (238) T TIGR01830 24 KVIITYRTSEEGAEEVVEEIK-ELGVKAAGVVLDVSDREDVKALVEEAEEELG-IDILVNNA 83 (238) T ss_pred EEEEECCCCHHHHHHHHHHHH-HCCCEEEEEEECCCCHHHHHHHHHHHHHHHC-CEEEEECC T ss_conf 599965982578889999998-5697599996038888999999999999829-90899787 No 325 >PTZ00124 adenosine deaminase; Provisional Probab=26.28 E-value=44 Score=13.75 Aligned_cols=38 Identities=11% Similarity=0.152 Sum_probs=22.3 Q ss_pred EECCEEEEEC--CEECCHHHHHHHHHHHHHHCCCCCEECC Q ss_conf 9502478751--6231899999999888642163741000 Q gi|254780234|r 227 FQRGDPIAIN--GQVMSPEVLLEQLNQYGRCNGIGRIDIV 264 (404) Q Consensus 227 Fe~G~PVain--G~~~~~~~li~~LN~igg~~GvGr~d~v 264 (404) ++.|.||+|| +-.+....|-.......-.+|.+..|+. T Consensus 298 ~~~G~~vtiNTDDp~vf~t~L~~Ey~~~~~~fgls~~d~~ 337 (362) T PTZ00124 298 YDAGVKVSVNSDDPGMFLTNIHDEYEELYTHLNFTLADFM 337 (362) T ss_pred HHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCHHHHH T ss_conf 9789969994899302089889999999998195999999 No 326 >PRK10490 sensor protein KdpD; Provisional Probab=26.15 E-value=44 Score=13.74 Aligned_cols=112 Identities=13% Similarity=-0.021 Sum_probs=69.5 Q ss_pred CCEEEEEECCCHHHHHHHHHH---HHHCCCEEEEEEEECCCC--------CHHHHHHHHHHHCCCEEEEEECHHHHHHHH Q ss_conf 778999914772599999999---871898699999457871--------106899999997398079982008999998 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILKWL---QVEKGLEVIVFIADLGQG--------EELKIASDKARLLGAKEVYVKDLRREFVRD 73 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~~L---~~e~g~eVi~~~~d~Gq~--------~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~ 73 (404) .+||++|.|++=.+--++.+- ....+.+-++++++.... +.+....+-|..+||+ .+.+-. ++.++. T Consensus 250 ~e~ilv~~~~~~~~~~lir~~~r~a~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~LAe~LGa~-v~~l~G-~dva~~ 327 (895) T PRK10490 250 RDAILLCIGHNTGSEKLVRTAARLAARLGSVWHAVYVETPRLHRLPEKKRRAILSALRLAQELGAE-TATLSD-PAEEKA 327 (895) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCE-EEEEEC-CCHHHH T ss_conf 762999988998746999999999984799869999964444579877899999999999981998-999837-988999 Q ss_pred HHHHHHHCCCCC--CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEC Q ss_conf 799997368654--787420130124335999999988568928750 Q gi|254780234|r 74 FVFPMFRANALY--EGYYLLGTAIARPLIAKYLVDIANETGADAIAH 118 (404) Q Consensus 74 ~i~~~I~ana~Y--eg~Ypl~tslaRplia~~lv~~a~~~ga~~iaH 118 (404) .+-.|-..|+.. -|+-.-..-.-|+-++..+++.+.......|.| T Consensus 328 il~~Ar~~nvT~IVlGrs~~rrw~~r~sl~drLi~~a~~IDVhiV~~ 374 (895) T PRK10490 328 VVRYAREHNLGKIIIGRPASRRWWRRETFADRLARLAPDLDQVIVAL 374 (895) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHCCCCEEEEECC T ss_conf 99999983999899888788750337889999987189974999537 No 327 >pfam00582 Usp Universal stress protein family. The universal stress protein UspA is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. UspA enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. The crystal structure of Haemophilus influenzae UspA reveals an alpha/beta fold similar to that of the Methanococcus jannaschii MJ0577 protein, which binds ATP, though UspA lacks ATP-binding activity. Probab=25.81 E-value=44 Score=13.70 Aligned_cols=106 Identities=19% Similarity=0.186 Sum_probs=54.8 Q ss_pred CEEEEEECCCHHHHHHHHHHH---HHCCCEEEEEEEECCCC----CH--HHHHHHHHHHCCCEEEEEECHHHHHHHHHHH Q ss_conf 789999147725999999998---71898699999457871----10--6899999997398079982008999998799 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ---VEKGLEVIVFIADLGQG----EE--LKIASDKARLLGAKEVYVKDLRREFVRDFVF 76 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~---~e~g~eVi~~~~d~Gq~----~d--~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~ 76 (404) |||++++-|.-.+.-++.|.. ...+.+++.+++---.. .. .....+... ............ T Consensus 3 k~Ilv~vD~s~~s~~al~~A~~~a~~~~~~l~llhv~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~ 72 (140) T pfam00582 3 KRILVAVDGSEESERALEWALELAKRRGAELILLHVIDPEPSGAASEALEEEEEEELE----------EEEAEAEALAEA 72 (140) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHH----------HHHHHHHHHHHH T ss_conf 8499998599999999999999998649979999996288755444333035799999----------999999999999 Q ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHH Q ss_conf 997368654787420130124335999999988568928750011168215 Q gi|254780234|r 77 PMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQ 127 (404) Q Consensus 77 ~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQ 127 (404) . .... ... .-....-+. ....+++++++.++|.|.=|+.|++.-. T Consensus 73 ~-~~~~--~~~-~~~~~~~g~--~~~~I~~~~~~~~~dliV~G~~~~~~~~ 117 (140) T pfam00582 73 A-AEAG--GVV-VEVVVRGGD--PAEAILEVAEEEDADLIVMGSRGRSGLR 117 (140) T ss_pred H-HHCC--CCC-EEEEEEEEC--HHHHHCEEECCCCCCEEEEECCCCCCCC T ss_conf 8-6357--986-799999607--0364210000567888999289998644 No 328 >PRK08227 aldolase; Validated Probab=25.75 E-value=45 Score=13.69 Aligned_cols=212 Identities=16% Similarity=0.132 Sum_probs=96.7 Q ss_pred EEEEEEEECCC---C----CHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH- Q ss_conf 69999945787---1----106899999997398079982008999998799997368654787420130124335999- Q gi|254780234|r 32 EVIVFIADLGQ---G----EELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKY- 103 (404) Q Consensus 32 eVi~~~~d~Gq---~----~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~- 103 (404) .-+.+..|=|- + +|++..-++... ++ ..+++.-- .++.+ .|.-...++--. .--+|++..+.-.+. T Consensus 50 R~v~VpiDHG~~~GPi~GLed~~~~I~~l~~-~a-DAi~l~kG--ilr~~-~~~~~~~~lil~-~s~~t~~~~~~~~k~l 123 (291) T PRK08227 50 RTVMLAFDHGYFQGPTTGLERIDINIAPLFE-YA-DVLMCTRG--ILRSV-VPPATNKPVVLR-ASGGNSILKELSNEAV 123 (291) T ss_pred CEEEEECCCCCCCCCCCCCCCHHHHHHHHHH-CC-CEEEECHH--HHHHH-CCCCCCCEEEEE-EECCCCCCCCCCCCEE T ss_conf 8899972676434886663167999999860-39-98997667--76320-467799407999-8068766678788755 Q ss_pred --HHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf --999988568928750011168215679999998587970774410068763799999999708967566567886415 Q gi|254780234|r 104 --LVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSI 181 (404) Q Consensus 104 --lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~ 181 (404) -|+-|-++||++|+-... -|.+..+- .+.--.+..+.|.++|+|+=+-....++.+. T Consensus 124 v~sVeeAvrlGAdAVsv~v~-iGs~~E~~--------------------~l~~lg~v~~e~~~~GmPlla~~~~g~~~~~ 182 (291) T PRK08227 124 AVDMEDAVRLNVCAVAAQVF-IGSEYETQ--------------------SIKNIIQLVDAGLRYGMPTMAVTAVGKDMVR 182 (291) T ss_pred EECHHHHHHCCCCEEEEEEE-CCCCHHHH--------------------HHHHHHHHHHHHHHCCCCEEEEECCCCCCCC T ss_conf 53499998679978999863-59932899--------------------9999999999999829987998346877777 Q ss_pred CHHHH-C-----CCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECC-EECCHHHHHHHHHHHHH Q ss_conf 72431-1-----54576324584576820100146581227999638999995024787516-23189999999988864 Q gi|254780234|r 182 DTNLL-H-----SSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAING-QVMSPEVLLEQLNQYGR 254 (404) Q Consensus 182 D~Nlw-g-----~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG-~~~~~~~li~~LN~igg 254 (404) |..+. | ....+.++-.++. + +.|.+.+ +.+| +||-|-| .+.+-.++++...+.-. T Consensus 183 d~~~va~aaRia~ELGADiVKt~yt--~-e~f~~Vv--~a~p-------------vPVliaGG~k~~d~e~L~~v~~a~~ 244 (291) T PRK08227 183 DARYFSLATRIAAEMGAQIIKTYYV--E-KGFERIT--AGCP-------------VPIVIAGGKKLPERDALEMCYQAID 244 (291) T ss_pred CHHHHHHHHHHHHHHCCCEEECCCC--H-HHHHHHH--HCCC-------------CCEEEECCCCCCHHHHHHHHHHHHH T ss_conf 7899999999999978998850697--3-4599999--6489-------------9789967998986999999999997 Q ss_pred HCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 21637410007732021000111373789999999999865089 Q gi|254780234|r 255 CNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISLD 298 (404) Q Consensus 255 ~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l~ 298 (404) .-+-|.. =+|-||..|-..-+..|.+.+-.-=.+ T Consensus 245 aGa~Gv~----------~GRNVfQ~~~P~~~~~Al~~iVHe~~s 278 (291) T PRK08227 245 QGASGVD----------MGRNIFQSDAPVAMIKAVHAVVHHNET 278 (291) T ss_pred CCCCEEE----------ECCHHHCCCCHHHHHHHHHHHHCCCCC T ss_conf 6993687----------240023589989999999998659999 No 329 >PRK09856 fructoselysine 3-epimerase; Provisional Probab=25.74 E-value=45 Score=13.69 Aligned_cols=117 Identities=15% Similarity=0.115 Sum_probs=63.8 Q ss_pred HHHHHHHHHCCCEEEEEEEECC-------CCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 9999998718986999994578-------711068999999973980799820089999987999973686547874201 Q gi|254780234|r 20 IILKWLQVEKGLEVIVFIADLG-------QGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLG 92 (404) Q Consensus 20 v~i~~L~~e~g~eVi~~~~d~G-------q~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~ 92 (404) -++..++ +.||+=|-+.++-= ...+++.+++.+...|. ++... .| ....||.+ T Consensus 17 ~a~~~aa-~lGydgVEi~~~~ph~~~~~~~~~~~~~ik~l~~~~gl-~v~~~-----------~p-------~~~~~~~~ 76 (276) T PRK09856 17 HAFRDAS-ELGYDGIEIWGGRPHAFAPDLKAGGIKQIKALAQTYQM-PIIGY-----------TP-------ETNGYPYN 76 (276) T ss_pred HHHHHHH-HHCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCC-EEEEE-----------CC-------CCCCCCCC T ss_conf 9999999-84999899737876546765465579999999998098-68995-----------53-------21377653 Q ss_pred CCHHHHH-------HHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCC--CCHHHHHHHHH Q ss_conf 3012433-------59999999885689287500111682156799999985879707744100687--63799999999 Q gi|254780234|r 93 TAIARPL-------IAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSF--KGRQDLIDFAE 163 (404) Q Consensus 93 tslaRpl-------ia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~--~sRe~~i~ya~ 163 (404) -+...+- -.|..++.|.++||..+.-++-..|++.-+.+ .|+. .+-.+..+||+ T Consensus 77 l~~~d~~~R~~~i~~~k~~id~A~~Lga~~v~v~~~~~~~~~~~~~-----------------~w~~~~e~l~~l~~~A~ 139 (276) T PRK09856 77 MMLGDEHMRRESLDMIKLAMDMAKEMNAGYTLISAAHAGYLTPPNV-----------------IWGRLAENLSELCEYAE 139 (276) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHH-----------------HHHHHHHHHHHHHHHHH T ss_conf 3679878989999999999999998499849993687778889799-----------------99999999999999999 Q ss_pred HHCCCCCCCC Q ss_conf 7089675665 Q gi|254780234|r 164 KHAIPIDKNK 173 (404) Q Consensus 164 ~~gIpv~~~~ 173 (404) ++||.+-... T Consensus 140 ~~Gv~l~lE~ 149 (276) T PRK09856 140 NIGMDLILEP 149 (276) T ss_pred HCCCEEEEEE T ss_conf 7399899951 No 330 >PRK07985 oxidoreductase; Provisional Probab=25.67 E-value=45 Score=13.68 Aligned_cols=58 Identities=16% Similarity=0.224 Sum_probs=28.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEE-EEEECHHH Q ss_conf 99991477259999999987189869999945787110689999999739807-99820089 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKE-VYVKDLRR 68 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~-~~v~D~r~ 68 (404) +|..=|+|+--.++..+. +.|++|+--..+-. .++.+.+.+.....|..- .+..|+.+ T Consensus 53 lVTGas~GIG~aiA~~lA--~~GA~Vvi~~~~~~-~~~a~~~~~~~~~~g~~~~~~~~Dvs~ 111 (294) T PRK07985 53 LVTGGDSGIGRAAAIAYA--REGADVAISYLPVE-EEDAQDVKKIIEECGRKAVLLPGDLSD 111 (294) T ss_pred EEECCCCHHHHHHHHHHH--HCCCEEEEECCCCC-HHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 991726699999999999--87999999429966-678999999999729958999767899 No 331 >COG3962 Acetolactate synthase [Amino acid transport and metabolism] Probab=25.47 E-value=45 Score=13.65 Aligned_cols=42 Identities=26% Similarity=0.387 Sum_probs=32.4 Q ss_pred CHHHHHHHHHHHCCCCCCCCC--CCCCCCCCHHHHCCCCCCCCC Q ss_conf 379999999970896756656--788641572431154576324 Q gi|254780234|r 154 GRQDLIDFAEKHAIPIDKNKR--GEAPFSIDTNLLHSSSEGRVL 195 (404) Q Consensus 154 sRe~~i~ya~~~gIpv~~~~~--~~~~yS~D~Nlwg~S~Egg~L 195 (404) -|++++.|++.|||||..+.. ..-+|+-.-||=+.-+-|..- T Consensus 245 A~~~L~af~E~~~iPv~ETQaGKsal~~dhplNlG~vGvTGt~A 288 (617) T COG3962 245 AREALRAFAETHGIPVVETQAGKSALAWDHPLNLGGVGVTGTLA 288 (617) T ss_pred HHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 79999999986499657635786445655755555644456277 No 332 >PRK10754 quinone oxidoreductase, NADPH-dependent; Provisional Probab=25.45 E-value=45 Score=13.65 Aligned_cols=46 Identities=20% Similarity=0.168 Sum_probs=26.3 Q ss_pred CCEEEC-HHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHCCCC Q ss_conf 928750-011168215679999998587970774410068763799999999708967 Q gi|254780234|r 113 ADAIAH-GSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 113 a~~iaH-G~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv 169 (404) -.++-| |+.|-|.=-+.+ .++++ .+||+-. ++++..+++++.|... T Consensus 142 ~~vLi~gaaGgVG~~avQl---Ak~~G--a~Vi~t~------~s~~k~e~~~~lGA~~ 188 (327) T PRK10754 142 EQFLFHAAAGGVGLIACQW---AKALG--AKLIGTV------GSAQKAQRALKAGAWQ 188 (327) T ss_pred CEEEEECCCCCCCHHHHHH---HHHCC--CEEEEEE------CCHHHHHHHHHCCCCE T ss_conf 9999981776112689999---99869--9999998------9899999999669999 No 333 >cd02649 nuc_hydro_CeIAG nuc_hydro_CeIAG: Nucleoside hydrolases similar to the inosine-adenosine-guanosine-preferring nucleoside hydrolase from Caenorhabditis elegans. Nucleoside hydrolases cleave the N-glycosidic bond in nucleosides generating ribose and the respective base. These enzymes vary in their substrate specificity. This group contains eukaryotic, bacterial and archeal proteins similar to the purine-preferring nucleoside hydrolase (IAG-NH) from C. elegans and the salivary purine nucleosidase from Aedes aegypti. C. elegans IAG-NH exhibits a high affinity for the substrate analogue p-nitrophenylriboside (p-NPR). Probab=25.40 E-value=45 Score=13.64 Aligned_cols=37 Identities=19% Similarity=0.259 Sum_probs=32.8 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 7899991477259999999987189869999945787 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ 42 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq 42 (404) |||++=.-.|.|-.+++.++......||.++|.--|. T Consensus 1 rkvIiDtD~G~DDa~Al~~al~~~~~~l~git~v~GN 37 (306) T cd02649 1 RKLIIDTDCGGDDAWALLMALASPNVEVLAITCVHGN 37 (306) T ss_pred CCEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCC T ss_conf 9399989998699999999986899859999994798 No 334 >pfam04309 G3P_antiterm Glycerol-3-phosphate responsive antiterminator. Intracellular glycerol is usually converted to glycerol-3-phosphate in an ATP-requiring phosphorylation reaction catalysed by glycerol kinase (GlpK) glycerol-3-phosphate activates the antiterminator GlpP. Probab=25.40 E-value=45 Score=13.64 Aligned_cols=98 Identities=24% Similarity=0.328 Sum_probs=62.4 Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH--HHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 999973686547874201301243359999999885689287500--111682156799999985879707744100687 Q gi|254780234|r 75 VFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG--STGKGNDQVRFELSAYSLNSDIEIIAPWRHWSF 152 (404) Q Consensus 75 i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG--~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~ 152 (404) +-.+++++ ++-.+.|... -+.-+.+++..++.|=.++-|= ..|-++|..-+|.-.+...|+ .||+ T Consensus 11 l~~al~s~--~~~vfll~g~---I~~l~~~v~~~k~~gK~vfVHiDli~GL~~d~~av~fL~~~~~~d-GIIS------- 77 (174) T pfam04309 11 LDRALESE--YQVVFLLTGH---ILNLKEIVKLLKQRGKTVFVHVDLVEGLSNDEEAIEFIKEVIEPD-GIIS------- 77 (174) T ss_pred HHHHHCCC--CCEEEEECCC---HHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCC-EEEE------- T ss_conf 99998499--9899995580---878999999999879989998521677789889999999824899-9996------- Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCC Q ss_conf 637999999997089675665678864157243115457 Q gi|254780234|r 153 KGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSE 191 (404) Q Consensus 153 ~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~E 191 (404) +|...+.+|++.|+.. .++-|=+|..-|-+++. T Consensus 78 -Tk~~~i~~Ak~~gl~t-----IqR~FliDS~al~~~~~ 110 (174) T pfam04309 78 -TKSKVIKKAKKLGLPA-----IQRLFLLDSIALERSIE 110 (174) T ss_pred -CCHHHHHHHHHCCCEE-----EEEEEEEEHHHHHHHHH T ss_conf -8899999999879928-----97764341778999999 No 335 >cd00439 Transaldolase Transaldolase. Enzymes found in the non-oxidative branch of the pentose phosphate pathway, that catalyze the reversible transfer of a dihydroxyacetone group from fructose-6-phosphate to erythrose-4-phosphate yielding sedoheptulose-7-phosphate and glyceraldehyde-3-phosphate. They are members of the class I aldolases, who are characterized by using a Schiff-base mechanism for stabilization of the reaction intermediates. Probab=25.12 E-value=32 Score=14.68 Aligned_cols=132 Identities=16% Similarity=0.084 Sum_probs=65.4 Q ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCC Q ss_conf 87970774410068763799999999708967566567886415724311545763245845768201001465812279 Q gi|254780234|r 138 NSDIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAP 217 (404) Q Consensus 138 ~P~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap 217 (404) .|++-|=-|.-.- -=+.++.+.+.||.+.+|. .||..+-+-+.-. |. T Consensus 115 ~~nv~IKIPaT~e----Gi~A~~~L~~~GI~vN~Tl----iFs~~Qa~~~aeA--ga----------------------- 161 (252) T cd00439 115 RRNIYIKIPATAE----GIPAIKDLIAAGISVNVTL----IFSIAQYEAVADA--GT----------------------- 161 (252) T ss_pred CCCEEEECCCCHH----HHHHHHHHHHCCCCEEEEE----ECCHHHHHHHHHH--CC----------------------- T ss_conf 8766885578677----9999999997697341455----2489999999980--97----------------------- Q ss_pred CCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 99638999995024787516231899999999888642163741000773202100011137378999999999986508 Q gi|254780234|r 218 DTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRCNGIGRIDIVENRFVGIKSRGVYETPGGTILLHAHRAIESISL 297 (404) Q Consensus 218 ~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~~GvGr~d~vEnr~vG~KsR~vYEaPga~iL~~Ahr~LE~~~l 297 (404) ..+ ||. |||+|-.-+....-++.+.-+-||......+|+..+..-. T Consensus 162 ---svi-----------------SpF--------------VgRiDd~~d~~l~~~~~~~~g~~GVa~a~~iy~~~~~~~~ 207 (252) T cd00439 162 ---SVA-----------------SPF--------------VSRIDTLMDKMLEQIGLDLRGKAGVAQVTLAYKLYKQKFK 207 (252) T ss_pred ---CEE-----------------EEE--------------CCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf ---656-----------------100--------------0237777650123345674457799999999999997399 Q ss_pred CHHHHH--HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC Q ss_conf 999999--99999999999875351469899999999999723 Q gi|254780234|r 298 DSGSAH--LKDDLMSRYASIVYQGFWFSPEREMLQALIDKSQE 338 (404) Q Consensus 298 ~~~~~~--~K~~~~~~~a~lvy~G~wf~p~~~~l~a~i~~~q~ 338 (404) ....+. |+. ...|.+|+.-..--.+--..|+||+|+-.+ T Consensus 208 ~trvL~AS~r~--~~yV~~LiG~dtVtt~P~~tl~a~~d~~~~ 248 (252) T cd00439 208 KQRVLWASFSD--TLYVAPLIGCDTVTTMPDQALEAGVDKFKK 248 (252) T ss_pred CCEEEEEECCC--HHHHHHHHCCCEEECCHHHHHHHHHHHCCC T ss_conf 87698765598--789999765890646918999998861657 No 336 >PRK11548 hypothetical protein; Provisional Probab=25.10 E-value=46 Score=13.61 Aligned_cols=30 Identities=20% Similarity=0.293 Sum_probs=21.7 Q ss_pred CCHHHCCCCCEEEEEEEE-CCEEEEECCEEC Q ss_conf 581227999638999995-024787516231 Q gi|254780234|r 211 VSPEEAPDTPTTIRIDFQ-RGDPIAINGQVM 240 (404) Q Consensus 211 ~~p~~ap~~pe~v~I~Fe-~G~PVainG~~~ 240 (404) ..|..-.-+...++|.|+ +|.-+.|||++- T Consensus 79 ~~~g~~~~~~~~ltl~Fd~~~ll~~Id~~p~ 109 (113) T PRK11548 79 QQPGHEKVTQQTLTLTFNSSGVLTNIDNKPA 109 (113) T ss_pred EECCCCCEEEEEEEEEECCCCCEEECCCCCC T ss_conf 9608983588899999839997883489767 No 337 >cd01467 vWA_BatA_type VWA BatA type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses. In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if Probab=24.92 E-value=46 Score=13.58 Aligned_cols=15 Identities=13% Similarity=0.297 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 799999999708967 Q gi|254780234|r 155 RQDLIDFAEKHAIPI 169 (404) Q Consensus 155 Re~~i~ya~~~gIpv 169 (404) ..+..+.++++||+| T Consensus 121 ~~~~~~~a~~~gi~v 135 (180) T cd01467 121 PATAAELAKNKGVRI 135 (180) T ss_pred HHHHHHHHHHCCCEE T ss_conf 999999999769989 No 338 >cd06299 PBP1_LacI_like_13 Ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This group includes the ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This regulatory protein is a member of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial trans Probab=24.86 E-value=46 Score=13.57 Aligned_cols=110 Identities=15% Similarity=0.065 Sum_probs=54.6 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEC--HHHHHHHHHH---HHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 718986999994578711068999999973980799820--0899999879---99973686547874201301243359 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKD--LRREFVRDFV---FPMFRANALYEGYYLLGTAIARPLIA 101 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D--~r~ef~~~~i---~~~I~ana~Yeg~Ypl~tslaRplia 101 (404) .+.||.++-++.+-. .+.....-+.....+..-.++.. ..+++.+... .|.+..+..+.+...-.....-.--+ T Consensus 26 ~~~gy~~ll~~s~~~-~~~e~~~l~~l~~~~vDGiIi~~~~~~~~~~~~l~~~~iPvV~id~~~~~~~~~~V~~d~~~~~ 104 (265) T cd06299 26 SAAGYSTIIGNSDEN-PETENRYLDNLLSQRVDGIIVVPHEQSAEQLEDLLKRGIPVVFVDREITGSPIPFVTSDPQPGM 104 (265) T ss_pred HHCCCEEEEEECCCC-HHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCCCEEEECHHHHH T ss_conf 985998999978999-8999999999994799999998067998999999966998998378689998868996769999 Q ss_pred HHHHHHHHHCCCCEEEC--HHHCCCCHHHHHHHHHHHH Q ss_conf 99999988568928750--0111682156799999985 Q gi|254780234|r 102 KYLVDIANETGADAIAH--GSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaH--G~TgkGNDQvRFe~~~~~l 137 (404) +.+++...+.|..-|+. |....-+.+-|++....++ T Consensus 105 ~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~a~ 142 (265) T cd06299 105 TEAVSLLVALGHKKIGYISGPQDTSTGRERLEAFRQAC 142 (265) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 99999999749983886048988756999999999999 No 339 >PRK08185 hypothetical protein; Provisional Probab=24.75 E-value=46 Score=13.56 Aligned_cols=115 Identities=15% Similarity=0.227 Sum_probs=63.4 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 71898699999457871106899999997398079982008999998799997368654787420130124335999999 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVD 106 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~ 106 (404) .+.+|-|-+|.+. .-+.+..+-+-|...++ +.++. +-.-.. +| -| .-+...+.. T Consensus 9 ~~~~yAV~AfNv~--~~e~~~avi~AAee~~s-PvIiq-~s~~~~-~~-----------~g----------~~~~~~~~~ 62 (283) T PRK08185 9 KEHQFAVGAFNVA--DSCFLRAVVEEAEANNA-PAIIA-IHPNEL-DF-----------VG----------DDFFAYVRE 62 (283) T ss_pred HHCCCEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEEE-ECCCHH-HH-----------HH----------HHHHHHHHH T ss_conf 9879679998889--99999999999999787-98999-381288-77-----------11----------999999999 Q ss_pred HHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE------EEECCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 98856892875001116821567999999858797077------44100687637999999997089675665 Q gi|254780234|r 107 IANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII------APWRHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 107 ~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi------aP~Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) .|++-.+.+..| .--|.| ||...+++..-..=+ -|.. -++.--.+..+||.++||.|.+.. T Consensus 63 ~a~~~~VpV~lH--LDH~~~---~e~~~~ai~~GFsSVM~DgS~lp~e-eNi~~Tk~vv~~ah~~gv~VEaEl 129 (283) T PRK08185 63 RAQRSPVPFVIH--LDHGAS---VEDVMRAIRCGFTSVMIDGSLLPYE-ENVALTKEVVELAHKVGVSVEGEL 129 (283) T ss_pred HHHHCCCCEEEE--CCCCCC---HHHHHHHHHCCCCEEEEECCCCCHH-HHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 998779989998--999999---9999999982998798638989999-999999999999986598389986 No 340 >pfam02353 CMAS Cyclopropane-fatty-acyl-phospholipid synthase. This family consist of Cyclopropane-fatty-acyl-phospholipid synthase or CFA synthase EC:2.1.1.79 this enzyme catalyse the reaction: S-adenosyl-L-methionine + phospholipid olefinic fatty acid <= S-adenosyl-L-homocysteine + phospholipid cyclopropane fatty acid. Probab=24.73 E-value=46 Score=13.56 Aligned_cols=55 Identities=18% Similarity=0.315 Sum_probs=33.1 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE---EEEEECHHH Q ss_conf 999147725999999998718986999994578711068999999973980---799820089 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK---EVYVKDLRR 68 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~---~~~v~D~r~ 68 (404) ||=.-.|-- +.+ .++.+++|+.|+++|.-. +..+-+++++...|.. ++.+.|.|+ T Consensus 66 vLDiGCGWG-~~a-~~~a~~~g~~v~giTlS~---~Q~~~~~~r~~~~gl~~~v~v~~~Dyrd 123 (273) T pfam02353 66 LLDIGCGWG-GLM-RRAAERYDVNVVGLTLSK---NQYKLARQRVAAEGLQRKVEVLLQDYRD 123 (273) T ss_pred EEEECCCCH-HHH-HHHHHHCCCEEEEEECCH---HHHHHHHHHHHHCCCCCCCCEEECCHHH T ss_conf 999788808-999-999984795189997978---9999999999870874321200062654 No 341 >CHL00059 atpA ATP synthase CF1 alpha subunit Probab=24.56 E-value=47 Score=13.54 Aligned_cols=106 Identities=13% Similarity=0.082 Sum_probs=61.8 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCC-CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9991477259999999987189869999945787-110689999999739807998200899999879999736865478 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQ-GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEG 87 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq-~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg 87 (404) +++=||-=-|++++--+..+++.+|+|+.+=+|| ..+..++-+.-...|+-++-++=+-. +. --|.++--| T Consensus 168 I~Gd~gvGKTtl~~d~Iinqk~~~vicVyvaIGeR~reV~~~i~~l~~~gam~~TvvV~At--ad--epa~lr~~A---- 239 (501) T CHL00059 168 IIGDRQTGKTAVATDTILNQKGQNVICVYVAIGQKASSVAQVVTTLQERGAMEYTIVVAET--AD--SPATLQYLA---- 239 (501) T ss_pred EECCCCCCHHHHHHHHHHHHCCCCCEEEEEEEEEEHHHHHHHHHHHHHCCCCEEEEEEECC--CC--CCHHHHHHH---- T ss_conf 7557884477999999985246884899998556277899999999756971036999617--99--999999999---- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH Q ss_conf 74201301243359999999885689287500111682156799999985 Q gi|254780234|r 88 YYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 88 ~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l 137 (404) |..|-.+.|+-+..|-+++.-- .|--||-.+.+-+ T Consensus 240 ----------pyaa~aIAEyFrdqGkdVLlv~-----DdLTr~A~A~REi 274 (501) T CHL00059 240 ----------PYTGAALAEYFMYRERHTLIIY-----DDLSKQAQAYRQM 274 (501) T ss_pred ----------HHHHHHHHHHHHHCCCEEEEEE-----ECCHHHHHHHHHH T ss_conf ----------9999999999997798289998-----6826988999998 No 342 >COG2065 PyrR Pyrimidine operon attenuation protein/uracil phosphoribosyltransferase [Nucleotide transport and metabolism] Probab=24.46 E-value=21 Score=16.08 Aligned_cols=24 Identities=38% Similarity=0.455 Sum_probs=13.3 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 320210001113737899999999998 Q gi|254780234|r 267 RFVGIKSRGVYETPGGTILLHAHRAIE 293 (404) Q Consensus 267 r~vG~KsR~vYEaPga~iL~~Ahr~LE 293 (404) .++|||+||+|= |.-|..-...+| T Consensus 34 vlvGIktrGv~l---A~rl~~~i~~~E 57 (179) T COG2065 34 VLVGIKTRGVPL---AERLAERIEELE 57 (179) T ss_pred EEEEEECCCHHH---HHHHHHHHHHHH T ss_conf 999674287899---999999998873 No 343 >PRK00910 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=24.43 E-value=47 Score=13.52 Aligned_cols=29 Identities=14% Similarity=0.421 Sum_probs=15.9 Q ss_pred CEEEEEECC--CCCCCHHHHHHHHHHHCCCC Q ss_conf 707744100--68763799999999708967 Q gi|254780234|r 141 IEIIAPWRH--WSFKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 141 l~viaP~Rd--~~~~sRe~~i~ya~~~gIpv 169 (404) ..||+-.-+ -++-.+++..+||++||+|+ T Consensus 170 ~avicEil~~dG~ma~~~~l~~fA~~h~l~~ 200 (218) T PRK00910 170 AGVLCELTNPDGTMAKTPEIIAFGKLHNMPV 200 (218) T ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHCCCCE T ss_conf 5999998359998267799999999829978 No 344 >cd06282 PBP1_GntR_like_2 Ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding Probab=24.33 E-value=47 Score=13.51 Aligned_cols=40 Identities=15% Similarity=0.094 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHCCCCEEECHH-HCCCCH--HHHHHHHHHHH Q ss_conf 33599999998856892875001-116821--56799999985 Q gi|254780234|r 98 PLIAKYLVDIANETGADAIAHGS-TGKGND--QVRFELSAYSL 137 (404) Q Consensus 98 plia~~lv~~a~~~ga~~iaHG~-TgkGND--QvRFe~~~~~l 137 (404) .--++.+++...+.|..-|+.-+ ....++ +.|++...+++ T Consensus 101 ~~~~~~~~~~L~~~G~~~i~~i~~~~~~~~~~~~R~~Gf~~a~ 143 (266) T cd06282 101 RAAARDVAQALAALGHRRIAMLAGRLAASDRARQRYAGYRAAM 143 (266) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 9999999999997399569999777888868999999999999 No 345 >smart00532 LIGANc Ligase N family. Probab=24.32 E-value=47 Score=13.51 Aligned_cols=60 Identities=25% Similarity=0.279 Sum_probs=36.0 Q ss_pred CCCHHHCCCCCCHHHCCCCCEEEEEEEECCE-----------EEEECCEECCHHHHHHHHHHHHHHCCCCCEECC Q ss_conf 6820100146581227999638999995024-----------787516231899999999888642163741000 Q gi|254780234|r 201 PAPEYVYKMIVSPEEAPDTPTTIRIDFQRGD-----------PIAINGQVMSPEVLLEQLNQYGRCNGIGRIDIV 264 (404) Q Consensus 201 ~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~-----------PVainG~~~~~~~li~~LN~igg~~GvGr~d~v 264 (404) ..|.-+..+..+++.+ +.....|.|.-|+ ||.|+|...+-+-|-..= + -.+.+||..|.| T Consensus 299 ~~PrwAiA~Kf~~e~~--~T~l~~I~~qVGRtG~itPva~lePV~l~G~~Vs~aTLhN~~-~-i~~~~i~iGd~V 369 (441) T smart00532 299 KAPRWAIAYKFPAEEA--ETKLLDIIVQVGRTGKITPVAELEPVFLAGSTVSRATLHNED-E-IEEKDIRIGDTV 369 (441) T ss_pred CCCCEEEEECCCCCEE--EEEEEEEEEECCCCEEEEEEEEEEEEEECCEEEEEECCCCHH-H-HHHCCCCCCCEE T ss_conf 8988348976886657--899999999737862876899997688678699970568989-9-997599999999 No 346 >COG1089 Gmd GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane] Probab=24.23 E-value=47 Score=13.49 Aligned_cols=106 Identities=16% Similarity=0.114 Sum_probs=59.0 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC-HHHHHHHHHHHCCC-EEEEEECHHHHHHHHHHHHHHHCCCC Q ss_conf 89999147725999999998718986999994578711-06899999997398-07998200899999879999736865 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGE-ELKIASDKARLLGA-KEVYVKDLRREFVRDFVFPMFRANAL 84 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga-~~~~v~D~r~ef~~~~i~~~I~ana~ 84 (404) ..++-==-|-|-|++..+|. ++||+|++..-.-.+-. ....+.+--..-+. -..+--|+.+...---+..-++-..+ T Consensus 4 ~ALITGITGQDGsYLa~lLL-ekGY~VhGi~Rrss~~n~~ri~L~~~~~~~~~~l~l~~gDLtD~~~l~r~l~~v~PdEI 82 (345) T COG1089 4 VALITGITGQDGSYLAELLL-EKGYEVHGIKRRSSSFNTPRIHLYEDPHLNDPRLHLHYGDLTDSSNLLRILEEVQPDEI 82 (345) T ss_pred EEEEECCCCCCHHHHHHHHH-HCCCEEEEEEECCCCCCCCCCHHHCCCCCCCCEEEEEECCCCCHHHHHHHHHHCCCHHH T ss_conf 69995445875389999998-56948987860335577653011116555786179996554356889999986094453 Q ss_pred CC--CCCCCCCCHHHHH--------HHHHHHHHHHHCCC Q ss_conf 47--8742013012433--------59999999885689 Q gi|254780234|r 85 YE--GYYLLGTAIARPL--------IAKYLVDIANETGA 113 (404) Q Consensus 85 Ye--g~Ypl~tslaRpl--------ia~~lv~~a~~~ga 113 (404) |+ .+-...+|...|. =+-.++|+.|.+|- T Consensus 83 YNLaAQS~V~vSFe~P~~T~~~~~iGtlrlLEaiR~~~~ 121 (345) T COG1089 83 YNLAAQSHVGVSFEQPEYTADVDAIGTLRLLEAIRILGE 121 (345) T ss_pred EECCCCCCCCCCCCCCCEEEEECHHHHHHHHHHHHHHCC T ss_conf 303432345530358640253100678899999997487 No 347 >PRK12815 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=24.18 E-value=47 Score=13.49 Aligned_cols=149 Identities=26% Similarity=0.333 Sum_probs=90.1 Q ss_pred CCCEEEEEECCCH--------H--HHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEEEEEECHHHHHHH Q ss_conf 7778999914772--------5--99999999871898699999457871-10689999999739807998200899999 Q gi|254780234|r 4 DVKKVVLAYSGGL--------D--TSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKEVYVKDLRREFVR 72 (404) Q Consensus 4 ~~kkVvlaySGGL--------D--TSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~ 72 (404) +.|||++-=||.. | .+=+++-|+ |.|++||-+--|--.- -|. .=|...|+..+.-+|++ T Consensus 6 dikkvLiiGsGpi~IGqa~EfDysg~qA~~aLk-eeG~~vvlvN~NPaTi~TD~---------~~ad~vY~ePlt~e~v~ 75 (1068) T PRK12815 6 DIKKILVIGSGPIIIGQAAEFDYSGTQACKALK-EEGYQVVLVNPNPATIMTDP---------EPADTVYFEPLTLEFVK 75 (1068) T ss_pred CCCEEEEECCCCCHHCCHHHHHHHHHHHHHHHH-HCCCEEEEECCCCCHHHCCC---------CCCCEEEECCCCHHHHH T ss_conf 788899989881531234465656999999999-86998999889832620898---------74443788589899999 Q ss_pred HHHHHHHHCCCC---CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECH--HHCCCCHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 879999736865---47874201301243359999999885689287500--1116821567999999858797077441 Q gi|254780234|r 73 DFVFPMFRANAL---YEGYYLLGTAIARPLIAKYLVDIANETGADAIAHG--STGKGNDQVRFELSAYSLNSDIEIIAPW 147 (404) Q Consensus 73 ~~i~~~I~ana~---Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG--~TgkGNDQvRFe~~~~~l~P~l~viaP~ 147 (404) +.+- .=+-.++ +.|+.+|-.+ .-|.... +-.+.|+..+..- +--+++|+.+|......++ +.+.. T Consensus 76 ~Ii~-~E~Pd~il~~~GGqtaLnla--~~L~~~G---iL~k~~v~llGt~~~~I~~aedR~~f~~~m~~ig--~~~~~-- 145 (1068) T PRK12815 76 RIIR-REKPDALLATLGGQTALNLA--VKLLEAG---ILEQYGVELLGTNVEAIQKGEDRERFRALMKELG--EPVPE-- 145 (1068) T ss_pred HHHH-HHCCCEEEECCCCHHHHHHH--HHHHHCC---CHHHCCCEEECCCHHHHHHHHCHHHHHHHHHHCC--CCCCC-- T ss_conf 9999-74999898677784799999--9999759---7786596496799999986459999999999769--99997-- Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 00687637999999997089675665 Q gi|254780234|r 148 RHWSFKGRQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 148 Rd~~~~sRe~~i~ya~~~gIpv~~~~ 173 (404) -|-..+-++.++.+++-|+||=..+ T Consensus 146 -s~~~~s~~ea~~~a~~ig~PvivRp 170 (1068) T PRK12815 146 -SEIVDSEEEALRFAEKIGFPIIVRP 170 (1068) T ss_pred -CHHCCCHHHHHHHHHHCCCCEEEEE T ss_conf -0521999999999986699899983 No 348 >PRK11634 ATP-dependent RNA helicase DeaD; Provisional Probab=24.05 E-value=48 Score=13.47 Aligned_cols=57 Identities=25% Similarity=0.256 Sum_probs=35.9 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHH Q ss_conf 999147725999999998718986999994578711068999999973980799820089999 Q gi|254780234|r 9 VLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFV 71 (404) Q Consensus 9 vlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~ 71 (404) +++..||-+-..-+.-|+ .|.+||--|- +.=++.+++....+...+++|+|=-+++. T Consensus 106 v~~l~GG~~~~~q~~~L~--~g~~IVVgTP----GRL~d~l~~~~l~L~~l~~lVLDEAD~mL 162 (629) T PRK11634 106 VVALYGGQRYDVQLRALR--QGPQIVVGTP----GRLLDHLKRGTLDLSKLSGLVLDEADEML 162 (629) T ss_pred EEEEECCCCHHHHHHHHC--CCCCEEEECH----HHHHHHHHCCCCCHHHCCEEEEECHHHHC T ss_conf 999989977899999862--7999999698----99999997296412007678986715533 No 349 >TIGR02631 xylA_Arthro xylose isomerase; InterPro: IPR013453 This is an enzyme which as well as interconverting D-xylose and D-xylulose, is also active as a D-glucose isomerase. It is tetrameric and dependent on a divalent cation, either Mg2+, Co2+ or Mn2+, as characterised in Arthrobacter . Enzymes in this entry differ substantially from the D-xylose isomerases of IPR013452 from INTERPRO.; GO: 0009045 xylose isomerase activity. Probab=23.96 E-value=48 Score=13.46 Aligned_cols=153 Identities=22% Similarity=0.279 Sum_probs=76.3 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEEE----CCCC-CHHHHHH---HHHH-HCCCE-EEEEECHHHHHHHHHHHHHHHCC Q ss_conf 477259999999987189869999945----7871-1068999---9999-73980-79982008999998799997368 Q gi|254780234|r 13 SGGLDTSIILKWLQVEKGLEVIVFIAD----LGQG-EELKIAS---DKAR-LLGAK-EVYVKDLRREFVRDFVFPMFRAN 82 (404) Q Consensus 13 SGGLDTSv~i~~L~~e~g~eVi~~~~d----~Gq~-~d~~~~~---~~A~-~~Ga~-~~~v~D~r~ef~~~~i~~~I~an 82 (404) ---||--.+++-|. |-|+-=|+++=| .|.. .+.+.+- ++|+ ..|.+ ++..-++ |--|..|-+ T Consensus 29 R~~LDPv~~V~kLA-ElGAyGv~fHD~DLiPfg~~~~~R~~~v~~F~~ALd~TGl~VPMvTtNL-------F~~PvFKDG 100 (390) T TIGR02631 29 REALDPVEAVHKLA-ELGAYGVTFHDDDLIPFGASEAERDKIVKRFKKALDETGLKVPMVTTNL-------FSHPVFKDG 100 (390) T ss_pred CCCCCHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHH-------CCCCCCCCC T ss_conf 14466489999988-6300245421366688988877899999999998884596544101100-------257743377 Q ss_pred CCCCCCCCCCCCHHHHHHHHH--HHHHHHHCCCCEEE--CHHHC---CCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCH Q ss_conf 654787420130124335999--99998856892875--00111---682156799999985879707744100687637 Q gi|254780234|r 83 ALYEGYYLLGTAIARPLIAKY--LVDIANETGADAIA--HGSTG---KGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGR 155 (404) Q Consensus 83 a~Yeg~Ypl~tslaRplia~~--lv~~a~~~ga~~ia--HG~Tg---kGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sR 155 (404) +.- -.-.++-|+.+.|. -++.+-|+||.... -|--| -+...+| .+| ||--..- T Consensus 101 gFT----snDr~vR~yAlrK~l~~~DL~aElGA~~~V~WgGREGaE~~~~kd~~-----~al-----------Dr~rEa~ 160 (390) T TIGR02631 101 GFT----SNDRSVRRYALRKVLRNIDLAAELGAETYVVWGGREGAEYDGAKDVR-----AAL-----------DRMREAL 160 (390) T ss_pred CCC----CCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHH-----HHH-----------HHHHHHH T ss_conf 756----88778999999999875202331154037653884543110157899-----999-----------9998999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHH Q ss_conf 999999997089675665678864157243115457632458457682010014658122 Q gi|254780234|r 156 QDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEE 215 (404) Q Consensus 156 e~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ 215 (404) +-+.+|.+..|..++.... -|| |||--|+|.-|+-..- T Consensus 161 ~~~a~Y~~d~GY~~rfA~E-PKP---------------------NEPRGDI~lpTvG~~l 198 (390) T TIGR02631 161 DLLAEYVEDQGYGLRFALE-PKP---------------------NEPRGDILLPTVGHAL 198 (390) T ss_pred HHHHHHHHHCCCCCCEECC-CCC---------------------CCCCCCCCHHHHHHHH T ss_conf 9998777523777523215-786---------------------5787530143465789 No 350 >PRK11361 acetoacetate metabolism regulatory protein AtoC; Provisional Probab=23.87 E-value=48 Score=13.45 Aligned_cols=113 Identities=18% Similarity=0.143 Sum_probs=57.4 Q ss_pred HHHHHHHHHHHH-CCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCC---------- Q ss_conf 599999999871-898699999457871106899999997398079982008999998799997368654---------- Q gi|254780234|r 17 DTSIILKWLQVE-KGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALY---------- 85 (404) Q Consensus 17 DTSv~i~~L~~e-~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Y---------- 85 (404) |.--++..++.. ....||-+|+. .+.+.+ .+|.+.||.+++...+.-+=....+-.++....+. T Consensus 62 ~Glell~~ir~~~p~~pvIvlTa~----~~~~~a-v~A~k~GA~Dyl~KPf~~~~L~~~v~rAl~~~~l~~~~~~l~~~~ 136 (457) T PRK11361 62 DGIKALKEMRSHETRTPVILMTAY----AEVETA-VEALRCGAFDYVIKPFDLDELNLIVQRALQLQSMKKEIRHLHQAL 136 (457) T ss_pred CHHHHHHHHHHCCCCCCEEEEECC----CCHHHH-HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999820989938999689----998999-999975966325699999999999999999988777776665432 Q ss_pred ----CCCCCCCCCHHHHHHH--HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHH Q ss_conf ----7874201301243359--999999885689287500111682156799999985 Q gi|254780234|r 86 ----EGYYLLGTAIARPLIA--KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSL 137 (404) Q Consensus 86 ----eg~Ypl~tslaRplia--~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l 137 (404) .....++.| |.+- ...++.+-.-.+.++-.|-||-|...+---.+.... T Consensus 137 ~~~~~~~~lig~S---~~m~~v~~~i~~~A~s~~~VLI~GEsGTGKe~~Ar~IH~~S~ 191 (457) T PRK11361 137 STSWQWGHILTNS---PAMMDICKDTAKIALSQASVLISGESGTGKELIARAIHYNSR 191 (457) T ss_pred CCCCCCCCCEECC---HHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHCC T ss_conf 0124567745469---999999999999848899589988998578999999998379 No 351 >TIGR03461 pabC_Proteo aminodeoxychorismate lyase. Members of this protein family are aminodeoxychorismate lyase (ADC lyase), EC 4.1.3.38, the PabC protein of PABA biosynthesis. PABA (para-aminobenzoate) is a precursor of folate, needed for de novo purine biosynthesis. This enzyme is a pyridoxal-phosphate-binding protein in the class IV aminotransferase family (pfam01063). Probab=23.80 E-value=46 Score=13.58 Aligned_cols=32 Identities=13% Similarity=0.211 Sum_probs=19.4 Q ss_pred CCEEEEEECCCC-C--CCHHHHHHHHHHHCCCCCC Q ss_conf 970774410068-7--6379999999970896756 Q gi|254780234|r 140 DIEIIAPWRHWS-F--KGRQDLIDFAEKHAIPIDK 171 (404) Q Consensus 140 ~l~viaP~Rd~~-~--~sRe~~i~ya~~~gIpv~~ 171 (404) +-++++|--+.. + .+|+..++.|++.||+|.. T Consensus 182 ~g~l~TP~~~~~~L~GItR~~vi~~a~~~gi~v~e 216 (261) T TIGR03461 182 GNQVFTPDLSYCGVAGVMRQHVLALLPALGYEIEE 216 (261) T ss_pred CCEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEE T ss_conf 99998599866664777999999999986984999 No 352 >PRK06932 glycerate dehydrogenase; Provisional Probab=23.80 E-value=48 Score=13.44 Aligned_cols=10 Identities=20% Similarity=0.295 Sum_probs=3.6 Q ss_pred HHHHHCCCCC Q ss_conf 9997089675 Q gi|254780234|r 161 FAEKHAIPID 170 (404) Q Consensus 161 ya~~~gIpv~ 170 (404) +|+++||+|. T Consensus 83 aa~~~gI~V~ 92 (314) T PRK06932 83 AAKELGITVK 92 (314) T ss_pred HHHHCCEEEE T ss_conf 9974997999 No 353 >TIGR01862 N2-ase-Ialpha nitrogenase component I, alpha chain; InterPro: IPR010143 This entry represents all three varieties (Fe-Fe, Mo-Fe and V-Fe) of the component I alpha chain of nitrogenase. Nitrogenase (1.18.6.1 from EC) is the enzyme system responsible for biological nitrogen fixation. Nitrogenase is an oligomeric complex which consists of two components: component 2 is an homodimer of an iron-sulphur protein, while component 1 which contains the active site for the reduction of nitrogen to ammonia exists in three different forms: the molybdenum-iron containing protein (MoFe) is a hetero-tetramer consisting of two pairs of alpha (nifD) and beta (nifK) subunits; the vanadium-iron containing protein (VFe) is a hexamer of two pairs each of alpha (vnfD), beta (vnfK), and delta (vnfG) subunits; the third form seems to only contain iron and is a hexamer composed of alpha (anfD), beta (anfK), and delta (anfG) subunits. The alpha and beta chains of the three types of component 1 are evolutionary related and they are also related to proteins nifE and nifN, which are most probably involved in the iron-molybdenum cofactor biosynthesis .; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=23.79 E-value=48 Score=13.44 Aligned_cols=147 Identities=16% Similarity=0.254 Sum_probs=70.3 Q ss_pred HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCC-----CCC-CCCCCCHHHHHHHHHHHH-HHHHCCCCEEE Q ss_conf 06899999997398079982008999998799997368654-----787-420130124335999999-98856892875 Q gi|254780234|r 45 ELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALY-----EGY-YLLGTAIARPLIAKYLVD-IANETGADAIA 117 (404) Q Consensus 45 d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Y-----eg~-Ypl~tslaRplia~~lv~-~a~~~ga~~ia 117 (404) =.+.+++=|...|-. .+.+--+|.+++.+ +.++..--| +|+ -+|=+.=+|+ .++.. +-.++|.++|+ T Consensus 287 ~~e~LR~I~~~Fg~~--eI~~r~EevIae~~-a~w~p~ld~yKerL~GK~v~ly~GG~r~---wH~~~~~~~~lG~~VV~ 360 (510) T TIGR01862 287 IAESLRAIAAFFGDS--EIEKRAEEVIAEEI-AKWKPELDYYKERLQGKRVCLYVGGSRL---WHWIKSLEEDLGVEVVA 360 (510) T ss_pred HHHHHHHHHHHCCCC--CHHHHHHHHHHHHH-HHCCCHHHHHHHHHCCCEEEEECCCCCH---HHHHHHHHHHCCCEEEE T ss_conf 899899999645885--42268888778744-2104124541131149889997588800---11112577763945999 Q ss_pred CHHHCCCCHHHHHHH-----HHH-----HHCCCCEEEEEECC-CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 001116821567999-----999-----85879707744100-6876379999999970896756656788641572431 Q gi|254780234|r 118 HGSTGKGNDQVRFEL-----SAY-----SLNSDIEIIAPWRH-WSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLL 186 (404) Q Consensus 118 HG~TgkGNDQvRFe~-----~~~-----~l~P~l~viaP~Rd-~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlw 186 (404) -|.-.-=||- ||. +++ ---|++.| +|=-. ....--|+.++-+++.|+|+.. =..+. T Consensus 361 ~~~~F~H~dD--yEkreviPtikidadskniPeitv-tPdeqkyrvviPedkveelkkaGvPlss----------yGG~~ 427 (510) T TIGR01862 361 VGIEFAHEDD--YEKREVIPTIKIDADSKNIPEITV-TPDEQKYRVVIPEDKVEELKKAGVPLSS----------YGGVM 427 (510) T ss_pred EEEEECCCCC--CCCCEECCEEEECCCCCCCCEEEE-CCCCCEEEEECCCCHHHHHHHCCCCCCC----------CCCEE T ss_conf 7566434478--776211322566156666871264-5676413786274136788754873003----------47637 Q ss_pred CCCCCCCCC-CCCCCCCCHHHCCCC Q ss_conf 154576324-584576820100146 Q gi|254780234|r 187 HSSSEGRVL-EDPSQPAPEYVYKMI 210 (404) Q Consensus 187 g~S~Egg~L-edp~~~~pe~~~~~t 210 (404) -+-.||-.+ +||..---+++..+. T Consensus 428 kr~~eGT~~iDD~N~lE~ee~~e~~ 452 (510) T TIGR01862 428 KRMGEGTLLIDDPNELEFEEIIEKL 452 (510) T ss_pred EEECCCCEEEECCCCCCHHHHHHHH T ss_conf 8723540789678885077766651 No 354 >PRK07709 fructose-bisphosphate aldolase; Provisional Probab=23.59 E-value=49 Score=13.41 Aligned_cols=33 Identities=18% Similarity=-0.027 Sum_probs=16.9 Q ss_pred HCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEE Q ss_conf 189869999945787110689999999739807998 Q gi|254780234|r 28 EKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYV 63 (404) Q Consensus 28 e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v 63 (404) +.+|-|-+|.+. .-+.+..+-+-|....+ +.++ T Consensus 15 ~~~yAV~AfNv~--n~e~~~Avi~AAee~~s-PvIl 47 (285) T PRK07709 15 EGKYAVGQFNMN--NLEWTQAILAAAEEEKS-PVIL 47 (285) T ss_pred HCCEEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE T ss_conf 889079997889--99999999999999788-9999 No 355 >TIGR02080 O_succ_thio_ly O-succinylhomoserine (thiol)-lyase; InterPro: IPR011821 Proteins in this entry are important in the interconversion of the sulphur-containing amino acids L-cysteine and L-methionine in bacteria. In bacteria, inorganic sulphur is reduced to sulphide, which is then used to displace the acetyl group of O-acetyl-L-serine to form the essential amino acid L-cysteine. Biosynthesis of L-methionine then occurs via the transsulphuration pathway which involves the sequential action of two pyridoxal 5'-phosphate (PLP) enzymes, cystathionine gamma-synthase (CGS) and cystathionine beta-lyase (CBL). Firstly CGS catalyses the formation of the intermediate cystathione from a homoserine ester and L-cysteine. This intermediate is then converted by CBL to L-homoserine, which is subsequently methylated by methionine synthase to form L-methionine. For more information see . The CGS enzyme (P00935 from SWISSPROT) from Escherichia coli is, like other CGS enzymes, a homotetramer composed of two catalytically active dimers . The active site of each dimer is found at the interface between the monomers and involves residues from each monomer. Catalysis occurs by alpha,gamma-elimination/replacement reactions.. Probab=23.57 E-value=49 Score=13.41 Aligned_cols=122 Identities=21% Similarity=0.353 Sum_probs=51.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHH-HHHCCCCEEEEEECCCCCCCHHH-HH-HHHHHHCCCCCC Q ss_conf 1243359999999885689287500111682156799999-98587970774410068763799-99-999970896756 Q gi|254780234|r 95 IARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSA-YSLNSDIEIIAPWRHWSFKGRQD-LI-DFAEKHAIPIDK 171 (404) Q Consensus 95 laRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~-~~l~P~l~viaP~Rd~~~~sRe~-~i-~ya~~~gIpv~~ 171 (404) -.|-+..+.+.+ .|-|+++|-. |||+ --.++.+ ..|.||==++||. |= - |=.. ++ +-|++--+.|.. T Consensus 51 PTRd~l~~aLA~--LE~G~~AVvT-~~GM----sAI~L~~~~lL~pdDLlvAPH-DC-Y-GGtYRLl~~lA~kG~~~v~f 120 (383) T TIGR02080 51 PTRDILQQALAE--LEGGAGAVVT-NSGM----SAIDLVTTALLGPDDLLVAPH-DC-Y-GGTYRLLNALAKKGQFKVQF 120 (383) T ss_pred CCHHHHHHHHHH--HHCCCCCEEC-CCHH----HHHHHHHHHEECCCCEEEECC-CC-C-CCHHHHHHHHHHCCCEEEEE T ss_conf 756589999999--8489982566-8568----988766431207897688043-26-7-70488886452078338998 Q ss_pred CCCCCCCCCCCHHHHCCCCCCC-C----CCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHH Q ss_conf 6567886415724311545763-2----4584576820100146581227999638999995024787516231899 Q gi|254780234|r 172 NKRGEAPFSIDTNLLHSSSEGR-V----LEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPE 243 (404) Q Consensus 172 ~~~~~~~yS~D~Nlwg~S~Egg-~----Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~ 243 (404) . .+ .|...+-....-. + +|.|+||. ...++.+.= .++.=+.|.-|.||+.-|||+ T Consensus 121 V--DQ----~D~~al~~ALa~kdPKLVlIEtPSNPL-----lRVVDI~~l------c~La~~~G~~vvVDNTFLsP~ 180 (383) T TIGR02080 121 V--DQ----SDEQALEAALAQKDPKLVLIETPSNPL-----LRVVDIAKL------CKLAKAVGAVVVVDNTFLSPI 180 (383) T ss_pred E--EC----CHHHHHHHHHHHCCCCEEEEECCCCCC-----CHHHHHHHH------HHHHHHCCEEEEEECCCCCHH T ss_conf 6--07----628999999974698479986799885-----218889999------988761883899855216776 No 356 >KOG1023 consensus Probab=23.48 E-value=32 Score=14.72 Aligned_cols=20 Identities=15% Similarity=0.253 Sum_probs=8.0 Q ss_pred EEECCCCCCC-HHHHHHHHHH Q ss_conf 4410068763-7999999997 Q gi|254780234|r 145 APWRHWSFKG-RQDLIDFAEK 164 (404) Q Consensus 145 aP~Rd~~~~s-Re~~i~ya~~ 164 (404) .||..-.... +.+.+.+.++ T Consensus 156 ~~~~~~~~~~~~~eii~~~~~ 176 (484) T KOG1023 156 GPFDLRNLVEDPDEIILRVKK 176 (484) T ss_pred CCCCCCCCCCCHHHHHHHHHH T ss_conf 750123556775889999872 No 357 >LOAD_USPA consensus Probab=23.46 E-value=49 Score=13.39 Aligned_cols=104 Identities=19% Similarity=0.296 Sum_probs=56.0 Q ss_pred CEEEEEECCCHHHHHHHHHHH---HHCCCEEEEEEEECCCCCH-H--HHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHH Q ss_conf 789999147725999999998---7189869999945787110-6--899999997398079982008999998799997 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQ---VEKGLEVIVFIADLGQGEE-L--KIASDKARLLGAKEVYVKDLRREFVRDFVFPMF 79 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~---~e~g~eVi~~~~d~Gq~~d-~--~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I 79 (404) |||++++-|.-.+..++.|-. ...+.+++.+++.-..... . ....+... +...+..+.+...+- T Consensus 1 k~Ilv~vd~s~~s~~a~~~A~~la~~~~a~l~~lhV~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~ 70 (135) T LOAD_USPA 1 KKILVAIDGSPESEKALRWAVDLAKRRGAELILLHVIPPSVSTAASPALDLALLL----------EEALKLLEEALELLE 70 (135) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCCCCHHHHHH----------HHHHHHHHHHHHHHH T ss_conf 9899998699899999999999998749989999996277644444420146899----------999999999999998 Q ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEECHHHCCCC Q ss_conf 3686547874201301243359999999885689287500111682 Q gi|254780234|r 80 RANALYEGYYLLGTAIARPLIAKYLVDIANETGADAIAHGSTGKGN 125 (404) Q Consensus 80 ~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~iaHG~TgkGN 125 (404) ..+. ...+.+..--.+..+++++++.++|.|.=|+.|++. T Consensus 71 ~~~~------~~~~~i~~g~~~~~I~~~~~~~~~dliVmG~~~~~~ 110 (135) T LOAD_USPA 71 EAGV------KIDVEVEEGSPAEAILELAEESNADLIVVGSRGRGG 110 (135) T ss_pred HCCC------CEEEEEEEECCHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 6399------739999996428999999860577679994689975 No 358 >PRK06522 2-dehydropantoate 2-reductase; Reviewed Probab=23.43 E-value=49 Score=13.39 Aligned_cols=30 Identities=17% Similarity=0.248 Sum_probs=21.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 89999147725999999998718986999994 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIA 38 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~ 38 (404) ||.+.=.|.+-+.++ .+|. +.|.+|.-+.- T Consensus 2 kI~IiGaGaiG~~~a-~~L~-~ag~~V~li~r 31 (307) T PRK06522 2 KIAILGAGAIGGLFG-ARLA-QAGHDVTLVAR 31 (307) T ss_pred EEEEECCCHHHHHHH-HHHH-HCCCCEEEEEC T ss_conf 899999149999999-9998-48998899978 No 359 >KOG4318 consensus Probab=23.23 E-value=14 Score=17.36 Aligned_cols=96 Identities=20% Similarity=0.233 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCC----CCCHHHCCCCCCHHHCCCCCEEEEEEEECC Q ss_conf 7999999997089675665678864157243115457632458457----682010014658122799963899999502 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQ----PAPEYVYKMIVSPEEAPDTPTTIRIDFQRG 230 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~----~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G 230 (404) |.+.+.-+.+.-+|.... .+||--|--.||+-++-+..++|.. ..|..+|..+-.-.++.+--+.-.+--.+| T Consensus 608 rkdd~s~a~ea~e~~~qk---yk~~P~~~e~lcrlv~ke~td~~qk~mDls~~iq~f~k~g~~~~a~di~etpG~r~r~~ 684 (1088) T KOG4318 608 RKDDQSAAQEAPEPEEQK---YKPYPKDLEGLCRLVYKETTDSPQKTMDLSIPIQKFEKLGSCVDAGDITETPGVRCRNG 684 (1088) T ss_pred ECCCHHHHHHCCHHHHHH---HCCCHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHCCCCCCHHHCCCCCCCCCCCC T ss_conf 026506666505078987---05886788999999986222358888754101588874621024221010686303687 Q ss_pred EEEEECCEECCHHH-HHHHHHHHH Q ss_conf 47875162318999-999998886 Q gi|254780234|r 231 DPIAINGQVMSPEV-LLEQLNQYG 253 (404) Q Consensus 231 ~PVainG~~~~~~~-li~~LN~ig 253 (404) .++.+|.-..-|++ +++.+|++| T Consensus 685 RDr~~de~e~~~lEll~elt~~lg 708 (1088) T KOG4318 685 RDRDTDEGEIVPLELLLELTHELG 708 (1088) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 765454671124798999975757 No 360 >PRK08594 enoyl-(acyl carrier protein) reductase; Provisional Probab=23.01 E-value=50 Score=13.33 Aligned_cols=64 Identities=17% Similarity=0.137 Sum_probs=33.1 Q ss_pred CCCCCCEEEEEE--CC--CHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHC-CCEE-EEEECHHHH Q ss_conf 987777899991--47--725999999998718986999994578711068999999973-9807-998200899 Q gi|254780234|r 1 MSRDVKKVVLAY--SG--GLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLL-GAKE-VYVKDLRRE 69 (404) Q Consensus 1 M~~~~kkVvlay--SG--GLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~-Ga~~-~~v~D~r~e 69 (404) |..-+.|++|-- || |+--.++. .|. +.|+.|+... .. .+..+.+++...++ |... .+..|+.++ T Consensus 1 m~~L~gK~~lVTGaag~rGIG~aiA~-~la-~~Ga~Vvi~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~Dvs~~ 70 (256) T PRK08594 1 MLSLEGKTYVVMGVANKRSIAWGIAR-SLH-NAGAKLVFTY--AG-ERLEKEVRELAETLEQQESLVLPCDVTSD 70 (256) T ss_pred CCCCCCCEEEEECCCCCCHHHHHHHH-HHH-HCCCEEEEEC--CC-CHHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 98999988999899999639999999-999-8799999974--88-06699999999870799479999138999 No 361 >KOG1502 consensus Probab=23.01 E-value=50 Score=13.33 Aligned_cols=73 Identities=19% Similarity=0.142 Sum_probs=43.8 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCC-CEEEEEECHHHHHHHHHHHHHHHC Q ss_conf 777899991477259999999987189869999945787110689999999739-807998200899999879999736 Q gi|254780234|r 4 DVKKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLG-AKEVYVKDLRREFVRDFVFPMFRA 81 (404) Q Consensus 4 ~~kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~G-a~~~~v~D~r~ef~~~~i~~~I~a 81 (404) ..++|++-=..|-=-|-+++.|. +.||.|++..=|.+-.+..+.+++ ....+ ..+.+..|+.++ +....+|.. T Consensus 5 ~~~~VcVTGAsGfIgswivk~LL-~rGY~V~gtVR~~~~~k~~~~L~~-l~~a~~~l~l~~aDL~d~---~sf~~ai~g 78 (327) T KOG1502 5 EGKKVCVTGASGFIGSWIVKLLL-SRGYTVRGTVRDPEDEKKTEHLRK-LEGAKERLKLFKADLLDE---GSFDKAIDG 78 (327) T ss_pred CCCEEEEECCCHHHHHHHHHHHH-HCCCEEEEEECCCCHHHHHHHHHH-CCCCCCCCEEEECCCCCC---CHHHHHHHC T ss_conf 87279994882089999999998-689989999708630565899986-515754425885243551---359999707 No 362 >PTZ00185 ATPase alpha subunit; Provisional Probab=22.84 E-value=50 Score=13.31 Aligned_cols=48 Identities=21% Similarity=0.135 Sum_probs=34.6 Q ss_pred HHHHHHHHHHHH--------CCCEEEEEEEECCCCC-HHHHHHHHHHHCCCEEEEEE Q ss_conf 599999999871--------8986999994578711-06899999997398079982 Q gi|254780234|r 17 DTSIILKWLQVE--------KGLEVIVFIADLGQGE-ELKIASDKARLLGAKEVYVK 64 (404) Q Consensus 17 DTSv~i~~L~~e--------~g~eVi~~~~d~Gq~~-d~~~~~~~A~~~Ga~~~~v~ 64 (404) -|++++.-+..+ .+-+|+|+.|-+||+. .+-.+-+.-.+.||-++-++ T Consensus 202 KTaIAiDTIINQk~~n~~~~k~~~V~CIYVAIGQK~StVA~iv~~L~e~gAm~yTiI 258 (574) T PTZ00185 202 KTSIAVSTIINQVRINQQILSKNAVISIYVSIGQRCSNVARIHRLLRSYGALRYTTV 258 (574) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEE T ss_conf 689999999837643446577896699999854128999999999997697203089 No 363 >PRK05778 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Validated Probab=22.77 E-value=50 Score=13.30 Aligned_cols=68 Identities=13% Similarity=0.205 Sum_probs=27.3 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEECCCC-----CCCHHHHHHHHHHHCCCC Q ss_conf 9999998856892875001116821567-99999985879-70774410068-----763799999999708967 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPWRHWS-----FKGRQDLIDFAEKHAIPI 169 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~Rd~~-----~~sRe~~i~ya~~~gIpv 169 (404) .-+++.|...||.+||.+.++.-.+..+ +..+++--++. +.|++|+--++ +++--+..+|-+++-+++ T Consensus 157 ~n~~~la~a~ga~fVAR~~~~~~~~l~~~ik~Ai~hkGfs~i~VlspC~Tfn~~~~~~~~~~~~~~~~~~~~~~l 231 (306) T PRK05778 157 IDPCALALAAGATFVARSFSGDVKQLKELIKKAISHKGFAFIDVLSPCVTFNGRRNSTKSPVETVEWYDKRVYKL 231 (306) T ss_pred CCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHCCCEEEC T ss_conf 699999997699869987547889999999999669996799996789734586655455368887642675774 No 364 >cd03080 GST_N_Metaxin_like GST_N family, Metaxin subfamily, Metaxin-like proteins; a heterogenous group of proteins, predominantly uncharacterized, with similarity to metaxins and GSTs. Metaxin 1 is a component of a preprotein import complex of the mitochondrial outer membrane. It extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. In humans, alterations in the metaxin gene may be associated with Gaucher disease. One characterized member of this subgroup is a novel GST from Rhodococcus with toluene o-monooxygenase and gamma-glutamylcysteine synthetase activities. Also members are the cadmium-inducible lysosomal protein CDR-1 and its homologs from C. elegans, and the failed axon connections (fax) protein from Drosophila. CDR-1 is an integral membrane protein that functions to protect against cadmium toxicity and may also have a role in osmoregulation to maintain salt balance in C. ele Probab=22.58 E-value=51 Score=13.28 Aligned_cols=22 Identities=23% Similarity=0.540 Sum_probs=14.9 Q ss_pred EEECCEECC-HHHHHHHHHHHHH Q ss_conf 875162318-9999999988864 Q gi|254780234|r 233 IAINGQVMS-PEVLLEQLNQYGR 254 (404) Q Consensus 233 VainG~~~~-~~~li~~LN~igg 254 (404) +..||+.++ .--+|++|++.-| T Consensus 53 I~~~g~~i~DS~~Ii~~L~~~fg 75 (75) T cd03080 53 IELNGEKIADSELIIDHLEEKYG 75 (75) T ss_pred EEECCEEECCHHHHHHHHHHHCC T ss_conf 99899894599999999997609 No 365 >cd01981 Pchlide_reductase_B Pchlide_reductase_B: B protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=22.43 E-value=51 Score=13.26 Aligned_cols=29 Identities=14% Similarity=0.120 Sum_probs=16.9 Q ss_pred EEEECCEEEEECCEECCHHHHHHHHH-HHH Q ss_conf 99950247875162318999999998-886 Q gi|254780234|r 225 IDFQRGDPIAINGQVMSPEVLLEQLN-QYG 253 (404) Q Consensus 225 I~Fe~G~PVainG~~~~~~~li~~LN-~ig 253 (404) -.+=+|..|+|-|...-..-++..|- ++| T Consensus 296 ~~~l~GKrvaI~gd~~~~~~l~~fl~~ELG 325 (430) T cd01981 296 SQNLTGKRAFVFGDATHVAAATRILAREMG 325 (430) T ss_pred HHCCCCCEEEEECCCHHHHHHHHHHHHHCC T ss_conf 540379779998781589999999998569 No 366 >PRK07315 fructose-bisphosphate aldolase; Provisional Probab=22.43 E-value=51 Score=13.26 Aligned_cols=34 Identities=21% Similarity=0.090 Sum_probs=20.9 Q ss_pred HHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEE Q ss_conf 7189869999945787110689999999739807998 Q gi|254780234|r 27 VEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYV 63 (404) Q Consensus 27 ~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v 63 (404) .+.+|-|-+|.+. .-+.+..+-+-|...++ +.++ T Consensus 14 ~~~~yAV~AfNv~--n~e~~~avi~AAee~~s-PvIl 47 (293) T PRK07315 14 RDNGYAVGGFNTN--NLEWTQAILRAAEAKKA-PVLI 47 (293) T ss_pred HHCCEEEEEEEEC--CHHHHHHHHHHHHHHCC-CEEE T ss_conf 9879079987879--99999999999999788-9999 No 367 >pfam09255 Antig_Caf1 Caf1 Capsule antigen. Members of this family are predominantly found in the F1 capsule antigen Caf1 synthesized by Yersinia bacteria. They adopt a structure consisting of a seven strands arranged in two beta-sheets, in a Greek-key topology, and mediate targeting of the bacterium to sites of infection. Probab=22.32 E-value=51 Score=13.24 Aligned_cols=15 Identities=33% Similarity=0.891 Sum_probs=5.7 Q ss_pred CEEEEEEEECCEEEE Q ss_conf 638999995024787 Q gi|254780234|r 220 PTTIRIDFQRGDPIA 234 (404) Q Consensus 220 pe~v~I~Fe~G~PVa 234 (404) |..++++|+.|-|+. T Consensus 3 PARit~T~~~GA~iT 17 (136) T pfam09255 3 PARITLTYKEGAPIT 17 (136) T ss_pred CCEEEEEECCCCCEE T ss_conf 507888843998258 No 368 >PRK03352 DNA polymerase IV; Validated Probab=22.29 E-value=24 Score=15.57 Aligned_cols=122 Identities=13% Similarity=0.054 Sum_probs=56.7 Q ss_pred EEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCC------HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHH Q ss_conf 982008999998799997368654787420130------12433599999998856892875001116821567999999 Q gi|254780234|r 62 YVKDLRREFVRDFVFPMFRANALYEGYYLLGTA------IARPLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAY 135 (404) Q Consensus 62 ~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~ts------laRplia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~ 135 (404) .-+|+.. |.. -| -..-|-.+.|+ |+... -.|-+|+. .--.||+.|...-.-. |+ .+ T Consensus 8 ~HvD~d~-Fya-Sv--E~~~~P~l~~k-PvvV~~~~d~~~~~gvv~a-asyeAr~~GV~~gmp~----------~~--a~ 69 (345) T PRK03352 8 LHVDLDQ-FLA-SV--ELLRRPELAGL-PVIVGGNGDPTEPRKVVTC-ASYEARAFGVRAGMPL----------RT--AA 69 (345) T ss_pred EEECCCC-HHH-HH--HHHHCCCCCCC-CEEEECCCCCCCCCEEEEE-CCHHHHHHCCCCCCCH----------HH--HH T ss_conf 9974674-899-89--93537253697-1899427887778669998-1999997499889849----------99--99 Q ss_pred HHCCCCEEEEEECCC--CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCC Q ss_conf 858797077441006--87637999999997089675665678864157243115457632458457682010014658 Q gi|254780234|r 136 SLNSDIEIIAPWRHW--SFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVS 212 (404) Q Consensus 136 ~l~P~l~viaP~Rd~--~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~ 212 (404) -++|++.++.|=.+. +. |+ ..++.+++..-.+ .+|||||--...+. ...++-....-..++.+|.- T Consensus 70 ~~cP~li~~~~~~~~Y~~~-S~-~i~~il~~~~~~v-------E~~SIDEafld~~~--~~~~~~a~~ir~~I~~~tgl 137 (345) T PRK03352 70 RRCPDATFLPSDPAAYDAA-SE-EVMALLRDLGHPV-------EVWGWDEAYLGADT--DDPEEVAEEIRAVVLEETGL 137 (345) T ss_pred HHCCCCEECCCCHHHHHHH-HH-HHHHHHHHCCCHH-------EECCCCCCEECCCC--CCHHHHHHHHHHHHHHHHCC T ss_conf 8688766658727999999-99-9999998619164-------04146601115888--99999999999999998789 No 369 >cd01477 vWA_F09G8-8_type VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of mo Probab=22.21 E-value=52 Score=13.23 Aligned_cols=24 Identities=17% Similarity=0.152 Sum_probs=10.1 Q ss_pred HHHHHHHCCCCEEECHHHCCCCHH Q ss_conf 999988568928750011168215 Q gi|254780234|r 104 LVDIANETGADAIAHGSTGKGNDQ 127 (404) Q Consensus 104 lv~~a~~~ga~~iaHG~TgkGNDQ 127 (404) +....+..|...|.-+++..++-| T Consensus 153 vA~rLK~~Gv~IiTVa~~q~~~~~ 176 (193) T cd01477 153 IAARLKSTGIAIITVAFTQDESSN 176 (193) T ss_pred HHHHHHHCCCEEEEEECCCCCCHH T ss_conf 999998769789999826887588 No 370 >TIGR02824 quinone_pig3 putative NAD(P)H quinone oxidoreductase, PIG3 family; InterPro: IPR014189 Members of this family are putative quinone oxidoreductases that belong to the broader superfamily of zinc-dependent alcohol (of medium chain length) dehydrogenases and quinone oxidoreductases. An alignment shows no motif of conserved Cys residues as are found in zinc-binding members of the superfamily, consequently members of this entry are likely to be quinone oxidoreductases. A member of this family in Homo sapiens (Human), PIG3, is induced by p53 but is otherwise uncharacterised.. Probab=22.20 E-value=52 Score=13.23 Aligned_cols=49 Identities=29% Similarity=0.449 Sum_probs=18.5 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEEEEEECHHHH-HH Q ss_conf 477259999999987189869999945787110689999999739807998200899-99 Q gi|254780234|r 13 SGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKEVYVKDLRRE-FV 71 (404) Q Consensus 13 SGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~~~v~D~r~e-f~ 71 (404) |+|.-|+-+ -.=| ..|+.|++-+ |+.|-. +.-+.+|||. ..||.|+| |+ T Consensus 154 aSGIGttAI-qLAK-A~Ga~V~~Ta---GS~eK~---~~a~~~LGAd--~aINY~e~DFv 203 (334) T TIGR02824 154 ASGIGTTAI-QLAK-AFGARVFTTA---GSDEKC---ATACEALGAD--IAINYREEDFV 203 (334) T ss_pred CCCHHHHHH-HHHH-HCCCEEEEEE---CCHHHH---HHHHHHCCCE--EEEECCCCCHH T ss_conf 673679999-9998-5697599982---898999---9999860980--78607773479 No 371 >pfam00926 DHBP_synthase 3,4-dihydroxy-2-butanone 4-phosphate synthase. 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin. Sometimes found as a bifunctional enzyme with pfam00925. Probab=22.02 E-value=52 Score=13.20 Aligned_cols=32 Identities=22% Similarity=0.364 Sum_probs=21.9 Q ss_pred CCCEEEEEECCC--CCCCHHHHHHHHHHHCCCCC Q ss_conf 797077441006--87637999999997089675 Q gi|254780234|r 139 SDIEIIAPWRHW--SFKGRQDLIDFAEKHAIPID 170 (404) Q Consensus 139 P~l~viaP~Rd~--~~~sRe~~i~ya~~~gIpv~ 170 (404) +...||+...+- ++-++++..+||++||+|+- T Consensus 151 ~P~avi~Eil~~dG~~~~~~~~~~fA~~~~lp~i 184 (193) T pfam00926 151 TPAAVICEILNDDGTMARLPDLEEFAKEHGLPLI 184 (193) T ss_pred CCEEEEEEEECCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 9639999985699882688999999998399799 No 372 >PRK07454 short chain dehydrogenase; Provisional Probab=21.99 E-value=52 Score=13.20 Aligned_cols=187 Identities=16% Similarity=0.136 Sum_probs=92.4 Q ss_pred CCCCCCEEEE--EECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCC-EEEEEECHHHHH-HHHHHH Q ss_conf 9877778999--914772599999999871898699999457871106899999997398-079982008999-998799 Q gi|254780234|r 1 MSRDVKKVVL--AYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGA-KEVYVKDLRREF-VRDFVF 76 (404) Q Consensus 1 M~~~~kkVvl--aySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga-~~~~v~D~r~ef-~~~~i~ 76 (404) |+....||+| .=|+|+--.++..+. +.|+.|+.+.-| .+.++.+.+.....|. ...+..|+.++= ++..+. T Consensus 1 m~~~~mKvalITGas~GIG~a~A~~la--~~G~~V~l~~R~---~~~l~~~~~e~~~~g~~~~~~~~Dvt~~~~v~~~~~ 75 (241) T PRK07454 1 MSLNSMPTALITGASRGIGKATALAFA--KAGWDLALVARS---QDALEALAEELRSTGVKVAAYSIDLSNPEAIAPGIA 75 (241) T ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHH--HCCCEEEEEECC---HHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHH T ss_conf 998999889991758789999999999--879989999899---999999999999659928999951899999999999 Q ss_pred HHHH---------CCCCCCC-CCCCCCCHHH------------HHHHHHHHHHHHHCCCCEEEC-HHHCCCCHHHHHHHH Q ss_conf 9973---------6865478-7420130124------------335999999988568928750-011168215679999 Q gi|254780234|r 77 PMFR---------ANALYEG-YYLLGTAIAR------------PLIAKYLVDIANETGADAIAH-GSTGKGNDQVRFELS 133 (404) Q Consensus 77 ~~I~---------ana~Yeg-~Ypl~tslaR------------plia~~lv~~a~~~ga~~iaH-G~TgkGNDQvRFe~~ 133 (404) ..+. .||=... .-...++... -...+.+++.|++.+...|-. +|+ .| T Consensus 76 ~~~~~~G~iDiLVnNAG~~~~~~~~~~~~e~~~~~~~vNl~g~~~~~~~~lp~M~~~~~G~IinisS~-ag--------- 145 (241) T PRK07454 76 ELLEQFGCPSVLINNAGAAYTGPLLEMPLSDWQWVIQLNLTSVFQCCSAVLPGMRARGGGLIINVSSH-AA--------- 145 (241) T ss_pred HHHHHCCCCCEEEECCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECH-HH--------- T ss_conf 99997599889998898899999266999999999999869999999999999997399899998356-54--------- Q ss_pred HHHHCCCCEEEEEECC--CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf 9985879707744100--687637999999997089675665678864157243115457632458457682010014 Q gi|254780234|r 134 AYSLNSDIEIIAPWRH--WSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKM 209 (404) Q Consensus 134 ~~~l~P~l~viaP~Rd--~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~ 209 (404) ..-.|....|+.-.- ..|+ |.-..+ +..+||-|..- .|=.+|..+|-........+....-.|||+-.. T Consensus 146 -~~~~~~~~~Y~aSK~al~~lt-~~la~E-~~~~gIrVn~V----~PG~v~T~m~~~~~~~~~~~~~~~l~PedVA~~ 216 (241) T PRK07454 146 -RNAFPQWGAYCVSKAALAAFT-KCLAEE-ERSHGIRVCTL----TLGAVNTPLWDSETVQADFDRSAMLSPEQVAQT 216 (241) T ss_pred -CCCCCCCHHHHHHHHHHHHHH-HHHHHH-HCCCCCEEEEE----EECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf -477899757999999999999-999998-38459389999----738898898886333355455689999999999 No 373 >pfam06342 DUF1057 Alpha/beta hydrolase of unknown function (DUF1057). This family consists of several Caenorhabditis elegans specific proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=21.88 E-value=34 Score=14.55 Aligned_cols=33 Identities=30% Similarity=0.475 Sum_probs=23.3 Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEE-ECHHHCCCCHHH Q ss_conf 73686547874201301243359999999885689287-500111682156 Q gi|254780234|r 79 FRANALYEGYYLLGTAIARPLIAKYLVDIANETGADAI-AHGSTGKGNDQV 128 (404) Q Consensus 79 I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~ga~~i-aHG~TgkGNDQv 128 (404) +--+|.|++.-|-++++ .++| -|||-|.-||=. T Consensus 19 v~v~A~y~D~~~~gs~~-----------------GTVV~~HGsPGSH~DFk 52 (297) T pfam06342 19 VEVQAVYEDSLTSGSPF-----------------GTVVAFHGSPGSHNDFK 52 (297) T ss_pred EEEEEEEEECCCCCCCC-----------------EEEEEECCCCCCCCCHH T ss_conf 88888998448999875-----------------27999538999742267 No 374 >PRK11705 cyclopropane fatty acyl phospholipid synthase; Provisional Probab=21.74 E-value=53 Score=13.16 Aligned_cols=48 Identities=2% Similarity=-0.042 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH----HHHHCCCCEEEEEEEEC Q ss_conf 999999999999998753514698999999999----99723367399999906 Q gi|254780234|r 301 SAHLKDDLMSRYASIVYQGFWFSPEREMLQALI----DKSQEYVEGSITLKLYK 350 (404) Q Consensus 301 ~~~~K~~~~~~~a~lvy~G~wf~p~~~~l~a~i----~~~q~~VtG~V~l~L~k 350 (404) ...+++.....|.+..- | +-+.-+++=+-++ .--...-.+.+.+.|-| T Consensus 321 L~~Wr~~F~~~~~~i~~-g-fde~F~RmW~~YL~~ceagF~~~~i~v~Q~vlsk 372 (383) T PRK11705 321 LMAWHENFEAAWPELAD-N-YSERFYRMWRYYLLSCAGAFRARDIQLWQVVFSP 372 (383) T ss_pred HHHHHHHHHHHHHHHHH-C-CCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEC T ss_conf 99999999999999997-3-9926441899999999999967998679999836 No 375 >TIGR02855 spore_yabG sporulation peptidase YabG; InterPro: IPR008764 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This is a group of peptidases belong to MEROPS peptidase family U57 (clan U-). The type example is the YabG protein of Bacillus subtilis. This is a protease involved in the synthesis and maturation of the spore coat proteins SpoIVA and YrbA of Bacillus subtilis .. Probab=21.70 E-value=28 Score=15.12 Aligned_cols=43 Identities=12% Similarity=0.166 Sum_probs=17.3 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999988568928750011168215679999998587970774 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEIIA 145 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~via 145 (404) +..+++=+++|.+++-=-|.=+---.-=.+ -+....||+=||+ T Consensus 127 ~~C~~~Y~~~gv~V~G~~~~E~emPe~v~~-L~~~~~PDIlViT 169 (292) T TIGR02855 127 RKCLKLYKKLGVPVVGIHCKEKEMPEKVLD-LIEEVRPDILVIT 169 (292) T ss_pred HHHHHHHHHCCCCEEEEEEEECCCCHHHHH-HHHHHCCCEEEEE T ss_conf 999998866197279999841218088999-9973099789994 No 376 >PRK09376 rho transcription termination factor Rho; Provisional Probab=21.67 E-value=27 Score=15.24 Aligned_cols=31 Identities=16% Similarity=0.139 Sum_probs=16.5 Q ss_pred CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHH Q ss_conf 11068999999973980799820089999987 Q gi|254780234|r 43 GEELKIASDKARLLGAKEVYVKDLRREFVRDF 74 (404) Q Consensus 43 ~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~ 74 (404) ...+.++.+.|..+|+. .+.---|+|.+... T Consensus 9 ~~~l~eL~~iA~~lgI~-~~~~~~K~eLI~~I 39 (416) T PRK09376 9 NKTLSELLELAEELGIE-NASRLRKQELIFAI 39 (416) T ss_pred CCCHHHHHHHHHHCCCC-CCCCCCHHHHHHHH T ss_conf 69999999999985998-76778999999999 No 377 >PRK07369 dihydroorotase; Provisional Probab=21.55 E-value=53 Score=13.14 Aligned_cols=53 Identities=19% Similarity=0.213 Sum_probs=26.2 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEECCCCCCCHHH Q ss_conf 99999998753514698999999999997233673999999061599987727831088322 Q gi|254780234|r 308 LMSRYASIVYQGFWFSPEREMLQALIDKSQEYVEGSITLKLYKGNVMVVGRESNKSLYSDKL 369 (404) Q Consensus 308 ~~~~~a~lvy~G~wf~p~~~~l~a~i~~~q~~VtG~V~l~L~kG~~~v~gr~S~~sLy~~~~ 369 (404) +..-|..+|-+|+. +++ +.++. +...-..+ +.||+|. ..+|.+....+++++. T Consensus 335 lpll~~~~V~~g~l-sl~-~~v~~-~s~nPAki-----~gl~~g~-I~~G~dADlvi~Dp~~ 387 (419) T PRK07369 335 LPLLWQNLVETGEL-SAL-QLWQA-LSTNPARC-----LGLEPPS-LEPGHPAELILFDPQK 387 (419) T ss_pred HHHHHHHHHHCCCC-CHH-HHHHH-HHHHHHHH-----HCCCCCC-CCCCCCCCEEEEECCC T ss_conf 99999999874998-999-99999-77789998-----3999997-2799977789992999 No 378 >PRK12937 short chain dehydrogenase; Provisional Probab=21.45 E-value=53 Score=13.12 Aligned_cols=65 Identities=18% Similarity=0.215 Sum_probs=35.4 Q ss_pred CCCCCCE-EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCE-EEEEECHHHH Q ss_conf 9877778-9999147725999999998718986999994578711068999999973980-7998200899 Q gi|254780234|r 1 MSRDVKK-VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAK-EVYVKDLRRE 69 (404) Q Consensus 1 M~~~~kk-VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~-~~~v~D~r~e 69 (404) |+..-|. +|..=|.|+--.++..+. +.|++|+.. +..+.+..+.+.+.....|.. ..+..|+.++ T Consensus 1 M~~sgK~alVTGgs~GIG~aia~~la--~~Ga~V~i~--~~~~~~~~~~~~~~~~~~~~~~~~~~~Dv~~~ 67 (245) T PRK12937 1 MTDSNKVAIVTGASRGIGAAIARRLA--ADGFAVAVN--YAGSAAMADELVEEIEAAGGRAIAVQADVADA 67 (245) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHH--HCCCEEEEE--CCCCHHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 99999889994857789999999999--879999997--69986899999999996599589998378999 No 379 >PRK06128 oxidoreductase; Provisional Probab=21.44 E-value=53 Score=13.12 Aligned_cols=65 Identities=17% Similarity=0.332 Sum_probs=33.0 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHHHHCCCEE-EEEECHHHH-HHHHHHH Q ss_conf 999914772599999999871898699999457871-10689999999739807-998200899-9998799 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQG-EELKIASDKARLLGAKE-VYVKDLRRE-FVRDFVF 76 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~-~d~~~~~~~A~~~Ga~~-~~v~D~r~e-f~~~~i~ 76 (404) +|..=|.|+--.++..+.+ .|++|+. .++... .+.+++.+.....|..- .+..|+.++ .+++.+. T Consensus 59 lVTGgssGIG~AiA~~lA~--eGA~Vvi--~~~~~~~~~a~~~~~~i~~~G~~a~~v~~Dvsd~~~~~~~v~ 126 (300) T PRK06128 59 LITGADSGIGRATAIAFAR--EGADIVL--NYLPEEEQDAAEVVQLIQAEGRKAVAVPGDLKDEAFCRQLVE 126 (300) T ss_pred EEECCCCHHHHHHHHHHHH--CCCEEEE--ECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHH T ss_conf 9917366999999999998--6999999--429955678999999999659818999747899999999999 No 380 >TIGR02146 LysS_fung_arch homocitrate synthase; InterPro: IPR011872 This entry includes the yeast LYS21 gene which carries out the first step of the alpha-aminoadipate (AAA) lysine biosynthesis pathway. A related pathway is found in Thermus thermophilus . This enzyme is closely related to 2-isopropylmalate synthase (LeuA) and citramalate synthase (CimA), both of which are present in the euryarchaeota. Some archaea have a separate homocitrate synthase (AksA) which also synthesizes longer homocitrate analogs .; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=21.42 E-value=46 Score=13.56 Aligned_cols=19 Identities=16% Similarity=0.457 Sum_probs=8.6 Q ss_pred HHHHHHHHHHHC--CCCCCCC Q ss_conf 799999999708--9675665 Q gi|254780234|r 155 RQDLIDFAEKHA--IPIDKNK 173 (404) Q Consensus 155 Re~~i~ya~~~g--Ipv~~~~ 173 (404) =.|.|+|++.|| +.|.+|. T Consensus 116 i~e~I~y~K~~Gphv~VRFta 136 (355) T TIGR02146 116 IAEVIEYAKSHGPHVKVRFTA 136 (355) T ss_pred HHHHHHHHHHCCCEEEEEECC T ss_conf 999999997248824788647 No 381 >TIGR00329 gcp metalloendopeptidase, putative, glycoprotease family; InterPro: IPR000905 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues . Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 , . Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion , . Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis . The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region .; GO: 0008450 O-sialoglycoprotein endopeptidase activity, 0006508 proteolysis. Probab=21.41 E-value=33 Score=14.65 Aligned_cols=48 Identities=27% Similarity=0.204 Sum_probs=29.2 Q ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCC Q ss_conf 7899991477259999999987189869999945787110689999999739 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLG 57 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~G 57 (404) .=|+|-.|||-= .++ ..+.-..|||+|=|.|..-+|-+|.+.+ .+-+| T Consensus 133 P~~~LlVSGGHT-~~~--~~~~~~~~e~~geT~DdA~Ge~fDK~ar-~~gl~ 180 (337) T TIGR00329 133 PFVSLLVSGGHT-QII--AVKGIGDYEILGETLDDAVGEAFDKVAR-LLGLG 180 (337) T ss_pred CCEEEEEECCEE-EEE--EEECCCCEEEEEEEHHHHHHHHHHHHHH-HHCCC T ss_conf 731677754011-532--2304675789852022454244677777-61889 No 382 >COG4789 EscV Type III secretory pathway, component EscV [Intracellular trafficking and secretion] Probab=21.36 E-value=54 Score=13.11 Aligned_cols=72 Identities=14% Similarity=0.088 Sum_probs=49.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 210001113737899999999998650899999999999999999875351469899999999999723367 Q gi|254780234|r 270 GIKSRGVYETPGGTILLHAHRAIESISLDSGSAHLKDDLMSRYASIVYQGFWFSPEREMLQALIDKSQEYVE 341 (404) Q Consensus 270 G~KsR~vYEaPga~iL~~Ahr~LE~~~l~~~~~~~K~~~~~~~a~lvy~G~wf~p~~~~l~a~i~~~q~~Vt 341 (404) |++.++.-+-=...+-+.-+|.-+.+.==.++.++-++|+.+|++||-+-+-.-|+-+..+-+-+-.+++|+ T Consensus 471 g~~~~~~~~~i~~~L~~~l~r~a~efiGiQETryll~~mE~~Y~dLvKEvqR~ip~qriaeiLqRLv~E~Is 542 (689) T COG4789 471 GITYYAPLDVITLHLSHVLRRNANEFIGIQETRYLLDQMERKYSDLVKEVQRQIPLQRIAEILQRLVEERIS 542 (689) T ss_pred CCCEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC T ss_conf 981554789999999999998688751368899999998633689999999857787899999999856887 No 383 >TIGR00584 mug mismatch-specific thymine-DNA glycosylate (mug); InterPro: IPR003310 G:U mismatches resulting from deamination of cytosine are the most common promutagenic lesions occurring in DNA. Uracil is removed in a base-excision repair pathway by uracil DNA-glycosylase (UDG), which excises uracil from both single- and double-stranded DNA. A biochemically distinct family of DNA repair enzymes which excises both uracil and thymine, but only from mispairs with guanine has been identified . Crystal structures of the mismatch-specific uracil DNA-glycosylase (MUG) from Escherichia coli, and of a DNA complex, reveal structural and functional homology to UDG despite low sequence identity .; GO: 0016798 hydrolase activity acting on glycosyl bonds, 0030983 mismatched DNA binding, 0006298 mismatch repair. Probab=21.35 E-value=32 Score=14.68 Aligned_cols=78 Identities=19% Similarity=0.234 Sum_probs=47.8 Q ss_pred CCCCCCHHHCCCCCEEEEEEEECCEEEEECCEEC-CHHH----------------HHHHHH--HHHHHCCCCCEECCCCC Q ss_conf 0146581227999638999995024787516231-8999----------------999998--88642163741000773 Q gi|254780234|r 207 YKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVM-SPEV----------------LLEQLN--QYGRCNGIGRIDIVENR 267 (404) Q Consensus 207 ~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~-~~~~----------------li~~LN--~igg~~GvGr~d~vEnr 267 (404) ...|+|---+ .-|..|.|+.-=|+--++-|-.+ +|-. -+.+|| .+-|+||||.+.|||-. T Consensus 134 L~~~~PD~l~-~N~~~vI~G~NPGl~~~~KGH~y~~P~N~FWK~L~~S~L~EGn~~l~~l~D~~LPG~~G~G~TNl~~R~ 212 (347) T TIGR00584 134 LLKTLPDILT-FNLDIVIVGLNPGLMAAYKGHAYAGPGNRFWKCLFKSKLSEGNVELNYLDDKDLPGKYGLGITNLVERT 212 (347) T ss_pred HHCCCCHHHH-CCCCEEEEECCHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCEEECCCC T ss_conf 6504412322-376379981272567540355579884257666542354302235431057568743467600000048 Q ss_pred CCCC---CCCCCCCCHHHHHHHH Q ss_conf 2021---0001113737899999 Q gi|254780234|r 268 FVGI---KSRGVYETPGGTILLH 287 (404) Q Consensus 268 ~vG~---KsR~vYEaPga~iL~~ 287 (404) --|- .+.|..| |+-||+. T Consensus 213 ~~~~~DL~~~E~~~--G~RIL~~ 233 (347) T TIGR00584 213 TSGSKDLSKKEFRE--GARILVE 233 (347) T ss_pred CCCCCCHHHHHHHC--CHHHHHH T ss_conf 88611000565322--2056788 No 384 >PRK05764 aspartate aminotransferase; Provisional Probab=21.32 E-value=54 Score=13.10 Aligned_cols=143 Identities=17% Similarity=0.194 Sum_probs=70.6 Q ss_pred HHCCCEEEEEEE---ECCCCCHHHHHHHHHHHCCCEEEE-----EECHHHHHHHHHHHH---------HHHCCCCCCCCC Q ss_conf 718986999994---578711068999999973980799-----820089999987999---------973686547874 Q gi|254780234|r 27 VEKGLEVIVFIA---DLGQGEELKIASDKARLLGAKEVY-----VKDLRREFVRDFVFP---------MFRANALYEGYY 89 (404) Q Consensus 27 ~e~g~eVi~~~~---d~Gq~~d~~~~~~~A~~~Ga~~~~-----v~D~r~ef~~~~i~~---------~I~ana~Yeg~Y 89 (404) +.+|-+||-+.+ |..-++.+.++-.++..-|.. .| ...+|+..++.|-.. .+-.+..-++-+ T Consensus 27 ~~~G~dvi~l~~g~pd~~~p~~i~~a~~~~~~~~~~-~Y~~~~G~~~LR~aia~~~~~~~g~~v~~d~I~it~G~~~al~ 105 (389) T PRK05764 27 KAAGRDVISLGAGEPDFDTPEHIKEAAIAALDEGKT-KYTPAAGIPELREAIAEKLKRDNGLDYEPDQIIVTTGAKQALY 105 (389) T ss_pred HHCCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCC-CCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHEEECCCHHHHHH T ss_conf 977998278889789999889999999999847999-9989988799999999999998689985797898888799999 Q ss_pred CCCCCHHHH---HHH-----HHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCCEEE---EEEC----CCCCCC Q ss_conf 201301243---359-----9999998856892875001116821567999999858797077---4410----068763 Q gi|254780234|r 90 LLGTAIARP---LIA-----KYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDIEII---APWR----HWSFKG 154 (404) Q Consensus 90 pl~tslaRp---lia-----~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l~vi---aP~R----d~~~~s 154 (404) .+..++..| ++. -.....++..|+..+.--+.-.+|=+.-++.--+++.+..+.+ .|-- -|+..- T Consensus 106 ~~~~~l~~pGD~Vlv~~P~Y~~y~~~~~~~g~~~v~vp~~~~~~~~~d~~~l~~~~~~~~k~i~l~~P~NPtG~v~s~~~ 185 (389) T PRK05764 106 NAFMALLNPGDEVIIPAPYWVSYPEMVKLAGGKPVFVPTGEENGFKLTPEQLEAAITPKTKALILNSPSNPTGAVYSKEE 185 (389) T ss_pred HHHHHHCCCCCEEEECCCCCHHHHHHHHHCCCEEEEECCCHHHCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHH T ss_conf 99999589999899857862458999996498655504362208768999999863846649998899899886757799 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 7999999997089675 Q gi|254780234|r 155 RQDLIDFAEKHAIPID 170 (404) Q Consensus 155 Re~~i~ya~~~gIpv~ 170 (404) +++.+++|++|++.|= T Consensus 186 l~~l~~~a~~~~i~ii 201 (389) T PRK05764 186 LEAIADVAVEHDIWVL 201 (389) T ss_pred HHHHHHHHHHCCCEEE T ss_conf 9999999886353275 No 385 >TIGR01367 pyrE_Therm orotate phosphoribosyltransferase; InterPro: IPR006273 This group of sequences are a distinct clade of orotate phosphoribosyltransferases. Members include the experimentally determined example from Thermus aquaticus and additional examples from Caulobacter crescentus, Helicobacter pylori (Campylobacter pylori), Mesorhizobium loti, and related species. ; GO: 0004588 orotate phosphoribosyltransferase activity, 0019856 pyrimidine base biosynthetic process. Probab=21.22 E-value=52 Score=13.22 Aligned_cols=27 Identities=33% Similarity=0.464 Sum_probs=17.4 Q ss_pred HCCCCCCCCCCCCCC--HHHHHHHHHHHH Q ss_conf 368654787420130--124335999999 Q gi|254780234|r 80 RANALYEGYYLLGTA--IARPLIAKYLVD 106 (404) Q Consensus 80 ~ana~Yeg~Ypl~ts--laRplia~~lv~ 106 (404) +|+|+.||++.|++. ..+.++...+++ T Consensus 8 ~aGA~~EGhFlLsSG~hS~~flQ~~~ll~ 36 (205) T TIGR01367 8 KAGALLEGHFLLSSGKHSPYFLQSAKLLE 36 (205) T ss_pred HCCCCCCCCEEEECCCCCCHHHHHHHHHH T ss_conf 71742137122405761423665656761 No 386 >smart00045 DAGKa Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain might either be an accessory domain or else contribute to the catalytic domain. Bacterial homologues are known. Probab=21.16 E-value=54 Score=13.08 Aligned_cols=21 Identities=19% Similarity=0.510 Sum_probs=15.8 Q ss_pred CCEEEEEEEE--CCEEEEECCEE Q ss_conf 9638999995--02478751623 Q gi|254780234|r 219 TPTTIRIDFQ--RGDPIAINGQV 239 (404) Q Consensus 219 ~pe~v~I~Fe--~G~PVainG~~ 239 (404) ....|.|+|+ +..|+.||||+ T Consensus 138 Q~~~v~i~l~~~~~~p~QvDGEP 160 (160) T smart00045 138 QCSEVRITIKTSKTIPMQVDGEP 160 (160) T ss_pred CCCEEEEEEECCCCEEEEECCCC T ss_conf 38678999954881378638999 No 387 >TIGR01520 FruBisAldo_II_A fructose-bisphosphate aldolase, class II; InterPro: IPR006411 Members of this family are class II examples of the glycolytic enzyme fructose-bisphosphate aldolase (FBA). They represent one of two deeply split, architecturally distinct clades of the family that includes class II fructose-bisphosphate aldolases, tagatose-bisphosphate aldolases, and related uncharacterised proteins. This family is well-conserved and includes characterised FBA from Saccharomyces cerevisiae, Escherichia coli, and Corynebacterium glutamicum.; GO: 0004332 fructose-bisphosphate aldolase activity, 0006096 glycolysis. Probab=21.11 E-value=47 Score=13.53 Aligned_cols=23 Identities=17% Similarity=0.186 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHCCCCEEECHHHC Q ss_conf 59999999885689287500111 Q gi|254780234|r 100 IAKYLVDIANETGADAIAHGSTG 122 (404) Q Consensus 100 ia~~lv~~a~~~ga~~iaHG~Tg 122 (404) .+-+.+|.|++.+++.+-+-||| T Consensus 42 t~nAaLeaA~d~ksP~i~QfS~G 64 (365) T TIGR01520 42 TINAALEAARDAKSPIIIQFSNG 64 (365) T ss_pred HHHHHHHHHHHCCCCEEEEECCC T ss_conf 68999999987079869985164 No 388 >TIGR00609 recB exodeoxyribonuclease V, beta subunit; InterPro: IPR004586 Exodeoxyribonuclease V, or RecBCD holoenzyme, (3.1.11.5 from EC) is a multifunctional nuclease with potent ATP-dependent exodeoxyribonuclease activity. Ejection of RecD, as occurs at chi recombinational hotspots, cripples exonuclease activity in favor of recombinagenic helicase activity. All proteins in this family for which functions are known are DNA-DNA helicases that are used as part of an exonuclease-helicase complex (made up of RecBCD homologs) that function to generate substrates for the initiation of recombination and recombinational repair. The complex catalyses exonucleolytic cleavage in either the 5' to 3' or 3' to 5' direction to yield 5-phosphooligonucleotides in the presence of ATP. This is the beta subunit .; GO: 0005524 ATP binding, 0008854 exodeoxyribonuclease V activity, 0006281 DNA repair. Probab=21.06 E-value=54 Score=13.07 Aligned_cols=67 Identities=19% Similarity=0.342 Sum_probs=52.9 Q ss_pred HHHHHHHHHHHHHHHCC------HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECC Q ss_conf 89999999999865089------99999999999999998753514698999999999997233673999999061 Q gi|254780234|r 282 GTILLHAHRAIESISLD------SGSAHLKDDLMSRYASIVYQGFWFSPEREMLQALIDKSQEYVEGSITLKLYKG 351 (404) Q Consensus 282 a~iL~~Ahr~LE~~~l~------~~~~~~K~~~~~~~a~lvy~G~wf~p~~~~l~a~i~~~q~~VtG~V~l~L~kG 351 (404) |++| |+.||.+-.+ .+.-.....+..+|-...-++.|-+++.+.|.+.+..--..+...+.|+.=.- T Consensus 1070 G~~l---H~~LE~~~~~~~d~~~~~~e~~~~~~~~~~~~~~~e~~w~~~l~~~l~~~~~tPL~~~~~~~~Ls~~~~ 1142 (1324) T TIGR00609 1070 GTLL---HAILESLKFSAADDQAADLEKQENLIAEKLKKTGLEEQWEEELAEWLEKILNTPLTFAEEAITLSQIDP 1142 (1324) T ss_pred HHHH---HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCEECCCCH T ss_conf 7899---999850587750136787999999999999721630200799999999998377775545501021486 No 389 >PRK08936 glucose-1-dehydrogenase; Provisional Probab=20.98 E-value=55 Score=13.05 Aligned_cols=58 Identities=16% Similarity=0.244 Sum_probs=35.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHCCCEE-EEEECHHHH Q ss_conf 99991477259999999987189869999945787110689999999739807-998200899 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKARLLGAKE-VYVKDLRRE 69 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~~~Ga~~-~~v~D~r~e 69 (404) +|..=|+|+--.++..+. ++|+.|+. + +....++.+...+...+.|..- .+..|+.++ T Consensus 11 lVTGa~~GIG~aia~~la--~~Ga~V~i-~-~~~~~~~~~~~~~~~~~~g~~~~~~~~Dv~~~ 69 (261) T PRK08936 11 VITGGSTGLGRAMAVRFG--KEKAKVVI-N-YRSDESEANDVAEEIKKVGGEAIAVKGDVTVE 69 (261) T ss_pred EEECCCCHHHHHHHHHHH--HCCCEEEE-E-CCCCHHHHHHHHHHHHHCCCCEEEEECCCCCH T ss_conf 996847789999999999--87999999-7-28987899999999996599389998279999 No 390 >cd06306 PBP1_TorT-like TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. The Tor respiratory system is consists of three proteins (TorC, TorA, and TorD) and is induced in the presence of TMAO. The TMAO control is tightly regulated by three proteins: TorS, TorT, and TorR. Thus, the disruption of any of these proteins can abolish the Tor respiratory induction. TorT shares homology with the sugar-binding domain of the type I periplasmic binding proteins. The members of TorT-like family bind TMAO or related compounds and are predicted to be involved in signal transduction and/or substrate transport. Probab=20.97 E-value=55 Score=13.05 Aligned_cols=46 Identities=15% Similarity=0.091 Sum_probs=17.4 Q ss_pred HHHHHHHHHCCCCEEECHHHCCCC--HHHHHHHHHHHHCCCCEEEEEE Q ss_conf 999999885689287500111682--1567999999858797077441 Q gi|254780234|r 102 KYLVDIANETGADAIAHGSTGKGN--DQVRFELSAYSLNSDIEIIAPW 147 (404) Q Consensus 102 ~~lv~~a~~~ga~~iaHG~TgkGN--DQvRFe~~~~~l~P~l~viaP~ 147 (404) +.+-+.|++.|+++.--.+.+..| .|+..=..+-+-++|.-|++|. T Consensus 19 ~Ga~~aA~e~G~~v~~~~~~~~~d~~~Qi~~ie~~i~~gvDaIii~p~ 66 (268) T cd06306 19 YGMVEEAKRLGVSLKLLEAGGYPNLAKQIAQLEDCAAWGADAILLGAV 66 (268) T ss_pred HHHHHHHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 999999999699799995899999999999999999839999998679 No 391 >cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins. Probab=20.96 E-value=55 Score=13.05 Aligned_cols=16 Identities=25% Similarity=0.603 Sum_probs=10.3 Q ss_pred CEEEEEECHHHHHHHHHH Q ss_conf 807998200899999879 Q gi|254780234|r 58 AKEVYVKDLRREFVRDFV 75 (404) Q Consensus 58 a~~~~v~D~r~ef~~~~i 75 (404) -.++.++|+|+.. +|- T Consensus 15 ~~~~~liDVR~~~--E~~ 30 (101) T cd01518 15 DPEVVLLDVRNDY--EYD 30 (101) T ss_pred CCCEEEEECCCHH--HHH T ss_conf 8991999896979--980 No 392 >pfam04373 DUF511 Protein of unknown function (DUF511). Bacterial protein of unknown function. Probab=20.95 E-value=55 Score=13.05 Aligned_cols=129 Identities=23% Similarity=0.257 Sum_probs=71.5 Q ss_pred HHHHHHHHHHHHHCCCCEEECHHHCCCCHHHHHHHHHHHHCCCC-EEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 33599999998856892875001116821567999999858797-07744100687637999999997089675665678 Q gi|254780234|r 98 PLIAKYLVDIANETGADAIAHGSTGKGNDQVRFELSAYSLNSDI-EIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGE 176 (404) Q Consensus 98 plia~~lv~~a~~~ga~~iaHG~TgkGNDQvRFe~~~~~l~P~l-~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~ 176 (404) |+.++.+-.- ...-|..|-|-.+.|... ..-+.|.||+ .+.-|..+|+ + +-+++++..+-.- . T Consensus 117 plL~~fl~~~-~~~~~ktI~he~S~k~~~-----g~n~Wl~PDiVgv~f~~~~~~---~-~~~~~~~~~~~~~--~---- 180 (310) T pfam04373 117 PLLVKYLYEN-LNFYCKTIDHERSSNKRG-----GGNKWLHPDIVGVEFLYKDWS---E-EVRDCVKQYNGKS--V---- 180 (310) T ss_pred HHHHHHHHHC-CCCEEEEEEHHHCCCCCC-----CCCCCCCCCEEEEECCCCCCC---H-HHHHHHHHCCCCC--E---- T ss_conf 9999999735-784143410343367778-----887447887788874500135---9-9999999739983--3---- Q ss_pred CCCCCCHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCHHHCCCCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHHHC Q ss_conf 86415724311545763245845768201001465812279996389999950247875162318999999998886421 Q gi|254780234|r 177 APFSIDTNLLHSSSEGRVLEDPSQPAPEYVYKMIVSPEEAPDTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRCN 256 (404) Q Consensus 177 ~~yS~D~Nlwg~S~Egg~Ledp~~~~pe~~~~~t~~p~~ap~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~~ 256 (404) + || |+|-...-+-. ..-+ .|--.++-..+.++ |..|+.+=. -.+++..|..+...| T Consensus 181 k-------L~--SFElK~~in~s-Nlre-~fFQAVSNSSWANe----------GYLVa~~i~---~~~~~~EL~rL~~~f 236 (310) T pfam04373 181 K-------LW--SFELKKELNRS-NLRE-SFFQAVSNSSWANE----------GYLVAAEID---DSDVLDELRRLSQSF 236 (310) T ss_pred E-------EE--EEEEEEECCCH-HHHH-HHHHHHHCCCCCCC----------CEEEEEECC---CHHHHHHHHHHHHHC T ss_conf 8-------99--78988643501-7899-99998604532366----------538998328---777999999998723 Q ss_pred CCCCEECC-CC Q ss_conf 63741000-77 Q gi|254780234|r 257 GIGRIDIV-EN 266 (404) Q Consensus 257 GvGr~d~v-En 266 (404) |||.+..= +| T Consensus 237 GIGvI~L~~~n 247 (310) T pfam04373 237 GIGVIKLDVEN 247 (310) T ss_pred CEEEEEECCCC T ss_conf 80389846899 No 393 >COG3414 SgaB Phosphotransferase system, galactitol-specific IIB component [Carbohydrate transport and metabolism] Probab=20.92 E-value=55 Score=13.05 Aligned_cols=35 Identities=29% Similarity=0.433 Sum_probs=24.1 Q ss_pred CCEEEEEECCCHHHHHHHH-----HHHHHCCCE--EEEEEEEC Q ss_conf 7789999147725999999-----998718986--99999457 Q gi|254780234|r 5 VKKVVLAYSGGLDTSIILK-----WLQVEKGLE--VIVFIADL 40 (404) Q Consensus 5 ~kkVvlaySGGLDTSv~i~-----~L~~e~g~e--Vi~~~~d~ 40 (404) |+||..|.-.|+-||..++ .|+ ++|++ +-...+|. T Consensus 1 ~~KIL~aCG~GvgSS~~ik~kve~~l~-~~gi~~~~~~~~v~~ 42 (93) T COG3414 1 MIKILAACGNGVGSSTMIKMKVEEVLK-ELGIDVDVEQCAVDE 42 (93) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHH-HCCCCCEEEEEEECC T ss_conf 957999878980289999999999999-859984266677111 No 394 >COG0615 TagD Cytidylyltransferase [Cell envelope biogenesis, outer membrane / Lipid metabolism] Probab=20.83 E-value=55 Score=13.03 Aligned_cols=58 Identities=21% Similarity=0.146 Sum_probs=33.7 Q ss_pred HHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH--------------H-HHHHHHCCCC Q ss_conf 999997398079982008999998799997368654787420130124335999--------------9-9998856892 Q gi|254780234|r 50 SDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKY--------------L-VDIANETGAD 114 (404) Q Consensus 50 ~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~--------------l-v~~a~~~ga~ 114 (404) -++|.++| .+.+|+=++++++...- ++-|.-..-.|.-+.+. . .+...+.++| T Consensus 21 L~~Ak~lG-d~liVv~a~de~~~~~~-----------k~~pi~~~~qR~evl~s~ryVD~vi~~~p~~~~~~~i~~~k~D 88 (140) T COG0615 21 LRQAKKLG-DELIVVVARDETVIKRK-----------KRKPIMPEEQRAEVLESLRYVDEVILGAPWDIKFEDIEEYKPD 88 (140) T ss_pred HHHHHHHC-CEEEEEEECCHHHHHHC-----------CCCCCCCHHHHHHHHHCCCCHHEEEECCCCCCCHHHHHHHCCC T ss_conf 99999858-96999996457888850-----------8999888899999997276254042089654676899995999 Q ss_pred EEECH Q ss_conf 87500 Q gi|254780234|r 115 AIAHG 119 (404) Q Consensus 115 ~iaHG 119 (404) +|+|| T Consensus 89 iv~lG 93 (140) T COG0615 89 IVVLG 93 (140) T ss_pred EEEEC T ss_conf 99977 No 395 >TIGR01963 PHB_DH 3-hydroxybutyrate dehydrogenase; InterPro: IPR011294 This entry represents a subfamily of the short chain dehydrogenases. Characterised members so far as 3-hydroxybutyrate dehydrogenases and are found in species that accumulate ester polymers called polyhydroxyalkanoic acids (PHAs) under certain conditions. Several members of the family are from species not known to accumulate PHAs, including Oceanobacillus iheyensis and Bacillus subtilis. However, polymer formation is not required for there be a role for 3-hydroxybutyrate dehydrogenase; it may be members of this family have the same function in those species.; GO: 0003858 3-hydroxybutyrate dehydrogenase activity. Probab=20.81 E-value=55 Score=13.03 Aligned_cols=128 Identities=20% Similarity=0.274 Sum_probs=72.2 Q ss_pred HHHCCCEEEEEEEECCCCCHHHH-HHHHHHHCCCEEEEEECHH-------HHHHHHHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 87189869999945787110689-9999997398079982008-------999998799997368654787420130124 Q gi|254780234|r 26 QVEKGLEVIVFIADLGQGEELKI-ASDKARLLGAKEVYVKDLR-------REFVRDFVFPMFRANALYEGYYLLGTAIAR 97 (404) Q Consensus 26 ~~e~g~eVi~~~~d~Gq~~d~~~-~~~~A~~~Ga~~~~v~D~r-------~ef~~~~i~~~I~ana~Yeg~Ypl~tslaR 97 (404) ..+.|++|+.+-+|+-..+|+++ +++-+..+|..++.|=|+- |||-.|-=...|.=| |+ = T Consensus 47 ~~~~G~~v~~~~~D~T~~~e~~~~~~~~~~~fG~~DiLVNNAG~QhVaPiEeFP~~~w~~iiav~--------Lt----s 114 (258) T TIGR01963 47 ATDAGGSVIYLVADVTKEEEIADMIQAVAAEFGGLDILVNNAGIQHVAPIEEFPPEKWDRIIAVM--------LT----S 114 (258) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCEEECCCCCCCCHHHHHHHHHHC--------CC----H T ss_conf 99618835775147888899999999999985688748844640141765477866787373021--------68----8 Q ss_pred HHHH-HHHHHHHHHCCCC-EEE-----CHH---HCC-----------CC-HHHHHH-----HHHHHHCCCCEEEEEECCC Q ss_conf 3359-9999998856892-875-----001---116-----------82-156799-----9999858797077441006 Q gi|254780234|r 98 PLIA-KYLVDIANETGAD-AIA-----HGS---TGK-----------GN-DQVRFE-----LSAYSLNSDIEIIAPWRHW 150 (404) Q Consensus 98 plia-~~lv~~a~~~ga~-~ia-----HG~---Tgk-----------GN-DQvRFe-----~~~~~l~P~l~viaP~Rd~ 150 (404) |..+ |+.++.||+.|=- -|- ||= -+| |= .=+=.| .+..++||- =|-+|.- T Consensus 115 aF~t~raAlP~Mk~~gwGGRIiNIAS~HGLvASp~KSAYVAAKHG~~GLTKv~ALE~A~~giT~NaiCPG-YV~TPLV-- 191 (258) T TIGR01963 115 AFHTIRAALPHMKKQGWGGRIINIASVHGLVASPFKSAYVAAKHGLIGLTKVLALEVAAHGITANAICPG-YVRTPLV-- 191 (258) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCC-CCCCHHH-- T ss_conf 8999975064321378553799710100000353213456774302121155554204788758667287-5675546-- Q ss_pred CCCCHHHHHHHHHHHCCCCCCC Q ss_conf 8763799999999708967566 Q gi|254780234|r 151 SFKGRQDLIDFAEKHAIPIDKN 172 (404) Q Consensus 151 ~~~sRe~~i~ya~~~gIpv~~~ 172 (404) +.+--+=|+.||||.+.- T Consensus 192 ----~~Qi~DqAk~rGi~eE~V 209 (258) T TIGR01963 192 ----EKQIADQAKTRGIPEEQV 209 (258) T ss_pred ----HHHHHHHHHHCCCCHHHH T ss_conf ----765899986518899888 No 396 >COG0794 GutQ Predicted sugar phosphate isomerase involved in capsule formation [Cell envelope biogenesis, outer membrane] Probab=20.76 E-value=55 Score=13.02 Aligned_cols=65 Identities=25% Similarity=0.244 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHHCCCCE--EE-----CHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHH----HHHHHHC Q ss_conf 335999999988568928--75-----001116821567999999858797077441006876379999----9999708 Q gi|254780234|r 98 PLIAKYLVDIANETGADA--IA-----HGSTGKGNDQVRFELSAYSLNSDIEIIAPWRHWSFKGRQDLI----DFAEKHA 166 (404) Q Consensus 98 plia~~lv~~a~~~ga~~--ia-----HG~TgkGNDQvRFe~~~~~l~P~l~viaP~Rd~~~~sRe~~i----~ya~~~g 166 (404) -+|+|+++-...-.|..+ |- ||-||. +.|+=-+++ .+-.|+..++ .++++.| T Consensus 51 G~Igkk~Aa~L~s~G~~a~fv~p~ea~hgdlg~-------------i~~~Dvvia----iS~SGeT~el~~~~~~aK~~g 113 (202) T COG0794 51 GLIGKKFAARLASTGTPAFFVGPAEALHGDLGM-------------ITPGDVVIA----ISGSGETKELLNLAPKAKRLG 113 (202) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCHHCCCCCCC-------------CCCCCEEEE----EECCCCHHHHHHHHHHHHHCC T ss_conf 899999999997359955996673010377567-------------898889999----808971779999999999759 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 967566567886415 Q gi|254780234|r 167 IPIDKNKRGEAPFSI 181 (404) Q Consensus 167 Ipv~~~~~~~~~yS~ 181 (404) +++-+-. ..|.|. T Consensus 114 ~~liaiT--~~~~Ss 126 (202) T COG0794 114 AKLIAIT--SNPDSS 126 (202) T ss_pred CCEEEEE--CCCCCH T ss_conf 9489995--899986 No 397 >COG3473 Maleate cis-trans isomerase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=20.74 E-value=55 Score=13.02 Aligned_cols=84 Identities=20% Similarity=0.290 Sum_probs=47.9 Q ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCC--CCCCHHHCCCCCCHHHCC Q ss_conf 970774410068763799999999708967566567886415724311545763245845--768201001465812279 Q gi|254780234|r 140 DIEIIAPWRHWSFKGRQDLIDFAEKHAIPIDKNKRGEAPFSIDTNLLHSSSEGRVLEDPS--QPAPEYVYKMIVSPEEAP 217 (404) Q Consensus 140 ~l~viaP~Rd~~~~sRe~~i~ya~~~gIpv~~~~~~~~~yS~D~Nlwg~S~Egg~Ledp~--~~~pe~~~~~t~~p~~ap 217 (404) .+.+++|+-+ .. -+-+++|.+++||.|-..+ .=|+.+|.+ ..+|..+|...+... +| T Consensus 120 ri~vlTPY~~-ev--n~~e~ef~~~~Gfeiv~~~-----------------~Lgi~dn~eigr~~P~~~y~lAk~~~-~~ 178 (238) T COG3473 120 RISVLTPYID-EV--NQREIEFLEANGFEIVDFK-----------------GLGITDNLEIGRQEPWAVYRLAKEVF-TP 178 (238) T ss_pred EEEEECCCHH-HH--HHHHHHHHHHCCEEEEEEE-----------------CCCCCCCCHHCCCCHHHHHHHHHHHC-CC T ss_conf 5898525305-44--1589999984892799754-----------------15776650011468389999999856-77 Q ss_pred CCCEEEEEEEECCEEEEECCEECCHHHHHHHHHHHHHH Q ss_conf 99638999995024787516231899999999888642 Q gi|254780234|r 218 DTPTTIRIDFQRGDPIAINGQVMSPEVLLEQLNQYGRC 255 (404) Q Consensus 218 ~~pe~v~I~Fe~G~PVainG~~~~~~~li~~LN~igg~ 255 (404) +. +-+-|. | ..+...++|++|.+--|+ T Consensus 179 ~~-DaiFiS-----C-----TnlRt~eii~~lE~~~G~ 205 (238) T COG3473 179 DA-DAIFIS-----C-----TNLRTFEIIEKLERDTGV 205 (238) T ss_pred CC-CEEEEE-----E-----ECCCCHHHHHHHHHHHCC T ss_conf 77-769997-----0-----045317889999997499 No 398 >TIGR01935 NOT-MenG RraA family; InterPro: IPR010203 This entry includes a number of closely related sequences bacteria and plants. The Escherichia coli member of this family has been characterised as a regulator of RNase E (see IPR004659 from INTERPRO), and its crystal structure has been analysed . E. coli RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing . RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage .; GO: 0008428 ribonuclease inhibitor activity, 0051252 regulation of RNA metabolic process. Probab=20.52 E-value=30 Score=14.93 Aligned_cols=72 Identities=24% Similarity=0.375 Sum_probs=48.0 Q ss_pred CCHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC-CCEEECHHH Q ss_conf 1106899999997398079982008999998799997368654787420130124335999999988568-928750011 Q gi|254780234|r 43 GEELKIASDKARLLGAKEVYVKDLRREFVRDFVFPMFRANALYEGYYLLGTAIARPLIAKYLVDIANETG-ADAIAHGST 121 (404) Q Consensus 43 ~~d~~~~~~~A~~~Ga~~~~v~D~r~ef~~~~i~~~I~ana~Yeg~Ypl~tslaRplia~~lv~~a~~~g-a~~iaHG~T 121 (404) -||=-.+|+.+..=|+=++.|+|..= ||-|.|+.-.+.+.|.+.| .-+|-|||- T Consensus 43 FEDNslvr~~L~qpG~GrVLVVDGgG-------------------------Slr~ALlGd~lA~~A~~NGWeGviv~GcV 97 (155) T TIGR01935 43 FEDNSLVREVLEQPGAGRVLVVDGGG-------------------------SLRRALLGDNLAELAEENGWEGVIVHGCV 97 (155) T ss_pred EECCHHHHHHHHCCCCCCEEEEECCC-------------------------HHHHHCCCCHHHHHHHHCCCCEEEEECCH T ss_conf 70473688998259997279995885-------------------------01021016756788650595149996230 Q ss_pred CCCCHHHH-HHHHHHHHCCC Q ss_conf 16821567-99999985879 Q gi|254780234|r 122 GKGNDQVR-FELSAYSLNSD 140 (404) Q Consensus 122 gkGNDQvR-Fe~~~~~l~P~ 140 (404) +==|+.+ +++-++||++. T Consensus 98 -RDv~~L~~~~lGv~ALAa~ 116 (155) T TIGR01935 98 -RDVAELAELDLGVKALAAH 116 (155) T ss_pred -HHHHHHHCCCCCCEEECCC T ss_conf -1688860489010010088 No 399 >PRK12384 sorbitol-6-phosphate dehydrogenase; Provisional Probab=20.49 E-value=56 Score=12.98 Aligned_cols=58 Identities=21% Similarity=0.200 Sum_probs=32.1 Q ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHH-HCCCE--EEEEECHHHH Q ss_conf 899991477259999999987189869999945787110689999999-73980--7998200899 Q gi|254780234|r 7 KVVLAYSGGLDTSIILKWLQVEKGLEVIVFIADLGQGEELKIASDKAR-LLGAK--EVYVKDLRRE 69 (404) Q Consensus 7 kVvlaySGGLDTSv~i~~L~~e~g~eVi~~~~d~Gq~~d~~~~~~~A~-~~Ga~--~~~v~D~r~e 69 (404) =+|..=|+|+--.++..++ +.|+.|+.. |.- .++++...++.. ..|.. ..+..|+.++ T Consensus 5 alITG~s~GIG~aia~~la--~~Ga~V~i~--~~~-~~~~~~~~~~l~~~~~~~~~~~~~~Dv~~~ 65 (259) T PRK12384 5 AVVIGGGQTLGAFLCHGLA--EEGYRVAVA--DIN-SEKAANVAQEINAEYGEGMAYGFGADATSE 65 (259) T ss_pred EEEECCCCHHHHHHHHHHH--HCCCEEEEE--ECC-HHHHHHHHHHHHHHCCCCEEEEEECCCCCH T ss_conf 9994688689999999999--879999999--798-899999999998624886089998327999 No 400 >COG2205 KdpD Osmosensitive K+ channel histidine kinase [Signal transduction mechanisms] Probab=20.42 E-value=56 Score=12.97 Aligned_cols=60 Identities=15% Similarity=0.145 Sum_probs=37.1 Q ss_pred CEEEEEECCCHHHHHHHHH---HHHHCCCEEEEEEEECCCC-----CH---HHHHHHHHHHCCCEEEEEECH Q ss_conf 7899991477259999999---9871898699999457871-----10---689999999739807998200 Q gi|254780234|r 6 KKVVLAYSGGLDTSIILKW---LQVEKGLEVIVFIADLGQG-----EE---LKIASDKARLLGAKEVYVKDL 66 (404) Q Consensus 6 kkVvlaySGGLDTSv~i~~---L~~e~g~eVi~~~~d~Gq~-----~d---~~~~~~~A~~~Ga~~~~v~D~ 66 (404) ++|++|.||+-.+--.+.+ |.+..+.+-+++++|.+.. .+ ++...+-|..+||. +.++.. T Consensus 249 e~ilvcI~~~~~~e~liR~a~RlA~~~~a~~~av~v~~~~~~~~~~~~~~~l~~~~~Lae~lGae-~~~l~~ 319 (890) T COG2205 249 ERILVCISGSPGSEKLIRRAARLASRLHAKWTAVYVETPELHRLSEKEARRLHENLRLAEELGAE-IVTLYG 319 (890) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCE-EEEEEC T ss_conf 61899978998638999999999997289769999825664444377999999999999983886-999837 No 401 >TIGR02627 rhamnulo_kin rhamnulokinase; InterPro: IPR013449 Rhamnulokinase (2.7.1.5 from EC) is an enzyme that catalyzes the second step in rhamnose catabolism.; GO: 0008993 rhamnulokinase activity, 0019301 rhamnose catabolic process. Probab=20.35 E-value=56 Score=12.96 Aligned_cols=19 Identities=5% Similarity=0.358 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHCCCCCCCC Q ss_conf 7999999997089675665 Q gi|254780234|r 155 RQDLIDFAEKHAIPIDKNK 173 (404) Q Consensus 155 Re~~i~ya~~~gIpv~~~~ 173 (404) -+|--.||+||+-|||.|. T Consensus 346 ~~eI~~yCREt~Q~vP~s~ 364 (460) T TIGR02627 346 VEEIQAYCRETNQPVPESD 364 (460) T ss_pred HHHHHHHHHHCCCCCCCCH T ss_conf 9999997542378899965 No 402 >pfam00148 Oxidored_nitro Nitrogenase component 1 type Oxidoreductase. Probab=20.34 E-value=56 Score=12.96 Aligned_cols=12 Identities=25% Similarity=0.368 Sum_probs=5.3 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999708967 Q gi|254780234|r 158 LIDFAEKHAIPI 169 (404) Q Consensus 158 ~i~ya~~~gIpv 169 (404) ....+++-||.+ T Consensus 164 i~~ll~~~Gi~v 175 (398) T pfam00148 164 IKRLLEKLGIEV 175 (398) T ss_pred HHHHHHHCCCCE T ss_conf 999999839935 No 403 >PRK08195 4-hydroxy-2-ketovalerate aldolase; Validated Probab=20.22 E-value=56 Score=12.95 Aligned_cols=16 Identities=19% Similarity=0.131 Sum_probs=9.3 Q ss_pred CCCHHHCCCCCCHHHC Q ss_conf 6820100146581227 Q gi|254780234|r 201 PAPEYVYKMIVSPEEA 216 (404) Q Consensus 201 ~~pe~~~~~t~~p~~a 216 (404) -.|+++........+. T Consensus 170 ~~P~~v~~~v~~l~~~ 185 (337) T PRK08195 170 LLPDDVRARVRALRAA 185 (337) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 7999999999999986 No 404 >COG1013 PorB Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit [Energy production and conversion] Probab=20.22 E-value=56 Score=12.95 Aligned_cols=47 Identities=21% Similarity=0.254 Sum_probs=38.6 Q ss_pred HHHHHHHHCCCCEEECHHHCCCCHHHH-HHHHHHHHCCC-CEEEEEECC Q ss_conf 999998856892875001116821567-99999985879-707744100 Q gi|254780234|r 103 YLVDIANETGADAIAHGSTGKGNDQVR-FELSAYSLNSD-IEIIAPWRH 149 (404) Q Consensus 103 ~lv~~a~~~ga~~iaHG~TgkGNDQvR-Fe~~~~~l~P~-l~viaP~Rd 149 (404) -++.+|...|+.+||.++.+.-+|+.+ ...+++.-+|. +.+++|+-- T Consensus 159 d~~~la~a~G~~yVAr~~~~~~~~l~~~i~~A~~~~Gps~I~v~sPC~t 207 (294) T COG1013 159 DPGLLAMAAGATYVARASVGDPKDLTEKIKKAAEHKGPSFIDVLSPCPT 207 (294) T ss_pred CHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 9999998769976999703598999999999985679769999668867 No 405 >COG0452 Dfp Phosphopantothenoylcysteine synthetase/decarboxylase [Coenzyme metabolism] Probab=20.19 E-value=57 Score=12.94 Aligned_cols=34 Identities=29% Similarity=0.426 Sum_probs=23.9 Q ss_pred CCEEEEEECCCHHH--HHHHHHHHHHCCCEEEEEEE Q ss_conf 77899991477259--99999998718986999994 Q gi|254780234|r 5 VKKVVLAYSGGLDT--SIILKWLQVEKGLEVIVFIA 38 (404) Q Consensus 5 ~kkVvlaySGGLDT--Sv~i~~L~~e~g~eVi~~~~ 38 (404) .|+|+|+.|||... |+.+..+....|++|.++-- T Consensus 4 ~k~ill~v~gsiaayk~~~l~r~L~~~ga~v~vvmt 39 (392) T COG0452 4 GKRILLGVTGSIAAYKSVELVRLLRRSGAEVRVVMT 39 (392) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 763899941733344678999987627973689706 No 406 >PRK07424 bifunctional sterol desaturase/short chain dehydrogenase; Validated Probab=20.01 E-value=57 Score=12.92 Aligned_cols=29 Identities=31% Similarity=0.494 Sum_probs=19.8 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 9999147725999999998718986999994 Q gi|254780234|r 8 VVLAYSGGLDTSIILKWLQVEKGLEVIVFIA 38 (404) Q Consensus 8 VvlaySGGLDTSv~i~~L~~e~g~eVi~~~~ 38 (404) .|..-||+|-.+.+ +.|. .+|+.||++|- T Consensus 184 ~VTGASG~LG~aL~-k~l~-~~GAKVIalTs 212 (410) T PRK07424 184 AVTGASGTLGQALL-KELH-QQGAKVIALTS 212 (410) T ss_pred EEECCCCHHHHHHH-HHHH-HCCCEEEEEEC T ss_conf 99547737789999-9999-77998999935 Done!