Query gi|254781031|ref|YP_003065444.1| exsB protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 240 No_of_seqs 113 out of 2303 Neff 10.1 Searched_HMMs 39220 Date Mon May 30 03:33:28 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781031.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11106 queuosine biosynthesi 100.0 0 0 322.1 17.5 211 9-237 2-216 (231) 2 COG0603 Predicted PP-loop supe 100.0 0 0 308.6 17.0 209 7-237 1-214 (222) 3 TIGR00364 TIGR00364 exsB prote 100.0 0 0 316.3 10.1 206 11-227 1-227 (227) 4 cd01995 ExsB ExsB is a transcr 100.0 1.4E-45 0 302.2 16.3 169 10-233 1-169 (169) 5 pfam06508 ExsB ExsB. This fami 99.9 2.6E-24 6.7E-29 169.8 9.6 133 10-152 1-137 (137) 6 pfam02568 ThiI Thiamine biosyn 99.7 2.5E-16 6.5E-21 120.3 14.0 166 8-197 3-169 (197) 7 PRK08349 hypothetical protein; 99.7 1.5E-16 3.8E-21 121.7 12.1 163 10-196 2-165 (198) 8 cd01712 ThiI ThiI is required 99.7 2.7E-16 6.9E-21 120.1 13.2 164 10-197 1-164 (177) 9 PRK01565 thiamine biosynthesis 99.7 3.8E-16 9.6E-21 119.2 13.5 170 7-203 175-344 (399) 10 PRK01269 thiamine biosynthesis 99.7 1.9E-15 4.8E-20 114.9 13.0 164 7-196 176-340 (483) 11 COG0301 ThiI Thiamine biosynth 99.7 3.3E-15 8.4E-20 113.4 13.8 168 7-203 174-344 (383) 12 TIGR00342 TIGR00342 thiamine b 99.7 6.1E-16 1.6E-20 117.9 9.5 172 7-207 185-361 (391) 13 PRK08384 thiamine biosynthesis 99.4 2E-12 5.1E-17 96.1 11.9 132 7-190 178-309 (310) 14 PRK00143 trmU tRNA (5-methylam 99.4 5.2E-12 1.3E-16 93.5 9.8 171 7-189 3-185 (355) 15 COG0482 TrmU Predicted tRNA(5- 99.3 8.4E-12 2.1E-16 92.2 9.5 192 7-206 2-198 (356) 16 PRK05370 argininosuccinate syn 99.3 1.4E-10 3.5E-15 84.7 12.4 166 5-190 8-182 (447) 17 cd01998 tRNA_Me_trans tRNA met 99.3 1.5E-11 3.9E-16 90.6 7.2 176 10-190 1-182 (349) 18 pfam03054 tRNA_Me_trans tRNA m 99.3 1.7E-11 4.4E-16 90.3 7.1 170 10-190 2-184 (354) 19 PRK00509 argininosuccinate syn 99.2 4.2E-10 1.1E-14 81.7 13.4 160 7-190 1-165 (398) 20 TIGR02432 lysidine_TilS_N tRNA 99.2 1.9E-10 4.9E-15 83.8 11.5 164 10-194 1-185 (204) 21 PRK13820 argininosuccinate syn 99.2 5.5E-10 1.4E-14 81.0 13.4 158 7-190 2-163 (395) 22 COG0037 MesJ tRNA(Ile)-lysidin 99.2 4.7E-10 1.2E-14 81.4 12.7 157 8-189 21-186 (298) 23 pfam01171 ATP_bind_3 PP-loop f 99.2 2.5E-10 6.5E-15 83.0 10.9 151 10-192 1-164 (182) 24 COG1606 ATP-utilizing enzymes 99.2 2.2E-10 5.7E-15 83.4 9.6 153 2-191 11-166 (269) 25 TIGR00032 argG argininosuccina 99.2 5.2E-10 1.3E-14 81.1 10.7 161 10-191 1-174 (420) 26 PRK04527 argininosuccinate syn 99.1 7E-10 1.8E-14 80.3 10.8 156 8-189 3-167 (397) 27 cd01992 PP-ATPase N-terminal d 99.1 3.9E-10 1E-14 81.9 9.5 151 10-191 1-163 (185) 28 cd01993 Alpha_ANH_like_II This 99.1 7.9E-10 2E-14 80.0 10.8 159 10-192 1-175 (185) 29 TIGR00268 TIGR00268 conserved 99.1 8.4E-10 2.2E-14 79.8 10.4 164 2-204 6-176 (263) 30 pfam00764 Arginosuc_synth Argi 99.1 2E-09 5.1E-14 77.5 11.9 157 12-190 1-159 (389) 31 COG0137 ArgG Argininosuccinate 99.1 1.3E-09 3.3E-14 78.7 10.7 165 5-192 1-169 (403) 32 cd01990 Alpha_ANH_like_I This 99.1 1.1E-09 2.7E-14 79.2 10.2 153 11-203 1-156 (202) 33 TIGR00884 guaA_Cterm GMP synth 99.1 1.6E-09 4.2E-14 78.0 10.7 164 7-192 15-189 (319) 34 cd01999 Argininosuccinate_Synt 99.1 6.5E-09 1.6E-13 74.3 13.1 157 11-190 1-162 (385) 35 TIGR00420 trmU tRNA (5-methyla 99.1 1.4E-09 3.6E-14 78.5 9.2 170 9-188 1-198 (394) 36 PRK10660 tilS tRNA(Ile)-lysidi 99.1 1.7E-09 4.4E-14 77.9 9.6 159 2-194 7-178 (433) 37 cd01997 GMP_synthase_C The C-t 98.9 2E-08 5E-13 71.3 10.0 161 10-192 1-167 (295) 38 PRK00919 GMP synthase subunit 98.9 6.8E-09 1.7E-13 74.2 7.2 157 8-192 20-179 (306) 39 PRK00074 guaA GMP synthase; Re 98.9 6E-08 1.5E-12 68.3 11.5 164 7-192 215-384 (513) 40 KOG2805 consensus 98.8 7.1E-09 1.8E-13 74.1 6.5 184 4-190 1-192 (377) 41 PRK10696 C32 tRNA thiolase; Pr 98.7 1.2E-07 3E-12 66.5 9.6 159 2-188 33-204 (311) 42 COG0519 GuaA GMP synthase, PP- 98.7 2.7E-07 7E-12 64.2 10.5 163 8-192 21-186 (315) 43 KOG1706 consensus 98.7 2.1E-07 5.4E-12 64.9 9.4 42 4-45 1-42 (412) 44 pfam01507 PAPS_reduct Phosphoa 98.6 1E-06 2.6E-11 60.7 10.5 152 10-188 1-152 (174) 45 PRK05253 sulfate adenylyltrans 98.6 1.8E-06 4.6E-11 59.1 11.5 160 1-186 19-199 (300) 46 PRK12563 sulfate adenylyltrans 98.5 4.8E-06 1.2E-10 56.6 11.9 159 2-186 31-210 (312) 47 cd01713 PAPS_reductase This do 98.5 2.4E-06 6E-11 58.4 9.9 155 10-188 1-158 (173) 48 pfam02540 NAD_synthase NAD syn 98.3 9.1E-06 2.3E-10 54.8 9.5 150 7-191 17-170 (243) 49 cd00553 NAD_synthase NAD+ synt 98.3 6.9E-06 1.8E-10 55.5 8.7 156 7-191 22-179 (248) 50 COG2117 Predicted subunit of t 98.2 1.1E-05 2.9E-10 54.2 8.1 144 11-187 3-148 (198) 51 KOG1622 consensus 98.2 6.7E-06 1.7E-10 55.6 6.6 38 8-45 230-269 (552) 52 PRK06850 hypothetical protein; 98.1 4.9E-05 1.2E-09 50.3 10.4 216 11-236 24-270 (488) 53 PRK13980 NAD synthetase; Provi 98.1 3.9E-05 1E-09 50.9 9.6 151 7-191 27-179 (264) 54 cd01986 Alpha_ANH_like Adenine 98.1 9.3E-06 2.4E-10 54.8 6.3 34 11-44 1-34 (103) 55 TIGR02039 CysD sulfate adenyly 98.1 7.7E-05 2E-09 49.1 10.9 159 1-186 12-192 (295) 56 TIGR03183 DNA_S_dndC putative 98.0 3.6E-05 9.2E-10 51.1 8.5 170 11-187 16-200 (447) 57 PRK00876 nadE NAD synthetase; 98.0 2.4E-05 6E-10 52.2 7.4 35 8-42 33-69 (325) 58 COG0175 CysH 3'-phosphoadenosi 98.0 7.9E-05 2E-09 49.0 9.9 156 7-188 38-193 (261) 59 PRK13981 NAD synthetase; Provi 98.0 5.3E-05 1.4E-09 50.1 8.8 150 8-186 281-432 (543) 60 PRK08576 hypothetical protein; 98.0 1.7E-05 4.4E-10 53.1 6.3 51 10-60 237-287 (439) 61 PRK08557 hypothetical protein; 98.0 1.8E-05 4.7E-10 52.9 5.7 50 10-59 185-234 (420) 62 PRK13794 hypothetical protein; 97.9 2.3E-05 5.9E-10 52.3 6.0 52 8-59 245-297 (473) 63 TIGR03573 WbuX N-acetyl sugar 97.9 5.7E-05 1.5E-09 49.9 7.7 110 10-138 61-171 (343) 64 cd01994 Alpha_ANH_like_IV This 97.9 0.00045 1.1E-08 44.3 12.0 133 10-190 1-139 (194) 65 cd01991 Asn_Synthase_B_C The C 97.8 5E-05 1.3E-09 50.2 6.3 56 5-60 12-68 (269) 66 pfam00733 Asn_synthase Asparag 97.8 4.2E-05 1.1E-09 50.7 5.7 54 7-60 16-70 (195) 67 COG0171 NadE NAD synthase [Coe 97.7 0.00025 6.5E-09 45.8 8.2 38 7-44 24-67 (268) 68 PRK13795 hypothetical protein; 97.7 0.00015 3.7E-09 47.3 6.6 50 8-57 245-294 (630) 69 cd01996 Alpha_ANH_like_III Thi 97.7 8.7E-05 2.2E-09 48.7 5.4 114 10-144 3-119 (154) 70 PRK00768 nadE NAD synthetase; 97.7 0.00029 7.3E-09 45.5 7.8 24 169-192 181-204 (274) 71 pfam01902 ATP_bind_4 ATP-bindi 97.6 0.0044 1.1E-07 38.2 13.0 128 10-188 2-134 (219) 72 TIGR01536 asn_synth_AEB aspara 97.6 0.00029 7.4E-09 45.5 6.6 126 4-147 308-453 (646) 73 PTZ00323 NAD+ synthase; Provis 97.4 0.0011 2.9E-08 41.8 7.8 26 166-191 187-212 (294) 74 PTZ00077 asparagine synthetase 97.3 0.0012 3.1E-08 41.6 7.5 26 4-30 234-259 (610) 75 TIGR03108 eps_aminotran_1 exos 97.3 0.0008 2E-08 42.8 6.1 50 10-59 260-310 (628) 76 COG1365 Predicted ATPase (PP-l 97.2 0.0015 3.9E-08 41.0 6.8 147 9-203 61-210 (255) 77 TIGR00289 TIGR00289 conserved 97.2 0.0005 1.3E-08 44.0 4.0 31 11-41 3-34 (227) 78 PRK02628 nadE NAD synthetase; 97.1 0.0037 9.4E-08 38.6 8.1 34 8-41 362-402 (678) 79 COG2102 Predicted ATPases of P 97.1 0.00079 2E-08 42.8 4.5 128 10-188 2-135 (223) 80 COG0367 AsnB Asparagine syntha 97.0 0.0023 5.8E-08 39.9 6.4 54 8-61 230-285 (542) 81 TIGR03104 trio_amidotrans aspa 97.0 0.0012 3E-08 41.7 4.8 33 11-43 263-296 (589) 82 PRK09431 asnB asparagine synth 96.7 0.0039 1E-07 38.5 5.5 26 5-31 225-250 (555) 83 cd01984 AANH_like Adenine nucl 96.7 0.0026 6.6E-08 39.6 4.2 31 11-41 1-34 (86) 84 COG3969 Predicted phosphoadeno 96.5 0.0037 9.5E-08 38.6 4.3 40 6-45 25-70 (407) 85 PRK02090 phosphoadenosine phos 96.5 0.036 9.1E-07 32.5 9.1 155 7-190 40-194 (243) 86 KOG0573 consensus 96.2 0.014 3.6E-07 35.0 5.7 60 10-71 252-322 (520) 87 KOG0571 consensus 95.9 0.01 2.6E-07 35.9 4.0 53 5-60 223-283 (543) 88 KOG2594 consensus 95.1 0.13 3.2E-06 29.1 7.2 173 3-186 57-260 (396) 89 TIGR00552 nadE NAD+ synthetase 95.0 0.11 2.7E-06 29.6 6.6 153 7-189 25-188 (286) 90 KOG2316 consensus 90.6 0.45 1.1E-05 25.7 4.3 32 10-41 2-33 (277) 91 TIGR00290 MJ0570_dom MJ0570-re 85.8 0.66 1.7E-05 24.7 2.6 33 10-42 2-34 (265) 92 KOG2840 consensus 79.8 2.1 5.3E-05 21.6 3.2 75 2-84 45-125 (347) 93 PRK05077 frsA fermentation/res 71.6 3.1 7.8E-05 20.5 2.3 33 10-42 194-230 (414) 94 TIGR02397 dnaX_nterm DNA polym 71.3 2.1 5.3E-05 21.6 1.4 23 197-226 56-78 (363) 95 KOG2303 consensus 69.5 2.6 6.5E-05 21.0 1.5 18 11-28 352-369 (706) 96 PRK12435 ferrochelatase; Provi 69.1 2.4 6.2E-05 21.2 1.3 22 5-26 1-23 (316) 97 pfam06500 DUF1100 Alpha/beta h 68.0 4.1 0.00011 19.7 2.3 34 9-42 189-226 (411) 98 cd01524 RHOD_Pyr_redox Member 66.1 9.1 0.00023 17.6 4.2 32 7-38 50-81 (90) 99 cd01534 4RHOD_Repeat_3 Member 64.0 10 0.00026 17.3 4.0 31 8-38 56-86 (95) 100 pfam10589 NADH_4Fe-4S NADH-ubi 61.2 2 5.2E-05 21.6 -0.3 14 217-230 15-28 (46) 101 PRK05707 DNA polymerase III su 61.1 5.6 0.00014 18.9 1.9 16 10-26 23-40 (328) 102 TIGR00276 TIGR00276 iron-sulfu 60.5 3.5 9E-05 20.1 0.9 11 216-226 172-182 (297) 103 TIGR00692 tdh L-threonine 3-de 60.1 2.2 5.7E-05 21.4 -0.2 20 176-195 214-233 (341) 104 PRK05818 DNA polymerase III su 58.6 5.8 0.00015 18.8 1.7 12 4-15 3-14 (262) 105 cd01523 RHOD_Lact_B Member of 56.6 13 0.00034 16.5 4.7 29 8-36 61-89 (100) 106 COG1751 Uncharacterized conser 56.1 14 0.00035 16.5 4.6 60 9-69 29-88 (186) 107 PRK13596 NADH dehydrogenase I 56.0 3.5 8.9E-05 20.2 0.2 16 216-231 343-358 (426) 108 KOG0795 consensus 55.5 6.3 0.00016 18.6 1.4 80 139-231 78-161 (262) 109 COG1856 Uncharacterized homolo 55.4 5.4 0.00014 19.0 1.1 31 4-35 51-86 (275) 110 KOG0317 consensus 55.1 6.2 0.00016 18.6 1.3 32 195-226 234-267 (293) 111 TIGR00678 holB DNA polymerase 55.1 7.6 0.00019 18.1 1.8 19 170-188 175-193 (216) 112 COG5273 Uncharacterized protei 55.0 3.6 9.1E-05 20.1 0.1 22 201-228 113-136 (309) 113 COG1062 AdhC Zn-dependent alco 52.2 3.1 7.8E-05 20.5 -0.6 29 13-41 191-219 (366) 114 TIGR02055 APS_reductase adenyl 50.7 17 0.00043 15.9 6.0 151 16-188 1-154 (200) 115 pfam04301 DUF452 Protein of un 50.4 17 0.00043 15.9 3.5 36 7-43 9-45 (213) 116 PTZ00304 NADH dehydrogenase (u 49.9 3.8 9.6E-05 20.0 -0.4 12 217-228 366-377 (483) 117 PTZ00082 L-lactate dehydrogena 49.8 17 0.00044 15.9 6.8 38 1-41 1-38 (322) 118 PRK10684 HCP oxidoreductase, N 49.8 17 0.00044 15.9 5.7 38 9-47 112-152 (335) 119 cd01518 RHOD_YceA Member of th 47.7 19 0.00048 15.7 4.3 24 10-33 63-86 (101) 120 PRK07276 DNA polymerase III su 46.5 14 0.00035 16.5 2.0 21 9-30 24-44 (290) 121 COG1150 HdrC Heterodisulfide r 45.8 13 0.00033 16.6 1.8 10 218-227 85-94 (195) 122 COG1894 NuoF NADH:ubiquinone o 45.8 4.4 0.00011 19.6 -0.6 11 217-227 348-358 (424) 123 PRK11278 NADH dehydrogenase I 45.4 5 0.00013 19.2 -0.4 16 216-231 352-367 (448) 124 pfam04396 consensus 45.1 20 0.00052 15.4 4.1 17 120-136 86-102 (149) 125 TIGR02196 GlrX_YruB Glutaredox 43.3 11 0.00028 17.1 1.1 19 221-239 10-28 (79) 126 COG1341 Predicted GTPase or GT 42.5 22 0.00057 15.2 4.1 42 7-48 70-117 (398) 127 PRK09590 celB cellobiose phosp 42.0 23 0.00058 15.1 4.6 34 9-42 2-41 (104) 128 TIGR03030 CelA cellulose synth 41.6 15 0.00037 16.3 1.5 19 215-235 305-323 (713) 129 cd03274 ABC_SMC4_euk Eukaryoti 41.0 22 0.00056 15.2 2.3 11 13-23 126-136 (212) 130 TIGR01389 recQ ATP-dependent D 40.8 7.6 0.00019 18.1 -0.0 36 196-231 364-402 (607) 131 TIGR01472 gmd GDP-mannose 4,6- 40.3 24 0.00062 15.0 3.6 27 13-39 3-31 (365) 132 TIGR01198 pgl 6-phosphoglucono 40.3 11 0.00029 17.0 0.8 41 1-41 20-67 (251) 133 pfam08366 LLGL LLGL2. This dom 40.2 16 0.0004 16.1 1.5 30 7-44 25-54 (105) 134 COG1158 Rho Transcription term 40.2 11 0.00028 17.1 0.7 15 168-182 355-369 (422) 135 pfam08981 consensus 40.2 24 0.00062 14.9 4.8 58 9-67 27-85 (181) 136 cd05565 PTS_IIB_lactose PTS_II 40.2 24 0.00062 14.9 4.3 35 10-44 2-40 (99) 137 KOG1315 consensus 39.4 12 0.0003 16.9 0.8 21 202-228 116-136 (307) 138 TIGR00725 TIGR00725 conserved 39.3 14 0.00037 16.4 1.2 37 3-39 28-66 (176) 139 cd01522 RHOD_1 Member of the R 39.3 25 0.00064 14.9 4.1 14 25-38 6-19 (117) 140 KOG0316 consensus 38.6 3.5 9E-05 20.2 -2.0 30 5-34 107-136 (307) 141 TIGR02108 PQQ_syn_pqqB coenzym 37.6 27 0.00068 14.7 2.5 22 10-31 82-103 (314) 142 COG1600 Uncharacterized Fe-S p 37.5 13 0.00034 16.6 0.8 12 216-227 185-196 (337) 143 PRK00162 glpE thiosulfate sulf 37.0 27 0.0007 14.6 4.6 18 15-32 65-82 (104) 144 PTZ00200 cysteine proteinase p 35.9 19 0.00048 15.6 1.3 12 212-223 250-261 (448) 145 TIGR00888 guaA_Nterm GMP synth 35.8 18 0.00046 15.8 1.2 19 3-22 36-54 (195) 146 TIGR03366 HpnZ_proposed putati 35.6 16 0.00042 16.0 1.0 32 8-40 121-153 (280) 147 PRK13685 hypothetical protein; 34.8 30 0.00075 14.4 4.4 14 174-187 273-286 (326) 148 PRK06305 DNA polymerase III su 34.0 30 0.00078 14.3 5.6 32 28-60 32-63 (462) 149 KOG0272 consensus 33.6 14 0.00035 16.5 0.3 17 14-30 318-334 (459) 150 COG2210 Peroxiredoxin family p 33.0 32 0.00081 14.2 4.4 37 7-43 2-42 (137) 151 KOG1372 consensus 32.7 32 0.00082 14.2 3.9 36 4-39 22-59 (376) 152 KOG0823 consensus 31.6 17 0.00044 15.9 0.6 22 8-29 57-78 (230) 153 pfam01799 Fer2_2 [2Fe-2S] bind 30.9 19 0.00048 15.6 0.7 12 146-157 29-40 (75) 154 KOG1312 consensus 30.5 19 0.00049 15.6 0.7 14 215-228 162-175 (341) 155 PRK09111 DNA polymerase III su 30.3 35 0.0009 14.0 6.2 16 15-30 284-299 (600) 156 KOG0270 consensus 30.2 4.3 0.00011 19.6 -2.7 19 13-31 258-276 (463) 157 TIGR02518 EutH_ACDH acetaldehy 30.0 31 0.0008 14.3 1.7 20 2-21 180-199 (528) 158 TIGR01488 HAD-SF-IB HAD-superf 29.8 36 0.00091 13.9 2.0 12 172-183 161-172 (204) 159 PRK11433 putative xanthine deh 29.7 24 0.00062 14.9 1.1 12 215-226 144-155 (217) 160 PRK09908 xanthine dehydrogenas 29.6 23 0.00059 15.1 1.0 18 215-232 100-118 (159) 161 PRK09376 rho transcription ter 29.1 20 0.0005 15.5 0.5 19 217-235 237-255 (416) 162 KOG0509 consensus 28.0 36 0.00093 13.9 1.7 57 164-228 391-447 (600) 163 TIGR03198 pucE xanthine dehydr 28.0 26 0.00067 14.7 1.0 12 215-226 95-106 (151) 164 cd04193 UDPGlcNAc_PPase UDPGlc 27.6 35 0.00089 14.0 1.6 21 3-23 9-29 (323) 165 cd01399 GlcN6P_deaminase GlcN6 27.5 33 0.00083 14.2 1.4 40 2-42 12-59 (232) 166 TIGR02057 PAPS_reductase phosp 26.2 41 0.0011 13.5 5.8 192 10-220 29-238 (239) 167 COG0607 PspE Rhodanese-related 26.2 42 0.0011 13.5 3.0 27 8-34 61-87 (110) 168 COG3349 Uncharacterized conser 25.7 42 0.0011 13.5 4.0 29 12-40 3-31 (485) 169 TIGR03193 4hydroxCoAred 4-hydr 25.5 29 0.00074 14.5 0.8 12 215-226 94-105 (148) 170 TIGR02374 nitri_red_nirB nitri 25.4 43 0.0011 13.4 2.3 47 169-228 430-476 (813) 171 TIGR02951 DMSO_dmsB dimethylsu 25.0 36 0.00092 13.9 1.3 30 198-229 103-133 (162) 172 pfam08240 ADH_N Alcohol dehydr 24.9 33 0.00085 14.1 1.1 13 216-228 64-76 (108) 173 KOG4283 consensus 23.9 6 0.00015 18.7 -2.9 19 13-31 58-76 (397) 174 cd01400 6PGL 6PGL: 6-Phosphogl 23.7 43 0.0011 13.4 1.4 40 2-42 16-60 (219) 175 TIGR02819 fdhA_non_GSH formald 23.6 18 0.00046 15.7 -0.5 10 218-227 106-115 (403) 176 KOG1311 consensus 23.5 17 0.00044 15.9 -0.6 15 214-228 126-140 (299) 177 PRK10509 bacterioferritin-asso 23.4 27 0.0007 14.6 0.4 41 174-227 8-48 (64) 178 TIGR01016 sucCoAbeta succinyl- 23.4 44 0.0011 13.4 1.4 25 161-185 306-332 (389) 179 KOG4533 consensus 22.2 48 0.0012 13.1 1.5 18 3-20 127-144 (317) 180 COG2080 CoxS Aerobic-type carb 22.1 37 0.00094 13.8 0.9 10 216-225 98-107 (156) 181 TIGR03451 mycoS_dep_FDH mycoth 22.1 42 0.0011 13.5 1.1 33 7-40 176-209 (358) 182 KOG1313 consensus 21.8 34 0.00087 14.0 0.6 15 213-227 114-128 (309) 183 cd06259 YdcF-like YdcF-like. Y 21.8 50 0.0013 13.0 4.6 31 10-40 1-41 (150) 184 TIGR02963 xanthine_xdhA xanthi 21.7 32 0.00082 14.2 0.5 15 166-180 375-389 (515) 185 COG5574 PEX10 RING-finger-cont 21.7 41 0.001 13.5 1.0 17 212-228 227-245 (271) 186 COG1063 Tdh Threonine dehydrog 21.6 32 0.00082 14.2 0.4 28 11-38 171-199 (350) 187 PTZ00038 ferredoxin; Provision 21.5 51 0.0013 12.9 2.0 41 164-222 77-122 (172) 188 PRK09880 L-idonate 5-dehydroge 21.4 42 0.0011 13.5 1.0 32 8-40 170-202 (343) 189 PRK06674 DNA polymerase III su 21.4 51 0.0013 12.9 5.4 14 16-29 273-286 (563) 190 PRK13339 malate:quinone oxidor 21.2 52 0.0013 12.9 3.0 27 5-31 2-28 (497) 191 PTZ00019 fructose-bisphosphate 20.9 53 0.0013 12.9 2.6 11 193-203 285-295 (356) 192 COG1832 Predicted CoA-binding 20.8 53 0.0013 12.9 3.8 39 3-41 11-52 (140) 193 PRK06647 DNA polymerase III su 20.4 54 0.0014 12.8 5.6 18 13-30 41-60 (560) 194 PRK13389 UTP--glucose-1-phosph 20.3 51 0.0013 13.0 1.2 20 1-21 1-20 (302) 195 KOG1615 consensus 20.2 54 0.0014 12.8 2.2 12 175-186 158-169 (227) 196 pfam04885 Stig1 Stigma-specifi 20.1 40 0.001 13.6 0.7 23 200-222 97-119 (136) 197 KOG2164 consensus 20.1 55 0.0014 12.8 1.5 16 217-232 203-220 (513) 198 pfam02008 zf-CXXC CXXC zinc fi 20.0 26 0.00067 14.7 -0.3 12 217-228 20-31 (48) No 1 >PRK11106 queuosine biosynthesis protein QueC; Provisional Probab=100.00 E-value=0 Score=322.11 Aligned_cols=211 Identities=44% Similarity=0.843 Sum_probs=177.4 Q ss_pred CCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCC Q ss_conf 94899980965699999999974983999997677887772799999999999998887520356158750110322444 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHS 88 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (240) ||||+|||||+||+++|+++++++.+|++|||||||||..+.+++..+++.+. ...+.+++++.++....+ T Consensus 2 kkavVLlSGGlDStt~L~~a~~~~~~v~alsfdYGQrh~~El~~A~~ia~~~g---------v~~h~vidl~~l~~~~~S 72 (231) T PRK11106 2 KRAVVVFSGGQDSTTCLIQALQQYDEVHCITFDYGQRHRAEIDVARELALKLG---------ARAHKVLDVTLLNELAVS 72 (231) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHCC---------CCCEEEECCHHHHHHCCC T ss_conf 80899907878999999999981994999987778671999999999999859---------840067532777775235 Q ss_pred CCCCCHHH----HHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 33222011----00012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 89 SLTKNVAM----KIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 89 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) ..+..... .......+..++|+||++++++++++|..+|+..|++|.+.+|++.+|||+++|.++++.++..+... T Consensus 73 aLt~~~i~vp~~~~~~~~~p~T~VP~RN~ifLsiAaa~Ae~~ga~~I~~G~~~~D~sgYPDCr~eFi~a~~~al~~g~~~ 152 (231) T PRK11106 73 SLTRDSIPVPDYEPEADGLPNTFVPGRNILFLTLAAIYAYQVKAEAVITGVCETDFSGYPDCRDEFVKALNHAVVLGMAK 152 (231) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 66777877777654445787646716729999999999998599959980465566889989899999999999851789 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 2167753138988999999998282000143255372162888655556688998687789999999973064 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLRQKGWMEFKEK 237 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R~~af~~~~~~ 237 (240) +++|++||++++|.|||+++.+.+ .+++|+..|||||+|.+ +.|||+|.+|++|++||.|++.. T Consensus 153 ~i~i~aPl~~l~K~eiv~l~~~~~---~L~~p~~~TwSCY~g~~------g~~CG~C~sC~~R~~gF~e~~~~ 216 (231) T PRK11106 153 DIRFETPLMWLDKAETWALADYYG---QLDLVRHETLTCYNGIK------GDGCGHCAACHLRANGLNHYLAN 216 (231) T ss_pred CEEEEECCCCCCHHHHHHHHHHCC---CCCCCCCCCCCCCCCCC------CCCCCCCCCHHHHHHHHHHHCCC T ss_conf 819983777788889999998616---66865023552759799------99899893288899999995558 No 2 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=100.00 E-value=0 Score=308.57 Aligned_cols=209 Identities=44% Similarity=0.740 Sum_probs=181.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) +.+|||++||||+||+++|+|+++++++|++|||||||||..+.+.+..+++++ ..++++++.+.++... T Consensus 1 ~~~kavvl~SGG~DStt~l~~a~~~~~ev~alsfdYGQrh~~Ele~A~~iak~l----------gv~~~iid~~~~~~~~ 70 (222) T COG0603 1 MMKKAVVLLSGGLDSTTCLAWAKKEGYEVHALTFDYGQRHRKELEAAKELAKKL----------GVPHHIIDVDLLGEIG 70 (222) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHC----------CCCEEEECHHHHHHCC T ss_conf 983499991688138999999996098899997507898689999999999981----------9982895416875358 Q ss_pred CCCCCCCHHHHHC----CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 4433222011000----123310111111122333777777433993651145444434555443024777776775306 Q gi|254781031|r 87 HSSLTKNVAMKIQ----DNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGM 162 (240) Q Consensus 87 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (240) .+..+......+. ....+..|+|+||.++++++.++|..+|+..|++|.+.+|++++|||+++|.++++.++..++ T Consensus 71 ~saLtd~~~~vp~~~~~~~~~p~t~VP~RN~iflsiA~~~Ae~~g~~~I~~Gv~~~D~sgYPDcrpefi~a~~~~~~l~~ 150 (222) T COG0603 71 GSALTDDSIDVPKYEFAEEEIPATFVPARNLIFLSIAAAYAEALGADAIIIGVNEEDFSGYPDCRPEFIEALNEALNLGT 150 (222) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 88675888636666666666864476066099999999999875998599975321047899888899999999998625 Q ss_pred CCCEE-EEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 43216-7753138988999999998282000143255372162888655556688998687789999999973064 Q gi|254781031|r 163 ESHVT-VHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLRQKGWMEFKEK 237 (240) Q Consensus 163 ~~~~~-i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R~~af~~~~~~ 237 (240) ...++ +.+|+++++|.|||+++.++| +++..|||||+|.... |||+|.+|.+|++||.++|.. T Consensus 151 ~~~~~~i~aPl~~l~Ka~iv~l~~elg------~~~~~T~SCY~g~~~~------~CG~C~sC~~R~~af~~~g~~ 214 (222) T COG0603 151 EKGVRIIHAPLMELTKAEIVKLADELG------VPLELTWSCYNGGEGD------HCGECESCVLRERAFKEAGLE 214 (222) T ss_pred CCCCCEEECCEEECCHHHHHHHHHHHC------CCCHHCEEEECCCCCC------CCCCCHHHHHHHHHHHHHCCC T ss_conf 578617767833134999999888868------8510026776799988------787777789999999983786 No 3 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=100.00 E-value=0 Score=316.27 Aligned_cols=206 Identities=44% Similarity=0.779 Sum_probs=183.0 Q ss_pred EEEEEECCHHHHHHHHHHHHCC--CEEEEEEEECCCC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCE-EEECHHHCCCC Q ss_conf 8999809656999999999749--8399999767788-777279999999999999888752035615-87501103224 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRF--DRVETLSFDYGQR-NKVELECRLCVRKKIVELMPKWKDSLGEDH-ILPLAILGDIS 86 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~--~~v~~i~~~ygq~-~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 86 (240) ||++||||+||++++++++++| ++|++|||||||| |..+.+.|..+++.+. .+++ .++++.++... T Consensus 1 Av~~lSGG~DStT~~~~a~~~~GkyeV~a~TF~YGQR~H~~Ele~A~~ia~~Lg----------i~~~~~~Dl~~l~~l~ 70 (227) T TIGR00364 1 AVVVLSGGQDSTTVLLIALDEGGKYEVHAITFDYGQRAHSRELESARKIAEALG----------IRHHFVIDLSLLKQLG 70 (227) T ss_pred CEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHC----------CCCEEEECHHHHHHHH T ss_conf 968743734689999999961795079985465013789999999999999808----------9707861779998554 Q ss_pred C-CCCCCCHHHHHC---C-------CCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 4-433222011000---1-------2331011111112233377777743399365114544443455544302477777 Q gi|254781031|r 87 H-SSLTKNVAMKIQ---D-------NNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIE 155 (240) Q Consensus 87 ~-~~~~~~~~~~~~---~-------~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 155 (240) . +.++........ + ...+..|+|+||++|++++.++|.+.|+..|.+|+...|++++|||+++|.++++ T Consensus 71 ~YsaLt~~~~~~~~~~~e~d~~~~~E~~p~tfVPgRN~~fl~~a~~yA~~~gA~~v~~G~~~~DfSgYPDCr~EFvka~n 150 (227) T TIGR00364 71 KYSALTDEQEIPEQKSNEEDKEKALETLPNTFVPGRNLIFLSLAASYAEALGAEAVITGVCETDFSGYPDCRDEFVKALN 150 (227) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCHHHHHHHH T ss_conf 04523578756766665320235653113333780479999999998886184252200100036579888688999999 Q ss_pred HHHHHC-CCCCEEE-----EEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 677530-6432167-----75313898899999999828200014325537216288865555668899868778999 Q gi|254781031|r 156 TAINLG-MESHVTV-----HTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLR 227 (240) Q Consensus 156 ~~~~~~-~~~~~~i-----~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R 227 (240) .++.++ +..+|+| .+|++++||.|||++|.++| ...++++++.|||||.|...-...+..|||+|.+|.+| T Consensus 151 ~~l~Lgn~~~~v~i~Piql~aPL~~ltKaeiv~la~elG-kk~l~lv~~~T~sCY~G~~~ilhad~~~Cg~C~sC~lR 227 (227) T TIGR00364 151 HALNLGNMLTPVEIRPIQLEAPLMDLTKAEIVKLADELG-KKLLDLVIKETYSCYAGGRKILHADGDGCGKCESCMLR 227 (227) T ss_pred HHHHHHCCCCCCEEEEEEEECCEEECCHHHHHHHHHHHC-CCEEEEEECCCCHHHHCCCEEEEECCCCCCCCCCCCCC T ss_conf 999974126881120246666701269899999988735-11231100011111316730564237887987435579 No 4 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=100.00 E-value=1.4e-45 Score=302.18 Aligned_cols=169 Identities=46% Similarity=0.813 Sum_probs=154.3 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) |||+|||||+||++++++++++|++|++|||||||||..+.+++..+++.+. T Consensus 1 kavvllSGGlDSt~~l~~~~~~g~~v~~l~~dYGQr~~~E~~~a~~i~~~l~---------------------------- 52 (169) T cd01995 1 KAVVLLSGGLDSTTCLAWAKKEGYEVHALSFDYGQRHAKEEEAAKLIAEKLG---------------------------- 52 (169) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCHHHHHHHHHHHHHHHC---------------------------- T ss_conf 9899907878899999999984996999997569854999999999999968---------------------------- Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 32220110001233101111111223337777774339936511454444345554430247777767753064321677 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTVH 169 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 169 (240) ...++|+||++++++++++|..+|++.|++|.+.+|++.+||++++|.+++++++..+.+.+++|+ T Consensus 53 --------------~~~~VP~RN~ifls~A~~~A~~~g~~~v~~G~~~~d~~~ypDc~~~F~~a~~~~~~~~~~~~v~i~ 118 (169) T cd01995 53 --------------PSTYVPARNLIFLSIAAAYAEALGAEAIIIGVNAEDYSGYPDCRPEFIEAMNKALNLGTENGIKIH 118 (169) T ss_pred --------------CCCEECCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEE T ss_conf --------------994652838999999999999829996899982477568999869999999999983087985898 Q ss_pred EEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 5313898899999999828200014325537216288865555668899868778999999997 Q gi|254781031|r 170 TPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLRQKGWME 233 (240) Q Consensus 170 ~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R~~af~~ 233 (240) +||++++|.||++++.++| +|+..|||||+|.+ .|||+|.+|++|++||.| T Consensus 119 aP~~~~~K~eiv~~g~~lg------v~~~~T~SCy~g~~-------~~CG~C~sC~~R~~aF~e 169 (169) T cd01995 119 APLIDLSKAEIVRLGGELG------VPLELTWSCYNGGE-------KHCGECDSCLLRKRAFEE 169 (169) T ss_pred ECCCCCCHHHHHHHHHHCC------CCHHHCCCCCCCCC-------CCCCCCHHHHHHHHHHCC T ss_conf 4333598999999998849------97998215779998-------989899779999999639 No 5 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=99.91 E-value=2.6e-24 Score=169.76 Aligned_cols=133 Identities=46% Similarity=0.801 Sum_probs=107.2 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) |||++||||+||+++|++++++++++++|||||||+|..+.+++..+++.+ ..++++++++.++....+. T Consensus 1 Kavvl~SGG~DSt~~l~~a~~~~~~v~ait~dYGQ~~~~Ei~~A~~ia~~l----------~i~h~vidl~~l~~~~~sa 70 (137) T pfam06508 1 KAVVLLSGGLDSTTCLAWAKKEGYEVYALTFDYGQRHSKELECAKKIAKAL----------GVEHKIVDLDFLKQIGGSA 70 (137) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHC----------CCCCEEECCCHHHHHCCCC T ss_conf 989991787899999999998699689998147887399999999999982----------9975030330244416552 Q ss_pred CCCCHH----HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHH Q ss_conf 322201----100012331011111112233377777743399365114544443455544302477 Q gi|254781031|r 90 LTKNVA----MKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIR 152 (240) Q Consensus 90 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~ 152 (240) +..... ...........++|+||+++++++.++|..+|++.|++|.+.+|+..+|||+++|.+ T Consensus 71 L~~~~~~vp~~~~~~~~~~~t~VP~RN~iflsiA~a~A~~~g~~~I~~G~~~~D~~~YPDcr~eFi~ 137 (137) T pfam06508 71 LTDDSIEVPDSELESEEIPNTYVPGRNLIFLSIAASYAEAIGANDIFIGVNEEDFSGYPDCRPEFVK 137 (137) T ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHCC T ss_conf 2368855566543456787765636549999999999998699979995655756889998867619 No 6 >pfam02568 ThiI Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis. Probab=99.72 E-value=2.5e-16 Score=120.26 Aligned_cols=166 Identities=19% Similarity=0.165 Sum_probs=105.0 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCCCCEEEECHHHCCCC Q ss_conf 99489998096569999999997498399999767788777279999999999999888752-03561587501103224 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKD-SLGEDHILPLAILGDIS 86 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 86 (240) ..|+|+|||||+||.+++|+++++|.+|++|||+-+.. ..+.+......+.+...++.. ....++.++....-. T Consensus 3 ~gk~l~LlSGGiDSpVAa~lmmkRG~~V~~vhf~~~p~---~~~~~~~k~~~l~~~l~~y~~~~~~~l~vv~~~~~~~-- 77 (197) T pfam02568 3 QGKVLALLSGGIDSPVAAYLMMRRGCRVVALHFINEPG---TSEEAIEKVRKLAELLAEYGTSCEGKLVVVDFTKVQK-- 77 (197) T ss_pred CCCEEEEECCCCHHHHHHHHHHHCCCEEEEEEEECCCC---CCHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHH-- T ss_conf 77189886687129999999998799799999879999---9899999999999999973788754499957499999-- Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) .. ................++.++..+|...|+..+.+|.+-++.+..+-. ++......... T Consensus 78 ------~i----~~~~~~~~~cv~cKr~M~r~A~~iA~~~ga~~IVTGEsLGQVaSQTl~---------nl~~i~~~~~~ 138 (197) T pfam02568 78 ------EI----IEKAPEKYRCVLCKRCMYRAAEKVAEEEGADALVTGESLGQVASQTLD---------NLRVISAATNL 138 (197) T ss_pred ------HH----HHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHCCCC---------HHHHHHHHHCC T ss_conf ------99----962898765456999999999999998499899847303121004531---------06999876248 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCHHCCCCH Q ss_conf 6775313898899999999828200014325 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIGGQDLVNLIL 197 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~ 197 (240) .|++|+++++|.||++.++++|.++....++ T Consensus 139 pilRPLig~DK~EIi~~Ar~IGt~~~s~~~~ 169 (197) T pfam02568 139 PILRPLIGLDKEEIINLAKEIGTYEISIEPY 169 (197) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCHHHHCCCC T ss_conf 5313434699999999999948677662888 No 7 >PRK08349 hypothetical protein; Validated Probab=99.71 E-value=1.5e-16 Score=121.68 Aligned_cols=163 Identities=16% Similarity=0.160 Sum_probs=104.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHC-CCCCEEEECHHHCCCCCC Q ss_conf 4899980965699999999974983999997677887772799999999999998887520-356158750110322444 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDS-LGEDHILPLAILGDISHS 88 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 88 (240) |||+|||||+||.+++|+++++|.+|+++||+.|.. ..+.+..+++.+ ..+... ....+.++..... T Consensus 2 Kvl~LlSGGiDSPVAa~~mmKRG~~V~~lhf~~~~~---~~~kv~~~~~~L----~~~~~~~~~~~~iv~~~~~~----- 69 (198) T PRK08349 2 KVVALLSSGIDSPVAIYLMLSRGVEIYPLHFRQDEK---KEHKARELVEIL----QEIHGGKVKDPVIVDAYEVQ----- 69 (198) T ss_pred EEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCHH---HHHHHHHHHHHH----HHHCCCCCCEEEEECCHHHH----- T ss_conf 499996588438999999997799799998638778---899999999999----99708887528997722532----- Q ss_pred CCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEE Q ss_conf 33222011000123310111111122333777777433993651145444434555443024777776775306432167 Q gi|254781031|r 89 SLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTV 168 (240) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 168 (240) .................-.....++.++..+|...|+.++.+|.+-++.+..+ +.++.......+..| T Consensus 70 ---~~i~~~i~~~~~~~~~~vl~rr~M~riA~~iA~~~g~~aivTGEsLGQVASQT---------l~NL~~i~~~~~~pV 137 (198) T PRK08349 70 ---GPVFEKLREIGKEKWTCLFCKYTMYRVAERYAHEIGAKAIVTGDSLGQVASQT---------LDNLMVISTATDLPI 137 (198) T ss_pred ---HHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHH---------HHHHHHHHHHHCCCC T ss_conf ---89999998607765130999999999999999985998898452167888999---------988999987506766 Q ss_pred EEEECCCCHHHHHHHHHHHCCCHHCCCC Q ss_conf 7531389889999999982820001432 Q gi|254781031|r 169 HTPLMWLKKYETWKLAQDIGGQDLVNLI 196 (240) Q Consensus 169 ~~P~~~~tK~eI~~~~~~~~~~~~~~~~ 196 (240) .+|+++++|.||++.++++|.++....+ T Consensus 138 lRPLig~DK~EII~~Ar~IGTye~S~~~ 165 (198) T PRK08349 138 LRPLIGLDKEEIVRIAKEIGTFEISIEP 165 (198) T ss_pred CCCCCCCCHHHHHHHHHHHCCHHHHCCC T ss_conf 4776679989999999981955343289 No 8 >cd01712 ThiI ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway. It belongs to the Adenosine Nucleotide Hydrolysis suoerfamily and predicted to bind to Adenosine nucleotide. Probab=99.71 E-value=2.7e-16 Score=120.09 Aligned_cols=164 Identities=21% Similarity=0.187 Sum_probs=98.2 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) |+|+|+|||+||++++|+++++|++|++|||+.+.-. .+.+...+ ......+..+.............. T Consensus 1 kvl~L~SGGiDS~VAa~ll~krG~~V~~l~f~~~~~~---~~~~~~~~---~~~~~~l~~~~~~~~~~~~~~~~~----- 69 (177) T cd01712 1 KALALLSGGIDSPVAAWLLMKRGIEVDALHFNSGPFT---SEKAREKV---EDLARKLARYSPGHKLVVIIFTFF----- 69 (177) T ss_pred CEEEEECCCCHHHHHHHHHHHCCCEEEEEEEECCCCC---CHHHHHHH---HHHHHHHHHHCCCCEEEEEECCHH----- T ss_conf 9899957781699999999987998999999899988---88999999---999999999489962899945389----- Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 32220110001233101111111223337777774339936511454444345554430247777767753064321677 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTVH 169 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 169 (240) ... ............... +..++..+..+|..+|++.+++|...+.... +...++..........|. T Consensus 70 ~~~--~~~~~g~~~npcv~c--kr~m~r~a~~~A~~~ga~~IvTGe~lGQvas---------qt~~nl~~i~~~~~~~il 136 (177) T cd01712 70 VQK--EIYGYGKEKYRCILC--KRMMYRIAEKLAEELGADAIVTGESLGQVAS---------QTLENLLVISSGTDLPIL 136 (177) T ss_pred HHH--HHHHHCCCCCCCEEH--HHHHHHHHHHHHHHCCCCEEEECCCHHHHHH---------HHHHHHHHHHHHHCCCCC T ss_conf 999--999708888986362--9999999999999869989986652220447---------689999999874256421 Q ss_pred EEECCCCHHHHHHHHHHHCCCHHCCCCH Q ss_conf 5313898899999999828200014325 Q gi|254781031|r 170 TPLMWLKKYETWKLAQDIGGQDLVNLIL 197 (240) Q Consensus 170 ~P~~~~tK~eI~~~~~~~~~~~~~~~~~ 197 (240) +|+..+||.||++++++.|.++....+. T Consensus 137 RPL~~~dK~EI~~~A~~igt~~~s~~~~ 164 (177) T cd01712 137 RPLIGFDKEEIIGIARRIGTYDISIRPR 164 (177) T ss_pred CCCCCCCHHHHHHHHHHHCCHHHHCCCC T ss_conf 4888999899999999809674405895 No 9 >PRK01565 thiamine biosynthesis protein ThiI; Provisional Probab=99.71 E-value=3.8e-16 Score=119.19 Aligned_cols=170 Identities=22% Similarity=0.282 Sum_probs=111.9 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) ..+++|+|+|||+||.+++|+++++|.++++|||+-+. ...+.+...+..+......+ ......+.++....... T Consensus 175 t~Gk~l~LlSGGIDSPVAa~~mmkRG~~v~~v~f~~~p---~t~~~a~~kv~~l~~~L~~y-~~~~kl~~v~f~~~~~~- 249 (399) T PRK01565 175 TSGKALLLLSGGIDSPVAGYLAMKRGVEIEAVHFHSPP---YTSERAKEKVIDLARILAKY-GGRIKLHVVPFTEIQEE- 249 (399) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHCCCEEEEEEEECCC---CCCHHHHHHHHHHHHHHHHH-CCCCEEEEECCHHHHHH- T ss_conf 77857998617875399999998559879999986899---87789999999999999972-88855999663999999- Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) ...........-....+++.++...|...|+.++.+|.+-++.+..+ +.++.......+. T Consensus 250 -----------i~~~~~~~~~~vl~RR~M~riA~~iA~~~~~~alvTGESLGQVASQT---------l~Nl~~i~~~~~~ 309 (399) T PRK01565 250 -----------IKKKVPESYLMTLMRRFMMRIADKIAEKRGALAIVTGESLGQVASQT---------LESMQAINAVTNL 309 (399) T ss_pred -----------HHHCCCCCEEEHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHH---------HHHHHHHHHHCCC T ss_conf -----------98608800385999999999999999985997898241355676755---------6778888752177 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCC Q ss_conf 6775313898899999999828200014325537216 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTC 203 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC 203 (240) .|.+|++.++|.||++.+++.|.++....|+. -+| T Consensus 310 pVlRPLIg~DK~EII~~Ar~IGTye~Si~p~e--dcC 344 (399) T PRK01565 310 PVLRPLIGMDKEEIIEIAKEIGTYDISILPYE--DCC 344 (399) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCCHHHCCCCC--CEE T ss_conf 54466557898999999987096011217875--726 No 10 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=99.67 E-value=1.9e-15 Score=114.85 Aligned_cols=164 Identities=19% Similarity=0.158 Sum_probs=105.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCCCCEEEECHHHCCC Q ss_conf 599489998096569999999997498399999767788777279999999999999888752-0356158750110322 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKD-SLGEDHILPLAILGDI 85 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 85 (240) ..+++++|+|||+||.+++|+++++|.+++++|||+|... .+.....+++.+. .++.. ....++.++....-.. T Consensus 176 tqGkvL~LlSGGiDSPVAa~~mmKRG~~v~~l~F~lg~~~--~e~~V~~va~~L~---~~~~~~~~vr~~~V~f~~v~~e 250 (483) T PRK01269 176 TQEDVLSLISGGFDSGVASYMLMRRGSRVHYCFFNLGGAA--HEIGVKQVAHYLW---NRYGSSHRVRFISVDFEPVVGE 250 (483) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHCCCEEEEEEECCCCHH--HHHHHHHHHHHHH---HHHCCCCCEEEEEECCHHHHHH T ss_conf 7786899952788838889999845987999995799868--8999999999999---9738888708999767999999 Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 44433222011000123310111111122333777777433993651145444434555443024777776775306432 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESH 165 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (240) .........+.-....+++.++..+|...|+.++.+|.+-++-+..+ +.++.......+ T Consensus 251 ------------I~~~v~~~~~~vv~KR~M~R~A~~iA~~~g~~ALVTGESLGQVASQT---------L~NL~~i~~~~~ 309 (483) T PRK01269 251 ------------ILEKVDNGQMGVVLKRMMLRAASKVAERYGIQALVTGEALGQVSSQT---------LTNLRLIDNVTD 309 (483) T ss_pred ------------HHHHCCHHHEEHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHH---------HHHHHHHHHHCC T ss_conf ------------98738763443999999999999999985998898555034567767---------878899887407 Q ss_pred EEEEEEECCCCHHHHHHHHHHHCCCHHCCCC Q ss_conf 1677531389889999999982820001432 Q gi|254781031|r 166 VTVHTPLMWLKKYETWKLAQDIGGQDLVNLI 196 (240) Q Consensus 166 ~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~ 196 (240) ..|.+|++.++|.||++.+++.|.+++.+.. T Consensus 310 ~pVlRPLIg~DK~eII~~Ar~IGTye~s~~~ 340 (483) T PRK01269 310 TLILRPLIAMDKEDIIDLARQIGTEDFAKTM 340 (483) T ss_pred CCEECCCCCCCHHHHHHHHHHHCHHHHHCCC T ss_conf 7400476679889999999983928776036 No 11 >COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Probab=99.67 E-value=3.3e-15 Score=113.36 Aligned_cols=168 Identities=22% Similarity=0.218 Sum_probs=110.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHH-HHHHHCC--CCCEEEECHHHC Q ss_conf 5994899980965699999999974983999997677887772799999999999998-8875203--561587501103 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELM-PKWKDSL--GEDHILPLAILG 83 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~ 83 (240) ...|+|+|+|||+||.+++|+++++|.+|+++||+-.. ...+.+ ..+..... .....+. ...+.++....- T Consensus 174 t~Gk~l~LlSGGIDSPVA~~l~mkRG~~v~~v~f~~~p---~~~~~a---~~k~~~l~~~~~~~~~~~~~~~~v~f~~v~ 247 (383) T COG0301 174 TQGKVLLLLSGGIDSPVAAWLMMKRGVEVIPVHFGNPP---YTSEKA---REKVVALALLRLTSYGGKVRLYVVPFTEVQ 247 (383) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHCCCEEEEEEECCCC---CCHHHH---HHHHHHHHHHHHCCCCCCEEEEEECHHHHH T ss_conf 67748999707877499999998569877999972799---851889---999999986430246872289997629999 Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 22444332220110001233101111111223337777774339936511454444345554430247777767753064 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGME 163 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (240) . ............-.....++.++.-++...|+.++.+|..-++-+..+ +.+|...... T Consensus 248 ~------------~i~~~~~~~y~~v~~rR~M~riA~~iae~~g~~aIvtGEsLGQVASQT---------l~nL~~i~~~ 306 (383) T COG0301 248 E------------EILEKVPESYRCVLLKRMMYRIAEKLAEEFGAKAIVTGESLGQVASQT---------LENLRVIDSV 306 (383) T ss_pred H------------HHHHHCCCCCEEHHHHHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHH---------HHHHHHHHHC T ss_conf 9------------998636753020478999999999999983983998368610466767---------8889999815 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCC Q ss_conf 3216775313898899999999828200014325537216 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTC 203 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC 203 (240) .+..|++|++.++|.||++.++++|.++..-.++. .+| T Consensus 307 t~~pIlRPLI~~DK~eIi~~Ar~IgT~eiSi~p~e--~cc 344 (383) T COG0301 307 TNTPVLRPLIGLDKEEIIEIARRIGTYEISIEPPE--DCC 344 (383) T ss_pred CCCCEECCCCCCCHHHHHHHHHHHCCHHHHCCCCC--CCC T ss_conf 58742314457998999999998297443135788--885 No 12 >TIGR00342 TIGR00342 thiamine biosynthesis/tRNA modification protein ThiI; InterPro: IPR003720 Thiamine pyrophosphate (TPP) is synthesized de novo in many bacteria and is a required cofactor for many enzymes in the cell. ThiI is required for thiazole synthesis in the thiamine biosynthesis pathway . Almost all the proteins in this group have an N-terminal THUMP domain (see IPR004114 from INTERPRO).; GO: 0009228 thiamin biosynthetic process. Probab=99.66 E-value=6.1e-16 Score=117.87 Aligned_cols=172 Identities=20% Similarity=0.157 Sum_probs=113.0 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHH----HHHCCCCCEEEECHHH Q ss_conf 599489998096569999999997498399999767788777279999999999999888----7520356158750110 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPK----WKDSLGEDHILPLAIL 82 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 82 (240) .-+|||+|+|||+||.+++|++++||++|++|||+-...| .+ ...+++..++.. -......++.++.... T Consensus 185 ~~Gkvl~LlSGGiDSPVAaf~~m~RGc~V~~vhf~~~~~~---~~---~~~~kv~~la~~~~~~~~~~~~~l~~~~F~~~ 258 (391) T TIGR00342 185 TQGKVLALLSGGIDSPVAAFLAMKRGCRVVAVHFFNEPAA---SE---KAREKVERLANLLSLNETGGSVKLYVVDFTDV 258 (391) T ss_pred CHHHHHHHHCCCCCHHHHHHHHHHCCCEEEEEEECCCCCC---CH---HHHHHHHHHHHHHHHCCCEEEEEEEEECHHHH T ss_conf 2536320411883516799999663877999973288552---46---89999999999885400037999998543899 Q ss_pred CCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 32244433222011000123310111111122333777777433993651145444434555443024777776775306 Q gi|254781031|r 83 GDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGM 162 (240) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (240) -... ..........-.....++.+|..++...|+.++.+|..-++-+..+ +.++..... T Consensus 259 ~~~~------------~~~~~e~~~~v~~rR~M~~~A~~~ae~~g~~A~VTGe~LGQVASQT---------L~Nl~vI~~ 317 (391) T TIGR00342 259 QEEI------------IEIIPEKYTMVLCRRLMLKIASKVAEKEGCLAIVTGESLGQVASQT---------LENLRVIQA 317 (391) T ss_pred HHHH------------HHCCCCCCEEEEHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHH---------HHHHHHHHH T ss_conf 9999------------8418998788740667999999888860994899766342037778---------889999972 Q ss_pred CCCEEEEE-EECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCC Q ss_conf 43216775-3138988999999998282000143255372162888 Q gi|254781031|r 163 ESHVTVHT-PLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGK 207 (240) Q Consensus 163 ~~~~~i~~-P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~ 207 (240) ..+..|.. |+++++|.+|+++|+++|.++..-.+.. -+|+-+. T Consensus 318 ~~~~~iL~RPLIg~DK~~Ii~~Ak~IgTyeiSi~~~~--~Cc~~~~ 361 (391) T TIGR00342 318 VVNTPILRRPLIGMDKEEIIELAKEIGTYEISIEPHE--DCCTIFK 361 (391) T ss_pred CCCCCEEECCCCCCCHHHHHHHHHHCCCEEEEEECCC--CCCCEEC T ss_conf 3787424077554797899999741396214441257--8863002 No 13 >PRK08384 thiamine biosynthesis protein ThiI; Provisional Probab=99.45 E-value=2e-12 Score=96.08 Aligned_cols=132 Identities=17% Similarity=0.171 Sum_probs=86.6 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) ...++++|+|||+ |.+++|+++++|.+++++||+.|.+ ..+ .++.+.....++. .....+.+... T Consensus 178 s~Gk~l~LlSGGi-SPVAa~~mmKRG~~v~~vhf~~~~~---~~~----kv~~l~~~L~~y~-~~~~~~~~~~~------ 242 (310) T PRK08384 178 TQGKMVGLLSDEL-SAVAIFLMMKRGVEVIPVHIGMGEK---NLE----KVRKLWNQLKKYS-YGSKGRLVVVK------ 242 (310) T ss_pred CCCCEEEEECCCC-CHHHHHHHHHCCCEEEEEEECCCHH---HHH----HHHHHHHHHHHHC-CCCCEEEEEEC------ T ss_conf 7884899953886-3999999985698799998568878---999----9999999999867-99846999966------ Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) ....+...|..+|+.++.+|.+-++.+..+ +.++.......+. T Consensus 243 ----------------------------~~~~a~~ia~~~~~~alvTGEsLGQVASQT---------l~nl~~i~~~~~~ 285 (310) T PRK08384 243 ----------------------------NFERVNKIIRDFGAKGVVKGLRPEQLASET---------LENIYEDSRMFDV 285 (310) T ss_pred ----------------------------HHHHHHHHHHHHCCCEEEECCCHHHHHHHH---------HHHHHHHHHCCCC T ss_conf ----------------------------599999999985998999774125545457---------9889999730488 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 677531389889999999982820 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) .|.+|++.++|.||++.+++.|.+ T Consensus 286 PVlRPLIg~DK~EII~~Ar~IGTy 309 (310) T PRK08384 286 PVYYPLIALPDEYIEKVKEKIGTF 309 (310) T ss_pred CCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 730385689989999999863889 No 14 >PRK00143 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; Reviewed Probab=99.37 E-value=5.2e-12 Score=93.54 Aligned_cols=171 Identities=16% Similarity=0.047 Sum_probs=98.7 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) .++|++|.+|||+||++.|++|+++|++|+.+|+..+....... ..+..+.+.+..........++++++.... .. T Consensus 3 sk~rV~VamSGGVDSsVaA~LL~~~Gy~V~Gv~m~~w~~~~~~~--~C~~~~d~~dA~~va~~LgIp~~v~d~~~~--f~ 78 (355) T PRK00143 3 SKKRVVVGMSGGVDSSVAAALLKEQGYDVIGLFMKLWDDDDGSG--GCCAEEDIADARAVADKLGIPLYVVDFAKE--FW 78 (355) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCC--CCCCHHHHHHHHHHHHHCCCCEEEECHHHH--HH T ss_conf 88889999167899999999999779958999998876887778--998578899999999986985799686998--76 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCC------------CCCCCHHHHHHH Q ss_conf 44332220110001233101111111223337777774339936511454444345------------554430247777 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSG------------YPDCRHDTIRAI 154 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~------------~~~~~~~~~~~~ 154 (240) ......-......+..+.-....++.. .+......|..+|++.+++|.-+.-... .+|.+-+ T Consensus 79 ~~Vi~~f~~~Y~~G~TPNPcv~CN~~I-KFg~l~~~A~~lgad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYf----- 152 (355) T PRK00143 79 DNVFDYFLDEYKAGRTPNPCVLCNKEI-KFKAFLDYALELGADYIATGHYARIRDEDGRYELLRAVDPNKDQSYF----- 152 (355) T ss_pred HHHHHHHHHHHHCCCCCCCCHHCCCCC-CHHHHHHHHHHCCCCEECCCCEEEEEECCCEEEEEECCCCCCCCEEE----- T ss_conf 788999999997699999733408723-39999999987399842335259999159818998678898872057----- Q ss_pred HHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 76775306432167753138988999999998282 Q gi|254781031|r 155 ETAINLGMESHVTVHTPLMWLKKYETWKLAQDIGG 189 (240) Q Consensus 155 ~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~ 189 (240) |....+..--++..|+-.++|.||.+++++++. T Consensus 153 --L~~l~~~~L~~~~FPLG~~~K~eVR~iA~~~gL 185 (355) T PRK00143 153 --LYQLTQEQLAKLLFPLGELTKPEVREIAAEIGL 185 (355) T ss_pred --EECCCHHHHHHEECCCCCCCHHHHHHHHHHCCC T ss_conf --610579887420758767735899999997589 No 15 >COG0482 TrmU Predicted tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=99.34 E-value=8.4e-12 Score=92.24 Aligned_cols=192 Identities=16% Similarity=0.078 Sum_probs=105.8 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) ..+|+|+++|||+||++.|++|.++|++|+.||++-+.-...... +..+.+.+..........+++.++... ..+ T Consensus 2 ~~~kV~v~mSGGVDSSVaA~lLk~QGyeViGl~m~~~~~~~~~~C---~s~~d~~da~~va~~LGIp~~~vdf~~--~y~ 76 (356) T COG0482 2 KKKKVLVGMSGGVDSSVAAYLLKEQGYEVIGLFMKNWDEDGGGGC---CSEEDLRDAERVADQLGIPLYVVDFEK--EFW 76 (356) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEEEECCCCCCCC---CCHHHHHHHHHHHHHHCCCEEEECHHH--HHH T ss_conf 876799990577789999999997697499999996414788867---725678999999998099559984689--988 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCC---CCCCCHHHHHHHHHHH--HHC Q ss_conf 44332220110001233101111111223337777774339936511454444345---5544302477777677--530 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSG---YPDCRHDTIRAIETAI--NLG 161 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~ 161 (240) ......-......+..+.-.....+. +.+.....++...|++.+++|........ ....+..=....+..+ ... T Consensus 77 ~~V~~~f~~~Y~~G~TPNPci~CN~~-iKF~~~l~~a~~lgad~iATGHYar~~~~~~~~~l~r~~D~~KDQsYfL~~~~ 155 (356) T COG0482 77 NKVFEYFLAEYKAGKTPNPCILCNKE-IKFKALLDYAKELGADYIATGHYARQREDEGIELLLRGVDLNKDQSYFLYALS 155 (356) T ss_pred HHHHHHHHHHHHCCCCCCCCHHCCHH-HHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHEECCCC T ss_conf 88888778998579999965104878-89999999999759985787446765437760001467874203000201268 Q ss_pred CCCCEEEEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCC Q ss_conf 643216775313898899999999828200014325537216288 Q gi|254781031|r 162 MESHVTVHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLG 206 (240) Q Consensus 162 ~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~ 206 (240) ...--++..|+-.++|.||-++|++.+....-+ -..|+=|..| T Consensus 156 ~~ql~~~lFPlG~l~K~evR~iA~~~gL~~a~K--kdS~~ICFi~ 198 (356) T COG0482 156 QEQLERLLFPLGDLEKLEVRPIAAEKGLPTAKK--KDSQGICFIG 198 (356) T ss_pred HHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCCC--CCCCCCEECC T ss_conf 877753225677777789999999759976676--6677622137 No 16 >PRK05370 argininosuccinate synthase; Validated Probab=99.27 E-value=1.4e-10 Score=84.67 Aligned_cols=166 Identities=10% Similarity=0.029 Sum_probs=88.0 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHH-- Q ss_conf 135994899980965699999999974983999997677887772799999999999998887520356158750110-- Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAIL-- 82 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 82 (240) |..-||+|+.||||||+++++.||.++|++|++++.|.||..+...+....-+.++. .. ..++++.... T Consensus 8 ~~~gkKVvLAYSGGLDTSv~l~wL~e~g~eVia~taDvGQ~de~d~~~i~~kA~~~G-------A~--k~~v~D~reefv 78 (447) T PRK05370 8 LPVGQRVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPDEDDYDAIPRRAMEYG-------AE--NARLIDCRAQLV 78 (447) T ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHC-------CC--EEEEECCHHHHH T ss_conf 878888999958982799999999875987999999799986133899999999829-------97--899971799999 Q ss_pred CCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 32244433222011000123310111111122333777777433993651145444434555443024777776775306 Q gi|254781031|r 83 GDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGM 162 (240) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (240) ....+ ....................+..-..........+...|++.+..|...- .+-+..+... .... T Consensus 79 ~~~i~-ai~a~A~y~~~~g~~Y~l~t~laRplia~~lv~~a~~~ga~~iaHGaTGK------GNDQVRFe~~----~~~l 147 (447) T PRK05370 79 AEGIA-AIQCGAFHISTGGVTYFNTTPLGRAVTGTMLVAAMKEDGVNIWGDGSTYK------GNDIERFYRY----GLLT 147 (447) T ss_pred HHHHH-HHHHCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCC------CCCHHHHHHH----HHHH T ss_conf 99999-99736233302576445877307889999999999983998995577678------7438899999----9852 Q ss_pred CCCEEEEEEECCC-------CHHHHHHHHHHHCCC Q ss_conf 4321677531389-------889999999982820 Q gi|254781031|r 163 ESHVTVHTPLMWL-------KKYETWKLAQDIGGQ 190 (240) Q Consensus 163 ~~~~~i~~P~~~~-------tK~eI~~~~~~~~~~ 190 (240) ...++|++|+..+ +..|.++.+++.|.+ T Consensus 148 ~P~lkIiaPwRd~~f~~el~sR~e~i~Ya~~~gIp 182 (447) T PRK05370 148 NPELKIYKPWLDQDFIDELGGRAEMSEFMIACGFD 182 (447) T ss_pred CCCCEEECCCCCHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 99876971433466665415749999999984998 No 17 >cd01998 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. This family of enzyme only presents in bacteria and eukaryote. The archaeal counterpart of this enzyme performs same function, but is completely unrelated in sequence. Probab=99.26 E-value=1.5e-11 Score=90.63 Aligned_cols=176 Identities=14% Similarity=0.041 Sum_probs=96.2 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) |+++.+|||+||++.|++|+++|++|+.+|+..+....... .....+.+.+..........++++++.... ..... T Consensus 1 kV~v~mSGGVDSsvaA~LL~~~G~~V~gv~m~~w~~~~~~~--~C~~~~d~~dA~~va~~LgIp~~v~d~~~e--f~~~V 76 (349) T cd01998 1 KVVVAMSGGVDSSVAAALLKEQGYEVIGVFMKNWDEDDGKG--GCCSEEDLKDARRVADQLGIPHYVVNFEKE--YWEKV 76 (349) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCC--CCCCHHHHHHHHHHHHHCCCCEEEECHHHH--HHHHH T ss_conf 98999077899999999998779957999999678876678--988677899999999986996799680998--86888 Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCC----C--HHHHHHHHHHHHHCCC Q ss_conf 322201100012331011111112233377777743399365114544443455544----3--0247777767753064 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDC----R--HDTIRAIETAINLGME 163 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~~~~~ 163 (240) ...-......+..+.-....++. +.+......|...|++.+++|.-+.-....... . ...-++.--+...... T Consensus 77 ~~~f~~~Y~~G~TPNPcv~CN~~-IKF~~l~~~A~~~g~d~iATGHYAri~~~~~~~~~L~r~~D~~KDQSYfL~~l~~~ 155 (349) T cd01998 77 FEPFLEEYKKGRTPNPDILCNKE-IKFGALLDYAKKLGADYIATGHYARIEEDNNGRYRLLRGVDPNKDQSYFLSQLSQE 155 (349) T ss_pred HHHHHHHHHCCCCCCCCHHCCCE-EEHHHHHHHHHHCCCCEECCCCEEEEEECCCCCEEEEECCCCCCCCCEEEECCCHH T ss_conf 99999999748999876211873-51999999998759986413514788975898489987689987820564012487 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 321677531389889999999982820 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) .--++..|+-.++|.||.++|++++.. T Consensus 156 ~L~~~~FPLG~~~K~eVR~iA~~~gl~ 182 (349) T cd01998 156 QLSRLIFPLGDLTKPEVREIAKELGLP 182 (349) T ss_pred HHHHEECCCCCCCHHHHHHHHHHCCCC T ss_conf 661038788786538999999985999 No 18 >pfam03054 tRNA_Me_trans tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs. Probab=99.25 E-value=1.7e-11 Score=90.30 Aligned_cols=170 Identities=15% Similarity=0.027 Sum_probs=93.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) |++|.+|||+||++.|++|+++|++|+.+|+..+...... ....+....+.+..........++++++.... ..... T Consensus 2 kV~VamSGGVDSsVaA~LL~~~Gy~V~Gv~m~~w~~~~~~-~~~C~~~~d~~dA~~va~~LgIp~~v~d~~~~--f~~~V 78 (354) T pfam03054 2 KVVVAMSGGVDSSVAAYLLKEQGYEVIGVFMKNWDEEDEF-GHGCCSEEDLADAQRVCEQLGIPLYVVNFEKE--YWEKV 78 (354) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCCCCCCC-CCCCCCHHHHHHHHHHHHHCCCCEEEECHHHH--HHHHH T ss_conf 6999904789999999999977996399999956688766-78899889999999999986997899787998--75899 Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH-HCCCCEEECCCCCCCCCC------------CCCCCHHHHHHHHH Q ss_conf 32220110001233101111111223337777774-339936511454444345------------55443024777776 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAY-RLGITNIVIGVCETDYSG------------YPDCRHDTIRAIET 156 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~~g~~~i~~~~~~~~~~~------------~~~~~~~~~~~~~~ 156 (240) ...-......+..+.-....++..- +......|. .+|++.+++|.-+.-... .+|.+-+. T Consensus 79 i~~fv~~Y~~G~TPNPcv~CN~~IK-Fg~l~~~A~~~lGad~iATGHYAri~~~~~~~~L~r~~D~~KDQSYfL------ 151 (354) T pfam03054 79 FEPFLDEYKNGRTPNPDILCNREIK-FGALLDYAKQVLGADYLATGHYARVSLNENESELLRGKDKNKDQSYFL------ 151 (354) T ss_pred HHHHHHHHHCCCCCCCCHHHCCCCC-HHHHHHHHHHHCCCCCCCCCCEEEEEECCCEEEEEECCCCCCCCCEEE------ T ss_conf 9999999976999984144189643-899999999866998104474289990499389998999999801786------ Q ss_pred HHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 7753064321677531389889999999982820 Q gi|254781031|r 157 AINLGMESHVTVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 157 ~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) ....+..--++..|+-.++|.||.++++++|.. T Consensus 152 -~~l~q~~L~~~lFPLG~~~K~eVR~iA~~~gL~ 184 (354) T pfam03054 152 -SGLSQEQLEKLLFPLGDLTKEEVRKIAKEAGLP 184 (354) T ss_pred -ECCCHHHHHHEECCCCCCCHHHHHHHHHHCCCC T ss_conf -107898764318287688528999999976998 No 19 >PRK00509 argininosuccinate synthase; Provisional Probab=99.24 E-value=4.2e-10 Score=81.72 Aligned_cols=160 Identities=17% Similarity=0.074 Sum_probs=87.0 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCC Q ss_conf 5994899980965699999999974-983999997677887772799999999999998887520356158750110322 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDI 85 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (240) +.+|+|++||||||+++++.||.++ |++|++++.|.||.. +.+.. .+.+..... ...++++....-.. T Consensus 1 m~kKVvLAySGGLDTSv~l~wL~e~yg~eVia~~~d~Gq~~--d~~~i-------~~kA~~~GA--~~~~v~D~r~ef~~ 69 (398) T PRK00509 1 MVKKVVLAYSGGLDTSVIIKWLKEEYGCEVVAFTADVGQGE--ELEPI-------REKALASGA--SEIYVEDLREEFVR 69 (398) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCHH--HHHHH-------HHHHHHHCC--CEEEEECCHHHHHH T ss_conf 98549999089845999999998753988999999798877--88999-------999998299--78999630999999 Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHH-HHHHHHHHHHHCCCC Q ss_conf 4443322201100012331011111112233377777743399365114544443455544302-477777677530643 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHD-TIRAIETAINLGMES 164 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 164 (240) ..-.........+....+ ...+..-..........|...|++.+..|...- .+-+. |...+. .... T Consensus 70 ~~i~p~I~ana~Yeg~Yp--L~tslaRplia~~~ve~A~~~ga~aiaHG~TGk------GNDQvRFe~~~~-----aL~P 136 (398) T PRK00509 70 DYVFPAIRANALYEGKYP--LGTALARPLIAKKLVEIARKEGADAVAHGCTGK------GNDQVRFELAIA-----ALAP 136 (398) T ss_pred HHHHHHHHCCCCCCCCEE--EECCCHHHHHHHHHHHHHHHCCCEEEEECCCCC------CCCHHHHHHHHH-----HHCC T ss_conf 999999960854558545--522031889999999999971985998545567------860899999999-----8599 Q ss_pred CEEEEEEECC---CCHHHHHHHHHHHCCC Q ss_conf 2167753138---9889999999982820 Q gi|254781031|r 165 HVTVHTPLMW---LKKYETWKLAQDIGGQ 190 (240) Q Consensus 165 ~~~i~~P~~~---~tK~eI~~~~~~~~~~ 190 (240) .++|++|+.. ++..|.++.+++.|.+ T Consensus 137 ~l~iiaP~Rd~~~~sR~~~i~ya~~~gIp 165 (398) T PRK00509 137 DLKVIAPWREWDLMSREELIAYAEEHGIP 165 (398) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 97686113213416899999999986998 No 20 >TIGR02432 lysidine_TilS_N tRNA(Ile)-lysidine synthetase; InterPro: IPR012795 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . The only examples in which the wobble position of a tRNA must discriminate between G and A of mRNA are AUA (Ile) vs. AUG (Met) and UGA (stop) vs. UGG (Trp). In all bacteria, the wobble position of the tRNA(Ile) recognizing AUA is lysidine, a lysine derivative of cytidine. This domain is found, apparently, in all bacteria in a single copy. Eukaryotic sequences appear to be organellar. The domain architecture of this protein is variable; some, including characterised proteins of Escherichia coli and Bacillus subtilis known to be tRNA(Ile)-lysidine synthetase, include a conserved 50-residue domain that many other members lack. This protein belongs to the ATP-binding PP-loop family. It appears in the literature and protein databases as TilS, YacA, and putative cell cycle protein MesJ (a misnomer).; GO: 0000166 nucleotide binding, 0005524 ATP binding, 0016879 ligase activity forming carbon-nitrogen bonds, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.23 E-value=1.9e-10 Score=83.81 Aligned_cols=164 Identities=18% Similarity=0.132 Sum_probs=92.6 Q ss_pred CEEEEEECCHHHHHHHHHHHHC----CCE-------EEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 4899980965699999999974----983-------99999767788777279999999999999888752035615875 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR----FDR-------VETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILP 78 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~----~~~-------v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (240) +.||.+|||+||+++++.|.+- +.+ +.++|+|++-|..+..+.+. .+...+.+. ..+.+... T Consensus 1 ~~lvAvSGG~DS~aLL~~L~~~~~~~~~~~~~~~~~~~a~h~nH~lR~~s~~~~~~----~v~~~c~~~---~~~~~~~~ 73 (204) T TIGR02432 1 KILVAVSGGVDSMALLHLLLKLQPKLKISWQIKDFKLIAAHVNHGLRPESAQEEAE----FVQQFCEKL---NIPLEIKK 73 (204) T ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEECCCCHHHHHHHHH----HHHHHHHHC---CCCEEEEE T ss_conf 97888628642799999999976632787788750689999617788415899999----999999961---89569984 Q ss_pred CHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCC-CCEEECCCCCCCCCC------CCCCCHHHH Q ss_conf 0110322444332220110001233101111111223337777774339-936511454444345------554430247 Q gi|254781031|r 79 LAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLG-ITNIVIGVCETDYSG------YPDCRHDTI 151 (240) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g-~~~i~~~~~~~~~~~------~~~~~~~~~ 151 (240) ............ ........-.|...+ ...+..+| +..|.++.+.+|..- .+.+...-. T Consensus 74 ~~~~~~~~~~~~----------~~~E~~AR~~RY~~f----~~~~~~~~~~~~i~tAHh~dDq~ET~L~rL~RG~~~~Gl 139 (204) T TIGR02432 74 LVDVKALAKGKK----------KNLEEAAREARYAFF----EEIAKKHGKADYILTAHHADDQAETILMRLLRGSGLRGL 139 (204) T ss_pred EECCCCCCCCCC----------CCHHHHHHHHHHHHH----HHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHH T ss_conf 212411012247----------677899999999999----999997299408997248322799999987516870001 Q ss_pred HHHHHHHHHC---CCCCEEEEEEECCCCHHHHHHHHHHHCCCHHCC Q ss_conf 7777677530---643216775313898899999999828200014 Q gi|254781031|r 152 RAIETAINLG---MESHVTVHTPLMWLKKYETWKLAQDIGGQDLVN 194 (240) Q Consensus 152 ~~~~~~~~~~---~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~ 194 (240) ..+..+-..+ ......+++|++..+|.||.+.+++.+.+...| T Consensus 140 ~g~~~~~~~~~~g~~~~~~~~RPLL~~~k~ei~~y~~~~~l~~~eD 185 (204) T TIGR02432 140 SGMPEIRPLGSLGWYKGGQIIRPLLNISKSEIEEYLKENGLPYFED 185 (204) T ss_pred HHHCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEC T ss_conf 1101336633244225632770775677899999999668965657 No 21 >PRK13820 argininosuccinate synthase; Provisional Probab=99.22 E-value=5.5e-10 Score=80.96 Aligned_cols=158 Identities=15% Similarity=0.061 Sum_probs=85.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-CC-EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCC Q ss_conf 5994899980965699999999974-98-399999767788777279999999999999888752035615875011032 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-FD-RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~~-~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) ++||+|++||||||+++++.||.++ |+ +|++++.|.||.. ++... +.++ .. ......++++....-. T Consensus 2 ~~kKVvLAySGGLDTSv~l~wL~e~yg~~eVi~~~~d~Gq~~---e~~~~-~~~k----A~---~~G~~~~viD~r~ef~ 70 (395) T PRK13820 2 MKKKVVLAYSGGLDTSVCIPLLKEKYGYDEVITVAVDVGQPE---EEIKE-AEEK----AK---KLGVKHYTIDAKEEFA 70 (395) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCCCH---HHHHH-HHHH----HH---HCCCCEEEEECHHHHH T ss_conf 987299996898489999999998519986999999899976---77699-9999----99---7099499981699999 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) ...-.........+.. .....+..-..........|...|+..+..|...-+. ..-+|...+ . -. T Consensus 71 ~~~i~paI~ana~Yeg---YpL~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGN-----DQvRFe~~~----~---ap 135 (395) T PRK13820 71 KDYIFPAIKANALYEG---YPLGTALARPLIAKKIVEVAKKEGASAIAHGCTGKGN-----DQLRFEAVF----R---AT 135 (395) T ss_pred HHHHHHHHHCCCCCCC---CCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC-----HHHHHHHHH----H---CC T ss_conf 9999999970873267---3032410468999999999997598298307778987-----099999998----6---78 Q ss_pred CEEEEEEECC--CCHHHHHHHHHHHCCC Q ss_conf 2167753138--9889999999982820 Q gi|254781031|r 165 HVTVHTPLMW--LKKYETWKLAQDIGGQ 190 (240) Q Consensus 165 ~~~i~~P~~~--~tK~eI~~~~~~~~~~ 190 (240) .++|++|+.. ++..|.++.+++.|.+ T Consensus 136 ~~~iiaP~R~~~~~R~~~i~ya~~~gI~ 163 (395) T PRK13820 136 DLDVIAPIRELNLTREWEIEYAKEHGIP 163 (395) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHCCCC T ss_conf 9868721045334589999999985998 No 22 >COG0037 MesJ tRNA(Ile)-lysidine synthase MesJ [Cell cycle control, cell division, chromosome partitioning] Probab=99.21 E-value=4.7e-10 Score=81.39 Aligned_cols=157 Identities=18% Similarity=0.101 Sum_probs=88.4 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCC--CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCC Q ss_conf 9948999809656999999999749--83999997677887772799999999999998887520356158750110322 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRF--DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDI 85 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~--~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (240) ..++++.+|||.||+++|+.|++.+ .++.++|+|.|++.....+ ...++.+. ... ..+..+......-.. T Consensus 21 ~~~ilVavSGGkDS~~ll~~L~~l~~~~~~~a~~Vd~~~~~~~~~~--~~~~~~~~---~~~---~~~~~v~~~~~~~~~ 92 (298) T COG0037 21 EYKILVAVSGGKDSLALLHLLKELGRRIEVEAVHVDHGLRGYSDQE--AELVEKLC---EKL---GIPLIVERVTDDLGR 92 (298) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHH--HHHHHHHH---HHC---CCCEEEEEHHHHHHH T ss_conf 7858999378789999999999842257389999708988643289--99999999---964---998488620454310 Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCC------CCCCHHHHHHHHHHHH Q ss_conf 4443322201100012331011111112233377777743399365114544443455------5443024777776775 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGY------PDCRHDTIRAIETAIN 159 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 159 (240) . ... ........ ....-......+...|++.|++|.+.+|...+ +.........+.. T Consensus 93 ~--~~~--------~~~~c~~c----~~~R~~~l~~~a~~~g~~~i~tgH~~dD~~et~lm~l~~g~~~~~l~~~~~--- 155 (298) T COG0037 93 E--TLD--------GKSICAAC----RRLRRGLLYKIAKELGADKIATGHHLDDQAETFLMNLLRGSGLRGLRGMPP--- 155 (298) T ss_pred C--CCC--------CCCHHHHH----HHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCCCHHHHHCCCC--- T ss_conf 0--246--------76787999----999999999999985999898567846899999999861752245640885--- Q ss_pred HCCCCCE-EEEEEECCCCHHHHHHHHHHHCC Q ss_conf 3064321-67753138988999999998282 Q gi|254781031|r 160 LGMESHV-TVHTPLMWLKKYETWKLAQDIGG 189 (240) Q Consensus 160 ~~~~~~~-~i~~P~~~~tK~eI~~~~~~~~~ 189 (240) ....... .+++|++..++.||...+...+. T Consensus 156 ~~~~~~~~~~iRPL~~~~~~ei~~~~~~~~l 186 (298) T COG0037 156 KRPFEGGLLIIRPLLYVREKEIELYAKEKGL 186 (298) T ss_pred CCCCCCCCEEECCCCCCCHHHHHHHHHHCCC T ss_conf 2445788645475746889999999997599 No 23 >pfam01171 ATP_bind_3 PP-loop family. This family of proteins belongs to the PP-loop superfamily. Probab=99.20 E-value=2.5e-10 Score=83.05 Aligned_cols=151 Identities=17% Similarity=0.104 Sum_probs=85.4 Q ss_pred CEEEEEECCHHHHHHHHHHHH----CCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCC Q ss_conf 489998096569999999997----4983999997677887772799999999999998887520356158750110322 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALD----RFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDI 85 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~----~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (240) |.+|.+|||.||+++++.|.+ .+.++.++|+|+|.+..+..+. . . +...+.. ...+.++...... .. T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~~~~~~~hvnh~lr~~s~~~~-~-~---v~~~~~~---~~i~~~i~~~~~~-~~ 71 (182) T pfam01171 1 KILVAVSGGPDSMALLYLLKKLKPKFGIDLTAAHVDHGLREESDREA-Q-F---VKELCRQ---LNIPLEVLRVDVA-KK 71 (182) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHH-H-H---HHHHHHH---CCCCEEEEEEECC-CC T ss_conf 99999678599999999999999975997899998799865405899-9-9---9999998---5997599997347-66 Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHH---------H Q ss_conf 4443322201100012331011111112233377777743399365114544443455544302477777---------6 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIE---------T 156 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~---------~ 156 (240) .... ........|. ......+...|+..+++|.+.+|..- .++..+. . T Consensus 72 ----~~~~---------~e~~aR~~Ry----~~l~~~a~~~~~~~i~lgHh~DD~~E------T~lm~l~rG~~~~gl~g 128 (182) T pfam01171 72 ----SGLN---------LEEAAREARY----DFFEEIAKKNGAEVLLTAHHADDQAE------TFLMRLLRGSGLAGLAG 128 (182) T ss_pred ----CCCC---------HHHHHHHHHH----HHHHHHHHHHCCCEEEEECCCCHHHH------HHHHHHHCCCCCCCCCC T ss_conf ----7877---------7578999999----99999898617664887434232999------99999972898110338 Q ss_pred HHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 775306432167753138988999999998282000 Q gi|254781031|r 157 AINLGMESHVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 157 ~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) +-......++++++|++..+|.||.+.+...+.+.. T Consensus 129 m~~~~~~~~~~iiRPLl~~~k~ei~~~a~~~~l~~~ 164 (182) T pfam01171 129 IAPVRPLAGGRIVRPLLKVTKSEIEEYLKEHGIPWV 164 (182) T ss_pred CCCCCCCCCCEEECCCCCCCHHHHHHHHHHCCCCEE T ss_conf 875134689438702114889999999998699368 No 24 >COG1606 ATP-utilizing enzymes of the PP-loop superfamily [General function prediction only] Probab=99.18 E-value=2.2e-10 Score=83.39 Aligned_cols=153 Identities=19% Similarity=0.168 Sum_probs=94.6 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECH Q ss_conf 122135994899980965699999999974-9839999976778877727999999999999988875203561587501 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLA 80 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (240) +..||..+++++.||||+||+++|+.+.+. |.++.++|++...-.+.+.+-+...+++ ....+.+++.. T Consensus 11 ~~~ik~~~kv~vAfSGGvDSslLa~la~~~lG~~v~AvTv~sP~~p~~e~e~A~~~A~~----------iGi~H~~i~~~ 80 (269) T COG1606 11 KKAIKEKKKVVVAFSGGVDSSLLAKLAKEALGDNVVAVTVDSPYIPRREIEEAKNIAKE----------IGIRHEFIKMN 80 (269) T ss_pred HHHHHHCCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHH----------HCCCCEEEEHH T ss_conf 99986437399996588427999999999735646999971687776660678999999----------48762465500 Q ss_pred HHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 10322444332220110001233101111111223337777774339936511454444345554430247777767753 Q gi|254781031|r 81 ILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINL 160 (240) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (240) -+. .. ..........+..-...+.....+...|.+.+..|....|...++. ... +..+ T Consensus 81 ~~~-~~-------------~~~n~~~rCY~CK~~v~~~l~~~a~~~Gyd~V~dGtNasDl~~~RP---G~r-----A~kE 138 (269) T COG1606 81 RMD-PE-------------FKENPENRCYLCKRAVYSTLVEEAEKRGYDVVADGTNASDLFDYRP---GLR-----ALKE 138 (269) T ss_pred HCC-HH-------------HCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHCCCCC---CHH-----HHHH T ss_conf 025-23-------------3049987355778999999999999739988974775787337883---023-----5776 Q ss_pred CCCCCEEEEEEE--CCCCHHHHHHHHHHHCCCH Q ss_conf 064321677531--3898899999999828200 Q gi|254781031|r 161 GMESHVTVHTPL--MWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 161 ~~~~~~~i~~P~--~~~tK~eI~~~~~~~~~~~ 191 (240) ..|..|| .++||.||.+.+..+|... T Consensus 139 -----~gi~sPl~e~gitk~eIre~a~~lgl~~ 166 (269) T COG1606 139 -----LGIRSPLAEFGITKKEIREIAKSLGLPT 166 (269) T ss_pred -----CCCCCHHHHHCCCHHHHHHHHHHCCCCC T ss_conf -----0777718883975999999999759984 No 25 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=99.16 E-value=5.2e-10 Score=81.13 Aligned_cols=161 Identities=15% Similarity=0.137 Sum_probs=94.9 Q ss_pred CEEEEEECCHHHHHHHHHHHHC-CC-EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCC Q ss_conf 4899980965699999999974-98-399999767788777279999999999999888752035615875011032244 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR-FD-RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISH 87 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~-~~-~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (240) |+|+.||||||.++++.||.++ |+ +|++++.|-||.-+.+.+.+..-+.++....+ ..|+-. .+... T Consensus 1 KVvLAySGGLDTSv~l~wL~~kyG~~~Via~~~dvGQPDE~d~~~~~~kA~~~GA~~~---------~~iDak--~eFv~ 69 (420) T TIGR00032 1 KVVLAYSGGLDTSVCLKWLEEKYGYEEVIAVTADVGQPDEEDIDAIEEKALKYGAEKH---------YTIDAK--EEFVK 69 (420) T ss_pred CEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCEE---------EEEECH--HHHHH T ss_conf 9688860702599999998865289860799975679998888889999887367102---------577236--88999 Q ss_pred C--CCCCCHHHHHCC--CCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 4--332220110001--233101111111223337777774339936511454444345554430247777767753064 Q gi|254781031|r 88 S--SLTKNVAMKIQD--NNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGME 163 (240) Q Consensus 88 ~--~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (240) . ....+....+.. ........+..-+.........|...|+.++..|-. .+.+=+..+... + ...- T Consensus 70 dy~f~aiqanA~Ye~~GG~~Y~L~TaLaRPlIA~~lVe~Ak~~Ga~AvaHGCT------GKGNDQ~RFe~~---~-~~~~ 139 (420) T TIGR00032 70 DYLFAAIQANAVYEGTGGLVYPLSTALARPLIAKKLVEVAKKEGAEAVAHGCT------GKGNDQVRFERS---I-RALN 139 (420) T ss_pred HHHHHHHHCCCEECCCCCEECCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCC------CCCCCHHHHHHH---H-HHHC T ss_conf 99888765066031787410347661145799999999984669549870768------777413576588---8-7526 Q ss_pred CCEEEEEEE--CCCC-----HHHHHHHHHHHCCCH Q ss_conf 321677531--3898-----899999999828200 Q gi|254781031|r 164 SHVTVHTPL--MWLK-----KYETWKLAQDIGGQD 191 (240) Q Consensus 164 ~~~~i~~P~--~~~t-----K~eI~~~~~~~~~~~ 191 (240) .+++|++|. ++++ ..|.++.+.+.|-+. T Consensus 140 p~LkviaP~R~~~~~e~lgGR~e~~eYa~~~Gip~ 174 (420) T TIGR00032 140 PDLKVIAPWRDLNLTEELGGREEEIEYAAQKGIPV 174 (420) T ss_pred CCCEEECCCCCHHHHHHHCCCHHHHHHHHHCCCCC T ss_conf 89858747346467764059488999999668887 No 26 >PRK04527 argininosuccinate synthase; Provisional Probab=99.14 E-value=7e-10 Score=80.31 Aligned_cols=156 Identities=18% Similarity=0.123 Sum_probs=81.7 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC-CCEEEECH--HHCC Q ss_conf 99489998096569999999997498399999767788777279999999999999888752035-61587501--1032 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLG-EDHILPLA--ILGD 84 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~ 84 (240) +||+|+.||||||+++++-||+++|++|++++.|.||...-+.+....-+.+ ... ...+++.. +... T Consensus 3 ~kkVvLAySGGLDTSv~l~wL~e~g~~Vi~~~ad~G~~~~e~~~~i~~kA~~----------~GA~~~~v~D~~~ef~~~ 72 (397) T PRK04527 3 TKDIVLAFSGGLDTSFCIPYLQERGYAVHTVFADTGGVDAEERDFIEKRAAE----------LGAASHVTVDGGPAIWEG 72 (397) T ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHH----------HCCCEEEEECCHHHHHHH T ss_conf 8759999089728999999998759947999997899861515789999998----------199779997289999999 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) ...... .....+....+ .....| ..+.....-.|...|+..+..|...-+- ..-+|...... .. T Consensus 73 ~v~~~i--~ana~Yeg~Yp--L~tsaR-plIak~~ve~A~~~ga~~iaHG~TGkGN-----DQvRFe~~~~a-----l~- 136 (397) T PRK04527 73 FVKPFV--WAGEGYQGQYP--LLVSDR-YLIVDAALKRAEELGTRIIAHGCTGMGN-----DQVRFDLAVKA-----LG- 136 (397) T ss_pred HHHHHH--HHCHHHCCCCC--CCCCHH-HHHHHHHHHHHHHCCCEEEECCCCCCCC-----CHHHHHHHHHH-----CC- T ss_conf 999998--60645418555--632107-9999999999997298384047666787-----02567768886-----36- Q ss_pred CEEEEEEEC------CCCHHHHHHHHHHHCC Q ss_conf 216775313------8988999999998282 Q gi|254781031|r 165 HVTVHTPLM------WLKKYETWKLAQDIGG 189 (240) Q Consensus 165 ~~~i~~P~~------~~tK~eI~~~~~~~~~ 189 (240) .++|++|+. +.+..+.++.+++.+. T Consensus 137 ~~~viAP~R~~~~~~~~~R~~~~~ya~~~gi 167 (397) T PRK04527 137 DYQIVAPIREIQKEHTQTRAYEQKYLEERGF 167 (397) T ss_pred CCCCCCCHHHHHHCCHHHHHHHHHHHHHCCC T ss_conf 7654365565310222029999976786499 No 27 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=99.14 E-value=3.9e-10 Score=81.89 Aligned_cols=151 Identities=19% Similarity=0.087 Sum_probs=83.7 Q ss_pred CEEEEEECCHHHHHHHHHHHH----CCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCC Q ss_conf 489998096569999999997----4983999997677887772799999999999998887520356158750110322 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALD----RFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDI 85 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~----~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (240) |.++.+|||.||+++++.|.+ .+.++.++|+|+|.+..+..+. . .+...+. ....+.++........ T Consensus 1 ki~vavSGG~DS~~Ll~~l~~~~~~~~~~l~a~hvdh~lr~~s~~~~-~----~v~~~~~---~~~i~~~i~~~~~~~~- 71 (185) T cd01992 1 KILVAVSGGPDSMALLHLLSELKPRLGLRLVAVHVDHGLRPESDEEA-A----FVADLCA---KLGIPLYILVVALAPK- 71 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHH-H----HHHHHHH---HCCCCEEEEEEEECCC- T ss_conf 99999678499999999999999974994899998189888888999-9----9999999---8599889999775367- Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHH--------HHH Q ss_conf 444332220110001233101111111223337777774339936511454444345554430247777--------767 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAI--------ETA 157 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~--------~~~ 157 (240) ..... .......|. ......+...++..+++|.+.+|..-+ .++... ..+ T Consensus 72 ----~~~~~---------e~~aR~~Ry----~~l~~~~~~~~~~~i~lgHh~dD~~ET-----~lm~l~rg~~~~gl~gm 129 (185) T cd01992 72 ----PGGNL---------EAAAREARY----DFFAEIAKEHGADVLLTAHHADDQAET-----VLMRLLRGSGLRGLAGM 129 (185) T ss_pred ----CCCCH---------HHHHHHHHH----HHHHHHHHHHCCCCEEECCCHHHHHHH-----HHHHHHCCCCCCCCCCC T ss_conf ----89999---------999999999----999999987354504203630368999-----99998718996415277 Q ss_pred HHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCCH Q ss_conf 7530643216775313898899999999828200 Q gi|254781031|r 158 INLGMESHVTVHTPLMWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 158 ~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~ 191 (240) -.......+.+++|++..+|.||.+.+.+.+.+. T Consensus 130 ~~~~~~~~~~iiRPLL~~~k~ei~~~~~~~~i~~ 163 (185) T cd01992 130 PARIPFGGGRLIRPLLGITRAEIEAYLRENGLPW 163 (185) T ss_pred CCCCCCCCCEEEEHHHHHHHHHHHHHHHHCCCCE T ss_conf 8413579952871577853999999999849956 No 28 >cd01993 Alpha_ANH_like_II This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=99.14 E-value=7.9e-10 Score=80.00 Aligned_cols=159 Identities=19% Similarity=0.064 Sum_probs=82.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHC------CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHC Q ss_conf 4899980965699999999974------9839999976778877727999999999999988875203561587501103 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR------FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILG 83 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~------~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (240) |.||.+|||.||++++++|.+- +.++.++|+|+|.+..+..+. . .+ ....... ..+.+.+...... T Consensus 1 kilva~SGG~DS~~Ll~ll~~~~~~~~~~~~l~~~hvnh~~r~~s~~~~-~-~v---~~~~~~~---~i~~~~~~~~~~~ 72 (185) T cd01993 1 RILVALSGGKDSLVLLHVLKKLQRRYPYGFELEALTVDEGIPGYRDESL-E-VV---ERLAEEL---GIELEIVSFKEEY 72 (185) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCHHHH-H-HH---HHHHHHC---CCCEEEEEEEECC T ss_conf 9899965849999999999998886488956999995699888889999-9-99---9989962---9935998764036 Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHH-HH-----H Q ss_conf 22444332220110001233101111111223337777774339936511454444345554430247777-76-----7 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAI-ET-----A 157 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~-----~ 157 (240) ...................-.|.. .....+...+++.+++|.+.+|..- .+...+ .. + T Consensus 73 ------~~~~~~~~~~~~~~c~~c~r~Rr~----~l~~~~~~~~~~~i~~gHh~dD~~E------T~l~~l~rg~~~~~~ 136 (185) T cd01993 73 ------TDDIEVKKRGGKSPCSLCGVLRRG----LLNKIAKELGADKLATGHNLDDEAE------TLLMNLLRGGILRLM 136 (185) T ss_pred ------CCCHHHHHHHHCCHHHHHHHHHHH----HHHHHHHHCCCCEEEEECCHHHHHH------HHHHHHHCCCCHHHC T ss_conf ------763899998640634689999999----9999999819988975330768999------999999848970005 Q ss_pred ----HHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf ----75306432167753138988999999998282000 Q gi|254781031|r 158 ----INLGMESHVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 158 ----~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) ........+++++|++..+|.||.+.+.+.+.+.. T Consensus 137 ~~~~~~~~~~~~i~iiRPLL~~~k~ei~~y~~~~~l~~~ 175 (185) T cd01993 137 RPGPILYLDEGDVTRIRPLVYVREKEIVLYAELNGLPFV 175 (185) T ss_pred CCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEE T ss_conf 676633356898479954988989999999998799878 No 29 >TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232 This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily .. Probab=99.12 E-value=8.4e-10 Score=79.81 Aligned_cols=164 Identities=18% Similarity=0.148 Sum_probs=104.4 Q ss_pred CHHHCC---CCCEEEEEECCHHHHHHHHHHHH-CCCEEEEEEEECCCCCC-CCHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 122135---99489998096569999999997-49839999976778877-72799999999999998887520356158 Q gi|254781031|r 2 NDIIKK---APSALLLFSGGQDSSTCLSWALD-RFDRVETLSFDYGQRNK-VELECRLCVRKKIVELMPKWKDSLGEDHI 76 (240) Q Consensus 2 ~~~~~~---~~~av~l~SGGlDS~~~l~~l~~-~~~~v~~i~~~ygq~~~-~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 76 (240) +..++. ++++|+.||||+||++++..+.+ .|.++.+||+-.=+-.+ -+.+-|... +......+.. T Consensus 6 ~~fl~~kf~nk~~~IAySGGvDS~lla~v~~~v~g~~~lait~~sP~~sp~~el~~A~~~----------A~~~g~~he~ 75 (263) T TIGR00268 6 RNFLKEKFKNKKVLIAYSGGVDSSLLAAVCSDVAGTEVLAITVVSPSISPRRELEDAKAI----------AKEIGVKHEL 75 (263) T ss_pred HHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCEEEEEEECCCCCCHHHHHHHHHH----------HHHHCCCEEE T ss_conf 889876764081699951746589999999875310113578762775735448999999----------9880830121 Q ss_pred EECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 75011032244433222011000123310111111122333777777433993651145444434555443024777776 Q gi|254781031|r 77 LPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIET 156 (240) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (240) +....+... ....-.....++..-.++......+...|.+.+.-|...+|-..++..-. + T Consensus 76 ~~~d~~~n~-------------~f~~N~~~RCY~CK~~~~~~L~~~a~~~gy~~V~dGtN~dDL~~~RPG~~----A--- 135 (263) T TIGR00268 76 VKIDKMANP-------------NFRANVEERCYFCKKKVLSILVKLAEKRGYDVVVDGTNADDLEDHRPGLR----A--- 135 (263) T ss_pred ECCCCCCCC-------------CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHH----H--- T ss_conf 100123685-------------00168854441548889999899998639957982346200023675135----6--- Q ss_pred HHHHCCCCCEEEEEEE--CCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCC Q ss_conf 7753064321677531--38988999999998282000143255372162 Q gi|254781031|r 157 AINLGMESHVTVHTPL--MWLKKYETWKLAQDIGGQDLVNLILEESHTCY 204 (240) Q Consensus 157 ~~~~~~~~~~~i~~P~--~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~ 204 (240) ..+..+ ...|| ...||.||.+.++.+| ....+-| +.+|- T Consensus 136 -~~E~~g----~~SP~aef~I~K~eir~ia~~lg-~~~~DKP---~~~CL 176 (263) T TIGR00268 136 -VKELNG----VYSPWAEFGITKKEIREIAKSLG-LSFYDKP---SEACL 176 (263) T ss_pred -HHHCCC----CCCCCCCCCCCHHHHHHHHHHCC-CCCCCCC---CCCHH T ss_conf -766078----86872002568799999999748-8988886---62102 No 30 >pfam00764 Arginosuc_synth Arginosuccinate synthase. This family contains a PP-loop motif. Probab=99.11 E-value=2e-09 Score=77.51 Aligned_cols=157 Identities=15% Similarity=0.153 Sum_probs=83.8 Q ss_pred EEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCCCC Q ss_conf 99980965699999999974983999997677887772799999999999998887520356158750110322444332 Q gi|254781031|r 12 LLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSSLT 91 (240) Q Consensus 12 v~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (240) |+.||||||+++++.||++++++|++++.|.||..+ +.+....-+.++. . ...++++....-....-... T Consensus 1 VLAySGGLDTSv~l~wL~e~~~eVia~~~d~Gq~~e-d~~~i~~kA~~~G-------A--~~~~v~D~r~ef~~~~i~p~ 70 (389) T pfam00764 1 VLAYSGGLDTSVCIPWLKEKYYEVIAVAVDVGQGEE-DLDEAREKALKLG-------A--VKHYVIDAKEEFVEDYIFPA 70 (389) T ss_pred CEEECCCCHHHHHHHHHHHCCCEEEEEEEECCCCHH-HHHHHHHHHHHHC-------C--CEEEEECCHHHHHHHHHHHH T ss_conf 960367628999999999709928999997999778-7899999999829-------9--78999737999999989999 Q ss_pred CCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 22011000123310111111122333777777433993651145444434555443024777776775306432167753 Q gi|254781031|r 92 KNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTVHTP 171 (240) Q Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P 171 (240) ......+.+. .....+..-..........|...|++.+..|...-+-. --+|...+. .....++|++| T Consensus 71 i~ana~Yeg~--Y~L~tslaRplia~~~v~~A~~~ga~aiaHG~TGkGND-----QvRFe~~~~-----aL~P~l~iiaP 138 (389) T pfam00764 71 IKANALYEGR--YPLGTALARPLIAKKLVEAAKKEGADAVAHGCTGKGND-----QVRFELSIR-----SLAPDLKVIAP 138 (389) T ss_pred HHCCCCCCCC--EEECCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC-----HHHHHHHHH-----HHCCCCEEECC T ss_conf 9668674464--78524321669999999998763976994366679974-----535478899-----74999758545 Q ss_pred E--CCCCHHHHHHHHHHHCCC Q ss_conf 1--389889999999982820 Q gi|254781031|r 172 L--MWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 172 ~--~~~tK~eI~~~~~~~~~~ 190 (240) + ..++..+.++.+++.|.+ T Consensus 139 ~R~~~~sR~~~i~ya~~~gI~ 159 (389) T pfam00764 139 VRDLNLTREEEIEYAKEKGIP 159 (389) T ss_pred CCCCCCCHHHHHHHHHHCCCC T ss_conf 245320579999999985998 No 31 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=99.10 E-value=1.3e-09 Score=78.70 Aligned_cols=165 Identities=15% Similarity=0.085 Sum_probs=91.3 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC-CEEEECHHH Q ss_conf 1359948999809656999999999749-83999997677887772799999999999998887520356-158750110 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGE-DHILPLAIL 82 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 82 (240) |...+|+|+.||||||.++++-||.+++ ++|++++.|.||. ..+.+.+..-+.. .... .++++...- T Consensus 1 ~~~~kkvvLAYSGGLDTSv~i~wL~e~~~~eVit~tadvGQ~-eed~~~i~eKA~~----------~Ga~~~~viD~ree 69 (403) T COG0137 1 MMKVKKVVLAYSGGLDTSVAIKWLKEKGGAEVIAVTADVGQP-EEDLDAIREKALE----------LGAEEAYVIDAREE 69 (403) T ss_pred CCCCCEEEEEECCCCCHHHHHHHHHHHCCCEEEEEEEECCCC-HHHHHHHHHHHHH----------HCCCEEEEEECHHH T ss_conf 987767999954882399999999976594699999758997-5775799999998----------18852899643899 Q ss_pred CCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 32244433222011000123310111111122333777777433993651145444434555443024777776775306 Q gi|254781031|r 83 GDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGM 162 (240) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 162 (240) -....-.........+... .....+..-..........|...|+..+..|...-+-..- +|.... ... T Consensus 70 Fv~~yi~~~i~ana~Yeg~--YpL~TalaRPLIak~lVe~A~k~Ga~avaHGcTGKGNDQv-----RFE~~~-----~al 137 (403) T COG0137 70 FVEDYIFPAIKANALYEGV--YPLGTALARPLIAKKLVEAAKKEGADAVAHGCTGKGNDQV-----RFELAI-----LAL 137 (403) T ss_pred HHHHHHHHHHHHHCEEECC--CCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEE-----EEEEEH-----HHH T ss_conf 9999999999730512156--4145434677999999999997199699746788887535-----432004-----541 Q ss_pred CCCEEEEEEE--CCCCHHHHHHHHHHHCCCHH Q ss_conf 4321677531--38988999999998282000 Q gi|254781031|r 163 ESHVTVHTPL--MWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 163 ~~~~~i~~P~--~~~tK~eI~~~~~~~~~~~~ 192 (240) ..+++|++|. .+++..+.++.+++.|.+.. T Consensus 138 ~pdlkiiAP~Rew~~~R~~~i~Ya~~~gipv~ 169 (403) T COG0137 138 NPDLKIIAPWREWNLTREEEIEYAEEHGIPVK 169 (403) T ss_pred CCCCEEEEEHHHHCCCHHHHHHHHHHCCCCCC T ss_conf 89967985633413376999999998499766 No 32 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=99.10 E-value=1.1e-09 Score=79.22 Aligned_cols=153 Identities=25% Similarity=0.256 Sum_probs=92.4 Q ss_pred EEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 899980965699999999974-9839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) .+|.||||+||+++|+.+.+. |.++++||++.+--...+.+.+...++ .....+..++...+.... T Consensus 1 V~vA~SGGVDSs~la~la~~alG~~v~aV~~~s~~~~~~E~e~a~~~a~----------~~gi~~~~i~~~~l~~~~--- 67 (202) T cd01990 1 VAVAFSGGVDSTLLLKAAVDALGDRVLAVTATSPLFPRRELEEAKRLAK----------EIGIRHEVIETDELDDPE--- 67 (202) T ss_pred CEEECCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHH----------HCCCCEEEEECCHHHHHH--- T ss_conf 9898238599999999999976878799997279899789999999998----------639857997341110165--- Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 32220110001233101111111223337777774339936511454444345554430247777767753064321677 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTVH 169 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 169 (240) ... . ...........+.......+...|...+..|...+|...++.. . .+.. ...|. T Consensus 68 ~~~--------N--~~~RCy~CK~~l~~~l~~~a~~~g~~~v~dGtn~dDl~d~RPG---l-~A~~---------e~~v~ 124 (202) T cd01990 68 FAK--------N--PPDRCYLCKKALYEALKEIAEELGLDVVLDGTNADDLGDYRPG---L-KALR---------ELGVR 124 (202) T ss_pred HCC--------C--CHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCC---H-HHHH---------HCCCC T ss_conf 545--------9--7556229989999999999996799789415757632355786---8-8998---------76998 Q ss_pred EEECC--CCHHHHHHHHHHHCCCHHCCCCHHHCCCC Q ss_conf 53138--98899999999828200014325537216 Q gi|254781031|r 170 TPLMW--LKKYETWKLAQDIGGQDLVNLILEESHTC 203 (240) Q Consensus 170 ~P~~~--~tK~eI~~~~~~~~~~~~~~~~~~~T~SC 203 (240) .|+.. ++|.||-++++++|.+. -+.| +.+| T Consensus 125 sPL~e~gl~K~eVR~la~~lgLp~-~~kp---~~~C 156 (202) T cd01990 125 SPLAEAGLGKAEIRELARELGLPT-WDKP---AMAC 156 (202) T ss_pred CCHHHCCCCHHHHHHHHHHCCCCC-CCCC---CCCC T ss_conf 852651978899999999839985-6689---9875 No 33 >TIGR00884 guaA_Cterm GMP synthase, C-terminal domain; InterPro: IPR001674 The amidotransferase family of enzymes utilises the ammonia derived from the hydrolysis of glutamine for a subsequent chemical reaction catalyzed by the same enzyme. The ammonia intermediate does not dissociate into solution during the chemical transformations . GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. The C-terminal domain is specific to the GMP synthases 6.3.5.2 from EC. In prokaryotes this domain mediates dimerisation. Eukaryotic GMP synthases are monomers. This domain in eukaryotes includes several large insertions that may form globular domains .; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006164 purine nucleotide biosynthetic process, 0006177 GMP biosynthetic process. Probab=99.09 E-value=1.6e-09 Score=78.01 Aligned_cols=164 Identities=21% Similarity=0.154 Sum_probs=107.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEE--ECHHHC Q ss_conf 5994899980965699999999974-9839999976778877727999999999999988875203561587--501103 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHIL--PLAILG 83 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 83 (240) -+.+|++.+|||+|||++|.++-+. |.+++|||+|.|=--.++.+. +.+.+.+. ...+...+ ...+++ T Consensus 15 GD~~vi~ALSGGVDSsV~A~L~hrAIGD~L~~vFVD~GLlR~gE~E~---V~~~F~~~------lg~nl~~VDA~e~FL~ 85 (319) T TIGR00884 15 GDAKVIIALSGGVDSSVAAVLLHRAIGDRLTCVFVDHGLLRKGEAER---VVKTFSDK------LGLNLVVVDAKERFLS 85 (319) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHH---HHHHHHHH------HCCCCEEECCCHHHHH T ss_conf 88468998108815899999998642276048982278887663789---99998753------0898278761079887 Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCC----CCEEECCCCCCCCCCC----CCCCHHHHHHHH Q ss_conf 22444332220110001233101111111223337777774339----9365114544443455----544302477777 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLG----ITNIVIGVCETDYSGY----PDCRHDTIRAIE 155 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g----~~~i~~~~~~~~~~~~----~~~~~~~~~~~~ 155 (240) .+..-. .++.-.-.-.-.|+.+..-.|..++ +..+.-|....|.--+ -......++..+ T Consensus 86 ~L~GV~-------------DPE~KRKIIG~~FI~VFE~~A~~~~~~~~a~yL~QGTlYPDvIES~~~kG~~~a~~IKsHH 152 (319) T TIGR00884 86 ALKGVT-------------DPEEKRKIIGRVFIEVFEREAEKIGDKKKAEYLVQGTLYPDVIESASSKGTSGAATIKSHH 152 (319) T ss_pred HCCCCC-------------CCHHHCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCEEEECCCCCCCCCCCCCCCC T ss_conf 528898-------------7034131122566777888998507997799997343447464602304776476456310 Q ss_pred HHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 6775306432167753138988999999998282000 Q gi|254781031|r 156 TAINLGMESHVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 156 ~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) |.-.+-....++++-|+..+=|-||-+++..||.+.. T Consensus 153 NVGGLP~~m~l~LvEPLR~LfKDEVR~lG~~LGlP~e 189 (319) T TIGR00884 153 NVGGLPEDMKLKLVEPLRELFKDEVRKLGKELGLPEE 189 (319) T ss_pred CCCCCCCCCEEEEECCCHHCCHHHHHHHHHHCCCCHH T ss_conf 6688854260258754023021789999987188679 No 34 >cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity Probab=99.07 E-value=6.5e-09 Score=74.33 Aligned_cols=157 Identities=17% Similarity=0.132 Sum_probs=83.2 Q ss_pred EEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 8999809656999999999749-839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) +|++||||||+++++.||.+++ ++|++++.|.||... ..+. +.++ ....... ..++++....-...... T Consensus 1 VvLAySGGLDTSv~l~wL~e~~~~eVia~~~d~Gq~~e-~~~~---i~~k----A~~~GA~--~~~v~D~r~ef~~~~i~ 70 (385) T cd01999 1 VVLAYSGGLDTSVILKWLKEKGGYEVIAVTADVGQPEE-EIEA---IEEK----ALKLGAK--KHVVVDLREEFVEDYIF 70 (385) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCCHH-HHHH---HHHH----HHHHCCC--EEEEECHHHHHHHHHHH T ss_conf 97997788179999999997459849999998999667-7789---9999----9981997--89996129999999999 Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHH-HHHHHHHHHHHCCCCCEEE Q ss_conf 322201100012331011111112233377777743399365114544443455544302-4777776775306432167 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHD-TIRAIETAINLGMESHVTV 168 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i 168 (240) ........+.. ......+..-..........|...|+..+..|...-+ +-+. |.... .......+| T Consensus 71 pai~ana~Yeg--~Y~L~tslaRplIak~~ve~A~~~ga~aiaHG~TGkG------NDQvRFe~~~-----~aL~P~l~i 137 (385) T cd01999 71 PAIQANALYEG--TYPLGTALARPLIAKALVEVAKEEGADAVAHGCTGKG------NDQVRFELAF-----YALNPDLKI 137 (385) T ss_pred HHHHHCCCCCC--CEECCCCHHHHHHHHHHHHHHHHHCCEEEEECCCCCC------CCHHHHHHHH-----HHHCCCCEE T ss_conf 99960864337--5321350157899999999999809849974566688------8406899999-----985999758 Q ss_pred EEEECC---CCHHHHHHHHHHHCCC Q ss_conf 753138---9889999999982820 Q gi|254781031|r 169 HTPLMW---LKKYETWKLAQDIGGQ 190 (240) Q Consensus 169 ~~P~~~---~tK~eI~~~~~~~~~~ 190 (240) ++|+.. ++..|.++.+++.|.+ T Consensus 138 iAP~Rd~~~~sR~~~i~ya~~~gi~ 162 (385) T cd01999 138 IAPWRDWEFLSREEEIEYAEEHGIP 162 (385) T ss_pred EEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 5147314448889999999985999 No 35 >TIGR00420 trmU tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase; InterPro: IPR004506 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase (2.1.1.61 from EC) catalyses the addition of 5-methylaminomethyl-2-thiouridylate to tRNAs using S-adenosyl-L-methionine as a substrate and releasing S-adenosyl-L-homocysteine. The enzyme is cytoplasmic and is involved in tRNA processing.; GO: 0004808 tRNA (5-methylaminomethyl-2-thiouridylate)-methyltransferase activity, 0008033 tRNA processing, 0005737 cytoplasm. Probab=99.05 E-value=1.4e-09 Score=78.45 Aligned_cols=170 Identities=14% Similarity=0.036 Sum_probs=105.3 Q ss_pred CCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCH---------HHHHHHH-HHHHHHHHHHHHCCCCCEEEE Q ss_conf 948999809656999999999749839999976778877727---------9999999-999999888752035615875 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVEL---------ECRLCVR-KKIVELMPKWKDSLGEDHILP 78 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~---------~~a~~~~-~~~~~~~~~~~~~~~~~~~~~ 78 (240) +|++|++|||+||++.||+|.++|++|..|++.-+....... ..+.+.. +.+.+..+......++++.++ T Consensus 1 ~kv~VglSGGVDSSVsA~lL~~qg~~v~G~ylk~W~~~~~~d~~GH~~~N~~K~~C~~~~dl~~a~~ic~~LGi~l~~~n 80 (394) T TIGR00420 1 KKVIVGLSGGVDSSVSAYLLKQQGYEVVGVYLKNWDEDEKNDILGHKKINENKSGCSSAEDLRDAQAICEKLGIPLEKVN 80 (394) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 94899806854789999999714965899987765124666644316562467788368879999999985598189855 Q ss_pred CHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH-C--CCCEEECCCCCCCC-CCCC---------- Q ss_conf 01103224443322201100012331011111112233377777743-3--99365114544443-4555---------- Q gi|254781031|r 79 LAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYR-L--GITNIVIGVCETDY-SGYP---------- 144 (240) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~-~--g~~~i~~~~~~~~~-~~~~---------- 144 (240) +.. .++......-....-.+..+.-....++..-|-.. .-++.. . |.+.+++|+-+.-. .... T Consensus 81 f~~--~Y~~~Vf~~~i~~y~~G~TPnPDi~CN~~iKFG~~-~e~~~~~~Gtg~~~~aTGHYAr~~q~~~~~~~~~L~~~~ 157 (394) T TIGR00420 81 FQK--EYWNKVFEPFIQEYKEGLTPNPDILCNKLIKFGLL-LEYAKQLLGTGNDKIATGHYARIAQEIENKSLFELLRAL 157 (394) T ss_pred HHH--HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHH-HHHHHHHCCCCCCEEECCHHHHHHHHHCCCCHHHHHCCC T ss_conf 078--87888889999987367777862004712003789-999998528995434424476544342131014443046 Q ss_pred ----CCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHC Q ss_conf ----44302477777677530643216775313898899999999828 Q gi|254781031|r 145 ----DCRHDTIRAIETAINLGMESHVTVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 145 ----~~~~~~~~~~~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~ 188 (240) |- ..|...+.... =-.++.|+..++|.+|-++|.+.+ T Consensus 158 D~~KDQ-SYFL~~l~~~~------~~~~~FPlG~L~K~~vR~iA~~~~ 198 (394) T TIGR00420 158 DKNKDQ-SYFLYHLSQEQ------LAKLLFPLGELLKPEVRQIAKNAG 198 (394) T ss_pred CCCCCC-HHHHHHHHHHH------HHHHCCCCCCCCCHHHHHHHHHCC T ss_conf 788861-17376401898------887505301036778999998648 No 36 >PRK10660 tilS tRNA(Ile)-lysidine synthetase; Provisional Probab=99.05 E-value=1.7e-09 Score=77.89 Aligned_cols=159 Identities=13% Similarity=0.030 Sum_probs=83.5 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHH---HC--CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 1221359948999809656999999999---74--983999997677887772799999999999998887520356158 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWAL---DR--FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHI 76 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~---~~--~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 76 (240) |-|+.+.++.||.+|||+||+++|+.|. ++ +.+++++|+|.|-+.+...+.+ .+ .+.+. ...++.+. T Consensus 7 ~~~~~~~~~ilvavSGG~DS~~LL~~L~~l~~~~~~~~l~~~HvnHgl~~~ad~~~~--fv---~~~c~---~~~ip~~~ 78 (433) T PRK10660 7 NRQLLESRQILVAFSGGLDSTVLLHQLVQWRTQNPGVTLRAIHIHHGLSPNADSWVK--HC---EQVCQ---QWQVPLVV 78 (433) T ss_pred HHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHH--HH---HHHHH---HCCCCEEE T ss_conf 664699998999972809999999999999986689828999971897926699999--99---99999---75997899 Q ss_pred EECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 75011032244433222011000123310111111122333777777433993651145444434555443024777776 Q gi|254781031|r 77 LPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIET 156 (240) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (240) ..... .. ... .........|...+.... .. ...+.++.+.+|..-+ .++..+-. T Consensus 79 ~~~~v-~~-----~~~---------~~E~aAR~~RY~~f~~~~----~~--~~~l~tAHh~dDQaET-----vLlrL~RG 132 (433) T PRK10660 79 ERVQL-AQ-----EGL---------GIEAAARQARYQAFARTL----LP--GEVLVTAQHLDDQCET-----FLLALKRG 132 (433) T ss_pred EEEEE-CC-----CCC---------CHHHHHHHHHHHHHHHHH----HH--CCEEEECCCCCHHHHH-----HHHHHHCC T ss_conf 99872-79-----997---------699999999999999987----43--8879962456519999-----99998658 Q ss_pred HHHHC-CCC-------CEEEEEEECCCCHHHHHHHHHHHCCCHHCC Q ss_conf 77530-643-------216775313898899999999828200014 Q gi|254781031|r 157 AINLG-MES-------HVTVHTPLMWLKKYETWKLAQDIGGQDLVN 194 (240) Q Consensus 157 ~~~~~-~~~-------~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~ 194 (240) .-..| .+. +..+++|++..+|.||...+.+.+.....+ T Consensus 133 sG~~GL~gm~~~r~~~~~~liRPLL~~~r~eI~~Y~~~~~l~~~eD 178 (433) T PRK10660 133 SGPAGLSAMAEVSPFAGTQLLRPLLARTRGELEQWAQAHGLRWIED 178 (433) T ss_pred CCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHCCCCCEEC T ss_conf 9964456775224478970773744665999999999849980389 No 37 >cd01997 GMP_synthase_C The C-terminal domain of GMP synthetase. It contains two subdomains; the ATP pyrophosphatase domain which closes to the N-termial and the dimerization domain at C-terminal end. The ATP-PPase is a twisted, five-stranded parallel beta-sheet sandwiched between helical layers. It has a signature nucleotide-binding motif, or P-loop, at the end of the first-beta strand.The dimerization domain formed by the C-terminal 115 amino acid for prokaryotic proteins. It is adjacent to teh ATP-binding site of the ATP-PPase subdomain. The largest difference between the primary sequence of prokaryotic and eukaryotic GMP synthetase map to the dimerization domain.Eukaryotic GMP synthetase has several large insertions relative to prokaryotes. Probab=98.89 E-value=2e-08 Score=71.35 Aligned_cols=161 Identities=22% Similarity=0.161 Sum_probs=81.2 Q ss_pred CEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECH--HHCCCC Q ss_conf 4899980965699999999974-9839999976778877727999999999999988875203561587501--103224 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLA--ILGDIS 86 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 86 (240) |+++++|||+||+++|.++-+. |..+++||+|.|---..+.+...... . +.. ......++.+ +++.+. T Consensus 1 kVi~~lSGGVDStV~A~Ll~kAig~~l~~vfVDnGlmRk~E~e~V~~~~---~----~~~--~~~~~~vdas~~Fl~~L~ 71 (295) T cd01997 1 KVILALSGGVDSTVAAVLLHKAIGDRLTCVFVDNGLLRKNEAERVEELF---S----KLL--GINLIVVDASERFLSALK 71 (295) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHHHHHHH---H----HHC--CCCEEEECHHHHHHHHHC T ss_conf 9899905782899999999988650569999858978888299999999---8----706--997799566999999865 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCC-CCEEECCCCCCCC--CCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 44332220110001233101111111223337777774339-9365114544443--45554430247777767753064 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLG-ITNIVIGVCETDY--SGYPDCRHDTIRAIETAINLGME 163 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g-~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 163 (240) . ..+ ++.-.-.-.-.++.+....+..++ +..+.-|....|. +.........+....+....... T Consensus 72 g-----v~D--------PE~KRKiIG~~Fi~vf~~~~~~~~~~~~L~QGTlyPDvIES~~~~~~a~~IKsHHNvggLp~~ 138 (295) T cd01997 72 G-----VTD--------PEEKRKIIGETFIEVFEEEAKKLGLAEYLAQGTLYPDVIESGSGKGSADTIKSHHNVGGLPED 138 (295) T ss_pred C-----CCC--------HHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCEEECCCCCCCCCEEEEECCCCCCCHH T ss_conf 8-----779--------256365653899999999997558861897243026257507888887514522456777232 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 32167753138988999999998282000 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) ...+++-|+..+-|.||-+++.+||.+.. T Consensus 139 ~~~kliEPLr~LfKDEVR~lG~~LGlp~~ 167 (295) T cd01997 139 MKLKLIEPLRDLFKDEVRELGRELGLPEE 167 (295) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 26540016788745999999998689788 No 38 >PRK00919 GMP synthase subunit B; Validated Probab=98.88 E-value=6.8e-09 Score=74.21 Aligned_cols=157 Identities=22% Similarity=0.169 Sum_probs=87.3 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHH--HCC Q ss_conf 994899980965699999999974-98399999767788777279999999999999888752035615875011--032 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAI--LGD 84 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 84 (240) .+|+++.+|||+||+++|.++-+. |..+++|++|.|---..+.+.. .+.+ +. ......++.+. ++. T Consensus 20 ~~kvi~~lSGGVDStV~A~Ll~kAig~~l~~v~VD~GllR~~E~~~V---~~~~----~~----~~~l~~vda~~~Fl~~ 88 (306) T PRK00919 20 DGKAIIALSGGVDSSVAAVLAHRAIGDRLLAVYVDTGLMRKGETERI---REIF----KD----GLNLRIVDASDRFLEA 88 (306) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHHH---HHHH----HH----CCCCEEEEHHHHHHHH T ss_conf 98599991688479999999998864265999986898879869999---9999----83----5892898349999997 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) +. .-.+ ++.-.-.-...|+.+....+..+++..+.-|....|.--+ ...+....+.-.+.... T Consensus 89 L~-----gv~D--------PE~KRKiIG~~Fi~vfe~~~~~~~~~~LaQGTlyPDvIES----~~~IKSHHNVggLp~~m 151 (306) T PRK00919 89 LK-----GVTD--------PEEKRKIIGETFIRVFEEVAKEIGAEYLVQGTIAPDWIES----EGGIKSHHNVGGLPEGM 151 (306) T ss_pred HC-----CCCC--------HHHCCCHHHHHHHHHHHHHHHHHCCCEEEECEECCCHHHC----CCCCCCCCCCCCCHHHC T ss_conf 37-----9989--------5674602579999999999998298768511073760204----67643416688773323 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 2167753138988999999998282000 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) ..+++-|+..+-|.||-+++.+||.+.. T Consensus 152 ~~kliEPLr~LfKDEVR~lG~~LGlp~~ 179 (306) T PRK00919 152 VLKIVEPLRDLYKDEVREVARALGLPEE 179 (306) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCHH T ss_conf 5631031899855999999999697888 No 39 >PRK00074 guaA GMP synthase; Reviewed Probab=98.86 E-value=6e-08 Score=68.32 Aligned_cols=164 Identities=21% Similarity=0.140 Sum_probs=82.1 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECH--HHC Q ss_conf 5994899980965699999999974-9839999976778877727999999999999988875203561587501--103 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLA--ILG 83 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 83 (240) ..+|++|.+|||+||+++|.++-+. |..+++|++|.|---..+.+. +.+.+.. . .......++.+ +++ T Consensus 215 g~~kVi~~lSGGVDStV~A~Ll~kAIGd~l~cvfVD~GllRknE~~~---V~~~~~~---~---lgl~~~~vdA~~~Fl~ 285 (513) T PRK00074 215 GDKKVILGLSGGVDSSVAAVLLHKAIGDQLTCVFVDHGLLRKNEAEQ---VMEMFRE---H---FGLNLIHVDASDRFLS 285 (513) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHCEEEEEECCCCCCCCCHHH---HHHHHHH---H---CCCCEEEEEHHHHHHH T ss_conf 87628997238830899999999985112489995377111672999---9999988---6---1996799224899999 Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCEEECCCCCCCC--CCCCCCCHHHHHHHHHHHHH Q ss_conf 2244433222011000123310111111122333777777433-99365114544443--45554430247777767753 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRL-GITNIVIGVCETDY--SGYPDCRHDTIRAIETAINL 160 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~-g~~~i~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 160 (240) .+.. -.+ ++.-.-.-.-.|+.+....+..+ ++..+.-|....|. +.........+....+.-.+ T Consensus 286 ~L~g-----v~D--------PE~KRKIIG~~FI~vfe~~a~~~~~~~~L~QGTlYPDvIES~~~~~~a~~IKsHHNVggl 352 (513) T PRK00074 286 ALAG-----VTD--------PEEKRKIIGREFIEVFEEEAKKLGGVKFLAQGTLYPDVIESASGTGKAATIKSHHNVGGL 352 (513) T ss_pred HHCC-----CCC--------HHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCEEEECCCCCCCCCCCCCCCCCCC T ss_conf 7379-----989--------799678877899999999997579983996467655068847989998860487758788 Q ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 06432167753138988999999998282000 Q gi|254781031|r 161 GMESHVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 161 ~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) ......++.-|+..+-|.||.+++.+||.+.. T Consensus 353 p~~~~~~lvEPlr~lfKDEVR~lg~~Lglp~~ 384 (513) T PRK00074 353 PEDMKLKLVEPLRELFKDEVRKLGLELGLPEE 384 (513) T ss_pred HHHCCCEEEEEHHHHCCHHHHHHHHHHCCCHH T ss_conf 56648754503175615899999999689777 No 40 >KOG2805 consensus Probab=98.85 E-value=7.1e-09 Score=74.07 Aligned_cols=184 Identities=16% Similarity=0.067 Sum_probs=84.6 Q ss_pred HHCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHC Q ss_conf 21359948999809656999999999749839999976778877727999999999999988875203561587501103 Q gi|254781031|r 4 IIKKAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILG 83 (240) Q Consensus 4 ~~~~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (240) ||++..++||.+|||+||++.|++|.++|+.+..|++.-++-.. +........+...+..+......++.+.+... . T Consensus 1 ~p~~~~~VvvamSgGVDSsVaa~Ll~~~g~~v~gv~M~nWd~~d-e~~s~cp~e~D~~da~~Vc~~LnI~~~~Vnf~--k 77 (377) T KOG2805 1 MPEKPDRVVVAMSGGVDSSVAARLLAARGYNVTGVFMKNWDSLD-EFGSQCPAERDWKDAKRVCKQLNIPLHQVNFV--K 77 (377) T ss_pred CCCCCCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEEECCCCCC-CCCCCCCCHHHHHHHHHHHHHHCCEEEEEEEH--H T ss_conf 98666548999537711899999997418871699662201222-35668981120899999998709702787437--9 Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCC-C-----HHHHHHHHHH Q ss_conf 224443322201100012331011111112233377777743399365114544443455544-3-----0247777767 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDC-R-----HDTIRAIETA 157 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~-~-----~~~~~~~~~~ 157 (240) +++......-.+....+..+.-.....+..-+-...---....|.+.++.|..+...-...++ . +.-....+.. T Consensus 78 EYW~~Vfs~~L~~Y~~G~TPNPDI~CN~~IKFg~~~~~a~en~~~d~latGHYAr~~~~~~~~~~~~l~~~~d~~KDQt~ 157 (377) T KOG2805 78 EYWNDVFSPFLEEYENGRTPNPDILCNKHIKFGKFFKHAIENLGYDWLATGHYARVVLEDEDNAESHLLISKDMVKDQTY 157 (377) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCEEECCHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCEEEEECCCCCCCCEE T ss_conf 99999999998887658999997134451421178999987558873774212144037556762357612566677326 Q ss_pred HH--HCCCCCEEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 75--3064321677531389889999999982820 Q gi|254781031|r 158 IN--LGMESHVTVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 158 ~~--~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) +. .....--+...|+..++|.||-+++...|-+ T Consensus 158 FL~~in~~~L~r~lfPlg~~~K~eVk~lA~~~gf~ 192 (377) T KOG2805 158 FLSTINQTQLKRLLFPLGCLTKSEVKKLAKQAGFP 192 (377) T ss_pred EEECCCHHHHHHHHCCCCCCCHHHHHHHHHHCCCC T ss_conf 76303699998620667656779999999865986 No 41 >PRK10696 C32 tRNA thiolase; Provisional Probab=98.73 E-value=1.2e-07 Score=66.54 Aligned_cols=159 Identities=17% Similarity=0.125 Sum_probs=91.1 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHHHC------CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 122135994899980965699999999974------98399999767788777279999999999999888752035615 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWALDR------FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDH 75 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~~~------~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 75 (240) ++|+....+.+|-+|||.||.++|+.|..- ..++.++|+|.++.-. ..+. +.+.. .....+.+ T Consensus 33 y~MIedGDRVlVglSGGKDS~~LL~iL~~Lq~~api~FeLvAv~lD~~~pGf-~~~~-------L~~yl---~~lGvp~~ 101 (311) T PRK10696 33 FNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGF-PEHI-------LPEYL---EKLGVEYK 101 (311) T ss_pred HCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCC-CHHH-------HHHHH---HHCCCCCE T ss_conf 5877899999998267888999999999999858998559999837899998-8068-------89999---97499707 Q ss_pred EEECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 87501103224443322201100012331011111112233377777743399365114544443455544302477777 Q gi|254781031|r 76 ILPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIE 155 (240) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 155 (240) ++.....+ ... ..........+...-.|- .+....|...|++.|+.|.+.+|..-+- ++..+. T Consensus 102 i~~~d~~~-iv~-------~~~~egks~CslCsRlRR----g~Ly~~A~e~G~nKIALGHH~DDi~ETf-----LMNlf~ 164 (311) T PRK10696 102 IVEENTYG-IVK-------EKIPEGKTTCSLCSRLRR----GILYRTATELGATKIALGHHRDDILQTL-----FLNMFY 164 (311) T ss_pred EEEEEHHH-HHH-------HHCCCCCCHHHHHHHHHH----HHHHHHHHHCCCCEEEECCCHHHHHHHH-----HHHHHH T ss_conf 99840767-998-------754368863789999999----9999999986998798606505589999-----999996 Q ss_pred -----HH--HHHCCCCCEEEEEEECCCCHHHHHHHHHHHC Q ss_conf -----67--7530643216775313898899999999828 Q gi|254781031|r 156 -----TA--INLGMESHVTVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 156 -----~~--~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~ 188 (240) .| ..........|++|+.+-...+|.+.+...+ T Consensus 165 gG~LktM~Pkl~~d~g~~~VIRPL~y~~E~di~~~a~~~~ 204 (311) T PRK10696 165 GGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIERFADAKA 204 (311) T ss_pred CCCCCCCCCEEECCCCCEEEEEECCCCCHHHHHHHHHHCC T ss_conf 6876367976674899759983042015999999999759 No 42 >COG0519 GuaA GMP synthase, PP-ATPase domain/subunit [Nucleotide transport and metabolism] Probab=98.69 E-value=2.7e-07 Score=64.24 Aligned_cols=163 Identities=21% Similarity=0.134 Sum_probs=99.2 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEC--HHHCC Q ss_conf 994899980965699999999974-983999997677887772799999999999998887520356158750--11032 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPL--AILGD 84 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~ 84 (240) .+|+++.+|||+||++++.++-+. |..+++|++|.|---..+.+. +.+.+.+. +......++. .+++. T Consensus 21 ~~kvi~alSGGVDSsv~a~L~~~AiGd~l~cvfVD~GLlR~~E~e~---V~~~f~~~------~~~nl~~VdA~~~Fl~~ 91 (315) T COG0519 21 DGKVILALSGGVDSSVAAVLAHRAIGDQLTCVFVDHGLLRKGEAEQ---VVEMFREH------LGLNLIVVDAKDRFLSA 91 (315) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHCCEEEEEECCCCCCCCCHHH---HHHHHHHH------CCCCEEEECHHHHHHHH T ss_conf 8539998338780899999999974052489996377434785699---99998753------29845997549889988 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) +..-. .+....-.-.-.++......+..+++..+.-|....|.--+-......+....+.-.+.-.. T Consensus 92 L~Gvt-------------DPE~KRKiIG~~Fi~VFe~ea~k~~~~~LaQGTiYpDvIES~~g~~~~IKSHHNVGGLP~~m 158 (315) T COG0519 92 LKGVT-------------DPEEKRKIIGREFIEVFEEEAKKLGAEFLAQGTIYPDVIESGTGKAGTIKSHHNVGGLPEDM 158 (315) T ss_pred HCCCC-------------CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEECCCCCCCCCCCCCCCCCCCCC T ss_conf 43999-------------88998999879999999999985786068704434504650688888621356268895014 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 2167753138988999999998282000 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) .++++-|+..+-|.||.+++.+||.+.. T Consensus 159 ~lkLvEPLr~LfKDEVR~lg~~LGlp~~ 186 (315) T COG0519 159 KLKLVEPLRELFKDEVRELGRELGLPEE 186 (315) T ss_pred CEEEEHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 2223077898868999999998599888 No 43 >KOG1706 consensus Probab=98.67 E-value=2.1e-07 Score=64.92 Aligned_cols=42 Identities=29% Similarity=0.472 Sum_probs=39.9 Q ss_pred HHCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCC Q ss_conf 213599489998096569999999997498399999767788 Q gi|254781031|r 4 IIKKAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQR 45 (240) Q Consensus 4 ~~~~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~ 45 (240) ||+..+++|+.+|||||.++.+.||+++|++|++..-|-||+ T Consensus 1 ~~~~~~~vVLAySGgLDTscil~WLkeqGyeViay~AnvGQ~ 42 (412) T KOG1706 1 MMSSKKSVVLAYSGGLDTSCILAWLKEQGYEVIAYLANVGQK 42 (412) T ss_pred CCCCCCEEEEEECCCCCCHHHHHHHHHCCCEEEEEECCCCCH T ss_conf 987774599995388672343588886296589840015626 No 44 >pfam01507 PAPS_reduct Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP). It is also found in NodP nodulation protein P from Rizobium which has ATP sulfurylase activity (sulfate adenylate transferase). Probab=98.57 E-value=1e-06 Score=60.70 Aligned_cols=152 Identities=14% Similarity=0.066 Sum_probs=73.0 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCC Q ss_conf 48999809656999999999749839999976778877727999999999999988875203561587501103224443 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSS 89 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (240) +.|++||||.||+++|.++.+.+..+..+++|.|.-.+-..+....+.++ . ....+..... ...... T Consensus 1 ~v~vsfSGGKDS~vlL~L~~~~~~~~~vvf~Dtg~efpet~~~v~~~~~~-------~---~~~~~~~~~~-~~~~~~-- 67 (174) T pfam01507 1 ELVVSFSGGKDSLVLLHLASKAFPPGPVIFIDTGYEFPETYEFVDELEEK-------Y---GLNLKVYRPE-DSFAEG-- 67 (174) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHH-------H---CCEEEEEECC-HHHHHH-- T ss_conf 99999576099999999999848997579997899868999999999998-------4---9818998086-779877-- Q ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 32220110001233101111111223337777774339936511454444345554430247777767753064321677 Q gi|254781031|r 90 LTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTVH 169 (240) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 169 (240) ..................+....+ .......+......|..+++... +...+.+. ........++ . T Consensus 68 ---~~~~~~~~~~~~~~~~~~~K~~p~---~~~l~~~~~~~~i~GiR~~Es~~-R~~~~~~~------~~~~~~~~~~-~ 133 (174) T pfam01507 68 ---INPEGIPSKLWEDCPCRLRKVEPL---KRALKKLDFDAWFTGLRRDESPS-RAKLPIVS------IDGDFPKVIK-V 133 (174) T ss_pred ---HHHCCCCHHHCCCCCHHHHHHHHH---HHHHHHCCCCEEEEEEECCCHHH-HHHCCEEE------EECCCCCEEE-E T ss_conf ---631279501145452146651799---99998659968999510024456-64071674------3057888599-9 Q ss_pred EEECCCCHHHHHHHHHHHC Q ss_conf 5313898899999999828 Q gi|254781031|r 170 TPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 170 ~P~~~~tK~eI~~~~~~~~ 188 (240) +|.+++|..+|+....+.+ T Consensus 134 ~PI~~Wt~~DVw~yi~~~~ 152 (174) T pfam01507 134 FPLLNWTETDVWQYILANN 152 (174) T ss_pred ECHHHCCHHHHHHHHHHCC T ss_conf 5005299999999999839 No 45 >PRK05253 sulfate adenylyltransferase subunit 2; Provisional Probab=98.56 E-value=1.8e-06 Score=59.14 Aligned_cols=160 Identities=18% Similarity=0.224 Sum_probs=86.5 Q ss_pred CCHHHCCCCCEEEEEECCHHHHHHHHHHHHCCC----EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 912213599489998096569999999997498----3999997677887772799999999999998887520356158 Q gi|254781031|r 1 MNDIIKKAPSALLLFSGGQDSSTCLSWALDRFD----RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHI 76 (240) Q Consensus 1 ~~~~~~~~~~av~l~SGGlDS~~~l~~l~~~~~----~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 76 (240) |+|-....++-|+|||||.||+++|+++.+..+ -+.+||+|.|.+=. .+.++..++........+ T Consensus 19 ~RE~~a~f~~pvmL~S~GKDS~VmLhLarKAF~p~~iPfPllHIDTg~kF~-----------Emi~fRD~~a~e~gl~Li 87 (300) T PRK05253 19 IREVAAEFENPVMLYSIGKDSSVMLHLARKAFYPGKLPFPLLHVDTGWKFP-----------EMIEFRDRRAKELGLELI 87 (300) T ss_pred HHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHH-----------HHHHHHHHHHHHHCCCEE T ss_conf 999999736967987167259999999999857889996348830799818-----------999999999998499789 Q ss_pred EECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCC--------------- Q ss_conf 75011032244433222011000123310111111122333777777433993651145444434--------------- Q gi|254781031|r 77 LPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYS--------------- 141 (240) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~--------------- 141 (240) +.... ..... ...+... -....++.+............+.++++.|...+... T Consensus 88 V~~n~-e~i~~-------g~~p~~~----g~~~~~~~lKT~~L~~al~~~~fda~~gGaRRDEe~sRAKERiFS~R~~~~ 155 (300) T PRK05253 88 VHSNP-EGIAR-------GINPFTH----GSAKHTNIMKTEGLKQALEKYGFDAAFGGARRDEEKSRAKERIFSFRDEFG 155 (300) T ss_pred EEECH-HHHHC-------CCCCCCC----CCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC T ss_conf 97576-77756-------9995668----822231578789999999964876565101234432121254247756468 Q ss_pred --CCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf --555443024777776775306432167753138988999999998 Q gi|254781031|r 142 --GYPDCRHDTIRAIETAINLGMESHVTVHTPLMWLKKYETWKLAQD 186 (240) Q Consensus 142 --~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~ 186 (240) .+++.+|++.+.++.-+. .+..++|. |+.++|--+||+.... T Consensus 156 ~Wdpk~QrPElW~lyn~~~~--~geh~RVf-PlsnWTElDIW~YI~~ 199 (300) T PRK05253 156 QWDPKNQRPELWNLYNGRIN--KGEHIRVF-PLSNWTELDIWQYIYR 199 (300) T ss_pred CCCCCCCCCHHHHHHCCCCC--CCCEEEEE-CCCCCHHHHHHHHHHH T ss_conf 88856698026676355679--99647751-1202257789999998 No 46 >PRK12563 sulfate adenylyltransferase subunit 2; Provisional Probab=98.48 E-value=4.8e-06 Score=56.55 Aligned_cols=159 Identities=14% Similarity=0.183 Sum_probs=85.9 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHHHCCC----EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEE Q ss_conf 12213599489998096569999999997498----39999976778877727999999999999988875203561587 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWALDRFD----RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHIL 77 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~~~~~----~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 77 (240) +|-....++-|+|||||.||+++++++.+..+ -+.+||+|.|.+- +.+.++...+........++ T Consensus 31 RE~~a~f~~pvmL~S~GKDS~VmlhLA~KAF~p~~iPfPllHIDTg~kF-----------~Emi~fRD~~a~~~~l~LiV 99 (312) T PRK12563 31 REVVAECSKPVMLYSIGKDSSVMLHLAMKAFRPTRPPFPLLHVDTTWKF-----------REMIDFRDRRAKELGLDLVV 99 (312) T ss_pred HHHHHHHCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCH-----------HHHHHHHHHHHHHHCCEEEE T ss_conf 9999973696798616742999999999973778999735886368875-----------99999999999981986898 Q ss_pred ECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCC---------------- Q ss_conf 5011032244433222011000123310111111122333777777433993651145444434---------------- Q gi|254781031|r 78 PLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYS---------------- 141 (240) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~---------------- 141 (240) ....- .... ...+.. ......++.+.......-....+.++++-|...+... T Consensus 100 ~~n~~-~i~~-------g~~p~~----~~~~~~~~~lKT~~L~~ale~~~fDa~~gGaRRDEEksRAKERiFS~Rd~~~~ 167 (312) T PRK12563 100 HHNPD-GIAR-------GIVPFR----HGSALHTDVAKTQGLKQALDHHGFDAAIGGARRDEEKSRAKERIFSFRSAFHR 167 (312) T ss_pred ECCHH-HHHC-------CCCCCC----CCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEECCCCCCCC T ss_conf 32878-8866-------999676----78377766898899999999739887850453213423443403066677889 Q ss_pred -CCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf -555443024777776775306432167753138988999999998 Q gi|254781031|r 142 -GYPDCRHDTIRAIETAINLGMESHVTVHTPLMWLKKYETWKLAQD 186 (240) Q Consensus 142 -~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~ 186 (240) .+++.+|++.+.++.-+. .+..++| .|+.++|--+||+.... T Consensus 168 WDPknQrPElWnlyN~~~~--~Geh~RV-fPlsNWTElDIW~YI~~ 210 (312) T PRK12563 168 WDPKAQRPELWSLYNARLR--RGESLRV-FPLSNWTELDVWQYIAR 210 (312) T ss_pred CCCCCCCHHHHHHHCCCCC--CCCEEEE-CCCCCCCHHHHHHHHHH T ss_conf 8855588358776155679--9971442-23556617899999998 No 47 >cd01713 PAPS_reductase This domain is found in phosphoadenosine phosphosulphate (PAPS) reductase enzymes or PAPS sulphotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N type ATP PPases and ATP sulphurylases. A highly modified version of the P loop, the fingerprint peptide of mononucleotide-binding proteins, is present in the active site of the protein, which appears to be a positively charged cleft containing a number of conserved arginine and lysine residues. Although PAPS reductase has no ATPase activity, it shows a striking similarity to the structure of the ATP pyrophosphatase (ATP PPase) domain of GMP synthetase, indicating that both enzyme families have evolved from a common ancestral nucleotide-binding fold. The enzyme uses thioredoxin as an electron donor for the reduction of PAPS to phospho-adenosine-phosphate (PAP) . It is also found in NodP nodulation protein P from Rhizobium meliloti which has ATP sulphurylase acti Probab=98.45 E-value=2.4e-06 Score=58.43 Aligned_cols=155 Identities=15% Similarity=0.089 Sum_probs=71.1 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCC---EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 489998096569999999997498---39999976778877727999999999999988875203561587501103224 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFD---RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~---~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) +.+++||||.||+++|+++.+... .+..+|+|.|.-.+-..+....+. ..+ ........ ....... T Consensus 1 ~~~vsfSGGKDS~vll~L~~~~~~~~~~~~vvf~DTg~e~pet~~~~~~~~-------~~~---~~~~~~~~-~~~~~~~ 69 (173) T cd01713 1 NVVVSFSGGKDSTVLLHLALKALPELKPVPVIFLDTGYEFPETYEFVDRVA-------ERY---GLPLVVVR-PPDSPAE 69 (173) T ss_pred CEEEEECCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHH-------HHC---CCEEEEEE-CCCCHHH T ss_conf 959996464999999999999665558837999688898989999999999-------873---99079996-8972899 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) ..... ......... .............+ .......+......|..+++........ .. .......+. T Consensus 70 ~~~~~-~~~~~~~~~-~~~~c~~~~K~~P~---~~~~~~~~~~~~~~GiR~~Es~~R~~~~-~~-------~~~~~~~~~ 136 (173) T cd01713 70 GLALG-LKGFPLPSP-DRRWCCRILKVEPL---RRALKELGVVAWITGIRRDESARRALLP-VV-------WTDDGKGGI 136 (173) T ss_pred HHHHH-CCCCCCCCC-CHHHHHHHHHHHHH---HHHHHHCCCCEEEEECCCCCCHHHCCCC-CC-------CCCCCCCCE T ss_conf 99861-357799832-26889888807899---9999832982899941214624444376-23-------432688998 Q ss_pred EEEEEECCCCHHHHHHHHHHHC Q ss_conf 6775313898899999999828 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~ 188 (240) -.++|+.++|..+|+....+.+ T Consensus 137 ~~~~Pi~~Wt~~dVw~yi~~~~ 158 (173) T cd01713 137 LKVNPLLDWTYEDVWAYLARHG 158 (173) T ss_pred EEEECHHHCCHHHHHHHHHHCC T ss_conf 9993228599999999999839 No 48 >pfam02540 NAD_synthase NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation. Probab=98.28 E-value=9.1e-06 Score=54.79 Aligned_cols=150 Identities=15% Similarity=0.111 Sum_probs=71.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCC Q ss_conf 5994899980965699999999974-9-8399999767788777279999999999999888752035615875011032 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) ..+++|+-+|||+||++++.++.+. | ..|+++..-+.+......+.+...++ ........+++...-. T Consensus 17 g~~gvViGlSGGIDSav~a~La~~AlG~~~v~~v~mP~~~ss~~s~~~A~~la~----------~lgi~~~~i~I~~~~~ 86 (243) T pfam02540 17 GAKGVVLGLSGGIDSAVVAYLAVKALGKENVLALIMPSINSSEEDVQDALALAE----------NLGINYKTIDIKPIVR 86 (243) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHH----------HHCCCEEEEECHHHHH T ss_conf 999199978887999999999999559742899964777898889999999999----------9689269998889999 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCC--CCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 2444332220110001233101111111223337777774339936511454444--34555443024777776775306 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETD--YSGYPDCRHDTIRAIETAINLGM 162 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 162 (240) ........... . ...-.............++...+ .++.|...-. ..++- .. . T Consensus 87 ~~~~~~~~~~~-----~---~a~~Ni~aR~Rm~~ly~~A~~~~--~lVlgT~NksE~~~Gy~-------Tk--------y 141 (243) T pfam02540 87 AFSQLFQPAKD-----D---LAKGNLKARIRMIILYAHANKFN--RLVLGTGNKSELALGYF-------TK--------Y 141 (243) T ss_pred HHHHHHCCCHH-----H---HHHHHHHHHHHHHHHHHHHCCCC--EEEEECCCCCCEEEEEE-------EE--------C T ss_conf 99988532013-----7---78745578888999998623379--18981687042005432-------31--------2 Q ss_pred CCCEEEEEEECCCCHHHHHHHHHHHCCCH Q ss_conf 43216775313898899999999828200 Q gi|254781031|r 163 ESHVTVHTPLMWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 163 ~~~~~i~~P~~~~tK~eI~~~~~~~~~~~ 191 (240) +...--++|...++|.||.++++.++.+. T Consensus 142 GD~~~di~Pi~dL~KteV~~la~~l~vP~ 170 (243) T pfam02540 142 GDGACDIAPIGDLYKTQVYELAKRLNVPE 170 (243) T ss_pred CCCCCCHHHHCCCCHHHHHHHHHHHCCCH T ss_conf 77765515416881899999999949888 No 49 >cd00553 NAD_synthase NAD+ synthase is a homodimer, which catalyzes the final step in de novo nicotinamide adenine dinucleotide (NAD+) biosynthesis, an amide transfer from either ammonia or glutamine to nicotinic acid adenine dinucleotide (NaAD). The conversion of NaAD to NAD+ occurs via an NAD-adenylate intermediate and requires ATP and Mg2+. The intemediate is subsequently cleaved into NAD+ and AMP. In many prokaryotes, such as E. coli , NAD synthetase consists of a single domain and is strictly ammonia dependent. In contrast, eukaryotes and other prokaryotes have an additional N-terminal amidohydrolase domain that prefer glutamine, Interestingly, NAD+ synthases in these prokaryotes, can also utilize ammonia as an amide source . Probab=98.27 E-value=6.9e-06 Score=55.54 Aligned_cols=156 Identities=19% Similarity=0.084 Sum_probs=72.0 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCC Q ss_conf 5994899980965699999999974-9-8399999767788777279999999999999888752035615875011032 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) ..+++|+-+|||+||++++.++.+. | .+|++++.-.........+.+..+++.+ ......+++...-. T Consensus 22 g~~~~viGlSGGIDSav~a~La~~Alg~~~v~~v~mP~~~ss~~s~~~A~~la~~l----------gi~~~~i~I~~~~~ 91 (248) T cd00553 22 GFKGVVLGLSGGIDSALVAALAVRALGRENVLALFMPSRYSSEETREDAKELAEAL----------GIEHVNIDIDPAVE 91 (248) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHH----------CCCEEEECHHHHHH T ss_conf 99919995888899999999999972887599988999789875899999999995----------89168715599999 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) .......................... .....-.++...+...+.++--.+...++- . -.+. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~NiqaRl----Rm~~Ly~~An~~~~lVlgTgNksE~~~Gy~-----------T----kyGD 152 (248) T cd00553 92 AFLALLGESGGSELEDLALGNIQARL----RMVILYALANKLGGLVLGTGNKSELLLGYF-----------T----KYGD 152 (248) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHH----HHHHHHHHHHHCCCEEECCCCHHHHHHCCC-----------E----ECCC T ss_conf 99998644048705567874248788----899999999744998962874607875721-----------0----0078 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCH Q ss_conf 216775313898899999999828200 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~ 191 (240) ..-=++|...+.|.||.++++-++.+. T Consensus 153 ~~~d~~Pi~~L~Kt~V~~la~~l~vp~ 179 (248) T cd00553 153 GAADINPIGDLYKTQVRELARYLGVPE 179 (248) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCHH T ss_conf 775822136884999999999968889 No 50 >COG2117 Predicted subunit of tRNA(5-methylaminomethyl-2-thiouridylate) methyltransferase, contains the PP-loop ATPase domain [Translation, ribosomal structure and biogenesis] Probab=98.18 E-value=1.1e-05 Score=54.24 Aligned_cols=144 Identities=22% Similarity=0.178 Sum_probs=72.3 Q ss_pred EEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCCC Q ss_conf 89998096569999999997498399999767788777279999999999999888752035615875011032244433 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSSL 90 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (240) +-+|||||.||+..|++|.+-|++|.+||+|+|--. .-+.|...++ .+...|.+...+-....+ +. T Consensus 3 v~vLfSGGKDSSLaA~iL~klgyev~LVTvnFGv~d--~~k~A~~tA~-----------~lgF~h~vl~Ldr~ile~-A~ 68 (198) T COG2117 3 VYVLFSGGKDSSLAALILDKLGYEVELVTVNFGVLD--SWKYARETAA-----------ILGFPHEVLQLDREILED-AV 68 (198) T ss_pred EEEEECCCCCHHHHHHHHHHHCCCCEEEEEEECCCC--CHHHHHHHHH-----------HHCCCCCEECCCHHHHHH-HH T ss_conf 689844897105899999871787179998733466--4066899999-----------829970000168899999-99 Q ss_pred CCCHHHHHCCCCCCHHHHHHHHHHHHHHH--HHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEE Q ss_conf 22201100012331011111112233377--7777433993651145444434555443024777776775306432167 Q gi|254781031|r 91 TKNVAMKIQDNNLPNTFVPGRNIIFLVFA--ATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHVTV 168 (240) Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 168 (240) ..-.+..+ |.+..-.+... ...|.. ..+.+..|..++|- .|..... ..-++....++.+ T Consensus 69 em~iedg~----------P~~aIq~iH~~alE~~A~r-~~~~iaDGTRRDDr------vP~ls~~--~~qSLEdR~nv~Y 129 (198) T COG2117 69 EMIIEDGY----------PRNAIQYIHEMALEALASR-EVDRIADGTRRDDR------VPKLSRS--EAQSLEDRLNVQY 129 (198) T ss_pred HHHHHCCC----------CCHHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCC------CCCCCHH--HHHHHHHHCCCEE T ss_conf 99985389----------8508899999999999877-78887578744566------7642498--8751887518645 Q ss_pred EEEECCCCHHHHHHHHHHH Q ss_conf 7531389889999999982 Q gi|254781031|r 169 HTPLMWLKKYETWKLAQDI 187 (240) Q Consensus 169 ~~P~~~~tK~eI~~~~~~~ 187 (240) ++|+.++-++.|-+++..+ T Consensus 130 i~PL~G~G~kti~~Lv~~~ 148 (198) T COG2117 130 IRPLLGLGYKTIRRLVSAI 148 (198) T ss_pred ECCCCCCCHHHHHHHHHHH T ss_conf 5303344678999998877 No 51 >KOG1622 consensus Probab=98.16 E-value=6.7e-06 Score=55.63 Aligned_cols=38 Identities=32% Similarity=0.414 Sum_probs=33.2 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEECCCC Q ss_conf 994899980965699999999974-9-8399999767788 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDYGQR 45 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~ygq~ 45 (240) ..++|++.|||+||++++.++.+. + ..++++++|-|-. T Consensus 230 ~~~Vl~~vSGgvdStV~a~Ll~~alg~~R~~ai~vdNG~m 269 (552) T KOG1622 230 DYKVLVAVSGGVDSTVCAALLRRALGPDRVHAIHVDNGFM 269 (552) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCH T ss_conf 4455998348850899999999863789448998126401 No 52 >PRK06850 hypothetical protein; Provisional Probab=98.12 E-value=4.9e-05 Score=50.27 Aligned_cols=216 Identities=14% Similarity=0.075 Sum_probs=98.0 Q ss_pred EEEEEECCHHHHHHHHHHH---------HCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHH Q ss_conf 8999809656999999999---------7498399999767788777279999999999999888752035615875011 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWAL---------DRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAI 81 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~---------~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (240) =|+-||||-||++++.+.. ++...|+.|+-|.+--.+.-..+.....+.+...+... ..+.......+ T Consensus 24 WvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnPiV~~~v~~sL~~i~~~A~~~---~LPI~~~~v~P 100 (488) T PRK06850 24 WVIGYSGGKDSTAVLQLVWNALAGLPPEKRHKPVYVISTDTLVENPVVVAWVNKSLERIKEAAKKQ---GLPITPHKLTP 100 (488) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHC---CCCCEEEEECC T ss_conf 799758771999999999999983788766898899958987567799999999999999999865---99822688148 Q ss_pred HCCCCCCCCCCCHHHHHCCCCCCHHHHHHHH--HHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHH--HH-HH Q ss_conf 0322444332220110001233101111111--2233377777743399365114544443455544302477--77-76 Q gi|254781031|r 82 LGDISHSSLTKNVAMKIQDNNLPNTFVPGRN--IIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIR--AI-ET 156 (240) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~--~~-~~ 156 (240) ....+.-..-....++.....+-|...|. .............+|...+..|.....+.. +..+..... .. .. T Consensus 101 --~~~dtFWvnlIGrGYPaP~~~FRWCTdRLKI~P~~rFI~~~v~~~ge~IlvLGtR~~ES~~-Ra~~m~k~e~~~~r~~ 177 (488) T PRK06850 101 --KVKDTFWVNLIGKGYPAPRRKFRWCTERLKINPSNDFIRDKVSAFGEVIVVLGTRKAESAA-RAQVMAKHKIGGTRDR 177 (488) T ss_pred --CCCCCHHHHHCCCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHCCCEEEEEECCHHHHHH-HHHHHHHHHCCCCCCC T ss_conf --8654433542047679998766547566514848999999987539789998545277799-9999998752473012 Q ss_pred HHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCCCHHC-----CCCHHHC---CCCCCCCCCCCCCCCCCCC----CCHHH Q ss_conf 7753064321677531389889999999982820001-----4325537---2162888655556688998----68778 Q gi|254781031|r 157 AINLGMESHVTVHTPLMWLKKYETWKLAQDIGGQDLV-----NLILEES---HTCYLGKRDKRYEWGYGCN----SCPAC 224 (240) Q Consensus 157 ~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~~~~~-----~~~~~~T---~SC~~~~~~~~~~~~~hCG----~C~~C 224 (240) +..-....+..+++|...++-.+||........+-.. ..+|... -.| |-. ....+..|| -||-| T Consensus 178 Ls~h~~l~n~~vy~PI~~ws~ddVW~yL~~~~nPWg~~n~~L~~lY~~A~~~gEC--Plv--~d~stpSCG~SRFGCWvC 253 (488) T PRK06850 178 LSRHSTLPNSFVYTPIEDWSNDDVWKYLLQWENPWGGSNKDLFTLYRGATADGEC--PLV--VDTSTPSCGNSRFGCWVC 253 (488) T ss_pred CCCCCCCCCCEEECCHHHCCCCHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCC--CEE--ECCCCCCCCCCCCCCEEE T ss_conf 5778888886897125537733799998618998899769999997057889967--736--579999877876661677 Q ss_pred HHHH-----HHHHHHHH Q ss_conf 9999-----99997306 Q gi|254781031|r 225 YLRQ-----KGWMEFKE 236 (240) Q Consensus 225 ~~R~-----~af~~~~~ 236 (240) -.=. .||++-|. T Consensus 254 TvV~~DKSm~~mI~nge 270 (488) T PRK06850 254 TVVTKDKSMEAMIQNDE 270 (488) T ss_pred EEECCCHHHHHHHHCCC T ss_conf 77643372999987786 No 53 >PRK13980 NAD synthetase; Provisional Probab=98.10 E-value=3.9e-05 Score=50.86 Aligned_cols=151 Identities=20% Similarity=0.155 Sum_probs=70.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCC Q ss_conf 5994899980965699999999974-9-8399999767788777279999999999999888752035615875011032 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) ..+++|+-+|||+||++++.++.+. | .+|++|..=+...+....+.|..+++. .......+++...-. T Consensus 27 g~kg~VlGlSGGIDSavva~La~~Alg~~~v~~v~mP~~~ss~~s~~dA~~la~~----------lgi~~~~i~I~~~~~ 96 (264) T PRK13980 27 GFKGVVLGLSGGIDSAVVAYLAVKALGKENVLALLMPYRVSPPEDLEDALLVAER----------LGIEYKVIEITPIVD 96 (264) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHH----------HCCCEEEEECHHHHH T ss_conf 9980999798688899999999985496606899898999988789999999998----------699849982799999 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) ......... +..... ....-......-.++...+ .++.|.. +..++..-+... .+. T Consensus 97 ~~~~~~~~~------~~~~~~---NiqaR~Rm~~Ly~~An~~~--~lVlgTg---------NksE~~~Gy~Tk----yGD 152 (264) T PRK13980 97 AFFSAVPDA------DRLRRG---NIMARTRMVLLYDYANRDN--RLVLGTS---------NKSELLLGYFTK----YGD 152 (264) T ss_pred HHHHHCCCC------CHHHHH---HHHHHHHHHHHHHHHHHCC--CEEECCC---------CHHHHHHHHCCC----CCC T ss_conf 999863100------066888---7589899999999864339--7765588---------674798721001----687 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCH Q ss_conf 216775313898899999999828200 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~ 191 (240) ..-=++|...+.|.||.++++.++.+. T Consensus 153 ~~~d~~Pi~dL~Kt~V~~La~~l~vP~ 179 (264) T PRK13980 153 GAVDINPIGDLYKTQVRELARHLGVPE 179 (264) T ss_pred CCCCHHHHCCCCHHHHHHHHHHHCCCH T ss_conf 665624525873999999999939966 No 54 >cd01986 Alpha_ANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases and ATP sulphurylases. The domain forms a apha/beta/apha fold which binds to Adenosine group.. Probab=98.10 E-value=9.3e-06 Score=54.76 Aligned_cols=34 Identities=32% Similarity=0.372 Sum_probs=31.3 Q ss_pred EEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 8999809656999999999749839999976778 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQ 44 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq 44 (240) +++++|||+||++.+++|.+.++++.++++|.|. T Consensus 1 ~~v~~sgG~ds~~~~~~l~~~~~~~~~~~~~~~~ 34 (103) T cd01986 1 VLVAFSGGKDSSVAAALLKKLGYQVIAVTVDHGI 34 (103) T ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 9897428822199999999869862899985376 No 55 >TIGR02039 CysD sulfate adenylyltransferase, small subunit; InterPro: IPR011784 Metabolic assimilation of sulphur from inorganic sulphate, requires sulphate activation by coupling to a nucleoside, for the production of high-energy nucleoside phosphosulphates. This pathway appears to be similar in all prokaryotic organisms. Activation is first achieved through reaction of sulphate with ATP by sulphate adenylyltransferase (ATP sulphurylase) to produce 5'-phosphosulphate (APS), coupled by GTP hydrolysis. Subsequently, APS is phosphorylated by an APS kinase to produce 3'-phosphoadenosine-5'-phosphosulphate (PAPS). In Escherichia coli, ATP sulphurylase is a heterodimer composed of two subunits encoded by cysD and cysN, with APS kinase encoded by cysC. These genes are located in a unidirectionally transcribed gene cluster, and have been shown to be required for the synthesis of sulphur-containing amino acids. Homologous to this E. coli activation pathway are nodPQH gene products found among members of the Rhizobiaceae family. These gene products have been shown to exhibit ATP sulphurase and APS kinase activity, yet are involved in Nod factor sulphation, and sulphation of other macromolecules.; GO: 0004781 sulfate adenylyltransferase (ATP) activity, 0019421 sulfate reduction APS pathway. Probab=98.09 E-value=7.7e-05 Score=49.05 Aligned_cols=159 Identities=18% Similarity=0.160 Sum_probs=84.5 Q ss_pred CCHHHCCCCCEEEEEECCHHHHHHHHHHHHCCC----EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 912213599489998096569999999997498----3999997677887772799999999999998887520356158 Q gi|254781031|r 1 MNDIIKKAPSALLLFSGGQDSSTCLSWALDRFD----RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHI 76 (240) Q Consensus 1 ~~~~~~~~~~av~l~SGGlDS~~~l~~l~~~~~----~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 76 (240) |+|.+....+=|+|||+|-||+++|+++.+..+ -..++|+|.|.+ -. .+...............+ T Consensus 12 ~REvaa~F~~PV~LyS~GKDS~VlLHLA~KaF~Pg~~PFplLHvDTgw~--f~---------E~~~fRD~~~~~~~~~L~ 80 (295) T TIGR02039 12 IREVAAEFERPVLLYSIGKDSSVLLHLARKAFYPGRLPFPLLHVDTGWK--FR---------EMIAFRDELVAKYGLELI 80 (295) T ss_pred HHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCCCCCCCCCEEEEECCCC--HH---------HHHHHHHHHHHHHCCEEE T ss_conf 9987630379768986573479999987640588887973577406632--78---------999999999987097688 Q ss_pred EECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH-HCCCCEEECCCCCCCCC-------------- Q ss_conf 750110322444332220110001233101111111223337777774-33993651145444434-------------- Q gi|254781031|r 77 LPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAY-RLGITNIVIGVCETDYS-------------- 141 (240) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~-~~g~~~i~~~~~~~~~~-------------- 141 (240) +....-+.. .+..+..........++-+.+.-.|. ..+.++.+-|...+.-. T Consensus 81 v~~~~~~~~-------------~g~~P~~~~~~~~~~~~~T~~L~~Al~~~~FDAa~gGARRdEE~sRaKERifS~R~~~ 147 (295) T TIGR02039 81 VHSNEEGIA-------------EGINPFTEGSALHTDIMKTEALRQALDKNQFDAAFGGARRDEEKSRAKERIFSFRDAF 147 (295) T ss_pred EEECHHHHH-------------HCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHCCCCCCEEHHCCCC T ss_conf 850502354-------------1566201012455555631899999873388656316631344301201000100345 Q ss_pred ---CCCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf ---555443024777776775306432167753138988999999998 Q gi|254781031|r 142 ---GYPDCRHDTIRAIETAINLGMESHVTVHTPLMWLKKYETWKLAQD 186 (240) Q Consensus 142 ---~~~~~~~~~~~~~~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~ 186 (240) .++..+|+.....+..+. .|..++++ |+-++|-.+||+.... T Consensus 148 h~WDPk~QRPELW~~YNg~~~--~GE~~RvF-PLSNwTE~DiW~Y~~~ 192 (295) T TIGR02039 148 HQWDPKKQRPELWNLYNGRIS--KGESVRVF-PLSNWTELDIWRYIAA 192 (295) T ss_pred CCCCCCCCCHHHHHHCCCCCC--CCCEEEEE-CCCCCCHHHHHHHHHH T ss_conf 788851257377875052346--88726872-2666504458988743 No 56 >TIGR03183 DNA_S_dndC putative sulfurtransferase DndC. Members of this protein family are the DndC protein from the dnd (degradation during electrophoresis) operon. The dnd phenotype reflects a sulfur-containing modification to DNA. This operon is sparsely and sporadically distributed among bactera; among the first eight examples are members from the Actinobacteria, Firmicutes, Gammaproteobacteria, Cyanobacteria. DndC is suggested to be a sulfurtransferase. Probab=98.04 E-value=3.6e-05 Score=51.09 Aligned_cols=170 Identities=12% Similarity=0.048 Sum_probs=79.4 Q ss_pred EEEEEECCHHHHHHHHHHH---------HCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEE-EECH Q ss_conf 8999809656999999999---------74983999997677887772799999999999998887520356158-7501 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWAL---------DRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHI-LPLA 80 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~---------~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 80 (240) =|+-||||-||++++.+.. ++...|+.|+-|.+--.+.-........+.+...+... ..+... +-.+ T Consensus 16 WvIGySGGKDSTavLqLvw~Al~~Lp~e~r~K~v~VIssDTlVEnP~V~~~v~~~L~~i~~~a~~~---~LPi~~~~~~P 92 (447) T TIGR03183 16 WVIGYSGGKDSTAVLQLVWNALSELPPEQRTKKIHVISTDTLVENPIVAAWVNVSLERMQEAAQKQ---GLPIEPHRLTP 92 (447) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHC---CCCCEEEEECC T ss_conf 799758773999999999999984788757688899979987557799999999999999999974---99823689337 Q ss_pred HHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHH--HHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHH---HH Q ss_conf 10322444332220110001233101111111--22333777777433993651145444434555443024777---77 Q gi|254781031|r 81 ILGDISHSSLTKNVAMKIQDNNLPNTFVPGRN--IIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRA---IE 155 (240) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~---~~ 155 (240) .+. .+.-..-....++.....+-|...+. .............+|...+..|.....+.. +..+...... .. T Consensus 93 ~~~---dtFWvnlIGrGYPaP~~~FRWCTdrLKI~P~~~fi~~~v~~~ge~ilvLGtR~~ES~~-Ra~~i~k~e~~~~~~ 168 (447) T TIGR03183 93 AIE---DTFWVNLIGKGYPAPRQKFRWCTDRLKISPSNTFIRDKVAANGEVILVLGTRKNESQA-RAAVMEKHEGGSTRD 168 (447) T ss_pred CCC---CCHHHHHCCCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCEEEEEECCHHHHHH-HHHHHHHHHCCCCHH T ss_conf 854---4300010346307999876525576636828999998652149689999565277799-999999864467042 Q ss_pred HHHHHCCCCCEEEEEEECCCCHHHHHHHHHHH Q ss_conf 67753064321677531389889999999982 Q gi|254781031|r 156 TAINLGMESHVTVHTPLMWLKKYETWKLAQDI 187 (240) Q Consensus 156 ~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~ 187 (240) .+..-....+..+++|...++-.|||...... T Consensus 169 ~ls~h~~l~n~~vy~PI~dws~ddVW~yL~~~ 200 (447) T TIGR03183 169 RLSRNSSLPNSWVYTPIEDWSNDDVWMYLLQN 200 (447) T ss_pred HCCCCCCCCCCEEECCHHHCCHHHHHHHHCCC T ss_conf 12768888883797306747640899986338 No 57 >PRK00876 nadE NAD synthetase; Reviewed Probab=98.03 E-value=2.4e-05 Score=52.23 Aligned_cols=35 Identities=34% Similarity=0.326 Sum_probs=28.1 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEEC Q ss_conf 994899980965699999999974-9-8399999767 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDY 42 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~y 42 (240) .|.+|+-+|||+||++++.++.+. | .+|+++..-. T Consensus 33 ~kG~VlGlSGGIDSAv~a~Lav~AlG~e~V~gl~MP~ 69 (325) T PRK00876 33 RRGVVLGLSGGIDSSVTLALCVRALGKDRVLGLLMPE 69 (325) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 8679996876888999999999974855169997886 No 58 >COG0175 CysH 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes [Amino acid transport and metabolism / Coenzyme metabolism] Probab=98.02 E-value=7.9e-05 Score=48.97 Aligned_cols=156 Identities=14% Similarity=0.029 Sum_probs=78.8 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) ....+++++|||.||++++.++.+....+..+|+|.|...+-..+....+..+.. ....+......... T Consensus 38 ~~~~~~~~~S~Gkds~V~l~L~~k~~~~~~vif~DTg~~f~Et~~~~d~~~~~~~-----------~~l~~~~~~~~~~~ 106 (261) T COG0175 38 FSNPVVVSFSGGKDSTVLLHLAAKAFPDFPVIFLDTGYHFPETYEFRDRLAEEYG-----------LDLKVYRPDDEVAE 106 (261) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHCC-----------CEEEEECCCHHHHH T ss_conf 4897599812746799999999860258718997077757889999999998729-----------71798167323443 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) . .........+. ........+....+..+.. ..+..+.+.|..+.+... +...+.+-.. ....+. T Consensus 107 ~--~~~~~~~~~~~--~~r~c~~i~K~~pl~~al~---~~~~~a~~~G~Rrdes~~-Rak~~~~~~~-------~~~~~~ 171 (261) T COG0175 107 G--EKYGGKLWEPS--VERWCCDIRKVEPLKRALD---EYGFDAWFTGLRRDESPT-RAKLPVVSFD-------SEFGES 171 (261) T ss_pred H--HHCCCCCCCCC--CCHHHHHHHHHHHHHHHHH---HCCCCEEEEECCCCCCCC-CCCCCEECCC-------CCCCCC T ss_conf 4--42136679997--4023325685513999996---358624998612045632-0248645346-------787683 Q ss_pred EEEEEECCCCHHHHHHHHHHHC Q ss_conf 6775313898899999999828 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~ 188 (240) ..++|+.++|..+|+......+ T Consensus 172 ~rv~Pl~~Wt~~dVw~Yi~~~~ 193 (261) T COG0175 172 IRVNPLADWTELDVWLYILANN 193 (261) T ss_pred EEECCHHCCCHHHHHHHHHHHC T ss_conf 6871532099999999999958 No 59 >PRK13981 NAD synthetase; Provisional Probab=98.01 E-value=5.3e-05 Score=50.06 Aligned_cols=150 Identities=17% Similarity=0.140 Sum_probs=71.2 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHC-C-CEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCC Q ss_conf 994899980965699999999974-9-83999997677887772799999999999998887520356158750110322 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDR-F-DRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDI 85 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~-~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (240) .+++|+-+|||+||+++|.++.+. | .+|++|++=.-..+....+.|...++. ....++.+++...-.. T Consensus 281 f~~vviGLSGGiDSaL~aaiA~dALG~~nV~~v~MPs~~tS~~s~~dA~~La~~----------LGi~~~~i~I~~~~~~ 350 (543) T PRK13981 281 FPGVVLGLSGGIDSALVAAIAVDALGAENVRAVMMPSRYTSDESLDDAAALARN----------LGVRYDIIPIEPAFEA 350 (543) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHCHHHEEEEECCCCCCCHHHHHHHHHHHHH----------HCCCEEEEEHHHHHHH T ss_conf 974999767883599999999985383437886458777866569999999999----------5997487514999999 Q ss_pred CCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 44433222011000123310111111122333777777433993651145444434555443024777776775306432 Q gi|254781031|r 86 SHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESH 165 (240) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (240) ..............+....-.+.-.|..+++ .++...|...+.+|--.+-..++- ...+.- T Consensus 351 ~~~~l~~~f~~~~~dvt~ENiQAR~R~~iLm----~laN~~g~lvl~TgnkSE~avGy~---------------TlyGD~ 411 (543) T PRK13981 351 FEAALAPLFAGTEPDITEENLQSRIRGTLLM----ALSNKFGSLVLTTGNKSEMAVGYA---------------TLYGDM 411 (543) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH----HHHHCCCCEEECCCCHHHHHHCHH---------------HHCCCC T ss_conf 9997404416876665223454888999999----987338967964786727874746---------------533765 Q ss_pred EEEEEEECCCCHHHHHHHHHH Q ss_conf 167753138988999999998 Q gi|254781031|r 166 VTVHTPLMWLKKYETWKLAQD 186 (240) Q Consensus 166 ~~i~~P~~~~tK~eI~~~~~~ 186 (240) ..-++|.....|.+|..+++= T Consensus 412 ~g~~avi~dv~KT~V~~L~r~ 432 (543) T PRK13981 412 AGGFAPIKDVYKTLVYRLCRW 432 (543) T ss_pred CCCCCCCCCCCHHHHHHHHHH T ss_conf 567430368659999999999 No 60 >PRK08576 hypothetical protein; Provisional Probab=98.00 E-value=1.7e-05 Score=53.10 Aligned_cols=51 Identities=31% Similarity=0.426 Sum_probs=42.5 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 489998096569999999997498399999767788777279999999999 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKI 60 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~ 60 (240) .+||.||||-||++++.++++...++.+|++|.|.-++.+.+....+++++ T Consensus 237 ~iiVp~SGGKDStA~LlLA~ea~~dv~aVfvDTGlEfPeT~eyVe~va~kl 287 (439) T PRK08576 237 TIIVPWSGGKDSTAALLLALKAFDEVTAVYVDTGYEMPLTDEYVEKVAEKL 287 (439) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCCEEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 889956887799999999998618836999707986575899999999985 No 61 >PRK08557 hypothetical protein; Provisional Probab=97.96 E-value=1.8e-05 Score=52.90 Aligned_cols=50 Identities=22% Similarity=0.224 Sum_probs=39.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 48999809656999999999749839999976778877727999999999 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKK 59 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~ 59 (240) -+.|+||||-||+++|.++.+.+.++..++.|.|--.+-+.+....+++. T Consensus 185 pV~VSfSGGKDS~a~L~La~~a~~~~~vvF~DTGlEfPeT~eyve~~~~~ 234 (420) T PRK08557 185 AINASFSGGKDSAVSTLLSKEVIPDLDVVFIDTGLEYPETLNYVKDFAKK 234 (420) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 46996068789999999999854881699962887772389999999998 No 62 >PRK13794 hypothetical protein; Provisional Probab=97.94 E-value=2.3e-05 Score=52.28 Aligned_cols=52 Identities=29% Similarity=0.289 Sum_probs=39.2 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 9948999809656999999999749-839999976778877727999999999 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYGQRNKVELECRLCVRKK 59 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~ygq~~~~~~~~a~~~~~~ 59 (240) +..+.|+||||-||+++|.++++.. .++..+|.|.|--.+-+.+....++++ T Consensus 245 ~~pv~VSfSGGKDS~v~L~La~ka~~~~~~vvF~DTglEfPeT~e~ve~v~~~ 297 (473) T PRK13794 245 GKPPSVAYSGGKDSLATLLLALKAFGNEFPVLFADTGLEFPETLENVEDVEEL 297 (473) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 99839970686899999999998608971699960888767699999999998 No 63 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=97.92 E-value=5.7e-05 Score=49.87 Aligned_cols=110 Identities=15% Similarity=0.103 Sum_probs=61.4 Q ss_pred CEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCC Q ss_conf 4899980965699999999974-983999997677887772799999999999998887520356158750110322444 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHS 88 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (240) .|||.+|||.||+-.|+.|+++ |.++.+||++-|...... .++...+...+..+++...........- T Consensus 61 DCiv~vSGGkDS~y~~~~l~~~~gl~pL~vt~d~~~~t~~g-----------~~Ni~~l~~~lgvD~i~~~~n~~~~k~l 129 (343) T TIGR03573 61 DCIIGVSGGKDSTYQAHVLKKKLGLNPLLVTVDPGWNTELG-----------VKNLNNLIKKLGFDLHTITINPETFRKL 129 (343) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHH-----------HHHHHHHHHHCCCCEEEECCCHHHHHHH T ss_conf 68986887728999999999982992599983598779899-----------9999999983699858746999999999 Q ss_pred CCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCC Q ss_conf 33222011000123310111111122333777777433993651145444 Q gi|254781031|r 89 SLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCET 138 (240) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~ 138 (240) . .. .... .....++. .......+...|...++..|..|.... T Consensus 130 ~--k~---~~~~--~gd~~~~~-~~~i~~~~~~iA~k~~IplIi~Gen~~ 171 (343) T TIGR03573 130 Q--RA---YFKK--VGDPEWPQ-DHAIFASVYQVALKFNIPLIIWGENIA 171 (343) T ss_pred H--HH---HHHH--CCCHHHHH-HHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 9--99---9986--68946999-999999999999981999899756701 No 64 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=97.91 E-value=0.00045 Score=44.32 Aligned_cols=133 Identities=16% Similarity=0.081 Sum_probs=68.9 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCC------CCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHC Q ss_conf 489998096569999999997498399999767788------77727999999999999988875203561587501103 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQR------NKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILG 83 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~------~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (240) |.++|+|||.||+..++++.++|++|..+-.-.-+. |....+....+++ .-..++..+.... T Consensus 1 Kv~~l~SGGKDS~lAl~~a~~~g~~v~~L~~~~~~~~~~~~~H~~~~~l~~~qAe----------algiPl~~~~~~~-- 68 (194) T cd01994 1 KVVALISGGKDSCYALYRALEEGHEVVALLNLTPEEGSSMMYHTVNHELLELQAE----------AMGIPLIRIEISG-- 68 (194) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEEEEECCCCCEEECCCCCHHHHHHHHH----------HCCCCEEEEECCC-- T ss_conf 9899977859999999999986992599999963998805250557899999999----------8599669996689-- Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 22444332220110001233101111111223337777774339936511454444345554430247777767753064 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGME 163 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (240) ..+ . ..-.... ...-+...|++.+..|....+. ...+... .... T Consensus 69 ---------~~~--------~-~~~~l~~------~L~~~k~~gi~~vv~GdI~s~~------qr~~~e~------~c~~ 112 (194) T cd01994 69 ---------EEE--------D-EVEDLKE------LLRKLKEEGVDAVVFGAILSEY------QRTRVER------VCER 112 (194) T ss_pred ---------CCC--------H-HHHHHHH------HHHHHHHCCCCEEEECCCCCHH------HHHHHHH------HHHH T ss_conf ---------871--------7-9999999------9999997599599999633288------9999999------9997 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 321677531389889999999982820 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) .++..+.|++..+-.++++...+.|.. T Consensus 113 lgl~~~~PLW~~~~~~ll~e~i~~Gf~ 139 (194) T cd01994 113 LGLEPLAPLWGRDQEELLREMIEAGFK 139 (194) T ss_pred CCCEEECHHCCCCHHHHHHHHHHCCCE T ss_conf 398887001079999999999987990 No 65 >cd01991 Asn_Synthase_B_C The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase . Probab=97.85 E-value=5e-05 Score=50.22 Aligned_cols=56 Identities=27% Similarity=0.341 Sum_probs=36.4 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 1359948999809656999999999749-8399999767788777279999999999 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYGQRNKVELECRLCVRKKI 60 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~ygq~~~~~~~~a~~~~~~~ 60 (240) +.....+-+++|||+||++++..+.+.+ ..+.+.++.++.....+.+.+..+++.+ T Consensus 12 l~sd~~vg~~LSGGlDSs~ia~~~~~~~~~~~~~~s~~~~~~~~de~~~a~~va~~~ 68 (269) T cd01991 12 LRSDVPVGVLLSGGLDSSLVAALAARLLPEPVKTFSIGFGFEGSDEREYARRVAEHL 68 (269) T ss_pred HCCCCCEEEEECCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHC T ss_conf 544885686513569999999999983689984799862899973789999999980 No 66 >pfam00733 Asn_synthase Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine. Probab=97.83 E-value=4.2e-05 Score=50.69 Aligned_cols=54 Identities=22% Similarity=0.203 Sum_probs=36.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 5994899980965699999999974-98399999767788777279999999999 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKI 60 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~ 60 (240) .....=++||||+||+++++++.+. +.++.+.++.+......+...+..+++.+ T Consensus 16 sdvpig~~LSGGlDSs~ia~l~~~~~~~~i~~~s~~~~~~~~~E~~~a~~~a~~~ 70 (195) T pfam00733 16 ADVPVGVLLSGGLDSSLIAALAARQSSPPLKTFSVGFEGSDYDEAPYAELVADHL 70 (195) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 6983675306746899999999985389953895578889975899999999651 No 67 >COG0171 NadE NAD synthase [Coenzyme metabolism] Probab=97.71 E-value=0.00025 Score=45.84 Aligned_cols=38 Identities=29% Similarity=0.436 Sum_probs=29.8 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCC------CEEEEEEEECCC Q ss_conf 59948999809656999999999749------839999976778 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRF------DRVETLSFDYGQ 44 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~------~~v~~i~~~ygq 44 (240) ..+++|+=+|||+||++++.++.+.- ..+.++..-|+. T Consensus 24 ~~k~~VlGiSGGiDSa~~~~La~~A~~~~~~~~~~~av~mP~~~ 67 (268) T COG0171 24 GFKGVVLGLSGGIDSALVLALAVRALGKGDSKENVLAVRLPYGY 67 (268) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHCCCCCHHHEEEEECCCCC T ss_conf 99986997666819999999999985656512432668678877 No 68 >PRK13795 hypothetical protein; Provisional Probab=97.68 E-value=0.00015 Score=47.34 Aligned_cols=50 Identities=22% Similarity=0.248 Sum_probs=38.5 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHH Q ss_conf 99489998096569999999997498399999767788777279999999 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVR 57 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~ 57 (240) ...+.|+||||-||+++|.++.+.+.++..++.|.|---+-+.+....++ T Consensus 245 ~~pv~VsfSGGKDS~v~L~La~ka~~~~~~~f~dTglEfPeT~e~v~~~~ 294 (630) T PRK13795 245 NLPVVVSFSGGKDSLVVLDLAAEALRKFKAFFNNTGLEFPETVENVKEVA 294 (630) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHH T ss_conf 89759952686899999999998549828999507767646999999999 No 69 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=97.68 E-value=8.7e-05 Score=48.72 Aligned_cols=114 Identities=18% Similarity=0.213 Sum_probs=60.8 Q ss_pred CEEEEEECCHHHHHHHHHHHHC-CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHC-CCCCEEEEC-HHHCCCC Q ss_conf 4899980965699999999974-983999997677887772799999999999998887520-356158750-1103224 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR-FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDS-LGEDHILPL-AILGDIS 86 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~-~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~ 86 (240) .||+.+|||.||+-.|+.+++. |.++.+||++-+...+... ++...+... +...+.... ....... T Consensus 3 DcIVpvSGGKDS~y~~~~lk~kyglnpL~Vt~~~~~~t~~g~-----------~Nl~nl~~~g~D~~~~~~~~~~~~~l~ 71 (154) T cd01996 3 DCIIGVSGGKDSSYALYLLKEKYGLNPLAVTVDNGFNSEEAV-----------KNIKNLIKKGLDLDHLVINPEEMKDLQ 71 (154) T ss_pred CEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHH-----------HHHHHHHHCCCCEEEEECCHHHHHHHH T ss_conf 789968888189999999999949965999967987546899-----------999999974998169932888999999 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCC Q ss_conf 4433222011000123310111111122333777777433993651145444434555 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYP 144 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~ 144 (240) ....... .....++.- ......+.-.|...++..|..|.........+ T Consensus 72 --------~~~~~~~-~gd~~~~~~-~~i~~~~~~~A~k~~IplIi~Gen~~~e~gg~ 119 (154) T cd01996 72 --------LARFKAK-VGDPCWPCD-TAIFTSLYKVALKFGIPLIITGENPAQEFGGI 119 (154) T ss_pred --------HHHHHHC-CCCCHHHHH-HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC T ss_conf --------9999844-799348899-87998999999993999899727835514784 No 70 >PRK00768 nadE NAD synthetase; Reviewed Probab=97.66 E-value=0.00029 Score=45.52 Aligned_cols=24 Identities=25% Similarity=0.329 Sum_probs=20.6 Q ss_pred EEEECCCCHHHHHHHHHHHCCCHH Q ss_conf 753138988999999998282000 Q gi|254781031|r 169 HTPLMWLKKYETWKLAQDIGGQDL 192 (240) Q Consensus 169 ~~P~~~~tK~eI~~~~~~~~~~~~ 192 (240) ++|...+.|.||.++++.+|.+.. T Consensus 181 i~PI~dL~KteV~~lA~~LgvP~~ 204 (274) T PRK00768 181 LLPLFGLNKRQGRALLAALGAPEH 204 (274) T ss_pred HHHHCCCCHHHHHHHHHHHCCCHH T ss_conf 376325619999999999597999 No 71 >pfam01902 ATP_bind_4 ATP-binding region. This family of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042. Probab=97.59 E-value=0.0044 Score=38.18 Aligned_cols=128 Identities=18% Similarity=0.148 Sum_probs=65.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEEC--CCC---CCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCC Q ss_conf 489998096569999999997498399999767--788---777279999999999999888752035615875011032 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDY--GQR---NKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~y--gq~---~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) |.++|||||.||+..+|.+++...-+.++++.. ..+ |....+....++ .. -..+......+ T Consensus 2 K~~~l~SGGKDS~~Al~~a~~~~~v~~L~t~~~~~~ds~~~H~~~~~l~~~qA--------~a--lgiPl~~~~~~---- 67 (219) T pfam01902 2 KVAALYSGGKDSNYALYWALKEIEVPYLVSMKSENKESYMFHEPNLHLTKLLA--------EA--LGIPIIKLYTK---- 67 (219) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCHHHHHHHH--------HH--CCCCEEEEECC---- T ss_conf 39999728699999999998719837999996379981026157889999999--------97--59968999679---- Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) .... ... ..........+++.+..|....+. +..+... ..... T Consensus 68 -------~~~e---------~~~---------~~L~~~l~~~~i~~vv~GdI~s~~------qr~~~e~------~c~~l 110 (219) T pfam01902 68 -------GEEE---------KEV---------EDLAGFLESLDVDALVAGAIYSEY------QKSRIES------VCREL 110 (219) T ss_pred -------CCHH---------HHH---------HHHHHHHHHCCCCEEEECCCCCHH------HHHHHHH------HHHHC T ss_conf -------8617---------999---------999999987598699998603688------9999999------99972 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHC Q ss_conf 216775313898899999999828 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~ 188 (240) +...+.|++..+-.++++...+.| T Consensus 111 gl~~~~PLW~~d~~~ll~e~i~~G 134 (219) T pfam01902 111 GLKPFAPLWGRDPRKLAEEIVREG 134 (219) T ss_pred CCEEEECCCCCCHHHHHHHHHHCC T ss_conf 988971034899999999999879 No 72 >TIGR01536 asn_synth_AEB asparagine synthase (glutamine-hydrolyzing); InterPro: IPR006426 These sequences represent glutamine-hydrolysing asparagine synthase. The group have a poorly conserved C-terminal extension while bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis .; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process. Probab=97.55 E-value=0.00029 Score=45.47 Aligned_cols=126 Identities=20% Similarity=0.201 Sum_probs=67.3 Q ss_pred HHCCCCCEEEEEECCHHHHHHHHHHHHCC--------CEEEEEEEECC--CCCCCCHH--------HHHHHHHHHHHHHH Q ss_conf 21359948999809656999999999749--------83999997677--88777279--------99999999999988 Q gi|254781031|r 4 IIKKAPSALLLFSGGQDSSTCLSWALDRF--------DRVETLSFDYG--QRNKVELE--------CRLCVRKKIVELMP 65 (240) Q Consensus 4 ~~~~~~~av~l~SGGlDS~~~l~~l~~~~--------~~v~~i~~~yg--q~~~~~~~--------~a~~~~~~~~~~~~ 65 (240) ||+..+-. ++|||||||+.++.++.+.- ..|+..++.+. +..-.+.+ +|..+++.+... T Consensus 308 l~aDvpvG-~~LSGGlDSS~vaa~a~~~~~~~~~~W~~~~~tFsigf~~~~~~~~Es~l~P~~D~P~A~~vA~~~G~~-- 384 (646) T TIGR01536 308 LVADVPVG-VLLSGGLDSSLVAAIARREAPSEGVDWKGPVKTFSIGFEGDSKDFDESKLRPDDDQPYARKVADELGTE-- 384 (646) T ss_pred CCCCCCEE-EEEECHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEECCCCCCCCHHHHHHHHHHHCCC-- T ss_conf 20056068-887171679999999999767643311774347886517787653002578998618999999984983-- Q ss_pred HHHHCCCCCEEEECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCEEECCCCCCC-CCCC Q ss_conf 8752035615875011032244433222011000123310111111122333777777433-9936511454444-3455 Q gi|254781031|r 66 KWKDSLGEDHILPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRL-GITNIVIGVCETD-YSGY 143 (240) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~-g~~~i~~~~~~~~-~~~~ 143 (240) ++.+.++.-.. .............+.....+..+.+....-.+... |...+..|-.++. +.+| T Consensus 385 --------h~~~~~s~~e~-------~~~l~~v~~~~e~y~p~~~~~~~P~Yl~s~~ar~~Tg~kV~LSGeGaDElFgGY 449 (646) T TIGR01536 385 --------HHEVLISEEEV-------LKALPEVIYHLETYDPTAIRASIPLYLLSKLAREDTGVKVVLSGEGADELFGGY 449 (646) T ss_pred --------CEEEEECHHHH-------HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHCCH T ss_conf --------24898458999-------999888765424885430311235899999999628838997486635553042 Q ss_pred CCCC Q ss_conf 5443 Q gi|254781031|r 144 PDCR 147 (240) Q Consensus 144 ~~~~ 147 (240) .-.+ T Consensus 450 ~~f~ 453 (646) T TIGR01536 450 EYFR 453 (646) T ss_pred HHHC T ss_conf 4421 No 73 >PTZ00323 NAD+ synthase; Provisional Probab=97.38 E-value=0.0011 Score=41.83 Aligned_cols=26 Identities=23% Similarity=0.266 Sum_probs=21.3 Q ss_pred EEEEEEECCCCHHHHHHHHHHHCCCH Q ss_conf 16775313898899999999828200 Q gi|254781031|r 166 VTVHTPLMWLKKYETWKLAQDIGGQD 191 (240) Q Consensus 166 ~~i~~P~~~~tK~eI~~~~~~~~~~~ 191 (240) .-=++|...++|.+|.++++.+|.+. T Consensus 187 agd~apI~DL~Kt~V~~Lar~LgiPe 212 (294) T PTZ00323 187 VVDVQLISDLHKSEVFLVAEELGVPE 212 (294) T ss_pred CCCCHHHCCCCHHHHHHHHHHCCCCH T ss_conf 54703304683999999999809989 No 74 >PTZ00077 asparagine synthetase; Provisional Probab=97.33 E-value=0.0012 Score=41.62 Aligned_cols=26 Identities=31% Similarity=0.466 Sum_probs=19.1 Q ss_pred HHCCCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 213599489998096569999999997 Q gi|254781031|r 4 IIKKAPSALLLFSGGQDSSTCLSWALD 30 (240) Q Consensus 4 ~~~~~~~av~l~SGGlDS~~~l~~l~~ 30 (240) ||+..+-. ++|||||||++++.++.+ T Consensus 234 L~sDVPvG-~lLSGGLDSSlIaAia~k 259 (610) T PTZ00077 234 LMGDVPFG-ILLSGGLDSSIIAAILAK 259 (610) T ss_pred HHCCCCEE-EEECCCCHHHHHHHHHHH T ss_conf 22567236-663477058999999998 No 75 >TIGR03108 eps_aminotran_1 exosortase 1 system-associated amidotransferase 1. The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins. Probab=97.28 E-value=0.0008 Score=42.76 Aligned_cols=50 Identities=22% Similarity=0.169 Sum_probs=31.8 Q ss_pred CEEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 48999809656999999999749-839999976778877727999999999 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYGQRNKVELECRLCVRKK 59 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~ygq~~~~~~~~a~~~~~~ 59 (240) ..=+++|||+||++++.++.+.. ..+.+.|+.+......+...+..+++. T Consensus 260 pvG~~LSGGlDSS~I~a~~~~~~~~~i~tfsi~f~~~~~DE~~~A~~vA~~ 310 (628) T TIGR03108 260 PLGAFLSGGVDSSAVVALMAGLSDTPVNTCSIAFDDPAFDESAYARQVAER 310 (628) T ss_pred CEEEEECCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 568764688636899999998429986436653488875436999999986 No 76 >COG1365 Predicted ATPase (PP-loop superfamily) [General function prediction only] Probab=97.19 E-value=0.0015 Score=41.01 Aligned_cols=147 Identities=19% Similarity=0.072 Sum_probs=71.9 Q ss_pred CCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCC Q ss_conf 94899980965699999999974983999997677887772799999999999998887520356158750110322444 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHS 88 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (240) .|++|.+|||.||++.+..|..-+.+|.+.+.-. +.......... ...++........+++....+ T Consensus 61 ~kiaVA~SGG~DSsas~iilR~~g~~v~p~t~~L----p~~ir~n~~~l----------~~~lg~~p~yveedl~~i~kG 126 (255) T COG1365 61 PKIAVAYSGGVDSSASAIILRWAGFTVDPGTAIL----PDHIRRNKEEL----------ETLLGEVPEYVEEDLEDIEKG 126 (255) T ss_pred CEEEEEECCCCCHHHHHHHHHHHCEEECCCCCCC----CHHHHHHHHHH----------HHHHCCCHHHHHHHHHHHHHH T ss_conf 4389984278642788999886210414632138----77886789999----------899725777899899999863 Q ss_pred CCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCC--CCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 33222011000123310111111122333777777433993651145444434--5554430247777767753064321 Q gi|254781031|r 89 SLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYS--GYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) .... + +....-.+.+..+.....+-..+++.+..|..-..+. .++ -..-+ T Consensus 127 alnG--------R---fhpCGRCh~~I~~~V~~k~re~di~~vafGDlLs~G~~svy~-----------------eD~i~ 178 (255) T COG1365 127 ALNG--------R---FHPCGRCHSMIENAVMDKARELDIDVVAFGDLLSTGYGSVYR-----------------EDGIF 178 (255) T ss_pred HCCC--------C---CCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEC-----------------CCCEE T ss_conf 2058--------7---787523789999999999985387089974632346632120-----------------58879 Q ss_pred EEEEEE-CCCCHHHHHHHHHHHCCCHHCCCCHHHCCCC Q ss_conf 677531-3898899999999828200014325537216 Q gi|254781031|r 167 TVHTPL-MWLKKYETWKLAQDIGGQDLVNLILEESHTC 203 (240) Q Consensus 167 ~i~~P~-~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC 203 (240) ++..|. +.+||.|........+ .+ +...+-| T Consensus 179 rlnlPAflAltK~Elr~il~~~~----~e--~~~kygC 210 (255) T COG1365 179 RLNLPAFLALTKDELRSILKWNG----YE--LEMKYGC 210 (255) T ss_pred EECCHHHHHHCCHHHHHHHHHCC----CC--CHHCCCC T ss_conf 97258888517199999987138----43--0011588 No 77 >TIGR00289 TIGR00289 conserved hypothetical protein TIGR00289; InterPro: IPR005237 This family of conserved hypothetical proteins has no known function. Homologous proteins related to MJ0570 of Methanococcus jannaschii include both the apparent orthologs in the family, the much longer protein YLR143W from Saccharomyces cerevisiae (Baker's yeast), and second homologous proteins from Archaeoglobus fulgidus and Pyrococcus horikoshii that appear to represent a second orthologous group.. Probab=97.16 E-value=0.0005 Score=44.03 Aligned_cols=31 Identities=32% Similarity=0.299 Sum_probs=24.9 Q ss_pred EEEEEECCHHHHHHHHHHHHCCCEE-EEEEEE Q ss_conf 8999809656999999999749839-999976 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRFDRV-ETLSFD 41 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~~~v-~~i~~~ 41 (240) +-+|+|||-||..+||+++++|.+| ++|.+- T Consensus 3 vAvLySGGKDS~~ALy~al~eG~eV~~LV~v~ 34 (227) T TIGR00289 3 VAVLYSGGKDSILALYKALEEGFEVKYLVGVI 34 (227) T ss_pred EEEEECCCHHHHHHHHHHHHCCCCEEEEEEEE T ss_conf 68985187136899999986288547851000 No 78 >PRK02628 nadE NAD synthetase; Reviewed Probab=97.12 E-value=0.0037 Score=38.63 Aligned_cols=34 Identities=21% Similarity=0.221 Sum_probs=22.8 Q ss_pred CCCEEEEEECCHHHHHH---HHHH---HHCC-CEEEEEEEE Q ss_conf 99489998096569999---9999---9749-839999976 Q gi|254781031|r 8 APSALLLFSGGQDSSTC---LSWA---LDRF-DRVETLSFD 41 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~---l~~l---~~~~-~~v~~i~~~ 41 (240) .+++|+-+|||+||+.. +..+ +..+ ..|++|++= T Consensus 362 ~~~vViGLSGGiDSaLaLLVaa~A~d~Lg~~r~~V~~vtMP 402 (678) T PRK02628 362 IKKVVIGISGGLDSTLALLVAAKAFDRLGLPRKNILAYTMP 402 (678) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHCCCHHCEEEEECC T ss_conf 98189967776447999999999999848971224899778 No 79 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=97.09 E-value=0.00079 Score=42.78 Aligned_cols=128 Identities=21% Similarity=0.161 Sum_probs=66.3 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCC------CCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHC Q ss_conf 489998096569999999997498399999767788------77727999999999999988875203561587501103 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQR------NKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILG 83 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~------~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (240) |.++|+|||.||+-.+|++++.|.+|..+-.-.-.+ |....+.+..+++.+. .+......+. T Consensus 2 k~~aL~SGGKDS~~Al~~a~~~G~eV~~Ll~~~p~~~dS~m~H~~n~~~~~~~Ae~~g----------i~l~~~~~~g-- 69 (223) T COG2102 2 KVIALYSGGKDSFYALYLALEEGHEVVYLLTVKPENGDSYMFHTPNLELAELQAEAMG----------IPLVTFDTSG-- 69 (223) T ss_pred CEEEEEECCCHHHHHHHHHHHCCCEEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCC----------CCEEEEECCC-- T ss_conf 6899981671889999999975983689999815998704422225679999987448----------8359983476-- Q ss_pred CCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 22444332220110001233101111111223337777774339936511454444345554430247777767753064 Q gi|254781031|r 84 DISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGME 163 (240) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (240) ..+... .........++.+.+..|........ .. +.+ .... T Consensus 70 ---------~~e~ev------------------e~L~~~l~~l~~d~iv~GaI~s~yqk------~r---ve~---lc~~ 110 (223) T COG2102 70 ---------EEEREV------------------EELKEALRRLKVDGIVAGAIASEYQK------ER---VER---LCEE 110 (223) T ss_pred ---------CCHHHH------------------HHHHHHHHHCCCCEEEECHHHHHHHH------HH---HHH---HHHH T ss_conf ---------305569------------------99999997376337997343309999------99---999---9997 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHC Q ss_conf 3216775313898899999999828 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~ 188 (240) .+.+.+.|+...+-.++++...+.| T Consensus 111 lGl~~~~PLWg~d~~ell~e~~~~G 135 (223) T COG2102 111 LGLKVYAPLWGRDPEELLEEMVEAG 135 (223) T ss_pred HCCEEEECCCCCCHHHHHHHHHHCC T ss_conf 2978851251789899999999739 No 80 >COG0367 AsnB Asparagine synthase (glutamine-hydrolyzing) [Amino acid transport and metabolism] Probab=97.03 E-value=0.0023 Score=39.92 Aligned_cols=54 Identities=19% Similarity=0.230 Sum_probs=34.1 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCC-EEE-EEEEECCCCCCCCHHHHHHHHHHHH Q ss_conf 99489998096569999999997498-399-9997677887772799999999999 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFD-RVE-TLSFDYGQRNKVELECRLCVRKKIV 61 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~-~v~-~i~~~ygq~~~~~~~~a~~~~~~~~ 61 (240) ...+-++||||+||++++..+.+..- ... ..++.+......+.+++..+++.+. T Consensus 230 dvpvg~~lSGGlDSS~Iaa~a~~~~~~~~~~~fsvg~~~~~~~D~~~a~~~A~~lg 285 (542) T COG0367 230 DVPVGVFLSGGLDSSLIAAIAAEELGKEGKTTFTVGFEDSDSPDAKYARAVAKFLG 285 (542) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 88789996776229999999998623455215898548999707999999999859 No 81 >TIGR03104 trio_amidotrans asparagine synthase family amidotransferase. Members of this protein family are closely related to several isoforms of asparagine synthetase (glutamine amidotransferase) and typically have been given this name in genome annotation to date. Each is part of a conserved three-gene cassette sparsely distributed across at least twenty different species known so far, including alpha, beta, and gamma Proteobacteria, Mycobacterium, and Prosthecochloris, which is a member of the Chlorobi. The other two members of the cassette are a probable protease and a member of the GNAT family of acetyltransferases. Probab=97.02 E-value=0.0012 Score=41.71 Aligned_cols=33 Identities=30% Similarity=0.385 Sum_probs=24.3 Q ss_pred EEEEEECCHHHHHHHHHHHHCC-CEEEEEEEECC Q ss_conf 8999809656999999999749-83999997677 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRF-DRVETLSFDYG 43 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~-~~v~~i~~~yg 43 (240) .=+++|||+||++++..+.+.+ .++.+.++.|- T Consensus 263 vg~~LSGGlDSS~i~a~~~~~~~~~i~tfs~~f~ 296 (589) T TIGR03104 263 VGVLLSGGLDSSLIVGLLAEAGVDGLRTFSIGFE 296 (589) T ss_pred CCEECCCCCCHHHHHHHHHHHCCCCCCEEEEEEC T ss_conf 3000178754389999999956999852788724 No 82 >PRK09431 asnB asparagine synthetase B; Provisional Probab=96.71 E-value=0.0039 Score=38.46 Aligned_cols=26 Identities=35% Similarity=0.436 Sum_probs=19.0 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHC Q ss_conf 135994899980965699999999974 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDR 31 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~ 31 (240) ++..+-+ ++|||||||++++.++.+. T Consensus 225 ~sDvpvG-~~LSGGLDSSlIaala~k~ 250 (555) T PRK09431 225 MSDVPYG-VLLSGGLDSSLISAIAKKY 250 (555) T ss_pred CCCCCEE-EEECCCHHHHHHHHHHHHH T ss_conf 5688624-3205770679999999985 No 83 >cd01984 AANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases, ATP sulphurylases Universal Stress Response protein and electron transfer flavoprotein (ETF). The domain forms a apha/beta/apha fold which binds to Adenosine nucleotide. Probab=96.65 E-value=0.0026 Score=39.58 Aligned_cols=31 Identities=35% Similarity=0.269 Sum_probs=25.2 Q ss_pred EEEEEECCHHHHHHHHHHHHC---CCEEEEEEEE Q ss_conf 899980965699999999974---9839999976 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDR---FDRVETLSFD 41 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~---~~~v~~i~~~ 41 (240) .++.+|||+||++++.++.+. +.+++.++++ T Consensus 1 ilv~~Sgg~dS~~~~~la~~~~~~~~~~~~~~~~ 34 (86) T cd01984 1 ILVALSGGLDSSVLLHLAKRLKSGGPEVVALVVV 34 (86) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEHH T ss_conf 9788617853899999999852599808976389 No 84 >COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only] Probab=96.52 E-value=0.0037 Score=38.62 Aligned_cols=40 Identities=28% Similarity=0.428 Sum_probs=30.4 Q ss_pred CCCCCEEEEEECCHHHHHHHHHHHH----CCC-EEEEEEEEC-CCC Q ss_conf 3599489998096569999999997----498-399999767-788 Q gi|254781031|r 6 KKAPSALLLFSGGQDSSTCLSWALD----RFD-RVETLSFDY-GQR 45 (240) Q Consensus 6 ~~~~~av~l~SGGlDS~~~l~~l~~----~~~-~v~~i~~~y-gq~ 45 (240) ..-+.+.|+||||.||.+.|.++++ .+. ++..+|+|+ ||. T Consensus 25 ~~f~~VcVSFSGGKDS~lmLhL~~~~ar~~~~~~i~VlfiD~E~QY 70 (407) T COG3969 25 NTFPRVCVSFSGGKDSGLMLHLVAEVARENGRDKISVLFIDWEAQY 70 (407) T ss_pred HCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEEEECCHHHH T ss_conf 4388389992378740589999999999819886579997004555 No 85 >PRK02090 phosphoadenosine phosphosulfate reductase; Provisional Probab=96.47 E-value=0.036 Score=32.52 Aligned_cols=155 Identities=12% Similarity=0.029 Sum_probs=76.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCC Q ss_conf 59948999809656999999999749839999976778877727999999999999988875203561587501103224 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDIS 86 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (240) ..++.++.-|-|.+|.+++.++.+-+.++..+++|.|.-..-+.+.+..+.+.+. ...+++ ........ T Consensus 40 ~~~~~~~tsSFG~es~Vllhli~~~~~~ipV~flDTG~~f~ETy~~~d~l~~~~~----------l~i~~~-~P~~~~~~ 108 (243) T PRK02090 40 FGGGLALASSFGAESAVLLHLVAQVDPDIPVIFLDTGYLFPETYRFIDQLTERLL----------LNLKVY-RPDASAAE 108 (243) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHC----------CCEEEE-CCCCHHHH T ss_conf 5997699945867899999999941999967986579888899999999999979----------977997-78814999 Q ss_pred CCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 44332220110001233101111111223337777774339936511454444345554430247777767753064321 Q gi|254781031|r 87 HSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMESHV 166 (240) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (240) ........ ..............|....+..+.. +.+.-..|. +.+.+.++...+.+- ...+. T Consensus 109 ~~~~~g~l--~~~~~~d~~~CC~irKvePL~raL~-----~~daWitG~-Rr~Qs~~Ra~l~~ve----------~d~g~ 170 (243) T PRK02090 109 QEARYGGL--WEQSVEDPDECCRIRKVEPLNRALA-----GLDAWITGL-RREQSGTRAELPVLE----------IDRGR 170 (243) T ss_pred HHHHHCCC--CCCCCCCHHHHHHHHCCHHHHHHHH-----HCCEEEECC-CHHHCHHHCCCCCEE----------ECCCE T ss_conf 99982997--6676324898767765027999860-----177587447-144263342587345----------23998 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 677531389889999999982820 Q gi|254781031|r 167 TVHTPLMWLKKYETWKLAQDIGGQ 190 (240) Q Consensus 167 ~i~~P~~~~tK~eI~~~~~~~~~~ 190 (240) --++|++++|+.||+....+...+ T Consensus 171 ~KvnPL~~Ws~~dv~~Yi~~~~LP 194 (243) T PRK02090 171 FKINPLADWTNEDVWAYLKEHDLP 194 (243) T ss_pred EEECCCHHCCHHHHHHHHHHCCCC T ss_conf 886700119999999999985999 No 86 >KOG0573 consensus Probab=96.20 E-value=0.014 Score=35.05 Aligned_cols=60 Identities=25% Similarity=0.256 Sum_probs=34.1 Q ss_pred CEEEEEECCHHHHHHHHHHHH---CC--CEEEEEEEECCCC------CCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 489998096569999999997---49--8399999767788------77727999999999999988875203 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALD---RF--DRVETLSFDYGQR------NKVELECRLCVRKKIVELMPKWKDSL 71 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~---~~--~~v~~i~~~ygq~------~~~~~~~a~~~~~~~~~~~~~~~~~~ 71 (240) +.-++||||+||+++|.++-. .+ .+++-|.| |.. .-...+.+..-.+.+....+....++ T Consensus 252 ~VcVlfSGGvDs~vvA~l~h~~vp~ne~IdLINVaF--~n~e~~~~~~~PDRktgr~g~~eL~s~~P~R~~nl 322 (520) T KOG0573 252 NVCVLFSGGVDSTVVAVLAHYVVPENEPIDLINVAF--GNPEGSKEQNVPDRKTGRRGLEELQSLYPKRSWNL 322 (520) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEEEEC--CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 679996488408999999986069888636898602--58776665579607778888999987587633789 No 87 >KOG0571 consensus Probab=95.95 E-value=0.01 Score=35.93 Aligned_cols=53 Identities=25% Similarity=0.303 Sum_probs=29.0 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHC--------CCEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 135994899980965699999999974--------98399999767788777279999999999 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDR--------FDRVETLSFDYGQRNKVELECRLCVRKKI 60 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~--------~~~v~~i~~~ygq~~~~~~~~a~~~~~~~ 60 (240) |...+-- +|+||||||+.+|..+.+. |..++...+.. .......++..+++.+ T Consensus 223 M~d~p~G-vLLSGGLDSSLvAsia~R~lk~~~~~~~~~lhsFaIGl--e~SPDL~aarkVAd~i 283 (543) T KOG0571 223 MTDVPFG-VLLSGGLDSSLVASIAARELKKAQAARGSKLHSFAIGL--EDSPDLLAARKVADFI 283 (543) T ss_pred HCCCCEE-EEEECCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECC--CCCHHHHHHHHHHHHH T ss_conf 3168615-89507731899999999999876641278754788528--9985378889999874 No 88 >KOG2594 consensus Probab=95.12 E-value=0.13 Score=29.11 Aligned_cols=173 Identities=17% Similarity=0.069 Sum_probs=78.1 Q ss_pred HHHCCCCCEEEEEECCHHHHHHHHHH---HHC--------CC-EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH- Q ss_conf 22135994899980965699999999---974--------98-399999767788777279999999999999888752- Q gi|254781031|r 3 DIIKKAPSALLLFSGGQDSSTCLSWA---LDR--------FD-RVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKD- 69 (240) Q Consensus 3 ~~~~~~~~av~l~SGGlDS~~~l~~l---~~~--------~~-~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~- 69 (240) .|..+.+.++++||||.-|+++|-++ +++ +. .++++++..+... ..+.+++.+...+... T Consensus 57 ~~~~~sk~vLlv~~G~ssSlvlLd~vh~~l~q~k~~~~~~~~tv~v~~~~~~~~~~-------~~v~e~lq~l~~~~~~~ 129 (396) T KOG2594 57 KMGTPSKNVLLVFDGGSSSLVLLDFVHLALKQLKNKRLRRDFTVLVLVVFQEFTDS-------TAVFEALQELIIDNIEW 129 (396) T ss_pred HCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEECCCH-------HHHHHHHHHHHHHHCCC T ss_conf 30885534699951884047699999999998666532768854899998852511-------88899999999840555 Q ss_pred --CCCCCEEEECHHHC-CCCCCCCCCCHHHHHCCCCCC-HHH------HHHHHHHHHHHHHHHHHHCCCCEEECCCCCCC Q ss_conf --03561587501103-224443322201100012331-011------11111223337777774339936511454444 Q gi|254781031|r 70 --SLGEDHILPLAILG-DISHSSLTKNVAMKIQDNNLP-NTF------VPGRNIIFLVFAATLAYRLGITNIVIGVCETD 139 (240) Q Consensus 70 --~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~ 139 (240) +.+...-.+..+.. .+...... +.+......... ... -.....+.......+|..+|...+..|.+..+ T Consensus 130 ~~~~V~~la~~~~~~~~~~~~~~~~-n~El~ak~~kl~~~~sv~~~srqDLl~~lk~kll~~vA~~~g~~~i~~g~~~t~ 208 (396) T KOG2594 130 VRYVVSCLAPPEKDNHVVPVESING-NDELIAKDRKLKLSDSVPDDSRQDLLLHLKMKLLQKVAAENGYNRIVLGDSTTD 208 (396) T ss_pred CCEEEEECCCHHHHCCCCCCCCCCC-CCEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHH T ss_conf 4268873373576457561100378-700113243304644566401789999999999999999859987983673527 Q ss_pred CCCCC--------CCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf 34555--------443024777776775306432167753138988999999998 Q gi|254781031|r 140 YSGYP--------DCRHDTIRAIETAINLGMESHVTVHTPLMWLKKYETWKLAQD 186 (240) Q Consensus 140 ~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~ 186 (240) .+..- ...-.++- ...+.-...++++.+|+..+++.||-..... T Consensus 209 la~~vlt~v~~GRG~sis~~v---~~~d~r~~~d~~llrPLrDl~~~Ei~~y~~l 260 (396) T KOG2594 209 LASHVLTAVVKGRGGSISTDV---QVVDKRPKGDVKLLRPLRDLLSLEITSYCLL 260 (396) T ss_pred HHHHHHHHHHHCCCCCCEEHH---HHHCCCCCCCCEEEHHHHHHHHHHHHHHHHH T ss_conf 999999999836676510054---4533465787200002778889999999976 No 89 >TIGR00552 nadE NAD+ synthetase; InterPro: IPR003694 NAD+ synthase (6.3.5.1 from EC) catalyzes the last step in the biosynthesis of nicotinamide adenine dinucleotide and is induced by stress factors such as heat shock and glucose limitation. The three-dimensional structure of NH3-dependent NAD+ synthetase from Bacillus subtilis, in its free form and in complex with ATP shows that the enzyme consists of a tight homodimer with alpha/beta subunit topology .; GO: 0003952 NAD+ synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0009435 NAD biosynthetic process. Probab=95.02 E-value=0.11 Score=29.55 Aligned_cols=153 Identities=16% Similarity=0.120 Sum_probs=72.2 Q ss_pred CCCCEEEEEECCHHH--------HHHHHHHHHCCC--EEEEEEEECCCC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 599489998096569--------999999997498--399999767788-777279999999999999888752035615 Q gi|254781031|r 7 KAPSALLLFSGGQDS--------STCLSWALDRFD--RVETLSFDYGQR-NKVELECRLCVRKKIVELMPKWKDSLGEDH 75 (240) Q Consensus 7 ~~~~av~l~SGGlDS--------~~~l~~l~~~~~--~v~~i~~~ygq~-~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 75 (240) ..+.+|+-+|||+|| ++++++..+... +++++-+-++.. .+...+.+...++.+. .... T Consensus 25 ~~~g~v~GlsGG~ds~l~g~l~~~~v~~~~~~~~~~~~~~~~~~p~~~~~~~~d~~~a~~~~~~~~----------~~~~ 94 (286) T TIGR00552 25 GAKGVVLGLSGGVDSTLAGKLCQAVVAALCVEALGKEQNHALLLPHSNTTPEDDVQDALALAEPLG----------INYK 94 (286) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCHHHHEEEEECCCCCCCHHHHHHHHHHHHHCC----------CCCE T ss_conf 765046730476203567789999999999850461142033214667884121799999887405----------4421 Q ss_pred EEECHHHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 87501103224443322201100012331011111112233377777743399365114544443455544302477777 Q gi|254781031|r 76 ILPLAILGDISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIE 155 (240) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 155 (240) .+.....-........ ....... ..........+.....+.+....+.+..|..... ...-.+... T Consensus 95 ~~~~~~~~~~~~~~~~------~~~~~~~-~~~~gn~~~r~r~~~~y~~a~~~~~lv~gt~n~~-----e~~~Gy~t~-- 160 (286) T TIGR00552 95 TIDIAPIAASFQAQTE------TGDPLAD-FLAEGNLKARLRMALLYAHANKHNLLVLGTGNKS-----ELLLGYFTK-- 160 (286) T ss_pred ECCCHHHHHHHHHHHC------CCCCCCC-HHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHH-----HHHHHHHHH-- T ss_conf 1130467788776420------3443331-0233222478899999887310163564046134-----454320010-- Q ss_pred HHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHCC Q ss_conf 6775306432167753138988999999998282 Q gi|254781031|r 156 TAINLGMESHVTVHTPLMWLKKYETWKLAQDIGG 189 (240) Q Consensus 156 ~~~~~~~~~~~~i~~P~~~~tK~eI~~~~~~~~~ 189 (240) .+..-.=.+|..++.|.++.+++..++. T Consensus 161 ------~Gdg~~d~~p~~~l~k~~~~~l~~~l~~ 188 (286) T TIGR00552 161 ------YGDGGCDILPLGGLFKTEVYELAKRLGV 188 (286) T ss_pred ------CCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf ------1442000345532237889999998175 No 90 >KOG2316 consensus Probab=90.61 E-value=0.45 Score=25.71 Aligned_cols=32 Identities=22% Similarity=0.224 Sum_probs=27.7 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 48999809656999999999749839999976 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFD 41 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ 41 (240) +.|.|.|||.||.-...-..+.|++++++--- T Consensus 2 rvvaLiSGGKDScynmm~cv~~gHeiVaLanl 33 (277) T KOG2316 2 RVVALISGGKDSCYNMMCCVRLGHEIVALANL 33 (277) T ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEECC T ss_conf 18999717807899999999848743564324 No 91 >TIGR00290 MJ0570_dom MJ0570-related uncharacterized domain; InterPro: IPR002761 This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N-terminal. The structure of Q8U2K6 from Pyrococcus furiosus has been resolved to 2.7A and is suggested to be a putative N-type pytophosphatase. In some members of the family e.g. Q12429 from SWISSPROT, this domain is associated with IPR006175 from INTERPRO, another domain of unknown function. Proteins with this uncharacterised domain include two apparent ortholog families in the archaea, one of which is universal among the first four completed archaeal genomes. The domain comprises the full length of the archaeal proteins and the first third of fungal proteins. . Probab=85.82 E-value=0.66 Score=24.67 Aligned_cols=33 Identities=33% Similarity=0.390 Sum_probs=26.9 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCCEEEEEEEEC Q ss_conf 489998096569999999997498399999767 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFDRVETLSFDY 42 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~y 42 (240) +.++++|||-||...+++.++++.++..+.--+ T Consensus 2 ~~~~~~~gg~d~~~~~~~~~~~~~~~~~l~~~~ 34 (265) T TIGR00290 2 KVAALYSGGKDSCLALYHALKEGHEVISLVNVF 34 (265) T ss_pred CEEEEEECCCHHHHHHHHHHHCCCCCHHHHHHC T ss_conf 247875077414789999874176100121101 No 92 >KOG2840 consensus Probab=79.81 E-value=2.1 Score=21.56 Aligned_cols=75 Identities=20% Similarity=0.018 Sum_probs=44.1 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHHH--C----CCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 12213599489998096569999999997--4----98399999767788777279999999999999888752035615 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWALD--R----FDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDH 75 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~~--~----~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 75 (240) |+++....+.-+--|||-||+++++.+.. + |.+..+++++-|-+= .. +....+. ..+ ...+..++. T Consensus 45 n~l~~rge~v~igasGgkdstvlA~v~~~Ln~r~~~g~~l~Lls~degi~g-yr-d~sl~av---krn---~~~~~lPL~ 116 (347) T KOG2840 45 NKLFARGERVAIGASGGKDSTVLAYVLDALNERHDYGLRLFLLSIDEGIRG-YR-DDSLEAV---KRN---GVQYGLPLC 116 (347) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCC-EE-CCHHHHH---HHH---HHHCCCCEE T ss_conf 686678883132555660277999999984030178841342101003642-00-4299999---986---142078638 Q ss_pred EEECHHHCC Q ss_conf 875011032 Q gi|254781031|r 76 ILPLAILGD 84 (240) Q Consensus 76 ~~~~~~~~~ 84 (240) ++...++-. T Consensus 117 ivs~~dl~~ 125 (347) T KOG2840 117 IVSYKDLYG 125 (347) T ss_pred EECHHHHHC T ss_conf 841999725 No 93 >PRK05077 frsA fermentation/respiration switch protein; Reviewed Probab=71.60 E-value=3.1 Score=20.53 Aligned_cols=33 Identities=24% Similarity=0.304 Sum_probs=22.1 Q ss_pred CEEEEEECCHHHHHHHHHHHH-CCC---EEEEEEEEC Q ss_conf 489998096569999999997-498---399999767 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALD-RFD---RVETLSFDY 42 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~-~~~---~v~~i~~~y 42 (240) --+|++||||||+=.=++-+= ++. .+--+|+|. T Consensus 194 ~PtViv~GgLDsLQ~D~~rLF~dylaP~giAMLTlDM 230 (414) T PRK05077 194 FPTVLMCGGLDSLQTDYYRLFRDYLAPRGIAMLTIDM 230 (414) T ss_pred CCEEEEECCHHHHHHHHHHHHHHHCCCCCCEEEEECC T ss_conf 8759995672477888999999852646733797037 No 94 >TIGR02397 dnaX_nterm DNA polymerase III, subunits gamma and tau; InterPro: IPR012763 This entry represents the well-conserved first N-terminal domain of DnaX, approx. 365 aa. The full-length product of the dnaX gene in Escherichia coli encodes the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the extreme thermophile Thermus thermophilis.; GO: 0003887 DNA-directed DNA polymerase activity, 0005524 ATP binding, 0006260 DNA replication, 0009360 DNA polymerase III complex. Probab=71.30 E-value=2.1 Score=21.58 Aligned_cols=23 Identities=30% Similarity=0.725 Sum_probs=12.8 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 553721628886555566889986877899 Q gi|254781031|r 197 LEESHTCYLGKRDKRYEWGYGCNSCPACYL 226 (240) Q Consensus 197 ~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~ 226 (240) |++.-=|- |. .+.|||.|.+|.+ T Consensus 56 fAKaLNC~-~~------~~~PCn~C~~C~~ 78 (363) T TIGR02397 56 FAKALNCQ-GP------DGEPCNECESCKE 78 (363) T ss_pred HHHHHCCC-CC------CCCCCCCCCHHHH T ss_conf 99986588-78------7787777502277 No 95 >KOG2303 consensus Probab=69.47 E-value=2.6 Score=21.01 Aligned_cols=18 Identities=39% Similarity=0.458 Sum_probs=13.8 Q ss_pred EEEEEECCHHHHHHHHHH Q ss_conf 899980965699999999 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWA 28 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l 28 (240) -.+-+|||+||++.|.+. T Consensus 352 fflPLSGG~DSsatA~iV 369 (706) T KOG2303 352 FFLPLSGGVDSSATAAIV 369 (706) T ss_pred EEEECCCCCCCHHHHHHH T ss_conf 477647886405899999 No 96 >PRK12435 ferrochelatase; Provisional Probab=69.13 E-value=2.4 Score=21.17 Aligned_cols=22 Identities=27% Similarity=0.395 Sum_probs=17.4 Q ss_pred HCCCCCEEEEEE-CCHHHHHHHH Q ss_conf 135994899980-9656999999 Q gi|254781031|r 5 IKKAPSALLLFS-GGQDSSTCLS 26 (240) Q Consensus 5 ~~~~~~av~l~S-GGlDS~~~l~ 26 (240) |+++|.+|+|++ ||.||.--.- T Consensus 1 M~k~K~gVLL~nlG~P~s~~dV~ 23 (316) T PRK12435 1 MKKKKIGLLVMAYGTPESLDDVE 23 (316) T ss_pred CCCCCEEEEEEECCCCCCHHHHH T ss_conf 99885899998789999876899 No 97 >pfam06500 DUF1100 Alpha/beta hydrolase of unknown function (DUF1100). This family consists of several hypothetical bacterial proteins of unknown function. Members of this family have an alpha/beta hydrolase fold. Probab=67.97 E-value=4.1 Score=19.72 Aligned_cols=34 Identities=32% Similarity=0.542 Sum_probs=22.1 Q ss_pred CCEEEEEECCHHHHHHHHHHHHC-C---CEEEEEEEEC Q ss_conf 94899980965699999999974-9---8399999767 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALDR-F---DRVETLSFDY 42 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~~-~---~~v~~i~~~y 42 (240) +--+|++||||||+=.=++-+=+ + ..+--+|+|. T Consensus 189 P~PtViv~ggLDsLQ~D~~rLF~dyLaP~giAMLTlDM 226 (411) T pfam06500 189 PFPVVLVSAGLDSLQTDMWRLFRDYLAPRDIAMLTIDM 226 (411) T ss_pred CCCEEEEECCHHHHHHHHHHHHHHHCCCCCCEEEEECC T ss_conf 88869995772577888999999853745733797047 No 98 >cd01524 RHOD_Pyr_redox Member of the Rhodanese Homology Domain superfamily. Included in this CD are the Lactococcus lactis NADH oxidase, Bacillus cereus NADH dehydrogenase, and Bacteroides thetaiotaomicron pyridine nucleotide-disulphide oxidoreductase, and similar rhodanese-like domains found C-terminal of the pyridine nucleotide-disulphide oxidoreductase (Pyr-redox) domain and the Pyr-redox dimerization domain. Probab=66.12 E-value=9.1 Score=17.59 Aligned_cols=32 Identities=13% Similarity=0.044 Sum_probs=20.6 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEE Q ss_conf 59948999809656999999999749839999 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETL 38 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i 38 (240) +.++.|+..-.|.-|..++..|.+.|++++-| T Consensus 50 k~~~ivv~C~~G~rS~~aa~~L~~~G~~v~~l 81 (90) T cd01524 50 KDKEIIVYCAVGLRGYIAARILTQNGFKVKNL 81 (90) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHCCCCEEEE T ss_conf 58709998899805999999999869988996 No 99 >cd01534 4RHOD_Repeat_3 Member of the Rhodanese Homology Domain superfamily, repeat 3. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 3rd repeat which does not contain the putative catalytic Cys residue. Probab=63.96 E-value=10 Score=17.34 Aligned_cols=31 Identities=19% Similarity=0.137 Sum_probs=19.9 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCEEEEE Q ss_conf 9948999809656999999999749839999 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDRVETL 38 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i 38 (240) ..+.|+....|.-|...+.+|.+.|++|+.+ T Consensus 56 ~~~ivl~C~~G~RS~~AA~~L~~~G~~v~~L 86 (95) T cd01534 56 GARIVLADDDGVRADMTASWLAQMGWEVYVL 86 (95) T ss_pred CCEEEEECCCCCCHHHHHHHHHHCCCCEEEE T ss_conf 9839998599987999999999869976995 No 100 >pfam10589 NADH_4Fe-4S NADH-ubiquinone oxidoreductase-F iron-sulfur binding region. Probab=61.17 E-value=2 Score=21.61 Aligned_cols=14 Identities=29% Similarity=0.700 Sum_probs=10.3 Q ss_pred CCCCCHHHHHHHHH Q ss_conf 99868778999999 Q gi|254781031|r 217 GCNSCPACYLRQKG 230 (240) Q Consensus 217 hCG~C~~C~~R~~a 230 (240) -||+|.||.+=-.. T Consensus 15 SCGqC~PCR~Gt~~ 28 (46) T pfam10589 15 SCGKCTPCREGTKW 28 (46) T ss_pred CCCCCCCCCCCHHH T ss_conf 58999884051999 No 101 >PRK05707 DNA polymerase III subunit delta'; Validated Probab=61.14 E-value=5.6 Score=18.92 Aligned_cols=16 Identities=25% Similarity=0.364 Sum_probs=7.9 Q ss_pred CEEEEEEC--CHHHHHHHH Q ss_conf 48999809--656999999 Q gi|254781031|r 10 SALLLFSG--GQDSSTCLS 26 (240) Q Consensus 10 ~av~l~SG--GlDS~~~l~ 26 (240) .| .||+| |+=-..+|. T Consensus 23 HA-~Lf~G~~G~GK~~lA~ 40 (328) T PRK05707 23 HA-YLLHGPAGIGKRALAE 40 (328) T ss_pred EE-EEEECCCCCCHHHHHH T ss_conf 04-6447999867999999 No 102 >TIGR00276 TIGR00276 iron-sulfur cluster binding protein, putative; InterPro: IPR004453 This is a family of putative iron-sulphur cluster binding proteins. These proteins contain the 4Fe-4S ferredoxins iron-sulphur binding domain.. Probab=60.54 E-value=3.5 Score=20.13 Aligned_cols=11 Identities=27% Similarity=0.818 Sum_probs=9.7 Q ss_pred CCCCCCHHHHH Q ss_conf 89986877899 Q gi|254781031|r 216 YGCNSCPACYL 226 (240) Q Consensus 216 ~hCG~C~~C~~ 226 (240) .+||.|..|++ T Consensus 172 ~~CG~C~~C~~ 182 (297) T TIGR00276 172 ERCGKCTKCLD 182 (297) T ss_pred CCCCCCHHHHH T ss_conf 57873224663 No 103 >TIGR00692 tdh L-threonine 3-dehydrogenase; InterPro: IPR004627 L-threonine 3-dehydrogenase (1.1.1.103 from EC) is a tetrameric, zinc-binding, NAD-dependent enzyme of threonine catabolism. It catalyses the conversion of L-threonine and NAD+ to L-2-amino-3-oxobutanoate and NADH. In Escherichia coli His-90 modulates substrate specificity and is believed part of the active site. Closely related proteins include sorbitol dehydrogenase, xylitol dehydrogenase, and benzyl alcohol dehydrogenase. Eukaryotic examples of this enzyme have been demonstrated experimentally but are not detected by this HMM. ; GO: 0008270 zinc ion binding, 0008743 L-threonine 3-dehydrogenase activity, 0006567 threonine catabolic process. Probab=60.07 E-value=2.2 Score=21.39 Aligned_cols=20 Identities=15% Similarity=0.224 Sum_probs=8.5 Q ss_pred CHHHHHHHHHHHCCCHHCCC Q ss_conf 88999999998282000143 Q gi|254781031|r 176 KKYETWKLAQDIGGQDLVNL 195 (240) Q Consensus 176 tK~eI~~~~~~~~~~~~~~~ 195 (240) .|.++++...+++..+..++ T Consensus 214 ~kEd~~k~v~~lt~geG~Dv 233 (341) T TIGR00692 214 AKEDLVKVVAELTSGEGVDV 233 (341) T ss_pred CCCCHHHHHHHHCCCCCEEE T ss_conf 01476898884038896479 No 104 >PRK05818 DNA polymerase III subunit delta'; Validated Probab=58.63 E-value=5.8 Score=18.80 Aligned_cols=12 Identities=33% Similarity=0.614 Sum_probs=6.1 Q ss_pred HHCCCCCEEEEE Q ss_conf 213599489998 Q gi|254781031|r 4 IIKKAPSALLLF 15 (240) Q Consensus 4 ~~~~~~~av~l~ 15 (240) .|+.+..|+++. T Consensus 3 ~~~n~~halLl~ 14 (262) T PRK05818 3 LMKNKTHPLLLI 14 (262) T ss_pred CCCCCCCEEEEE T ss_conf 334898505664 No 105 >cd01523 RHOD_Lact_B Member of the Rhodanese Homology Domain superfamily. This CD includes predicted proteins with rhodanese-like domains found N-terminal of the metallo-beta-lactamase domain. Probab=56.58 E-value=13 Score=16.54 Aligned_cols=29 Identities=10% Similarity=0.029 Sum_probs=15.4 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCEEE Q ss_conf 99489998096569999999997498399 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDRVE 36 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~v~ 36 (240) .++.|+.+.+|--|..++..|.+.|+++. T Consensus 61 d~~ivv~C~~G~rS~~aa~~L~~~G~~v~ 89 (100) T cd01523 61 DQEVTVICAKEGSSQFVAELLAERGYDVD 89 (100) T ss_pred CCEEEEEECCCHHHHHHHHHHHHCCCEEE T ss_conf 98399992896569999999998798379 No 106 >COG1751 Uncharacterized conserved protein [Function unknown] Probab=56.08 E-value=14 Score=16.49 Aligned_cols=60 Identities=18% Similarity=0.025 Sum_probs=42.3 Q ss_pred CCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9489998096569999999997498399999767788777279999999999999888752 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKD 69 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~ 69 (240) ++.|+.-|-| +++.-+..+.+...+++.||+++|+..+.+.+....+.+.+.+....+.. T Consensus 29 k~~vVAS~tG-~tA~k~lemveg~lkvVvVthh~Gf~e~g~~e~~~E~~~~L~erGa~v~~ 88 (186) T COG1751 29 KHIVVASSTG-YTALKALEMVEGDLKVVVVTHHAGFEEKGTQEMDEEVRKELKERGAKVLT 88 (186) T ss_pred CEEEEEECCC-HHHHHHHHHCCCCCEEEEEEEECCCCCCCCEECCHHHHHHHHHCCCEEEE T ss_conf 1599984365-78999998442673599998322655688520589999999970860043 No 107 >PRK13596 NADH dehydrogenase I subunit F; Provisional Probab=56.02 E-value=3.5 Score=20.18 Aligned_cols=16 Identities=19% Similarity=0.449 Sum_probs=11.7 Q ss_pred CCCCCCHHHHHHHHHH Q ss_conf 8998687789999999 Q gi|254781031|r 216 YGCNSCPACYLRQKGW 231 (240) Q Consensus 216 ~hCG~C~~C~~R~~af 231 (240) .-||+|.||.+=-... T Consensus 343 ESCGqCtPCReGt~~l 358 (426) T PRK13596 343 ESCGQCTPCREGTGWM 358 (426) T ss_pred CCCCCCCCCHHHHHHH T ss_conf 7899998645159999 No 108 >KOG0795 consensus Probab=55.47 E-value=6.3 Score=18.59 Aligned_cols=80 Identities=15% Similarity=0.144 Sum_probs=46.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE---EEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCC Q ss_conf 4345554430247777767753064321677---5313898899999999828200014325537216288865555668 Q gi|254781031|r 139 DYSGYPDCRHDTIRAIETAINLGMESHVTVH---TPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWG 215 (240) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~---~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~ 215 (240) +...+|+.+++|.+....-+......+ .+. +|-.+.+|+ ||++.-+ .+.++ .| +|+++++-..+ T Consensus 78 ~Ry~sPdE~PFfpd~lpepilP~~~yp-~~Lhp~a~svNiNkk-Iw~~Yf~--------~lvP~-iv--kpGDDgNygSt 144 (262) T KOG0795 78 RRYESPDEHPFFPDNLPEPILPPIQYP-QVLHPYAPSVNINKK-IWNMYFK--------ELVPL-IV--KPGDDGNYGST 144 (262) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCHHHH-HHHHHHH--------HHHHH-HC--CCCCCCCCCHH T ss_conf 502697658886111875557877787-445667754555389-9999999--------88566-15--87888876237 Q ss_pred CCCCC-CHHHHHHHHHH Q ss_conf 89986-87789999999 Q gi|254781031|r 216 YGCNS-CPACYLRQKGW 231 (240) Q Consensus 216 ~hCG~-C~~C~~R~~af 231 (240) .-|.. |-+|+-||--| T Consensus 145 a~cD~~CLQ~LSrRIHy 161 (262) T KOG0795 145 AVCDIECLQSLSRRIHY 161 (262) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 88789999999888552 No 109 >COG1856 Uncharacterized homolog of biotin synthetase [Function unknown] Probab=55.41 E-value=5.4 Score=19.02 Aligned_cols=31 Identities=32% Similarity=0.296 Sum_probs=16.5 Q ss_pred HHCCCCCEEEEEECCHHHH--HHHH---HHHHCCCEE Q ss_conf 2135994899980965699--9999---999749839 Q gi|254781031|r 4 IIKKAPSALLLFSGGQDSS--TCLS---WALDRFDRV 35 (240) Q Consensus 4 ~~~~~~~av~l~SGGlDS~--~~l~---~l~~~~~~v 35 (240) |.|+--.- +|+|||+||- +-++ .++++..+- T Consensus 51 l~kkGy~g-~llSGGm~srg~VPl~kf~d~lK~lke~ 86 (275) T COG1856 51 LEKKGYEG-CLLSGGMDSRGKVPLWKFKDELKALKER 86 (275) T ss_pred HHHCCCEE-EEEECCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 98457605-8975786879974289999999998775 No 110 >KOG0317 consensus Probab=55.13 E-value=6.2 Score=18.64 Aligned_cols=32 Identities=25% Similarity=0.479 Sum_probs=18.6 Q ss_pred CCHHHCCCCCCCCCCCCCCCCCCCCC--CHHHHH Q ss_conf 32553721628886555566889986--877899 Q gi|254781031|r 195 LILEESHTCYLGKRDKRYEWGYGCNS--CPACYL 226 (240) Q Consensus 195 ~~~~~T~SC~~~~~~~~~~~~~hCG~--C~~C~~ 226 (240) ....-|+=|..=-+.+.....++||. ||+|+. T Consensus 234 ~i~~a~~kC~LCLe~~~~pSaTpCGHiFCWsCI~ 267 (293) T KOG0317 234 SIPEATRKCSLCLENRSNPSATPCGHIFCWSCIL 267 (293) T ss_pred CCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHH T ss_conf 5778887347871678798768672117999999 No 111 >TIGR00678 holB DNA polymerase III, delta' subunit; InterPro: IPR004622 DNA-directed DNA polymerase (2.7.7.7 from EC) catalyzes DNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time. DNA polymerase III is a complex, multi-chain enzyme responsible for most of the replicative synthesis in bacteria. The enzyme also has 3' to 5' exonuclease activity. It has a core composed of alpha, epsilon and theta chains, that associate with a tau subunit which allows the core dimerisation to form the PolIII' complex. PolIII' associates with the gamma complex (gamma, delta, delta', psi and chi chains) and with the beta chain. This domain is the N-terminal half of the delta' subunit of DNA polymerase III. Delta' is homologous to the gamma and tau subunits, which form an outgroup for phylogenetic comparison. The gamma/tau branch of the tree is much more tightly conserved than the delta' branch, and some members of that branch score more highly against this model than some proteins classified as delta'. The noise cut-off is set to detect weakly scoring delta' subunits rather than to exclude gamma/tau subunits.; GO: 0003887 DNA-directed DNA polymerase activity, 0008408 3'-5' exonuclease activity, 0006260 DNA replication. Probab=55.11 E-value=7.6 Score=18.08 Aligned_cols=19 Identities=16% Similarity=0.118 Sum_probs=10.5 Q ss_pred EEECCCCHHHHHHHHHHHC Q ss_conf 5313898899999999828 Q gi|254781031|r 170 TPLMWLKKYETWKLAQDIG 188 (240) Q Consensus 170 ~P~~~~tK~eI~~~~~~~~ 188 (240) -+|.-.+..++.++-.+-| T Consensus 175 ~~f~~l~~~~~~~~L~~~g 193 (216) T TIGR00678 175 LPFPPLSEEALLQWLIEQG 193 (216) T ss_pred EEECCCCHHHHHHHHHHCC T ss_conf 8625998899999999708 No 112 >COG5273 Uncharacterized protein containing DHHC-type Zn finger [General function prediction only] Probab=54.99 E-value=3.6 Score=20.13 Aligned_cols=22 Identities=36% Similarity=0.816 Sum_probs=16.0 Q ss_pred CCC--CCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 216--2888655556688998687789999 Q gi|254781031|r 201 HTC--YLGKRDKRYEWGYGCNSCPACYLRQ 228 (240) Q Consensus 201 ~SC--~~~~~~~~~~~~~hCG~C~~C~~R~ 228 (240) .+| ++|.+ ..||+.|-.|++|- T Consensus 113 ~~C~~~KP~R------S~HC~~Cn~CV~k~ 136 (309) T COG5273 113 STCNIYKPPR------SHHCSICNRCVLKF 136 (309) T ss_pred CCCCCCCCCC------CCCCHHHCCCCCCC T ss_conf 0243755896------75043306102045 No 113 >COG1062 AdhC Zn-dependent alcohol dehydrogenases, class III [Energy production and conversion] Probab=52.22 E-value=3.1 Score=20.52 Aligned_cols=29 Identities=10% Similarity=0.095 Sum_probs=13.3 Q ss_pred EEEECCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 99809656999999999749839999976 Q gi|254781031|r 13 LLFSGGQDSSTCLSWALDRFDRVETLSFD 41 (240) Q Consensus 13 ~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ 41 (240) +.-=||+-=++..--....-..+++|-+| T Consensus 191 V~GlGgVGlaaI~gA~~agA~~IiAvD~~ 219 (366) T COG1062 191 VFGLGGVGLAAIQGAKAAGAGRIIAVDIN 219 (366) T ss_pred EEECCHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 99042766999998987488549999377 No 114 >TIGR02055 APS_reductase adenylylsulfate reductase, thioredoxin dependent; InterPro: IPR011798 This entry describes recently identified adenosine 5'-phosphosulphate (APS) reductase activity found in sulphate-assimilatory prokaryotes, thus separating it from the traditionally described phosphoadenosine 5'-phosphosulphate (PAPS) reductases found in bacteria and fungi. Homologous to PAPS reductase in enterobacteria, cyanobacteria, and yeast, APS reductase here clusters with, and demonstrates greater homology to plant APS reductase. Additionally, the presence of two conserved C-terminal motifs (CCXXRKXXPL and SXGCXXCT) distinguishes APS substrate specificity and serves as a FeS cluster.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019344 cysteine biosynthetic process. Probab=50.66 E-value=17 Score=15.95 Aligned_cols=151 Identities=13% Similarity=0.043 Sum_probs=76.9 Q ss_pred ECCHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHCCCCCCCCCCCHH Q ss_conf 09656999999999749839999976778877727999999999999988875203561587501103224443322201 Q gi|254781031|r 16 SGGQDSSTCLSWALDRFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKWKDSLGEDHILPLAILGDISHSSLTKNVA 95 (240) Q Consensus 16 SGGlDS~~~l~~l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (240) |=|+.=++++-++.+...++-.++.|.|-=|.=+.+.. .++.........+.................. T Consensus 1 Sl~~ED~VL~dl~~k~~~~~~v~~LDTGrlf~ET~~~i-----------dqv~E~Y~~~~~~~~~~~~~~~e~v~~~G~n 69 (200) T TIGR02055 1 SLGAEDLVLVDLIAKVRKDVEVFFLDTGRLFKETYELI-----------DQVEERYDILIKVLVPPELTVEEQVEEYGLN 69 (200) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHHHHH-----------HHHHCCCCCEEEECCCCHHHHHHHHHHHCHH T ss_conf 91012489999998607997278853888616779887-----------6532016710120168877888999884735 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCC---CCEEEEEEE Q ss_conf 10001233101111111223337777774339936511454444345554430247777767753064---321677531 Q gi|254781031|r 96 MKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGME---SHVTVHTPL 172 (240) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~i~~P~ 172 (240) ......-........|....|..++. |..+=.+|..+.++..++.+.+..- +++..+ .++--+||+ T Consensus 70 lF~~~ve~~~~CC~~RKV~PL~raL~-----G~~AW~TGLRR~q~P~TRA~~~~~e------~De~~~~PekPl~K~nPL 138 (200) T TIGR02055 70 LFYRSVELRKECCRIRKVEPLKRALS-----GVSAWITGLRREQSPPTRAQAPFLE------IDEAFGSPEKPLVKVNPL 138 (200) T ss_pred HHHHHCCCHHHCCCCCCCHHHHHCCC-----CCCCCEEECCCCCCCCCHHHHHHHH------CCCCCCCCCCCEEEECCC T ss_conf 55540211010661454214574048-----8762030010067687554415542------022367898761443660 Q ss_pred CCCCHHHHHHHHHHHC Q ss_conf 3898899999999828 Q gi|254781031|r 173 MWLKKYETWKLAQDIG 188 (240) Q Consensus 173 ~~~tK~eI~~~~~~~~ 188 (240) +.+|-.||++..++.. T Consensus 139 a~Wt~~~vw~Yi~~~~ 154 (200) T TIGR02055 139 ADWTLEDVWEYIADNE 154 (200) T ss_pred CCCCHHHHHHHHHHCC T ss_conf 0046689999999707 No 115 >pfam04301 DUF452 Protein of unknown function (DUF452). Probab=50.41 E-value=17 Score=15.92 Aligned_cols=36 Identities=19% Similarity=0.425 Sum_probs=25.7 Q ss_pred CCCCEEEEEEC-CHHHHHHHHHHHHCCCEEEEEEEECC Q ss_conf 59948999809-65699999999974983999997677 Q gi|254781031|r 7 KAPSALLLFSG-GQDSSTCLSWALDRFDRVETLSFDYG 43 (240) Q Consensus 7 ~~~~av~l~SG-GlDS~~~l~~l~~~~~~v~~i~~~yg 43 (240) ..++.++.|+| |+|..+...+.++.++++ .+.+||- T Consensus 9 ~~~~LiLfF~GWg~~~~~f~hL~~p~~~D~-l~~yDYr 45 (213) T pfam04301 9 QGDHLILYFAGWGTPPDAVEHLILPENHDL-CICYDYQ 45 (213) T ss_pred CCCEEEEEEECCCCCHHHHHHHCCCCCCEE-EEEECCC T ss_conf 897599998167899789866336778649-9991466 No 116 >PTZ00304 NADH dehydrogenase (ubiquinone); Provisional Probab=49.90 E-value=3.8 Score=19.96 Aligned_cols=12 Identities=25% Similarity=0.847 Sum_probs=9.6 Q ss_pred CCCCCHHHHHHH Q ss_conf 998687789999 Q gi|254781031|r 217 GCNSCPACYLRQ 228 (240) Q Consensus 217 hCG~C~~C~~R~ 228 (240) .||+|.||.+=- T Consensus 366 SCGqCTPCReGT 377 (483) T PTZ00304 366 SCGQCTPCREGS 377 (483) T ss_pred CCCCCCCCCCHH T ss_conf 788997861728 No 117 >PTZ00082 L-lactate dehydrogenase; Provisional Probab=49.83 E-value=17 Score=15.87 Aligned_cols=38 Identities=18% Similarity=0.199 Sum_probs=26.9 Q ss_pred CCHHHCCCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEE Q ss_conf 91221359948999809656999999999749839999976 Q gi|254781031|r 1 MNDIIKKAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFD 41 (240) Q Consensus 1 ~~~~~~~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ 41 (240) |.+||+++|-+|+= +|.+=|+++. .+...+.. ..+-+| T Consensus 1 ~~~~~~~~KV~IIG-aG~VG~~~A~-~l~~~~l~-eivLiD 38 (322) T PTZ00082 1 MAKMIPRKKISLIG-SGNIGGVMAY-LIQLKNLA-DVVLFD 38 (322) T ss_pred CCCCCCCCCEEEEC-CCHHHHHHHH-HHHCCCCC-EEEEEE T ss_conf 91101798299989-6989999999-99638997-799997 No 118 >PRK10684 HCP oxidoreductase, NADH-dependent; Provisional Probab=49.78 E-value=17 Score=15.86 Aligned_cols=38 Identities=13% Similarity=0.173 Sum_probs=21.4 Q ss_pred CCEEEEEECCHHHHHHHHHH---HHCCCEEEEEEEECCCCCC Q ss_conf 94899980965699999999---9749839999976778877 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWA---LDRFDRVETLSFDYGQRNK 47 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l---~~~~~~v~~i~~~ygq~~~ 47 (240) .+-++|++||.==+=.+-|+ ++++. -..|++-|+.|.. T Consensus 112 ~~~~vliAgGiGITPilSMl~~l~~~~~-~~~v~l~y~~r~~ 152 (335) T PRK10684 112 EDKYLLLAAGCGVTPIMSMRRWLAKNRP-QADVQVIFNVRTP 152 (335) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCC-CCCEEEEEEECCC T ss_conf 9868999654121569999999996199-9978999971771 No 119 >cd01518 RHOD_YceA Member of the Rhodanese Homology Domain superfamily. This CD includes Escherichia coli YceA, Bacillus subtilis YbfQ, and similar uncharacterized proteins. Probab=47.66 E-value=19 Score=15.66 Aligned_cols=24 Identities=17% Similarity=0.239 Sum_probs=9.3 Q ss_pred CEEEEEECCHHHHHHHHHHHHCCC Q ss_conf 489998096569999999997498 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRFD 33 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~~ 33 (240) +.|+-..+|.-|..++..|.+.|+ T Consensus 63 ~ivvyC~~G~RS~~Aa~~L~~~Gf 86 (101) T cd01518 63 KVLMYCTGGIRCEKASAYLKERGF 86 (101) T ss_pred EEEEECCCCHHHHHHHHHHHHCCC T ss_conf 699985998279999999998498 No 120 >PRK07276 DNA polymerase III subunit delta'; Validated Probab=46.47 E-value=14 Score=16.52 Aligned_cols=21 Identities=29% Similarity=0.248 Sum_probs=16.1 Q ss_pred CCEEEEEECCHHHHHHHHHHHH Q ss_conf 9489998096569999999997 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALD 30 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~ 30 (240) ..| -||+||.-+...|.++.+ T Consensus 24 ~HA-yLf~G~~Gk~~~A~~~A~ 44 (290) T PRK07276 24 NHA-YLFSGDFASFEMALFLAQ 44 (290) T ss_pred CEE-EECCCCCCHHHHHHHHHH T ss_conf 505-421698687999999999 No 121 >COG1150 HdrC Heterodisulfide reductase, subunit C [Energy production and conversion] Probab=45.83 E-value=13 Score=16.61 Aligned_cols=10 Identities=40% Similarity=1.042 Sum_probs=4.6 Q ss_pred CCCCHHHHHH Q ss_conf 9868778999 Q gi|254781031|r 218 CNSCPACYLR 227 (240) Q Consensus 218 CG~C~~C~~R 227 (240) |=+|.+|.+| T Consensus 85 C~tCytC~eR 94 (195) T COG1150 85 CVTCYTCTER 94 (195) T ss_pred EEECHHHHHH T ss_conf 4101204301 No 122 >COG1894 NuoF NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit [Energy production and conversion] Probab=45.76 E-value=4.4 Score=19.57 Aligned_cols=11 Identities=27% Similarity=0.915 Sum_probs=8.9 Q ss_pred CCCCCHHHHHH Q ss_conf 99868778999 Q gi|254781031|r 217 GCNSCPACYLR 227 (240) Q Consensus 217 hCG~C~~C~~R 227 (240) .||+|.+|.+= T Consensus 348 SCGkCtPCReG 358 (424) T COG1894 348 SCGKCTPCREG 358 (424) T ss_pred CCCCCCCCCCC T ss_conf 68888786687 No 123 >PRK11278 NADH dehydrogenase I subunit F; Provisional Probab=45.43 E-value=5 Score=19.22 Aligned_cols=16 Identities=19% Similarity=0.280 Sum_probs=11.2 Q ss_pred CCCCCCHHHHHHHHHH Q ss_conf 8998687789999999 Q gi|254781031|r 216 YGCNSCPACYLRQKGW 231 (240) Q Consensus 216 ~hCG~C~~C~~R~~af 231 (240) .-||+|.+|.+=-.-. T Consensus 352 ESCGqCtPCReGt~~l 367 (448) T PRK11278 352 ESCGWCTPCRDGLPWS 367 (448) T ss_pred CCCCCCCCCCCCHHHH T ss_conf 8899998751719999 No 124 >pfam04396 consensus Probab=45.09 E-value=20 Score=15.41 Aligned_cols=17 Identities=6% Similarity=-0.158 Sum_probs=7.0 Q ss_pred HHHHHHCCCCEEECCCC Q ss_conf 77774339936511454 Q gi|254781031|r 120 ATLAYRLGITNIVIGVC 136 (240) Q Consensus 120 ~~~a~~~g~~~i~~~~~ 136 (240) ..++..+.+........ T Consensus 86 ~~~a~~np~Pati~LIS 102 (149) T pfam04396 86 LLWALDNPPPANLMLIS 102 (149) T ss_pred HHHHHHCCCCEEEEEEE T ss_conf 99997099970799995 No 125 >TIGR02196 GlrX_YruB Glutaredoxin-like protein, YruB-family; InterPro: IPR011911 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This glutaredoxin-like protein family contains the conserved CxxC motif and includes the Clostridium pasteurianum protein YruB which has been cloned from a rubredoxin operon . Somewhat related to NrdH, it is unknown whether this protein actually interacts with glutathione/glutathione reducatase, or, like NrdH, some other reductant system.. Probab=43.31 E-value=11 Score=17.11 Aligned_cols=19 Identities=16% Similarity=0.419 Sum_probs=13.3 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 8778999999997306407 Q gi|254781031|r 221 CPACYLRQKGWMEFKEKYQ 239 (240) Q Consensus 221 C~~C~~R~~af~~~~~~~~ 239 (240) |++|..=|+.|...|..|+ T Consensus 10 Cp~C~~aK~yL~~~gv~f~ 28 (79) T TIGR02196 10 CPPCKKAKEYLTSKGVAFE 28 (79) T ss_pred CCCHHHHHHHHHHCCCCEE T ss_conf 8207899999995499457 No 126 >COG1341 Predicted GTPase or GTP-binding protein [General function prediction only] Probab=42.51 E-value=22 Score=15.17 Aligned_cols=42 Identities=24% Similarity=0.268 Sum_probs=31.9 Q ss_pred CCCCEEEEEECCHHH--HHHHHHHH----HCCCEEEEEEEECCCCCCC Q ss_conf 599489998096569--99999999----7498399999767788777 Q gi|254781031|r 7 KAPSALLLFSGGQDS--STCLSWAL----DRFDRVETLSFDYGQRNKV 48 (240) Q Consensus 7 ~~~~av~l~SGGlDS--~~~l~~l~----~~~~~v~~i~~~ygq~~~~ 48 (240) ..+..++|+=||.|| ++++.+|. ++|+++--|--|-||..-. T Consensus 70 ~~~~~~vmvvG~vDSGKSTLt~~LaN~~l~rG~~v~iiDaDvGQ~ei~ 117 (398) T COG1341 70 AGKVGVVMVVGPVDSGKSTLTTYLANKLLARGRKVAIIDADVGQSEIG 117 (398) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC T ss_conf 068738999898676788999999988764474189996899976667 No 127 >PRK09590 celB cellobiose phosphotransferase system IIB component; Reviewed Probab=42.02 E-value=23 Score=15.12 Aligned_cols=34 Identities=21% Similarity=0.237 Sum_probs=24.2 Q ss_pred CCEEEEEECCHHHHHHHHHHHH----CCC--EEEEEEEEC Q ss_conf 9489998096569999999997----498--399999767 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALD----RFD--RVETLSFDY 42 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~----~~~--~v~~i~~~y 42 (240) ||.++..|||+-|+.++--+.+ +|. ++.++.... T Consensus 2 kKILLvCaaGMSTSmlv~km~~~A~~~G~dveI~Av~~~e 41 (104) T PRK09590 2 AKALIICAAGMSSSLMAKKTTEFLKGKGKDIEVDAISATE 41 (104) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEEHHH T ss_conf 5599996899879999999999999769836999848889 No 128 >TIGR03030 CelA cellulose synthase catalytic subunit (UDP-forming). Cellulose synthase catalyzes the beta-1,4 polymerization of glucose residues in the formation of cellulose. In bacteria, the substrate is UDP-glucose. The synthase consists of two subunits (or domains in the frequent cases where it is encoded as a single polypeptide), the catalytic domain modelled here and the regulatory domain (pfam03170). The regulatory domain binds the allosteric activator cyclic di-GMP. The protein is membrane-associated and probably assembles into multimers such that the individual cellulose strands can self-assemble into multi-strand fibrils. Probab=41.56 E-value=15 Score=16.32 Aligned_cols=19 Identities=26% Similarity=0.459 Sum_probs=16.2 Q ss_pred CCCCCCCHHHHHHHHHHHHHH Q ss_conf 889986877899999999730 Q gi|254781031|r 215 GYGCNSCPACYLRQKGWMEFK 235 (240) Q Consensus 215 ~~hCG~C~~C~~R~~af~~~~ 235 (240) ..-||.| |+.||.|+.|.| T Consensus 305 af~cGS~--aV~RR~AL~eIG 323 (713) T TIGR03030 305 AFFCGSA--AVLRREALDEIG 323 (713) T ss_pred EEECCCH--HHHHHHHHHHCC T ss_conf 7982617--987499999807 No 129 >cd03274 ABC_SMC4_euk Eukaryotic SMC4 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (fo Probab=40.96 E-value=22 Score=15.23 Aligned_cols=11 Identities=27% Similarity=0.468 Sum_probs=3.7 Q ss_pred EEEECCHHHHH Q ss_conf 99809656999 Q gi|254781031|r 13 LLFSGGQDSST 23 (240) Q Consensus 13 ~l~SGGlDS~~ 23 (240) .++|||=-|++ T Consensus 126 s~LSGGEKsLt 136 (212) T cd03274 126 SNLSGGEKTLS 136 (212) T ss_pred HHCCCHHHHHH T ss_conf 56275189999 No 130 >TIGR01389 recQ ATP-dependent DNA helicase RecQ; InterPro: IPR006293 The ATP-dependent DNA helicase RecQ (3.6.1 from EC) is involved in genome maintenance . All homologues tested to date unwind paired DNA, translocating in a 3' to 5' direction and several have a preference for forked or 4-way DNA structures (e.g. Holliday junctions) or for G-quartet DNA. The yeast protein, Sgs1, is present in numerous foci that coincide with sites of de novo synthesis DNA, such as the replication fork, and protein levels peak during S-phase. A model has been proposed for Sgs1p action in the S-phase checkpoint response, both as a 'sensor' for damage during replication and a 'resolvase' for structures that arise at paused forks, such as the four-way 'chickenfoot' structure. The action of Sgs1p may serve to maintain the proper amount and integrity of ss DNA that is necessary for the binding of RPA (replication protein A, the eukaryotic ss DNA-binding protein)DNA pol complexes. Sgs1p would thus function by detecting (or resolving) aberrant DNA structures, and would thus contribute to the full activation of the DNA-dependent protein kinase, Mec1p and the effector kinase, Rad53p. Its ability to bind both the large subunit of RPA and the RecA-like protein Rad51p, place it in a unique position to resolve inappropriate fork structures that can occur when either the leading or lagging strand synthesis is stalled. Thus, RecQ helicases integrate checkpoint activation and checkpoint response. ; GO: 0004003 ATP-dependent DNA helicase activity, 0006310 DNA recombination, 0009432 SOS response. Probab=40.76 E-value=7.6 Score=18.07 Aligned_cols=36 Identities=22% Similarity=0.338 Sum_probs=22.8 Q ss_pred CHHHCCCCCCC---CCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 25537216288---8655556688998687789999999 Q gi|254781031|r 196 ILEESHTCYLG---KRDKRYEWGYGCNSCPACYLRQKGW 231 (240) Q Consensus 196 ~~~~T~SC~~~---~~~~~~~~~~hCG~C~~C~~R~~af 231 (240) -|..|.+|..- +.-+......|||.|..|+.=...+ T Consensus 364 ~y~~~~~Cr~~~~l~YFGE~~~~e~Cg~CdnCl~p~~~~ 402 (607) T TIGR01389 364 AYCETQTCRRALILRYFGENEEVEPCGNCDNCLDPPKSY 402 (607) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 988789999999886418888898888665557984100 No 131 >TIGR01472 gmd GDP-mannose 4,6-dehydratase; InterPro: IPR006368 This family represent GDP-mannose 4,6-dehydratase, also known as GDP-D-mannose dehydratase. This enzyme converts GDP-mannose to GDP-4-dehydro-6-deoxy-D-mannose, the first of three steps for the conversion of GDP-mannose to GDP-fucose in animals, plants, and bacteria. In bacteria, GDP-L-fucose acts as a precursor of surface antigens such as the extracellular polysaccharide colanic acid of Escherichia coli. Excluded from this family are members of the clade that are poorly related because of highly dervied (phylogenetically long-branch) sequences, e.g. Aneurinibacillus thermoaerophilus Gmd, described as a bifunctional GDP-mannose 4,6-dehydratase/GDP-6-deoxy-D-lyxo-4-hexulose reductase .; GO: 0008446 GDP-mannose 46-dehydratase activity, 0019673 GDP-mannose metabolic process, 0005622 intracellular. Probab=40.30 E-value=24 Score=14.95 Aligned_cols=27 Identities=33% Similarity=0.411 Sum_probs=21.5 Q ss_pred EEEEC--CHHHHHHHHHHHHCCCEEEEEE Q ss_conf 99809--6569999999997498399999 Q gi|254781031|r 13 LLFSG--GQDSSTCLSWALDRFDRVETLS 39 (240) Q Consensus 13 ~l~SG--GlDS~~~l~~l~~~~~~v~~i~ 39 (240) .|.|| |=|=+=+|=+|+++||+||.|- T Consensus 3 ALiTGiTGQDGSYLAE~LL~~GYeVHG~~ 31 (365) T TIGR01472 3 ALITGITGQDGSYLAEFLLEKGYEVHGLI 31 (365) T ss_pred EEEECCCCCCHHHHHHHHHHCCCEEEEEE T ss_conf 68834555767899999872696876458 No 132 >TIGR01198 pgl 6-phosphogluconolactonase; InterPro: IPR005900 6-phosphogluconolactonase (3.1.1.31 from EC) is the enzyme responsible for the hydrolysis of 6-phosphogluconolactone to 6-phosphogluconate, the second step in the pentose phosphate pathway.6-phospho-D-glucono-1,5-lactone + H2O = 6-phospho-D-gluconate. ; GO: 0017057 6-phosphogluconolactonase activity, 0006098 pentose-phosphate shunt. Probab=40.26 E-value=11 Score=17.00 Aligned_cols=41 Identities=20% Similarity=0.240 Sum_probs=25.1 Q ss_pred CCHHHCCCCCEEEEEECCHHHHHHHHHHHHC----C---CEEEEEEEE Q ss_conf 9122135994899980965699999999974----9---839999976 Q gi|254781031|r 1 MNDIIKKAPSALLLFSGGQDSSTCLSWALDR----F---DRVETLSFD 41 (240) Q Consensus 1 ~~~~~~~~~~av~l~SGGlDS~~~l~~l~~~----~---~~v~~i~~~ 41 (240) |.+.++.+.++.|.||||===..++--|-.+ . .+++....| T Consensus 20 ~~~Al~~~~~~~~alSGGr~p~~L~~~La~~~~~~~~~W~~~~~~l~D 67 (251) T TIGR01198 20 LQKALAERGQFLLALSGGRSPIALLEALAAQEAGSKLDWSRIHLFLGD 67 (251) T ss_pred HHHHHHHHCCEEEEEECCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEE T ss_conf 999999768968998248745789999778634365880215788632 No 133 >pfam08366 LLGL LLGL2. This domain is found in lethal giant larvae homolog 2 (LLGL2) proteins and syntaxin-binding proteins like tomosyn. It has been identified in eukaryotes and tends to be found together with WD repeats (pfam00400). Probab=40.24 E-value=16 Score=16.15 Aligned_cols=30 Identities=17% Similarity=0.450 Sum_probs=18.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 59948999809656999999999749839999976778 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFDRVETLSFDYGQ 44 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~~v~~i~~~ygq 44 (240) +...-+++|||||=. ..+-+-++||+--|. T Consensus 25 ~~g~~~iiFSGGmpr--------~~ygdr~~vTV~~gk 54 (105) T pfam08366 25 RSGEPFIIFSGGMPR--------ASYGDRHCITVMHGK 54 (105) T ss_pred CCCCCEEEEECCCCC--------CCCCCCCEEEEEECC T ss_conf 789967998489876--------535897239999798 No 134 >COG1158 Rho Transcription termination factor [Transcription] Probab=40.22 E-value=11 Score=17.07 Aligned_cols=15 Identities=27% Similarity=0.125 Sum_probs=8.0 Q ss_pred EEEEECCCCHHHHHH Q ss_conf 775313898899999 Q gi|254781031|r 168 VHTPLMWLKKYETWK 182 (240) Q Consensus 168 i~~P~~~~tK~eI~~ 182 (240) -+.|.++..|...-+ T Consensus 355 RifPAIdi~kSGTRk 369 (422) T COG1158 355 RIFPAIDINKSGTRK 369 (422) T ss_pred CCCCCEECCCCCCCH T ss_conf 245302035677615 No 135 >pfam08981 consensus Probab=40.20 E-value=24 Score=14.94 Aligned_cols=58 Identities=22% Similarity=0.111 Sum_probs=37.6 Q ss_pred CCEEEEEECCHHHHHHHHHHHH-CCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 9489998096569999999997-4983999997677887772799999999999998887 Q gi|254781031|r 9 PSALLLFSGGQDSSTCLSWALD-RFDRVETLSFDYGQRNKVELECRLCVRKKIVELMPKW 67 (240) Q Consensus 9 ~~av~l~SGGlDS~~~l~~l~~-~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~~~~~~~~ 67 (240) ++.|+.-|-| ++...++.+++ .+.+++.|++.+|.+.+..-+......+++.+...++ T Consensus 27 ~~iVvASssG-~TA~k~~e~~~~~~~~lvvVth~~GF~~pg~~e~~~e~~~~L~~~G~~V 85 (181) T pfam08981 27 KHIVVASSSG-ETALKAAEALEGTNLNVVVVTHHAGFSEPGEQEMDPEVRKELEERGVKV 85 (181) T ss_pred CEEEEEECCC-HHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHCCCEE T ss_conf 7699980787-7999999871358955999946457789985638999999999779889 No 136 >cd05565 PTS_IIB_lactose PTS_IIB_lactose: subunit IIB of enzyme II (EII) of the lactose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS) found in Firmicutes as well as Actinobacteria. In this system, EII is a lactose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIC and IIB domains are expressed as a single protein from the lac operon. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include lactose, chitobiose/lichenan, ascorbate, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system. Probab=40.20 E-value=24 Score=14.94 Aligned_cols=35 Identities=26% Similarity=0.251 Sum_probs=24.3 Q ss_pred CEEEEEECCHHHHHHHHHHHH----CCCEEEEEEEECCC Q ss_conf 489998096569999999997----49839999976778 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALD----RFDRVETLSFDYGQ 44 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~----~~~~v~~i~~~ygq 44 (240) |.+++.|||+-|+.++.-+.+ +|.++..--+.||+ T Consensus 2 kVLl~Ca~GmSSs~la~~m~k~A~~~gi~~~i~A~~~~~ 40 (99) T cd05565 2 NVLVLCAGGGTSGLLANALNKGAKERGVPLEAAAGAYGS 40 (99) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEEECHH T ss_conf 099990798858999999999999819976999510344 No 137 >KOG1315 consensus Probab=39.43 E-value=12 Score=16.90 Aligned_cols=21 Identities=29% Similarity=0.730 Sum_probs=13.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 162888655556688998687789999 Q gi|254781031|r 202 TCYLGKRDKRYEWGYGCNSCPACYLRQ 228 (240) Q Consensus 202 SC~~~~~~~~~~~~~hCG~C~~C~~R~ 228 (240) .|++|.+. .||..|.-|+++- T Consensus 116 ~~iKPdRa------HHCsvC~rCvLKm 136 (307) T KOG1315 116 KCIKPDRA------HHCSVCNRCVLKM 136 (307) T ss_pred CCCCCCCC------CCCHHHHHHHHCC T ss_conf 65268856------6650645320035 No 138 >TIGR00725 TIGR00725 conserved hypothetical protein TIGR00725; InterPro: IPR005268 This family of conserved hypothetical proteins has no known function. . Probab=39.27 E-value=14 Score=16.36 Aligned_cols=37 Identities=24% Similarity=0.316 Sum_probs=21.1 Q ss_pred HHHCCCCCEEEEEECCHHHHH--HHHHHHHCCCEEEEEE Q ss_conf 221359948999809656999--9999997498399999 Q gi|254781031|r 3 DIIKKAPSALLLFSGGQDSST--CLSWALDRFDRVETLS 39 (240) Q Consensus 3 ~~~~~~~~av~l~SGGlDS~~--~l~~l~~~~~~v~~i~ 39 (240) ++.|+.+.-++|++||+|=.- ..--++++|=-|+.|- T Consensus 28 ~~AK~gPsd~~l~~GGr~GvME~~s~G~~~~GGlVVGIL 66 (176) T TIGR00725 28 ELAKKGPSDHILINGGRGGVMEAVSKGAREKGGLVVGIL 66 (176) T ss_pred HHHHHCCCCEEEEECCCCCEEEEEEECEECCCCEEEEEC T ss_conf 998619997899947998445424522010678799980 No 139 >cd01522 RHOD_1 Member of the Rhodanese Homology Domain superfamily, subgroup 1. This CD includes the putative rhodanese-related sulfurtransferases of several uncharacterized proteins. Probab=39.25 E-value=25 Score=14.85 Aligned_cols=14 Identities=7% Similarity=-0.235 Sum_probs=5.4 Q ss_pred HHHHHHCCCEEEEE Q ss_conf 99999749839999 Q gi|254781031|r 25 LSWALDRFDRVETL 38 (240) Q Consensus 25 l~~l~~~~~~v~~i 38 (240) |+.+++++-+.++| T Consensus 6 A~e~L~~~~~~~lI 19 (117) T cd01522 6 AWALLQADPQAVLV 19 (117) T ss_pred HHHHHHHCCCEEEE T ss_conf 99999849983999 No 140 >KOG0316 consensus Probab=38.63 E-value=3.5 Score=20.16 Aligned_cols=30 Identities=23% Similarity=0.089 Sum_probs=21.3 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHCCCE Q ss_conf 135994899980965699999999974983 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDRFDR 34 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~~~~ 34 (240) .+.++.+-+++||++||++-+|.-..+..+ T Consensus 107 V~fNeesSVv~SgsfD~s~r~wDCRS~s~e 136 (307) T KOG0316 107 VRFNEESSVVASGSFDSSVRLWDCRSRSFE 136 (307) T ss_pred EEECCCCEEEEECCCCCEEEEEECCCCCCC T ss_conf 996575308995344641688872368987 No 141 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=37.59 E-value=27 Score=14.69 Aligned_cols=22 Identities=23% Similarity=0.175 Sum_probs=20.3 Q ss_pred CEEEEEECCHHHHHHHHHHHHC Q ss_conf 4899980965699999999974 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDR 31 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~ 31 (240) .+|||.+|=+|-+++|..|.|. T Consensus 82 ~~V~Lt~g~iDH~tGLL~LREg 103 (314) T TIGR02108 82 EGVVLTDGEIDHVTGLLSLREG 103 (314) T ss_pred EEEEECCCCHHHHHHHHHHCCC T ss_conf 4788206622456412333037 No 142 >COG1600 Uncharacterized Fe-S protein [Energy production and conversion] Probab=37.55 E-value=13 Score=16.55 Aligned_cols=12 Identities=33% Similarity=0.772 Sum_probs=10.4 Q ss_pred CCCCCCHHHHHH Q ss_conf 899868778999 Q gi|254781031|r 216 YGCNSCPACYLR 227 (240) Q Consensus 216 ~hCG~C~~C~~R 227 (240) .|||.|..|++. T Consensus 185 ~~Cg~C~~Cida 196 (337) T COG1600 185 DHCGSCTRCLDA 196 (337) T ss_pred CCCHHHHHHHHH T ss_conf 757216899863 No 143 >PRK00162 glpE thiosulfate sulfurtransferase; Validated Probab=37.04 E-value=27 Score=14.64 Aligned_cols=18 Identities=17% Similarity=0.161 Sum_probs=5.8 Q ss_pred EECCHHHHHHHHHHHHCC Q ss_conf 809656999999999749 Q gi|254781031|r 15 FSGGQDSSTCLSWALDRF 32 (240) Q Consensus 15 ~SGGlDS~~~l~~l~~~~ 32 (240) .-+|..|..++..|.+.| T Consensus 65 C~~G~rS~~aa~~L~~~G 82 (104) T PRK00162 65 CYHGNSSQGAAQYLLQQG 82 (104) T ss_pred ECCCHHHHHHHHHHHHCC T ss_conf 598818999999999737 No 144 >PTZ00200 cysteine proteinase precursor; Provisional Probab=35.90 E-value=19 Score=15.65 Aligned_cols=12 Identities=42% Similarity=0.775 Sum_probs=7.6 Q ss_pred CCCCCCCCCCHH Q ss_conf 566889986877 Q gi|254781031|r 212 YEWGYGCNSCPA 223 (240) Q Consensus 212 ~~~~~hCG~C~~ 223 (240) .+++.+||-||+ T Consensus 250 KdQG~~CGSCWA 261 (448) T PTZ00200 250 KDQGLNCGSCWA 261 (448) T ss_pred CCCCCCCCHHHH T ss_conf 338999747799 No 145 >TIGR00888 guaA_Nterm GMP synthase, N-terminal domain; InterPro: IPR004739 The N-terminal region of GMP synthase 6.3.5.2 from EC is recognised by this signature. GMP synthase catalyzes the synthesis of GMP from XMP , .ATP + xanthosine 5'-phosphate + L-glutamine + H(2)O = AMP + diphosphate + GMP + L-glutamate ; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006177 GMP biosynthetic process. Probab=35.78 E-value=18 Score=15.77 Aligned_cols=19 Identities=37% Similarity=0.712 Sum_probs=14.2 Q ss_pred HHHCCCCCEEEEEECCHHHH Q ss_conf 22135994899980965699 Q gi|254781031|r 3 DIIKKAPSALLLFSGGQDSS 22 (240) Q Consensus 3 ~~~~~~~~av~l~SGGlDS~ 22 (240) ++-+..+|.| -||||.+|. T Consensus 36 eI~~~~P~Gi-ILSGGP~sv 54 (195) T TIGR00888 36 EIKEKNPKGI-ILSGGPASV 54 (195) T ss_pred HHHHCCCCEE-EEECCCCCC T ss_conf 9861298679-972788764 No 146 >TIGR03366 HpnZ_proposed putative phosphonate catabolism associated alcohol dehydrogenase. This clade of zinc-binding alcohol dehydrogenases (members of pfam00107) are repeatedly associated with genes proposed to be involved with the catabolism of phosphonate compounds. Probab=35.60 E-value=16 Score=16.01 Aligned_cols=32 Identities=19% Similarity=-0.009 Sum_probs=17.1 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCC-EEEEEEE Q ss_conf 99489998096569999999997498-3999997 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFD-RVETLSF 40 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~-~v~~i~~ 40 (240) .+++++.=.|++= +..+..++..|. +|+.+-. T Consensus 121 g~~V~V~G~G~iG-l~~~~~a~~~Ga~~Vi~~d~ 153 (280) T TIGR03366 121 GRRVLVVGAGMLG-LTAAAAAAAAGAARVVAADP 153 (280) T ss_pred CCEEEEEECCHHH-HHHHHHHHHCCCCEEEEEEC T ss_conf 9989999078689-99999999849987999919 No 147 >PRK13685 hypothetical protein; Provisional Probab=34.81 E-value=30 Score=14.42 Aligned_cols=14 Identities=7% Similarity=-0.163 Sum_probs=5.4 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 89889999999982 Q gi|254781031|r 174 WLKKYETWKLAQDI 187 (240) Q Consensus 174 ~~tK~eI~~~~~~~ 187 (240) -.+..+.-+...++ T Consensus 273 A~d~~~L~~Iy~~i 286 (326) T PRK13685 273 AASLEELRAVYATL 286 (326) T ss_pred CCCHHHHHHHHHHH T ss_conf 19999999999996 No 148 >PRK06305 DNA polymerase III subunits gamma and tau; Validated Probab=34.02 E-value=30 Score=14.34 Aligned_cols=32 Identities=19% Similarity=0.097 Sum_probs=12.0 Q ss_pred HHHCCCEEEEEEEECCCCCCCCHHHHHHHHHHH Q ss_conf 997498399999767788777279999999999 Q gi|254781031|r 28 ALDRFDRVETLSFDYGQRNKVELECRLCVRKKI 60 (240) Q Consensus 28 l~~~~~~v~~i~~~ygq~~~~~~~~a~~~~~~~ 60 (240) +.+.+.-.|+.-| .|.+--+..-.|...++.+ T Consensus 32 ai~~~ri~HAyLF-~GprGtGKTT~ArilAkaL 63 (462) T PRK06305 32 ALRFNRAAHAYLF-SGIRGTGKTTLARIFAKAL 63 (462) T ss_pred HHHCCCCCCEEEE-ECCCCCCHHHHHHHHHHHH T ss_conf 9984997623430-3899859999999999996 No 149 >KOG0272 consensus Probab=33.64 E-value=14 Score=16.49 Aligned_cols=17 Identities=24% Similarity=0.022 Sum_probs=8.9 Q ss_pred EEECCHHHHHHHHHHHH Q ss_conf 98096569999999997 Q gi|254781031|r 14 LFSGGQDSSTCLSWALD 30 (240) Q Consensus 14 l~SGGlDS~~~l~~l~~ 30 (240) ..|||+||+.-.|++.. T Consensus 318 ~~tGGlD~~~RvWDlRt 334 (459) T KOG0272 318 AATGGLDSLGRVWDLRT 334 (459) T ss_pred EECCCCCCHHHEEECCC T ss_conf 65267631010022156 No 150 >COG2210 Peroxiredoxin family protein [General function prediction only] Probab=33.05 E-value=32 Score=14.24 Aligned_cols=37 Identities=16% Similarity=0.053 Sum_probs=27.0 Q ss_pred CCCCEEEEEECCHHHHHHHHHHH----HCCCEEEEEEEECC Q ss_conf 59948999809656999999999----74983999997677 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWAL----DRFDRVETLSFDYG 43 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~----~~~~~v~~i~~~yg 43 (240) ..|-+++++||=+|=...+..+. ..|++|.-.+..+| T Consensus 2 ~~k~~IIl~SG~~dk~~~a~iias~A~A~G~EV~VF~TfwG 42 (137) T COG2210 2 DKKLGIILASGTLDKAYAALIIASGAAAMGYEVTVFFTFWG 42 (137) T ss_pred CCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEHHH T ss_conf 84579999579788999999999999976975899986788 No 151 >KOG1372 consensus Probab=32.65 E-value=32 Score=14.20 Aligned_cols=36 Identities=25% Similarity=0.303 Sum_probs=27.6 Q ss_pred HHCCCCCEEEEEEC--CHHHHHHHHHHHHCCCEEEEEE Q ss_conf 21359948999809--6569999999997498399999 Q gi|254781031|r 4 IIKKAPSALLLFSG--GQDSSTCLSWALDRFDRVETLS 39 (240) Q Consensus 4 ~~~~~~~av~l~SG--GlDS~~~l~~l~~~~~~v~~i~ 39 (240) +-....+-|.|..| |-|-+-++-.|+++||+||.|- T Consensus 22 ~~~~r~rkvALITGItGQDGSYLaEfLL~KgYeVHGii 59 (376) T KOG1372 22 LGAFRPRKVALITGITGQDGSYLAEFLLSKGYEVHGII 59 (376) T ss_pred HCCCCCCEEEEEECCCCCCCHHHHHHHHCCCCEEEEEE T ss_conf 05766541799962368872699999870885676788 No 152 >KOG0823 consensus Probab=31.63 E-value=17 Score=15.85 Aligned_cols=22 Identities=23% Similarity=0.218 Sum_probs=9.5 Q ss_pred CCCEEEEEECCHHHHHHHHHHH Q ss_conf 9948999809656999999999 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWAL 29 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~ 29 (240) .+..||-+=|=|===-++|.++ T Consensus 57 akdPVvTlCGHLFCWpClyqWl 78 (230) T KOG0823 57 AKDPVVTLCGHLFCWPCLYQWL 78 (230) T ss_pred CCCCEEEECCCCEEHHHHHHHH T ss_conf 5787774145430348888877 No 153 >pfam01799 Fer2_2 [2Fe-2S] binding domain. Probab=30.91 E-value=19 Score=15.65 Aligned_cols=12 Identities=17% Similarity=0.335 Sum_probs=4.3 Q ss_pred CCHHHHHHHHHH Q ss_conf 430247777767 Q gi|254781031|r 146 CRHDTIRAIETA 157 (240) Q Consensus 146 ~~~~~~~~~~~~ 157 (240) ++|.+......+ T Consensus 29 CtPG~ims~~~l 40 (75) T pfam01799 29 CTPGMVMAATAL 40 (75) T ss_pred CCHHHHHHHHHH T ss_conf 750499999999 No 154 >KOG1312 consensus Probab=30.53 E-value=19 Score=15.60 Aligned_cols=14 Identities=29% Similarity=0.669 Sum_probs=12.0 Q ss_pred CCCCCCCHHHHHHH Q ss_conf 88998687789999 Q gi|254781031|r 215 GYGCNSCPACYLRQ 228 (240) Q Consensus 215 ~~hCG~C~~C~~R~ 228 (240) ..||-.|-.|.+|= T Consensus 162 SKHCsiCNrCV~rf 175 (341) T KOG1312 162 SKHCSICNRCVHRF 175 (341) T ss_pred CCCCHHHHHHHHHH T ss_conf 34133888999874 No 155 >PRK09111 DNA polymerase III subunits gamma and tau; Validated Probab=30.29 E-value=35 Score=13.95 Aligned_cols=16 Identities=19% Similarity=0.325 Sum_probs=8.9 Q ss_pred EECCHHHHHHHHHHHH Q ss_conf 8096569999999997 Q gi|254781031|r 15 FSGGQDSSTCLSWALD 30 (240) Q Consensus 15 ~SGGlDS~~~l~~l~~ 30 (240) +..|+|-...+-.|++ T Consensus 284 ~~~G~Dp~~iL~dLLe 299 (600) T PRK09111 284 YDAGADPVVVLRDLAE 299 (600) T ss_pred HHHCCCHHHHHHHHHH T ss_conf 9859899999999999 No 156 >KOG0270 consensus Probab=30.23 E-value=4.3 Score=19.64 Aligned_cols=19 Identities=26% Similarity=0.118 Sum_probs=16.1 Q ss_pred EEEECCHHHHHHHHHHHHC Q ss_conf 9980965699999999974 Q gi|254781031|r 13 LLFSGGQDSSTCLSWALDR 31 (240) Q Consensus 13 ~l~SGGlDS~~~l~~l~~~ 31 (240) +|.|||.|+++.+|.+-.. T Consensus 258 VLaSgsaD~TV~lWD~~~g 276 (463) T KOG0270 258 VLASGSADKTVKLWDVDTG 276 (463) T ss_pred EEEECCCCCEEEEEECCCC T ss_conf 6773478743899876778 No 157 >TIGR02518 EutH_ACDH acetaldehyde dehydrogenase (acetylating); InterPro: IPR013357 Aldehyde dehydrogenases (1.2.1.3 from EC and 1.2.1.5 from EC) are enzymes which oxidize a wide variety of aliphatic and aromatic aldehydes using NAD(P) as a cofactor. In eukaryotes, at least twenty distinct aldehyde dehydrogenase families have been classified . Many aldehyde dehydrogenases have also been found in prokaryotic species. A glutamic acid and a cysteine residue have been implicated in the catalytic activity of mammalian aldehyde dehydrogenase, though these residues are not necessarily conserved in all the enzymes of this family. The acetaldehyde dehydrogenase family (1.2.1.10 from EC) of bacterial enzymes catalyse the formation of acetyl-CoA from acetaldehyde in the degradation of phenols, cresols and catechol . Proteins in this entry occur in bacterial species and are annotated as probable acetaldehyde dehydrogenases, though this function has not been experimentally verified.. Probab=30.03 E-value=31 Score=14.26 Aligned_cols=20 Identities=30% Similarity=0.675 Sum_probs=17.0 Q ss_pred CHHHCCCCCEEEEEECCHHH Q ss_conf 12213599489998096569 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDS 21 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS 21 (240) |+|||.++.+|+|=.||-== T Consensus 180 neLMK~kdts~IlATGG~aM 199 (528) T TIGR02518 180 NELMKNKDTSLILATGGEAM 199 (528) T ss_pred HHHHCCCCEEEEEECCCHHH T ss_conf 55402468068996077025 No 158 >TIGR01488 HAD-SF-IB HAD-superfamily hydrolase, subfamily IB (PSPase-like); InterPro: IPR006383 This group represents a subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. Subfamily IA, B, C and D are distinguished from the rest of the superfamily by the presence of a variable domain between the first and second conserved catalytic motifs. In subfamilies IA and IB, this domain consists of an alpha-helical bundle. It was necessary to model these two subfamilies separately, breaking them at an apparent phylogenetic bifurcation, so that the resulting model(s) are not so broadly defined that members of subfamily III (which lack the variable domain) are included. ; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=29.83 E-value=36 Score=13.90 Aligned_cols=12 Identities=17% Similarity=-0.155 Sum_probs=5.7 Q ss_pred ECCCCHHHHHHH Q ss_conf 138988999999 Q gi|254781031|r 172 LMWLKKYETWKL 183 (240) Q Consensus 172 ~~~~tK~eI~~~ 183 (240) -.+..|..+++. T Consensus 161 ~~~~~K~~~~~~ 172 (204) T TIGR01488 161 VEGEEKGKVLKE 172 (204) T ss_pred CCCCCHHHHHHH T ss_conf 552203899999 No 159 >PRK11433 putative xanthine dehydrogenase iron-sulfur-binding subunit; Provisional Probab=29.74 E-value=24 Score=14.95 Aligned_cols=12 Identities=25% Similarity=0.880 Sum_probs=8.2 Q ss_pred CCCCCCCHHHHH Q ss_conf 889986877899 Q gi|254781031|r 215 GYGCNSCPACYL 226 (240) Q Consensus 215 ~~hCG~C~~C~~ 226 (240) ..|||+|.|=+. T Consensus 144 a~QCGyCTPG~i 155 (217) T PRK11433 144 GFQCGYCTPGQI 155 (217) T ss_pred CCCCCCCHHHHH T ss_conf 997988848999 No 160 >PRK09908 xanthine dehydrogenase subunit XdhC; Provisional Probab=29.57 E-value=23 Score=15.06 Aligned_cols=18 Identities=11% Similarity=0.388 Sum_probs=9.5 Q ss_pred CCCCCCCHHHHHH-HHHHH Q ss_conf 8899868778999-99999 Q gi|254781031|r 215 GYGCNSCPACYLR-QKGWM 232 (240) Q Consensus 215 ~~hCG~C~~C~~R-~~af~ 232 (240) ..|||+|.|=+.= -+|++ T Consensus 100 a~QCGyCtPG~im~~~alL 118 (159) T PRK09908 100 AVQCGFCTPGLIMATTAML 118 (159) T ss_pred CCCCCCCHHHHHHHHHHHH T ss_conf 9808897389999999999 No 161 >PRK09376 rho transcription termination factor Rho; Provisional Probab=29.09 E-value=20 Score=15.51 Aligned_cols=19 Identities=11% Similarity=0.039 Sum_probs=7.3 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 9986877899999999730 Q gi|254781031|r 217 GCNSCPACYLRQKGWMEFK 235 (240) Q Consensus 217 hCG~C~~C~~R~~af~~~~ 235 (240) |+..-.-+++|-.-+.|.| T Consensus 237 H~~vae~~lerAkRlvE~G 255 (416) T PRK09376 237 HVQVAEMVIEKAKRLVEHG 255 (416) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999998769 No 162 >KOG0509 consensus Probab=28.02 E-value=36 Score=13.86 Aligned_cols=57 Identities=21% Similarity=0.176 Sum_probs=29.2 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 32167753138988999999998282000143255372162888655556688998687789999 Q gi|254781031|r 164 SHVTVHTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLRQ 228 (240) Q Consensus 164 ~~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R~ 228 (240) .+..+.-+.-...+..|.++. +.+..+... -|=.|-.+.+|.+ ..||+.|-.|+.|= T Consensus 391 sDPg~i~~~~~~~~~tIs~l~-d~gkf~~en-~FC~~clirKp~r------Skhc~vcnrcVarf 447 (600) T KOG0509 391 SDPGFIPTSTEVGRETISQLI-DFGKFDLEN-RFCLTCLIRKPLR------SKHCSVCNRCVARF 447 (600) T ss_pred CCCCCCCCCHHHHHHHHHHHH-CCCCCCCCC-CCEEEEEEECCCC------CCHHHHHHHHHHCC T ss_conf 588987774236788999765-005323323-4120566314774------30415667777422 No 163 >TIGR03198 pucE xanthine dehydrogenase E subunit. This gene has been characterized in B. subtilis as the Iron-sulfur cluster binding-subunit of xanthine dehydrogenase (pucE), acting in conjunction with pucC, the FAD-binding subunit and pucD, the molybdopterin binding subunit. The more common XDH complex (GenProp0640) includes the xdhA gene as the Fe-S cluster binding component. Probab=28.00 E-value=26 Score=14.74 Aligned_cols=12 Identities=25% Similarity=0.894 Sum_probs=7.8 Q ss_pred CCCCCCCHHHHH Q ss_conf 889986877899 Q gi|254781031|r 215 GYGCNSCPACYL 226 (240) Q Consensus 215 ~~hCG~C~~C~~ 226 (240) ..|||+|.|=+. T Consensus 95 a~QCGyCtPG~v 106 (151) T TIGR03198 95 GFQCGYCTPGMV 106 (151) T ss_pred CCCCCCCHHHHH T ss_conf 983898738999 No 164 >cd04193 UDPGlcNAc_PPase UDPGlcNAc pyrophosphorylase catalayzes the synthesis of UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1 to PPi and UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc), the activated form of GlcNAc, is a key precursor of N- and O-linked glycosylations. It is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker which anchors a variety of cell surface proteins to the plasma membrane. In bacteria, UDPGlcNAc represents an essential precursor for both peptidoglycan and lipopolysaccharide biosynthesis. Human UAP has two isoforms, resulting from alternative splicing of a single gene and differing by the presence or absence of 17 amino acids. UDPGlcNAc pyrophosphorylase shares significant sequence and structure conservation with UDPglucose pyrophosphorylase. Probab=27.59 E-value=35 Score=13.96 Aligned_cols=21 Identities=33% Similarity=0.551 Sum_probs=17.9 Q ss_pred HHHCCCCCEEEEEECCHHHHH Q ss_conf 221359948999809656999 Q gi|254781031|r 3 DIIKKAPSALLLFSGGQDSST 23 (240) Q Consensus 3 ~~~~~~~~av~l~SGGlDS~~ 23 (240) +.+++.|-||+++.||+=+.. T Consensus 9 ~~i~~gkvavv~laGG~GTrl 29 (323) T cd04193 9 KAIAEGKVAVLLLAGGQGTRL 29 (323) T ss_pred HHHHCCCEEEEEECCCCCCCC T ss_conf 999749989999578874556 No 165 >cd01399 GlcN6P_deaminase GlcN6P_deaminase: Glucosamine-6-phosphate (GlcN6P) deaminase subfamily; GlcN6P deaminase catalyzes the reversible conversion of GlcN6P to D-fructose-6-phosphate (Fru6P) and ammonium. The reaction is an aldo-keto isomerization coupled with an amination or deamination. It is the last step of the metabolic pathway of N-acetyl-D-glucosamine-6-phosphate (GlcNAc6P). GlcN6P deaminase is a hexameric enzyme that is allosterically activated by GlcNAc6P. Probab=27.53 E-value=33 Score=14.15 Aligned_cols=40 Identities=23% Similarity=0.346 Sum_probs=25.6 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHHHH-H---C-C---CEEEEEEEEC Q ss_conf 1221359948999809656999999999-7---4-9---8399999767 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSWAL-D---R-F---DRVETLSFDY 42 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~l~-~---~-~---~~v~~i~~~y 42 (240) .+.+++++++++.+|||-- ...+|..+ + + . .+++..+.|- T Consensus 12 ~~~i~~~~~~~ialsGG~t-P~~~y~~L~~~~~~~~i~w~~v~~f~~DE 59 (232) T cd01399 12 AELIREKPPAVLGLATGST-PLGVYEELIELHKEGGLSFSNVTTFNLDE 59 (232) T ss_pred HHHHHHCCCEEEEECCCCC-HHHHHHHHHHHHHHCCCCHHHEEEEECCC T ss_conf 9999977986999899878-99999999998863499757938992742 No 166 >TIGR02057 PAPS_reductase phosphoadenosine phosphosulfate reductase; InterPro: IPR011800 Requiring thioredoxin as an electron donor, phosphoadenosine phosphosulphate reductase catalyzes the reduction of 3'-phosphoadenylylsulphate (PAPS) to sulphite and phospho-adenosine-phosphate (PAP). Found in enterobacteria, cyanobacteria, and yeast, PAPS reductase is related to a group of plant (IPR004508 from INTERPRO) and bacterial (IPR011798 from INTERPRO) enzymes preferring 5'-adenylylsulphate (APS) over PAPS as a substrate for reduction to sulphite.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019344 cysteine biosynthetic process. Probab=26.25 E-value=41 Score=13.51 Aligned_cols=192 Identities=9% Similarity=-0.085 Sum_probs=85.6 Q ss_pred CEEEEEECCHHHHHHHHHHHHCC----CEEEEEEEECCCCCCCCHHHHHHHHHHHH-HHHHHHHHCCCCCEEEECHHHCC Q ss_conf 48999809656999999999749----83999997677887772799999999999-99888752035615875011032 Q gi|254781031|r 10 SALLLFSGGQDSSTCLSWALDRF----DRVETLSFDYGQRNKVELECRLCVRKKIV-ELMPKWKDSLGEDHILPLAILGD 84 (240) Q Consensus 10 ~av~l~SGGlDS~~~l~~l~~~~----~~v~~i~~~ygq~~~~~~~~a~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 84 (240) .-|..=|=|+|.+|.+.++-+-- ..|..|++|.|.--.-+.+....+.+++. .....+.-+......-....... T Consensus 29 ~Lv~tsaFg~~~lV~~hll~~~~keky~~~pvif~DTly~F~~Tl~l~~~~~~~y~qPkn~~L~~~~~~~~~s~a~~~Ak 108 (239) T TIGR02057 29 ELVQTSAFGIQALVILHLLSSILKEKYPDIPVIFIDTLYLFPQTLELVDELTEKYKQPKNLDLNLYKYDDVESEAAFEAK 108 (239) T ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHHH T ss_conf 74530102047899999999987621799867862111362789999999999862765563303316888878989744 Q ss_pred CCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 24443322201100012331011111112233377777743399365114544443455544302477777677530643 Q gi|254781031|r 85 ISHSSLTKNVAMKIQDNNLPNTFVPGRNIIFLVFAATLAYRLGITNIVIGVCETDYSGYPDCRHDTIRAIETAINLGMES 164 (240) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (240) +....-.. +..... -+-..+..-.+.-+ --.+++.+.++|..++.+ .+|.+-+.+.. ....+ T Consensus 109 YG~~l~~K--D~~~~~-----~Y~~~~KvEP~~RA---lkEL~~~Aw~tG~Rr~Q~-~~R~nL~v~~~-------D~~~G 170 (239) T TIGR02057 109 YGKLLWQK--DVEKLE-----KYDDIAKVEPMQRA---LKELNAVAWFTGRRRDQA-SARANLPVLEL-------DLQNG 170 (239) T ss_pred HHHHHHCC--CHHHHH-----HHCCCCCCCHHHHH---HHHHHHHHHHHCCCHHHH-CCCCCCCCEEE-------ECCCC T ss_conf 20465405--722233-----21343443623355---688889998742554342-02236861000-------01478 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCHHCCCC-----HHHCCCC---CCC---C-CCCCCC-CCCCCCC Q ss_conf 21677531389889999999982820001432-----5537216---288---8-655556-6889986 Q gi|254781031|r 165 HVTVHTPLMWLKKYETWKLAQDIGGQDLVNLI-----LEESHTC---YLG---K-RDKRYE-WGYGCNS 220 (240) Q Consensus 165 ~~~i~~P~~~~tK~eI~~~~~~~~~~~~~~~~-----~~~T~SC---~~~---~-~~~~~~-~~~hCG~ 220 (240) +--++|++++|..+|.+...+.+...-.-.+ .+.++|- ..+ + ..++.+ .++-||. T Consensus 171 -~lKv~Pl~~W~~~~~~~Y~~~h~v~Y~pL~D~GY~SiGD~H~t~k~~~G~~e~r~gRf~G~~K~ECGl 238 (239) T TIGR02057 171 -ILKVNPLIDWTRKQVYQYLAAHKVPYNPLFDQGYLSIGDEHSTRKVKEGEDERRAGRFKGKLKTECGL 238 (239) T ss_pred -CCEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCC T ss_conf -10320210265022799998678846842227866507757888877763122213532787663578 No 167 >COG0607 PspE Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism] Probab=26.15 E-value=42 Score=13.50 Aligned_cols=27 Identities=7% Similarity=-0.113 Sum_probs=13.1 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCCE Q ss_conf 994899980965699999999974983 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFDR 34 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~~ 34 (240) .+..|+..+.|.-|...+..|++.|++ T Consensus 61 ~~~ivv~C~~G~rS~~aa~~L~~~G~~ 87 (110) T COG0607 61 DDPIVVYCASGVRSAAAAAALKLAGFT 87 (110) T ss_pred CCEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 986999999975589999999986997 No 168 >COG3349 Uncharacterized conserved protein [Function unknown] Probab=25.74 E-value=42 Score=13.45 Aligned_cols=29 Identities=10% Similarity=-0.089 Sum_probs=25.2 Q ss_pred EEEEECCHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 99980965699999999974983999997 Q gi|254781031|r 12 LLLFSGGQDSSTCLSWALDRFDRVETLSF 40 (240) Q Consensus 12 v~l~SGGlDS~~~l~~l~~~~~~v~~i~~ 40 (240) |+..-||+=++++|+.|.+.|++|...-- T Consensus 3 Vai~GaG~AgL~~a~~La~~g~~vt~~ea 31 (485) T COG3349 3 VAIAGAGLAGLAAAYELADAGYDVTLYEA 31 (485) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 89974627888999999868985399842 No 169 >TIGR03193 4hydroxCoAred 4-hydroxybenzoyl-CoA reductase, gamma subunit. 4-hydroxybenzoyl-CoA reductase converts 4-hydroxybenzoyl-CoA to benzoyl-CoA, a common intermediate in the degradation of aromatic compounds. This protein family represents the gamma chain of this three-subunit enzyme. Probab=25.46 E-value=29 Score=14.48 Aligned_cols=12 Identities=25% Similarity=0.703 Sum_probs=7.1 Q ss_pred CCCCCCCHHHHH Q ss_conf 889986877899 Q gi|254781031|r 215 GYGCNSCPACYL 226 (240) Q Consensus 215 ~~hCG~C~~C~~ 226 (240) ..|||+|.|=+. T Consensus 94 a~QCGfCtPG~v 105 (148) T TIGR03193 94 GTQCGFCTPGMI 105 (148) T ss_pred CCCCCCCCHHHH T ss_conf 987798844799 No 170 >TIGR02374 nitri_red_nirB nitrite reductase [NAD(P)H], large subunit; InterPro: IPR012744 This entry describes NirB, the large subunit of nitrite reductase [NAD(P)H] (the assimilatory nitrite reductase), which associates with NirD, the small subunit (IPR012748 from INTERPRO). In a few bacteria such as Klebsiella pneumoniae and in fungi, the two regions are fused.; GO: 0008942 nitrite reductase [NAD(P)H] activity, 0050660 FAD binding, 0050661 NADP binding, 0042128 nitrate assimilation. Probab=25.37 E-value=43 Score=13.41 Aligned_cols=47 Identities=15% Similarity=0.145 Sum_probs=22.5 Q ss_pred EEEECCCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 753138988999999998282000143255372162888655556688998687789999 Q gi|254781031|r 169 HTPLMWLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLRQ 228 (240) Q Consensus 169 ~~P~~~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R~ 228 (240) +.-=-+.||++|..-..+-+....-++ -+| ...++-||-|.|=++-. T Consensus 430 IC~Cn~VtKG~l~~Ai~~~~c~~v~~~-----K~C--------T~A~~SCGGC~PLv~~~ 476 (813) T TIGR02374 430 ICSCNDVTKGALIAAIQEGSCSTVEEL-----KAC--------TKAGTSCGGCKPLVEQL 476 (813) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCHHHH-----HHH--------CCCCCCCCCCCHHHHHH T ss_conf 126788577899999842789876998-----600--------26788998762368999 No 171 >TIGR02951 DMSO_dmsB dimethylsulfoxide reductase, chain B; InterPro: IPR014297 This family consists of the iron-sulphur subunit, or chain B, of the anaerobic dimethyl sulphoxide reductase enzyme. Chains A and B are catalytic, while chain C is a membrane anchor.. Probab=24.98 E-value=36 Score=13.87 Aligned_cols=30 Identities=17% Similarity=0.141 Sum_probs=21.5 Q ss_pred HHCCCCCCCCCCCC-CCCCCCCCCCHHHHHHHH Q ss_conf 53721628886555-566889986877899999 Q gi|254781031|r 198 EESHTCYLGKRDKR-YEWGYGCNSCPACYLRQK 229 (240) Q Consensus 198 ~~T~SC~~~~~~~~-~~~~~hCG~C~~C~~R~~ 229 (240) .-.||| |..+++ ..+..|=++|--|++|.+ T Consensus 103 yC~waC--PYgAPQy~~~~G~m~KCDgC~D~~~ 133 (162) T TIGR02951 103 YCVWAC--PYGAPQYDEQQGVMSKCDGCVDRVE 133 (162) T ss_pred CCCCCC--CCCCCCCCCCCCEEEECCCCHHHHH T ss_conf 456458--7887673334774442134367884 No 172 >pfam08240 ADH_N Alcohol dehydrogenase GroES-like domain. This is the catalytic domain of alcohol dehydrogenases. Many of them contain an inserted zinc binding domain. This domain has a GroES-like structure. Probab=24.90 E-value=33 Score=14.11 Aligned_cols=13 Identities=31% Similarity=0.802 Sum_probs=9.3 Q ss_pred CCCCCCHHHHHHH Q ss_conf 8998687789999 Q gi|254781031|r 216 YGCNSCPACYLRQ 228 (240) Q Consensus 216 ~hCG~C~~C~~R~ 228 (240) .+||.|..|..-+ T Consensus 64 ~~Cg~C~~C~~G~ 76 (108) T pfam08240 64 IPCGKCAACREGR 76 (108) T ss_pred CCCCCCHHHHCCC T ss_conf 4899897895909 No 173 >KOG4283 consensus Probab=23.91 E-value=6 Score=18.69 Aligned_cols=19 Identities=26% Similarity=0.349 Sum_probs=14.9 Q ss_pred EEEECCHHHHHHHHHHHHC Q ss_conf 9980965699999999974 Q gi|254781031|r 13 LLFSGGQDSSTCLSWALDR 31 (240) Q Consensus 13 ~l~SGGlDS~~~l~~l~~~ 31 (240) .|+|||-|+++.+|+|... T Consensus 58 ymlSGgadgsi~v~Dl~n~ 76 (397) T KOG4283 58 YMLSGGADGSIAVFDLQNA 76 (397) T ss_pred EEEECCCCCCEEEEEECCC T ss_conf 8860477762799970246 No 174 >cd01400 6PGL 6PGL: 6-Phosphogluconolactonase (6PGL) subfamily; 6PGL catalyzes the second step of the oxidative phase of the pentose phosphate pathway, the hydrolyzation of 6-phosphoglucono-1,5-lactone (delta form) to 6-phosphogluconate. 6PGL is thought to guard against the accumulation of the delta form of the lactone, which may be toxic through its reaction with endogenous cellular nucleophiles. Probab=23.69 E-value=43 Score=13.41 Aligned_cols=40 Identities=23% Similarity=0.092 Sum_probs=25.6 Q ss_pred CHHHCCCCCEEEEEECCHHHHHHHHH-HHHCC----CEEEEEEEEC Q ss_conf 12213599489998096569999999-99749----8399999767 Q gi|254781031|r 2 NDIIKKAPSALLLFSGGQDSSTCLSW-ALDRF----DRVETLSFDY 42 (240) Q Consensus 2 ~~~~~~~~~av~l~SGGlDS~~~l~~-l~~~~----~~v~~i~~~y 42 (240) ++.++.++++++.+|||-- ...+|. |.++. .+++....|- T Consensus 16 ~~~i~~~~~~~l~lsGGst-p~~~y~~L~~~~~ldw~~v~~~~~DE 60 (219) T cd01400 16 AAAIAKRGRFSLALSGGST-PKPLYELLAAAPALDWSKVHVFLGDE 60 (219) T ss_pred HHHHHHCCCEEEEECCCCC-HHHHHHHHHHCCCCCCHHEEEEEEEC T ss_conf 9999977988999848977-99999998554589903638996003 No 175 >TIGR02819 fdhA_non_GSH formaldehyde dehydrogenase, glutathione-independent; InterPro: IPR014184 Members of this entry represent a distinct clade within the larger family of zinc-dependent dehydrogenases of medium chain alcohols. This entry includes the so-called glutathione-dependent formaldehyde dehydrogenase. Members of this entry have a tightly bound NAD that can act as a true cofactor, rather than a co-substrate in dehydrogenase reactions and in dismutase reactions for some aldehydes. The name given to this family, however, is formaldehyde dehydrogenase, glutathione-independent.. Probab=23.56 E-value=18 Score=15.73 Aligned_cols=10 Identities=30% Similarity=1.002 Sum_probs=3.7 Q ss_pred CCCCHHHHHH Q ss_conf 9868778999 Q gi|254781031|r 218 CNSCPACYLR 227 (240) Q Consensus 218 CG~C~~C~~R 227 (240) ||+|..|.++ T Consensus 106 CGrC~~C~~~ 115 (403) T TIGR02819 106 CGRCRNCKEQ 115 (403) T ss_pred CCCCCCCCCC T ss_conf 5620011245 No 176 >KOG1311 consensus Probab=23.51 E-value=17 Score=15.88 Aligned_cols=15 Identities=33% Similarity=0.749 Sum_probs=11.0 Q ss_pred CCCCCCCCHHHHHHH Q ss_conf 688998687789999 Q gi|254781031|r 214 WGYGCNSCPACYLRQ 228 (240) Q Consensus 214 ~~~hCG~C~~C~~R~ 228 (240) ...||=.|--|++|. T Consensus 126 Rs~HCsvC~~CV~rf 140 (299) T KOG1311 126 RSSHCSVCNNCVLRF 140 (299) T ss_pred CCCCCCCCCCEECCC T ss_conf 775361014700787 No 177 >PRK10509 bacterioferritin-associated ferredoxin; Provisional Probab=23.42 E-value=27 Score=14.63 Aligned_cols=41 Identities=12% Similarity=0.178 Sum_probs=20.3 Q ss_pred CCCHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 898899999999828200014325537216288865555668899868778999 Q gi|254781031|r 174 WLKKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCPACYLR 227 (240) Q Consensus 174 ~~tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~~C~~R 227 (240) ..|-.+|.+...+.+...+-++ ..+ -.-+.+||+|..+... T Consensus 8 gVTDk~Ir~AV~~~g~~S~r~l-----r~~--------l~vgsqCGKC~~~Ar~ 48 (64) T PRK10509 8 GVSDKKIRQAVRQFHPQSFQQL-----RKF--------IPVGNQCGKCIRAARE 48 (64) T ss_pred CCCHHHHHHHHHHCCCCCHHHH-----HHH--------CCCCCCCCHHHHHHHH T ss_conf 8458999999998297669999-----987--------4754656088999999 No 178 >TIGR01016 sucCoAbeta succinyl-CoA synthetase, beta subunit; InterPro: IPR005809 There are four different enzymes that share a similar catalytic mechanism which involves the phosphorylation by ATP (or GTP) of a specific histidine residue in the active site. These enzymes are: ATP citrate-lyase (4.1.3.8 from EC) , the primary enzyme responsible for the synthesis of cytosolic acetyl-CoA in many tissues, catalyzes the formation of acetyl-CoA and oxaloacetate from citrate and CoA with the concomitant hydrolysis of ATP to ADP and phosphate. ATP-citrate lyase is a tetramer of identical subunits; Succinyl-CoA ligase (GDP-forming) (6.2.1.4 from EC) is a mitochondrial enzyme that catalyzes the substrate level phosphorylation step of the tricarboxylic acid cycle: the formation of succinyl-CoA from succinate with a concomitant hydrolysis of GTP to GDP and phosphate. This enzyme is a dimer composed of an alpha and a beta subunits; Succinyl-CoA ligase (ADP-forming) (6.2.1.5 from EC) is a bacterial enzyme that during aerobic metabolism functions in the citric acid cycle, coupling the hydrolysis of succinyl-CoA to the synthesis of ATP. It can also function in the other direction for anabolic purposes. This enzyme is a tetramer composed of two alpha and two beta subunits; and Malate-CoA ligase (6.2.1.9 from EC) (malyl-CoA synthetase) , is a bacterial enzyme that forms malyl-CoA from malate and CoA with the concomitant hydrolysis of ATP to ADP and phosphate. Malate-CoA ligase is composed of two different subunits. This entry corresponds to a conserved region located in the first half of ATP citrate lyase and in the beta subunits of succinyl-CoA ligases and malate-CoA ligase. ; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=23.39 E-value=44 Score=13.36 Aligned_cols=25 Identities=4% Similarity=-0.102 Sum_probs=17.0 Q ss_pred CCCCCEE--EEEEECCCCHHHHHHHHH Q ss_conf 0643216--775313898899999999 Q gi|254781031|r 161 GMESHVT--VHTPLMWLKKYETWKLAQ 185 (240) Q Consensus 161 ~~~~~~~--i~~P~~~~tK~eI~~~~~ 185 (240) .+..+|+ ++|=|=+.+..+.|..+- T Consensus 306 LsD~~VKvvfiNIFGGI~RCD~vA~G~ 332 (389) T TIGR01016 306 LSDKSVKVVFINIFGGITRCDEVAKGL 332 (389) T ss_pred HCCCCCEEEEEEECCCEECHHHHHHHH T ss_conf 359982089997068600187887889 No 179 >KOG4533 consensus Probab=22.25 E-value=48 Score=13.11 Aligned_cols=18 Identities=44% Similarity=0.720 Sum_probs=12.7 Q ss_pred HHHCCCCCEEEEEECCHH Q ss_conf 221359948999809656 Q gi|254781031|r 3 DIIKKAPSALLLFSGGQD 20 (240) Q Consensus 3 ~~~~~~~~av~l~SGGlD 20 (240) |...+....++|||||-- T Consensus 127 d~~~~DdeslLlfsgg~t 144 (317) T KOG4533 127 DVPNFDDESLLLFSGGKT 144 (317) T ss_pred CCCCCCCCEEEEEECCCC T ss_conf 666778621355307631 No 180 >COG2080 CoxS Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs [Energy production and conversion] Probab=22.12 E-value=37 Score=13.82 Aligned_cols=10 Identities=20% Similarity=0.743 Sum_probs=4.5 Q ss_pred CCCCCCHHHH Q ss_conf 8998687789 Q gi|254781031|r 216 YGCNSCPACY 225 (240) Q Consensus 216 ~hCG~C~~C~ 225 (240) .+||+|.|=. T Consensus 98 ~QCGyCtpG~ 107 (156) T COG2080 98 FQCGYCTPGQ 107 (156) T ss_pred CCCCCCCHHH T ss_conf 7578884889 No 181 >TIGR03451 mycoS_dep_FDH mycothiol-dependent formaldehyde dehydrogenase. Members of this protein family are mycothiol-dependent formaldehyde dehydrogenase (EC 1.2.1.66). This protein is found, so far, only in the Actinobacteria (Mycobacterium sp., Streptomyces sp., Corynebacterium sp., and related species), where mycothione replaces glutathione. Probab=22.06 E-value=42 Score=13.47 Aligned_cols=33 Identities=12% Similarity=0.148 Sum_probs=16.1 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHCCC-EEEEEEE Q ss_conf 599489998096569999999997498-3999997 Q gi|254781031|r 7 KAPSALLLFSGGQDSSTCLSWALDRFD-RVETLSF 40 (240) Q Consensus 7 ~~~~av~l~SGGlDS~~~l~~l~~~~~-~v~~i~~ 40 (240) +.+++++.=.|++=. ..+.+++..|. +|+.+.. T Consensus 176 ~g~~VlV~GaG~iGl-~a~~~ak~~Ga~~Vi~~d~ 209 (358) T TIGR03451 176 RGDSVAVIGCGGVGD-AAIAGAALAGASKIIAVDI 209 (358) T ss_pred CCCEEEEECCCHHHH-HHHHHHHHCCCCEEEEEEC T ss_conf 998899967376999-9999999839918999919 No 182 >KOG1313 consensus Probab=21.81 E-value=34 Score=14.04 Aligned_cols=15 Identities=27% Similarity=0.691 Sum_probs=7.4 Q ss_pred CCCCCCCCCHHHHHH Q ss_conf 668899868778999 Q gi|254781031|r 213 EWGYGCNSCPACYLR 227 (240) Q Consensus 213 ~~~~hCG~C~~C~~R 227 (240) ..+.||-.|..|++| T Consensus 114 prTHHCsiC~kCVL~ 128 (309) T KOG1313 114 PRTHHCSICNKCVLK 128 (309) T ss_pred CCCCHHHHHHHHHHC T ss_conf 975310277667544 No 183 >cd06259 YdcF-like YdcF-like. YdcF-like is a large family of mainly bacterial proteins, with a few members found in fungi, plants, and archaea. Escherichia coli YdcF has been shown to bind S-adenosyl-L-methionine (AdoMet), but a biochemical function has not been idenitified. The family also includes Escherichia coli sanA and Salmonella typhimurium sfiX, which are involved in vancomycin resistance; sfiX may also be involved in murein synthesis. Probab=21.79 E-value=50 Score=12.98 Aligned_cols=31 Identities=10% Similarity=0.087 Sum_probs=19.1 Q ss_pred CEEEEEECCHH----------HHHHHHHHHHCCCEEEEEEE Q ss_conf 48999809656----------99999999974983999997 Q gi|254781031|r 10 SALLLFSGGQD----------SSTCLSWALDRFDRVETLSF 40 (240) Q Consensus 10 ~av~l~SGGlD----------S~~~l~~l~~~~~~v~~i~~ 40 (240) .++++|.||++ =.-.+..|.+++.....++. T Consensus 1 daIvVLG~~~~~~~~~~~~~~R~~~a~~L~~~~~~~~ii~s 41 (150) T cd06259 1 DAIVVLGGGVNGDGPSPILAERLDAAAELYRAGPAPKLIVS 41 (150) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 98998999879999897999999999999981999889995 No 184 >TIGR02963 xanthine_xdhA xanthine dehydrogenase, small subunit; InterPro: IPR014307 Members of this protein are the small subunit (or, in eukaryotes, the N-terminal domain) of xanthine dehydrogenase, an enzyme of purine catabolism via urate. The small subunit contains both an FAD and a 2Fe-2S cofactor. Aldehyde oxidase (retinal oxidase) appears to have arisen as a neofunctionalisation among xanthine dehydrogenases in eukaryotes. ; GO: 0004854 xanthine dehydrogenase activity, 0004855 xanthine oxidase activity, 0050660 FAD binding. Probab=21.73 E-value=32 Score=14.20 Aligned_cols=15 Identities=7% Similarity=0.058 Sum_probs=6.8 Q ss_pred EEEEEEECCCCHHHH Q ss_conf 167753138988999 Q gi|254781031|r 166 VTVHTPLMWLKKYET 180 (240) Q Consensus 166 ~~i~~P~~~~tK~eI 180 (240) +.++..|+.+-|.++ T Consensus 375 lpLEdfFi~Y~k~~~ 389 (515) T TIGR02963 375 LPLEDFFIDYGKQDR 389 (515) T ss_pred CHHHHHHHHHHHEEC T ss_conf 117777467202010 No 185 >COG5574 PEX10 RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones] Probab=21.70 E-value=41 Score=13.55 Aligned_cols=17 Identities=24% Similarity=0.489 Sum_probs=12.7 Q ss_pred CCCCCCCCC--CHHHHHHH Q ss_conf 566889986--87789999 Q gi|254781031|r 212 YEWGYGCNS--CPACYLRQ 228 (240) Q Consensus 212 ~~~~~hCG~--C~~C~~R~ 228 (240) ...-++||. |++|+.=- T Consensus 227 ~ps~t~CgHlFC~~Cl~~~ 245 (271) T COG5574 227 VPSCTPCGHLFCLSCLLIS 245 (271) T ss_pred CCCCCCCCCHHHHHHHHHH T ss_conf 7310466313669888778 No 186 >COG1063 Tdh Threonine dehydrogenase and related Zn-dependent dehydrogenases [Amino acid transport and metabolism / General function prediction only] Probab=21.59 E-value=32 Score=14.20 Aligned_cols=28 Identities=11% Similarity=-0.005 Sum_probs=11.5 Q ss_pred EEEEEECCHHHHHHHHHHHHCCC-EEEEE Q ss_conf 89998096569999999997498-39999 Q gi|254781031|r 11 ALLLFSGGQDSSTCLSWALDRFD-RVETL 38 (240) Q Consensus 11 av~l~SGGlDS~~~l~~l~~~~~-~v~~i 38 (240) .|+.+=-|.==+..+.++.-.|. +|+.+ T Consensus 171 ~V~V~GaGpIGLla~~~a~~~Ga~~Viv~ 199 (350) T COG1063 171 TVVVVGAGPIGLLAIALAKLLGASVVIVV 199 (350) T ss_pred EEEEECCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 89998888999999999987698279997 No 187 >PTZ00038 ferredoxin; Provisional Probab=21.46 E-value=51 Score=12.94 Aligned_cols=41 Identities=27% Similarity=0.400 Sum_probs=20.2 Q ss_pred CCEEEEEEECCC-----CHHHHHHHHHHHCCCHHCCCCHHHCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 321677531389-----88999999998282000143255372162888655556688998687 Q gi|254781031|r 164 SHVTVHTPLMWL-----KKYETWKLAQDIGGQDLVNLILEESHTCYLGKRDKRYEWGYGCNSCP 222 (240) Q Consensus 164 ~~~~i~~P~~~~-----tK~eI~~~~~~~~~~~~~~~~~~~T~SC~~~~~~~~~~~~~hCG~C~ 222 (240) ..|++.+|--.. .-.-|+..+++-| +++| +||..| .||.|. T Consensus 77 y~v~l~tp~g~~~ieC~eDE~ILDAAer~G----I~LP----ySCR~G----------~CgTCk 122 (172) T PTZ00038 77 YNVKLRTPEGEKVIECDEDEYILEAAERQG----VELP----YSCRGG----------SCSTCA 122 (172) T ss_pred EEEEEECCCCCEEEECCCCCHHHHHHHHCC----CCCC----CCCCCC----------CCCCCE T ss_conf 999998899977686899865999999849----9877----777786----------188881 No 188 >PRK09880 L-idonate 5-dehydrogenase; Provisional Probab=21.39 E-value=42 Score=13.47 Aligned_cols=32 Identities=6% Similarity=-0.098 Sum_probs=15.5 Q ss_pred CCCEEEEEECCHHHHHHHHHHHHCCC-EEEEEEE Q ss_conf 99489998096569999999997498-3999997 Q gi|254781031|r 8 APSALLLFSGGQDSSTCLSWALDRFD-RVETLSF 40 (240) Q Consensus 8 ~~~av~l~SGGlDS~~~l~~l~~~~~-~v~~i~~ 40 (240) .+++++.=.|++= ...+..++..|. +|+.+-. T Consensus 170 g~~VlV~G~G~iG-l~~~~~ak~~Ga~~Vi~~d~ 202 (343) T PRK09880 170 GKRVFISGVGPIG-CLIVSAVKTLGAAEIVCADL 202 (343) T ss_pred CCEEEEECCCHHH-HHHHHHHHHCCCCEEEEEEC T ss_conf 9889998477679-99999999869987999979 No 189 >PRK06674 DNA polymerase III subunits gamma and tau; Validated Probab=21.38 E-value=51 Score=12.93 Aligned_cols=14 Identities=14% Similarity=0.197 Sum_probs=5.3 Q ss_pred ECCHHHHHHHHHHH Q ss_conf 09656999999999 Q gi|254781031|r 16 SGGQDSSTCLSWAL 29 (240) Q Consensus 16 SGGlDS~~~l~~l~ 29 (240) ..|.|..-.+-.|+ T Consensus 273 ~~G~d~~~~~~dl~ 286 (563) T PRK06674 273 GQGKDPVRLMEDFI 286 (563) T ss_pred HCCCCHHHHHHHHH T ss_conf 87999999999999 No 190 >PRK13339 malate:quinone oxidoreductase; Reviewed Probab=21.16 E-value=52 Score=12.90 Aligned_cols=27 Identities=19% Similarity=0.107 Sum_probs=21.9 Q ss_pred HCCCCCEEEEEECCHHHHHHHHHHHHC Q ss_conf 135994899980965699999999974 Q gi|254781031|r 5 IKKAPSALLLFSGGQDSSTCLSWALDR 31 (240) Q Consensus 5 ~~~~~~av~l~SGGlDS~~~l~~l~~~ 31 (240) -...+.-|+|.=||+=|++++++|.+- T Consensus 2 ~~~~~~DVvLIGaGIMsaTL~~lL~el 28 (497) T PRK13339 2 AKSESKDVVLVGAGILSTTFGVLLKEL 28 (497) T ss_pred CCCCCCCEEEECCHHHHHHHHHHHHHC T ss_conf 977777889989528899999999976 No 191 >PTZ00019 fructose-bisphosphate aldolase; Provisional Probab=20.90 E-value=53 Score=12.87 Aligned_cols=11 Identities=9% Similarity=-0.272 Sum_probs=7.1 Q ss_pred CCCCHHHCCCC Q ss_conf 14325537216 Q gi|254781031|r 193 VNLILEESHTC 203 (240) Q Consensus 193 ~~~~~~~T~SC 203 (240) ...|+.+|+|= T Consensus 285 ~~~PW~LSFSY 295 (356) T PTZ00019 285 GPHPWALSFSY 295 (356) T ss_pred CCCCCEEEEEH T ss_conf 99996487635 No 192 >COG1832 Predicted CoA-binding protein [General function prediction only] Probab=20.83 E-value=53 Score=12.86 Aligned_cols=39 Identities=18% Similarity=0.093 Sum_probs=29.2 Q ss_pred HHHC-CCCCEEEEEECCH--HHHHHHHHHHHCCCEEEEEEEE Q ss_conf 2213-5994899980965--6999999999749839999976 Q gi|254781031|r 3 DIIK-KAPSALLLFSGGQ--DSSTCLSWALDRFDRVETLSFD 41 (240) Q Consensus 3 ~~~~-~~~~av~l~SGGl--DS~~~l~~l~~~~~~v~~i~~~ 41 (240) ++++ .++-||+-+|.-. +|-.++-.|.+.||+|+||-=. T Consensus 11 ~iL~~~K~IAvVG~S~~P~r~sy~V~kyL~~~GY~ViPVNP~ 52 (140) T COG1832 11 EILKSAKTIAVVGASDKPDRPSYRVAKYLQQKGYRVIPVNPK 52 (140) T ss_pred HHHHHCCEEEEEECCCCCCCCHHHHHHHHHHCCCEEEEECCC T ss_conf 999857669999457999861889999999789989955855 No 193 >PRK06647 DNA polymerase III subunits gamma and tau; Validated Probab=20.43 E-value=54 Score=12.80 Aligned_cols=18 Identities=22% Similarity=0.338 Sum_probs=9.7 Q ss_pred EEEEC--CHHHHHHHHHHHH Q ss_conf 99809--6569999999997 Q gi|254781031|r 13 LLFSG--GQDSSTCLSWALD 30 (240) Q Consensus 13 ~l~SG--GlDS~~~l~~l~~ 30 (240) .|||| |.=-+++|..+.+ T Consensus 41 yLFsGprG~GKTt~ArilAk 60 (560) T PRK06647 41 YIFSGPRGVGKTSSARAFAR 60 (560) T ss_pred EEEECCCCCCHHHHHHHHHH T ss_conf 66328998789999999999 No 194 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=20.26 E-value=51 Score=12.96 Aligned_cols=20 Identities=25% Similarity=0.287 Sum_probs=15.4 Q ss_pred CCHHHCCCCCEEEEEECCHHH Q ss_conf 912213599489998096569 Q gi|254781031|r 1 MNDIIKKAPSALLLFSGGQDS 21 (240) Q Consensus 1 ~~~~~~~~~~av~l~SGGlDS 21 (240) |.+|-|+-+||| .+-+|+=+ T Consensus 1 ~~~m~kki~KaV-I~AAG~GT 20 (302) T PRK13389 1 MAAINTKVKKAV-IPVAGLGT 20 (302) T ss_pred CCCCCCCCCEEE-EECCCCCC T ss_conf 964357756799-98986753 No 195 >KOG1615 consensus Probab=20.24 E-value=54 Score=12.78 Aligned_cols=12 Identities=25% Similarity=0.050 Sum_probs=6.4 Q ss_pred CCHHHHHHHHHH Q ss_conf 988999999998 Q gi|254781031|r 175 LKKYETWKLAQD 186 (240) Q Consensus 175 ~tK~eI~~~~~~ 186 (240) .-|.++++.-.+ T Consensus 158 ggKa~~i~~lrk 169 (227) T KOG1615 158 GGKAEVIALLRK 169 (227) T ss_pred CCCHHHHHHHHH T ss_conf 861899999980 No 196 >pfam04885 Stig1 Stigma-specific protein, Stig1. This family represents the Stig1 cysteine rich plant protein. The STIG1 gene is developmentally regulated and expressed specifically in the stigmatic secretory zone. Probab=20.15 E-value=40 Score=13.57 Aligned_cols=23 Identities=22% Similarity=0.375 Sum_probs=10.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 72162888655556688998687 Q gi|254781031|r 200 SHTCYLGKRDKRYEWGYGCNSCP 222 (240) Q Consensus 200 T~SC~~~~~~~~~~~~~hCG~C~ 222 (240) +++|=.+.=........|||.|. T Consensus 97 ~~~CC~G~Cvd~~sd~~hCG~Cg 119 (136) T pfam04885 97 GQTCCGGQCVDLNSDRRNCGKCN 119 (136) T ss_pred CCCCCCCEEECCCCCCCHHCCCC T ss_conf 66056888377777810121244 No 197 >KOG2164 consensus Probab=20.05 E-value=55 Score=12.75 Aligned_cols=16 Identities=25% Similarity=0.760 Sum_probs=12.1 Q ss_pred CCCC--CHHHHHHHHHHH Q ss_conf 9986--877899999999 Q gi|254781031|r 217 GCNS--CPACYLRQKGWM 232 (240) Q Consensus 217 hCG~--C~~C~~R~~af~ 232 (240) -||. |++|+++=|-.- T Consensus 203 ~CGHiFC~~CiLqy~~~s 220 (513) T KOG2164 203 NCGHIFCGPCILQYWNYS 220 (513) T ss_pred CCCCEEEHHHHHHHHHHH T ss_conf 667331068999998553 No 198 >pfam02008 zf-CXXC CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterized by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR. Probab=20.01 E-value=26 Score=14.75 Aligned_cols=12 Identities=25% Similarity=0.711 Sum_probs=9.6 Q ss_pred CCCCCHHHHHHH Q ss_conf 998687789999 Q gi|254781031|r 217 GCNSCPACYLRQ 228 (240) Q Consensus 217 hCG~C~~C~~R~ 228 (240) -||.|..|+++. T Consensus 20 dCG~C~~C~d~~ 31 (48) T pfam02008 20 DCGQCSFCRDMP 31 (48) T ss_pred CCCCCHHHHCCC T ss_conf 787772020767 Done!